Search Results for author: Fan Wang

Found 165 papers, 67 papers with code

PLATO-KAG: Unsupervised Knowledge-Grounded Conversation via Joint Modeling

no code implementations EMNLP (NLP4ConvAI) 2021 Xinxian Huang, Huang He, Siqi Bao, Fan Wang, Hua Wu, Haifeng Wang

Large-scale conversation models are turning to leveraging external knowledge to improve the factual accuracy in response generation.

Response Generation

New Threats against Object Detector with Non-local Block

no code implementations ECCV 2020 Yi Huang, Fan Wang, Adams Wai-Kin Kong, Kwok-Yan Lam

The experiments show that the universal patches are able to mislead the detector with greater probabilities.

Object

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

no code implementations4 Apr 2024 Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai

Recent advances in 2D/3D generative models enable the generation of dynamic 3D objects from a single-view video.

motion prediction

Uncovering the Text Embedding in Text-to-Image Diffusion Models

no code implementations1 Apr 2024 Hu Yu, Hao Luo, Fan Wang, Feng Zhao

The correspondence between input text and the generated image exhibits opacity, wherein minor textual modifications can induce substantial deviations in the generated image.

Text Data-Centric Image Captioning with Interactive Prompts

no code implementations28 Mar 2024 Yiyu Wang, Hao Luo, Jungang Xu, Yingfei Sun, Fan Wang

Among them, the mainstream solution is to project image embeddings into the text embedding space with the assistance of consistent representations between image-text pairs from the CLIP model.

Image Captioning

XScale-NVS: Cross-Scale Novel View Synthesis with Hash Featurized Manifold

no code implementations28 Mar 2024 Guangyu Wang, Jinzhi Zhang, Fan Wang, Ruqi Huang, Lu Fang

We also introduce a novel dataset, namely GigaNVS, to benchmark cross-scale, high-resolution novel view synthesis of realworld large-scale scenes.

Neural Rendering Novel View Synthesis

Learning-based Multi-continuum Model for Multiscale Flow Problems

no code implementations21 Mar 2024 Fan Wang, Yating Wang, Wing Tat Leung, Zongben Xu

Multiscale problems can usually be approximated through numerical homogenization by an equation with some effective parameters that can capture the macroscopic behavior of the original system on the coarse grid to speed up the simulation.

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

1 code implementation18 Mar 2024 Wangbo Zhao, Jiasheng Tang, Yizeng Han, Yibing Song, Kai Wang, Gao Huang, Fan Wang, Yang You

Existing parameter-efficient fine-tuning (PEFT) methods have achieved significant success on vision transformers (ViTs) adaptation by improving parameter efficiency.

Semantic Segmentation Video Recognition

Neural radiance fields-based holography [Invited]

no code implementations2 Mar 2024 Minsung Kang, Fan Wang, Kai Kumano, Tomoyoshi Ito, Tomoyoshi Shimobaba

NeRF is a state-of-the-art technique for 3D light-field reconstruction from 2D images based on volume rendering.

Accelerating Parallel Sampling of Diffusion Models

no code implementations15 Feb 2024 Zhiwei Tang, Jiasheng Tang, Hao Luo, Fan Wang, Tsung-Hui Chang

Our experiments demonstrate that ParaTAA can decrease the inference steps required by common sequential sampling algorithms such as DDIM and DDPM by a factor of 4~14 times.

Image Generation

Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach

1 code implementation28 Jan 2024 Shaofeng Zhang, Jinfa Huang, Qiang Zhou, Zhibin Wang, Fan Wang, Jiebo Luo, Junchi Yan

At inference, we generate images with arbitrary expansion multiples by inputting an anchor image and its corresponding positional embeddings.

Image Outpainting

DMT: Comprehensive Distillation with Multiple Self-supervised Teachers

no code implementations19 Dec 2023 Yuang Liu, Jing Wang, Qiang Zhou, Fan Wang, Jun Wang, Wei zhang

Numerous self-supervised learning paradigms, such as contrastive learning and masked image modeling, have been proposed to acquire powerful and general representations from unlabeled data.

Contrastive Learning Model Compression +1

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

no code implementations14 Dec 2023 Yabing Wang, Fan Wang, Jianfeng Dong, Hao Luo

Cross-lingual cross-modal retrieval has garnered increasing attention recently, which aims to achieve the alignment between vision and target language (V-T) without using any annotated V-T data pairs.

Cross-Lingual Transfer Cross-Modal Retrieval +4

Towards a Psychological Generalist AI: A Survey of Current Applications of Large Language Models and Future Prospects

no code implementations1 Dec 2023 Tianyu He, Guanghui Fu, Yijing Yu, Fan Wang, Jianqiang Li, Qing Zhao, Changwei Song, Hongzhi Qi, Dan Luo, Huijing Zou, Bing Xiang Yang

The complexity of psychological principles underscore a significant societal challenge, given the vast social implications of psychological problems.

Language-guided Few-shot Semantic Segmentation

no code implementations23 Nov 2023 Jing Wang, Yuang Liu, Qiang Zhou, Fan Wang

Few-shot learning is a promising way for reducing the label cost in new categories adaptation with the guidance of a small, well labeled support set.

Few-Shot Semantic Segmentation Segmentation +1

OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning

1 code implementation20 Nov 2023 Haiyang Ying, Yixuan Yin, Jinzhi Zhang, Fan Wang, Tao Yu, Ruqi Huang, Lu Fang

Towards holistic understanding of 3D scenes, a general 3D segmentation method is needed that can segment diverse objects without restrictions on object quantity or categories, while also reflecting the inherent hierarchical structure.

Contrastive Learning Novel View Synthesis +1

Pre-Training on Large-Scale Generated Docking Conformations with HelixDock to Unlock the Potential of Protein-ligand Structure Prediction Models

no code implementations21 Oct 2023 Lihang Liu, Donglong He, Xianbin Ye, Jingbo Zhou, Shanzhuo Zhang, Xiaonan Zhang, Jun Li, Hua Chai, Fan Wang, Jingzhou He, Liang Zheng, Yonghui Li, Xiaomin Fang

In this work, we show that by pre-training a geometry-aware SE(3)-Equivariant neural network on a large-scale docking conformation generated by traditional physics-based docking tools and then fine-tuning with a limited set of experimentally validated receptor-ligand complexes, we can achieve outstanding performance.

Drug Discovery Molecular Docking

SingleInsert: Inserting New Concepts from a Single Image into Text-to-Image Models for Flexible Editing

no code implementations12 Oct 2023 Zijie Wu, Chaohui Yu, Zhen Zhu, Fan Wang, Xiang Bai

To utilize the abundant visual priors in the off-the-shelf T2I models, a series of methods try to invert an image to proper embedding that aligns with the semantic space of the T2I model.

Image Generation Novel View Synthesis

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels

2 code implementations15 Sep 2023 Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou

Recently, many parameter-efficient fine-tuning (PEFT) methods have been proposed, and their experiments demonstrate that tuning only 1% of extra parameters could surpass full fine-tuning in low-data resource scenarios.

Domain Generalization Few-Shot Learning

Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding

no code implementations15 Sep 2023 Xiaonan Lu, Jianlong Yuan, Ruigang Niu, Yuan Hu, Fan Wang

Therefore, they cannot be directly applied to cope with image change understanding (ICU), which requires models to capture actual changes between multiple images and describe them in language.

Temporal compressive edge imaging enabled by a lensless diffuser camera

no code implementations13 Sep 2023 Ze Zheng, Baolei Liu, Jiaqi Song, Lei Ding, Xiaolan Zhong, David Mcgloin, Fan Wang

Lensless imagers based on diffusers or encoding masks enable high-dimensional imaging from a single shot measurement and have been applied in various applications.

Edge Detection

Punctate White Matter Lesion Segmentation in Preterm Infants Powered by Counterfactually Generative Learning

no code implementations7 Sep 2023 Zehua Ren, Yongheng Sun, Miaomiao Wang, Yuying Feng, Xianjun Li, Chao Jin, Jian Yang, Chunfeng Lian, Fan Wang

In this paper, we propose to leverage the idea of counterfactual reasoning coupled with the auxiliary task of brain tissue segmentation to learn fine-grained positional and morphological representations of PWMLs for accurate localization and segmentation.

counterfactual Counterfactual Reasoning +2

Region Generation and Assessment Network for Occluded Person Re-Identification

no code implementations7 Sep 2023 Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding

Then, to measure the importance of each generated region, we introduce a Region Assessment Module (RAM) that assigns confidence scores to different regions and reduces the negative impact of the occlusion regions by lower scores.

Person Re-Identification

Forensic Histopathological Recognition via a Context-Aware MIL Network Powered by Self-Supervised Contrastive Learning

no code implementations27 Aug 2023 Chen Shen, Jun Zhang, Xinggong Liang, Zeyi Hao, Kehan Li, Fan Wang, Zhenyuan Wang, Chunfeng Lian

Forensic pathology is critical in analyzing death manner and time from the microscopic aspect to assist in the establishment of reliable factual bases for criminal investigation.

Contrastive Learning Domain Generalization +3

Graph-Segmenter: Graph Transformer with Boundary-aware Attention for Semantic Segmentation

no code implementations15 Aug 2023 Zizhang Wu, Yuanzhu Gan, Tianhao Xu, Fan Wang

To address this issue, we propose a Graph-Segmenter, including a Graph Transformer and a Boundary-aware Attention module, which is an effective network for simultaneously modeling the more profound relation between windows in a global view and various pixels inside each window as a local one, and for substantial low-cost boundary adjustment.

Relation Segmentation +1

Revisiting Vision Transformer from the View of Path Ensemble

no code implementations ICCV 2023 Shuning Chang, Pichao Wang, Hao Luo, Fan Wang, Mike Zheng Shou

Therefore, we propose the path pruning and EnsembleScale skills for improvement, which cut out the underperforming paths and re-weight the ensemble components, respectively, to optimize the path combination and make the short paths focus on providing high-quality representation for subsequent paths.

Dynamic Token-Pass Transformers for Semantic Segmentation

no code implementations3 Aug 2023 Yuang Liu, Qiang Zhou, Jing Wang, Fan Wang, Jun Wang, Wei zhang

Vision transformers (ViT) usually extract features via forwarding all the tokens in the self-attention layers from top to toe.

Segmentation Semantic Segmentation

RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic and Regional Comprehension

1 code implementation3 Aug 2023 Qiang Zhou, Chaohui Yu, Shaofeng Zhang, Sitong Wu, Zhibing Wang, Fan Wang

To this end, we propose to extract features corresponding to regional objects as soft prompts for LLM, which provides a straightforward and scalable approach and eliminates the need for LLM fine-tuning.

Image Comprehension

Improved Neural Radiance Fields Using Pseudo-depth and Fusion

no code implementations27 Jul 2023 Jingliang Li, Qiang Zhou, Chaohui Yu, Zhengda Lu, Jun Xiao, Zhibin Wang, Fan Wang

To make the constructed volumes as close as possible to the surfaces of objects in the scene and the rendered depth more accurate, we propose to perform depth prediction and radiance field reconstruction simultaneously.

Depth Estimation Depth Prediction +1

Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation

no code implementations26 Jul 2023 Chaohui Yu, Qiang Zhou, Jingliang Li, Zhe Zhang, Zhibin Wang, Fan Wang

To better utilize the sparse 3D points, we propose an efficient point cloud guidance loss to adaptively drive the NeRF's geometry to align with the shape of the sparse 3D points.

3D Generation Text to 3D

Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training

1 code implementation15 Jun 2023 Chong Liu, Yuqi Zhang, Hongsong Wang, Weihua Chen, Fan Wang, Yan Huang, Yi-Dong Shen, Liang Wang

Most previous works either simply learn coarse-grained representations of the overall image and text, or elaborately establish the correspondence between image regions or pixels and text words.

Representation Learning Retrieval +1

Graph Convolution Based Efficient Re-Ranking for Visual Retrieval

1 code implementation15 Jun 2023 Yuqi Zhang, Qi Qian, Hongsong Wang, Chong Liu, Weihua Chen, Fan Wang

In particular, the plain GCR is extended for cross-camera retrieval and an improved feature propagation formulation is presented to leverage affinity relationships across different cameras.

Distributed Computing Image Retrieval +3

SwinRDM: Integrate SwinRNN with Diffusion Model towards High-Resolution and High-Quality Weather Forecasting

no code implementations5 Jun 2023 Lei Chen, Fei Du, Yuan Hu, Fan Wang, Zhibin Wang

Recurrent predictions for future atmospheric fields are firstly performed at 1. 40625-degree resolution, and then a diffusion-based super-resolution model is leveraged to recover the high spatial resolution and finer-scale atmospheric details.

Super-Resolution Weather Forecasting

MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks

1 code implementation17 May 2023 Wenfang Sun, Yingjun Du, XianTong Zhen, Fan Wang, Ling Wang, Cees G. M. Snoek

To account for the uncertainty caused by the limited training tasks, we propose a variational MetaModulation where the modulation parameters are treated as latent variables.

Few-Shot Learning

NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation

1 code implementation CVPR 2023 Jiefeng Li, Siyuan Bian, Qi Liu, Jiasheng Tang, Fan Wang, Cewu Lu

In this work, we present NIKI (Neural Inverse Kinematics with Invertible Neural Network), which models bi-directional errors to improve the robustness to occlusions and obtain pixel-aligned accuracy.

3D human pose and shape estimation

UniNeXt: Exploring A Unified Architecture for Vision Recognition

1 code implementation26 Apr 2023 Fangjian Lin, Jianlong Yuan, Sitong Wu, Fan Wang, Zhibin Wang

Interestingly, the ranking of these spatial token mixers also changes under our UniNeXt, suggesting that an excellent spatial token mixer may be stifled due to a suboptimal general architecture, which further shows the importance of the study on the general architecture of vision backbone.

Spatial Token Mixer

DOAD: Decoupled One Stage Action Detection Network

no code implementations1 Apr 2023 Shuning Chang, Pichao Wang, Fan Wang, Jiashi Feng, Mike Zheng Show

Specifically, one branch focuses on detection representation for actor detection, and the other one for action recognition.

Action Detection Action Recognition +1

Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks

4 code implementations CVPR 2023 Weihua Chen, Xianzhe Xu, Jian Jia, Hao Luo, Yaohua Wang, Fan Wang, Rong Jin, Xiuyu Sun

Unlike the existing self-supervised learning methods, prior knowledge from human images is utilized in SOLIDER to build pseudo semantic labels and import more semantic information into the learned representation.

Human Parsing Pedestrian Attribute Recognition +6

ARMBench: An Object-centric Benchmark Dataset for Robotic Manipulation

no code implementations29 Mar 2023 Chaitanya Mitash, Fan Wang, Shiyang Lu, Vikedo Terhuja, Tyler Garaas, Felipe Polido, Manikantan Nambi

This paper introduces Amazon Robotic Manipulation Benchmark (ARMBench), a large-scale, object-centric benchmark dataset for robotic manipulation in the context of a warehouse.

Defect Detection Object +1

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

2 code implementations22 Mar 2023 Hansheng Chen, Wei Tian, Pichao Wang, Fan Wang, Lu Xiong, Hao Li

In this paper, we propose the EPro-PnP, a probabilistic PnP layer for general end-to-end pose estimation, which outputs a distribution of pose with differentiable probability density on the SE(3) manifold.

3D Object Detection 6D Pose Estimation using RGB +1

Making Vision Transformers Efficient from A Token Sparsification View

1 code implementation CVPR 2023 Shuning Chang, Pichao Wang, Ming Lin, Fan Wang, David Junhao Zhang, Rong Jin, Mike Zheng Shou

In this work, we propose a novel Semantic Token ViT (STViT), for efficient global and local vision transformers, which can also be revised to serve as backbone for downstream tasks.

Efficient ViTs Instance Segmentation +4

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm

no code implementations14 Mar 2023 Hengyuan Zhao, Hao Luo, Yuyang Zhao, Pichao Wang, Fan Wang, Mike Zheng Shou

In view of the practicality of PETL, previous works focus on tuning a small set of parameters for each downstream task in an end-to-end manner while rarely considering the task distribution shift issue between the pre-training task and the downstream task.

Transfer Learning Vocal Bursts Valence Prediction

A Practical Upper Bound for the Worst-Case Attribution Deviations

no code implementations CVPR 2023 Fan Wang, Adams Wai-Kin Kong

Model attribution is a critical component of deep neural networks (DNNs) for its interpretability to complex models.

D2Q-DETR: Decoupling and Dynamic Queries for Oriented Object Detection with Transformers

no code implementations1 Mar 2023 Qiang Zhou, Chaohui Yu, Zhibin Wang, Fan Wang

In this paper, we propose an end-to-end framework for oriented object detection, which simplifies the model pipeline and obtains superior performance.

Object object-detection +3

LMSeg: Language-guided Multi-dataset Segmentation

no code implementations27 Feb 2023 Qiang Zhou, Yuang Liu, Chaohui Yu, Jingliang Li, Zhibin Wang, Fan Wang

Instead of relabeling each dataset with the unified taxonomy, a category-guided decoding module is designed to dynamically guide predictions to each datasets taxonomy.

Image Augmentation Panoptic Segmentation +1

Dual-mode adaptive-SVD ghost imaging

no code implementations14 Feb 2023 Dajing Wang, Baolei Liu, Jiaqi Song, Yao Wang, Xuchen Shan, Fan Wang

In this paper, we present a dual-mode adaptive singular value decomposition ghost imaging (A-SVD GI), which can be easily switched between the modes of imaging and edge detection.

Edge Detection

Head-Free Lightweight Semantic Segmentation with Linear Transformer

1 code implementation11 Jan 2023 Bo Dong, Pichao Wang, Fan Wang

On the ADE20K dataset, our model achieves 41. 8 mIoU and 4. 6 GFLOPs, which is 4. 4 mIoU higher than Segformer, with 45% less GFLOPs.

Segmentation Semantic Segmentation

NeuroExplainer: Fine-Grained Attention Decoding to Uncover Cortical Development Patterns of Preterm Infants

no code implementations1 Jan 2023 Chenyu Xue, Fan Wang, Yuanzhuo Zhu, Hui Li, Deyu Meng, Dinggang Shen, Chunfeng Lian

Deploying reliable deep learning techniques in interdisciplinary applications needs learned models to output accurate and (even more importantly) explainable predictions.

MHPL: Minimum Happy Points Learning for Active Source Free Domain Adaptation

no code implementations CVPR 2023 Fan Wang, Zhongyi Han, Zhiyan Zhang, Rundong He, Yilong Yin

Source free domain adaptation (SFDA) aims to transfer a trained source model to the unlabeled target domain without accessing the source data.

Active Learning Source-Free Domain Adaptation

Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling

1 code implementation19 Dec 2022 Mingzhu Cai, Siqi Bao, Xin Tian, Huang He, Fan Wang, Hua Wu

In this paper, we propose an unsupervised query enhanced approach for knowledge-intensive conversations, namely QKConv.

Conversational Question Answering Retrieval

CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion

no code implementations13 Dec 2022 Zizhang Wu, Man Wang, Weiwei Sun, Yuchen Li, Tianhao Xu, Fan Wang, Keke Huang

Channel and spatial attention mechanism has proven to provide an evident performance boost of deep convolution neural networks (CNNs).

Image Classification Instance Segmentation +3

Complete Solution for Vehicle Re-ID in Surround-view Camera System

no code implementations8 Dec 2022 Zizhang Wu, Tianhao Xu, Fan Wang, Xiaoquan Wang, Jing Song

Vehicle re-identification (Re-ID) is a critical component of the autonomous driving perception system, and research in this area has accelerated in recent years.

Autonomous Driving Vehicle Re-Identification

Surround-view Fisheye BEV-Perception for Valet Parking: Dataset, Baseline and Distortion-insensitive Multi-task Framework

no code implementations8 Dec 2022 Zizhang Wu, Yuanzhu Gan, Xianzhi Li, Yunzhe Wu, Xiaoquan Wang, Tianhao Xu, Fan Wang

Most existing networks based on public datasets may generalize suboptimal results on these valet parking scenes, also affected by the fisheye distortion.

Autonomous Driving

A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition

1 code implementation16 Nov 2022 Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang

Although improving motion recognition to some extent, these methods still face sub-optimal situations in the following aspects: (i) Data augmentation, i. e., the scale of the RGB-D datasets is still limited, and few efforts have been made to explore novel data augmentation strategies for videos; (ii) Optimization mechanism, i. e., the tightly space-time-entangled network structure brings more challenges to spatiotemporal information modeling; And (iii) cross-modal knowledge fusion, i. e., the high similarity between multimodal representations caused to insufficient late fusion.

Action Recognition Data Augmentation +2

PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation

no code implementations2 Nov 2022 Siqi Bao, Huang He, Jun Xu, Hua Lu, Fan Wang, Hua Wu, Han Zhou, Wenquan Wu, Zheng-Yu Niu, Haifeng Wang

Recently, the practical deployment of open-domain dialogue systems has been plagued by the knowledge issue of information deficiency and factual inaccuracy.

Dialogue Generation Memorization +1

VTC-LFC: Vision Transformer Compression with Low-Frequency Components

1 code implementation NIPS 2022 Zhenyu Wang, Hao Luo, Pichao Wang, Feng Ding, Fan Wang, Hao Li

Although Vision transformers (ViTs) have recently dominated many vision tasks, deploying ViT models on resource-limited devices remains a challenging problem.

Behavioral Intention Prediction in Driving Scenes: A Survey

no code implementations1 Nov 2022 Jianwu Fang, Fan Wang, Jianru Xue, Tat-Seng Chua

Behavioral Intention Prediction (BIP) simulates such a human consideration process and fulfills the early prediction of specific behaviors.

Trajectory Prediction

Q-TOD: A Query-driven Task-oriented Dialogue System

1 code implementation14 Oct 2022 Xin Tian, Yingzhan Lin, Mengfei Song, Siqi Bao, Fan Wang, Huang He, Shuqi Sun, Hua Wu

Firstly, as the query is in the form of natural language and not confined to the schema of the knowledge base, the issue of domain adaption is alleviated remarkably in Q-TOD.

Domain Adaptation Response Generation +2

Effective Vision Transformer Training: A Data-Centric Perspective

no code implementations29 Sep 2022 Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang

To achieve these two purposes, we propose a novel data-centric ViT training framework to dynamically measure the ``difficulty'' of training samples and generate ``effective'' samples for models at different training stages.

Towards Boosting the Open-Domain Chatbot with Human Feedback

1 code implementation30 Aug 2022 Hua Lu, Siqi Bao, Huang He, Fan Wang, Hua Wu, Haifeng Wang

Many open-domain dialogue models pre-trained with social media comments can generate coherent replies but have difficulties producing engaging responses when interacting with real users.

Chatbot

FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation

1 code implementation30 Aug 2022 Jianlong Yuan, Qian Qi, Fei Du, Zhibin Wang, Fan Wang, Yifan Liu

Inspired by the recent progress on semantic directions on feature-space, we propose to include augmentations in feature space for efficient distillation.

Knowledge Distillation Segmentation +1

GEM-2: Next Generation Molecular Property Prediction Network by Modeling Full-range Many-body Interactions

1 code implementation11 Aug 2022 Lihang Liu, Donglong He, Xiaomin Fang, Shanzhuo Zhang, Fan Wang, Jingzhou He, Hua Wu

Full-range many-body interactions between electrons have been proven effective in obtaining an accurate solution of the Schr"odinger equation by classical computational chemistry methods, although modeling such interactions consumes an expensive computational cost.

Drug Discovery Graph Regression +2

HelixFold-Single: MSA-free Protein Structure Prediction by Using Protein Language Model as an Alternative

1 code implementation28 Jul 2022 Xiaomin Fang, Fan Wang, Lihang Liu, Jingzhou He, Dayong Lin, Yingfei Xiang, Xiaonan Zhang, Hua Wu, Hui Li, Le Song

Our proposed method, HelixFold-Single, first pre-trains a large-scale protein language model (PLM) with thousands of millions of primary sequences utilizing the self-supervised learning paradigm, which will be used as an alternative to MSAs for learning the co-evolution information.

Protein Language Model Protein Structure Prediction +1

Dynamic Gradient Reactivation for Backward Compatible Person Re-identification

no code implementations12 Jul 2022 Xiao Pan, Hao Luo, Weihua Chen, Fan Wang, Hao Li, Wei Jiang, Jianming Zhang, Jianyang Gu, Peike Li

To address this issue, we propose the Ranking-based Backward Compatible Learning (RBCL), which directly optimizes the ranking metric between new features and old features.

Person Re-Identification Retrieval

HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

1 code implementation12 Jul 2022 Guoxia Wang, Xiaomin Fang, Zhihua Wu, Yiqun Liu, Yang Xue, Yingfei Xiang, dianhai yu, Fan Wang, Yanjun Ma

Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to implement the training and inference of AlphaFold2 from scratch.

Protein Structure Prediction

TCR: A Transformer Based Deep Network for Predicting Cancer Drugs Response

no code implementations10 Jul 2022 Jie Gao, Jing Hu, Wanqing Sun, Yili Shen, Xiaonan Zhang, Xiaomin Fang, Fan Wang, Guodong Zhao

Our study highlights the prediction power of TCR and its potential value for cancer drug repurpose and precision oncology treatment.

Link the World: Improving Open-domain Conversation with Dynamic Spatiotemporal-aware Knowledge

no code implementations28 Jun 2022 Han Zhou, Xinchao Xu, Wenquan Wu, Zheng-Yu Niu, Hua Wu, Siqi Bao, Fan Wang, Haifeng Wang

Making chatbots world aware in a conversation like a human is a crucial challenge, where the world may contain dynamic knowledge and spatiotemporal state.

Informativeness

Active Source Free Domain Adaptation

no code implementations22 May 2022 Fan Wang, Zhongyi Han, Zhiyan Zhang, Yilong Yin

We then propose minimum happy points learning (MHPL) to actively explore and exploit MH points.

Source-Free Domain Adaptation

HelixADMET: a robust and endpoint extensible ADMET system incorporating self-supervised knowledge transfer

no code implementations17 May 2022 Shanzhuo Zhang, Zhiyuan Yan, Yueyang Huang, Lihang Liu, Donglong He, Wei Wang, Xiaomin Fang, Xiaonan Zhang, Fan Wang, Hua Wu, Haifeng Wang

Additionally, the pre-trained model provided by H-ADMET can be fine-tuned to generate new and customised ADMET endpoints, meeting various demands of drug research and development requirements.

Drug Discovery Self-Supervised Learning +1

Exploiting the Relationship Between Kendall's Rank Correlation and Cosine Similarity for Attribution Protection

no code implementations15 May 2022 Fan Wang, Adams Wai-Kin Kong

In this paper, we first show that the expected Kendall's rank correlation is positively correlated to cosine similarity and then indicate that the direction of attribution is the key to attribution robustness.

Adversarial Robustness

An empirical equilibrium model of formal and informal credit markets in developing countries

no code implementations26 Apr 2022 Fan Wang

I develop and estimate a dynamic equilibrium model of risky entrepreneurs' borrowing and savings decisions incorporating both formal and local-informal credit markets.

Optimal allocations to heterogeneous agents with an application to stimulus checks

no code implementations8 Apr 2022 Vegard M. Nygaard, Bent E. Sørensen, Fan Wang

A planner allocates discrete transfers of size $D_g$ to $N$ heterogeneous groups labeled $g$ and has CES preferences over the resulting outcomes, $H_g(D_g)$.

Structure-aware Protein Self-supervised Learning

1 code implementation6 Apr 2022 Can Chen, Jingbo Zhou, Fan Wang, Xue Liu, Dejing Dou

Furthermore, we propose to leverage the available protein language model pretrained on protein sequences to enhance the self-supervised learning.

Protein Language Model Representation Learning +1

Early life height and weight production functions with endogenous energy and protein inputs

no code implementations6 Apr 2022 Esteban Puentes, Fan Wang, Jere R. Behrman, Flávio Cunha, John Hoddinott, John A. Maluccio, Linda S. Adair, Judith B. Borja, Reynaldo Martorell, Aryeh D. Stein

We examine effects of protein and energy intakes on height and weight growth for children between 6 and 24 months old in Guatemala and the Philippines.

You are what your parents expect: Height and local reference points

no code implementations5 Apr 2022 Fan Wang, Esteban Puentes, Jere R. Behrman, Flávio Cunha

We explore the exogenous variation in reference height produced by a protein-supplementation experiment in Guatemala to estimate our model's parameters.

Fewer, better pathways for all? Intersectional impacts of rural school consolidation in China's minority regions

no code implementations4 Apr 2022 Emily Hannum, Fan Wang

Much more than Han youth, ethnic minority youth were negatively affected by closure, in terms of its impact on both educational attainment and written Mandarin facility.

Same environment, stratified impacts? Air pollution, extreme temperatures, and birth weight in south China

no code implementations1 Apr 2022 Xiaoying Liu, Jere R. Behrman, Emily Hannum, Fan Wang, Qingguo Zhao

This paper investigates whether associations between birth weight and prenatal ambient environmental conditions--pollution and extreme temperatures--differ by 1) maternal education; 2) children's innate health; and 3) interactions between these two.

Estimating the Effects of Educational System Consolidation: The Case of China's Rural School Closure Initiative

no code implementations31 Mar 2022 Emily Hannum, Xiaoying Liu, Fan Wang

We estimate the impact of educational infrastructure consolidation on educational attainment using the case of China's rural primary school closure policies in the early 2000s.

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

1 code implementation CVPR 2022 Hansheng Chen, Pichao Wang, Fan Wang, Wei Tian, Lu Xiong, Hao Li

The 2D-3D coordinates and corresponding weights are treated as intermediate variables learned by minimizing the KL divergence between the predicted and target pose distribution.

3D Object Detection 6D Pose Estimation using RGB +1

Controllable energy angular spectrum method

no code implementations18 Mar 2022 Fan Wang, Tomoyoshi Shimobaba, Takashi Kakue, Tomoyoshi Ito

A controllable energy method, which considers the undersampling issue of the transfer function and valid spectral energy of a source signal, is proposed to implement angular spectrum diffraction calculation in near and far fields.

valid

Information retrieval for label noise document ranking by bag sampling and group-wise loss

no code implementations12 Mar 2022 Chunyu Li, Jiajia Ding, Xing Hu, Fan Wang

To fit bag sampling well, after query and document are encoded, the global features of each group are extracted by convolutional layer and max-pooling to improve the model's resistance to the impact of labeling noise, finally, calculate the LCE group-wise loss.

Document Ranking Information Retrieval +2

Detecting Owner-member Relationship with Graph Convolution Network in Fisheye Camera System

no code implementations28 Jan 2022 Zizhang Wu, Jason Wang, Tianhao Xu, Fan Wang

The owner-member relationship between wheels and vehicles contributes significantly to the 3D perception of vehicles, especially in embedded environments.

Graph Attention

Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer

no code implementations21 Jan 2022 Pichao Wang, Fan Wang, Hao Li

During the KD process, the TCL loss transfers the local structure, exploits the higher order information, and mitigates the misalignment of the heterogeneous output of teacher and student networks.

Knowledge Distillation Transfer Learning

Network-ELAA Beamforming and Coverage Analysis for eMBB/URLLC in Spatially Non-Stationary Rician Channels

no code implementations19 Jan 2022 Jinfei Wang, Yi Ma, Na Yi, Rahim Tafazolli, Fan Wang

Finally, it is shown that the network-ELAA can offer significant coverage extension (50% or more in most of cases) when comparing with the single-AP scenario.

Exploring Domain-Invariant Parameters for Source Free Domain Adaptation

no code implementations CVPR 2022 Fan Wang, Zhongyi Han, Yongshun Gong, Yilong Yin

In contrast, we provide a fascinating insight: rather than attempting to learn domain-invariant representations, it is better to explore the domain-invariant parameters of the source model.

Privacy Preserving Source-Free Domain Adaptation

Memory-Augmented Deep Conditional Unfolding Network for Pan-Sharpening

1 code implementation CVPR 2022 Gang Yang, Man Zhou, Keyu Yan, Aiping Liu, Xueyang Fu, Fan Wang

Pan-sharpening aims to obtain high-resolution multispectral (MS) images for remote sensing systems and deep learning-based methods have achieved remarkable success.

Denoising

TAGPerson: A Target-Aware Generation Pipeline for Person Re-identification

1 code implementation28 Dec 2021 Kai Chen, Weihua Chen, Tao He, Rong Du, Fan Wang, Xiuyu Sun, Yuchen Guo, Guiguang Ding

In TAGPerson, we extract information from target scenes and use them to control our parameterized rendering process to generate target-aware synthetic images, which would hold a smaller gap to the real images in the target domain.

Person Re-Identification

ELSA: Enhanced Local Self-Attention for Vision Transformer

1 code implementation23 Dec 2021 Jingkai Zhou, Pichao Wang, Fan Wang, Qiong Liu, Hao Li, Rong Jin

Self-attention is powerful in modeling long-range dependencies, but it is weak in local finer-level feature learning.

Image Classification Instance Segmentation +2

Multimodal Pre-Training Model for Sequence-based Prediction of Protein-Protein Interaction

no code implementations9 Dec 2021 Yang Xue, Zijing Liu, Xiaomin Fang, Fan Wang

However, neither sequences nor contact maps can fully characterize structures and functions of the proteins, which are closely related to the PPI problem.

Drug Discovery

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

1 code implementation2 Dec 2021 Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin

Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations.

Ranked #2 on Unsupervised Semantic Segmentation on COCO-Stuff-171 (using extra training data)

Segmentation Self-Supervised Learning +1

HelixMO: Sample-Efficient Molecular Optimization in Scene-Sensitive Latent Space

no code implementations30 Nov 2021 ZhiYuan Chen, Xiaomin Fang, Zixu Hua, Yueyang Huang, Fan Wang, Hua Wu

Efficient exploration of the chemical space to search the candidate drugs that satisfy various constraints is a fundamental task of drug discovery.

Drug Discovery Efficient Exploration

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

2 code implementations23 Nov 2021 Hao Luo, Pichao Wang, Yi Xu, Feng Ding, Yanxin Zhou, Fan Wang, Hao Li, Rong Jin

We first investigate self-supervised learning (SSL) methods with Vision Transformer (ViT) pretrained on unlabelled person images (the LUPerson dataset), and empirically find it significantly surpasses ImageNet supervised pre-training models on ReID tasks.

 Ranked #1 on Unsupervised Person Re-Identification on Market-1501 (using extra training data)

Self-Supervised Learning Unsupervised Domain Adaptation +1

Amendable Generation for Dialogue State Tracking

1 code implementation EMNLP (NLP4ConvAI) 2021 Xin Tian, Liankai Huang, Yingzhan Lin, Siqi Bao, Huang He, Yunyi Yang, Hua Wu, Fan Wang, Shuqi Sun

In this paper, we propose a novel Amendable Generation for Dialogue State Tracking (AG-DST), which contains a two-pass generation process: (1) generating a primitive dialogue state based on the dialogue of the current turn and the previous dialogue state, and (2) amending the primitive dialogue state from the first pass.

Dialogue State Tracking Multi-domain Dialogue State Tracking +1

Do What Nature Did To Us: Evolving Plastic Recurrent Neural Networks For Generalized Tasks

no code implementations29 Sep 2021 Fan Wang, Hao Tian, Haoyi Xiong, Hua Wu, Yang Cao, Yu Kang, Haifeng Wang

While artificial neural networks (ANNs) have been widely adopted in machine learning, researchers are increasingly obsessed by the gaps between ANNs and natural neural networks (NNNs).

Meta-Learning

PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation

3 code implementations20 Sep 2021 Siqi Bao, Huang He, Fan Wang, Hua Wu, Haifeng Wang, Wenquan Wu, Zhihua Wu, Zhen Guo, Hua Lu, Xinxian Huang, Xin Tian, Xinchao Xu, Yingzhan Lin, Zheng-Yu Niu

To explore the limit of dialogue generation pre-training, we present the models of PLATO-XL with up to 11 billion parameters, trained on both Chinese and English social media conversations.

Dialogue Generation

Reinforcement Learning with Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion

1 code implementation14 Sep 2021 Haojie Shi, Bo Zhou, Hongsheng Zeng, Fan Wang, Yueqiang Dong, Jiangyong Li, Kang Wang, Hao Tian, Max Q. -H. Meng

However, due to the complex nonlinear dynamics in quadrupedal robots and reward sparsity, it is still difficult for RL to learn effective gaits from scratch, especially in challenging tasks such as walking over the balance beam.

reinforcement-learning Reinforcement Learning (RL)

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

2 code implementations ICLR 2022 Tongkun Xu, Weihua Chen, Pichao Wang, Fan Wang, Hao Li, Rong Jin

Along with the pseudo labels, a weight-sharing triple-branch transformer framework is proposed to apply self-attention and cross-attention for source/target feature learning and source-target domain alignment, respectively.

Unsupervised Domain Adaptation

Scaled ReLU Matters for Training Vision Transformers

no code implementations8 Sep 2021 Pichao Wang, Xue Wang, Hao Luo, Jingkai Zhou, Zhipeng Zhou, Fan Wang, Hao Li, Rong Jin

In this paper, we further investigate this problem and extend the above conclusion: only early convolutions do not help for stable training, but the scaled ReLU operation in the \textit{convolutional stem} (\textit{conv-stem}) matters.

Evolving Decomposed Plasticity Rules for Information-Bottlenecked Meta-Learning

2 code implementations8 Sep 2021 Fan Wang, Hao Tian, Haoyi Xiong, Hua Wu, Jie Fu, Yang Cao, Yu Kang, Haifeng Wang

In contrast, biological neural networks (BNNs) can adapt to various new tasks by continually updating the neural connections based on the inputs, which is aligned with the paradigm of learning effective learning rules in addition to static parameters, e. g., meta-learning.

Memorization Meta-Learning

ADER:Adapting between Exploration and Robustness for Actor-Critic Methods

no code implementations8 Sep 2021 Bo Zhou, Kejiao Li, Hongsheng Zeng, Fan Wang, Hao Tian

Combining off-policy reinforcement learning methods with function approximators such as neural networks has been found to lead to overestimation of the value function and sub-optimal solutions.

Continuous Control

Exploring the Quality of GAN Generated Images for Person Re-Identification

no code implementations23 Aug 2021 Yiqi Jiang, Weihua Chen, Xiuyu Sun, Xiaoyu Shi, Fan Wang, Hao Li

Recently, GAN based method has demonstrated strong effectiveness in generating augmentation data for person re-identification (ReID), on account of its ability to bridge the gap between domains and enrich the data variety in feature space.

Person Re-Identification Unsupervised Domain Adaptation

Structure-aware Interactive Graph Neural Networks for the Prediction of Protein-Ligand Binding Affinity

1 code implementation21 Jul 2021 Shuangli Li, Jingbo Zhou, Tong Xu, Liang Huang, Fan Wang, Haoyi Xiong, Weili Huang, Dejing Dou, Hui Xiong

To this end, we propose a structure-aware interactive graph neural network (SIGN) which consists of two components: polar-inspired graph attention layers (PGAL) and pairwise interactive pooling (PiPool).

Drug Discovery Graph Attention +1

Graph Convolution for Re-ranking in Person Re-identification

1 code implementation5 Jul 2021 Yuqi Zhang, Qian Qi, Chong Liu, Weihua Chen, Fan Wang, Hao Li, Rong Jin

In this work, we propose a graph-based re-ranking method to improve learned features while still keeping Euclidean distance as the similarity metric.

Person Re-Identification Re-Ranking +1

KVT: k-NN Attention for Boosting Vision Transformers

1 code implementation28 May 2021 Pichao Wang, Xue Wang, Fan Wang, Ming Lin, Shuning Chang, Hao Li, Rong Jin

A key component in vision transformers is the fully-connected self-attention which is more powerful than CNNs in modelling long range dependencies.

An Empirical Study of Vehicle Re-Identification on the AI City Challenge

1 code implementation20 May 2021 Hao Luo, Weihua Chen, Xianzhe Xu, Jianyang Gu, Yuqi Zhang, Chong Liu, Yiqi Jiang, Shuting He, Fan Wang, Hao Li

We mainly focus on four points, i. e. training data, unsupervised domain-adaptive (UDA) training, post-processing, model ensembling in this challenge.

Re-Ranking Retrieval +1

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones

1 code implementation14 May 2021 Chong Liu, Yuqi Zhang, Hao Luo, Jiasheng Tang, Weihua Chen, Xianzhe Xu, Fan Wang, Hao Li, Yi-Dong Shen

Multi-Target Multi-Camera Tracking has a wide range of applications and is the basis for many advanced inferences and predictions.

Clustering Vehicle Re-Identification

TopoTxR: A Topological Biomarker for Predicting Treatment Response in Breast Cancer

1 code implementation13 May 2021 Fan Wang, Saarthak Kapse, Steven Liu, Prateek Prasanna, Chao Chen

Characterization of breast parenchyma on dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is a challenging task owing to the complexity of underlying tissue structures.

A Unified Pre-training Framework for Conversational AI

1 code implementation6 May 2021 Siqi Bao, Bingjin Chen, Huang He, Xin Tian, Han Zhou, Fan Wang, Hua Wu, Haifeng Wang, Wenquan Wu, Yingzhan Lin

In this work, we explore the application of PLATO-2 on various dialogue systems, including open-domain conversation, knowledge grounded dialogue, and task-oriented conversation.

Chatbot Interactive Evaluation of Dialog +1

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation

no code implementations30 Mar 2021 Shuning Chang, Pichao Wang, Fan Wang, Hao Li, Jiashi Feng

Temporal action proposal generation (TAPG) is a fundamental and challenging task in video understanding, especially in temporal action detection.

Action Detection Temporal Action Proposal Generation +1

Molecular Representation Learning by Leveraging Chemical Information

1 code implementation NA 2021 Weibin Li, Shanzhuo Zhang, Lihang Liu, Zhengjie Huang, Jieqiong Lei, Xiaomin Fang, Shikun Feng, Fan Wang

As graph neural networks have achieved great success in many domains, some studies apply graph neural networks to molecular property prediction and regard each molecule as a graph.

Graph Property Prediction Molecular Property Prediction +3

Intelligent Electric Vehicle Charging Recommendation Based on Multi-Agent Reinforcement Learning

1 code implementation15 Feb 2021 Weijia Zhang, Hao liu, Fan Wang, Tong Xu, Haoran Xin, Dejing Dou, Hui Xiong

Electric Vehicle (EV) has become a preferable choice in the modern transportation system due to its environmental and energy sustainability.

Multi-agent Reinforcement Learning reinforcement-learning +1

Learning to Select External Knowledge with Multi-Scale Negative Sampling

1 code implementation3 Feb 2021 Huang He, Hua Lu, Siqi Bao, Fan Wang, Hua Wu, ZhengYu Niu, Haifeng Wang

The Track-1 of DSTC9 aims to effectively answer user requests or questions during task-oriented dialogues, which are out of the scope of APIs/DB.

Response Generation

1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking

1 code implementation20 Jan 2021 Fei Du, Bo Xu, Jiasheng Tang, Yuqi Zhang, Fan Wang, Hao Li

We extend the classical tracking-by-detection paradigm to this tracking-any-object task.

Ranked #7 on Multi-Object Tracking on TAO (using extra training data)

Multi-Object Tracking Object

Multi-object Tracking with a Hierarchical Single-branch Network

no code implementations6 Jan 2021 Fan Wang, Lei Luo, En Zhu, Siwei Wang, Jun Long

Recent Multiple Object Tracking (MOT) methods have gradually attempted to integrate object detection and instance re-identification (Re-ID) into a united network to form a one-stage solution.

Multi-Object Tracking Multiple Object Tracking +4

1st Place Solution to VisDA-2020: Bias Elimination for Domain Adaptive Pedestrian Re-identification

1 code implementation25 Dec 2020 Jianyang Gu, Hao Luo, Weihua Chen, Yiqi Jiang, Yuqi Zhang, Shuting He, Fan Wang, Hao Li, Wei Jiang

Considering the large gap between the source domain and target domain, we focused on solving two biases that influenced the performance on domain adaptive pedestrian Re-ID and proposed a two-stage training procedure.

Domain Adaptation Pseudo Label

Besov and Triebel-Lizorkin Spaces on Spaces of Homogeneous Type with Applications to Boundedness of Calderón-Zygmund Operators

no code implementations24 Dec 2020 Fan Wang, Yongsheng Han, Ziyi He, Dachun Yang

In this article, the authors introduce Besov and Triebel-Lizorkin spaces on spaces of homogeneous type in the sense of Coifman and Weiss, prove that these (in)homogeneous Besov and Triebel-Lizorkin spaces are independent of the choices of both exp-ATIs (or exp-IATIs) and underlying spaces of distributions, and give some basic properties of these spaces.

Functional Analysis Analysis of PDEs Classical Analysis and ODEs Primary 46E35, Secondary 42B25, 42B20, 42B35, 30L99

Distance-aware Molecule Graph Attention Network for Drug-Target Binding Affinity Prediction

1 code implementation17 Dec 2020 Jingbo Zhou, Shuangli Li, Liang Huang, Haoyi Xiong, Fan Wang, Tong Xu, Hui Xiong, Dejing Dou

The hierarchical attentive aggregation can capture spatial dependencies among atoms, as well as fuse the position-enhanced information with the capability of discriminating multiple spatial relations among atoms.

Drug Discovery Graph Attention +2

Heterochromatic nonlinear optical responses in upconversion nanoparticles for point spread function engineering

no code implementations12 Dec 2020 Chaohao Chen, Baolei Liu, Yongtao Liu, Jiayan Liao, Xuchen Shan, Fan Wang, Dayong Jin

Point spread function (PSF) engineering of the emitter can code higher spatial frequency information of an image to break diffraction limit but suffer from the complexed optical systems.

Optics

Boosting Image Super-Resolution Via Fusion of Complementary Information Captured by Multi-Modal Sensors

no code implementations7 Dec 2020 Fan Wang, Jiangxin Yang, Yanlong Cao, Yanpeng Cao, Michael Ying Yang

Image Super-Resolution (SR) provides a promising technique to enhance the image quality of low-resolution optical sensors, facilitating better-performing target detection and autonomous navigation in a wide range of robotics applications.

3D Reconstruction Autonomous Navigation +1

Infrared small target detection based on isotropic constraint under complex background

no code implementations24 Nov 2020 Fan Wang

Infrared search and tracking (IRST) system has been widely concerned and applied in the area of national defence.

Neural Video Coding using Multiscale Motion Compensation and Spatiotemporal Context Model

no code implementations9 Jul 2020 Haojie Liu, Ming Lu, Zhan Ma, Fan Wang, Zhihuang Xie, Xun Cao, Yao Wang

Over the past two decades, traditional block-based video coding has made remarkable progress and spawned a series of well-known standards such as MPEG-4, H. 264/AVC and H. 265/HEVC.

Motion Compensation MS-SSIM +2

SUPER: A Novel Lane Detection System

no code implementations14 May 2020 Pingping Lu, Chen Cui, Shaobing Xu, Huei Peng, Fan Wang

AI-based lane detection algorithms were actively studied over the last few years.

Lane Detection Scene Understanding +1

PSDet: Efficient and Universal Parking Slot Detection

no code implementations12 May 2020 Zizhang Wu, Weiwei Sun, Man Wang, Xiaoquan Wang, Lizhu Ding, Fan Wang

\romannumeral2, Expert knowledge for parking slot detection is under-estimated.

A Quantitative Analytical Model for Predicting and Optimizing the Rate Performance of Battery Cells

1 code implementation20 Apr 2020 Fan Wang, Ming Tang

An important objective of designing lithium-ion rechargeable battery cells is to maximize their rate performance without compromising the energy density, which is mainly achieved through computationally expensive numerical simulations at present.

Materials Science Applied Physics

Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion

no code implementations10 Dec 2019 Bo Zhou, Hongsheng Zeng, Fan Wang, Yunxiang Li, Hao Tian

By integrating dynamics models into model-free reinforcement learning (RL) methods, model-based value expansion (MVE) algorithms have shown a significant advantage in sample efficiency as well as value estimation.

reinforcement-learning Reinforcement Learning (RL)

MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems

no code implementations6 Nov 2019 Fan Wang, Xiaomin Fang, Lihang Liu, Hao Tian, Zhiming Peng

The proposed method takes advantage of the characteristics of recommender systems and draws ideas from the model-based reinforcement learning method for higher sample efficiency.

counterfactual Model-based Reinforcement Learning +3

Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning

no code implementations25 Sep 2019 Bo Zhou, Fan Wang, Hongsheng Zeng, Hao Tian

A promising direction is to combine model-based reinforcement learning with model-free reinforcement learning, such as model-based value expansion(MVE).

Model-based Reinforcement Learning reinforcement-learning +1

Hyperspectral City V1.0 Dataset and Benchmark

no code implementations24 Jul 2019 Shaodi You, Erqi Huang, Shuaizhe Liang, Yongrong Zheng, Yunxiang Li, Fan Wang, Sen Lin, Qiu Shen, Xun Cao, Diming Zhang, Yuanjiang Li, Yu Li, Ying Fu, Boxin Shi, Feng Lu, Yinqiang Zheng, Robby T. Tan

This document introduces the background and the usage of the Hyperspectral City Dataset and the benchmark.

Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection

1 code implementation5 Jun 2019 Chaotao Chen, Jinhua Peng, Fan Wang, Jun Xu, Hua Wu

In this paper, we propose a multi-mapping mechanism to better capture the one-to-many relationship, where multiple mapping modules are employed as latent mechanisms to model the semantic mappings from an input post to its diverse responses.

Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment

1 code implementation ACL 2019 Siqi Bao, Huang He, Fan Wang, Rongzhong Lian, Hua Wu

In this paper, a novel Generation-Evaluation framework is developed for multi-turn conversations with the objective of letting both participants know more about each other.

Informativeness

Learning to Select Knowledge for Response Generation in Dialog Systems

1 code implementation13 Feb 2019 Rongzhong Lian, Min Xie, Fan Wang, Jinhua Peng, Hua Wu

Specifically, a posterior distribution over knowledge is inferred from both utterances and responses, and it ensures the appropriate selection of knowledge during the training process.

Response Generation

Sequential Evaluation and Generation Framework for Combinatorial Recommender System

1 code implementation1 Feb 2019 Fan Wang, Xiaomin Fang, Lihang Liu, Yaxue Chen, Jiucheng Tao, Zhiming Peng, Cihang Jin, Hao Tian

On the one hand of this framework, an evaluation model is trained to evaluate the expected overall utility, by fully considering the user, item information and the correlations among the co-exposed items.

Recommendation Systems

Seeds Cleansing CNMF for Spatiotemporal Neural Signals Extraction of Miniscope Imaging Data

1 code implementation3 Apr 2017 Jinghao Lu, Chunyuan Li, Fan Wang

Miniscope calcium imaging is increasingly being used to monitor large populations of neuronal activities in freely behaving animals.

Neurons and Cognition Quantitative Methods

3D-Assisted Feature Synthesis for Novel Views of an Object

no code implementations ICCV 2015 Hao Su, Fan Wang, Eric Yi, Leonidas J. Guibas

Comparing two images from different views has been a long-standing challenging problem in computer vision, as visual features are not stable under large view point changes.

Image Retrieval Object +1

Integrating Dashcam Views Through Inter-Video Mapping

no code implementations ICCV 2015 Hsin-I Chen, Yi-Ling Chen, Wei-Tse Lee, Fan Wang, Bing-Yu Chen

In this paper, an inter-video mapping approach is proposed to integrate video footages from two dashcams installed on a preceding and its following vehicle to provide the illusion that the driver of the following vehicle can see-through the preceding one.

Motion Estimation

3D-Assisted Image Feature Synthesis for Novel Views of an Object

no code implementations26 Nov 2014 Hao Su, Fan Wang, Li Yi, Leonidas Guibas

In this paper, given a single input image of an object, we synthesize new features for other views of the same object.

Image Retrieval Object +1

Unsupervised Multi-Class Joint Image Segmentation

no code implementations CVPR 2014 Fan Wang, Qi-Xing Huang, Maks Ovsjanikov, Leonidas J. Guibas

Joint segmentation of image sets is a challenging problem, especially when there are multiple objects with variable appearance shared among the images in the collection and the set of objects present in each particular image is itself varying and unknown.

Image Segmentation Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.