Search Results for author: Fan Wang

Found 185 papers, 80 papers with code

New Threats against Object Detector with Non-local Block

no code implementations ECCV 2020 Yi Huang, Fan Wang, Adams Wai-Kin Kong, Kwok-Yan Lam

The experiments show that the universal patches are able to mislead the detector with greater probabilities.

Object

PLATO-KAG: Unsupervised Knowledge-Grounded Conversation via Joint Modeling

no code implementations EMNLP (NLP4ConvAI) 2021 Xinxian Huang, Huang He, Siqi Bao, Fan Wang, Hua Wu, Haifeng Wang

Large-scale conversation models are turning to leveraging external knowledge to improve the factual accuracy in response generation.

Response Generation

MentalGLM Series: Explainable Large Language Models for Mental Health Analysis on Chinese Social Media

no code implementations14 Oct 2024 Wei Zhai, Nan Bai, Qing Zhao, Jianqiang Li, Fan Wang, Hongzhi Qi, Meng Jiang, Xiaoqin Wang, Bing Xiang Yang, Guanghui Fu

The proposed models were evaluated on three downstream tasks and achieved better or comparable performance compared to deep learning models, generalized LLMs, and task fine-tuned LLMs.

Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning

1 code implementation7 Oct 2024 Qingyu Yin, Xuzheng He, Luoao Deng, Chak Tou Leong, Fan Wang, Yanzhao Yan, Xiaoyu Shen, Qiang Zhang

Fine-tuning and in-context learning (ICL) are two prevalent methods in imbuing large language models with task-specific knowledge.

In-Context Learning

Dynamic Diffusion Transformer

1 code implementation4 Oct 2024 Wangbo Zhao, Yizeng Han, Jiasheng Tang, Kai Wang, Yibing Song, Gao Huang, Fan Wang, Yang You

In addition, we design a Spatial-wise Dynamic Token (SDT) strategy to avoid redundant computation at unnecessary spatial locations.

Image Generation

AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status

no code implementations26 Sep 2024 Jinghao Zhang, Wen Qian, Hao Luo, Fan Wang, Feng Zhao

Diffusion models have made compelling progress on facilitating high-throughput daily production.

Denoising Image Generation

RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images

no code implementations5 Sep 2024 Benzhi Wang, Jingkai Zhou, Jingqi Bai, Yang Yang, Weihua Chen, Fan Wang, Zhen Lei

First, it generates realistic human parts, such as hands or faces, using the original malformed parts as references, ensuring consistent details with the original image.

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs

1 code implementation24 Aug 2024 Chansung Park, Juyong Jiang, Fan Wang, Sayak Paul, Jing Tang

The widespread adoption of cloud-based proprietary large language models (LLMs) has introduced significant challenges, including operational dependencies, privacy concerns, and the necessity of continuous internet connectivity.

Language Modelling

A Generic Review of Integrating Artificial Intelligence in Cognitive Behavioral Therapy

no code implementations28 Jul 2024 Meng Jiang, Qing Zhao, Jianqiang Li, Fan Wang, Tianyu He, Xinyan Cheng, Bing Xiang Yang, Grace W. K. Ho, Guanghui Fu

Cognitive Behavioral Therapy (CBT) is a well-established intervention for mitigating psychological issues by modifying maladaptive cognitive and behavioral patterns.

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence

no code implementations23 Jul 2024 Canyu Zhao, MingYu Liu, Wen Wang, Weihua Chen, Fan Wang, Hao Chen, Bo Zhang, Chunhua Shen

Our approach utilizes autoregressive models for global narrative coherence, predicting sequences of visual tokens that are subsequently transformed into high-quality video frames through diffusion rendering.

Video Generation

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

no code implementations16 Jul 2024 Yanqin Jiang, Chaohui Yu, Chenjie Cao, Fan Wang, Weiming Hu, Jin Gao

The core idea is two-fold: 1) We propose a novel multi-view video diffusion model (MV-VDM) conditioned on multi-view renderings of the static 3D object, which is trained on our presented large-scale multi-view video dataset (MV-Video).

Exploring the Causality of End-to-End Autonomous Driving

1 code implementation9 Jul 2024 Jiankun Li, Hao Li, JiangJiang Liu, Zhikang Zou, Xiaoqing Ye, Fan Wang, Jizhou Huang, Hua Wu, Haifeng Wang

Deep learning-based models are widely deployed in autonomous driving areas, especially the increasingly noticed end-to-end solutions.

Autonomous Driving counterfactual

VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing

no code implementations5 Jul 2024 Shang Liu, Chaohui Yu, Chenjie Cao, Wen Qian, Fan Wang

Recent research on texture synthesis for 3D shapes benefits a lot from dramatically developed 2D text-to-image diffusion models, including inpainting-based and optimization-based approaches.

Denoising Texture Synthesis

A Survey on Mixture of Experts

1 code implementation26 Jun 2024 Weilin Cai, Juyong Jiang, Fan Wang, Jing Tang, Sunghun Kim, Jiayi Huang

Large language models (LLMs) have garnered unprecedented advancements across diverse fields, ranging from natural language processing to computer vision and beyond.

In-Context Learning Survey

A Survey on Large Language Models for Code Generation

no code implementations1 Jun 2024 Juyong Jiang, Fan Wang, Jiasi Shen, Sungju Kim, Sunghun Kim

Despite the active exploration of LLMs for a variety of code tasks, either from the perspective of natural language processing (NLP) or software engineering (SE) or both, there is a noticeable absence of a comprehensive and up-to-date literature review dedicated to LLM for code generation.

Code Generation HumanEval +1

Inference-Time Alignment of Diffusion Models with Direct Noise Optimization

no code implementations29 May 2024 Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang, Mingyi Hong, Fan Wang, Tsung-Hui Chang

In this work, we focus on the alignment problem of diffusion models with a continuous reward function, which represents specific objectives for downstream tasks, such as increasing darkness or improving the aesthetics of images.

Benchmarking General-Purpose In-Context Learning

1 code implementation27 May 2024 Fan Wang, Chuan Lin, Yang Cao, Yu Kang

In-context learning (ICL) empowers generative models to address new tasks effectively and efficiently on the fly, without relying on any artificially crafted optimization techniques.

Benchmarking Decision Making +5

Certified $\ell_2$ Attribution Robustness via Uniformly Smoothed Attributions

no code implementations10 May 2024 Fan Wang, Adams Wai-Kin Kong

Model attribution is a popular tool to explain the rationales behind model predictions.

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

1 code implementation4 Apr 2024 Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai

Recent advances in 2D/3D generative models enable the generation of dynamic 3D objects from a single-view video.

motion prediction

Uncovering the Text Embedding in Text-to-Image Diffusion Models

no code implementations1 Apr 2024 Hu Yu, Hao Luo, Fan Wang, Feng Zhao

The correspondence between input text and the generated image exhibits opacity, wherein minor textual modifications can induce substantial deviations in the generated image.

XScale-NVS: Cross-Scale Novel View Synthesis with Hash Featurized Manifold

no code implementations CVPR 2024 Guangyu Wang, Jinzhi Zhang, Fan Wang, Ruqi Huang, Lu Fang

We also introduce a novel dataset, namely GigaNVS, to benchmark cross-scale, high-resolution novel view synthesis of realworld large-scale scenes.

Neural Rendering Novel View Synthesis

Text Data-Centric Image Captioning with Interactive Prompts

no code implementations28 Mar 2024 Yiyu Wang, Hao Luo, Jungang Xu, Yingfei Sun, Fan Wang

Among them, the mainstream solution is to project image embeddings into the text embedding space with the assistance of consistent representations between image-text pairs from the CLIP model.

Image Captioning

Learning-based Multi-continuum Model for Multiscale Flow Problems

no code implementations21 Mar 2024 Fan Wang, Yating Wang, Wing Tat Leung, Zongben Xu

Multiscale problems can usually be approximated through numerical homogenization by an equation with some effective parameters that can capture the macroscopic behavior of the original system on the coarse grid to speed up the simulation.

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

1 code implementation18 Mar 2024 Wangbo Zhao, Jiasheng Tang, Yizeng Han, Yibing Song, Kai Wang, Gao Huang, Fan Wang, Yang You

Existing parameter-efficient fine-tuning (PEFT) methods have achieved significant success on vision transformers (ViTs) adaptation by improving parameter efficiency.

parameter-efficient fine-tuning Semantic Segmentation +1

Neural radiance fields-based holography [Invited]

no code implementations2 Mar 2024 Minsung Kang, Fan Wang, Kai Kumano, Tomoyoshi Ito, Tomoyoshi Shimobaba

NeRF is a state-of-the-art technique for 3D light-field reconstruction from 2D images based on volume rendering.

Accelerating Parallel Sampling of Diffusion Models

1 code implementation15 Feb 2024 Zhiwei Tang, Jiasheng Tang, Hao Luo, Fan Wang, Tsung-Hui Chang

Our experiments demonstrate that ParaTAA can decrease the inference steps required by common sequential sampling algorithms such as DDIM and DDPM by a factor of 4$\sim$14 times.

Image Generation

Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach

1 code implementation28 Jan 2024 Shaofeng Zhang, Jinfa Huang, Qiang Zhou, Zhibin Wang, Fan Wang, Jiebo Luo, Junchi Yan

At inference, we generate images with arbitrary expansion multiples by inputting an anchor image and its corresponding positional embeddings.

Image Outpainting

DMT: Comprehensive Distillation with Multiple Self-supervised Teachers

no code implementations19 Dec 2023 Yuang Liu, Jing Wang, Qiang Zhou, Fan Wang, Jun Wang, Wei zhang

Numerous self-supervised learning paradigms, such as contrastive learning and masked image modeling, have been proposed to acquire powerful and general representations from unlabeled data.

Contrastive Learning Model Compression +1

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

no code implementations14 Dec 2023 Yabing Wang, Fan Wang, Jianfeng Dong, Hao Luo

Cross-lingual cross-modal retrieval has garnered increasing attention recently, which aims to achieve the alignment between vision and target language (V-T) without using any annotated V-T data pairs.

Cross-Lingual Transfer Cross-Modal Retrieval +4

Towards a Psychological Generalist AI: A Survey of Current Applications of Large Language Models and Future Prospects

no code implementations1 Dec 2023 Tianyu He, Guanghui Fu, Yijing Yu, Fan Wang, Jianqiang Li, Qing Zhao, Changwei Song, Hongzhi Qi, Dan Luo, Huijing Zou, Bing Xiang Yang

The complexity of psychological principles underscore a significant societal challenge, given the vast social implications of psychological problems.

Language-guided Few-shot Semantic Segmentation

no code implementations23 Nov 2023 Jing Wang, Yuang Liu, Qiang Zhou, Fan Wang

Few-shot learning is a promising way for reducing the label cost in new categories adaptation with the guidance of a small, well labeled support set.

Few-Shot Semantic Segmentation Segmentation +1

OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning

1 code implementation CVPR 2024 Haiyang Ying, Yixuan Yin, Jinzhi Zhang, Fan Wang, Tao Yu, Ruqi Huang, Lu Fang

Towards holistic understanding of 3D scenes, a general 3D segmentation method is needed that can segment diverse objects without restrictions on object quantity or categories, while also reflecting the inherent hierarchical structure.

Contrastive Learning Novel View Synthesis +1

Pre-Training on Large-Scale Generated Docking Conformations with HelixDock to Unlock the Potential of Protein-ligand Structure Prediction Models

no code implementations21 Oct 2023 Lihang Liu, Shanzhuo Zhang, Donglong He, Xianbin Ye, Jingbo Zhou, Xiaonan Zhang, Yaoyao Jiang, Weiming Diao, Hang Yin, Hua Chai, Fan Wang, Jingzhou He, Liang Zheng, Yonghui Li, Xiaomin Fang

In this work, we show that by pre-training on a large-scale docking conformation generated by traditional physics-based docking tools and then fine-tuning with a limited set of experimentally validated receptor-ligand complexes, we can obtain a protein-ligand structure prediction model with outstanding performance.

Drug Discovery Molecular Docking

SingleInsert: Inserting New Concepts from a Single Image into Text-to-Image Models for Flexible Editing

no code implementations12 Oct 2023 Zijie Wu, Chaohui Yu, Zhen Zhu, Fan Wang, Xiang Bai

To utilize the abundant visual priors in the off-the-shelf T2I models, a series of methods try to invert an image to proper embedding that aligns with the semantic space of the T2I model.

Image Generation Novel View Synthesis

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels

2 code implementations15 Sep 2023 Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou

Experiments on 19 visual transfer learning downstream tasks demonstrate that our SCT outperforms full fine-tuning on 18 out of 19 tasks by adding only 0. 11M parameters of the ViT-B, which is 780$\times$ fewer than its full fine-tuning counterpart.

Domain Generalization Few-Shot Learning +2

Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding

no code implementations15 Sep 2023 Xiaonan Lu, Jianlong Yuan, Ruigang Niu, Yuan Hu, Fan Wang

Therefore, they cannot be directly applied to cope with image change understanding (ICU), which requires models to capture actual changes between multiple images and describe them in language.

Temporal compressive edge imaging enabled by a lensless diffuser camera

no code implementations13 Sep 2023 Ze Zheng, Baolei Liu, Jiaqi Song, Lei Ding, Xiaolan Zhong, David Mcgloin, Fan Wang

Lensless imagers based on diffusers or encoding masks enable high-dimensional imaging from a single shot measurement and have been applied in various applications.

Edge Detection

DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation

1 code implementation10 Sep 2023 Zelin Zang, Hao Luo, Kai Wang, Panpan Zhang, Fan Wang, Stan. Z Li, Yang You

With the help of iterative training of the semantic encoder and diffusion model, DiffAug improves the representation ability in an uninterrupted and unsupervised manner.

Contrastive Learning Data Augmentation +2

Punctate White Matter Lesion Segmentation in Preterm Infants Powered by Counterfactually Generative Learning

no code implementations7 Sep 2023 Zehua Ren, Yongheng Sun, Miaomiao Wang, Yuying Feng, Xianjun Li, Chao Jin, Jian Yang, Chunfeng Lian, Fan Wang

In this paper, we propose to leverage the idea of counterfactual reasoning coupled with the auxiliary task of brain tissue segmentation to learn fine-grained positional and morphological representations of PWMLs for accurate localization and segmentation.

counterfactual Counterfactual Reasoning +2

Region Generation and Assessment Network for Occluded Person Re-Identification

no code implementations7 Sep 2023 Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding

Then, to measure the importance of each generated region, we introduce a Region Assessment Module (RAM) that assigns confidence scores to different regions and reduces the negative impact of the occlusion regions by lower scores.

Occluded Person Re-Identification

Forensic Histopathological Recognition via a Context-Aware MIL Network Powered by Self-Supervised Contrastive Learning

no code implementations27 Aug 2023 Chen Shen, Jun Zhang, Xinggong Liang, Zeyi Hao, Kehan Li, Fan Wang, Zhenyuan Wang, Chunfeng Lian

Forensic pathology is critical in analyzing death manner and time from the microscopic aspect to assist in the establishment of reliable factual bases for criminal investigation.

Contrastive Learning Domain Generalization +3

Graph-Segmenter: Graph Transformer with Boundary-aware Attention for Semantic Segmentation

no code implementations15 Aug 2023 Zizhang Wu, Yuanzhu Gan, Tianhao Xu, Fan Wang

To address this issue, we propose a Graph-Segmenter, including a Graph Transformer and a Boundary-aware Attention module, which is an effective network for simultaneously modeling the more profound relation between windows in a global view and various pixels inside each window as a local one, and for substantial low-cost boundary adjustment.

Relation Segmentation +1

Revisiting Vision Transformer from the View of Path Ensemble

no code implementations ICCV 2023 Shuning Chang, Pichao Wang, Hao Luo, Fan Wang, Mike Zheng Shou

Therefore, we propose the path pruning and EnsembleScale skills for improvement, which cut out the underperforming paths and re-weight the ensemble components, respectively, to optimize the path combination and make the short paths focus on providing high-quality representation for subsequent paths.

Dynamic Token-Pass Transformers for Semantic Segmentation

no code implementations3 Aug 2023 Yuang Liu, Qiang Zhou, Jing Wang, Fan Wang, Jun Wang, Wei zhang

Vision transformers (ViT) usually extract features via forwarding all the tokens in the self-attention layers from top to toe.

Segmentation Semantic Segmentation

RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic and Regional Comprehension

1 code implementation3 Aug 2023 Qiang Zhou, Chaohui Yu, Shaofeng Zhang, Sitong Wu, Zhibing Wang, Fan Wang

To this end, we propose to extract features corresponding to regional objects as soft prompts for LLM, which provides a straightforward and scalable approach and eliminates the need for LLM fine-tuning.

Image Comprehension

Improved Neural Radiance Fields Using Pseudo-depth and Fusion

no code implementations27 Jul 2023 Jingliang Li, Qiang Zhou, Chaohui Yu, Zhengda Lu, Jun Xiao, Zhibin Wang, Fan Wang

To make the constructed volumes as close as possible to the surfaces of objects in the scene and the rendered depth more accurate, we propose to perform depth prediction and radiance field reconstruction simultaneously.

Depth Estimation Depth Prediction +1

Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation

no code implementations26 Jul 2023 Chaohui Yu, Qiang Zhou, Jingliang Li, Zhe Zhang, Zhibin Wang, Fan Wang

To better utilize the sparse 3D points, we propose an efficient point cloud guidance loss to adaptively drive the NeRF's geometry to align with the shape of the sparse 3D points.

3D Generation Text to 3D

Graph Convolution Based Efficient Re-Ranking for Visual Retrieval

1 code implementation15 Jun 2023 Yuqi Zhang, Qi Qian, Hongsong Wang, Chong Liu, Weihua Chen, Fan Wang

In particular, the plain GCR is extended for cross-camera retrieval and an improved feature propagation formulation is presented to leverage affinity relationships across different cameras.

Distributed Computing Image Retrieval +3

Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training

1 code implementation15 Jun 2023 Chong Liu, Yuqi Zhang, Hongsong Wang, Weihua Chen, Fan Wang, Yan Huang, Yi-Dong Shen, Liang Wang

Most previous works either simply learn coarse-grained representations of the overall image and text, or elaborately establish the correspondence between image regions or pixels and text words.

Image-text Retrieval Representation Learning +1

SwinRDM: Integrate SwinRNN with Diffusion Model towards High-Resolution and High-Quality Weather Forecasting

no code implementations5 Jun 2023 Lei Chen, Fei Du, Yuan Hu, Fan Wang, Zhibin Wang

Recurrent predictions for future atmospheric fields are firstly performed at 1. 40625-degree resolution, and then a diffusion-based super-resolution model is leveraged to recover the high spatial resolution and finer-scale atmospheric details.

Super-Resolution Weather Forecasting

MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks

1 code implementation17 May 2023 Wenfang Sun, Yingjun Du, XianTong Zhen, Fan Wang, Ling Wang, Cees G. M. Snoek

To account for the uncertainty caused by the limited training tasks, we propose a variational MetaModulation where the modulation parameters are treated as latent variables.

Diversity Few-Shot Learning

NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation

1 code implementation CVPR 2023 Jiefeng Li, Siyuan Bian, Qi Liu, Jiasheng Tang, Fan Wang, Cewu Lu

In this work, we present NIKI (Neural Inverse Kinematics with Invertible Neural Network), which models bi-directional errors to improve the robustness to occlusions and obtain pixel-aligned accuracy.

3D human pose and shape estimation

UniNeXt: Exploring A Unified Architecture for Vision Recognition

1 code implementation26 Apr 2023 Fangjian Lin, Jianlong Yuan, Sitong Wu, Fan Wang, Zhibin Wang

Interestingly, the ranking of these spatial token mixers also changes under our UniNeXt, suggesting that an excellent spatial token mixer may be stifled due to a suboptimal general architecture, which further shows the importance of the study on the general architecture of vision backbone.

Spatial Token Mixer

DOAD: Decoupled One Stage Action Detection Network

no code implementations1 Apr 2023 Shuning Chang, Pichao Wang, Fan Wang, Jiashi Feng, Mike Zheng Show

Specifically, one branch focuses on detection representation for actor detection, and the other one for action recognition.

Action Detection Action Recognition +1

Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks

4 code implementations CVPR 2023 Weihua Chen, Xianzhe Xu, Jian Jia, Hao Luo, Yaohua Wang, Fan Wang, Rong Jin, Xiuyu Sun

Unlike the existing self-supervised learning methods, prior knowledge from human images is utilized in SOLIDER to build pseudo semantic labels and import more semantic information into the learned representation.

Human Parsing Pedestrian Attribute Recognition +6

ARMBench: An Object-centric Benchmark Dataset for Robotic Manipulation

no code implementations29 Mar 2023 Chaitanya Mitash, Fan Wang, Shiyang Lu, Vikedo Terhuja, Tyler Garaas, Felipe Polido, Manikantan Nambi

This paper introduces Amazon Robotic Manipulation Benchmark (ARMBench), a large-scale, object-centric benchmark dataset for robotic manipulation in the context of a warehouse.

Defect Detection Instance Segmentation +2

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

2 code implementations22 Mar 2023 Hansheng Chen, Wei Tian, Pichao Wang, Fan Wang, Lu Xiong, Hao Li

In this paper, we propose the EPro-PnP, a probabilistic PnP layer for general end-to-end pose estimation, which outputs a distribution of pose with differentiable probability density on the SE(3) manifold.

3D Object Detection 6D Pose Estimation using RGB +1

Making Vision Transformers Efficient from A Token Sparsification View

1 code implementation CVPR 2023 Shuning Chang, Pichao Wang, Ming Lin, Fan Wang, David Junhao Zhang, Rong Jin, Mike Zheng Shou

In this work, we propose a novel Semantic Token ViT (STViT), for efficient global and local vision transformers, which can also be revised to serve as backbone for downstream tasks.

Efficient ViTs Instance Segmentation +4

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm

no code implementations14 Mar 2023 Hengyuan Zhao, Hao Luo, Yuyang Zhao, Pichao Wang, Fan Wang, Mike Zheng Shou

In view of the practicality of PETL, previous works focus on tuning a small set of parameters for each downstream task in an end-to-end manner while rarely considering the task distribution shift issue between the pre-training task and the downstream task.

Transfer Learning Vocal Bursts Valence Prediction

Time series anomaly detection with reconstruction-based state-space models

1 code implementation6 Mar 2023 Fan Wang, Keli Wang, Boyu Yao

In this work, we propose a novel unsupervised anomaly detection method for time series data.

Decoder State Space Models +3

A Practical Upper Bound for the Worst-Case Attribution Deviations

no code implementations CVPR 2023 Fan Wang, Adams Wai-Kin Kong

Model attribution is a critical component of deep neural networks (DNNs) for its interpretability to complex models.

D2Q-DETR: Decoupling and Dynamic Queries for Oriented Object Detection with Transformers

no code implementations1 Mar 2023 Qiang Zhou, Chaohui Yu, Zhibin Wang, Fan Wang

In this paper, we propose an end-to-end framework for oriented object detection, which simplifies the model pipeline and obtains superior performance.

Decoder Object +4

LMSeg: Language-guided Multi-dataset Segmentation

no code implementations27 Feb 2023 Qiang Zhou, Yuang Liu, Chaohui Yu, Jingliang Li, Zhibin Wang, Fan Wang

Instead of relabeling each dataset with the unified taxonomy, a category-guided decoding module is designed to dynamically guide predictions to each datasets taxonomy.

Image Augmentation Panoptic Segmentation +1

Dual-mode adaptive-SVD ghost imaging

no code implementations14 Feb 2023 Dajing Wang, Baolei Liu, Jiaqi Song, Yao Wang, Xuchen Shan, Fan Wang

In this paper, we present a dual-mode adaptive singular value decomposition ghost imaging (A-SVD GI), which can be easily switched between the modes of imaging and edge detection.

Edge Detection

Head-Free Lightweight Semantic Segmentation with Linear Transformer

1 code implementation11 Jan 2023 Bo Dong, Pichao Wang, Fan Wang

On the ADE20K dataset, our model achieves 41. 8 mIoU and 4. 6 GFLOPs, which is 4. 4 mIoU higher than Segformer, with 45% less GFLOPs.

Decoder Segmentation +1

NeuroExplainer: Fine-Grained Attention Decoding to Uncover Cortical Development Patterns of Preterm Infants

no code implementations1 Jan 2023 Chenyu Xue, Fan Wang, Yuanzhuo Zhu, Hui Li, Deyu Meng, Dinggang Shen, Chunfeng Lian

Deploying reliable deep learning techniques in interdisciplinary applications needs learned models to output accurate and (even more importantly) explainable predictions.

MHPL: Minimum Happy Points Learning for Active Source Free Domain Adaptation

no code implementations CVPR 2023 Fan Wang, Zhongyi Han, Zhiyan Zhang, Rundong He, Yilong Yin

Source free domain adaptation (SFDA) aims to transfer a trained source model to the unlabeled target domain without accessing the source data.

Active Learning Source-Free Domain Adaptation

Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling

1 code implementation19 Dec 2022 Mingzhu Cai, Siqi Bao, Xin Tian, Huang He, Fan Wang, Hua Wu

In this paper, we propose an unsupervised query enhanced approach for knowledge-intensive conversations, namely QKConv.

Conversational Question Answering Retrieval

CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion

no code implementations13 Dec 2022 Zizhang Wu, Man Wang, Weiwei Sun, Yuchen Li, Tianhao Xu, Fan Wang, Keke Huang

Channel and spatial attention mechanism has proven to provide an evident performance boost of deep convolution neural networks (CNNs).

Image Classification Instance Segmentation +3

Surround-view Fisheye BEV-Perception for Valet Parking: Dataset, Baseline and Distortion-insensitive Multi-task Framework

no code implementations8 Dec 2022 Zizhang Wu, Yuanzhu Gan, Xianzhi Li, Yunzhe Wu, Xiaoquan Wang, Tianhao Xu, Fan Wang

Most existing networks based on public datasets may generalize suboptimal results on these valet parking scenes, also affected by the fisheye distortion.

Autonomous Driving

Complete Solution for Vehicle Re-ID in Surround-view Camera System

no code implementations8 Dec 2022 Zizhang Wu, Tianhao Xu, Fan Wang, Xiaoquan Wang, Jing Song

Vehicle re-identification (Re-ID) is a critical component of the autonomous driving perception system, and research in this area has accelerated in recent years.

Autonomous Driving Vehicle Re-Identification

A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition

1 code implementation16 Nov 2022 Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang

Although improving motion recognition to some extent, these methods still face sub-optimal situations in the following aspects: (i) Data augmentation, i. e., the scale of the RGB-D datasets is still limited, and few efforts have been made to explore novel data augmentation strategies for videos; (ii) Optimization mechanism, i. e., the tightly space-time-entangled network structure brings more challenges to spatiotemporal information modeling; And (iii) cross-modal knowledge fusion, i. e., the high similarity between multimodal representations caused to insufficient late fusion.

Action Recognition Data Augmentation +2

PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation

no code implementations2 Nov 2022 Siqi Bao, Huang He, Jun Xu, Hua Lu, Fan Wang, Hua Wu, Han Zhou, Wenquan Wu, Zheng-Yu Niu, Haifeng Wang

Recently, the practical deployment of open-domain dialogue systems has been plagued by the knowledge issue of information deficiency and factual inaccuracy.

Dialogue Generation Memorization +1

VTC-LFC: Vision Transformer Compression with Low-Frequency Components

1 code implementation NIPS 2022 Zhenyu Wang, Hao Luo, Pichao Wang, Feng Ding, Fan Wang, Hao Li

Although Vision transformers (ViTs) have recently dominated many vision tasks, deploying ViT models on resource-limited devices remains a challenging problem.

Behavioral Intention Prediction in Driving Scenes: A Survey

no code implementations1 Nov 2022 Jianwu Fang, Fan Wang, Jianru Xue, Tat-Seng Chua

Behavioral Intention Prediction (BIP) simulates such a human consideration process and fulfills the early prediction of specific behaviors.

Survey Trajectory Prediction

Q-TOD: A Query-driven Task-oriented Dialogue System

1 code implementation14 Oct 2022 Xin Tian, Yingzhan Lin, Mengfei Song, Siqi Bao, Fan Wang, Huang He, Shuqi Sun, Hua Wu

Firstly, as the query is in the form of natural language and not confined to the schema of the knowledge base, the issue of domain adaption is alleviated remarkably in Q-TOD.

Domain Adaptation Response Generation +2

Effective Vision Transformer Training: A Data-Centric Perspective

no code implementations29 Sep 2022 Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang

To achieve these two purposes, we propose a novel data-centric ViT training framework to dynamically measure the ``difficulty'' of training samples and generate ``effective'' samples for models at different training stages.

FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation

1 code implementation30 Aug 2022 Jianlong Yuan, Qian Qi, Fei Du, Zhibin Wang, Fan Wang, Yifan Liu

Inspired by the recent progress on semantic directions on feature-space, we propose to include augmentations in feature space for efficient distillation.

Knowledge Distillation Segmentation +1

Towards Boosting the Open-Domain Chatbot with Human Feedback

1 code implementation30 Aug 2022 Hua Lu, Siqi Bao, Huang He, Fan Wang, Hua Wu, Haifeng Wang

Many open-domain dialogue models pre-trained with social media comments can generate coherent replies but have difficulties producing engaging responses when interacting with real users.

Chatbot

GEM-2: Next Generation Molecular Property Prediction Network by Modeling Full-range Many-body Interactions

1 code implementation11 Aug 2022 Lihang Liu, Donglong He, Xiaomin Fang, Shanzhuo Zhang, Fan Wang, Jingzhou He, Hua Wu

Full-range many-body interactions between electrons have been proven effective in obtaining an accurate solution of the Schr"odinger equation by classical computational chemistry methods, although modeling such interactions consumes an expensive computational cost.

Computational chemistry Drug Discovery +3

HelixFold-Single: MSA-free Protein Structure Prediction by Using Protein Language Model as an Alternative

1 code implementation28 Jul 2022 Xiaomin Fang, Fan Wang, Lihang Liu, Jingzhou He, Dayong Lin, Yingfei Xiang, Xiaonan Zhang, Hua Wu, Hui Li, Le Song

Our proposed method, HelixFold-Single, first pre-trains a large-scale protein language model (PLM) with thousands of millions of primary sequences utilizing the self-supervised learning paradigm, which will be used as an alternative to MSAs for learning the co-evolution information.

Protein Language Model Protein Structure Prediction +1

Dynamic Gradient Reactivation for Backward Compatible Person Re-identification

no code implementations12 Jul 2022 Xiao Pan, Hao Luo, Weihua Chen, Fan Wang, Hao Li, Wei Jiang, Jianming Zhang, Jianyang Gu, Peike Li

To address this issue, we propose the Ranking-based Backward Compatible Learning (RBCL), which directly optimizes the ranking metric between new features and old features.

Person Re-Identification Retrieval

HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

1 code implementation12 Jul 2022 Guoxia Wang, Xiaomin Fang, Zhihua Wu, Yiqun Liu, Yang Xue, Yingfei Xiang, dianhai yu, Fan Wang, Yanjun Ma

Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to implement the training and inference of AlphaFold2 from scratch.

Protein Structure Prediction

TCR: A Transformer Based Deep Network for Predicting Cancer Drugs Response

no code implementations10 Jul 2022 Jie Gao, Jing Hu, Wanqing Sun, Yili Shen, Xiaonan Zhang, Xiaomin Fang, Fan Wang, Guodong Zhao

Our study highlights the prediction power of TCR and its potential value for cancer drug repurpose and precision oncology treatment.

Link the World: Improving Open-domain Conversation with Dynamic Spatiotemporal-aware Knowledge

no code implementations28 Jun 2022 Han Zhou, Xinchao Xu, Wenquan Wu, Zheng-Yu Niu, Hua Wu, Siqi Bao, Fan Wang, Haifeng Wang

Making chatbots world aware in a conversation like a human is a crucial challenge, where the world may contain dynamic knowledge and spatiotemporal state.

Informativeness

Active Source Free Domain Adaptation

no code implementations22 May 2022 Fan Wang, Zhongyi Han, Zhiyan Zhang, Yilong Yin

We then propose minimum happy points learning (MHPL) to actively explore and exploit MH points.

Source-Free Domain Adaptation

HelixADMET: a robust and endpoint extensible ADMET system incorporating self-supervised knowledge transfer

no code implementations17 May 2022 Shanzhuo Zhang, Zhiyuan Yan, Yueyang Huang, Lihang Liu, Donglong He, Wei Wang, Xiaomin Fang, Xiaonan Zhang, Fan Wang, Hua Wu, Haifeng Wang

Additionally, the pre-trained model provided by H-ADMET can be fine-tuned to generate new and customised ADMET endpoints, meeting various demands of drug research and development requirements.

Drug Discovery Self-Supervised Learning +1

Exploiting the Relationship Between Kendall's Rank Correlation and Cosine Similarity for Attribution Protection

no code implementations15 May 2022 Fan Wang, Adams Wai-Kin Kong

In this paper, we first show that the expected Kendall's rank correlation is positively correlated to cosine similarity and then indicate that the direction of attribution is the key to attribution robustness.

Adversarial Robustness

An empirical equilibrium model of formal and informal credit markets in developing countries

no code implementations26 Apr 2022 Fan Wang

I develop and estimate a dynamic equilibrium model of risky entrepreneurs' borrowing and savings decisions incorporating both formal and local-informal credit markets.

Optimal allocations to heterogeneous agents with an application to stimulus checks

no code implementations8 Apr 2022 Vegard M. Nygaard, Bent E. Sørensen, Fan Wang

A planner allocates discrete transfers of size $D_g$ to $N$ heterogeneous groups labeled $g$ and has CES preferences over the resulting outcomes, $H_g(D_g)$.

Structure-aware Protein Self-supervised Learning

1 code implementation6 Apr 2022 Can Chen, Jingbo Zhou, Fan Wang, Xue Liu, Dejing Dou

Furthermore, we propose to leverage the available protein language model pretrained on protein sequences to enhance the self-supervised learning.

Graph Neural Network Protein Language Model +2

Early life height and weight production functions with endogenous energy and protein inputs

no code implementations6 Apr 2022 Esteban Puentes, Fan Wang, Jere R. Behrman, Flávio Cunha, John Hoddinott, John A. Maluccio, Linda S. Adair, Judith B. Borja, Reynaldo Martorell, Aryeh D. Stein

We examine effects of protein and energy intakes on height and weight growth for children between 6 and 24 months old in Guatemala and the Philippines.

You are what your parents expect: Height and local reference points

no code implementations5 Apr 2022 Fan Wang, Esteban Puentes, Jere R. Behrman, Flávio Cunha

We explore the exogenous variation in reference height produced by a protein-supplementation experiment in Guatemala to estimate our model's parameters.

Fewer, better pathways for all? Intersectional impacts of rural school consolidation in China's minority regions

no code implementations4 Apr 2022 Emily Hannum, Fan Wang

Much more than Han youth, ethnic minority youth were negatively affected by closure, in terms of its impact on both educational attainment and written Mandarin facility.

Same environment, stratified impacts? Air pollution, extreme temperatures, and birth weight in south China

no code implementations1 Apr 2022 Xiaoying Liu, Jere R. Behrman, Emily Hannum, Fan Wang, Qingguo Zhao

This paper investigates whether associations between birth weight and prenatal ambient environmental conditions--pollution and extreme temperatures--differ by 1) maternal education; 2) children's innate health; and 3) interactions between these two.

Estimating the Effects of Educational System Consolidation: The Case of China's Rural School Closure Initiative

no code implementations31 Mar 2022 Emily Hannum, Xiaoying Liu, Fan Wang

We estimate the impact of educational infrastructure consolidation on educational attainment using the case of China's rural primary school closure policies in the early 2000s.

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

1 code implementation CVPR 2022 Hansheng Chen, Pichao Wang, Fan Wang, Wei Tian, Lu Xiong, Hao Li

The 2D-3D coordinates and corresponding weights are treated as intermediate variables learned by minimizing the KL divergence between the predicted and target pose distribution.

3D Object Detection 6D Pose Estimation using RGB +1

Controllable energy angular spectrum method

no code implementations18 Mar 2022 Fan Wang, Tomoyoshi Shimobaba, Takashi Kakue, Tomoyoshi Ito

A controllable energy method, which considers the undersampling issue of the transfer function and valid spectral energy of a source signal, is proposed to implement angular spectrum diffraction calculation in near and far fields.

valid

Information retrieval for label noise document ranking by bag sampling and group-wise loss

no code implementations12 Mar 2022 Chunyu Li, Jiajia Ding, Xing Hu, Fan Wang

To fit bag sampling well, after query and document are encoded, the global features of each group are extracted by convolutional layer and max-pooling to improve the model's resistance to the impact of labeling noise, finally, calculate the LCE group-wise loss.

Document Ranking Information Retrieval +2

Detecting Owner-member Relationship with Graph Convolution Network in Fisheye Camera System

no code implementations28 Jan 2022 Zizhang Wu, Jason Wang, Tianhao Xu, Fan Wang

The owner-member relationship between wheels and vehicles contributes significantly to the 3D perception of vehicles, especially in embedded environments.

Graph Attention

Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer

no code implementations21 Jan 2022 Pichao Wang, Fan Wang, Hao Li

During the KD process, the TCL loss transfers the local structure, exploits the higher order information, and mitigates the misalignment of the heterogeneous output of teacher and student networks.

Knowledge Distillation Transfer Learning +1

Network-ELAA Beamforming and Coverage Analysis for eMBB/URLLC in Spatially Non-Stationary Rician Channels

no code implementations19 Jan 2022 Jinfei Wang, Yi Ma, Na Yi, Rahim Tafazolli, Fan Wang

Finally, it is shown that the network-ELAA can offer significant coverage extension (50% or more in most of cases) when comparing with the single-AP scenario.

Exploring Domain-Invariant Parameters for Source Free Domain Adaptation

no code implementations CVPR 2022 Fan Wang, Zhongyi Han, Yongshun Gong, Yilong Yin

In contrast, we provide a fascinating insight: rather than attempting to learn domain-invariant representations, it is better to explore the domain-invariant parameters of the source model.

Privacy Preserving Source-Free Domain Adaptation

Memory-Augmented Deep Conditional Unfolding Network for Pan-Sharpening

1 code implementation CVPR 2022 Gang Yang, Man Zhou, Keyu Yan, Aiping Liu, Xueyang Fu, Fan Wang

Pan-sharpening aims to obtain high-resolution multispectral (MS) images for remote sensing systems and deep learning-based methods have achieved remarkable success.

Denoising

TAGPerson: A Target-Aware Generation Pipeline for Person Re-identification

1 code implementation28 Dec 2021 Kai Chen, Weihua Chen, Tao He, Rong Du, Fan Wang, Xiuyu Sun, Yuchen Guo, Guiguang Ding

In TAGPerson, we extract information from target scenes and use them to control our parameterized rendering process to generate target-aware synthetic images, which would hold a smaller gap to the real images in the target domain.

Person Re-Identification

ELSA: Enhanced Local Self-Attention for Vision Transformer

1 code implementation23 Dec 2021 Jingkai Zhou, Pichao Wang, Fan Wang, Qiong Liu, Hao Li, Rong Jin

Self-attention is powerful in modeling long-range dependencies, but it is weak in local finer-level feature learning.

Image Classification Instance Segmentation +2

Multimodal Pre-Training Model for Sequence-based Prediction of Protein-Protein Interaction

no code implementations9 Dec 2021 Yang Xue, Zijing Liu, Xiaomin Fang, Fan Wang

However, neither sequences nor contact maps can fully characterize structures and functions of the proteins, which are closely related to the PPI problem.

Drug Discovery

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

1 code implementation2 Dec 2021 Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin

Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations.

Ranked #2 on Unsupervised Semantic Segmentation on COCO-Stuff-171 (using extra training data)

Segmentation Self-Supervised Learning +1

HelixMO: Sample-Efficient Molecular Optimization in Scene-Sensitive Latent Space

no code implementations30 Nov 2021 ZhiYuan Chen, Xiaomin Fang, Zixu Hua, Yueyang Huang, Fan Wang, Hua Wu

Efficient exploration of the chemical space to search the candidate drugs that satisfy various constraints is a fundamental task of drug discovery.

Drug Discovery Efficient Exploration

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

3 code implementations23 Nov 2021 Hao Luo, Pichao Wang, Yi Xu, Feng Ding, Yanxin Zhou, Fan Wang, Hao Li, Rong Jin

We first investigate self-supervised learning (SSL) methods with Vision Transformer (ViT) pretrained on unlabelled person images (the LUPerson dataset), and empirically find it significantly surpasses ImageNet supervised pre-training models on ReID tasks.

 Ranked #1 on Unsupervised Person Re-Identification on Market-1501 (using extra training data)

Self-Supervised Learning Unsupervised Domain Adaptation +1

Amendable Generation for Dialogue State Tracking

1 code implementation EMNLP (NLP4ConvAI) 2021 Xin Tian, Liankai Huang, Yingzhan Lin, Siqi Bao, Huang He, Yunyi Yang, Hua Wu, Fan Wang, Shuqi Sun

In this paper, we propose a novel Amendable Generation for Dialogue State Tracking (AG-DST), which contains a two-pass generation process: (1) generating a primitive dialogue state based on the dialogue of the current turn and the previous dialogue state, and (2) amending the primitive dialogue state from the first pass.

Dialogue State Tracking Multi-domain Dialogue State Tracking +1

Do What Nature Did To Us: Evolving Plastic Recurrent Neural Networks For Generalized Tasks

no code implementations29 Sep 2021 Fan Wang, Hao Tian, Haoyi Xiong, Hua Wu, Yang Cao, Yu Kang, Haifeng Wang

While artificial neural networks (ANNs) have been widely adopted in machine learning, researchers are increasingly obsessed by the gaps between ANNs and natural neural networks (NNNs).

Meta-Learning

PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation

3 code implementations20 Sep 2021 Siqi Bao, Huang He, Fan Wang, Hua Wu, Haifeng Wang, Wenquan Wu, Zhihua Wu, Zhen Guo, Hua Lu, Xinxian Huang, Xin Tian, Xinchao Xu, Yingzhan Lin, Zheng-Yu Niu

To explore the limit of dialogue generation pre-training, we present the models of PLATO-XL with up to 11 billion parameters, trained on both Chinese and English social media conversations.

Dialogue Generation

Reinforcement Learning with Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion

1 code implementation14 Sep 2021 Haojie Shi, Bo Zhou, Hongsheng Zeng, Fan Wang, Yueqiang Dong, Jiangyong Li, Kang Wang, Hao Tian, Max Q. -H. Meng

However, due to the complex nonlinear dynamics in quadrupedal robots and reward sparsity, it is still difficult for RL to learn effective gaits from scratch, especially in challenging tasks such as walking over the balance beam.

reinforcement-learning Reinforcement Learning +1

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

2 code implementations ICLR 2022 Tongkun Xu, Weihua Chen, Pichao Wang, Fan Wang, Hao Li, Rong Jin

Along with the pseudo labels, a weight-sharing triple-branch transformer framework is proposed to apply self-attention and cross-attention for source/target feature learning and source-target domain alignment, respectively.

Unsupervised Domain Adaptation

Scaled ReLU Matters for Training Vision Transformers

no code implementations8 Sep 2021 Pichao Wang, Xue Wang, Hao Luo, Jingkai Zhou, Zhipeng Zhou, Fan Wang, Hao Li, Rong Jin

In this paper, we further investigate this problem and extend the above conclusion: only early convolutions do not help for stable training, but the scaled ReLU operation in the \textit{convolutional stem} (\textit{conv-stem}) matters.

Diversity

Evolving Decomposed Plasticity Rules for Information-Bottlenecked Meta-Learning

2 code implementations8 Sep 2021 Fan Wang, Hao Tian, Haoyi Xiong, Hua Wu, Jie Fu, Yang Cao, Yu Kang, Haifeng Wang

In contrast, biological neural networks (BNNs) can adapt to various new tasks by continually updating the neural connections based on the inputs, which is aligned with the paradigm of learning effective learning rules in addition to static parameters, e. g., meta-learning.

Memorization Meta-Learning

ADER:Adapting between Exploration and Robustness for Actor-Critic Methods

no code implementations8 Sep 2021 Bo Zhou, Kejiao Li, Hongsheng Zeng, Fan Wang, Hao Tian

Combining off-policy reinforcement learning methods with function approximators such as neural networks has been found to lead to overestimation of the value function and sub-optimal solutions.

Continuous Control

Exploring the Quality of GAN Generated Images for Person Re-Identification

no code implementations23 Aug 2021 Yiqi Jiang, Weihua Chen, Xiuyu Sun, Xiaoyu Shi, Fan Wang, Hao Li

Recently, GAN based method has demonstrated strong effectiveness in generating augmentation data for person re-identification (ReID), on account of its ability to bridge the gap between domains and enrich the data variety in feature space.

Diversity Person Re-Identification +1

Structure-aware Interactive Graph Neural Networks for the Prediction of Protein-Ligand Binding Affinity

1 code implementation21 Jul 2021 Shuangli Li, Jingbo Zhou, Tong Xu, Liang Huang, Fan Wang, Haoyi Xiong, Weili Huang, Dejing Dou, Hui Xiong

To this end, we propose a structure-aware interactive graph neural network (SIGN) which consists of two components: polar-inspired graph attention layers (PGAL) and pairwise interactive pooling (PiPool).

Drug Discovery Graph Attention +2

Graph Convolution for Re-ranking in Person Re-identification

1 code implementation5 Jul 2021 Yuqi Zhang, Qian Qi, Chong Liu, Weihua Chen, Fan Wang, Hao Li, Rong Jin

In this work, we propose a graph-based re-ranking method to improve learned features while still keeping Euclidean distance as the similarity metric.

Person Re-Identification Re-Ranking +1

KVT: k-NN Attention for Boosting Vision Transformers

1 code implementation28 May 2021 Pichao Wang, Xue Wang, Fan Wang, Ming Lin, Shuning Chang, Hao Li, Rong Jin

A key component in vision transformers is the fully-connected self-attention which is more powerful than CNNs in modelling long range dependencies.

An Empirical Study of Vehicle Re-Identification on the AI City Challenge

1 code implementation20 May 2021 Hao Luo, Weihua Chen, Xianzhe Xu, Jianyang Gu, Yuqi Zhang, Chong Liu, Yiqi Jiang, Shuting He, Fan Wang, Hao Li

We mainly focus on four points, i. e. training data, unsupervised domain-adaptive (UDA) training, post-processing, model ensembling in this challenge.

Diversity Re-Ranking +2

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones

1 code implementation14 May 2021 Chong Liu, Yuqi Zhang, Hao Luo, Jiasheng Tang, Weihua Chen, Xianzhe Xu, Fan Wang, Hao Li, Yi-Dong Shen

Multi-Target Multi-Camera Tracking has a wide range of applications and is the basis for many advanced inferences and predictions.

Clustering Vehicle Re-Identification

TopoTxR: A Topological Biomarker for Predicting Treatment Response in Breast Cancer

1 code implementation13 May 2021 Fan Wang, Saarthak Kapse, Steven Liu, Prateek Prasanna, Chao Chen

Characterization of breast parenchyma on dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is a challenging task owing to the complexity of underlying tissue structures.

A Unified Pre-training Framework for Conversational AI

1 code implementation6 May 2021 Siqi Bao, Bingjin Chen, Huang He, Xin Tian, Han Zhou, Fan Wang, Hua Wu, Haifeng Wang, Wenquan Wu, Yingzhan Lin

In this work, we explore the application of PLATO-2 on various dialogue systems, including open-domain conversation, knowledge grounded dialogue, and task-oriented conversation.

Chatbot Interactive Evaluation of Dialog +1

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation

no code implementations30 Mar 2021 Shuning Chang, Pichao Wang, Fan Wang, Hao Li, Jiashi Feng

Temporal action proposal generation (TAPG) is a fundamental and challenging task in video understanding, especially in temporal action detection.

Action Detection Temporal Action Proposal Generation +1

Molecular Representation Learning by Leveraging Chemical Information

1 code implementation NA 2021 Weibin Li, Shanzhuo Zhang, Lihang Liu, Zhengjie Huang, Jieqiong Lei, Xiaomin Fang, Shikun Feng, Fan Wang

As graph neural networks have achieved great success in many domains, some studies apply graph neural networks to molecular property prediction and regard each molecule as a graph.

Graph Property Prediction Molecular Property Prediction +3