Search Results for author: Chen Qian

Found 171 papers, 93 papers with code

Local Correlation Consistency for Knowledge Distillation

no code implementations ECCV 2020 Xiaojie Li, Jianlong Wu, Hongyu Fang, Yue Liao, Fei Wang, Chen Qian

Sufficient knowledge extraction from the teacher network plays a critical role in the knowledge distillation task to improve the performance of the student network.

Knowledge Distillation

Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey

no code implementations3 Dec 2024 Yunkai Dang, Kaichen Huang, Jiahao Huo, Yibo Yan, Sirui Huang, Dongrui Liu, Mengxi Gao, Jie Zhang, Chen Qian, Kun Wang, Yong liu, Jing Shao, Hui Xiong, Xuming Hu

The rapid development of Artificial Intelligence (AI) has revolutionized numerous fields, with large language models (LLMs) and computer vision (CV) systems driving advancements in natural language understanding and visual processing, respectively.

Cross-Modal Retrieval Natural Language Understanding +4

Quantum Hamiltonian Descent for Graph Partition

no code implementations22 Nov 2024 Jinglei Cheng, Ruilin Zhou, Yuhang Gan, Chen Qian, Junyu Liu

We introduce Quantum Hamiltonian Descent as a novel approach to solve the graph partition problem.

From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$-NeuS

1 code implementation8 Nov 2024 Haoran Zhang, Junkai Deng, Xuhui Chen, Fei Hou, Wencheng Wang, Hong Qin, Chen Qian, Ying He

Our method leverages the observation that transparent surfaces induce local extreme values in the learned distance fields during neural volumetric rendering, contrasting with opaque surfaces that align with zero level sets.

3D Shape Reconstruction Transparent objects

KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension

no code implementations4 Nov 2024 Jie Yang, Wang Zeng, Sheng Jin, Lumin Xu, Wentao Liu, Chen Qian, Ruimao Zhang

To bridge this gap, we introduce the novel challenge of Semantic Keypoint Comprehension, which aims to comprehend keypoints across different task scenarios, including keypoint semantic understanding, visual prompt-based keypoint detection, and textual prompt-based keypoint detection.

Keypoint Detection Language Modelling +1

Quasi-Medial Distance Field (Q-MDF): A Robust Method for Approximating and Discretizing Neural Medial Axis

no code implementations23 Oct 2024 Jiayi Kong, Chen Zong, Jun Luo, Shiqing Xin, Fei Hou, Hanqing Jiang, Chen Qian, Ying He

The medial axis, a lower-dimensional shape descriptor, plays an important role in the field of digital geometry processing.

DEAN: Deactivating the Coupled Neurons to Mitigate Fairness-Privacy Conflicts in Large Language Models

1 code implementation22 Oct 2024 Chen Qian, Dongrui Liu, Jie Zhang, Yong liu, Jing Shao

Extensive experimental results demonstrate that DEAN eliminates the trade-off phenomenon and significantly improves LLMs' fairness and privacy awareness simultaneously, \eg improving Qwen-2-7B-Instruct's fairness awareness by 12. 2\% and privacy awareness by 14. 0\%.

Fairness

REEF: Representation Encoding Fingerprints for Large Language Models

1 code implementation18 Oct 2024 Jie Zhang, Dongrui Liu, Chen Qian, Linfeng Zhang, Yong liu, Yu Qiao, Jing Shao

Therefore, model owners and third parties need to identify whether a suspect model is a subsequent development of the victim model.

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System

no code implementations10 Oct 2024 Weize Chen, Jiarui Yuan, Chen Qian, Cheng Yang, Zhiyuan Liu, Maosong Sun

Large Language Model (LLM) based multi-agent systems (MAS) show remarkable potential in collaborative problem-solving, yet they still face critical challenges: low communication efficiency, poor scalability, and a lack of effective parameter-updating optimization methods.

Large Language Model Question Answering

Improving Data Efficiency via Curating LLM-Driven Rating Systems

no code implementations9 Oct 2024 Jinlong Pang, Jiaheng Wei, Ankit Parag Shah, Zhaowei Zhu, Yaxuan Wang, Chen Qian, Yang Liu, Yujia Bao, Wei Wei

Instruction tuning is critical for adapting large language models (LLMs) to downstream tasks, and recent studies have demonstrated that small amounts of human-curated data can outperform larger datasets, challenging traditional data scaling laws.

Diversity

Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models

1 code implementation29 Sep 2024 Xin Li, Weize Chen, Qizhi Chu, Haopeng Li, Zhaojun Sun, Ran Li, Chen Qian, Yiwei Wei, Zhiyuan Liu, Chuan Shi, Maosong Sun, Cheng Yang

Our results underscore that the capabilities of LLMs in handling structured data are still under-explored, and show the effectiveness of LLM4Graph in enhancing LLMs' proficiency of graph analysis.

Recommendation Systems

CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications

1 code implementation7 Aug 2024 Tianfang Zhang, Lei LI, Yang Zhou, Wentao Liu, Chen Qian, Xiangyang Ji

In this paper, we introduce CAS-ViT: Convolutional Additive Self-attention Vision Transformers, to achieve a balance between efficiency and performance in mobile applications.

Image Classification Instance Segmentation +3

MAO: A Framework for Process Model Generation with Multi-Agent Orchestration

no code implementations4 Aug 2024 Leilei Lin, Yumeng Jin, Yingming Zhou, Wenlong Chen, Chen Qian

Our framework MAO leverages large language models as the cornerstone for multi-agent, employing an innovative prompt strategy to ensure efficient collaboration among multi-agent.

Hallucination software testing

The Better Angels of Machine Personality: How Personality Relates to LLM Safety

1 code implementation17 Jul 2024 Jie Zhang, Dongrui Liu, Chen Qian, Ziyue Gan, Yong liu, Yu Qiao, Jing Shao

In this paper, we discover that LLMs' personality traits are closely related to their safety abilities, i. e., toxicity, privacy, and fairness, based on the reliable MBTI-M scale.

Fairness Safety Alignment

TCFormer: Visual Recognition via Token Clustering Transformer

1 code implementation16 Jul 2024 Wang Zeng, Sheng Jin, Lumin Xu, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang

Our dynamic tokens possess two crucial characteristics: (1) Representing image regions with similar semantic meanings using the same vision token, even if those regions are not adjacent, and (2) concentrating on regions with valuable details and represent them using fine tokens.

Clustering Image Classification +4

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

1 code implementation14 Jul 2024 Yi Zhang, Wang Zeng, Sheng Jin, Chen Qian, Ping Luo, Wentao Liu

With multi-modal joint training, our model achieves state-of-the-art performance on a wide range of pedestrian detection benchmarks, surpassing leading models tailored for specific sensor modality.

3D Object Detection Multispectral Object Detection +1

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

1 code implementation9 Jul 2024 Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents.

MIRReS: Multi-bounce Inverse Rendering using Reservoir Sampling

no code implementations24 Jun 2024 Yuxin Dai, Qi Wang, Jingsen Zhu, Dianbing Xi, Yuchi Huo, Chen Qian, Ying He

We present MIRReS, a novel two-stage inverse rendering framework that jointly reconstructs and optimizes the explicit geometry, material, and lighting from multi-view images.

Inverse Rendering

Autonomous Agents for Collaborative Task under Information Asymmetry

2 code implementations21 Jun 2024 Wei Liu, Chenxi Wang, Yifei Wang, Zihao Xie, Rennai Qiu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Chen Qian

Together with InfoNav, iAgents organizes human information in a mixed memory to provide agents with accurate and comprehensive information for exchange.

Language Modelling Large Language Model +1

Self-Supervised Vision Transformer for Enhanced Virtual Clothes Try-On

no code implementations15 Jun 2024 Lingxiao Lu, Shengyi Wu, Haoxuan Sun, Junhong Gou, Jianlou Si, Chen Qian, Jianfu Zhang, Liqing Zhang

Virtual clothes try-on has emerged as a vital feature in online shopping, offering consumers a critical tool to visualize how clothing fits.

Virtual Try-on

Multi-Agent Software Development through Cross-Team Collaboration

1 code implementation13 Jun 2024 Zhuoyun Du, Chen Qian, Wei Liu, Zihao Xie, Yifei Wang, Yufan Dang, Weize Chen, Cheng Yang

We anticipate that our work will guide LLM agents towards a cross-team paradigm and contribute to their significant growth in but not limited to software development.

Story Generation

Scaling Large-Language-Model-based Multi-Agent Collaboration

1 code implementation11 Jun 2024 Chen Qian, Zihao Xie, Yifei Wang, Wei Liu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun

Pioneering advancements in large language model-powered agents have underscored the design pattern of multi-agent collaboration, demonstrating that collective intelligence can surpass the capabilities of each individual.

Language Modelling Large Language Model

KerasCV and KerasNLP: Vision and Language Power-Ups

no code implementations30 May 2024 Matthew Watson, Divyashree Shivakumar Sreepathihalli, Francois Chollet, Martin Gorner, Kiranbir Sodhia, Ramesh Sampath, Tirth Patel, Haifeng Jin, Neel Kovelamudi, Gabriel Rasskin, Samaneh Saadat, Luke Wood, Chen Qian, Jonathan Bischof, Ian Stenbit, Abheesht Sharma, Anshuman Mishra

We present the Keras domain packages KerasCV and KerasNLP, extensions of the Keras API for Computer Vision and Natural Language Processing workflows, capable of running on either JAX, TensorFlow, or PyTorch.

Iterative Experience Refinement of Software-Developing Agents

no code implementations7 May 2024 Chen Qian, Jiahao Li, Yufan Dang, Wei Liu, Yifei Wang, Zihao Xie, Weize Chen, Cheng Yang, Yingli Zhang, Zhiyuan Liu, Maosong Sun

We propose two fundamental patterns: the successive pattern, refining based on nearest experiences within a task batch, and the cumulative pattern, acquiring experiences across all previous task batches.

UniFS: Universal Few-shot Instance Perception with Point Representations

1 code implementation30 Apr 2024 Sheng Jin, Ruijie Yao, Lumin Xu, Wentao Liu, Chen Qian, Ji Wu, Ping Luo

In this paper, we propose UniFS, a universal few-shot instance perception model that unifies a wide range of instance perception tasks by reformulating them into a dynamic point representation learning framework.

Few-Shot Learning Few-Shot Object Detection +4

LocalMamba: Visual State Space Model with Windowed Selective Scan

1 code implementation14 Mar 2024 Tao Huang, Xiaohuan Pei, Shan You, Fei Wang, Chen Qian, Chang Xu

This paper posits that the key to enhancing Vision Mamba (ViM) lies in optimizing scan directions for sequence modeling.

Mamba State Space Models

DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception

no code implementations4 Mar 2024 Jingyu Gong, Min Wang, Wentao Liu, Chen Qian, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

To handle this problem, we propose the first Dynamic Environment MOtion Synthesis framework (DEMOS) to predict future motion instantly according to the current scene, and use it to dynamically update the latent motion for final motion synthesis.

motion prediction Motion Synthesis

Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models

1 code implementation29 Feb 2024 Chen Qian, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong liu, Jing Shao

This research provides an initial exploration of trustworthiness modeling during LLM pre-training, seeking to unveil new insights and spur further developments in the field.

Fairness Mutual Information Estimation

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

1 code implementation28 Feb 2024 Weize Chen, Chenfei Yuan, Jiarui Yuan, Yusheng Su, Chen Qian, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

Natural language (NL) has long been the predominant format for human cognition and communication, and by extension, has been similarly pivotal in the development and application of Large Language Models (LLMs).

A Study on the Vulnerability of Test Questions against ChatGPT-based Cheating

no code implementations21 Feb 2024 Shanker Ram, Chen Qian

In this paper, we try to provide an answer to an important question: how well ChatGPT can answer test questions and how we can detect whether the questions of a test can be answered correctly by ChatGPT.

Chatbot

Fairness Without Harm: An Influence-Guided Active Sampling Approach

1 code implementation20 Feb 2024 Jinlong Pang, Jialu Wang, Zhaowei Zhu, Yuanshun Yao, Chen Qian, Yang Liu

In this work, we aim to train models that mitigate group fairness disparity without causing harm to model accuracy.

Active Learning Attribute +1

MetaTra: Meta-Learning for Generalized Trajectory Prediction in Unseen Domain

no code implementations13 Feb 2024 Xiaohe Li, Feilong Huang, Zide Fan, Fangli Mou, Yingyan Hou, Chen Qian, Lijie Wen

Trajectory prediction has garnered widespread attention in different fields, such as autonomous driving and robotic navigation.

Autonomous Driving Domain Generalization +2

DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

1 code implementation8 Feb 2024 Zhiyuan Ma, Xiangyu Zhu, GuoJun Qi, Chen Qian, Zhaoxiang Zhang, Zhen Lei

We suspect this is due to a shortage of paired audio-4D data, which is crucial for the Transformer to effectively perform as a denoiser within the Diffusion framework.

Lens: A Foundation Model for Network Traffic

no code implementations6 Feb 2024 Qineng Wang, Chen Qian, Xiaochang Li, Ziyu Yao, Gang Zhou, Huajie Shao

Network traffic refers to the amount of data being sent and received over the internet or any system that connects computers.

Decoder Traffic Prediction

Topology-Aware Latent Diffusion for 3D Shape Generation

no code implementations31 Jan 2024 Jiangbei Hu, Ben Fei, Baixin Xu, Fei Hou, Weidong Yang, Shengfa Wang, Na lei, Chen Qian, Ying He

By strategically incorporating topological features into the diffusion process, our generative module is able to produce a richer variety of 3D shapes with different topological structures.

3D Shape Generation Diversity +1

Experiential Co-Learning of Software-Developing Agents

1 code implementation28 Dec 2023 Chen Qian, Yufan Dang, Jiahao Li, Wei Liu, Zihao Xie, Yifei Wang, Weize Chen, Cheng Yang, Xin Cong, Xiaoyin Che, Zhiyuan Liu, Maosong Sun

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents.

Robust Geometry and Reflectance Disentanglement for 3D Face Reconstruction from Sparse-view Images

no code implementations11 Dec 2023 Daisheng Jin, Jiangbei Hu, Baixin Xu, Yuxin Dai, Chen Qian, Ying He

This paper presents a novel two-stage approach for reconstructing human faces from sparse-view images, a task made challenging by the unique geometry and complex skin reflectance of each individual.

3D Face Reconstruction Disentanglement +1

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception

1 code implementation9 Dec 2023 Sheng Jin, Shuhuai Li, Tong Li, Wentao Liu, Chen Qian, Ping Luo

Human-centric perception (e. g. detection, segmentation, pose estimation, and attribute analysis) is a long-standing problem for computer vision.

Attribute Multi-Task Learning +1

Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction

no code implementations7 Dec 2023 Jiayi Kong, Baixin Xu, Xurui Song, Chen Qian, Jun Luo, Ying He

Neural radiance fields (NeRF) typically require a complete set of images taken from multiple camera perspectives to accurately reconstruct geometric details.

Privacy Preserving

Minimum Snap Trajectory Generation and Control for an Under-actuated Flapping Wing Aerial Vehicle

no code implementations2 Nov 2023 Chen Qian, Rui Chen, Peiyao Shen, Yongchun Fang, Jifu Yan, Tiefeng Li

This work firstly achieves the closed-loop integration of trajectory generation and control for real 3-dimensional flight of an underactuated FWAV to a practical level.

Virtual Accessory Try-On via Keypoint Hallucination

no code implementations26 Oct 2023 Junhong Gou, Bo Zhang, Li Niu, Jianfu Zhang, Jianlou Si, Chen Qian, Liqing Zhang

Specifically, our approach learns the human body priors and hallucinates the target locations of specified foreground keypoints in the background.

Hallucination Semantic Segmentation +1

Secure Decentralized Learning with Blockchain

no code implementations10 Oct 2023 Xiaoxue Zhang, Yifan Hua, Chen Qian

To avoid the single point of failure problem in FL, decentralized federated learning (DFL) has been proposed to use peer-to-peer communication for model aggregation, which has been considered an attractive solution for machine learning tasks on distributed personal devices.

Federated Learning

Parameterization-driven Neural Surface Reconstruction for Object-oriented Editing in Neural Rendering

no code implementations9 Oct 2023 Baixin Xu, Jiangbei Hu, Fei Hou, Kwan-Yee Lin, Wayne Wu, Chen Qian, Ying He

The advancements in neural rendering have increased the need for techniques that enable intuitive editing of 3D objects represented as neural implicit surfaces.

3D geometry Neural Rendering +1

Bloch Equation Enables Physics-informed Neural Network in Parametric Magnetic Resonance Imaging

no code implementations21 Sep 2023 Qingrui Cai, Liuhong Zhu, Jianjun Zhou, Chen Qian, Di Guo, Xiaobo Qu

PINN enables learning the Bloch equation, estimating the T2 parameter, and generating a series of physically synthetic data.

Network Interpretation

GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition

1 code implementation28 Aug 2023 Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian, Ping Luo, Ji Wu

Multi-Label Image Recognition (MLIR) is a challenging task that aims to predict multiple object labels in a single image while modeling the complex relationships between labels and image regions.

graph construction Multi-Label Classification +1

CoNe: Contrast Your Neighbours for Supervised Image Classification

1 code implementation21 Aug 2023 Mingkai Zheng, Shan You, Lang Huang, Xiu Su, Fei Wang, Chen Qian, Xiaogang Wang, Chang Xu

Moreover, to further boost the performance, we propose ``distributional consistency" as a more informative regularization to enable similar instances to have a similar probability distribution.

Classification Image Classification +1

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

1 code implementation21 Aug 2023 Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie zhou

Autonomous agents empowered by Large Language Models (LLMs) have undergone significant improvements, enabling them to generalize across a broad spectrum of tasks.

Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow

1 code implementation11 Aug 2023 Junhong Gou, Siyu Sun, Jianfu Zhang, Jianlou Si, Chen Qian, Liqing Zhang

Our approach, namely Diffusion-based Conditional Inpainting for Virtual Try-ON (DCI-VTON), effectively utilizes the power of the diffusion model, and the incorporation of the warping module helps to produce high-quality and realistic virtual try-on results.

Denoising Image Generation +1

ChatDev: Communicative Agents for Software Development

1 code implementation16 Jul 2023 Chen Qian, Wei Liu, Hongzhang Liu, Nuo Chen, Yufan Dang, Jiahao Li, Cheng Yang, Weize Chen, Yusheng Su, Xin Cong, Juyuan Xu, Dahai Li, Zhiyuan Liu, Maosong Sun

Numerous studies used deep learning to improve specific phases in a waterfall model, such as design, coding, and testing.

Decision Making

Can Large Language Models Empower Molecular Property Prediction?

1 code implementation14 Jul 2023 Chen Qian, Huayi Tang, Zhirui Yang, Hong Liang, Yong liu

Molecular property prediction has gained significant attention due to its transformative potential in multiple scientific disciplines.

Molecular Property Prediction Property Prediction

Knowledge Diffusion for Distillation

1 code implementation NeurIPS 2023 Tao Huang, Yuan Zhang, Mingkai Zheng, Shan You, Fei Wang, Chen Qian, Chang Xu

To address this, we propose to denoise student features using a diffusion model trained by teacher features.

Denoising Image Classification +4

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

1 code implementation NeurIPS 2023 Dongwei Pan, Long Zhuo, Jingtan Piao, Huiwen Luo, Wei Cheng, Yuxin Wang, Siming Fan, Shengqi Liu, Lei Yang, Bo Dai, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Kwan-Yee Lin

It is a large-scale digital library for head avatars with three key attributes: 1) High Fidelity: all subjects are captured by 60 synchronized, high-resolution 2K cameras in 360 degrees.

2k Image Matting +2

Can GPT-4 Perform Neural Architecture Search?

1 code implementation21 Apr 2023 Mingkai Zheng, Xiu Su, Shan You, Fei Wang, Chen Qian, Chang Xu, Samuel Albanie

We investigate the potential of GPT-4~\cite{gpt4} to perform Neural Architecture Search (NAS) -- the task of designing effective neural architectures.

Navigate Neural Architecture Search

Deformable Model-Driven Neural Rendering for High-Fidelity 3D Reconstruction of Human Heads Under Low-View Settings

2 code implementations ICCV 2023 Baixin Xu, Jiarui Zhang, Kwan-Yee Lin, Chen Qian, Ying He

To address this, we propose geometry decomposition and adopt a two-stage, coarse-to-fine training strategy, allowing for progressively capturing high-frequency geometric details.

3D Reconstruction Neural Rendering +1

A Dynamics Theory of Implicit Regularization in Deep Low-Rank Matrix Factorization

no code implementations29 Dec 2022 Jian Cao, Chen Qian, Yihui Huang, Dicheng Chen, Yuncheng Gao, Jiyang Dong, Di Guo, Xiaobo Qu

Recent theory starts to explain implicit regularization with the model of deep matrix factorization (DMF) and analyze the trajectory of discrete gradient dynamics in the optimization process.

Adaptive Control of Client Selection and Gradient Compression for Efficient Federated Learning

no code implementations19 Dec 2022 Zhida Jiang, Yang Xu, Hongli Xu, Zhiyuan Wang, Chen Qian

Federated learning (FL) allows multiple clients cooperatively train models without disclosing local data.

Federated Learning

CloudBrain-ReconAI: An Online Platform for MRI Reconstruction and Image Quality Evaluation

no code implementations4 Dec 2022 Yirong Zhou, Chen Qian, Jiayu Li, Zi Wang, Yu Hu, Biao Qu, Liuhong Zhu, Jianjun Zhou, Taishan Kang, Jianzhong Lin, Qing Hong, Jiyang Dong, Di Guo, Xiaobo Qu

Efficient collaboration between engineers and radiologists is important for image reconstruction algorithm development and image quality evaluation in magnetic resonance imaging (MRI).

Cloud Computing MRI Reconstruction

A Faithful Deep Sensitivity Estimation for Accelerated Magnetic Resonance Imaging

no code implementations23 Oct 2022 Zi Wang, Haoming Fang, Chen Qian, Boxuan Shi, Lijun Bao, Liuhong Zhu, Jianjun Zhou, Wenping Wei, Jianzhong Lin, Di Guo, Xiaobo Qu

To understand the behavior of the network, the mutual promotion of sensitivity estimation and image reconstruction is revealed through the visualization of network intermediate results.

MRI Reconstruction

Weak-shot Semantic Segmentation via Dual Similarity Transfer

1 code implementation5 Oct 2022 Junjie Chen, Li Niu, Siyuan Zhou, Jianlou Si, Chen Qian, Liqing Zhang

Proposal segmentation allows proposal-pixel similarity transfer from base classes to novel classes, which enables the mask learning of novel classes.

Segmentation Semantic Segmentation +2

ZoomNAS: Searching for Whole-body Human Pose Estimation in the Wild

1 code implementation23 Aug 2022 Lumin Xu, Sheng Jin, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang

We propose a single-network approach, termed ZoomNet, to take into account the hierarchical structure of the full human body and solve the scale variation of different body parts.

2D Human Pose Estimation Neural Architecture Search +1

3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal

1 code implementation22 Jul 2022 Hao Meng, Sheng Jin, Wentao Liu, Chen Qian, Mengxiang Lin, Wanli Ouyang, Ping Luo

Unlike most previous works that directly predict the 3D poses of two interacting hands simultaneously, we propose to decompose the challenging interacting hand pose estimation task and estimate the pose of each hand separately.

3D Interacting Hand Pose Estimation Hand Pose Estimation

Pose for Everything: Towards Category-Agnostic Pose Estimation

1 code implementation21 Jul 2022 Lumin Xu, Sheng Jin, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang

In this paper, we introduce the task of Category-Agnostic Pose Estimation (CAPE), which aims to create a pose estimation model capable of detecting the pose of any class of object given only a few samples with keypoint definition.

Category-Agnostic Pose Estimation Pose Estimation

Structure-aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation

1 code implementation19 Jul 2022 Jingwang Ling, Zhibo Wang, Ming Lu, Quan Wang, Chen Qian, Feng Xu

Previous works on morphable models mostly focus on large-scale facial geometry but ignore facial details.

Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation

no code implementations19 Jul 2022 Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai, Chen Qian

Weakly supervised point cloud segmentation, i. e. semantically segmenting a point cloud with only a few labeled points in the whole 3D scene, is highly desirable due to the heavy burden of collecting abundant dense annotations for the model training.

Point Cloud Segmentation Segmentation

ScaleNet: Searching for the Model to Scale

1 code implementation15 Jul 2022 Jiyang Xie, Xiu Su, Shan You, Zhanyu Ma, Fei Wang, Chen Qian

Recently, community has paid increasing attention on model scaling and contributed to developing a model family with a wide spectrum of scales.

LightViT: Towards Light-Weight Convolution-Free Vision Transformers

1 code implementation12 Jul 2022 Tao Huang, Lang Huang, Shan You, Fei Wang, Chen Qian, Chang Xu

Vision transformers (ViTs) are usually considered to be less light-weight than convolutional neural networks (CNNs) due to the lack of inductive bias.

Image Classification Inductive Bias +3

HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors

1 code implementation12 Jul 2022 Luting Wang, Xiaojie Li, Yue Liao, Zeren Jiang, Jianlong Wu, Fei Wang, Chen Qian, Si Liu

We observe that the core difficulty for heterogeneous KD (hetero-KD) is the significant semantic gap between the backbone features of heterogeneous detectors due to the different optimization manners.

Knowledge Distillation Object +3

Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach

no code implementations30 Jun 2022 Jiaqi Tang, Zhaoyang Liu, Jing Tan, Chen Qian, Wayne Wu, LiMin Wang

Local context modeling sub-network is proposed to perceive diverse patterns of generic event boundaries, and it generates powerful video representations and reliable boundary confidence.

Boundary Detection Generic Event Boundary Detection +1

Masked Distillation with Receptive Tokens

1 code implementation29 May 2022 Tao Huang, Yuan Zhang, Shan You, Fei Wang, Chen Qian, Jian Cao, Chang Xu

To obtain a group of masks, the receptive tokens are learned via the regular task loss but with teacher fixed, and we also leverage a Dice loss to enrich the diversity of learned masks.

object-detection Object Detection +1

Green Hierarchical Vision Transformer for Masked Image Modeling

1 code implementation26 May 2022 Lang Huang, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Toshihiko Yamasaki

We present an efficient approach for Masked Image Modeling (MIM) with hierarchical Vision Transformers (ViTs), allowing the hierarchical ViTs to discard masked patches and operate only on the visible ones.

Object Detection

Knowledge Distillation from A Stronger Teacher

3 code implementations21 May 2022 Tao Huang, Shan You, Fei Wang, Chen Qian, Chang Xu

In this paper, we show that simply preserving the relations between the predictions of teacher and student would suffice, and propose a correlation-based loss to capture the intrinsic inter-class relations from the teacher explicitly.

Ranked #3 on Knowledge Distillation on ImageNet (using extra training data)

Image Classification Knowledge Distillation +2

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

4 code implementations25 Apr 2022 Jianglin Fu, Shikai Li, Yuming Jiang, Kwan-Yee Lin, Chen Qian, Chen Change Loy, Wayne Wu, Ziwei Liu

In addition, a model zoo and human editing applications are demonstrated to facilitate future research in the community.

Image Generation

Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing

2 code implementations25 Apr 2022 Haoyue Cheng, Zhaoyang Liu, Hang Zhou, Chen Qian, Wayne Wu, LiMin Wang

This paper focuses on the weakly-supervised audio-visual video parsing task, which aims to recognize all events belonging to each modality and localize their temporal boundaries.

Denoising valid

Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis

1 code implementation25 Apr 2022 Wei Cheng, Su Xu, Jingtan Piao, Chen Qian, Wayne Wu, Kwan-Yee Lin, Hongsheng Li

Specifically, we compress the light fields for novel view human rendering as conditional implicit neural radiance fields from both geometry and appearance aspects.

Novel View Synthesis

A Keypoint-based Global Association Network for Lane Detection

1 code implementation CVPR 2022 Jinsheng Wang, Yinchao Ma, Shaofei Huang, Tianrui Hui, Fei Wang, Chen Qian, Tianzhu Zhang

Earlier works follow a top-down roadmap to regress predefined anchors into various shapes of lane lines, which lacks enough flexibility to fit complex shapes of lanes due to the fixed anchor shapes.

Ranked #4 on Lane Detection on TuSimple (F1 score metric)

Keypoint Estimation Lane Detection

A Paired Phase and Magnitude Reconstruction for Advanced Diffusion-Weighted Imaging

no code implementations28 Mar 2022 Chen Qian, Zi Wang, Xinlin Zhang, Boxuan Shi, Boyu Jiang, Ran Tao, Jing Li, Yuwei Ge, Taishan Kang, Jianzhong Lin, Di Guo, Xiaobo Qu

Conclusion: The explicit phase model PAIR with complementary priors has a good performance on challenging reconstructions under inter-shot motions between shots and a low signal-to-noise ratio.

Learning Where to Learn in Cross-View Self-Supervised Learning

1 code implementation CVPR 2022 Lang Huang, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Toshihiko Yamasaki

In this paper, we present a new approach, Learning Where to Learn (LEWEL), to adaptively aggregate spatial information of features, so that the projected embeddings could be exactly aligned and thus guide the feature learning better.

object-detection Object Detection +3

Searching for Network Width with Bilaterally Coupled Network

1 code implementation25 Mar 2022 Xiu Su, Shan You, Jiyang Xie, Fei Wang, Chen Qian, ChangShui Zhang, Chang Xu

In BCNet, each channel is fairly trained and responsible for the same amount of network widths, thus each network width can be evaluated more accurately.

Fairness

Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory

1 code implementation CVPR 2022 Li SiYao, Weijiang Yu, Tianpei Gu, Chunze Lin, Quan Wang, Chen Qian, Chen Change Loy, Ziwei Liu

With the learned choreographic memory, dance generation is realized on the quantized units that meet high choreography standards, such that the generated dancing sequences are confined within the spatial constraints.

Motion Synthesis

DyRep: Bootstrapping Training with Dynamic Re-parameterization

2 code implementations CVPR 2022 Tao Huang, Shan You, Bohan Zhang, Yuxuan Du, Fei Wang, Chen Qian, Chang Xu

Structural re-parameterization (Rep) methods achieve noticeable improvements on simple VGG-style networks.

Weak Augmentation Guided Relational Self-Supervised Learning

1 code implementation16 Mar 2022 Mingkai Zheng, Shan You, Fei Wang, Chen Qian, ChangShui Zhang, Xiaogang Wang, Chang Xu

Self-supervised Learning (SSL) including the mainstream contrastive learning has achieved great success in learning visual representations without data annotations.

Contrastive Learning Relation +2

Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization

no code implementations ICLR 2022 Can Wang, Sheng Jin, Yingda Guan, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang

PL approaches apply pseudo-labels to unlabeled data, and then train the model with a combination of the labeled and pseudo-labeled data iteratively.

Efficient and Reliable Overlay Networks for Decentralized Federated Learning

no code implementations12 Dec 2021 Yifan Hua, Kevin Miller, Andrea L. Bertozzi, Chen Qian, Bao Wang

As such, our proposed overlay networks accelerate convergence, improve generalization, and enhance robustness to clients failures in DFL with theoretical guarantees.

Federated Learning Generalization Bounds +2

Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection

3 code implementations CVPR 2022 Jiaqi Tang, Zhaoyang Liu, Chen Qian, Wayne Wu, LiMin Wang

Generic event boundary detection is an important yet challenging task in video understanding, which aims at detecting the moments where humans naturally perceive event boundaries.

Boundary Detection Diversity +2

One-dimensional Deep Low-rank and Sparse Network for Accelerated MRI

no code implementations9 Dec 2021 Zi Wang, Chen Qian, Di Guo, Hongwei Sun, Rushuai Li, Bo Zhao, Xiaobo Qu

Deep learning has shown astonishing performance in accelerated magnetic resonance imaging (MRI).

Deep Learning

GreedyNASv2: Greedier Search with a Greedy Path Filter

no code implementations CVPR 2022 Tao Huang, Shan You, Fei Wang, Chen Qian, ChangShui Zhang, Xiaogang Wang, Chang Xu

In this paper, we leverage an explicit path filter to capture the characteristics of paths and directly filter those weak ones, so that the search can be thus implemented on the shrunk space more greedily and efficiently.

Weak-shot Semantic Segmentation by Transferring Semantic Affinity and Boundary

no code implementations4 Oct 2021 Siyuan Zhou, Li Niu, Jianlou Si, Chen Qian, Liqing Zhang

As a result, we find that pixel-level annotation of base categories can facilitate affinity learning and propagation, leading to higher-quality CAMs of novel categories.

Segmentation Weakly supervised Semantic Segmentation +1

Counterfactual Inference for Text Classification Debiasing

1 code implementation ACL 2021 Chen Qian, Fuli Feng, Lijie Wen, Chunping Ma, Pengjun Xie

In inference, given a factual input document, Corsair imagines its two counterfactual counterparts to distill and mitigate the two biases captured by the poisonous model.

counterfactual Counterfactual Inference +3

ReSSL: Relational Self-Supervised Learning with Weak Augmentation

2 code implementations NeurIPS 2021 Mingkai Zheng, Shan You, Fei Wang, Chen Qian, ChangShui Zhang, Xiaogang Wang, Chang Xu

Self-supervised Learning (SSL) including the mainstream contrastive learning has achieved great success in learning visual representations without data annotations.

Contrastive Learning Relation +2

ViTAS: Vision Transformer Architecture Search

1 code implementation25 Jun 2021 Xiu Su, Shan You, Jiyang Xie, Mingkai Zheng, Fei Wang, Chen Qian, ChangShui Zhang, Xiaogang Wang, Chang Xu

Vision transformers (ViTs) inherited the success of NLP but their structures have not been sufficiently investigated and optimized for visual tasks.

Inductive Bias Neural Architecture Search

Pareidolia Face Reenactment

no code implementations CVPR 2021 Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He

We present a new application direction named Pareidolia Face Reenactment, which is defined as animating a static illusory face to move in tandem with a human face in the video.

Face Reenactment Texture Synthesis

K-shot NAS: Learnable Weight-Sharing for NAS with K-shot Supernets

no code implementations11 Jun 2021 Xiu Su, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, ChangShui Zhang, Chang Xu

The operation weight for each path is represented as a convex combination of items in a dictionary with a simplex code.

BCNet: Searching for Network Width with Bilaterally Coupled Network

no code implementations CVPR 2021 Xiu Su, Shan You, Fei Wang, Chen Qian, ChangShui Zhang, Chang Xu

In BCNet, each channel is fairly trained and responsible for the same amount of network widths, thus each network width can be evaluated more accurately.

When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks

1 code implementation CVPR 2021 Jiahang Wang, Sheng Jin, Wentao Liu, Weizhong Liu, Chen Qian, Ping Luo

However, unlike human vision that is robust to various data corruptions such as blur and pixelation, current pose estimators are easily confused by these corruptions.

Knowledge Distillation Pose Estimation

Invariance and Contraction in Geometrically Periodic Systems with Differential Inclusions

no code implementations29 Apr 2021 Chen Qian, Yongchun Fang

The objective of this paper is to derive the essential invariance and contraction properties for the geometric periodic systems, which can be formulated as a category of differential inclusions, and primarily rendered in the phase coordinate, or the cycle coordinate.

XCloud-pFISTA: A Medical Intelligence Cloud for Accelerated MRI

no code implementations18 Apr 2021 Yirong Zhou, Chen Qian, Yi Guo, Zi Wang, Jian Wang, Biao Qu, Di Guo, Yongfu You, Xiaobo Qu

Machine learning and artificial intelligence have shown remarkable performance in accelerated magnetic resonance imaging (MRI).

Cloud Computing Image Reconstruction

Everything's Talkin': Pareidolia Face Reenactment

1 code implementation7 Apr 2021 Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He

We present a new application direction named Pareidolia Face Reenactment, which is defined as animating a static illusory face to move in tandem with a human face in the video.

Face Reenactment Texture Synthesis

Prioritized Architecture Sampling with Monto-Carlo Tree Search

1 code implementation CVPR 2021 Xiu Su, Tao Huang, Yanxi Li, Shan You, Fei Wang, Chen Qian, ChangShui Zhang, Chang Xu

One-shot neural architecture search (NAS) methods significantly reduce the search cost by considering the whole search space as one network, which only needs to be trained once.

Neural Architecture Search

Reformulating HOI Detection as Adaptive Set Prediction

1 code implementation CVPR 2021 Mingfei Chen, Yue Liao, Si Liu, ZhiYuan Chen, Fei Wang, Chen Qian

To attain this, we map a trainable interaction query set to an interaction prediction set with a transformer.

Ranked #30 on Human-Object Interaction Detection on HICO-DET (using extra training data)

Human-Object Interaction Detection

Locally Free Weight Sharing for Network Width Search

no code implementations ICLR 2021 Xiu Su, Shan You, Tao Huang, Fei Wang, Chen Qian, ChangShui Zhang, Chang Xu

In this paper, to better evaluate each width, we propose a locally free weight sharing strategy (CafeNet) accordingly.

Towards Improving the Consistency, Efficiency, and Flexibility of Differentiable Neural Architecture Search

no code implementations CVPR 2021 Yibo Yang, Shan You, Hongyang Li, Fei Wang, Chen Qian, Zhouchen Lin

Our method enables differentiable sparsification, and keeps the derived architecture equivalent to that of Engine-cell, which further improves the consistency between search and evaluation.

Neural Architecture Search

EnTranNAS: Towards Closing the Gap between the Architectures in Search and Evaluation

no code implementations1 Jan 2021 Yibo Yang, Shan You, Hongyang Li, Fei Wang, Chen Qian, Zhouchen Lin

The Engine-cell is differentiable for architecture search, while the Transit-cell only transits the current sub-graph by architecture derivation.

Neural Architecture Search

Explicit Learning Topology for Differentiable Neural Architecture Search

no code implementations1 Jan 2021 Tao Huang, Shan You, Yibo Yang, Zhuozhuo Tu, Fei Wang, Chen Qian, ChangShui Zhang

Differentiable neural architecture search (NAS) has gained much success in discovering more flexible and diverse cell types.

Neural Architecture Search

Learning With Privileged Tasks

no code implementations ICCV 2021 Yuru Song, Zan Lou, Shan You, Erkun Yang, Fei Wang, Chen Qian, ChangShui Zhang, Xiaogang Wang

Concretely, we introduce a privileged parameter so that the optimization direction does not necessarily follow the gradient from the privileged tasks, but concentrates more on the target tasks.

Multi-Task Learning

GAHNE: Graph-Aggregated Heterogeneous Network Embedding

no code implementations23 Dec 2020 Xiaohe Li, Lijie Wen, Chen Qian, Jianmin Wang

Heterogeneous network embedding aims to embed nodes into low-dimensional vectors which capture rich intrinsic information of heterogeneous networks.

Network Embedding

Agree to Disagree: Adaptive Ensemble Knowledge Distillation in Gradient Space

1 code implementation NeurIPS 2020 Shangchen Du, Shan You, Xiaojie Li, Jianlong Wu, Fei Wang, Chen Qian, ChangShui Zhang

In this paper, we examine the diversity of teacher models in the gradient space and regard the ensemble knowledge distillation as a multi-objective optimization problem so that we can determine a better optimization direction for the training of student network.

Diversity Knowledge Distillation

Directed Graph Attention Neural Network Utilizing 3D Coordinates for Molecular Property Prediction

no code implementations1 Dec 2020 Chen Qian, Yunhai Xiong, Xiang Chen

DGANN distinguishes from previous models with those features: (1) It learns the local chemical environment encoding by graph attention mechanism on chemical bonds.

Graph Attention Molecular Property Prediction +2

Stretchable Cells Help DARTS Search Better

no code implementations18 Nov 2020 Tao Huang, Shan You, Yibo Yang, Zhuozhuo Tu, Fei Wang, Chen Qian, ChangShui Zhang

However, even for this consistent search, the searched cells often suffer from poor performance, especially for the supernet with fewer layers, as current DARTS methods are prone to wide and shallow cells, and this topology collapse induces sub-optimal searched cells.

Neural Architecture Search

AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection

no code implementations NeurIPS 2020 Hao Zhu, Chaoyou Fu, Qianyi Wu, Wayne Wu, Chen Qian, Ran He

However, due to the lack of Deepfakes datasets with large variance in appearance, which can be hardly produced by recent identity swapping methods, the detection algorithm may fail in this situation.

Data Agnostic Filter Gating for Efficient Deep Networks

no code implementations28 Oct 2020 Xiu Su, Shan You, Tao Huang, Hongyan Xu, Fei Wang, Chen Qian, ChangShui Zhang, Chang Xu

To deploy a well-trained CNN model on low-end computation edge devices, it is usually supposed to compress or prune the model under certain computation budget (e. g., FLOPs).

HMOR: Hierarchical Multi-Person Ordinal Relations for Monocular Multi-Person 3D Pose Estimation

no code implementations ECCV 2020 Jiefeng Li, Can Wang, Wentao Liu, Chen Qian, Cewu Lu

The HMOR encodes interaction information as the ordinal relations of depths and angles hierarchically, which captures the body-part and joint level semantic and maintains global consistency at the same time.

3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +2

Whole-Body Human Pose Estimation in the Wild

2 code implementations ECCV 2020 Sheng Jin, Lumin Xu, Jin Xu, Can Wang, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo

This paper investigates the task of 2D human whole-body pose estimation, which aims to localize dense landmarks on the entire human body including face, hands, body, and feet.

2D Human Pose Estimation Facial Landmark Detection +2

Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation

no code implementations ECCV 2020 Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang, Ping Luo

The modules of HGG can be trained end-to-end with the keypoint detection network and is able to supervise the grouping process in a hierarchical manner.

2D Human Pose Estimation Clustering +5

Effects of Horizons on Entanglement Harvesting

no code implementations2 Jun 2020 Wan Cong, Chen Qian, Michael R. R. Good, Robert B. Mann

We study the effects of horizons on the entanglement harvested between two Unruh-DeWitt detectors via the use of moving mirrors with and without strict horizons.

General Relativity and Quantum Cosmology High Energy Physics - Theory

TAM: Temporal Adaptive Module for Video Recognition

2 code implementations ICCV 2021 Zhao-Yang Liu, Li-Min Wang, Wayne Wu, Chen Qian, Tong Lu

Video data is with complex temporal dynamics due to various factors such as camera motion, speed variation, and different activities.

Action Recognition Video Recognition

Multiple uncertainty relation for accelerated quantum information

no code implementations21 Apr 2020 Chen Qian, Ya-Dong Wu, Jia-Wei Ji, Yunlong Xiao, Barry C. Sanders

The uncertainty principle, first introduced by Heisenberg in inertial frames, clearly distinguishes quantum theories from classical mechanics.

Quantum Physics General Relativity and Quantum Cosmology

TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting

no code implementations CVPR 2020 Zhuoqian Yang, Wentao Zhu, Wayne Wu, Chen Qian, Qiang Zhou, Bolei Zhou, Chen Change Loy

We present a lightweight video motion retargeting approach TransMoMo that is capable of transferring motion of a person in a source video realistically to another video of a target person.

motion retargeting

GreedyNAS: Towards Fast One-Shot NAS with Greedy Supernet

no code implementations CVPR 2020 Shan You, Tao Huang, Mingmin Yang, Fei Wang, Chen Qian, Chang-Shui Zhang

The training efficiency is thus boosted since the training space has been greedily shrunk from all paths to those potentially-good ones.