Search Results for author: Chen Chen

Found 400 papers, 176 papers with code

Learning to See in the Dark

19 code implementations • CVPR 2018 • Chen Chen, Qifeng Chen, Jia Xu, Vladlen Koltun

Imaging in low light is challenging due to low photon count and low SNR.

Ranked #4 on Image Denoising on SID x300

Deblurring Image Denoising

5,409

Paper
Code

Seeing Motion in the Dark

1 code implementation • ICCV 2019 • Chen Chen, Qifeng Chen, Minh N. Do, Vladlen Koltun

By carefully designing a learning-based pipeline and introducing a new loss function to encourage temporal stability, we train a siamese network on static raw videos, for which ground truth is available, such that the network generalizes to videos of dynamic scenes at test time.

Low-Light Image Enhancement

5,409

Paper
Code

TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities

3 code implementations • 13 Dec 2022 • Zhe Zhao, Yudong Li, Cheng Hou, Jing Zhao, Rong Tian, Weijie Liu, Yiren Chen, Ningyuan Sun, Haoyan Liu, Weiquan Mao, Han Guo, Weigang Guo, Taiqiang Wu, Tao Zhu, Wenhang Shi, Chen Chen, Shan Huang, Sihong Chen, Liqun Liu, Feifei Li, Xiaoshuai Chen, Xingwu Sun, Zhanhui Kang, Xiaoyong Du, Linlin Shen, Kimmo Yan

The proposed pre-training models of different modalities are showing a rising trend of homogeneity in their model structures, which brings the opportunity to implement different pre-training models within a uniform framework.

2,985

Paper
Code

Semantic Image Inpainting with Deep Generative Models

7 code implementations • CVPR 2017 • Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa-Johnson, Minh N. Do

In this paper, we propose a novel method for semantic image inpainting, which generates the missing content by conditioning on the available data.

Image Inpainting

1,308

Paper
Code

Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness

1 code implementation • 8 May 2023 • Liangliang Cao, BoWen Zhang, Chen Chen, Yinfei Yang, Xianzhi Du, Wencong Zhang, Zhiyun Lu, Yantao Zheng

In this paper, we discuss two effective approaches to improve the efficiency and robustness of CLIP training: (1) augmenting the training dataset while maintaining the same number of optimization steps, and (2) filtering out samples that contain text regions in the image.

Adversarial Text Retrieval

883

Paper
Code

VEGA: Towards an End-to-End Configurable AutoML Pipeline

1 code implementation • 3 Nov 2020 • Bochao Wang, Hang Xu, Jiajin Zhang, Chen Chen, Xiaozhi Fang, Yixing Xu, Ning Kang, Lanqing Hong, Chenhan Jiang, Xinyue Cai, Jiawei Li, Fengwei Zhou, Yong Li, Zhicheng Liu, Xinghao Chen, Kai Han, Han Shu, Dehua Song, Yunhe Wang, Wei zhang, Chunjing Xu, Zhenguo Li, Wenzhi Liu, Tong Zhang

Automated Machine Learning (AutoML) is an important industrial solution for automatic discovery and deployment of the machine learning models.

BIG-bench Machine Learning Data Augmentation +3

834

Paper
Code

Real-world Anomaly Detection in Surveillance Videos

8 code implementations • CVPR 2018 • Waqas Sultani, Chen Chen, Mubarak Shah

To avoid annotating the anomalous segments or clips in training videos, which is very time consuming, we propose to learn anomaly through the deep multiple instance ranking framework by leveraging weakly labeled training videos, i. e. the training labels (anomalous or normal) are at video-level instead of clip-level.

Ranked #2 on Abnormal Event Detection In Video on UBI-Fights

Activity Recognition Anomaly Detection In Surveillance Videos +2

492

Paper
Code

3D Human Pose Estimation with Spatial and Temporal Transformers

3 code implementations • ICCV 2021 • Ce Zheng, Sijie Zhu, Matias Mendieta, Taojiannan Yang, Chen Chen, Zhengming Ding

Transformer architectures have become the model of choice in natural language processing and are now being introduced into computer vision tasks such as image classification, object detection, and semantic segmentation.

Ranked #13 on Monocular 3D Human Pose Estimation on Human3.6M

Image Classification Monocular 3D Human Pose Estimation +3

470

Paper
Code

Deep Learning-Based Human Pose Estimation: A Survey

1 code implementation • 24 Dec 2020 • Ce Zheng, Wenhan Wu, Chen Chen, Taojiannan Yang, Sijie Zhu, Ju Shen, Nasser Kehtarnavaz, Mubarak Shah

Furthermore, 2D and 3D human pose estimation datasets and evaluation metrics are included.

2D Human Pose Estimation 3D Human Pose Estimation +1

444

Paper
Code

PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation

2 code implementations • CVPR 2023 • Qitao Zhao, Ce Zheng, Mengyuan Liu, Pichao Wang, Chen Chen

However, in real scenarios, the performance of PoseFormer and its follow-ups is limited by two factors: (a) The length of the input joint sequence; (b) The quality of 2D joint detection.

Ranked #8 on 3D Human Pose Estimation on MPI-INF-3DHP

3D Human Pose Estimation Human Dynamics

444

Paper
Code

TransBTS: Multimodal Brain Tumor Segmentation Using Transformer

2 code implementations • 7 Mar 2021 • Wenxuan Wang, Chen Chen, Meng Ding, Jiangyun Li, Hong Yu, Sen Zha

To capture the local 3D context information, the encoder first utilizes 3D CNN to extract the volumetric spatial feature maps.

Brain Tumor Segmentation Image Classification +3

369

Paper
Code

TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images

1 code implementation • 30 Jan 2022 • Jiangyun Li, Wenxuan Wang, Chen Chen, Tianxiang Zhang, Sen Zha, Jing Wang, Hong Yu

Different from TransBTS, the proposed TransBTSV2 is not limited to brain tumor segmentation (BTS) but focuses on general medical image segmentation, providing a stronger and more efficient 3D baseline for volumetric segmentation of medical images.

Brain Tumor Segmentation Image Segmentation +3

369

Paper
Code

Self-Supervision with Superpixels: Training Few-shot Medical Image Segmentation without Annotation

3 code implementations • ECCV 2020 • Cheng Ouyang, Carlo Biffi, Chen Chen, Turkay Kart, Huaqi Qiu, Daniel Rueckert

Few-shot semantic segmentation (FSS) has great potential for medical imaging applications.

Cardiac Segmentation Few-Shot Semantic Segmentation +5

360

Paper
Code

RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

1 code implementation • ICCV 2023 • Lijun Li, Linrui Tian, Xindi Zhang, Qi Wang, Bang Zhang, Mengyuan Liu, Chen Chen

The current interacting hand (IH) datasets are relatively simplistic in terms of background and texture, with hand joints being annotated by a machine annotator, which may result in inaccuracies, and the diversity of pose distribution is limited.

3D Interacting Hand Pose Estimation Hand Pose Estimation

299

Paper
Code

BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks

1 code implementation • 26 May 2023 • Kai Zhang, Jun Yu, Eashan Adhikarla, Rong Zhou, Zhiling Yan, Yixin Liu, Zhengliang Liu, Lifang He, Brian Davison, Xiang Li, Hui Ren, Sunyang Fu, James Zou, Wei Liu, Jing Huang, Chen Chen, Yuyin Zhou, Tianming Liu, Xun Chen, Yong Chen, Quanzheng Li, Hongfang Liu, Lichao Sun

Conventional task- and modality-specific artificial intelligence (AI) models are inflexible in real-world deployment and maintenance for biomedicine.

Ranked #1 on Text Summarization on MeQSum

Image Captioning Medical Visual Question Answering +5

287

Paper
Code

GlyphDraw: Seamlessly Rendering Text with Intricate Spatial Structures in Text-to-Image Generation

3 code implementations • 31 Mar 2023 • Jian Ma, Mingjun Zhao, Chen Chen, Ruichen Wang, Di Niu, Haonan Lu, Xiaodong Lin

Recent breakthroughs in the field of language-guided image generation have yielded impressive achievements, enabling the creation of high-quality and diverse images based on user instructions. Although the synthesis performance is fascinating, one significant limitation of current image generation models is their insufficient ability to generate text coherently within images, particularly for complex glyph structures like Chinese characters.

Optical Character Recognition (OCR) Text-to-Image Generation

255

Paper
Code

Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning

1 code implementation • 21 Jul 2023 • Jian Ma, Junhao Liang, Chen Chen, Haonan Lu

In this paper, we propose Subject-Diffusion, a novel open-domain personalized image generation model that, in addition to not requiring test-time fine-tuning, also only requires a single reference image to support personalized generation of single- or multi-subject in any domain.

Diffusion Personalization Tuning Free Text-to-Image Generation

255

Paper
Code

AIM: Adapting Image Models for Efficient Video Action Recognition

1 code implementation • 6 Feb 2023 • Taojiannan Yang, Yi Zhu, Yusheng Xie, Aston Zhang, Chen Chen, Mu Li

Recent vision transformer based video models mostly follow the ``image pre-training then finetuning" paradigm and have achieved great success on multiple video benchmarks.

Ranked #2 on Action Recognition on Diving-48 (using extra training data)

Action Classification Action Recognition +2

240

Paper
Code

Lightweight image super-resolution with enhanced CNN

1 code implementation • 8 Jul 2020 • Chunwei Tian, Ruibin Zhuge, Zhihao Wu, Yong Xu, WangMeng Zuo, Chen Chen, Chia-Wen Lin

Finally, the IRB uses coarse high-frequency features from the RB to learn more accurate SR features and construct a SR image.

Ranked #45 on Image Super-Resolution on Set14 - 4x upscaling

Image Super-Resolution

217

Paper
Code

3D Dilated Multi-Fiber Network for Real-time Brain Tumor Segmentation in MRI

1 code implementation • 6 Apr 2019 • Chen Chen, Xiaopeng Liu, Meng Ding, Junfeng Zheng, Jiangyun Li

In this work, we aim to segment brain MRI volumes.

Brain Tumor Segmentation Segmentation +1

176

Paper
Code

MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution

2 code implementations • ECCV 2020 • Taojiannan Yang, Sijie Zhu, Chen Chen, Shen Yan, Mi Zhang, Andrew Willis

We propose the width-resolution mutual learning method (MutualNet) to train a network that is executable at dynamic resource constraints to achieve adaptive accuracy-efficiency trade-offs at runtime.

Instance Segmentation object-detection +3

158

Paper
Code

MutualNet: Adaptive ConvNet via Mutual Learning from Different Model Configurations

1 code implementation • 14 May 2021 • Taojiannan Yang, Sijie Zhu, Matias Mendieta, Pu Wang, Ravikumar Balakrishnan, Minwoo Lee, Tao Han, Mubarak Shah, Chen Chen

MutualNet is a general training methodology that can be applied to various network structures (e. g., 2D networks: MobileNets, ResNet, 3D networks: SlowFast, X3D) and various tasks (e. g., image classification, object detection, segmentation, and action recognition), and is demonstrated to achieve consistent improvements on a variety of datasets.

Action Recognition Image Classification +2

158

Paper
Code

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

1 code implementation • 11 Apr 2024 • Ming Li, Taojiannan Yang, Huafeng Kuang, Jie Wu, Zhaoning Wang, Xuefeng Xiao, Chen Chen

To this end, we propose ControlNet++, a novel approach that improves controllable generation by explicitly optimizing pixel-level cycle consistency between generated images and conditional controls.

SSIM

157

Paper
Code

Multi-Scale Progressive Fusion Network for Single Image Deraining

3 code implementations • CVPR 2020 • Kui Jiang, Zhongyuan Wang, Peng Yi, Chen Chen, Baojin Huang, Yimin Luo, Jiayi Ma, Junjun Jiang

In this work, we explore the multi-scale collaborative representation for rain streaks from the perspective of input image scales and hierarchical deep features in a unified framework, termed multi-scale progressive fusion network (MSPFN) for single image rain streak removal.

Ranked #6 on Single Image Deraining on Test2800

Single Image Deraining

154

Paper
Code

Enhanced 3D Human Pose Estimation from Videos by using Attention-Based Neural Network with Dilated Convolutions

1 code implementation • 4 Mar 2021 • Ruixu Liu, Ju Shen, He Wang, Chen Chen, Sen-ching Cheung, Vijayan K. Asari

In this work, we show a systematic design (from 2D to 3D) for how conventional networks and other forms of constraints can be incorporated into the attention framework for learning long-range dependencies for the task of pose estimation.

Ranked #6 on 3D Human Pose Estimation on HumanEva-I

2D Pose Estimation 3D Human Pose Estimation

152

Paper
Code

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

1 code implementation • 10 Feb 2024 • Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, Eng Siong Chng

Leveraging the rich linguistic knowledge and strong reasoning abilities of LLMs, our new paradigm can integrate the rich information in N-best candidates to generate a higher-quality translation result.

Machine Translation Translation

146

Paper
Code

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection

1 code implementation • ICCV 2023 • Ming Li, Jie Wu, Xionghui Wang, Chen Chen, Jie Qin, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

To this end, we propose AlignDet, a unified pre-training framework that can be adapted to various existing detectors to alleviate the discrepancies.

object-detection Object Detection

129

Paper
Code

Learning Normal Dynamics in Videos with Meta Prototype Network

1 code implementation • CVPR 2021 • Hui Lv, Chen Chen, Zhen Cui, Chunyan Xu, Yong Li, Jian Yang

Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection.

Anomaly Detection Meta-Learning +1

124

Paper
Code

Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition

1 code implementation • AAAI 2022 2021 • Yue He, Chen Chen, Jing Zhang, Juhua Liu, Fengxiang He, Chaoyue Wang, Bo Du

Technically, given the character segmentation maps predicted by a VR model, we construct a subgraph for each instance, where nodes represent the pixels in it and edges are added between nodes based on their spatial similarity.

Ranked #9 on Scene Text Recognition on ICDAR2015 (using extra training data)

Language Modelling Scene Text Recognition

105

Paper
Code

Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information

1 code implementation • 29 Mar 2022 • Heqing Zou, Yuke Si, Chen Chen, Deepu Rajan, Eng Siong Chng

In this paper, we propose an end-to-end speech emotion recognition system using multi-level acoustic information with a newly designed co-attention module.

Speech Emotion Recognition

103

Paper
Code

TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization

1 code implementation • CVPR 2022 • Sijie Zhu, Mubarak Shah, Chen Chen

It does not rely on polar transform and infers faster than CNN-based methods.

Ranked #3 on Image-Based Localization on VIGOR Cross Area

Image-Based Localization

Paper
Code

GradAug: A New Regularization Method for Deep Neural Networks

1 code implementation • NeurIPS 2020 • Taojiannan Yang, Sijie Zhu, Chen Chen

The key idea is utilizing randomly transformed training samples to regularize a set of sub-networks, which are originated by sampling the width of the original network, in the training process.

Instance Segmentation object-detection +2

Paper
Code

Density Map Guided Object Detection in Aerial Images

1 code implementation • 12 Apr 2020 • Changlin Li, Taojiannan Yang, Sijie Zhu, Chen Chen, Shanyue Guan

Specifically, we propose a Density-Map guided object detection Network (DMNet), which is inspired from the observation that the object density map of an image presents how objects distribute in terms of the pixel intensity of the map.

Image Cropping Object +3

Paper
Code

SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection

1 code implementation • 6 Jan 2022 • Chen Chen, Zhe Chen, Jing Zhang, DaCheng Tao

We observe that the prevailing set abstraction design for down-sampling points may maintain too much unimportant background information that can affect feature learning for detecting objects.

3D Object Detection object-detection

Paper
Code

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition

1 code implementation • 19 Jan 2024 • Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng

To this end, we propose to extract a language-space noise embedding from the N-best list to represent the noise conditions of source speech, which can promote the denoising process in GER.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Code

Visual Explanation for Deep Metric Learning

1 code implementation • 27 Sep 2019 • Sijie Zhu, Taojiannan Yang, Chen Chen

This work explores the visual explanation for deep metric learning and its applications.

Metric Learning Retrieval

Paper
Code

Causality-inspired Single-source Domain Generalization for Medical Image Segmentation

1 code implementation • 24 Nov 2021 • Cheng Ouyang, Chen Chen, Surui Li, Zeju Li, Chen Qin, Wenjia Bai, Daniel Rueckert

In this work, we investigate the single-source domain generalization problem: training a deep network that is robust to unseen domains, under the condition that training data is only available from one source domain, which is common in medical imaging applications.

Data Augmentation Domain Generalization +4

Paper
Code

Noise-aware Speech Enhancement using Diffusion Probabilistic Model

1 code implementation • 16 Jul 2023 • Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng

Specifically, we design a noise classification (NC) model to produce acoustic embedding as a noise conditioner for guiding the reverse denoising process.

Denoising Multi-Task Learning +2

Paper
Code

Consistency-based Active Learning for Object Detection

1 code implementation • 18 Mar 2021 • Weiping Yu, Sijie Zhu, Taojiannan Yang, Chen Chen

Unlike most recent works that focused on applying active learning for image classification, we propose an effective Consistency-based Active Learning method for object Detection (CALD), which fully explores the consistency between original and augmented data.

Active Learning Classification +5

Paper
Code

The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions

1 code implementation • 6 Apr 2021 • Jennifer J. Sun, Tomomi Karigo, Dipam Chakraborty, Sharada P. Mohanty, Benjamin Wild, Quan Sun, Chen Chen, David J. Anderson, Pietro Perona, Yisong Yue, Ann Kennedy

Multi-agent behavior modeling aims to understand the interactions that occur between agents.

Style Transfer

Paper
Code

SoccerNet 2023 Challenges Results

2 code implementations • 12 Sep 2023 • Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim, Chen Chen, Fabian Deuser, Feng Yan, Fufu Yu, Gal Shitrit, Guanshuo Wang, Gyusik Choi, Hankyul Kim, Hao Guo, Hasby Fahrudin, Hidenari Koguchi, Håkan Ardö, Ibrahim Salah, Ido Yerushalmy, Iftikar Muhammad, Ikuma Uchida, Ishay Be'ery, Jaonary Rabarisoa, Jeongae Lee, Jiajun Fu, Jianqin Yin, Jinghang Xu, Jongho Nang, Julien Denize, Junjie Li, Junpei Zhang, Juntae Kim, Kamil Synowiec, Kenji Kobayashi, Kexin Zhang, Konrad Habel, Kota Nakajima, Licheng Jiao, Lin Ma, Lizhi Wang, Luping Wang, Menglong Li, Mengying Zhou, Mohamed Nasr, Mohamed Abdelwahed, Mykola Liashuha, Nikolay Falaleev, Norbert Oswald, Qiong Jia, Quoc-Cuong Pham, Ran Song, Romain Hérault, Rui Peng, Ruilong Chen, Ruixuan Liu, Ruslan Baikulov, Ryuto Fukushima, Sergio Escalera, Seungcheon Lee, Shimin Chen, Shouhong Ding, Taiga Someya, Thomas B. Moeslund, Tianjiao Li, Wei Shen, Wei zhang, Wei Li, Wei Dai, Weixin Luo, Wending Zhao, Wenjie Zhang, Xinquan Yang, Yanbiao Ma, Yeeun Joo, Yingsen Zeng, Yiyang Gan, Yongqiang Zhu, Yujie Zhong, Zheng Ruan, Zhiheng Li, Zhijian Huang, Ziyu Meng

More information on the tasks, challenges, and leaderboards are available on https://www. soccer-net. org.

Action Spotting Camera Calibration +3

Paper
Code

SoccerNet 2022 Challenges Results

7 code implementations • 5 Oct 2022 • Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li

The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.

Action Spotting Camera Calibration +3

Paper
Code

Unsupervised Anomaly Detection in Medical Images Using Masked Diffusion Model

1 code implementation • 31 May 2023 • Hasan Iqbal, Umar Khalid, Jing Hua, Chen Chen

It can be challenging to identify brain MRI anomalies using supervised deep-learning techniques due to anatomical heterogeneity and the requirement for pixel-level labeling.

Anatomy Unsupervised Anomaly Detection

Paper
Code

VIGOR: Cross-View Image Geo-localization beyond One-to-one Retrieval

1 code implementation • CVPR 2021 • Sijie Zhu, Taojiannan Yang, Chen Chen

In this paper, we redefine this problem with a more realistic assumption that the query image can be arbitrary in the area of interest and the reference images are captured before the queries emerge.

Image-Based Localization Image Retrieval

Paper
Code

Stock price prediction using Generative Adversarial Networks

1 code implementation • Journal of Computer Science 2021 • HungChun Lin, Chen Chen, Gaofeng Huang, Amir Jafari

In this paper, it proposes a stock prediction model using Generative Adversarial Network (GAN) with Gated Recurrent Units (GRU) used as a generator that inputs historical stock price and generates future stock price and Convolutional Neural Network (CNN) as a discriminator to discriminate between the real stock price and generated stock price.

Fraud Detection Generative Adversarial Network +5

Paper
Code

LatentEditor: Text Driven Local Editing of 3D Scenes

1 code implementation • 14 Dec 2023 • Umar Khalid, Hasan Iqbal, Nazmul Karim, Jing Hua, Chen Chen

Our approach achieves faster editing speeds and superior output quality compared to existing 3D editing models, bridging the gap between textual instructions and high-quality 3D scene editing in latent space.

3D scene Editing Denoising

Paper
Code

Geometric Transformers for Protein Interface Contact Prediction

2 code implementations • ICLR 2022 • Alex Morehead, Chen Chen, Jianlin Cheng

Computational methods for predicting the interface contacts between proteins come highly sought after for drug discovery as they can significantly advance the accuracy of alternative approaches, such as protein-protein docking, protein function analysis tools, and other computational methods for protein bioinformatics.

Drug Discovery Translation

Paper
Code

MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID

1 code implementation • CVPR 2023 • Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao

Neural Architecture Search (NAS) has been increasingly appealing to the society of object Re-Identification (ReID), for that task-specific architectures significantly improve the retrieval performance.

Ranked #8 on Vehicle Re-Identification on VehicleID Large

Image Classification Neural Architecture Search +3

Paper
Code

SuperPCA: A Superpixelwise PCA Approach for Unsupervised Feature Extraction of Hyperspectral Imagery

1 code implementation • 26 Jun 2018 • Junjun Jiang, Jiayi Ma, Chen Chen, Zhongyuan Wang, Zhihua Cai, Lizhe Wang

(1) Unlike the traditional PCA method based on a whole image, SuperPCA takes into account the diversity in different homogeneous regions, that is, different regions should have different projections.

Dimensionality Reduction General Classification

Paper
Code

Realistic Adversarial Data Augmentation for MR Image Segmentation

1 code implementation • 23 Jun 2020 • Chen Chen, Chen Qin, Huaqi Qiu, Cheng Ouyang, Shuo Wang, Liang Chen, Giacomo Tarroni, Wenjia Bai, Daniel Rueckert

In this work, we propose an adversarial data augmentation method for training neural networks for medical image segmentation.

Data Augmentation Image Segmentation +3

Paper
Code

Enhancing MR Image Segmentation with Realistic Adversarial Data Augmentation

1 code implementation • 7 Aug 2021 • Chen Chen, Chen Qin, Cheng Ouyang, Zeju Li, Shuo Wang, Huaqi Qiu, Liang Chen, Giacomo Tarroni, Wenjia Bai, Daniel Rueckert

The success of neural networks on medical image segmentation tasks typically relies on large labeled datasets for model training.

Cardiac Segmentation Data Augmentation +3

Paper
Code

MOFI: Learning Image Representations from Noisy Entity Annotated Images

1 code implementation • 13 Jun 2023 • Wentao Wu, Aleksei Timofeev, Chen Chen, BoWen Zhang, Kun Duan, Shuangning Liu, Yantao Zheng, Jonathon Shlens, Xianzhi Du, Zhe Gan, Yinfei Yang

Our approach involves employing a named entity recognition model to extract entities from the alt-text, and then using a CLIP model to select the correct entities as labels of the paired image.

Image Classification Image Retrieval +3

Paper
Code

Fairness in Graph Mining: A Survey

2 code implementations • 21 Apr 2022 • Yushun Dong, Jing Ma, Song Wang, Chen Chen, Jundong Li

Recently, algorithmic fairness has been extensively studied in graph-based applications.

Fairness Graph Mining

Paper
Code

Towards Resolving the Challenge of Long-tail Distribution in UAV Images for Object Detection

1 code implementation • 7 Nov 2020 • Weiping Yu, Taojiannan Yang, Chen Chen

To this end, we rethink long-tailed object detection in UAV images and propose the Dual Sampler and Head detection Network (DSHNet), which is the first work that aims to resolve long-tail distribution in UAV images.

Head Detection Image Cropping +2

Paper
Code

Towards Geospatial Foundation Models via Continual Pretraining

2 code implementations • ICCV 2023 • Matias Mendieta, Boran Han, Xingjian Shi, Yi Zhu, Chen Chen

Geospatial technologies are becoming increasingly essential in our world for a wide range of applications, including agriculture, urban planning, and disaster response.

Change Detection Continual Pretraining +4

Paper
Code

Cross-View Image Matching for Geo-localization in Urban Environments

1 code implementation • CVPR 2017 • Yicong Tian, Chen Chen, Mubarak Shah

Next, for each building in the query image, we retrieve the $k$ nearest neighbors from the reference buildings using a Siamese network trained on both positive matching image pairs and negative pairs.

Cross-View Image-to-Image Translation Image Classification +2

Paper
Code

Local Learning Matters: Rethinking Data Heterogeneity in Federated Learning

1 code implementation • CVPR 2022 • Matias Mendieta, Taojiannan Yang, Pu Wang, Minwoo Lee, Zhengming Ding, Chen Chen

To alleviate this issue, many FL algorithms focus on mitigating the effects of data heterogeneity across clients by introducing a variety of proximal terms, some incurring considerable compute and/or memory overheads, to restrain local updates with respect to the global model.

Federated Learning Privacy Preserving

Paper
Code

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models

1 code implementation • NeurIPS 2023 • Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Macro Siniscalchi, Pin-Yu Chen, Eng Siong Chng

We make our results publicly accessible for reproducible pipelines with released pre-trained models, thus providing a new evaluation paradigm for ASR error correction with LLMs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition

1 code implementation • 8 Apr 2022 • Ce Zheng, Matias Mendieta, Chen Chen

In this paper, we propose a two-stream Pyramid crOss-fuSion TransformER network (POSTER), that aims to holistically solve all three issues.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Code

POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery

1 code implementation • CVPR 2023 • Ce Zheng, Xianpeng Liu, Guo-Jun Qi, Chen Chen

In this paper, we propose a pure transformer architecture named POoling aTtention TransformER (POTTER) for the HMR task from single images.

Ranked #33 on 3D Human Pose Estimation on 3DPW

3D Human Pose Estimation Human Mesh Recovery

Paper
Code

CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot Interaction

1 code implementation • 29 Aug 2023 • Umar Khalid, Hasan Iqbal, Saeed Vahidian, Jing Hua, Chen Chen

Machine learning plays a vital role in industrial HRI by enhancing the adaptability and autonomy of robots in complex environments.

Federated Learning Image Classification +1

Paper
Code

DIPS-Plus: The Enhanced Database of Interacting Protein Structures for Interface Prediction

1 code implementation • 6 Jun 2021 • Alex Morehead, Chen Chen, Ada Sedova, Jianlin Cheng

In this work, we expand on a dataset recently introduced for this task, the Database of Interacting Protein Structures (DIPS), to present DIPS-Plus, an enhanced, feature-rich dataset of 42, 112 complexes for geometric deep learning of protein interfaces.

Drug Discovery Protein Interface Prediction

Paper
Code

MGANet: A Robust Model for Quality Enhancement of Compressed Video

2 code implementations • 22 Nov 2018 • Xiandong Meng, Xuan Deng, Shuyuan Zhu, Shuaicheng Liu, Chuan Wang, Chen Chen, Bing Zeng

In video compression, most of the existing deep learning approaches concentrate on the visual quality of a single frame, while ignoring the useful priors as well as the temporal information of adjacent frames.

Video Compression

Paper
Code

RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search

1 code implementation • 23 May 2023 • Yang Bai, Min Cao, Daming Gao, Ziqiang Cao, Chen Chen, Zhenfeng Fan, Liqiang Nie, Min Zhang

RA offsets the overfitting risk by introducing a novel positive relation detection task (i. e., learning to distinguish strong and weak positive pairs).

Ranked #2 on Text based Person Retrieval on RSTPReid

Person Search Relation +2

Paper
Code

MMM: Generative Masked Motion Model

1 code implementation • 6 Dec 2023 • Ekkasit Pinyoanuntapong, Pu Wang, Minwoo Lee, Chen Chen

MMM consists of two key components: (1) a motion tokenizer that transforms 3D human motion into a sequence of discrete tokens in latent space, and (2) a conditional masked motion transformer that learns to predict randomly masked motion tokens, conditioned on the pre-computed text tokens.

Ranked #5 on Motion Synthesis on KIT Motion-Language

Motion Synthesis

Paper
Code

A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition

1 code implementation • ICCV 2023 • Andong Deng, Taojiannan Yang, Chen Chen

The goal of building a benchmark (suite of datasets) is to provide a unified protocol for fair evaluation and thus facilitate the evolution of a specific area.

Action Recognition Representation Learning +3

Paper
Code

R2Former: Unified Retrieval and Reranking Transformer for Place Recognition

1 code implementation • CVPR 2023 • Sijie Zhu, Linjie Yang, Chen Chen, Mubarak Shah, Xiaohui Shen, Heng Wang

Visual Place Recognition (VPR) estimates the location of query images by matching them with images in a reference database.

Feature Correlation Retrieval +1

Paper
Code

Cooperative Training and Latent Space Data Augmentation for Robust Medical Image Segmentation

2 code implementations • 2 Jul 2021 • Chen Chen, Kerstin Hammernik, Cheng Ouyang, Chen Qin, Wenjia Bai, Daniel Rueckert

In this paper, we present a cooperative framework for training image segmentation models and a latent space augmentation method for generating hard examples.

Data Augmentation Image Reconstruction +4

Paper
Code

Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition

2 code implementations • 11 Oct 2021 • Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng

Speech enhancement (SE) aims to suppress the additive noise from a noisy speech signal to improve the speech's perceptual quality and intelligibility.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition

1 code implementation • 28 Mar 2022 • Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng

Then, we propose style learning to map the fused feature close to clean feature, in order to learn latent speech information from the latter, i. e., clean "speech style".

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

MaxStyle: Adversarial Style Composition for Robust Medical Image Segmentation

1 code implementation • 2 Jun 2022 • Chen Chen, Zeju Li, Cheng Ouyang, Matt Sinclair, Wenjia Bai, Daniel Rueckert

We propose a novel data augmentation framework called MaxStyle, which maximizes the effectiveness of style augmentation for model OOD performance.

Data Augmentation Image Segmentation +2

Paper
Code

Efficient Distribution Similarity Identification in Clustered Federated Learning via Principal Angles Between Client Data Subspaces

1 code implementation • 21 Sep 2022 • Saeed Vahidian, Mahdi Morafah, Weijia Wang, Vyacheslav Kungurtsev, Chen Chen, Mubarak Shah, Bill Lin

This small set of principal vectors is provided to the server so that the server can directly identify distribution similarities among the clients to form clusters.

Federated Learning

Paper
Code

Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models

1 code implementation • 23 May 2023 • Ruichen Wang, Zekang Chen, Chen Chen, Jian Ma, Haonan Lu, Xiaodong Lin

Our approach produces a more semantically accurate synthesis by constraining the attention regions of each token in the prompt to the image.

Attribute Image Generation

Paper
Code

Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation

1 code implementation • 22 Feb 2023 • Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng

To alleviate this problem, we propose a novel network to unify speech enhancement and separation with gradient modulation to improve noise-robustness.

Multi-Task Learning Speech Enhancement +2

Paper
Code

Multi-view Self-supervised Disentanglement for General Image Denoising

1 code implementation • ICCV 2023 • Hao Chen, Chenyuan Qu, Yu Zhang, Chen Chen, Jianbo Jiao

It is understandable as the model is designed to learn paired mapping (e. g. from a noisy image to its clean version).

Ranked #1 on Denoising on CBSD68 sigm75

Disentanglement Image Denoising +1

Paper
Code

Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning

1 code implementation • 6 Dec 2023 • Xinshun Wang, Zhongbin Fang, Xia Li, Xiangtai Li, Chen Chen, Mengyuan Liu

Under this setting, the model can perceive tasks from prompts and accomplish them without any extra task-specific head predictions or model fine-tuning.

In-Context Learning motion prediction +1

Paper
Code

Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning

1 code implementation • 29 Sep 2020 • Haotian Fu, Hongyao Tang, Jianye Hao, Chen Chen, Xidong Feng, Dong Li, Wulong Liu

How to collect informative trajectories of which the corresponding context reflects the specification of tasks?

Contrastive Learning Meta Reinforcement Learning +2

Paper
Code

What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator

1 code implementation • NeurIPS 2021 • Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang

We study Policy-extended Value Function Approximator (PeVFA) in Reinforcement Learning (RL), which extends conventional value function approximator (VFA) to take as input not only the state (and action) but also an explicit policy representation.

Continuous Control Contrastive Learning +3

Paper
Code

Annotating Columns with Pre-trained Language Models

1 code implementation • 5 Apr 2021 • Yoshihiko Suhara, Jinfeng Li, Yuliang Li, Dan Zhang, Çağatay Demiralp, Chen Chen, Wang-Chiew Tan

Inferring meta information about tables, such as column headers or relationships between columns, is an active research topic in data management as we find many tables are missing some of this information.

Ranked #1 on Column Type Annotation on VizNet-Sato-MultiColumn

Columns Property Annotation Column Type Annotation +3

Paper
Code

A Lightweight Graph Transformer Network for Human Mesh Reconstruction from 2D Human Pose

1 code implementation • 24 Nov 2021 • Ce Zheng, Matias Mendieta, Pu Wang, Aidong Lu, Chen Chen

We propose a pose analysis module that uses graph transformers to exploit structured and implicit joint correlations, and a mesh regression module that combines the extracted pose feature with the mesh template to reconstruct the final human mesh.

Ranked #60 on 3D Human Pose Estimation on 3DPW

3D Human Pose Estimation 3D Human Shape Estimation +2

Paper
Code

Knowledge Is Flat: A Seq2Seq Generative Framework for Various Knowledge Graph Completion

1 code implementation • COLING 2022 • Chen Chen, YuFei Wang, Bing Li, Kwok-Yan Lam

To remedy the KG structure information loss from the "flat" text, we further improve the input representations of entities and relations, and the inference algorithm in KG-S2S.

Knowledge Graph Completion

Paper
Code

OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition

1 code implementation • 30 Nov 2023 • Tongjia Chen, Hongshan Yu, Zhengeng Yang, Zechuan Li, Wei Sun, Chen Chen

Due to the resource-intensive nature of training vision-language models on expansive video data, a majority of studies have centered on adapting pre-trained image-language models to the video domain.

Ranked #2 on Zero-Shot Action Recognition on Kinetics

Descriptive Language Modelling +5

Paper
Code

GAMa: Cross-view Video Geo-localization

1 code implementation • 6 Jul 2022 • Shruti Vyas, Chen Chen, Mubarak Shah

There are no existing datasets for this problem, therefore we propose GAMa dataset, a large-scale dataset with ground videos and corresponding aerial images.

Paper
Code

Memory Attention Networks for Skeleton-based Action Recognition

1 code implementation • 23 Apr 2018 • Chunyu Xie, Ce Li, Baochang Zhang, Chen Chen, Jungong Han, Changqing Zou, Jianzhuang Liu

Specifically, the TARM is deployed in a residual learning module that employs a novel attention learning network to recalibrate the temporal attention of frames in a skeleton sequence.

Ranked #89 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Skeleton Based Action Recognition +1

Paper
Code

Degrade is Upgrade: Learning Degradation for Low-light Image Enhancement

1 code implementation • 19 Mar 2021 • Kui Jiang, Zhongyuan Wang, Zheng Wang, Chen Chen, Peng Yi, Tao Lu, Chia-Wen Lin

Different from existing methods tending to accomplish the relighting task directly by ignoring the fidelity and naturalness recovery, we investigate the intrinsic degradation and relight the low-light image while refining the details and color in two steps.

Low-Light Image Enhancement

Paper
Code

Balanced Knowledge Distillation for Long-tailed Learning

1 code implementation • 21 Apr 2021 • Shaoyu Zhang, Chen Chen, Xiyuan Hu, Silong Peng

Existing methods usually modify the classification loss to increase the learning focus on tail classes, which unexpectedly sacrifice the performance on head classes.

Knowledge Distillation

Paper
Code

Dipping PLMs Sauce: Bridging Structure and Text for Effective Knowledge Graph Completion via Conditional Soft Prompting

2 code implementations • 4 Jul 2023 • Chen Chen, YuFei Wang, Aixin Sun, Bing Li, Kwok-Yan Lam

However, the fine-tuned PLMs often overwhelmingly focus on the textual information and overlook structural knowledge.

Knowledge Graph Completion

Paper
Code

Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement

2 code implementations • 25 Dec 2023 • Jing Wang, Jinagyun Li, Chen Chen, Yisi Zhang, Haoran Shen, Tianxiang Zhang

In this paper, we propose a novel framework based on the adapter mechanism, namely Adaptive FSS, which can efficiently adapt the existing FSS model to the novel classes.

Meta-Learning

Paper
Code

Rethinking Data Heterogeneity in Federated Learning: Introducing a New Notion and Standard Benchmarks

1 code implementation • 30 Sep 2022 • Mahdi Morafah, Saeed Vahidian, Chen Chen, Mubarak Shah, Bill Lin

Though successful, federated learning presents new challenges for machine learning, especially when the issue of data heterogeneity, also known as Non-IID data, arises.

Federated Learning

Paper
Code

Improving Image Captioning with Conditional Generative Adversarial Nets

1 code implementation • 18 May 2018 • Chen Chen, Shuai Mu, Wanpeng Xiao, Zexiong Ye, Liesi Wu, Qi Ju

In this paper, we propose a novel conditional-generative-adversarial-nets-based image captioning framework as an extension of traditional reinforcement-learning (RL)-based encoder-decoder architecture.

Image Captioning Reinforcement Learning (RL)

Paper
Code

Multi-Task Learning for Left Atrial Segmentation on GE-MRI

1 code implementation • 31 Oct 2018 • Chen Chen, Wenjia Bai, Daniel Rueckert

Segmentation of the left atrium (LA) is crucial for assessing its anatomy in both pre-operative atrial fibrillation (AF) ablation planning and post-operative follow-up studies.

Anatomy General Classification +2

Paper
Code

Efficient Deep Learning of Non-local Features for Hyperspectral Image Classification

1 code implementation • 2 Aug 2020 • Yu Shen, Sijie Zhu, Chen Chen, Qian Du, Liang Xiao, Jianyu Chen, Delu Pan

Therefore, to incorporate the long-range contextual information, a deep fully convolutional network (FCN) with an efficient non-local module, named ENL-FCN, is proposed for HSI classification.

General Classification Hyperspectral Image Classification

Paper
Code

Attention guided global enhancement and local refinement network for semantic segmentation

1 code implementation • 9 Apr 2022 • Jiangyun Li, Sen Zha, Chen Chen, Meng Ding, Tianxiang Zhang, Hong Yu

First, commonly used upsampling methods in the decoder such as interpolation and deconvolution suffer from a local receptive field, unable to encode global contexts.

Semantic Segmentation

Paper
Code

TARGET: Federated Class-Continual Learning via Exemplar-Free Distillation

1 code implementation • ICCV 2023 • Jie Zhang, Chen Chen, Weiming Zhuang, LingJuan Lv

This paper focuses on an under-explored yet important problem: Federated Class-Continual Learning (FCCL), where new classes are dynamically added in federated learning.

Continual Learning Federated Learning

Paper
Code

UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning

1 code implementation • 16 May 2023 • Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng

Multimodal learning aims to imitate human beings to acquire complementary information from multiple modalities for various downstream tasks.

Contrastive Learning Image-text Classification +2

Paper
Code

Pseudo-label Alignment for Semi-supervised Instance Segmentation

1 code implementation • ICCV 2023 • Jie Hu, Chen Chen, Liujuan Cao, Shengchuan Zhang, Annan Shu, Guannan Jiang, Rongrong Ji

Through extensive experiments conducted on the COCO and Cityscapes datasets, we demonstrate that PAIS is a promising framework for semi-supervised instance segmentation, particularly in cases where labeled data is severely limited.

Instance Segmentation Pseudo Label +3

Paper
Code

Hyperspectral Image Classification in the Presence of Noisy Labels

1 code implementation • 12 Sep 2018 • Junjun Jiang, Jiayi Ma, Zheng Wang, Chen Chen, Xian-Ming Liu

The key idea of RLPA is to exploit knowledge (e. g., the superpixel based spectral-spatial constraints) from the observed hyperspectral images and apply it to the process of label propagation.

Classification General Classification +1

Paper
Code

BDANet: Multiscale Convolutional Neural Network with Cross-directional Attention for Building Damage Assessment from Satellite Images

1 code implementation • 16 May 2021 • Yu Shen, Sijie Zhu, Taojiannan Yang, Chen Chen, Delu Pan, Jianyu Chen, Liang Xiao, Qian Du

With a pair of pre- and post-disaster satellite images, building damage assessment aims at predicting the extent of damage to buildings.

Ranked #2 on 2D Semantic Segmentation on xBD

2D Semantic Segmentation Data Augmentation +1

Paper
Code

Biomechanics-informed Neural Networks for Myocardial Motion Tracking in MRI

1 code implementation • 8 Jun 2020 • Chen Qin, Shuo Wang, Chen Chen, Huaqi Qiu, Wenjia Bai, Daniel Rueckert

The learnt VAE regulariser then can be coupled with any deep learning based registration network to regularise the solution space to be biomechanically plausible.

Image Registration

Paper
Code

DENSE: Data-Free One-Shot Federated Learning

1 code implementation • 23 Dec 2021 • Jie Zhang, Chen Chen, Bo Li, Lingjuan Lyu, Shuang Wu, Shouhong Ding, Chunhua Shen, Chao Wu

One-shot Federated Learning (FL) has recently emerged as a promising approach, which allows the central server to learn a model in a single communication round.

Federated Learning

Paper
Code

SPAct: Self-supervised Privacy Preservation for Action Recognition

1 code implementation • CVPR 2022 • Ishan Rajendrakumar Dave, Chen Chen, Mubarak Shah

Existing approaches for mitigating privacy leakage in action recognition require privacy labels along with the action labels from the video dataset.

Ranked #1 on Action Classification on UCF101

Action Classification Action Recognition +2

Paper
Code

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

1 code implementation • 3 Jan 2024 • Haopeng Li, Andong Deng, Qiuhong Ke, Jun Liu, Hossein Rahmani, Yulan Guo, Bernt Schiele, Chen Chen

Reasoning over sports videos for question answering is an important task with numerous applications, such as player training and information retrieval.

Action Understanding counterfactual +4

Paper
Code

GaitMixer: Skeleton-based Gait Representation Learning via Wide-spectrum Multi-axial Mixer

1 code implementation • 27 Oct 2022 • Ekkasit Pinyoanuntapong, Ayman Ali, Pu Wang, Minwoo Lee, Chen Chen

Most existing gait recognition methods are appearance-based, which rely on the silhouettes extracted from the video data of human walking activities.

Ranked #6 on Multiview Gait Recognition on CASIA-B

Multiview Gait Recognition Representation Learning

Paper
Code

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models

1 code implementation • 21 Feb 2024 • Chenyang Song, Xu Han, Zhengyan Zhang, Shengding Hu, Xiyu Shi, Kuai Li, Chen Chen, Zhiyuan Liu, Guangli Li, Tao Yang, Maosong Sun

Some recent efforts have explored introducing ReLU or its variants as the substitutive activation function to help LLMs achieve activation sparsity and inference acceleration, but few can simultaneously obtain high sparsity and comparable model performance.

Paper
Code

MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions

1 code implementation • 21 Apr 2024 • Sheng Yan, Mengyuan Liu, Yong Wang, Yang Liu, Chen Chen, Hong Liu

In this paper, we address the unexplored question of temporal sentence localization in human motions (TSLM), aiming to locate a target moment from a 3D human motion that semantically corresponds to a text query.

Moment Retrieval Sentence

Paper
Code

IDEAL: Query-Efficient Data-Free Learning from Black-box Models

1 code implementation • 23 May 2022 • Jie Zhang, Chen Chen, Lingjuan Lyu

Knowledge Distillation (KD) is a typical method for training a lightweight student model with the help of a well-trained teacher model.

Knowledge Distillation

Paper
Code

Task-Adaptive Few-shot Node Classification

1 code implementation • 23 Jun 2022 • Song Wang, Kaize Ding, Chuxu Zhang, Chen Chen, Jundong Li

Then we transfer such knowledge to the classes with limited labeled nodes via our proposed task-adaptive modules.

Classification Few-Shot Learning +2

Paper
Code

Synthetic optical coherence tomography angiographs for detailed retinal vessel segmentation without human annotations

1 code implementation • 19 Jun 2023 • Linus Kreitner, Johannes C. Paetzold, Nikolaus Rauch, Chen Chen, Ahmed M. Hagag, Alaa E. Fayed, Sobha Sivaprasad, Sebastian Rausch, Julian Weichsel, Bjoern H. Menze, Matthias Harders, Benjamin Knier, Daniel Rueckert, Martin J. Menten

To address this issue, recent work has employed transfer learning, where a segmentation network is trained on synthetic OCTA images and is then applied to real data.

Domain Adaptation Retinal Vessel Segmentation +2

Paper
Code

M-FLAG: Medical Vision-Language Pre-training with Frozen Language Models and Latent Space Geometry Optimization

1 code implementation • 17 Jul 2023 • Che Liu, Sibo Cheng, Chen Chen, Mengyun Qiao, Weitong Zhang, Anand Shah, Wenjia Bai, Rossella Arcucci

The proposed method, named Medical vision-language pre-training with Frozen language models and Latent spAce Geometry optimization (M-FLAG), leverages a frozen language model for training stability and efficiency and introduces a novel orthogonality loss to harmonize the latent space geometry.

Image Classification Language Modelling +3

Paper
Code

Multimodal Transformer for Nursing Activity Recognition

1 code implementation • 9 Apr 2022 • Momal Ijaz, Renato Diaz, Chen Chen

Our method achieves state-of-the-art performance of 81. 8% accuracy on the benchmark dataset available for nurse activity recognition from the Nurse Care Activity Recognition Challenge.

Activity Recognition

Paper
Code

Context Label Learning: Improving Background Class Representations in Semantic Segmentation

1 code implementation • 16 Dec 2022 • Zeju Li, Konstantinos Kamnitsas, Cheng Ouyang, Chen Chen, Ben Glocker

The results demonstrate that CoLab can guide the segmentation model to map the logits of background samples away from the decision boundary, resulting in significantly improved segmentation accuracy.

Segmentation Semantic Segmentation

Paper
Code

Edit Everything: A Text-Guided Generative System for Images Editing

1 code implementation • 27 Apr 2023 • Defeng Xie, Ruichen Wang, Jian Ma, Chen Chen, Haonan Lu, Dong Yang, Fobo Shi, Xiaodong Lin

We introduce a new generative system called Edit Everything, which can take image and text inputs and produce image outputs.

Paper
Code

A Neural State-Space Model Approach to Efficient Speech Separation

1 code implementation • 26 May 2023 • Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng

In this work, we introduce S4M, a new efficient speech separation framework based on neural state-space models (SSM).

Representation Learning Speech Separation

Paper
Code

Deep Fisher Discriminant Learning for Mobile Hand Gesture Recognition

1 code implementation • 12 Jul 2017 • Chunyu Xie, Ce Li, Baochang Zhang, Chen Chen, Jungong Han

Gesture recognition is a challenging problem in the field of biometrics.

Ranked #1 on Hand Gesture Recognition on MGB

Hand Gesture Recognition Hand-Gesture Recognition

Paper
Code

Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition

1 code implementation • 16 May 2023 • Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng

However, most existing AVSR approaches simply fuse the audio and visual features by concatenation, without explicit interactions to capture the deep correlations between them, which results in sub-optimal multimodal representations for downstream speech recognition task.

Audio-Visual Speech Recognition Automatic Speech Recognition +3

Paper
Code

First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment

1 code implementation • 23 Jun 2023 • Tom Tongjia Chen, Hongshan Yu, Zhengeng Yang, Ming Li, Zechuan Li, Jingwen Wang, Wei Miao, Wei Sun, Chen Chen

Affordance-Centric Question-driven Task Completion (AQTC) has been proposed to acquire knowledge from videos to furnish users with comprehensive and systematic instructions.

Human-Object Interaction Detection

Paper
Code

LAMP: Leveraging Language Prompts for Multi-person Pose Estimation

1 code implementation • 21 Jul 2023 • Shengnan Hu, Ce Zheng, Zixiang Zhou, Chen Chen, Gita Sukthankar

Human-centric visual understanding is an important desideratum for effective human-robot interaction.

Language Modelling Multi-Person Pose Estimation +1

Paper
Code

Federated Few-shot Learning

1 code implementation • 17 Jun 2023 • Song Wang, Xingbo Fu, Kaize Ding, Chen Chen, Huiyuan Chen, Jundong Li

In this way, the server can exploit the computational power of all clients and train the model on a larger set of data samples among all clients.

Federated Learning Few-Shot Learning

Paper
Code

An Analysis of Adversarial Attacks and Defenses on Autonomous Driving Models

1 code implementation • 6 Feb 2020 • Yao Deng, Xi Zheng, Tianyi Zhang, Chen Chen, Guannan Lou, Miryung Kim

We derive several implications for system and middleware builders: (1) when adding a defense component against adversarial attacks, it is important to deploy multiple defense methods in tandem to achieve a good coverage of various attacks, (2) a blackbox attack is much less effective compared with a white-box attack, implying that it is important to keep model details (e. g., model architecture, hyperparameters) confidential via model obfuscation, and (3) driving models with a complex architecture are preferred if computing resources permit as they are more resilient to adversarial attacks than simple models.

Autonomous Driving

Paper
Code

RoPGen: Towards Robust Code Authorship Attribution via Automatic Coding Style Transformation

1 code implementation • 12 Feb 2022 • Zhen Li, Guenevere, Chen, Chen Chen, Yayi Zou, Shouhuai Xu

Recent studies show that current source code authorship attribution methods can be compromised by attackers exploiting adversarial examples and coding style manipulation.

Authorship Attribution Bug fixing +1

Paper
Code

Part Aware Contrastive Learning for Self-Supervised Action Recognition

1 code implementation • 1 May 2023 • Yilei Hua, Wenhan Wu, Ce Zheng, Aidong Lu, Mengyuan Liu, Chen Chen, Shiqian Wu

This paper proposes an attention-based contrastive learning framework for skeleton representation learning, called SkeAttnCLR, which integrates local similarity and global features for skeleton-based action representations.

Contrastive Learning Data Augmentation +3

Paper
Code

MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition

1 code implementation • 18 Jun 2023 • Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng

In this paper, we aim to learn the shared representations across modalities to bridge their gap.

Audio-Visual Speech Recognition Representation Learning +3

Paper
Code

PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation

1 code implementation • 28 Nov 2023 • Jian Ma, Chen Chen, Qingsong Xie, Haonan Lu

In this paper, we are inspired to propose a simple plug-and-play language transfer method based on knowledge distillation.

Cross-lingual Text-to-Image Generation Knowledge Distillation +1

Paper
Code

Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning

1 code implementation • NeurIPS 2023 • Jianzhun Shao, Yun Qu, Chen Chen, Hongchang Zhang, Xiangyang Ji

Offline multi-agent reinforcement learning is challenging due to the coupling effect of both distribution shift issue common in offline setting and the high dimension issue common in multi-agent setting, making the action out-of-distribution (OOD) and value overestimation phenomenon excessively severe.

counterfactual Multi-agent Reinforcement Learning +3

Paper
Code

Robust Image Segmentation Using Contour-Guided Color Palettes

1 code implementation • ICCV 2015 • Xiang Fu, Chien-Yi Wang, Chen Chen, Changhu Wang, C. -C. Jay Kuo

The contour-guided color palette (CCP) is proposed for robust image segmentation.

Image Segmentation Segmentation +1

Paper
Code

Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition

1 code implementation • 18 Jun 2023 • Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng

In this work, we investigate the noise-invariant visual modality to strengthen robustness of AVSR, which can adapt to any testing noises while without dependence on noisy training data, a. k. a., unsupervised noise adaptation.

Audio-Visual Speech Recognition speech-recognition +1

Paper
Code

GeoSegNet: Point Cloud Semantic Segmentation via Geometric Encoder-Decoder Modeling

1 code implementation • 14 Jul 2022 • Chen Chen, Yisen Wang, Honghua Chen, Xuefeng Yan, Dayong Ren, Yanwen Guo, Haoran Xie, Fu Lee Wang, Mingqiang Wei

Semantic segmentation of point clouds, aiming to assign each point a semantic category, is critical to 3D scene understanding. Despite of significant advances in recent years, most of existing methods still suffer from either the object-level misclassification or the boundary-level ambiguity.

Object Segmentation +1

Paper
Code

Estimating Model Performance under Domain Shifts with Class-Specific Confidence Scores

1 code implementation • 20 Jul 2022 • Zeju Li, Konstantinos Kamnitsas, Mobarakol Islam, Chen Chen, Ben Glocker

If we could estimate the performance that a pre-trained model would achieve on data from a specific deployment setting, for example a certain clinic, we could judge whether the model could safely be deployed or if its performance degrades unacceptably on the specific data.

Image Segmentation Semantic Segmentation

Paper
Code

Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition

1 code implementation • 22 Feb 2023 • Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng

In this paper, we propose a simple yet effective approach called gradient remedy (GR) to solve interference between task gradients in noise-robust speech recognition, from perspectives of both angle and magnitude.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Code

FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image Segmentation

1 code implementation • 21 Apr 2023 • Wenxuan Wang, Jing Wang, Chen Chen, Jianbo Jiao, Yuanxiu Cai, Shanshan Song, Jiangyun Li

The research community has witnessed the powerful potential of self-supervised Masked Image Modeling (MIM), which enables the models capable of learning visual representation from unlabeled data.

Image Segmentation Medical Image Segmentation +2

Paper
Code

A Global Benchmark of Algorithms for Segmenting Late Gadolinium-Enhanced Cardiac Magnetic Resonance Imaging

1 code implementation • 26 Apr 2020 • Zhaohan Xiong, Qing Xia, Zhiqiang Hu, Ning Huang, Cheng Bian, Yefeng Zheng, Sulaiman Vesal, Nishant Ravikumar, Andreas Maier, Xin Yang, Pheng-Ann Heng, Dong Ni, Caizi Li, Qianqian Tong, Weixin Si, Elodie Puybareau, Younes Khoudli, Thierry Geraud, Chen Chen, Wenjia Bai, Daniel Rueckert, Lingchao Xu, Xiahai Zhuang, Xinzhe Luo, Shuman Jia, Maxime Sermesant, Yashu Liu, Kuanquan Wang, Davide Borra, Alessandro Masci, Cristiana Corsi, Coen de Vente, Mitko Veta, Rashed Karim, Chandrakanth Jayachandran Preetha, Sandy Engelhardt, Menyun Qiao, Yuanyuan Wang, Qian Tao, Marta Nunez-Garcia, Oscar Camara, Nicolo Savioli, Pablo Lamata, Jichao Zhao

Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment.

Benchmarking Left Atrium Segmentation +1

Paper
Code

Magic ELF: Image Deraining Meets Association Learning and Transformer

1 code implementation • 21 Jul 2022 • Kui Jiang, Zhongyuan Wang, Chen Chen, Zheng Wang, Laizhong Cui, Chia-Wen Lin

Convolutional neural network (CNN) and Transformer have achieved great success in multimedia applications.

Rain Removal

Paper
Code

GaitSADA: Self-Aligned Domain Adaptation for mmWave Gait Recognition

1 code implementation • 31 Jan 2023 • Ekkasit Pinyoanuntapong, Ayman Ali, Kalvik Jakkala, Pu Wang, Minwoo Lee, Qucheng Peng, Chen Chen, Zhi Sun

mmWave radar-based gait recognition is a novel user identification method that captures human gait biometrics from mmWave radar return signals.

Contrastive Learning Domain Adaptation +1

Paper
Code

Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting

1 code implementation • 13 Feb 2023 • Yuchen Liu, Chen Chen, Lingjuan Lyu, Fangzhao Wu, Sai Wu, Gang Chen

In order to address this issue, we propose GAS, a \shorten approach that can successfully adapt existing robust AGRs to non-IID settings.

Federated Learning

Paper
Code

Generative Myocardial Motion Tracking via Latent Space Exploration with Biomechanics-informed Prior

1 code implementation • 8 Jun 2022 • Chen Qin, Shuo Wang, Chen Chen, Wenjia Bai, Daniel Rueckert

In contrast to most existing approaches which impose explicit generic regularization such as smoothness, in this work we propose a novel method that can implicitly learn an application-specific biomechanics-informed prior and embed it into a neural network-parameterized transformation model.

Image Registration

Paper
Code

FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated Learning

1 code implementation • ICCV 2023 • Guangyu Sun, Matias Mendieta, Jun Luo, Shandong Wu, Chen Chen

Personalized Federated Learning (PFL) represents a promising solution for decentralized learning in heterogeneous data environments.

Personalized Federated Learning

Paper
Code

Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning

1 code implementation • 25 Sep 2023 • Yang Liu, Chen Chen, Can Wang, Xulin King, Mengyuan Liu

The proposed method decouples functions between the decoder and the encoder by introducing a mask regressor, which predicts the masked patch representation from the visible patch representation encoded by the encoder and the decoder reconstructs the target from the predicted masked patch representation.

Ranked #3 on Few-Shot 3D Point Cloud Classification on ModelNet40 10-way (20-shot) (using extra training data)

Few-Shot 3D Point Cloud Classification Representation Learning +1

Paper
Code

FAITH: Few-Shot Graph Classification with Hierarchical Task Graphs

1 code implementation • 5 May 2022 • Song Wang, Yushun Dong, Xiao Huang, Chen Chen, Jundong Li

Specifically, these works propose to accumulate meta-knowledge across diverse meta-training tasks, and then generalize such meta-knowledge to the target task with a disjoint label set.

Few-Shot Learning Graph Classification

Paper
Code

ConPET: Continual Parameter-Efficient Tuning for Large Language Models

1 code implementation • 26 Sep 2023 • Chenyang Song, Xu Han, Zheni Zeng, Kuai Li, Chen Chen, Zhiyuan Liu, Maosong Sun, Tao Yang

First, Static ConPET can adapt former continual learning methods originally designed for relatively smaller models to LLMs through PET and a dynamic replay strategy, which largely reduces the tuning costs and alleviates the over-fitting and forgetting issue.

Continual Learning

Paper
Code

Adversarial Attacks on Fairness of Graph Neural Networks

1 code implementation • 20 Oct 2023 • Binchi Zhang, Yushun Dong, Chen Chen, Yada Zhu, Minnan Luo, Jundong Li

Fairness-aware graph neural networks (GNNs) have gained a surge of attention as they can reduce the bias of predictions on any demographic group (e. g., female) in graph-based applications.

Fairness

Paper
Code

A Coarse-to-Fine Indoor Layout Estimation (CFILE) Method

1 code implementation • 3 Jul 2016 • Yuzhuo Ren, Chen Chen, Shang-Wen Li, C. -C. Jay Kuo

The task of estimating the spatial layout of cluttered indoor scenes from a single RGB image is addressed in this work.

Paper
Code

Improving the generalizability of convolutional neural network-based segmentation on CMR images

1 code implementation • 2 Jul 2019 • Chen Chen, Wenjia Bai, Rhodri H. Davies, Anish N. Bhuva, Charlotte Manisty, James C. Moon, Nay Aung, Aaron M. Lee, Mihir M. Sanghvi, Kenneth Fung, Jose Miguel Paiva, Steffen E. Petersen, Elena Lukaschuk, Stefan K. Piechnik, Stefan Neubauer, Daniel Rueckert

We demonstrate that a neural network trained on a single-site single-scanner dataset from the UK Biobank can be successfully applied to segmenting cardiac MR images across different sites and different scanners without substantial loss of accuracy.

Image Segmentation Segmentation +1

Paper
Code

Revisiting Training-free NAS Metrics: An Efficient Training-based Method

1 code implementation • 16 Nov 2022 • Taojiannan Yang, Linjie Yang, Xiaojie Jin, Chen Chen

In this paper, we revisit these training-free metrics and find that: (1) the number of parameters (\#Param), which is the most straightforward training-free metric, is overlooked in previous works but is surprisingly effective, (2) recent training-free metrics largely rely on the \#Param information to rank networks.

Neural Architecture Search

Paper
Code

PGFed: Personalize Each Client's Global Objective for Federated Learning

1 code implementation • ICCV 2023 • Jun Luo, Matias Mendieta, Chen Chen, Shandong Wu

Based on our observation, in this work, we propose Personalized Global Federated Learning (PGFed), a novel personalized FL framework that enables each client to personalize its own global objective by explicitly and adaptively aggregating the empirical risks of itself and other clients.

Personalized Federated Learning Transfer Learning

Paper
Code

Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning

1 code implementation • 20 Jan 2023 • Zifan Wu, Chao Yu, Chen Chen, Jianye Hao, Hankz Hankui Zhuo

In Model-based Reinforcement Learning (MBRL), model learning is critical since an inaccurate model can bias policy learning via generating misleading samples.

Decision Making Model-based Reinforcement Learning

Paper
Code

SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing

1 code implementation • 30 May 2023 • Nazmul Karim, Umar Khalid, Mohsen Joneidi, Chen Chen, Nazanin Rahnavard

Text-to-Image (T2I) diffusion models have achieved remarkable success in synthesizing high-quality images conditioned on text prompts.

Style Transfer Video Editing

Paper
Code

Source-free Domain Adaptive Human Pose Estimation

1 code implementation • ICCV 2023 • Qucheng Peng, Ce Zheng, Chen Chen

To this end, we propose a new task, named source-free domain adaptive HPE, which aims to address the challenges of cross-domain learning of HPE without access to source data during the adaptation process.

Contrastive Learning Domain Adaptation +1

Paper
Code

Robust and Scalable Model Editing for Large Language Models

1 code implementation • 26 Mar 2024 • Yingfa Chen, Zhengyan Zhang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Chen Chen, Kuai Li, Tao Yang, Maosong Sun

Large language models (LLMs) can make predictions using parametric knowledge--knowledge encoded in the model weights--or contextual knowledge--knowledge presented in the context.

Model Editing

Paper
Code

Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos

1 code implementation • ICCV 2017 • Rui Hou, Chen Chen, Mubarak Shah

A video is first divided into equal length clips and for each clip a set of tube proposals are generated next based on 3D Convolutional Network (ConvNet) features.

Ranked #1 on Action Detection on UCF Sports

Action Detection Image Classification +2

Paper
Code

Category Guided Attention Network for Brain Tumor Segmentation in MRI

1 code implementation • 29 Mar 2022 • Jiangyun Li, Hong Yu, Chen Chen, Meng Ding, Sen Zha

In this model, we design a Supervised Attention Module (SAM) based on the attention mechanism, which can capture more accurate and stable long-range dependency in feature maps without introducing much computational cost.

Brain Tumor Segmentation Segmentation +1

Paper
Code

DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion Models

1 code implementation • 6 Jul 2023 • Zhenting Wang, Chen Chen, Lingjuan Lyu, Dimitris N. Metaxas, Shiqing Ma

To address this issue, we propose a method for detecting such unauthorized data usage by planting the injected memorization into the text-to-image diffusion models trained on the protected dataset.

Memorization

Paper
Code

A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation

1 code implementation • 17 Mar 2024 • Qucheng Peng, Ce Zheng, Chen Chen

Furthermore, the pose estimator's optimization is not exposed to domain shifts, limiting its overall generalization ability.

Ranked #2 on Weakly-supervised 3D Human Pose Estimation on Human3.6M

Domain Generalization Weakly-supervised 3D Human Pose Estimation

Paper
Code

Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning

1 code implementation • 28 Nov 2022 • Xian Zhong, Zipeng Li, Shuqin Chen, Kui Jiang, Chen Chen, Mang Ye

In this paper, we introduce a novel Refined Semantic enhancement method towards Frequency Diffusion (RSFD), a captioning model that constantly perceives the linguistic representation of the infrequent tokens.

FAD Video Captioning

Paper
Code

Reconciling Object-Level and Global-Level Objectives for Long-Tail Detection

1 code implementation • ICCV 2023 • Shaoyu Zhang, Chen Chen, Silong Peng

Specifically, complementary to the object-level classification loss for model discrimination, we design a generalized average precision (GAP) loss to explicitly optimize the global-level score ranking across different objects.

Multi-Task Learning Object

Paper
Code

Progressive Bilateral-Context Driven Model for Post-Processing Person Re-Identification

1 code implementation • 7 Sep 2020 • Min Cao, Chen Chen, Hao Dou, Xiyuan Hu, Silong Peng, Arjan Kuijper

Most existing person re-identification methods compute pairwise similarity by extracting robust visual features and learning the discriminative metric.

Large-Scale Person Re-Identification

Paper
Code

Embedding Gradient-based Optimization in Image Registration Networks

1 code implementation • 7 Dec 2021 • Huaqi Qiu, Kerstin Hammernik, Chen Qin, Chen Chen, Daniel Rueckert

Deep learning (DL) image registration methods amortize the costly pair-wise iterative optimization by training deep neural networks to predict the optimal transformation in one fast forward-pass.

Image Reconstruction Image Registration

Paper
Code

FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER

1 code implementation • CVPR 2023 • Ce Zheng, Matias Mendieta, Taojiannan Yang, Guo-Jun Qi, Chen Chen

Recently, vision transformers have shown great success in a set of human reconstruction tasks such as 2D human pose estimation (2D HPE), 3D human pose estimation (3D HPE), and human mesh reconstruction (HMR) tasks.

Ranked #29 on 3D Human Pose Estimation on 3DPW

2D Human Pose Estimation 3D Human Pose Estimation

Paper
Code

Graph Few-shot Learning with Task-specific Structures

1 code implementation • 21 Oct 2022 • Song Wang, Chen Chen, Jundong Li

Therefore, to adaptively learn node representations across meta-tasks, we propose a novel framework that learns a task-specific structure for each meta-task.

Classification Few-Shot Learning +2

Paper
Code

Dynamic Ensemble of Low-fidelity Experts: Mitigating NAS "Cold-Start"

1 code implementation • 2 Feb 2023 • Junbo Zhao, Xuefei Ning, Enshu Liu, Binxin Ru, Zixuan Zhou, Tianchen Zhao, Chen Chen, Jiajin Zhang, Qingmin Liao, Yu Wang

In the first step, we train different sub-predictors on different types of available low-fidelity information to extract beneficial knowledge as low-fidelity experts.

Neural Architecture Search

Paper
Code

Combating Unknown Bias with Effective Bias-Conflicting Scoring and Gradient Alignment

1 code implementation • 25 Nov 2021 • Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He, Shu-Tao Xia

For challenge B, we point out that the gradient contribution statistics can be a reliable indicator to inspect whether the optimization is dominated by bias-aligned samples.

Fairness

Paper
Code

CalFAT: Calibrated Federated Adversarial Training with Label Skewness

1 code implementation • 30 May 2022 • Chen Chen, Yuchen Liu, Xingjun Ma, Lingjuan Lyu

In this paper, we study the problem of FAT under label skewness, and reveal one root cause of the training instability and natural accuracy degradation issues: skewed labels lead to non-identical class probabilities and heterogeneous local models.

Adversarial Robustness Federated Learning

Paper
Code

TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition

1 code implementation • CVPR 2023 • Ishan Rajendrakumar Dave, Mamshad Nayeem Rizve, Chen Chen, Mubarak Shah

We observe that these representations complement each other depending on the nature of the action.

Action Recognition Optical Flow Estimation +1

Paper
Code

GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction

1 code implementation • 19 Dec 2023 • Xinshun Wang, Qiongjie Cui, Chen Chen, Mengyuan Liu

The past few years has witnessed the dominance of Graph Convolutional Networks (GCNs) over human motion prediction. Various styles of graph convolutions have been proposed, with each one meticulously designed and incorporated into a carefully-crafted network architecture.

Human motion prediction motion prediction +1

Paper
Code

Towards Improved Proxy-based Deep Metric Learning via Data-Augmented Domain Adaptation

1 code implementation • 1 Jan 2024 • Li Ren, Chen Chen, Liqiang Wang, Kien Hua

Our experiments on benchmarks, including the popular CUB-200-2011, CARS196, Stanford Online Products, and In-Shop Clothes Retrieval, show that our learning algorithm significantly improves the existing proxy losses and achieves superior results compared to the existing methods.

Domain Adaptation Metric Learning +1

Paper
Code

CodaMal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes

1 code implementation • 16 Feb 2024 • Ishan Rajendrakumar Dave, Tristan de Blegiers, Chen Chen, Mubarak Shah

Annotating images from LCM significantly increases the burden on medical experts compared to annotating images from high-cost microscopes (HCM).

Domain Adaptation object-detection +1

Paper
Code

Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction

1 code implementation • 3 Mar 2021 • Hongyao Tang, Jianye Hao, Guangyong Chen, Pengfei Chen, Chen Chen, Yaodong Yang, Luo Zhang, Wulong Liu, Zhaopeng Meng

Value function is the central notion of Reinforcement Learning (RL).

Continuous Control Future prediction +2

Paper
Code

Nonnegative Tensor Completion via Integer Optimization

1 code implementation • 8 Nov 2021 • Caleb Bugg, Chen Chen, Anil Aswani

Unlike matrix completion, tensor completion does not have an algorithm that is known to achieve the information-theoretic sample complexity rate.

Matrix Completion

Paper
Code

Sampo: Unsupervised Knowledge Base Construction for Opinions and Implications

1 code implementation • AKBC 2020 • Nikita Bhutani, Aaron Traylor, Chen Chen, Xiaolan Wang, Behzad Golshan, Wang-Chiew Tan

Since it can be expensive to obtain training data to learn to extract implications for each new domain of reviews, we propose an unsupervised KBC system, Sampo, Specifically, Sampo is tailored to build KBs for domains where many reviews on the same domain are available.

Paper
Code

Accelerated Nonnegative Tensor Completion via Integer Programming

1 code implementation • 28 Nov 2022 • Wenhao Pan, Anil Aswani, Chen Chen

A recent approach, based on integer programming, resolves this tension for nonnegative tensor completion.

Paper
Code

Few-shot Node Classification with Extremely Weak Supervision

1 code implementation • 6 Jan 2023 • Song Wang, Yushun Dong, Kaize Ding, Chen Chen, Jundong Li

Recent few-shot node classification methods typically learn from classes with abundant labeled nodes (i. e., meta-training classes) and then generalize to classes with limited labeled nodes (i. e., meta-test classes).

Classification Meta-Learning +1

Paper
Code

Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning

1 code implementation • 4 Feb 2024 • Li Ren, Chen Chen, Liqiang Wang, Kien Hua

As a result of the success of recent pre-trained models trained from larger-scale datasets, it is challenging to adapt the model to the DML tasks in the local data domain while retaining the previously gained knowledge.

Ranked #2 on Image Retrieval on iNaturalist

Image Retrieval Metric Learning

Paper
Code

Improved post-hoc probability calibration for out-of-domain MRI segmentation

1 code implementation • 4 Aug 2022 • Cheng Ouyang, Shuo Wang, Chen Chen, Zeju Li, Wenjia Bai, Bernhard Kainz, Daniel Rueckert

In image segmentation, well-calibrated probabilities allow radiologists to identify regions where model-predicted segmentations are unreliable.

Image Segmentation MRI segmentation +2

Paper
Code

Getting the Most from Eye-Tracking: User-Interaction Based Reading Region Estimation Dataset and Models

1 code implementation • 12 Jun 2023 • Ruoyan Kong, Ruixuan Sun, Charles Chuankai Zhang, Chen Chen, Sneha Patri, Gayathri Gajjela, Joseph A. Konstan

A single digital newsletter usually contains many messages (regions).

Paper
Code

Semi-Dynamic Load Balancing: Efficient Distributed Learning in Non-Dedicated Environments

no code implementations • 7 Jun 2018 • Chen Chen, Qizhen Weng, Wei Wang, Baochun Li, Bo Li

Efficient model training requires eliminating such stragglers, yet for modern ML workloads, existing load balancing strategies are inefficient and even infeasible.

Paper
Add Code

Key Person Aided Re-identification in Partially Ordered Pedestrian Set

no code implementations • 25 May 2018 • Chen Chen, Min Cao, Xiyuan Hu, Silong Peng

Ideally person re-identification seeks for perfect feature representation and metric model that re-identify all various pedestrians well in non-overlapping views at different locations with different camera configurations, which is very challenging.

Person Re-Identification

Paper
Add Code

One-Two-One Networks for Compression Artifacts Reduction in Remote Sensing

no code implementations • 1 Apr 2018 • Baochang Zhang, Jiaxin Gu, Chen Chen, Jungong Han, Xiangbo Su, Xian-Bin Cao, Jianzhuang Liu

Compression artifacts reduction (CAR) is a challenging problem in the field of remote sensing.

Blocking Image Compression +1

Paper
Add Code

Gabor Convolutional Networks

no code implementations • 3 May 2017 • Shangzhen Luan, Baochang Zhang, Chen Chen, Xian-Bin Cao, Jungong Han, Jianzhuang Liu

Steerable properties dominate the design of traditional filters, e. g., Gabor filters, and endow features the capability of dealing with spatial transformations.

Paper
Add Code

Robust 3D Action Recognition through Sampling Local Appearances and Global Distributions

no code implementations • 4 Dec 2017 • Mengyuan Liu, Hong Liu, Chen Chen

Then, motion and shape cues are jointly used to generate robust and distinctive spatial-temporal interest points (STIPs): motion-based STIPs and shape-based STIPs.

3D Action Recognition

Paper
Add Code

An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos

no code implementations • 30 Nov 2017 • Rui Hou, Chen Chen, Mubarak Shah

A video is first divided into equal length clips and next for each clip a set of tube proposals are generated based on 3D CNN features.

Action Detection Action Segmentation +4

Paper
Add Code

Latent Constrained Correlation Filter

no code implementations • 11 Nov 2017 • Baochang Zhang, Shangzhen Luan, Chen Chen, Jungong Han, Wei Wang, Alessandro Perina, Ling Shao

In this paper, we introduce an intermediate step -- solution sampling -- after the data sampling step to form a subspace, in which an optimal solution can be estimated.

Object Recognition Object Tracking

Paper
Add Code

Block building programming for symbolic regression

no code implementations • 22 May 2017 • Chen Chen, Changtong Luo, Zonglin Jiang

Symbolic regression that aims to detect underlying data-driven models has become increasingly important for industrial data analysis.

Computational Efficiency regression +1

Paper
Add Code

Multi-modal Aggregation for Video Classification

no code implementations • 27 Oct 2017 • Chen Chen, Xiaowei Zhao, Yang Liu

In this paper, we present a solution to Large-Scale Video Classification Challenge (LSVC2017) [1] that ranked the 1st place.

Classification General Classification +1

Paper
Add Code

Manifold Constrained Low-Rank Decomposition

no code implementations • 6 Aug 2017 • Chen Chen, Baochang Zhang, Alessio Del Bue, Vittorio Murino

Low-rank decomposition (LRD) is a state-of-the-art method for visual data reconstruction and modelling.

Paper
Add Code

Fast Modeling Methods for Complex System with Separable Features

no code implementations • 5 Aug 2017 • Chen Chen, Changtong Luo, Zonglin Jiang

In this paper, we analyze different types of separability of some real-world engineering equations and establish a mathematical model of generalized separable system (GS system).

Paper
Add Code

A divide and conquer method for symbolic regression

no code implementations • 23 May 2017 • Changtong Luo, Chen Chen, Zonglin Jiang

This feature motivated us to develop a new method, divide and conquer (D&C), for symbolic regression, in which the target function is divided into a number of sub-functions and the sub-functions are then determined by any of a GP algorithm.

regression Symbolic Regression

Paper
Add Code

Latent Constrained Correlation Filters for Object Localization

no code implementations • 7 Jun 2016 • Shangzhen Luan, Baochang Zhang, Jungong Han, Chen Chen, Ling Shao, Alessandro Perina, Linlin Shen

There is a neglected fact in the traditional machine learning methods that the data sampling can actually lead to the solution sampling.

Object Object Localization

Paper
Add Code

Elite Bases Regression: A Real-time Algorithm for Symbolic Regression

no code implementations • 24 Apr 2017 • Chen Chen, Changtong Luo, Zonglin Jiang

In this paper, a new non-evolutionary real-time algorithm for symbolic regression, Elite Bases Regression (EBR), is proposed.

regression Symbolic Regression

Paper
Add Code

Deep Spatio-temporal Manifold Network for Action Recognition

no code implementations • 9 May 2017 • Ce Li, Chen Chen, Baochang Zhang, Qixiang Ye, Jungong Han, Rongrong Ji

Visual data such as videos are often sampled from complex manifold.

Action Recognition Temporal Action Localization

Paper
Add Code

Measuring and Predicting Tag Importance for Image Retrieval

no code implementations • 28 Feb 2016 • Shang-Wen Li, Sanjay Purushotham, Chen Chen, Yuzhuo Ren, C. -C. Jay Kuo

Textual data such as tags, sentence descriptions are combined with visual cues to reduce the semantic gap for image retrieval applications in today's Multimodal Image Retrieval (MIR) systems.

Image Retrieval Retrieval +2

Paper
Add Code

Output Constraint Transfer for Kernelized Correlation Filter in Tracking

no code implementations • 16 Dec 2016 • Baochang Zhang, Zhigang Li, Xian-Bin Cao, Qixiang Ye, Chen Chen, Linlin Shen, Alessandro Perina, Rongrong Ji

Kernelized Correlation Filter (KCF) is one of the state-of-the-art object trackers.

Bayesian Optimization

Paper
Add Code

GAL: A Global-Attributes Assisted Labeling System for Outdoor Scenes

no code implementations • 3 Apr 2016 • Yuzhuo Ren, Chen Chen, Shang-Wen Li, C. -C. Jay Kuo

The proposed Global-attributes Assisted Labeling (GAL) system exploits both local features and global attributes.

Paper
Add Code

Fast Iteratively Reweighted Least Squares Algorithms for Analysis-Based Sparsity Reconstruction

no code implementations • 18 Nov 2014 • Chen Chen, Junzhou Huang, Lei He, Hongsheng Li

The convergence rate of the proposed algorithm is almost the same as that of the traditional IRLS algorithms, that is, exponentially fast.

Compressive Sensing

Paper
Add Code

SIRF: Simultaneous Image Registration and Fusion in A Unified Framework

no code implementations • 18 Nov 2014 • Chen Chen, Yeqing Li, Wei Liu, Junzhou Huang

In this paper, we propose a novel method for image fusion with a high-resolution panchromatic image and a low-resolution multispectral image at the same geographical location.

Image Registration

Paper
Add Code

Forest Sparsity for Multi-channel Compressive Sensing

no code implementations • 20 Nov 2012 • Chen Chen, Yeqing Li, Junzhou Huang

In this paper, we investigate a new compressive sensing model for multi-channel sparse data where each channel can be represented as a hierarchical tree and different channels are highly correlated.

Compressive Sensing

Paper
Add Code

Rain Removal By Image Quasi-Sparsity Priors

no code implementations • 20 Dec 2018 • Yinglong Wang, Shuaicheng Liu, Chen Chen, Dehua Xie, Bing Zeng

We present a novel rain removal method in this paper, which consists of two steps, i. e., detection of rain streaks and reconstruction of the rain-removed image.

Rain Removal

Paper
Add Code

Chinese Zero Pronoun Resolution with Deep Neural Networks

no code implementations • ACL 2016 • Chen Chen, Vincent Ng

Chinese Zero Pronoun Resolution Feature Engineering +1

Paper
Add Code

Compressive Sensing MRI with Wavelet Tree Sparsity

no code implementations • NeurIPS 2012 • Chen Chen, Junzhou Huang

On the other side, some algorithms have been proposed for tree sparsity regularization, but few of them has validated the benefit of tree structure in CS-MRI.

Compressive Sensing

Paper
Add Code

Dense Semantic and Topological Correspondence of 3D Faces without Landmarks

no code implementations • ECCV 2018 • Zhenfeng Fan, Xiyuan Hu, Chen Chen, Silong Peng

The dense correspondence goal is revisited in two perspectives: semantic and topological correspondence.

Semantic correspondence

Paper
Add Code

Preconditioning for Accelerated Iteratively Reweighted Least Squares in Structured Sparsity Reconstruction

no code implementations • CVPR 2014 • Chen Chen, Junzhou Huang, Lei He, Hongsheng Li

In this paper, we propose a novel algorithm for structured sparsity reconstruction.

Compressive Sensing

Paper
Add Code

Image Fusion with Local Spectral Consistency and Dynamic Gradient Sparsity

no code implementations • CVPR 2014 • Chen Chen, Yeqing Li, Wei Liu, Junzhou Huang

In this paper, we propose a novel method for image fusion from a high resolution panchromatic image and a low resolution multispectral image at the same geographical location.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.