Search Results for author: Xin Li

Found 368 papers, 157 papers with code

Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling

2 code implementations • 15 Nov 2022 • Yu Wang, Xin Li, Shengzhao Wen, Fukui Yang, Wanping Zhang, Gang Zhang, Haocheng Feng, Junyu Han, Errui Ding

In this paper, we focus on the compression of DETR with knowledge distillation.

General Knowledge Knowledge Distillation

12,041

Paper
Code

Vector-quantized Image Modeling with Improved VQGAN

5 code implementations • ICLR 2022 • Jiahui Yu, Xin Li, Jing Yu Koh, Han Zhang, Ruoming Pang, James Qin, Alexander Ku, Yuanzhong Xu, Jason Baldridge, Yonghui Wu

Motivated by this success, we explore a Vector-quantized Image Modeling (VIM) approach that involves pretraining a Transformer to predict rasterized image tokens autoregressively.

Image Generation Representation Learning +1

10,817

Paper
Code

Local Patch AutoAugment with Multi-Agent Collaboration

2 code implementations • 20 Mar 2021 • Shiqi Lin, Tao Yu, Ruoyu Feng, Xin Li, Xin Jin, Zhibo Chen

We formulate it as a multi-agent reinforcement learning (MARL) problem, where each agent learns an augmentation policy for each patch based on its content together with the semantics of the whole image.

Data Augmentation Fine-Grained Image Recognition +2

9,366

Paper
Code

Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

2 code implementations • 28 Nov 2023 • Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing

Large Vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned.

Hallucination Object

8,890

Paper
Code

Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer

2 code implementations • CVPR 2021 • Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao

Inspired by the common painting process of drawing a draft and revising the details, we introduce a novel feed-forward method named Laplacian Pyramid Network (LapStyle).

Style Transfer

7,669

Paper
Code

Deep Concept-wise Temporal Convolutional Networks for Action Localization

2 code implementations • 26 Aug 2019 • Xin Li, Tianwei Lin, Xiao Liu, Chuang Gan, WangMeng Zuo, Chao Li, Xiang Long, Dongliang He, Fu Li, Shilei Wen

In this paper, we empirically find that stacking more conventional temporal convolution layers actually deteriorates action classification performance, possibly ascribing to that all channels of 1D feature map, which generally are highly abstract and can be regarded as latent concepts, are excessively recombined in temporal convolution.

Action Classification Action Localization

6,866

Paper
Code

BMN: Boundary-Matching Network for Temporal Action Proposal Generation

15 code implementations • ICCV 2019 • Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen

To address these difficulties, we introduce the Boundary-Matching (BM) mechanism to evaluate confidence scores of densely distributed proposals, which denote a proposal as a matching pair of starting and ending boundaries and combine all densely distributed BM pairs into the BM confidence map.

Ranked #1 on Action Recognition on THUMOS’14

Action Detection Action Recognition +1

6,703

Paper
Code

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

1 code implementation • 5 Jun 2023 • Hang Zhang, Xin Li, Lidong Bing

We present Video-LLaMA a multi-modal framework that empowers Large Language Models (LLMs) with the capability of understanding both visual and auditory content in the video.

Ranked #7 on Video Question Answering on MVBench

Language Modelling Text Generation +7

2,407

Paper
Code

AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results

3 code implementations • 23 Aug 2022 • Ren Yang, Radu Timofte, Qi Zhang, Lin Zhang, Fanglong Liu, Dongliang He, Fu Li, He Zheng, Weihang Yuan, Pavel Ostyakov, Dmitry Vyal, Magauiya Zhussip, Xueyi Zou, Youliang Yan, Lei LI, Jingzhu Tang, Ming Chen, Shijie Zhao, Yu Zhu, Xiaoran Qin, Chenghua Li, Cong Leng, Jian Cheng, Claudio Rota, Marco Buzzelli, Simone Bianco, Raimondo Schettini, Dafeng Zhang, Feiyu Huang, Shizhuo Liu, Xiaobing Wang, Zhezhu Jin, Bingchen Li, Xin Li, Mingxi Li, Ding Liu, Wenbin Zou, Peijie Dong, Tian Ye, Yunchen Zhang, Ming Tan, Xin Niu, Mustafa Ayazoglu, Marcos Conde, Ui-Jin Choi, Zhuang Jia, Tianyu Xu, Yijian Zhang, Mao Ye, Dengyan Luo, Xiaofeng Pan, Liuhan Peng

The homepage of this challenge is at https://github. com/RenYang-home/AIM22_CompressSR.

Super-Resolution

525

Paper
Code

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

2 code implementations • 22 Jun 2022 • Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, ZiRui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui Wu

We present the Pathways Autoregressive Text-to-Image (Parti) model, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge.

Ranked #1 on Text-to-Image Generation on LAION COCO

Machine Translation Text-to-Image Generation +1

506

Paper
Code

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

2 code implementations • ICCV 2021 • Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Ruifeng Deng, Xin Li, Errui Ding, Hao Wang

Neural painting refers to the procedure of producing a series of strokes for a given image and non-photo-realistically recreating it using neural networks.

Ranked #1 on Object Detection on A2D

Object Detection Reinforcement Learning (RL) +1

478

Paper
Code

Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey

1 code implementation • 18 Aug 2023 • Xin Li, Yulin Ren, Xin Jin, Cuiling Lan, Xingrui Wang, Wenjun Zeng, Xinchao Wang, Zhibo Chen

Image restoration (IR) has been an indispensable and challenging task in the low-level vision field, which strives to improve the subjective quality of images distorted by various forms of degradation.

Deblurring Image Restoration +2

436

Paper
Code

Exploiting BERT for End-to-End Aspect-based Sentiment Analysis

1 code implementation • WS 2019 • Xin Li, Lidong Bing, Wenxuan Zhang, Wai Lam

In this paper, we investigate the modeling power of contextualized embeddings from pre-trained language models, e. g. BERT, on the E2E-ABSA task.

Ranked #5 on Aspect-Based Sentiment Analysis (ABSA) on SemEval 2014 Task 4 Laptop

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

385

Paper
Code

Drive Like a Human: Rethinking Autonomous Driving with Large Language Models

1 code implementation • 14 Jul 2023 • Daocheng Fu, Xin Li, Licheng Wen, Min Dou, Pinlong Cai, Botian Shi, Yu Qiao

In this paper, we explore the potential of using a large language model (LLM) to understand the driving environment in a human-like manner and analyze its ability to reason, interpret, and memorize when facing complex scenarios.

Autonomous Driving Common Sense Reasoning +3

312

Paper
Code

DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models

2 code implementations • 28 Sep 2023 • Licheng Wen, Daocheng Fu, Xin Li, Xinyu Cai, Tao Ma, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yu Qiao

Recent advancements in autonomous driving have relied on data-driven approaches, which are widely adopted but face challenges including dataset bias, overfitting, and uninterpretability.

Autonomous Driving Common Sense Reasoning +1

312

Paper
Code

Towards Knowledge-driven Autonomous Driving

1 code implementation • 7 Dec 2023 • Xin Li, Yeqi Bai, Pinlong Cai, Licheng Wen, Daocheng Fu, Bo Zhang, Xuemeng Yang, Xinyu Cai, Tao Ma, Jianfei Guo, Xing Gao, Min Dou, Yikang Li, Botian Shi, Yong liu, Liang He, Yu Qiao

This paper explores the emerging knowledge-driven autonomous driving technologies.

Autonomous Driving Neural Rendering

304

Paper
Code

UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase

1 code implementation • ICCV 2023 • Youquan Liu, Runnan Chen, Xin Li, Lingdong Kong, Yuchen Yang, Zhaoyang Xia, Yeqi Bai, Xinge Zhu, Yuexin Ma, Yikang Li, Yu Qiao, Yuenan Hou

Besides, we construct the OpenPCSeg codebase, which is the largest and most comprehensive outdoor LiDAR segmentation codebase.

Ranked #2 on 3D Semantic Segmentation on SemanticKITTI (using extra training data)

3D Semantic Segmentation LIDAR Semantic Segmentation +2

296

Paper
Code

DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

1 code implementation • ICCV 2023 • Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li

Extensive experiments on Waymo Open Dataset show our DetZero outperforms all state-of-the-art onboard and offboard 3D detection methods.

3D Object Detection Object +1

273

Paper
Code

Robo3D: Towards Robust and Reliable 3D Perception against Corruptions

1 code implementation • ICCV 2023 • Lingdong Kong, Youquan Liu, Xin Li, Runnan Chen, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu

The robustness of 3D perception systems under natural corruptions from environments and sensors is pivotal for safety-critical applications.

Robust 3D Object Detection Robust 3D Semantic Segmentation

272

Paper
Code

A Unified Model for Opinion Target Extraction and Target Sentiment Prediction

1 code implementation • 13 Nov 2018 • Xin Li, Lidong Bing, Piji Li, Wai Lam

Target-based sentiment analysis involves opinion target extraction and target sentiment classification.

Ranked #8 on Aspect-Based Sentiment Analysis (ABSA) on SemEval 2014 Task 4 Laptop

Aspect-Based Sentiment Analysis (ABSA) Sentiment Classification

268

Paper
Code

On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving

1 code implementation • 9 Nov 2023 • Licheng Wen, Xuemeng Yang, Daocheng Fu, XiaoFeng Wang, Pinlong Cai, Xin Li, Tao Ma, Yingxuan Li, Linran Xu, Dengke Shang, Zheng Zhu, Shaoyan Sun, Yeqi Bai, Xinyu Cai, Min Dou, Shuanglu Hu, Botian Shi, Yu Qiao

This has been a significant bottleneck, particularly in the development of common sense reasoning and nuanced scene understanding necessary for safe and reliable autonomous driving.

Autonomous Driving Common Sense Reasoning +4

263

Paper
Code

Virtual Sparse Convolution for Multimodal 3D Object Detection

1 code implementation • CVPR 2023 • Hai Wu, Chenglu Wen, Shaoshuai Shi, Xin Li, Cheng Wang

Finally, we develop a semi-supervised pipeline VirConv-S based on a pseudo-label framework.

3D Object Detection Depth Completion +3

236

Paper
Code

AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer

3 code implementations • ICCV 2021 • Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Meiling Wang, Xin Li, Zhengxing Sun, Qian Li, Errui Ding

Finally, the content feature is normalized so that they demonstrate the same local feature statistics as the calculated per-point weighted style feature statistics.

Style Transfer Video Style Transfer

197

Paper
Code

GRIP++: Enhanced Graph-based Interaction-aware Trajectory Prediction for Autonomous Driving

5 code implementations • arXiv preprint 2020 • Xin Li, Xiaowen Ying, Mooi Choo Chuah

Despite the advancement in the technology of autonomous driving cars, the safety of a self-driving car is still a challenging problem that has not been well studied.

Autonomous Driving motion prediction +1

155

Paper
Code

GRIP: Graph-based Interaction-aware Trajectory Prediction

1 code implementation • IEEE Intelligent Transportation Systems Conference (ITSC) 2019 • Xin Li, Xiaowen Ying, Mooi Choo Chuah

The prediction error of GRIP is one meter shorter than existing schemes.

Autonomous Driving motion prediction +1

155

Paper
Code

Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results

2 code implementations • 22 Dec 2021 • Liang Pan, Tong Wu, Zhongang Cai, Ziwei Liu, Xumin Yu, Yongming Rao, Jiwen Lu, Jie zhou, Mingye Xu, Xiaoyuan Luo, Kexue Fu, Peng Gao, Manning Wang, Yali Wang, Yu Qiao, Junsheng Zhou, Xin Wen, Peng Xiang, Yu-Shen Liu, Zhizhong Han, Yuanjie Yan, Junyi An, Lifa Zhu, Changwei Lin, Dongrui Liu, Xin Li, Francisco Gómez-Fernández, Qinlong Wang, Yang Yang

Based on the MVP dataset, this paper reports methods and results in the Multi-View Partial Point Cloud Challenge 2021 on Completion and Registration.

3D Reconstruction Point Cloud Completion +2

153

Paper
Code

SMILEtrack: SiMIlarity LEarning for Occlusion-Aware Multiple Object Tracking

2 code implementations • 16 Nov 2022 • Yu-Hsiang Wang, Jun-Wei Hsieh, Ping-Yang Chen, Ming-Ching Chang, Hung Hin So, Xin Li

Second, we develop a Similarity Matching Cascade (SMC) module with a novel GATE function for robust object matching across consecutive video frames, further enhancing MOT performance.

Ranked #1 on Multi-Object Tracking on MOT20 (using extra training data)

Multi-Object Tracking Multiple Object Tracking +3

152

Paper
Code

Partial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search

2 code implementations • CVPR 2019 • Xin Li, Yiming Zhou, Zheng Pan, Jiashi Feng

It prunes the architecture search space with a partial order assumption to automatically search for the architectures with the best speed and accuracy trade-off.

Neural Architecture Search

149

Paper
Code

Transformation Networks for Target-Oriented Sentiment Classification

2 code implementations • ACL 2018 • Xin Li, Lidong Bing, Wai Lam, Bei Shi

Between the two layers, we propose a component to generate target-specific representations of words in the sentence, meanwhile incorporate a mechanism for preserving the original contextual information from the RNN layer.

Ranked #19 on Aspect-Based Sentiment Analysis (ABSA) on SemEval-2014 Task-4 (Laptop (Acc) metric)

Aspect-Based Sentiment Analysis (ABSA) Classification +3

141

Paper
Code

scCDCG: Efficient Deep Structural Clustering for single-cell RNA-seq via Deep Cut-informed Graph Embedding

2 code implementations • 9 Apr 2024 • Ping Xu, Zhiyuan Ning, Meng Xiao, Guihai Feng, Xin Li, Yuanchun Zhou, Pengfei Wang

Addressing these limitations, we introduce scCDCG (single-cell RNA-seq Clustering via Deep Cut-informed Graph), a novel framework designed for efficient and accurate clustering of scRNA-seq data that simultaneously utilizes intercellular high-order structural information.

Clustering Dimensionality Reduction +2

140

Paper
Code

RF-Net: An End-to-End Image Matching Network based on Receptive Field

1 code implementation • CVPR 2019 • Xuelun Shen, Cheng Wang, Xin Li, Zenglei Yu, Jonathan Li, Chenglu Wen, Ming Cheng, Zijian He

This paper proposes a new end-to-end trainable matching network based on receptive field, RF-Net, to compute sparse correspondence between images.

Keypoint Detection

128

Paper
Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

117

Paper
Code

LSOTB-TIR:A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark

1 code implementation • 3 Aug 2020 • Qiao Liu, Xin Li, Zhenyu He, Chenglong Li, Jun Li, Zikun Zhou, Di Yuan, Jing Li, Kai Yang, Nana Fan, Feng Zheng

We evaluate and analyze more than 30 trackers on LSOTB-TIR to provide a series of baselines, and the results show that deep trackers achieve promising performance.

Thermal Infrared Object Tracking Vocal Bursts Intensity Prediction

113

Paper
Code

nnMobileNet: Rethinking CNN for Retinopathy Research

2 code implementations • 2 Jun 2023 • Wenhui Zhu, Peijie Qiu, Xiwen Chen, Xin Li, Natasha Lepore, Oana M. Dumitrascu, Yalin Wang

Over the past few decades, convolutional neural networks (CNNs) have been at the forefront of the detection and tracking of various retinal diseases (RD).

Diabetic Retinopathy Grading

110

Paper
Code

SeaLLMs -- Large Language Models for Southeast Asia

1 code implementation • 1 Dec 2023 • Xuan-Phi Nguyen, Wenxuan Zhang, Xin Li, Mahani Aljunied, Qingyu Tan, Liying Cheng, Guanzheng Chen, Yue Deng, Sen yang, Chaoqun Liu, Hang Zhang, Lidong Bing

Despite the remarkable achievements of large language models (LLMs) in various tasks, there remains a linguistic bias that favors high-resource languages, such as English, often at the expense of low-resource and regional languages.

Instruction Following

102

Paper
Code

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows

2 code implementations • 11 Aug 2021 • Xiao Wang, Jianing Li, Lin Zhu, Zhipeng Zhang, Zhe Chen, Xin Li, YaoWei Wang, Yonghong Tian, Feng Wu

Different from visible cameras which record intensity images frame by frame, the biologically inspired event camera produces a stream of asynchronous and sparse events with much lower latency.

Ranked #1 on Object Tracking on VisEvent

Object Tracking

101

Paper
Code

Micron-BERT: BERT-based Facial Micro-Expression Recognition

1 code implementation • CVPR 2023 • Xuan-Bac Nguyen, Chi Nhan Duong, Xin Li, Susan Gauch, Han-Seok Seo, Khoa Luu

By incorporating these components into an end-to-end deep network, the proposed $\mu$-BERT significantly outperforms all previous work in various micro-expression tasks.

Ranked #1 on Micro Expression Recognition on SMIC

Micro Expression Recognition Micro-Expression Recognition +1

100

Paper
Code

A Survey on Aspect-Based Sentiment Analysis: Tasks, Methods, and Challenges

1 code implementation • 2 Mar 2022 • Wenxuan Zhang, Xin Li, Yang Deng, Lidong Bing, Wai Lam

More specifically, we provide a new taxonomy for ABSA which organizes existing studies from the axes of concerned sentiment elements, with an emphasis on recent advances of compound ABSA tasks.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)

Paper
Code

Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement

1 code implementation • NeurIPS 2020 • Yongqing Liang, Xin Li, Navid Jafari, Qin Chen

We also design a new confidence loss and a fine-grained segmentation module to enhance the segmentation accuracy in uncertain regions.

Ranked #2 on Semi-Supervised Video Object Segmentation on Long Video Dataset (3X) (using extra training data)

Segmentation Semantic Segmentation +2

Paper
Code

Towards Generative Aspect-Based Sentiment Analysis

1 code implementation • ACL 2021 • Wenxuan Zhang, Xin Li, Yang Deng, Lidong Bing, Wai Lam

Aspect-based sentiment analysis (ABSA) has received increasing attention recently.

Ranked #4 on Aspect Sentiment Triplet Extraction on ASTE-Data-V2

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Code

SCPNet: Semantic Scene Completion on Point Cloud

1 code implementation • CVPR 2023 • Zhaoyang Xia, Youquan Liu, Xin Li, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao

We propose a simple yet effective label rectification strategy, which uses off-the-shelf panoptic segmentation labels to remove the traces of dynamic objects in completion labels, greatly improving the performance of deep models especially for those moving objects.

Ranked #1 on 3D Semantic Scene Completion on SemanticKITTI

3D Semantic Scene Completion Knowledge Distillation +3

Paper
Code

Aspect Sentiment Quad Prediction as Paraphrase Generation

1 code implementation • EMNLP 2021 • Wenxuan Zhang, Yang Deng, Xin Li, Yifei Yuan, Lidong Bing, Wai Lam

Aspect-based sentiment analysis (ABSA) has been extensively studied in recent years, which typically involves four fundamental sentiment elements, including the aspect category, aspect term, opinion term, and sentiment polarity.

Ranked #3 on Aspect-Based Sentiment Analysis (ABSA) on TASD

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Code

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang

This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.

Paper
Code

PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark

1 code implementation • 18 Jan 2018 • Qiao Liu, Zhenyu He, Xin Li, Yuan Zheng

The ability to evaluate the TIR pedestrian tracker fairly, on a benchmark dataset, is significant for the development of this field.

Attribute Thermal Infrared Object Tracking

Paper
Code

Transferable End-to-End Aspect-based Sentiment Analysis with Selective Adversarial Learning

1 code implementation • IJCNLP 2019 • Zheng Li, Xin Li, Ying WEI, Lidong Bing, Yu Zhang, Qiang Yang

Joint extraction of aspects and sentiments can be effectively formulated as a sequence labeling problem.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Code

CLEX: Continuous Length Extrapolation for Large Language Models

1 code implementation • 25 Oct 2023 • Guanzheng Chen, Xin Li, Zaiqiao Meng, Shangsong Liang, Lidong Bing

We generalise the PE scaling approaches to model the continuous dynamics by ordinary differential equations over the length scaling factor, thereby overcoming the constraints of current PE scaling methods designed for specific lengths.

4k Position

Paper
Code

Deep Models with Fusion Strategies for MVP Point Cloud Registration

1 code implementation • 18 Oct 2021 • Lifa Zhu, Changwei Lin, Dongrui Liu, Xin Li, Francisco Gómez-Fernández

The main goal of point cloud registration in Multi-View Partial (MVP) Challenge 2021 is to estimate a rigid transformation to align a point cloud pair.

Point Cloud Registration

Paper
Code

Constraining Depth Map Geometry for Multi-View Stereo: A Dual-Depth Approach with Saddle-shaped Depth Cells

1 code implementation • ICCV 2023 • Xinyi Ye, Weiyue Zhao, Tianqi Liu, Zihao Huang, Zhiguo Cao, Xin Li

Learning-based multi-view stereo (MVS) methods deal with predicting accurate depth maps to achieve an accurate and complete 3D representation.

Depth Estimation Depth Prediction

Paper
Code

GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph

1 code implementation • NeurIPS 2023 • Xin Li, Dongze Lian, Zhihe Lu, Jiawang Bai, Zhibo Chen, Xinchao Wang

To mitigate that, we propose an effective adapter-style tuning strategy, dubbed GraphAdapter, which performs the textual adapter by explicitly modeling the dual-modality structure knowledge (i. e., the correlation of different semantics/classes in textual and visual modalities) with a dual knowledge graph.

Transfer Learning

Paper
Code

Aspect Term Extraction with History Attention and Selective Transformation

1 code implementation • 2 May 2018 • Xin Li, Lidong Bing, Piji Li, Wai Lam, Zhimou Yang

Aspect Term Extraction (ATE), a key sub-task in Aspect-Based Sentiment Analysis, aims to extract explicit aspect expressions from online user reviews.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Code

Neural Color Operators for Sequential Image Retouching

2 code implementations • 17 Jul 2022 • Yili Wang, Xin Li, Kun Xu, Dongliang He, Qi Zhang, Fu Li, Errui Ding

The neural color operator mimics the behavior of traditional color operators and learns pixelwise color transformation while its strength is controlled by a scalar.

Image Enhancement Image Retouching

Paper
Code

Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness

1 code implementation • 28 Apr 2021 • Manyu Zhu, Dongliang He, Xin Li, Chao Li, Fu Li, Xiao Liu, Errui Ding, Zhaoxiang Zhang

Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial.

Ranked #4 on Image Inpainting on CelebA-HQ

Image Inpainting valid

Paper
Code

Learning Semantic Person Image Generation by Region-Adaptive Normalization

1 code implementation • CVPR 2021 • Zhengyao Lv, Xiaoming Li, Xin Li, Fu Li, Tianwei Lin, Dongliang He, WangMeng Zuo

In the first stage, we predict the target semantic parsing maps to eliminate the difficulties of pose transfer and further benefit the latter translation of per-region appearance style.

Pose Transfer Semantic Parsing +1

Paper
Code

JigsawNet: Shredded Image Reassembly using Convolutional Neural Network and Loop-based Composition

3 code implementations • 11 Sep 2018 • Canyu Le, Xin Li

Existing reassembly pipelines commonly consist of a local matching stage and a global compositions stage.

Paper
Code

Pyramid Mask Text Detector

1 code implementation • 28 Mar 2019 • Jingchao Liu, Xuebo Liu, Jie Sheng, Ding Liang, Xin Li, Qingjie Liu

Scene text detection, an essential step of scene text recognition system, is to locate text instances in natural scene images automatically.

Ranked #1 on Scene Text Detection on ICDAR 2017 MLT

Clustering Instance Segmentation +4

Paper
Code

NM-Net: Mining Reliable Neighbors for Robust Feature Correspondences

1 code implementation • CVPR 2019 • Chen Zhao, Zhiguo Cao, Chi Li, Xin Li, Jiaqi Yang

Feature correspondence selection is pivotal to many feature-matching based tasks in computer vision.

Paper
Code

Image-to-Image Translation with Deep Reinforcement Learning

1 code implementation • 24 Sep 2023 • Xin Wang, Ziwei Luo, Jing Hu, Chengming Feng, Shu Hu, Bin Zhu, Xi Wu, Xin Li, Siwei Lyu

The key feature in the RL-I2IT framework is to decompose a monolithic learning process into small steps with a lightweight model to progressively transform a source image successively to a target image.

Auxiliary Learning Decision Making +3

Paper
Code

Exploiting Coarse-to-Fine Task Transfer for Aspect-level Sentiment Classification

1 code implementation • AAAI 2019 2018 • Zheng Li, Ying WEI, Yu Zhang, Xiang Zhang, Xin Li, Qiang Yang

Aspect-level sentiment classification (ASC) aims at identifying sentiment polarities towards aspects in a sentence, where the aspect can behave as a general Aspect Category (AC) or a specific Aspect Term (AT).

Ranked #19 on Aspect-Based Sentiment Analysis (ABSA) on SemEval-2014 Task-4

General Classification Sentence +2

Paper
Code

MGeo: Multi-Modal Geographic Pre-Training Method

1 code implementation • 11 Jan 2023 • Ruixue Ding, Boli Chen, Pengjun Xie, Fei Huang, Xin Li, Qiang Zhang, Yao Xu

Single-modal PTMs can barely make use of the important GC and therefore have limited performance.

Language Modelling

Paper
Code

Cascade Graph Neural Networks for RGB-D Salient Object Detection

1 code implementation • ECCV 2020 • Ao Luo, Xin Li, Fan Yang, Zhicheng Jiao, Hong Cheng, Siwei Lyu

Current works either simply distill prior knowledge from the corresponding depth map for handling the RGB-image or blindly fuse color and geometric information to generate the coarse depth-aware representations, hindering the performance of RGB-D saliency detectors. In this work, we introduceCascade Graph Neural Networks(Cas-Gnn), a unified framework which is capable of comprehensively distilling and reasoning the mutual benefits between these two data sources through a set of cascade graphs, to learn powerful representations for RGB-D salient object detection.

Ranked #5 on RGB-D Salient Object Detection on NJU2K

Object object-detection +3

Paper
Code

Mutual Graph Learning for Camouflaged Object Detection

1 code implementation • CVPR 2021 • Qiang Zhai, Xin Li, Fan Yang, Chenglizhao Chen, Hong Cheng, Deng-Ping Fan

Automatically detecting/segmenting object(s) that blend in with their surroundings is difficult for current models.

Graph Learning Object +2

Paper
Code

MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NER

1 code implementation • ACL 2022 • Ran Zhou, Xin Li, Ruidan He, Lidong Bing, Erik Cambria, Luo Si, Chunyan Miao

Data augmentation is an effective solution to data scarcity in low-resource scenarios.

Cross-Lingual NER Data Augmentation +4

Paper
Code

Learning Distortion Invariant Representation for Image Restoration from A Causality Perspective

2 code implementations • CVPR 2023 • Xin Li, Bingchen Li, Xin Jin, Cuiling Lan, Zhibo Chen

In this paper, we are the first to propose a novel training strategy for image restoration from the causality perspective, to improve the generalization ability of DNNs for unknown degradations.

counterfactual Image Restoration +2

Paper
Code

MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild

1 code implementation • 23 Aug 2023 • Yu-Xiang Zeng, Jun-Wei Hsieh, Xin Li, Ming-Ching Chang

Detecting small scene text instances in the wild is particularly challenging, where the influence of irregular positions and nonideal lighting often leads to detection errors.

Ranked #1 on Scene Text Detection on SCUT-CTW1500

Scene Text Detection Text Detection

Paper
Code

SeD: Semantic-Aware Discriminator for Image Super-Resolution

1 code implementation • 29 Feb 2024 • Bingchen Li, Xin Li, Hanxin Zhu, Yeying Jin, Ruoyu Feng, Zhizheng Zhang, Zhibo Chen

In particular, one discriminator is utilized to enable the SR network to learn the distribution of real-world high-quality images in an adversarial training manner.

Image Super-Resolution

Paper
Code

Saliency-Associated Object Tracking

1 code implementation • ICCV 2021 • Zikun Zhou, Wenjie Pei, Xin Li, Hongpeng Wang, Feng Zheng, Zhenyu He

A potential limitation of such trackers is that not all patches are equally informative for tracking.

Object Object Tracking

Paper
Code

Learning Optical Flow with Adaptive Graph Reasoning

1 code implementation • 8 Feb 2022 • Ao Luo, Fan Yang, Kunming Luo, Xin Li, Haoqiang Fan, Shuaicheng Liu

Our key idea is to decouple the context reasoning from the matching procedure, and exploit scene information to effectively assist motion estimation by learning to reason over the adaptive graph.

Motion Estimation Optical Flow Estimation +1

Paper
Code

Learning Optical Flow With Kernel Patch Attention

1 code implementation • CVPR 2022 • Ao Luo, Fan Yang, Xin Li, Shuaicheng Liu

Optical flow is a fundamental method used for quantitative motion estimation on the image plane.

Motion Estimation Optical Flow Estimation

Paper
Code

Multi-Task Driven Feature Models for Thermal Infrared Tracking

1 code implementation • 26 Nov 2019 • Qiao Liu, Xin Li, Zhenyu He, Nana Fan, Di Yuan, Wei Liu, Yonsheng Liang

These two feature models are learned using a multi-task matching framework and are jointly optimized on the TIR tracking task.

Thermal Infrared Object Tracking

Paper
Code

GAFlow: Incorporating Gaussian Attention into Optical Flow

1 code implementation • ICCV 2023 • Ao Luo, Fan Yang, Xin Li, Lang Nie, Chunyu Lin, Haoqiang Fan, Shuaicheng Liu

Moreover, for reliable motion analysis, we provide a new Gaussian-Guided Attention Module (GGAM) which not only inherits properties from Gaussian distribution to instinctively revolve around the neighbor fields of each point but also is empowered to put the emphasis on contextually related regions during matching.

Optical Flow Estimation Representation Learning

Paper
Code

Detecting Multimedia Generated by Large AI Models: A Survey

1 code implementation • 22 Jan 2024 • Li Lin, Neeraj Gupta, Yue Zhang, Hainan Ren, Chun-Hao Liu, Feng Ding, Xin Wang, Xin Li, Luisa Verdoliva, Shu Hu

The rapid advancement of Large AI Models (LAIMs), particularly diffusion models and large language models, has marked a new era where AI-generated multimedia is increasingly integrated into various aspects of daily life.

Paper
Code

LMR: A Large-Scale Multi-Reference Dataset for Reference-based Super-Resolution

1 code implementation • ICCV 2023 • Lin Zhang, Xin Li, Dongliang He, Errui Ding, Zhaoxiang Zhang

To this end, we construct a large-scale, multi-reference super-resolution dataset, named LMR.

feature selection Image Super-Resolution +1

Paper
Code

Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models

2 code implementations • 3 Feb 2021 • Shang Wang, Peiming Yang, Yuxuan Zheng, Xin Li, Gennady Pekhimenko

Driven by the tremendous effort in researching novel deep learning (DL) algorithms, the training cost of developing new models increases staggeringly in recent years.

Paper
Code

DDGCN: A Dynamic Directed Graph Convolutional Network for Action Recognition

1 code implementation • ECCV 2020 • Matthew Korban, Xin Li

We propose a Dynamic Directed Graph Convolutional Network (DDGCN) to model spatial and temporal features of human actions from their skeletal representations.

Action Recognition

Paper
Code

CiteTracker: Correlating Image and Text for Visual Tracking

1 code implementation • ICCV 2023 • Xin Li, Yuqing Huang, Zhenyu He, YaoWei Wang, Huchuan Lu, Ming-Hsuan Yang

Existing visual tracking methods typically take an image patch as the reference of the target to perform tracking.

Attribute Descriptive +2

Paper
Code

Zero-1-to-3: Domain-level Zero-shot Cognitive Diagnosis via One Batch of Early-bird Students towards Three Diagnostic Objectives

2 code implementations • 20 Dec 2023 • Weibo Gao, Qi Liu, Hao Wang, Linan Yue, Haoyang Bi, Yin Gu, Fangzhou Yao, Zheng Zhang, Xin Li, Yuanjing He

Consequently, we refine the cognitive states of cold-start students as diagnostic outcomes via virtual data, aligning with the diagnosis-oriented goal.

cognitive diagnosis Domain Adaptation +1

Paper
Code

Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection

1 code implementation • ICCV 2021 • Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, Deng-Ping Fan

Spotting objects that are visually adapted to their surroundings is challenging for both humans and AI.

Object object-detection +2

Paper
Code

KVQ: Kwai Video Quality Assessment for Short-form Videos

1 code implementation • 11 Feb 2024 • Yiting Lu, Xin Li, Yajing Pei, Kun Yuan, Qizhi Xie, Yunpeng Qu, Ming Sun, Chao Zhou, Zhibo Chen

Short-form UGC video platforms, like Kwai and TikTok, have been an emerging and irreplaceable mainstream media form, thriving on user-friendly engagement, and kaleidoscope creation, etc.

Video Quality Assessment Visual Question Answering (VQA)

Paper
Code

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

1 code implementation • 17 Apr 2024 • Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei LI, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, WangMeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, huimin zheng, JunHao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu

This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i. e., Kuaishou/Kwai Platform.

valid Video Quality Assessment +1

Paper
Code

NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition

1 code implementation • CVPR 2022 • Hao liu, Xinghua Jiang, Xin Li, Zhimin Bao, Deqiang Jiang, Bo Ren

For the sake of trade-off between efficiency and performance, a group of works merely perform SA operation within local patches, whereas the global contextual information is abandoned, which would be indispensable for visual recognition tasks.

object-detection Object Detection +1

Paper
Code

CoIn: Contrastive Instance Feature Mining for Outdoor 3D Object Detection with Very Limited Annotations

1 code implementation • ICCV 2023 • Qiming Xia, Jinhao Deng, Chenglu Wen, Hai Wu, Shaoshuai Shi, Xin Li, Cheng Wang

Combining CoIn with an iterative training strategy, we propose a CoIn++ pipeline, which requires only 2% annotations in the KITTI dataset to achieve performance comparable to the fully supervised methods.

3D Object Detection Contrastive Learning +2

Paper
Code

DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition

1 code implementation • CVPR 2022 • Thanh-Dat Truong, Quoc-Huy Bui, Chi Nhan Duong, Han-Seok Seo, Son Lam Phung, Xin Li, Khoa Luu

Various 3D-CNN based methods have been presented to tackle both the spatial and temporal dimensions in the task of video action recognition with competitive results.

Ranked #1 on Action Recognition on Jester (Gesture Recognition)

Action Classification Action Recognition In Videos +2

Paper
Code

Low-Light Image Enhancement with Multi-Stage Residue Quantization and Brightness-Aware Attention

1 code implementation • ICCV 2023 • Yunlong Liu, Tao Huang, Weisheng Dong, Fangfang Wu, Xin Li, Guangming Shi

Deep learning-based LLIE methods focus on learning a mapping function between low-light images and normal-light images that outperforms conventional LLIE methods.

Low-Light Image Enhancement Quantization

Paper
Code

Contour Knowledge Transfer for Salient Object Detection

1 code implementation • ECCV 2018 • Xin Li, Fan Yang, Hong Cheng, Wei Liu, Dinggang Shen

Our goal is to overcome this limitation by automatically converting an existing deep contour detection model into a salient object detection model without using any manual salient object masks.

Contour Detection Object +4

Paper
Code

COVID-MobileXpert: On-Device COVID-19 Patient Triage and Follow-up using Chest X-rays

1 code implementation • 6 Apr 2020 • Xin Li, Chengyin Li, Dongxiao Zhu

We design and implement a novel three-player knowledge transfer and distillation (KTD) framework including a pre-trained attending physician (AP) network that extracts CXR imaging features from a large scale of lung disease CXR images, a fine-tuned resident fellow (RF) network that learns the essential CXR imaging features to discriminate COVID-19 from pneumonia and/or normal cases with a small amount of COVID-19 cases, and a trained lightweight medical student (MS) network to perform on-device COVID-19 patient triage and follow-up.

Computed Tomography (CT) Trajectory Prediction +1

Paper
Code

Fast Full-frame Video Stabilization with Iterative Optimization

1 code implementation • ICCV 2023 • Weiyue Zhao, Xin Li, Zhan Peng, Xianrui Luo, Xinyi Ye, Hao Lu, Zhiguo Cao

Video stabilization refers to the problem of transforming a shaky video into a visually pleasing one.

Video Stabilization

Paper
Code

Unsupervised Learning of Accurate Siamese Tracking

1 code implementation • CVPR 2022 • Qiuhong Shen, Lei Qiao, Jinyang Guo, Peixia Li, Xin Li, Bo Li, Weitao Feng, Weihao Gan, Wei Wu, Wanli Ouyang

As unlimited self-supervision signals can be obtained by tracking a video along a cycle in time, we investigate evolving a Siamese tracker by tracking videos forward-backward.

Visual Object Tracking

Paper
Code

SiamCorners: Siamese Corner Networks for Visual Tracking

1 code implementation • 15 Apr 2021 • Kai Yang, Zhenyu He, Wenjie Pei, Zikun Zhou, Xin Li, Di Yuan, Haijun Zhang

By tracking a target as a pair of corners, we avoid the need to design the anchor boxes.

Region Proposal Visual Tracking

Paper
Code

Improving Fine-grained Entity Typing with Entity Linking

1 code implementation • IJCNLP 2019 • Hongliang Dai, Donghong Du, Xin Li, Yangqiu Song

Fine-grained entity typing is a challenging problem since it usually involves a relatively large tag set and may require to understand the context of the entity mention.

Entity Linking Entity Typing +1

Paper
Code

Face Beautification: Beyond Makeup Transfer

1 code implementation • 8 Dec 2019 • Xudong Liu, Ruizhe Wang, Chih-Fan Chen, Minglei Yin, Hao Peng, Shukhan Ng, Xin Li

Inspired by the latest advances in style-based synthesis and face beauty prediction, we propose a novel framework of face beautification.

Translation

Paper
Code

A Chinese Corpus for Fine-grained Entity Typing

1 code implementation • LREC 2020 • Chin Lee, Hongliang Dai, Yangqiu Song, Xin Li

In this paper, we introduce a corpus for Chinese fine-grained entity typing that contains 4, 800 mentions manually labeled through crowdsourcing.

Cross-Lingual Transfer Entity Typing +1

Paper
Code

Model Attribution of Face-swap Deepfake Videos

1 code implementation • 25 Feb 2022 • Shan Jia, Xin Li, Siwei Lyu

Then we take Deepfakes model attribution as a multiclass classification task and propose a spatial and temporal attention based method to explore the differences among Deepfakes in the new dataset.

Attribute Face Swapping

Paper
Code

Hierarchical Spatial-aware Siamese Network for Thermal Infrared Object Tracking

1 code implementation • 27 Nov 2017 • Xin Li, Qiao Liu, Nana Fan, Zhenyu He, Hongzhi Wang

In this paper, we cast the TIR tracking problem as a similarity verification task, which is coupled well to the objective of the tracking task.

General Classification Thermal Infrared Object Tracking

Paper
Code

An Informative Tracking Benchmark

1 code implementation • 13 Dec 2021 • Xin Li, Qiao Liu, Wenjie Pei, Qiuhong Shen, YaoWei Wang, Huchuan Lu, Ming-Hsuan Yang

Along with the rapid progress of visual tracking, existing benchmarks become less informative due to redundancy of samples and weak discrimination between current trackers, making evaluations on all datasets extremely time-consuming.

Visual Tracking

Paper
Code

Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs

1 code implementation • 16 Nov 2023 • Sen yang, Xin Li, Leyang Cui, Lidong Bing, Wai Lam

Though prompting LLMs with various reasoning structures produces reasoning proofs along with answers, these proofs are not ensured to be causal and reliable due to the inherent defects of LLMs.

GSM8K

Paper
Code

MHSA-Net: Multi-Head Self-Attention Network for Occluded Person Re-Identification

1 code implementation • 10 Aug 2020 • Hongchen Tan, Xiuping Liu, BaoCai Yin, Xin Li

This paper presents a novel person re-identification model, named Multi-Head Self-Attention Network (MHSA-Net), to prune unimportant information and capture key local information from person images.

Person Re-Identification

Paper
Code

Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors

1 code implementation • 22 Nov 2021 • Zihan Yan, Li Liu, Xin Li, William K. Cheung, Youmin Zhang, Qun Liu, Guoyin Wang

Social network alignment aims at aligning person identities across social networks.

Meta-Learning

Paper
Code

Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models

1 code implementation • 28 Nov 2023 • Zhihe Lu, Jiawang Bai, Xin Li, Zeyu Xiao, Xinchao Wang

However, performance advancements are limited when relying solely on intricate algorithmic designs for a single model, even one exhibiting strong performance, e. g., CLIP-ViT-B/16.

Ranked #2 on Prompt Engineering on ImageNet

Prompt Engineering

Paper
Code

Once Upon a $\textit{Time}$ in $\textit{Graph}$: Relative-Time Pretraining for Complex Temporal Reasoning

1 code implementation • 23 Oct 2023 • Sen yang, Xin Li, Lidong Bing, Wai Lam

However, the knowledge-time association is usually insufficient for the downstream tasks that require reasoning over temporal dependencies between knowledge.

Question Answering

Paper
Code

Learning Deep Multi-Level Similarity for Thermal Infrared Object Tracking

1 code implementation • 9 Jun 2019 • Qiao Liu, Xin Li, Zhenyu He, Nana Fan, Di Yuan, Hongpeng Wang

These two similarities complement each other and hence enhance the discriminative capacity of the network for handling distractors.

Paper
Code

DeepReduce: A Sparse-tensor Communication Framework for Distributed Deep Learning

1 code implementation • NeurIPS 2021 • Kelly Kostopoulou, Hang Xu, Aritra Dutta, Xin Li, Alexandros Ntoulas, Panos Kalnis

This paper introduces DeepReduce, a versatile framework for the compressed communication of sparse tensors, tailored for distributed deep learning.

Paper
Code

DeepReduce: A Sparse-tensor Communication Framework for Federated Deep Learning

1 code implementation • NeurIPS 2021 • Hang Xu, Kelly Kostopoulou, Aritra Dutta, Xin Li, Alexandros Ntoulas, Panos Kalnis

DeepReduce is orthogonal to existing gradient sparsifiers and can be applied in conjunction with them, transparently to the end-user, to significantly lower the communication overhead.

Paper
Code

ConNER: Consistency Training for Cross-lingual Named Entity Recognition

1 code implementation • 17 Nov 2022 • Ran Zhou, Xin Li, Lidong Bing, Erik Cambria, Luo Si, Chunyan Miao

We propose ConNER as a novel consistency training framework for cross-lingual NER, which comprises of: (1) translation-based consistency training on unlabeled target-language data, and (2) dropoutbased consistency training on labeled source-language data.

Cross-Lingual NER Knowledge Distillation +3

Paper
Code

No trends in spring and autumn phenology during the global warming hiatus

1 code implementation • Nature Communications 2019 • Xufeng Wang, Jingfeng Xiao, Xin Li, Guodong Cheng, Mingguo Ma, Gaofeng Zhu, M. Altaf Arain, T. Andrew Black & Rachhpal S. Jassal

Phenology plays a fundamental role in regulating photosynthesis, evapotranspiration, and surface energy fluxes and is sensitive to climate change.

Paper
Code

Rotation Invariant Point Cloud Classification: Where Local Geometry Meets Global Topology

1 code implementation • 1 Nov 2019 • Chen Zhao, Jiaqi Yang, Xin Xiong, Angfan Zhu, Zhiguo Cao, Xin Li

To the best of our knowledge, this work is the first principled approach toward adaptively combining global and local information under the context of RI point cloud analysis.

General Classification Point Cloud Classification

Paper
Code

A Detector-oblivious Multi-arm Network for Keypoint Matching

1 code implementation • 2 Apr 2021 • Xuelun Shen, Cheng Wang, Xin Li, Qian Hu, Jingyi Zhang

This paper presents a matching network to establish point correspondence between images.

Paper
Code

Multilingual AMR Parsing with Noisy Knowledge Distillation

1 code implementation • Findings (EMNLP) 2021 • Deng Cai, Xin Li, Jackie Chun-Sing Ho, Lidong Bing, Wai Lam

We study multilingual AMR parsing from the perspective of knowledge distillation, where the aim is to learn and improve a multilingual AMR parser by using an existing English parser as its teacher.

AMR Parsing Knowledge Distillation

Paper
Code

SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment

1 code implementation • 9 May 2022 • Jianzhao Liu, Xin Li, Yanding Peng, Tao Yu, Zhibo Chen

In this paper, we design a full-reference image quality assessment metric SwinIQA to measure the perceptual quality of compressed images in a learned Swin distance space.

Ranked #1 on Compressed Image Quality Assessment on CLIC2021Test-subset

Compressed Image Quality Assessment Image Compression +1

Paper
Code

Manifold Learning of Four-dimensional Scanning Transmission Electron Microscopy

1 code implementation • 18 Oct 2018 • Xin Li, Ondrej E. Dyck, Mark P. Oxley, Andrew R. Lupini, Leland McInnes, John Healy, Stephen Jesse, Sergei V. Kalinin

Four-dimensional scanning transmission electron microscopy (4D-STEM) of local atomic diffraction patterns is emerging as a powerful technique for probing intricate details of atomic structure and atomic electric fields.

Paper
Code

Probabilistic Model Distillation for Semantic Correspondence

1 code implementation • CVPR 2021 • Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu

We address this problem with the use of a novel Probabilistic Model Distillation (PMD) approach which transfers knowledge learned by a probabilistic teacher model on synthetic data to a static student model with the use of unlabeled real image pairs.

Representation Learning Semantic correspondence

Paper
Code

Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples

1 code implementation • 22 Nov 2021 • Linlin Liu, Xin Li, Ruidan He, Lidong Bing, Shafiq Joty, Luo Si

In this work, we explore methods to make better use of the multilingual annotation and language agnostic property of KG triples, and present novel knowledge based multilingual language models (KMLMs) trained directly on the knowledge triples.

Knowledge Graphs Language Modelling +9

Paper
Code

HST: Hierarchical Swin Transformer for Compressed Image Super-resolution

3 code implementations • 21 Aug 2022 • Bingchen Li, Xin Li, Yiting Lu, Sen Liu, Ruoyu Feng, Zhibo Chen

Compressed Image Super-resolution has achieved great attention in recent years, where images are degraded with compression artifacts and low-resolution artifacts.

Ranked #1 on Compressed Image Super-resolution on DIV2K-q40-x4

Compressed Image Super-resolution Image Super-Resolution

Paper
Code

Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations

1 code implementation • 16 Nov 2022 • Linlin Liu, Xingxuan Li, Megh Thakkar, Xin Li, Shafiq Joty, Luo Si, Lidong Bing

Due to the huge amount of parameters, fine-tuning of pretrained language models (PLMs) is prone to overfitting in the low resource scenarios.

Paper
Code

WL-Align: Weisfeiler-Lehman Relabeling for Aligning Users across Networks via Regularized Representation Learning

1 code implementation • 29 Dec 2022 • Li Liu, Penggang Chen, Xin Li, William K. Cheung, Youmin Zhang, Qun Liu, Guoyin Wang

Aligning users across networks using graph representation learning has been found effective where the alignment is accomplished in a low-dimensional embedding space.

Graph Representation Learning

Paper
Code

Domain-adversarial Network Alignment

1 code implementation • 15 Aug 2019 • Huiting Hong, Xin Li, Yuangang Pan, Ivor Tsang

Network alignment is a critical task to a wide variety of fields.

Network Embedding

Paper
Code

Toward Tag-free Aspect Based Sentiment Analysis: A Multiple Attention Network Approach

3 code implementations • 22 Mar 2020 • Yao Qiang, Xin Li, Dongxiao Zhu

Existing aspect based sentiment analysis (ABSA) approaches leverage various neural network models to extract the aspect sentiments via learning aspect-specific feature representations.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Code

Improving Adversarial Robustness via Probabilistically Compact Loss with Logit Constraints

1 code implementation • 14 Dec 2020 • Xin Li, Xiangrui Li, Deng Pan, Dongxiao Zhu

This inspires us to propose a new Probabilistically Compact (PC) loss with logit constraints which can be used as a drop-in replacement for cross-entropy (CE) loss to improve CNN's adversarial robustness.

Adversarial Robustness

Paper
Code

SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning

2 code implementations • 31 Dec 2021 • Hongyu Zang, Xin Li, Mingzhong Wang

This work explores how to learn robust and generalizable state representation from image-based observations with deep reinforcement learning methods.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Semantic-aware Message Broadcasting for Efficient Unsupervised Domain Adaptation

1 code implementation • 6 Dec 2022 • Xin Li, Cuiling Lan, Guoqiang Wei, Zhibo Chen

In this way, our message broadcasting encourages the group tokens to learn more informative and diverse information for effective domain alignment.

Ranked #1 on Unsupervised Domain Adaptation on VisDA2017

Pseudo Label Unsupervised Domain Adaptation

Paper
Code

LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion

1 code implementation • CVPR 2023 • Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He

Notably, LoGoNet ranks 1st on Waymo 3D object detection leaderboard and obtains 81. 02 mAPH (L2) detection performance.

3D Object Detection object-detection +1

Paper
Code

SPARTAN: Self-supervised Spatiotemporal Transformers Approach to Group Activity Recognition

1 code implementation • 6 Mar 2023 • Naga VS Raviteja Chappa, Pha Nguyen, Alexander H Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu

In this paper, we propose a new, simple, and effective Self-supervised Spatio-temporal Transformers (SPARTAN) approach to Group Activity Recognition (GAR) using unlabeled video data.

Group Activity Recognition

Paper
Code

Improving Self-training for Cross-lingual Named Entity Recognition with Contrastive and Prototype Learning

1 code implementation • 23 May 2023 • Ran Zhou, Xin Li, Lidong Bing, Erik Cambria, Chunyan Miao

In cross-lingual named entity recognition (NER), self-training is commonly used to bridge the linguistic gap by training on pseudo-labeled target-language data.

Cross-Lingual NER named-entity-recognition +4

Paper
Code

Task-driven Semantic Coding via Reinforcement Learning

1 code implementation • 7 Jun 2021 • Xin Li, Jun Shi, Zhibo Chen

However, the traditional hybrid coding framework cannot be optimized in an end-to-end manner, which makes task-driven semantic fidelity metric unable to be automatically integrated into the rate-distortion optimization process.

Face Detection License Plate Detection +4

Paper
Code

Retrofitting Multilingual Sentence Embeddings with Abstract Meaning Representation

1 code implementation • 18 Oct 2022 • Deng Cai, Xin Li, Jackie Chun-Sing Ho, Lidong Bing, Wai Lam

Unlike most prior work that only evaluates the ability to measure semantic similarity, we present a thorough evaluation of existing multilingual sentence embeddings and our improved versions, which include a collection of five transfer tasks in different downstream applications.

Paper
Code

RTracker: Recoverable Tracking via PN Tree Structured Memory

1 code implementation • 28 Mar 2024 • Yuqing Huang, Xin Li, Zikun Zhou, YaoWei Wang, Zhenyu He, Ming-Hsuan Yang

Upon the PN tree memory, we develop corresponding walking rules for determining the state of the target and define a set of control flows to unite the tracker and the detector in different tracking scenarios.

Paper
Code

Compressed Sensing of Scanning Transmission Electron Microscopy (STEM) on Non-Rectangular Scans

1 code implementation • 13 May 2018 • Xin Li, Ondrej Dyck, Sergei V. Kalinin, Stephen Jesse

Scanning Transmission Electron Microscopy (STEM) has become the main stay for materials characterization on atomic level, with applications ranging from visualization of localized and extended defects to mapping order parameter fields.

Paper
Code

DAC: Data-free Automatic Acceleration of Convolutional Networks

1 code implementation • 20 Dec 2018 • Xin Li, Shuai Zhang, Bolan Jiang, Yingyong Qi, Mooi Choo Chuah, Ning Bi

A complex deep learning model with high accuracy runs slowly on resource-limited devices, while a light-weight model that runs much faster loses accuracy.

Image Classification Multi-Person Pose Estimation +2

Paper
Code

Probabilistic prediction of the heave motions of a semi-submersible by a deep learning problem model

1 code implementation • 9 Oct 2021 • Xiaoxian Guo, Xiantao Zhang, Xinliang Tian, Wenyue Lu, Xin Li

In this study, we extend a deep learning (DL) model, which could predict the heave and surge motions of a floating semi-submersible 20 to 50 seconds ahead with good accuracy, to quantify its uncertainty of the predictive time series with the help of the dropout technique.

Motion Compensation motion prediction +2

Paper
Code

Enhancing Cross-lingual Prompting with Dual Prompt Augmentation

1 code implementation • 15 Feb 2022 • Meng Zhou, Xin Li, Yue Jiang, Lidong Bing

Prompting shows promising results in few-shot scenarios.

Cross-Lingual Transfer

Paper
Code

Behavior Prior Representation learning for Offline Reinforcement Learning

1 code implementation • 2 Nov 2022 • Hongyu Zang, Xin Li, Jie Yu, Chen Liu, Riashat Islam, Remi Tachet des Combes, Romain Laroche

Our method, Behavior Prior Representation (BPR), learns state representations with an easy-to-integrate objective based on behavior cloning of the dataset: we first learn a state representation by mimicking actions from the dataset, and then train a policy on top of the fixed representation, using any off-the-shelf Offline RL algorithm.

Offline RL reinforcement-learning +2

Paper
Code

From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader

1 code implementation • 9 Dec 2022 • Weiwen Xu, Xin Li, Wenxuan Zhang, Meng Zhou, Wai Lam, Luo Si, Lidong Bing

We present Pre-trained Machine Reader (PMR), a novel method for retrofitting pre-trained masked language models (MLMs) to pre-trained machine reading comprehension (MRC) models without acquiring labeled data.

Classification Extractive Question-Answering +6

Paper
Code

mPMR: A Multilingual Pre-trained Machine Reader at Scale

1 code implementation • 23 May 2023 • Weiwen Xu, Xin Li, Wai Lam, Lidong Bing

mPMR aims to guide multilingual pre-trained language models (mPLMs) to perform natural language understanding (NLU) including both sequence classification and span extraction in multiple languages.

Classification Machine Reading Comprehension +3

Paper
Code

Dual-view Correlation Hybrid Attention Network for Robust Holistic Mammogram Classification

1 code implementation • 19 Jun 2023 • Zhiwei Wang, Junlin Xian, Kangyi Liu, Xin Li, Qiang Li, Xin Yang

Mammogram image is important for breast cancer screening, and typically obtained in a dual-view form, i. e., cranio-caudal (CC) and mediolateral oblique (MLO), to provide complementary information.

Clinical Knowledge

Paper
Code

The Algonauts Project 2023 Challenge: UARK-UAlbany Team Solution

1 code implementation • 1 Aug 2023 • Xuan-Bac Nguyen, Xudong Liu, Xin Li, Khoa Luu

The goal is to predict brain responses across the entire visual brain, as it is the region where the most reliable responses to images have been observed.

Paper
Code

Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics

1 code implementation • 21 Mar 2024 • Shan Jia, Reilin Lyu, Kangran Zhao, Yize Chen, Zhiyuan Yan, Yan Ju, Chuanbo Hu, Xin Li, Baoyuan Wu, Siwei Lyu

DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation.

DeepFake Detection Experimental Design +2

Paper
Code

SBGAR: Semantics Based Group Activity Recognition

1 code implementation • ICCV 2017 • Xin Li, Mooi Choo Chuah

Activity recognition has become an important function in many emerging computer vision applications e. g. automatic video surveillance system, human-computer interaction application, and video recommendation system, etc.

Group Activity Recognition

Paper
Code

On the Learning Property of Logistic and Softmax Losses for Deep Neural Networks

1 code implementation • 4 Mar 2020 • Xiangrui Li, Xin Li, Deng Pan, Dongxiao Zhu

Deep convolutional neural networks (CNNs) trained with logistic and softmax losses have made significant advancement in visual recognition tasks in computer vision.

Binary Classification Classification +2

Paper
Code

Aspect-based Sentiment Analysis in Question Answering Forums

1 code implementation • Findings (EMNLP) 2021 • Wenxuan Zhang, Yang Deng, Xin Li, Lidong Bing, Wai Lam

This motivates us to investigate the task of ABSA on QA forums (ABSA-QA), aiming to jointly detect the discussed aspects and their sentiment polarities for a given QA pair.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Code

PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks

1 code implementation • 17 Oct 2022 • Weiwen Xu, Xin Li, Yang Deng, Wai Lam, Lidong Bing

Specifically, a novel Peer Data Augmentation (PeerDA) approach is proposed which employs span pairs with the PR relation as the augmentation data for training.

Data Augmentation Relation

Paper
Code

AQE: Argument Quadruplet Extraction via a Quad-Tagging Augmented Generative Approach

1 code implementation • 31 May 2023 • Jia Guo, Liying Cheng, Wenxuan Zhang, Stanley Kok, Xin Li, Lidong Bing

In this work, we for the first time propose a challenging argument quadruplet extraction task (AQE), which can provide an all-in-one extraction of four argumentative components, i. e., claims, evidence, evidence types, and stances.

Argument Mining Stance Classification +1

Paper
Code

A Real-Time Deep Network for Crowd Counting

1 code implementation • 16 Feb 2020 • Xiaowen Shi, Xin Li, Caili Wu, Shuchen Kong, Jing Yang, Liang He

Automatic analysis of highly crowded people has attracted extensive attention from computer vision research.

Crowd Counting

Paper
Code

A streamlined Approach to Multimodal Few-Shot Class Incremental Learning for Fine-Grained Datasets

2 code implementations • 10 Mar 2024 • Thang Doan, Sima Behpour, Xin Li, Wenbin He, Liang Gou, Liu Ren

Few-shot Class-Incremental Learning (FSCIL) poses the challenge of retaining prior knowledge while learning from limited new data streams, all without overfitting.

Few-Shot Class-Incremental Learning Incremental Learning

Paper
Code

On Improving Deep Reinforcement Learning for POMDPs

1 code implementation • 26 Apr 2017 • Pengfei Zhu, Xin Li, Pascal Poupart, Guanghui Miao

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e. g., computer Go.

Atari Games Decision Making +4

Paper
Code

A Saliency-Guided Street View Image Inpainting Framework for Efficient Last-Meters Wayfinding

1 code implementation • 14 May 2022 • Chuanbo Hu, Shan Jia, Fan Zhang, Xin Li

However, due to the large diversity of geographic context and acquisition conditions, the captured SVI always contains various distracting objects (e. g., pedestrians and vehicles), which will distract human visual attention from efficiently finding the destination in the last few meters.

Image Inpainting object-detection +2

Paper
Code

Fusion-based Few-Shot Morphing Attack Detection and Fingerprinting

1 code implementation • 27 Oct 2022 • Na Zhang, Shan Jia, Siwei Lyu, Xin Li

Our technical contributions include: 1) We propose a fusion-based few-shot learning (FSL) method to learn discriminative features that can generalize to unseen morphing attack types from predefined presentation attacks; 2) The proposed FSL based on the fusion of the PRNU model and Noiseprint network is extended from binary MAD to multiclass morphing attack fingerprinting (MAF).

Face Recognition Few-Shot Learning

Paper
Code

Negative Flux Aggregation to Estimate Feature Attributions

1 code implementation • 17 Jan 2023 • Xin Li, Deng Pan, Chengyin Li, Yao Qiang, Dongxiao Zhu

There are increasing demands for understanding deep neural networks' (DNNs) behavior spurred by growing security and/or transparency concerns.

Paper
Code

Federated Learning for Clinical Structured Data: A Benchmark Comparison of Engineering and Statistical Approaches

1 code implementation • 6 Nov 2023 • Siqi Li, Di Miao, Qiming Wu, Chuan Hong, Danny D'Agostino, Xin Li, Yilin Ning, Yuqing Shang, Huazhu Fu, Marcus Eng Hock Ong, Hamed Haddadi, Nan Liu

Our goal was to bridge the gap by presenting the first comprehensive comparison of FL frameworks from both engineering and statistical domains.

Federated Learning Privacy Preserving

Paper
Code

On the K-theory of crossed products by automorphic semigroup actions

1 code implementation • 24 May 2012 • Joachim Cuntz, Siegfried Echterhoff, Xin Li

Let P be a semigroup that admits an embedding into a group G. Assume that the embedding satisfies a certain Toeplitz condition and that the Baum-Connes conjecture holds for G. We prove a formula describing the K- theory of the reduced crossed product A \rtimes{\alpha}, r P by any automorphic action of P. This formula is obtained as a consequence of a result on the K-theory of crossed products for special actions of G on totally disconnected spaces.

Operator Algebras Dynamical Systems K-Theory and Homology 46L05, 46L80 (Primary) 20Mxx, 11R04 (Secondary)

Paper
Code

TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous Control

1 code implementation • 1 Jan 2021 • Hongyu Zang, Xin Li, Li Zhang, Peiyao Zhao, Mingzhong Wang

Trust region methods and maximum entropy methods are two state-of-the-art branches used in reinforcement learning (RL) for the benefits of stability and exploration in continuous environments, respectively.

Continuous Control Reinforcement Learning (RL)

Paper
Code

Muti-view Mouse Social Behaviour Recognition with Deep Graphical Model

1 code implementation • 4 Nov 2020 • Zheheng Jiang, Feixiang Zhou, Aite Zhao, Xin Li, Ling Li, DaCheng Tao, Xuelong Li, Huiyu Zhou

To address this problem, we here propose a novel multiview latent-attention and dynamic discriminative model that jointly learns view-specific and view-shared sub-structures, where the former captures unique dynamics of each view whilst the latter encodes the interaction between the views.

Paper
Code

Exploiting Semantic Relations for Fine-grained Entity Typing

1 code implementation • AKBC 2020 • Hongliang Dai, Yangqiu Song, Xin Li

We find that, in some cases, existing neural fine-grained entity typing models may ignore the semantic information in the context that is important for typing.

Entity Typing Relation +2

Paper
Code

DR-GAN: Distribution Regularization for Text-to-Image Generation

1 code implementation • 17 Apr 2022 • Hongchen Tan, Xiuping Liu, BaoCai Yin, Xin Li

This paper presents a new Text-to-Image generation model, named Distribution Regularization Generative Adversarial Network (DR-GAN), to generate images from text descriptions from improved distribution learning.

Generative Adversarial Network Text-to-Image Generation

Paper
Code

Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

1 code implementation • 31 Oct 2022 • Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford

We find that contemporary representation learning techniques can fail on datasets where the noise is a complex and time dependent process, which is prevalent in practical applications.

Offline RL Reinforcement Learning (RL) +1

Paper
Code

Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Open World Object Detection

2 code implementations • 25 Jun 2023 • Thang Doan, Xin Li, Sima Behpour, Wenbin He, Liang Gou, Liu Ren

We argue that this contextual information should already be embedded within the known classes.

object-detection Open World Object Detection

Paper
Code

WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations

1 code implementation • 4 Mar 2024 • Haolin Deng, Chang Wang, Xin Li, Dezhang Yuan, Junlang Zhan, Tianhua Zhou, Jin Ma, Jun Gao, Ruifeng Xu

Enhancing the attribution in large language models (LLMs) is a crucial task.

Query-focused Summarization

Paper
Code

COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization

1 code implementation • 11 Mar 2024 • Aozhong zhang, Zi Yang, Naigang Wang, Yingyong Qin, Jack Xin, Xin Li, Penghang Yin

Within a fixed layer, COMQ treats all the scaling factor(s) and bit-codes as the variables of the reconstruction error.

Quantization

Paper
Code

GANE: A Generative Adversarial Network Embedding

no code implementations • 18 May 2018 • Huiting Hong, Xin Li, Mingzhong Wang

Network embedding has become a hot research topic recently which can provide low-dimensional feature representations for many machine learning applications.

Clustering Generative Adversarial Network +2

Paper
Add Code

On Improving Deep Reinforcement Learning for POMDPs

no code implementations • 17 Apr 2018 • Pengfei Zhu, Xin Li, Pascal Poupart, Guanghui Miao

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e. g., computer Go.

Atari Games Decision Making +4

Paper
Add Code

Perceptually Optimized Generative Adversarial Network for Single Image Dehazing

no code implementations • 3 May 2018 • Yixin Du, Xin Li

To overcome this weakness, we propose a direct deep learning approach toward image dehazing bypassing the step of transmission map estimation and facilitating end-to-end perceptual optimization.

Denoising Generative Adversarial Network +2

Paper
Add Code

Weighted Low-Rank Approximation of Matrices and Background Modeling

no code implementations • 15 Apr 2018 • Aritra Dutta, Xin Li, Peter Richtarik

We primarily study a special a weighted low-rank approximation of matrices and then apply it to solve the background modeling problem.

Paper
Add Code

ReHAR: Robust and Efficient Human Activity Recognition

no code implementations • 27 Feb 2018 • Xin Li, Mooi Choo Chuah

The whole model is trained end-to-end to allow meaningful representations to be generated for the final activity recognition.

Human Activity Recognition Optical Flow Estimation

Paper
Add Code

Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network

no code implementations • 13 Feb 2018 • Weishong Dong, Ming Yuan, Xin Li, Guangming Shi

Image demosaicing - one of the most important early stages in digital camera pipelines - addressed the problem of reconstructing a full-resolution image from so-called color-filter-arrays.

Demosaicking Denoising +2

Paper
Add Code

Adversarial Examples Detection in Deep Networks with Convolutional Filter Statistics

no code implementations • ICCV 2017 • Xin Li, Fuxin Li

A cascade classifier was designed to efficiently detect adversarials.

Paper
Add Code

Two-Level Structural Sparsity Regularization for Identifying Lattices and Defects in Noisy Images

no code implementations • 24 Nov 2016 • Xin Li, Alex Belianinov, Ondrej Dyck, Stephen Jesse, Chiwoo Park

We propose to formulate the identification of the lattice groups as a sparse group selection problem.

regression

Paper
Add Code

Learning with Rethinking: Recurrently Improving Convolutional Neural Networks through Feedback

no code implementations • 15 Aug 2017 • Xin Li, Zequn Jie, Jiashi Feng, Changsong Liu, Shuicheng Yan

However, most of the existing CNN models only learn features through a feedforward structure and no feedback information from top to bottom layers is exploited to enable the networks to refine themselves.

Paper
Add Code

Prune the Convolutional Neural Networks with Sparse Shrink

no code implementations • 8 Aug 2017 • Xin Li, Changsong Liu

These results have demonstrated the effectiveness of our "Sparse Shrink" algorithm.

Paper
Add Code

FoveaNet: Perspective-aware Urban Scene Parsing

no code implementations • ICCV 2017 • Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng

Thus, they suffer from heterogeneous object scales caused by perspective projection of cameras on actual scenes and inevitably encounter parsing failures on distant objects as well as other boundary and recognition errors.

Scene Parsing

Paper
Add Code

Weighted Low Rank Approximation for Background Estimation Problems

no code implementations • 4 Jul 2017 • Aritra Dutta, Xin Li

Classical principal component analysis (PCA) is not robust to the presence of sparse outliers in the data.

Paper
Add Code

A Batch-Incremental Video Background Estimation Model using Weighted Low-Rank Approximation of Matrices

no code implementations • 2 Jul 2017 • Aritra Dutta, Xin Li, Peter Richtárik

Principal component pursuit (PCP) is a state-of-the-art approach for background estimation problems.

Paper
Add Code

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

no code implementations • 1 Dec 2016 • Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

Evaluated on the LSTM for speech recognition benchmark, ESE is 43x and 3x faster than Core i7 5930k CPU and Pascal Titan X GPU implementations.

Quantization speech-recognition +1

Paper
Add Code

Cross-scale predictive dictionaries

no code implementations • 16 Nov 2015 • Vishwanath Saragadam, Xin Li, Aswin Sankaranarayanan

Sparse representations using data dictionaries provide an efficient model particularly for signals that do not enjoy alternate analytic sparsifying transformations.

Paper
Add Code

Video Scene Parsing with Predictive Feature Learning

no code implementations • ICCV 2017 • Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan

In this way, the network can effectively learn to capture video dynamics and temporal context, which are critical clues for video scene parsing, without requiring extra manual annotations.

Representation Learning Scene Parsing

Paper
Add Code

Detecting Suicidal Ideation in Chinese Microblogs with Psychological Lexicons

no code implementations • 4 Nov 2014 • Xiaolei Huang, Lei Zhang, Tianli Liu, David Chiu, Tingshao Zhu, Xin Li

Currently, we have identified 53 known suicidal cases who posted suicide notes on Weibo prior to their deaths. We explore linguistic features of these known cases using a psychological lexicon dictionary, and train an effective suicidal Weibo post detection model.

BIG-bench Machine Learning

Paper
Add Code

Learning Hybrid Sparsity Prior for Image Restoration: Where Deep Learning Meets Sparse Coding

no code implementations • 18 Jul 2018 • Fangfang Wu, Weisheng Dong, Guangming Shi, Xin Li

State-of-the-art approaches toward image restoration can be classified into model-based and learning-based.

Image Restoration

Paper
Add Code

Superimposition-guided Facial Reconstruction from Skull

no code implementations • 28 Sep 2018 • Celong Liu, Xin Li

We develop a new algorithm to perform facial reconstruction from a given skull.

Facial Inpainting

Paper
Add Code

Deep Multi-Task Learning for Aspect Term Extraction with Memory Interaction

no code implementations • EMNLP 2017 • Xin Li, Wai Lam

We propose a novel LSTM-based deep multi-task learning framework for aspect term extraction from user review sentences.

Aspect-Based Sentiment Analysis (ABSA) Multi-Task Learning +2

Paper
Add Code

Learning Parametric Sparse Models for Image Super-Resolution

no code implementations • NeurIPS 2016 • Yongbo Li, Weisheng Dong, Xuemei Xie, Guangming Shi, Xin Li, Donglai Xu

More specifically, the parametric sparse prior of the desirable high-resolution (HR) image patches are learned from both the input low-resolution (LR) image and a training image dataset.

Image Super-Resolution

Paper
Add Code

CONet: A Cognitive Ocean Network

no code implementations • 9 Jan 2019 • Huimin Lu, Dong Wang, Yujie Li, Jianru Li, Xin Li, Hyoungseop Kim, Seiichi Serikawa, Iztok Humar

The Cognitive Ocean Network (CONet) will become the mainstream of future ocean science and engineering developments.

Paper
Add Code

Adaptive Active Learning for Image Classification

no code implementations • CVPR 2013 • Xin Li, Yuhong Guo

Recently active learning has attracted a lot of attention in computer vision field, as it is time and cost consuming to prepare a good set of labeled images for vision data analysis.

Active Learning Classification +4

Paper
Add Code

Simplified Mirror-Based Camera Pose Computation via Rotation Averaging

no code implementations • CVPR 2015 • Gucan Long, Laurent Kneip, Xin Li, Xiaohu Zhang, Qifeng Yu

Our theoretical contribution extends the applicability of rotation averaging to a more general case, and enables mirror-based pose estimation in closed-form under the chordal L2-metric, or in an outlier-robust way by employing iterative L1-norm averaging.

Camera Calibration Pose Estimation

Paper
Add Code

Object-Aware Dense Semantic Correspondence

no code implementations • CVPR 2017 • Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen

To address these problems, this paper proposes an object-aware method to estimate per-pixel correspondences from semantic to low-level by learning a classifier for each selected discriminative grid cell and guiding the localization of every pixel under the semantic constraint.

Object Semantic correspondence

Paper
Add Code

Low-Rank Tensor Approximation With Laplacian Scale Mixture Modeling for Multiframe Image Denoising

no code implementations • ICCV 2015 • Weisheng Dong, Guangyu Li, Guangming Shi, Xin Li, Yi Ma

Patch-based low-rank models have shown effective in exploiting spatial redundancy of natural images especially for the application of image denoising.

Dictionary Learning Image Denoising

Paper
Add Code

3D Fragment Reassembly Using Integrated Template Guidance and Fracture-Region Matching

no code implementations • ICCV 2015 • Kang Zhang, Wuyi Yu, Mary Manhein, Warren Waggenspack, Xin Li

This paper studies matching of fragmented objects to recompose their original geometry.

Paper
Add Code

Semi-Supervised Zero-Shot Classification With Label Representation Learning

no code implementations • ICCV 2015 • Xin Li, Yuhong Guo, Dale Schuurmans

Most existing zero-shot learning methods require a user to first provide a set of semantic visual attributes for each class as side information before applying a two-step prediction procedure that introduces an intermediate attribute prediction problem.

Attribute Classification +4

Paper
Add Code

Topic Model for Identifying Suicidal Ideation in Chinese Microblog

no code implementations • PACLIC 2015 • Xiaolei Huang, Xin Li, Tianli Liu, David Chiu, Tingshao Zhu, Lei Zhang

Paper
Add Code

Iris R-CNN: Accurate Iris Segmentation in Non-cooperative Environment

no code implementations • 25 Mar 2019 • Chunyang Feng, Yufeng Sun, Xin Li

Despite the significant advances in iris segmentation, accomplishing accurate iris segmentation in non-cooperative environment remains a grand challenge.

Iris Segmentation Region Proposal +1

Paper
Add Code

Aligning Users Across Social Networks Using Network Embedding

no code implementations • IJCAI 2016 • Li Liu, William K. Cheung, Xin Li, Lejian Liao

Li Liu, 1 William K. Cheung, 2 Xin Li, 1⇤ and Lejian Liao1

Network Embedding

Paper
Add Code

Target-Aware Deep Tracking

no code implementations • CVPR 2019 • Xin Li, Chao Ma, Baoyuan Wu, Zhenyu He, Ming-Hsuan Yang

Despite demonstrated successes for numerous vision tasks, the contributions of using pre-trained deep features for visual tracking are not as significant as that for object recognition.

Object Object Recognition +1

Paper
Add Code

LO-Net: Deep Real-time Lidar Odometry

no code implementations • CVPR 2019 • Qing Li, Shaoyang Chen, Cheng Wang, Xin Li, Chenglu Wen, Ming Cheng, Jonathan Li

We present a novel deep convolutional network pipeline, LO-Net, for real-time lidar odometry estimation.

feature selection Pose Estimation

Paper
Add Code

STN-Homography: estimate homography parameters directly

no code implementations • 6 Jun 2019 • Qiang Zhou, Xin Li

In this paper, we introduce the STN-Homography model to directly estimate the homography matrix between image pair.

Homography Estimation

Paper
Add Code

Vispi: Automatic Visual Perception and Interpretation of Chest X-rays

no code implementations • MIDL 2019 • Xin Li, Rui Cao, Dongxiao Zhu

Medical imaging contains the essential information for rendering diagnostic and treatment decisions.

Image Captioning

Paper
Add Code

Reconstructing Perceived Images from Brain Activity by Visually-guided Cognitive Representation and Adversarial Learning

no code implementations • 27 Jun 2019 • Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo Gao

In addition, we introduce a novel three-stage learning approach which enables the (cognitive) encoder to gradually distill useful knowledge from the paired (visual) encoder during the learning process.

Generative Adversarial Network Image Reconstruction +2

Paper
Add Code

Small and Practical BERT Models for Sequence Labeling

no code implementations • IJCNLP 2019 • Henry Tsai, Jason Riesa, Melvin Johnson, Naveen Arivazhagan, Xin Li, Amelia Archer

We propose a practical scheme to train a single multilingual sequence labeling model that yields state of the art results and is small and fast enough to run on a single CPU.

Part-Of-Speech Tagging

Paper
Add Code

Iterative Clustering with Game-Theoretic Matching for Robust Multi-consistency Correspondence

no code implementations • 3 Sep 2019 • Chen Zhao, Jiaqi Yang, Ke Xian, Zhiguo Cao, Xin Li

Matching corresponding features between two images is a fundamental task to computer vision with numerous applications in object recognition, robotics, and 3D reconstruction.

3D Reconstruction Clustering +2

Paper
Add Code

Spoofing and Anti-Spoofing with Wax Figure Faces

no code implementations • 12 Oct 2019 • Shan Jia, Xin Li, Chuanbo Hu, Zhengquan Xu

In this work, we introduce a wax figure face database (WFFD) as a novel and super-realistic 3D face presentation attack.

Face Detection Face Recognition +1

Paper
Add Code

Automatic Lumbar Spinal CT Image Segmentation with a Dual Densely Connected U-Net

no code implementations • 21 Oct 2019 • He Tang, Xiaobing Pei, Shilong Huang, Xin Li, Chao Liu

The clinical treatment of degenerative and developmental lumbar spinal stenosis (LSS) is different.

Computed Tomography (CT) Denoising +3

Paper
Add Code

Joint Demosaicing and Super-Resolution (JDSR): Network Design and Perceptual Optimization

no code implementations • 8 Nov 2019 • Xuan Xu, Yanfang Ye, Xin Li

Image demosaicing and super-resolution are two important tasks in color imaging pipeline.

Demosaicking Generative Adversarial Network +3

Paper
Add Code

Sparse estimation via $\ell_q$ optimization method in high-dimensional linear regression

no code implementations • 12 Nov 2019 • Xin Li, Yaohua Hu, Chong Li, Xiaoqi Yang, Tianzi Jiang

In this paper, we discuss the statistical properties of the $\ell_q$ optimization methods $(0<q\leq 1)$, including the $\ell_q$ minimization method and the $\ell_q$ regularization method, for estimating a sparse parameter from noisy observations in high-dimensional linear regression with either a deterministic or random design.

regression Vocal Bursts Intensity Prediction

Paper
Add Code

Relevance-Promoting Language Model for Short-Text Conversation

no code implementations • 26 Nov 2019 • Xin Li, Piji Li, Wei Bi, Xiaojiang Liu, Wai Lam

In this paper, we propose to formulate the STC task as a language modeling problem and tailor-make a training strategy to adapt a language model for response generation.

Language Modelling Response Generation +1

Paper
Add Code

Digital Twin: Acquiring High-Fidelity 3D Avatar from a Single Image

no code implementations • 7 Dec 2019 • Ruizhe Wang, Chih-Fan Chen, Hao Peng, Xudong Liu, Oliver Liu, Xin Li

We present an approach to generate high fidelity 3D face avatar with a high-resolution UV texture map from a single image.

Face Model Vocal Bursts Intensity Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.