Search Results for author: Xin Li

Found 371 papers, 160 papers with code

GANE: A Generative Adversarial Network Embedding

no code implementations • 18 May 2018 • Huiting Hong, Xin Li, Mingzhong Wang

Network embedding has become a hot research topic recently which can provide low-dimensional feature representations for many machine learning applications.

Clustering Generative Adversarial Network +2

Paper
Add Code

On Improving Deep Reinforcement Learning for POMDPs

no code implementations • 17 Apr 2018 • Pengfei Zhu, Xin Li, Pascal Poupart, Guanghui Miao

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e. g., computer Go.

Atari Games Decision Making +4

Paper
Add Code

Perceptually Optimized Generative Adversarial Network for Single Image Dehazing

no code implementations • 3 May 2018 • Yixin Du, Xin Li

To overcome this weakness, we propose a direct deep learning approach toward image dehazing bypassing the step of transmission map estimation and facilitating end-to-end perceptual optimization.

Denoising Generative Adversarial Network +2

Paper
Add Code

Weighted Low-Rank Approximation of Matrices and Background Modeling

no code implementations • 15 Apr 2018 • Aritra Dutta, Xin Li, Peter Richtarik

We primarily study a special a weighted low-rank approximation of matrices and then apply it to solve the background modeling problem.

Paper
Add Code

ReHAR: Robust and Efficient Human Activity Recognition

no code implementations • 27 Feb 2018 • Xin Li, Mooi Choo Chuah

The whole model is trained end-to-end to allow meaningful representations to be generated for the final activity recognition.

Human Activity Recognition Optical Flow Estimation

Paper
Add Code

Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network

no code implementations • 13 Feb 2018 • Weishong Dong, Ming Yuan, Xin Li, Guangming Shi

Image demosaicing - one of the most important early stages in digital camera pipelines - addressed the problem of reconstructing a full-resolution image from so-called color-filter-arrays.

Demosaicking Denoising +2

Paper
Add Code

Adversarial Examples Detection in Deep Networks with Convolutional Filter Statistics

no code implementations • ICCV 2017 • Xin Li, Fuxin Li

A cascade classifier was designed to efficiently detect adversarials.

Paper
Add Code

Two-Level Structural Sparsity Regularization for Identifying Lattices and Defects in Noisy Images

no code implementations • 24 Nov 2016 • Xin Li, Alex Belianinov, Ondrej Dyck, Stephen Jesse, Chiwoo Park

We propose to formulate the identification of the lattice groups as a sparse group selection problem.

regression

Paper
Add Code

Learning with Rethinking: Recurrently Improving Convolutional Neural Networks through Feedback

no code implementations • 15 Aug 2017 • Xin Li, Zequn Jie, Jiashi Feng, Changsong Liu, Shuicheng Yan

However, most of the existing CNN models only learn features through a feedforward structure and no feedback information from top to bottom layers is exploited to enable the networks to refine themselves.

Paper
Add Code

Prune the Convolutional Neural Networks with Sparse Shrink

no code implementations • 8 Aug 2017 • Xin Li, Changsong Liu

These results have demonstrated the effectiveness of our "Sparse Shrink" algorithm.

Paper
Add Code

FoveaNet: Perspective-aware Urban Scene Parsing

no code implementations • ICCV 2017 • Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng

Thus, they suffer from heterogeneous object scales caused by perspective projection of cameras on actual scenes and inevitably encounter parsing failures on distant objects as well as other boundary and recognition errors.

Scene Parsing

Paper
Add Code

Weighted Low Rank Approximation for Background Estimation Problems

no code implementations • 4 Jul 2017 • Aritra Dutta, Xin Li

Classical principal component analysis (PCA) is not robust to the presence of sparse outliers in the data.

Paper
Add Code

A Batch-Incremental Video Background Estimation Model using Weighted Low-Rank Approximation of Matrices

no code implementations • 2 Jul 2017 • Aritra Dutta, Xin Li, Peter Richtárik

Principal component pursuit (PCP) is a state-of-the-art approach for background estimation problems.

Paper
Add Code

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

no code implementations • 1 Dec 2016 • Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

Evaluated on the LSTM for speech recognition benchmark, ESE is 43x and 3x faster than Core i7 5930k CPU and Pascal Titan X GPU implementations.

Quantization speech-recognition +1

Paper
Add Code

Cross-scale predictive dictionaries

no code implementations • 16 Nov 2015 • Vishwanath Saragadam, Xin Li, Aswin Sankaranarayanan

Sparse representations using data dictionaries provide an efficient model particularly for signals that do not enjoy alternate analytic sparsifying transformations.

Paper
Add Code

Video Scene Parsing with Predictive Feature Learning

no code implementations • ICCV 2017 • Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan

In this way, the network can effectively learn to capture video dynamics and temporal context, which are critical clues for video scene parsing, without requiring extra manual annotations.

Representation Learning Scene Parsing

Paper
Add Code

Detecting Suicidal Ideation in Chinese Microblogs with Psychological Lexicons

no code implementations • 4 Nov 2014 • Xiaolei Huang, Lei Zhang, Tianli Liu, David Chiu, Tingshao Zhu, Xin Li

Currently, we have identified 53 known suicidal cases who posted suicide notes on Weibo prior to their deaths. We explore linguistic features of these known cases using a psychological lexicon dictionary, and train an effective suicidal Weibo post detection model.

BIG-bench Machine Learning

Paper
Add Code

Learning Hybrid Sparsity Prior for Image Restoration: Where Deep Learning Meets Sparse Coding

no code implementations • 18 Jul 2018 • Fangfang Wu, Weisheng Dong, Guangming Shi, Xin Li

State-of-the-art approaches toward image restoration can be classified into model-based and learning-based.

Image Restoration

Paper
Add Code

Superimposition-guided Facial Reconstruction from Skull

no code implementations • 28 Sep 2018 • Celong Liu, Xin Li

We develop a new algorithm to perform facial reconstruction from a given skull.

Facial Inpainting

Paper
Add Code

Deep Multi-Task Learning for Aspect Term Extraction with Memory Interaction

no code implementations • EMNLP 2017 • Xin Li, Wai Lam

We propose a novel LSTM-based deep multi-task learning framework for aspect term extraction from user review sentences.

Aspect-Based Sentiment Analysis (ABSA) Multi-Task Learning +2

Paper
Add Code

Learning Parametric Sparse Models for Image Super-Resolution

no code implementations • NeurIPS 2016 • Yongbo Li, Weisheng Dong, Xuemei Xie, Guangming Shi, Xin Li, Donglai Xu

More specifically, the parametric sparse prior of the desirable high-resolution (HR) image patches are learned from both the input low-resolution (LR) image and a training image dataset.

Image Super-Resolution

Paper
Add Code

CONet: A Cognitive Ocean Network

no code implementations • 9 Jan 2019 • Huimin Lu, Dong Wang, Yujie Li, Jianru Li, Xin Li, Hyoungseop Kim, Seiichi Serikawa, Iztok Humar

The Cognitive Ocean Network (CONet) will become the mainstream of future ocean science and engineering developments.

Paper
Add Code

Adaptive Active Learning for Image Classification

no code implementations • CVPR 2013 • Xin Li, Yuhong Guo

Recently active learning has attracted a lot of attention in computer vision field, as it is time and cost consuming to prepare a good set of labeled images for vision data analysis.

Active Learning Classification +4

Paper
Add Code

Simplified Mirror-Based Camera Pose Computation via Rotation Averaging

no code implementations • CVPR 2015 • Gucan Long, Laurent Kneip, Xin Li, Xiaohu Zhang, Qifeng Yu

Our theoretical contribution extends the applicability of rotation averaging to a more general case, and enables mirror-based pose estimation in closed-form under the chordal L2-metric, or in an outlier-robust way by employing iterative L1-norm averaging.

Camera Calibration Pose Estimation

Paper
Add Code

Object-Aware Dense Semantic Correspondence

no code implementations • CVPR 2017 • Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen

To address these problems, this paper proposes an object-aware method to estimate per-pixel correspondences from semantic to low-level by learning a classifier for each selected discriminative grid cell and guiding the localization of every pixel under the semantic constraint.

Object Semantic correspondence

Paper
Add Code

Low-Rank Tensor Approximation With Laplacian Scale Mixture Modeling for Multiframe Image Denoising

no code implementations • ICCV 2015 • Weisheng Dong, Guangyu Li, Guangming Shi, Xin Li, Yi Ma

Patch-based low-rank models have shown effective in exploiting spatial redundancy of natural images especially for the application of image denoising.

Dictionary Learning Image Denoising

Paper
Add Code

3D Fragment Reassembly Using Integrated Template Guidance and Fracture-Region Matching

no code implementations • ICCV 2015 • Kang Zhang, Wuyi Yu, Mary Manhein, Warren Waggenspack, Xin Li

This paper studies matching of fragmented objects to recompose their original geometry.

Paper
Add Code

Semi-Supervised Zero-Shot Classification With Label Representation Learning

no code implementations • ICCV 2015 • Xin Li, Yuhong Guo, Dale Schuurmans

Most existing zero-shot learning methods require a user to first provide a set of semantic visual attributes for each class as side information before applying a two-step prediction procedure that introduces an intermediate attribute prediction problem.

Attribute Classification +4

Paper
Add Code

Topic Model for Identifying Suicidal Ideation in Chinese Microblog

no code implementations • PACLIC 2015 • Xiaolei Huang, Xin Li, Tianli Liu, David Chiu, Tingshao Zhu, Lei Zhang

Paper
Add Code

Iris R-CNN: Accurate Iris Segmentation in Non-cooperative Environment

no code implementations • 25 Mar 2019 • Chunyang Feng, Yufeng Sun, Xin Li

Despite the significant advances in iris segmentation, accomplishing accurate iris segmentation in non-cooperative environment remains a grand challenge.

Iris Segmentation Region Proposal +1

Paper
Add Code

Aligning Users Across Social Networks Using Network Embedding

no code implementations • IJCAI 2016 • Li Liu, William K. Cheung, Xin Li, Lejian Liao

Li Liu, 1 William K. Cheung, 2 Xin Li, 1⇤ and Lejian Liao1

Network Embedding

Paper
Add Code

Target-Aware Deep Tracking

no code implementations • CVPR 2019 • Xin Li, Chao Ma, Baoyuan Wu, Zhenyu He, Ming-Hsuan Yang

Despite demonstrated successes for numerous vision tasks, the contributions of using pre-trained deep features for visual tracking are not as significant as that for object recognition.

Object Object Recognition +1

Paper
Add Code

LO-Net: Deep Real-time Lidar Odometry

no code implementations • CVPR 2019 • Qing Li, Shaoyang Chen, Cheng Wang, Xin Li, Chenglu Wen, Ming Cheng, Jonathan Li

We present a novel deep convolutional network pipeline, LO-Net, for real-time lidar odometry estimation.

feature selection Pose Estimation

Paper
Add Code

STN-Homography: estimate homography parameters directly

no code implementations • 6 Jun 2019 • Qiang Zhou, Xin Li

In this paper, we introduce the STN-Homography model to directly estimate the homography matrix between image pair.

Homography Estimation

Paper
Add Code

Vispi: Automatic Visual Perception and Interpretation of Chest X-rays

no code implementations • MIDL 2019 • Xin Li, Rui Cao, Dongxiao Zhu

Medical imaging contains the essential information for rendering diagnostic and treatment decisions.

Image Captioning

Paper
Add Code

Reconstructing Perceived Images from Brain Activity by Visually-guided Cognitive Representation and Adversarial Learning

no code implementations • 27 Jun 2019 • Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo Gao

In addition, we introduce a novel three-stage learning approach which enables the (cognitive) encoder to gradually distill useful knowledge from the paired (visual) encoder during the learning process.

Generative Adversarial Network Image Reconstruction +2

Paper
Add Code

Small and Practical BERT Models for Sequence Labeling

no code implementations • IJCNLP 2019 • Henry Tsai, Jason Riesa, Melvin Johnson, Naveen Arivazhagan, Xin Li, Amelia Archer

We propose a practical scheme to train a single multilingual sequence labeling model that yields state of the art results and is small and fast enough to run on a single CPU.

Part-Of-Speech Tagging

Paper
Add Code

Iterative Clustering with Game-Theoretic Matching for Robust Multi-consistency Correspondence

no code implementations • 3 Sep 2019 • Chen Zhao, Jiaqi Yang, Ke Xian, Zhiguo Cao, Xin Li

Matching corresponding features between two images is a fundamental task to computer vision with numerous applications in object recognition, robotics, and 3D reconstruction.

3D Reconstruction Clustering +2

Paper
Add Code

Spoofing and Anti-Spoofing with Wax Figure Faces

no code implementations • 12 Oct 2019 • Shan Jia, Xin Li, Chuanbo Hu, Zhengquan Xu

In this work, we introduce a wax figure face database (WFFD) as a novel and super-realistic 3D face presentation attack.

Face Detection Face Recognition +1

Paper
Add Code

Automatic Lumbar Spinal CT Image Segmentation with a Dual Densely Connected U-Net

no code implementations • 21 Oct 2019 • He Tang, Xiaobing Pei, Shilong Huang, Xin Li, Chao Liu

The clinical treatment of degenerative and developmental lumbar spinal stenosis (LSS) is different.

Computed Tomography (CT) Denoising +3

Paper
Add Code

Joint Demosaicing and Super-Resolution (JDSR): Network Design and Perceptual Optimization

no code implementations • 8 Nov 2019 • Xuan Xu, Yanfang Ye, Xin Li

Image demosaicing and super-resolution are two important tasks in color imaging pipeline.

Demosaicking Generative Adversarial Network +3

Paper
Add Code

Sparse estimation via $\ell_q$ optimization method in high-dimensional linear regression

no code implementations • 12 Nov 2019 • Xin Li, Yaohua Hu, Chong Li, Xiaoqi Yang, Tianzi Jiang

In this paper, we discuss the statistical properties of the $\ell_q$ optimization methods $(0<q\leq 1)$, including the $\ell_q$ minimization method and the $\ell_q$ regularization method, for estimating a sparse parameter from noisy observations in high-dimensional linear regression with either a deterministic or random design.

regression Vocal Bursts Intensity Prediction

Paper
Add Code

Relevance-Promoting Language Model for Short-Text Conversation

no code implementations • 26 Nov 2019 • Xin Li, Piji Li, Wei Bi, Xiaojiang Liu, Wai Lam

In this paper, we propose to formulate the STC task as a language modeling problem and tailor-make a training strategy to adapt a language model for response generation.

Language Modelling Response Generation +1

Paper
Add Code

Digital Twin: Acquiring High-Fidelity 3D Avatar from a Single Image

no code implementations • 7 Dec 2019 • Ruizhe Wang, Chih-Fan Chen, Hao Peng, Xudong Liu, Oliver Liu, Xin Li

We present an approach to generate high fidelity 3D face avatar with a high-resolution UV texture map from a single image.

Face Model Vocal Bursts Intensity Prediction

Paper
Add Code

Point2Node: Correlation Learning of Dynamic-Node for Point Cloud Feature Modeling

no code implementations • 23 Dec 2019 • Wenkai Han, Chenglu Wen, Cheng Wang, Xin Li, Qing Li

Point2Node can dynamically explore correlation among all graph nodes from different levels, and adaptively aggregate the learned features.

Paper
Add Code

Hybrid Graph Neural Networks for Crowd Counting

no code implementations • 31 Jan 2020 • Ao Luo, Fan Yang, Xin Li, Dong Nie, Zhicheng Jiao, Shangchen Zhou, Hong Cheng

In this paper, we present a novel network structure called Hybrid Graph Neural Network (HyGnn) which targets to relieve the problem by interweaving the multi-scale features for crowd density as well as its auxiliary task (localization) together and performing joint reasoning over a graph.

Crowd Counting

Paper
Add Code

Improve SGD Training via Aligning Mini-batches

no code implementations • 23 Feb 2020 • Xiangrui Li, Deng Pan, Xin Li, Dongxiao Zhu

In each iteration of SGD, a mini-batch from the training data is sampled and the true gradient of the loss function is estimated as the noisy gradient calculated on this mini-batch.

Paper
Add Code

Contextual-Bandit Based Personalized Recommendation with Time-Varying User Interests

no code implementations • 29 Feb 2020 • Xiao Xu, Fang Dong, Yanghua Li, Shaojian He, Xin Li

A contextual bandit problem is studied in a highly non-stationary environment, which is ubiquitous in various recommender systems due to the time-varying interests of users.

Recommendation Systems

Paper
Add Code

Towards Evaluating the Robustness of Chinese BERT Classifiers

no code implementations • 7 Apr 2020 • Boxin Wang, Boyuan Pan, Xin Li, Bo Li

Recent advances in large-scale language representation models such as BERT have improved the state-of-the-art performances in many NLP tasks.

Paper
Add Code

Leveraging Planar Regularities for Point Line Visual-Inertial Odometry

no code implementations • 16 Apr 2020 • Xin Li, Yijia He, Jinlong Lin, Xiao Liu

To improve the accuracy of 3D mesh generation and localization, we propose a tightly-coupled monocular VIO system, PLP-VIO, which exploits point features and line features as well as plane regularities.

Paper
Add Code

Context-aware Helpfulness Prediction for Online Product Reviews

no code implementations • 27 Apr 2020 • Iyiola E. Olatunji, Xin Li, Wai Lam

In this paper, we propose a neural deep learning model that predicts the helpfulness score of a review.

Paper
Add Code

3D Face Anti-spoofing with Factorized Bilinear Coding

no code implementations • 12 May 2020 • Shan Jia, Xin Li, Chuanbo Hu, Guodong Guo, Zhengquan Xu

We have witnessed rapid advances in both face presentation attack models and presentation attack detection (PAD) in recent years.

Face Anti-Spoofing Face Presentation Attack Detection +1

Paper
Add Code

Multi-scale Grouped Dense Network for VVC Intra Coding

no code implementations • 16 May 2020 • Xin Li, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Versatile Video Coding (H. 266/VVC) standard achieves better image quality when keeping the same bits than any other conventional image codec, such as BPG, JPEG, and etc.

Generative Adversarial Network

Paper
Add Code

Robust Trajectory Forecasting for Multiple Intelligent Agents in Dynamic Scene

no code implementations • 27 May 2020 • Yanliang Zhu, Dongchun Ren, Mingyu Fan, Deheng Qian, Xin Li, Huaxia Xia

Trajectory forecasting, or trajectory prediction, of multiple interacting agents in dynamic scenes, is an important problem for many applications, such as robotic systems and autonomous driving.

Autonomous Driving Trajectory Forecasting

Paper
Add Code

Defending against adversarial attacks on medical imaging AI system, classification or detection?

1 code implementation • 24 Jun 2020 • Xin Li, Deng Pan, Dongxiao Zhu

Medical imaging AI systems such as disease classification and segmentation are increasingly inspired and transformed from computer vision based AI systems.

Adversarial Defense General Classification

Paper
Code

Explainable Recommendation via Interpretable Feature Mapping and Evaluation of Explainability

no code implementations • 12 Jul 2020 • Deng Pan, Xiangrui Li, Xin Li, Dongxiao Zhu

Latent factor collaborative filtering (CF) has been a widely used technique for recommender system by learning the semantic representations of users and items.

Collaborative Filtering Explainable Recommendation +1

Paper
Add Code

Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration

no code implementations • ECCV 2020 • Xin Li, Xin Jin, Jianxin Lin, Tao Yu, Sen Liu, Yaojun Wu, Wei Zhou, Zhibo Chen

Hybrid-distorted image restoration (HD-IR) is dedicated to restore real distorted image that is degraded by multiple distortions.

Disentanglement Image Restoration

Paper
Add Code

Predicting heave and surge motions of a semi-submersible with neural networks

no code implementations • 31 Jul 2020 • Xiaoxian Guo, Xiantao Zhang, Xinliang Tian, Xin Li, Wenyue Lu

With the help of measured waves, the prediction extended 46. 5 s into future with an average accuracy close to 90%.

BIG-bench Machine Learning Motion Compensation +1

Paper
Add Code

Multi-node Bert-pretraining: Cost-efficient Approach

no code implementations • 1 Aug 2020 • Jiahuang Lin, Xin Li, Gennady Pekhimenko

As a result, to train these models within a reasonable time, machine learning (ML) programmers often require advanced hardware setups such as the premium GPU-enabled NVIDIA DGX workstations or specialized accelerators such as Google's TPU Pods.

Paper
Add Code

LIRA: Lifelong Image Restoration from Unknown Blended Distortions

no code implementations • ECCV 2020 • Jianzhao Liu, Jianxin Lin, Xin Li, Wei Zhou, Sen Liu, Zhibo Chen

Most existing image restoration networks are designed in a disposable way and catastrophically forget previously learned distortions when trained on a new distortion removal task.

Image Restoration SSIM

Paper
Add Code

Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without Communication

no code implementations • 21 Aug 2020 • Xu He, Bo An, Yanghua Li, Haikai Chen, Rundong Wang, Xinrun Wang, Runsheng Yu, Xin Li, Zhirong Wang

Thus, the global policy of the whole page could be sub-optimal.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Paper
Add Code

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

no code implementations • 21 Aug 2020 • Xu He, Bo An, Yanghua Li, Haikai Chen, Qingyu Guo, Xin Li, Zhirong Wang

First, since we concern the reward of a set of recommended items, we model the online recommendation as a contextual combinatorial bandit problem and define the reward of a recommended set.

Paper
Add Code

Detection of Genuine and Posed Facial Expressions of Emotion: A Review

no code implementations • 26 Aug 2020 • Shan Jia, Shuo Wang, Chuanbo Hu, Paula Webster, Xin Li

Facial expressions of emotion play an important role in human social interactions.

Paper
Add Code

Training Recurrent Neural Networks Online by Learning Explicit State Variables

no code implementations • ICLR 2020 • Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White

Recurrent neural networks (RNNs) allow an agent to construct a state-representation from a stream of experience, which is essential in partially observable problems.

Paper
Add Code

Efficiency in Real-time Webcam Gaze Tracking

no code implementations • 2 Sep 2020 • Amogh Gudi, Xin Li, Jan van Gemert

To do so, we evaluate the computational speed/accuracy trade-off for the CNN and the calibration effort/accuracy trade-off for screen calibration.

Computational Efficiency regression

Paper
Add Code

Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction

no code implementations • ECCV 2020 • Xin Xiong, Haipeng Xiong, Ke Xian, Chen Zhao, Zhiguo Cao, Xin Li

Depth completion is a widely studied problem of predicting a dense depth map from a sparse set of measurements and a single RGB image.

Depth Completion graph construction

Paper
Add Code

Accurate and Lightweight Image Super-Resolution with Model-Guided Deep Unfolding Network

no code implementations • 14 Sep 2020 • Qian Ning, Weisheng Dong, Guangming Shi, Leida Li, Xin Li

Deep neural networks (DNNs) based methods have achieved great success in single image super-resolution (SISR).

Denoising Image Super-Resolution

Paper
Add Code

AIM 2020 Challenge on Video Extreme Super-Resolution: Methods and Results

no code implementations • 14 Sep 2020 • Dario Fuoli, Zhiwu Huang, Shuhang Gu, Radu Timofte, Arnau Raventos, Aryan Esfandiari, Salah Karout, Xuan Xu, Xin Li, Xin Xiong, Jinge Wang, Pablo Navarrete Michelini, Wen-Hao Zhang, Dongyang Zhang, Hanwei Zhu, Dan Xia, Haoyu Chen, Jinjin Gu, Zhi Zhang, Tongtong Zhao, Shanshan Zhao, Kazutoshi Akita, Norimichi Ukita, Hrishikesh P. S, Densen Puthussery, Jiji C. V

Missing information can be restored well in this region, especially in HR videos, where the high-frequency content mostly consists of texture details.

Image Super-Resolution SSIM +1

Paper
Add Code

Enhancing Dialogue Generation via Multi-Level Contrastive Learning

no code implementations • 19 Sep 2020 • Xin Li, Piji Li, Yan Wang, Xiaojiang Liu, Wai Lam

Most of the existing works for dialogue generation are data-driven models trained directly on corpora crawled from websites.

Contrastive Learning Dialogue Generation +1

Paper
Add Code

AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results

no code implementations • 25 Sep 2020 • Pengxu Wei, Hannan Lu, Radu Timofte, Liang Lin, WangMeng Zuo, Zhihong Pan, Baopu Li, Teng Xi, Yanwen Fan, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding, Tangxin Xie, Liang Cao, Yan Zou, Yi Shen, Jialiang Zhang, Yu Jia, Kaihua Cheng, Chenhuan Wu, Yue Lin, Cen Liu, Yunbo Peng, Xueyi Zou, Zhipeng Luo, Yuehan Yao, Zhenyu Xu, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Tongtong Zhao, Shanshan Zhao, Yoseob Han, Byung-Hoon Kim, JaeHyun Baek, HaoNing Wu, Dejia Xu, Bo Zhou, Wei Guan, Xiaobo Li, Chen Ye, Hao Li, Yukai Shi, Zhijing Yang, Xiaojun Yang, Haoyu Zhong, Xin Li, Xin Jin, Yaojun Wu, Yingxue Pang, Sen Liu, Zhi-Song Liu, Li-Wen Wang, Chu-Tak Li, Marie-Paule Cani, Wan-Chi Siu, Yuanbo Zhou, Rao Muhammad Umer, Christian Micheloni, Xiaofeng Cong, Rajat Gupta, Keon-Hee Ahn, Jun-Hyuk Kim, Jun-Ho Choi, Jong-Seok Lee, Feras Almasri, Thomas Vandamme, Olivier Debeir

This paper introduces the real image Super-Resolution (SR) challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2020.

Image Manipulation Image Super-Resolution +1

Paper
Add Code

FAN: Frequency Aggregation Network for Real Image Super-resolution

no code implementations • 30 Sep 2020 • Yingxue Pang, Xin Li, Xin Jin, Yaojun Wu, Jianzhao Liu, Sen Liu, Zhibo Chen

Specifically, we extract different frequencies of the LR image and pass them to a channel attention-grouped residual dense network (CA-GRDB) individually to output corresponding feature maps.

Image Super-Resolution SSIM

Paper
Add Code

Deformable Kernel Convolutional Network for Video Extreme Super-Resolution

no code implementations • 1 Oct 2020 • Xuan Xu, Xin Xiong, Jinge Wang, Xin Li

Thanks to newly designed Deformable Kernel Convolution Alignment (DKC_Align) and Deformable Kernel Spatial Attention (DKSA) modules, DKSAN can better exploit both spatial and temporal redundancies to facilitate the information propagation across different layers.

Video Super-Resolution

Paper
Add Code

Anion charge-lattice volume dependent Li ion migration in compounds with the face-centered cubic anion frameworks

no code implementations • 25 Oct 2019 • Zhenming Xu, Xin Chen, Ronghan Chen, Xin Li, Hong Zhu

In this work, the face-centered cubic (fcc) anion frameworks were creatively constructed to study the effects of anion charge and lattice volume on the stability of lithium ion occupation and lithium ion migration.

Applied Physics

Paper
Add Code

The similarity metric

no code implementations • 20 Nov 2001 • Ming Li, Xin Chen, Xin Li, Bin Ma, Paul Vitanyi

A new class of distances appropriate for measuring similarity relations between sequences, say one type of similarity per distance, is studied.

Paper
Add Code

Limitations of Autoregressive Models and Their Alternatives

no code implementations • NAACL 2021 • Chu-Cheng Lin, Aaron Jaech, Xin Li, Matthew R. Gormley, Jason Eisner

Standard autoregressive language models perform only polynomial-time computation to compute the probability of the next symbol.

Language Modelling

Paper
Add Code

Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond

no code implementations • 23 Oct 2020 • Xin Li, Lidong Bing, Wenxuan Zhang, Zheng Li, Wai Lam

Cross-lingual adaptation with multilingual pre-trained language models (mPTLMs) mainly consists of two lines of works: zero-shot approach and translation-based approach, which have been studied extensively on the sequence-level tasks.

Cross-Lingual Transfer Translation

Paper
Add Code

Magnetoelectric coupling and decoupling in multiferroic hexagonal YbFeO3 thin films

no code implementations • 13 Nov 2020 • Yu Yun, Xin Li, Arashdeep Singh Thind, Yuewei Yin, Hao liu, Qiang Li, Wenbin Wang, Alpha T. N Diaye, Corbyn Mellinger, Xuanyuan Jiang, Rohan Mishra, Xiaoshan Xu

The coupling between ferroelectric and magnetic orders in multiferroic materials and the nature of magnetoelectric (ME) effects are enduring experimental challenges.

Materials Science Other Condensed Matter

Paper
Add Code

A New Action Recognition Framework for Video Highlights Summarization in Sporting Events

no code implementations • 1 Dec 2020 • Cheng Yan, Xin Li, Guoqiang Li

To date, machine learning for human action recognition in video has been widely implemented in sports activities.

Action Recognition Temporal Action Localization +1

Paper
Add Code

Impact of Temperature and Relative Humidity on the Transmission of COVID-19: A Modeling Study in China and the United States

no code implementations • 9 Mar 2020 • Jingyuan Wang, Ke Tang, Kai Feng, Xin Li, Weifeng Lv, Kun Chen, Fei Wang

Primary outcome measures: Regression analysis of the impact of temperature and relative humidity on the effective reproductive number (R value).

regression

Paper
Add Code

Statistical Issues and Recommendations for Clinical Trials Conducted During the COVID-19 Pandemic

no code implementations • 21 May 2020 • R. Daniel Meyer, Bohdana Ratitch, Marcel Wolbers, Olga Marchenko, Hui Quan, Daniel Li, Chrissie Fletcher, Xin Li, David Wright, Yue Shentu, Stefan Englert, Wei Shen, Jyotirmoy Dey, Thomas Liu, Ming Zhou, Norman Bohidar, Peng-Liang Zhao, Michael Hale

The COVID-19 pandemic has had and continues to have major impacts on planned and ongoing clinical trials.

Paper
Add Code

Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution

no code implementations • 11 Dec 2020 • Xin Li, Xin Jin, Tao Yu, Yingxue Pang, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i. e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations.

Image Super-Resolution

Paper
Add Code

Learned Block-based Hybrid Image Compression

no code implementations • 17 Dec 2020 • Yaojun Wu, Xin Li, Zhizheng Zhang, Xin Jin, Zhibo Chen

Recent works on learned image compression perform encoding and decoding processes in a full-resolution manner, resulting in two problems when deployed for practical applications.

Blocking Image Compression +2

Paper
Add Code

Automatic Opioid User Detection From Twitter: Transductive Ensemble Built On Different Meta-graph Based Similarities Over Heterogeneous Information Network

no code implementations • 1 Jul 2018 • Yujie Fan, Yiming Zhang, Yanfang Y e∗, Xin Li

Opioid (e. g., heroin and morphine) addiction has become one of the largest and deadliest epidemics in the United States.

Paper
Add Code

Learning Inter- and Intraframe Representations for Non-Lambertian Photometric Stereo

no code implementations • 26 Dec 2020 • Yanlong Cao, Binjie Ding, Zewei He, Jiangxin Yang, Jingxi Chen, Yanpeng Cao, Xin Li

Photometric stereo provides an important method for high-fidelity 3D reconstruction based on multiple intensity images captured under different illumination directions.

3D Reconstruction

Paper
Add Code

Understanding Team Collaboration in Artificial Intelligence from the perspective of Geographic Distance

no code implementations • 25 Dec 2020 • Xuli Tang, Xin Li, Ying Ding, Feicheng Ma

This paper analyzes team collaboration in the field of Artificial Intelligence (AI) from the perspective of geographic distance.

Paper
Add Code

Recent Advances of Generic Object Detection with Deep Learning: A Review

no code implementations • 19 Dec 2020 • Xin Li, YingYing Li, Shushu Li

Object detection is an important and challenging problem in computer vision.

Action Recognition Data Augmentation +6

Paper
Add Code

Graph-based Facial Affect Analysis: A Review

no code implementations • 29 Mar 2021 • Yang Liu, Xingming Zhang, Yante Li, Jinzhao Zhou, Xin Li, Guoying Zhao

As far as we know, this is the first survey of graph-based FAA methods.

graph construction Relational Reasoning

Paper
Add Code

Searching Efficient Model-guided Deep Network for Image Denoising

no code implementations • 6 Apr 2021 • Qian Ning, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi

Similar to the success of NAS in high-level vision tasks, it is possible to find a memory and computationally efficient solution via NAS with highly competent denoising performance.

Image Denoising Neural Architecture Search

Paper
Add Code

Self-Supervised Tracking via Target-Aware Data Synthesis

no code implementations • 21 Jun 2021 • Xin Li, Wenjie Pei, YaoWei Wang, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang

While deep-learning based tracking methods have achieved substantial progress, they entail large-scale and high-quality annotated data for sufficient training.

Representation Learning Self-Supervised Learning +1

Paper
Add Code

Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud Learning

no code implementations • 5 Jul 2021 • Bi'an Du, Xiang Gao, Wei Hu, Xin Li

Point clouds have attracted increasing attention.

Contrastive Learning Point Cloud Segmentation +3

Paper
Add Code

Metasurface-Enabled On-Chip Multiplexed Diffractive Neural Networks in the Visible

no code implementations • 13 Jul 2021 • Xuhao Luo, Yueqiang Hu, Xin Li, Xiangnian Ou, Jiajie Lai, Na Liu, Huigao Duan

Replacing electrons with photons is a compelling route towards light-speed, highly parallel, and low-power artificial intelligence computing.

Autonomous Driving

Paper
Add Code

Human-In-The-Loop Document Layout Analysis

no code implementations • 4 Aug 2021 • Xingjiao Wu, Tianlong Ma, Xin Li, Qin Chen, Liang He

The HITL select key samples by using confidence.

Document Layout Analysis Semantic Segmentation

Paper
Add Code

Identifying Illicit Drug Dealers on Instagram with Large-scale Multimodal Data Fusion

no code implementations • 18 Aug 2021 • Chuanbo Hu, Minglei Yin, Bin Liu, Xin Li, Yanfang Ye

Unlike existing methods that focus on posting-based detection, we propose to tackle the problem of illicit drug dealer identification by constructing a large-scale multimodal dataset named Identifying Drug Dealers on Instagram (IDDIG).

Community Detection

Paper
Add Code

Detection of Illicit Drug Trafficking Events on Instagram: A Deep Multimodal Multilabel Learning Approach

no code implementations • 19 Aug 2021 • Chuanbo Hu, Minglei Yin, Bin Liu, Xin Li, Yanfang Ye

Accordingly, accurate detection of illicit drug trafficking events (IDTEs) from social media has become even more challenging.

Marketing

Paper
Add Code

Characterizing interdisciplinarity in drug research: a translational science perspective

no code implementations • 4 Sep 2021 • Xin Li, Xuli Tang

Despite the significant advances in life science, it still takes decades to translate a basic drug discovery into a cure for human disease.

Drug Discovery

Paper
Add Code

Uncertainty-Driven Loss for Single Image Super-Resolution

no code implementations • NeurIPS 2021 • Qian Ning, Weisheng Dong, Xin Li, Jinjian Wu, Guangming Shi

Specifically, we introduce variance estimation characterizing the uncertainty on a pixel-by-pixel basis into SISR solutions so the targeted pixels in a high-resolution image (mean) and their corresponding uncertainty (variance) can be learned simultaneously.

Image Super-Resolution

Paper
Add Code

Confounder Identification-free Causal Visual Feature Learning

no code implementations • 26 Nov 2021 • Xin Li, Zhizheng Zhang, Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Xin Jin, Zhibo Chen

In this paper, we propose a novel Confounder Identification-free Causal Visual Feature Learning (CICF) method, which obviates the need for identifying confounders.

Domain Generalization Meta-Learning

Paper
Add Code

Neural Collaborative Graph Machines for Table Structure Recognition

no code implementations • CVPR 2022 • Hao liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, Bo Ren

We also show that the proposed NCGM can modulate collaborative pattern of different modalities conditioned on the context of intra-modality cues, which is vital for diversified table cases.

Ranked #6 on Table Recognition on PubTabNet

Table Recognition

Paper
Add Code

A Close Look at Few-shot Real Image Super-resolution from the Distortion Relation Perspective

no code implementations • 25 Nov 2021 • Xin Li, Xin Jin, Jun Fu, Xiaoyuan Yu, Bei Tong, Zhibo Chen

Under this brand-new scenario, we propose Distortion Relation guided Transfer Learning (DRTL) for the few-shot RealSR by transferring the rich restoration knowledge from auxiliary distortions (i. e., synthetic distortions) to the target RealSR under the guidance of distortion relation.

Image Restoration Image Super-Resolution +4

Paper
Add Code

Simple Contrastive Representation Adversarial Learning for NLP Tasks

no code implementations • 26 Nov 2021 • Deshui Miao, JiaQi Zhang, WenBo Xie, Jian Song, Xin Li, Lijuan Jia, Ning Guo

In this paper, adversarial training is performed to generate challenging and harder learning adversarial examples over the embedding space of NLP as learning pairs.

Contrastive Learning Natural Language Understanding +4

Paper
Add Code

Document Layout Analysis with Aesthetic-Guided Image Augmentation

no code implementations • 27 Nov 2021 • Tianlong Ma, Xingjiao Wu, Xin Li, Xiangcheng Du, Zhao Zhou, Liang Xue, Cheng Jin

To measure the proposed image layer modeling method, we propose a manually-labeled non-Manhattan layout fine-grained segmentation dataset named FPD.

Document Layout Analysis document understanding +2

Paper
Add Code

Interactive Model with Structural Loss for Language-based Abductive Reasoning

no code implementations • 1 Dec 2021 • Linhao Li, Ming Xu, Yongfeng Dong, Xin Li, Ao Wang

Therefore, we propose to group instead of ranking the hypotheses and design a structural loss called ``joint softmax focal loss'' in this paper.

Language Modelling Natural Language Inference

Paper
Add Code

Internationalizing AI: Evolution and Impact of Distance Factors

no code implementations • 10 Nov 2021 • Xuli Tang, Xin Li, Feicheng Ma

A framework including 13 indicators to quantify the distance factors between countries from 5 perspectives (i. e., geographic distance, economic distance, cultural distance, academic distance, and industrial distance) is proposed.

Descriptive

Paper
Add Code

SelectAugment: Hierarchical Deterministic Sample Selection for Data Augmentation

no code implementations • 6 Dec 2021 • Shiqi Lin, Zhizheng Zhang, Xin Li, Wenjun Zeng, Zhibo Chen

Data augmentation (DA) has been widely investigated to facilitate model optimization in many tasks.

Data Augmentation Fine-Grained Image Recognition +3

Paper
Add Code

Robust Depth Completion with Uncertainty-Driven Loss Functions

no code implementations • 15 Dec 2021 • Yufan Zhu, Weisheng Dong, Leida Li, Jinjian Wu, Xin Li, Guangming Shi

In this work, we introduce uncertainty-driven loss functions to improve the robustness of depth completion and handle the uncertainty in depth completion.

Depth Completion

Paper
Add Code

A Survey on Applications of Digital Human Avatars toward Virtual Co-presence

no code implementations • 11 Jan 2022 • Matthew Korban, Xin Li

This paper investigates different approaches to build and use digital human avatars toward interactive Virtual Co-presence (VCP) environments.

Paper
Add Code

Machine learning prediction for mean motion resonance behaviour -- The planar case

no code implementations • 18 Jan 2022 • Xin Li, Jian Li, Zhihong Jeff Xia, Nikolaos Georgakarakos

Most recently, machine learning has been used to study the dynamics of integrable Hamiltonian systems and the chaotic 3-body problem.

BIG-bench Machine Learning Numerical Integration

Paper
Add Code

Cross-Domain Document Layout Analysis via Unsupervised Document Style Guide

no code implementations • 24 Jan 2022 • Xingjiao Wu, Luwei Xiao, Xiangcheng Du, Yingbin Zheng, Xin Li, Tianlong Ma, Liang He

Our framework is an unsupervised document layout analysis framework.

Contrastive Learning Document Layout Analysis

Paper
Add Code

A multi-domain virtual network embedding algorithm with delay prediction

no code implementations • 3 Feb 2022 • Peiying Zhang, Xue Pang, Yongjing Ni, Haipeng Yao, Xin Li

Virtual network embedding (VNE) is an crucial part of network virtualization (NV), which aims to map the virtual networks (VNs) to a shared substrate network (SN).

Network Embedding

Paper
Add Code

Multi-modal Sensor Fusion for Auto Driving Perception: A Survey

no code implementations • 6 Feb 2022 • Keli Huang, Botian Shi, Xiang Li, Xin Li, Siyuan Huang, Yikang Li

Multi-modal fusion is a fundamental task for the perception of an autonomous driving system, which has recently intrigued many researchers.

Autonomous Driving object-detection +3

Paper
Add Code

Low-Rank Phase Retrieval with Structured Tensor Models

no code implementations • 15 Feb 2022 • Soo Min Kwon, Xin Li, Anand D. Sarwate

We study the low-rank phase retrieval problem, where the objective is to recover a sequence of signals (typically images) given the magnitude of linear measurements of those signals.

Retrieval

Paper
Add Code

Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence

no code implementations • CVPR 2022 • Zhihong Pan, Baopu Li, Dongliang He, Mingde Yao, Wenhao Wu, Tianwei Lin, Xin Li, Errui Ding

Deep learning based single image super-resolution models have been widely studied and superb results are achieved in upscaling low-resolution images with fixed scale factor and downscaling degradation kernel.

Image Super-Resolution

Paper
Add Code

Aggregate effects of advertising decisions: a complex systems look at search engine advertising via an experimental study

no code implementations • 4 Mar 2022 • Yanwu Yang, Xin Li, Bernard J. Jansen, Daniel Zeng

Originality: This is one of the first research works to explore collective group decisions and resulting phenomena in the complex context of search engine advertising via developing and validating a simulation framework that supports assessments of various advertising strategies and estimations of the impact of mechanisms on the search market.

Paper
Add Code

Context-aware Visual Tracking with Joint Meta-updating

no code implementations • 4 Apr 2022 • Qiuhong Shen, Xin Li, Fanyang Meng, Yongsheng Liang

These deep trackers usually do not perform online update or update single sub-branch of the tracking model, for which they cannot adapt to the appearance variation of objects.

Meta-Learning Visual Object Tracking +1

Paper
Add Code

The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training

no code implementations • 18 Apr 2022 • Hao liu, Xinghua Jiang, Xin Li, Antai Guo, Deqiang Jiang, Bo Ren

The self-supervised Masked Image Modeling (MIM) schema, following "mask-and-reconstruct" pipeline of recovering contents from masked image, has recently captured the increasing interest in the multimedia community, owing to the excellent ability of learning visual representation from unlabeled data.

Paper
Add Code

Global Mapping of Gene/Protein Interactions in PubMed Abstracts: A Framework and an Experiment with P53 Interactions

no code implementations • 22 Apr 2022 • Xin Li, Hsinchun Chen, Zan Huang, Hua Su, Jesse D. Martinez

In this paper, we propose a comprehensive framework for constructing and analyzing large-scale gene functional networks based on the gene/protein interactions extracted from biomedical literature repositories using text mining tools.

Paper
Add Code

Gene Function Prediction with Gene Interaction Networks: A Context Graph Kernel Approach

no code implementations • 22 Apr 2022 • Xin Li, Hsinchun Chen, Jiexun Li, Zhu Zhang

Predicting gene functions is a challenge for biologists in the post genomic era.

Paper
Add Code

Relational Representation Learning in Visually-Rich Documents

no code implementations • 5 May 2022 • Xin Li, Yan Zheng, Yiqing Hu, Haoyu Cao, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Bo Ren

To deal with the unpredictable definition of relations, we propose a novel contrastive learning task named Relational Consistency Modeling (RCM), which harnesses the fact that existing relations should be consistent in differently augmented positive views.

Contrastive Learning Key Information Extraction +3

Paper
Add Code

Multi-Object Tracking Meets Moving UAV

no code implementations • CVPR 2022 • Shuai Liu, Xin Li, Huchuan Lu, You He

Multi-object tracking in unmanned aerial vehicle (UAV) videos is an important vision task and can be applied in a wide range of applications.

Multi-Object Tracking Object

Paper
Add Code

Accurate Scoliosis Vertebral Landmark Localization on X-ray Images via Shape-constrained Multi-stage Cascaded CNNs

no code implementations • 5 Jun 2022 • Zhiwei Wang, Jinxin Lv, Yunqiao Yang, Yuanhuai Liang, Yi Lin, Qiang Li, Xin Li, Xin Yang

Vertebral landmark localization is a crucial step for variant spine-related clinical applications, which requires detecting the corner points of 17 vertebrae.

Paper
Add Code

Formation Tracking for a Multi-Auv System Based on an Adaptive Sliding Mode Method in the Water Flow Environment

no code implementations • 9 Jun 2022 • Xin Li, Daqi Zhu, Bing Sun, Qi Chen, Wenyang Gan, Zhigang Li

At last, a robust sliding mode controller with continuous model predictive control strategy for the multi-AUV system is developed to achieve leader-follower formation tracking under the presence of bounded flow disturbances, and simulations are implemented to confirm the effectiveness of the proposed method.

Model Predictive Control

Paper
Add Code

RTN: Reinforced Transformer Network for Coronary CT Angiography Vessel-level Image Quality Assessment

no code implementations • 13 Jul 2022 • Yiting Lu, Jun Fu, Xin Li, Wei Zhou, Sen Liu, Xinxin Zhang, Congfu Jia, Ying Liu, Zhibo Chen

Therefore, we propose a Progressive Reinforcement learning based Instance Discarding module (termed as PRID) to progressively remove quality-irrelevant/negative instances for CCTA VIQA.

Image Quality Assessment Multiple Instance Learning

Paper
Add Code

Stroke-Based Autoencoders: Self-Supervised Learners for Efficient Zero-Shot Chinese Character Recognition

no code implementations • 17 Jul 2022 • Zongze Chen, Wenxia Yang, Xin Li

Following its canonical writing order, we first represent a Chinese character as a series of stroke images with a fixed writing order, and then our SAE model is trained to reconstruct this stroke image sequence.

Word Embeddings Zero-Shot Learning

Paper
Add Code

Source-free Unsupervised Domain Adaptation for Blind Image Quality Assessment

no code implementations • 17 Jul 2022 • Jianzhao Liu, Xin Li, Shukun An, Zhibo Chen

Thanks to the development of unsupervised domain adaptation (UDA), some works attempt to transfer the knowledge from a label-sufficient source domain to a label-free target domain under domain shift with UDA.

Blind Image Quality Assessment Unsupervised Domain Adaptation

Paper
Add Code

Point Cloud Attacks in Graph Spectral Domain: When 3D Geometry Meets Graph Signal Processing

no code implementations • 27 Jul 2022 • Daizong Liu, Wei Hu, Xin Li

Instead, we propose point cloud attacks from a new perspective -- the graph spectral domain attack, aiming to perturb graph transform coefficients in the spectral domain that corresponds to varying certain geometric structure.

Paper
Add Code

StyleAM: Perception-Oriented Unsupervised Domain Adaption for Non-reference Image Quality Assessment

no code implementations • 29 Jul 2022 • Yiting Lu, Xin Li, Jianzhao Liu, Zhibo Chen

Specifically, we find a more compact and reliable space i. e., feature style space for perception-oriented UDA based on an interesting/amazing observation, that the feature style (i. e., the mean and variance) of the deep layer in DNNs is exactly associated with the quality score in NR-IQA.

Image Quality Assessment NR-IQA +1

Paper
Add Code

Learned Lossless JPEG Transcoding via Joint Lossy and Residual Compression

no code implementations • 24 Aug 2022 • Xiaoshuai Fan, Xin Li, Zhibo Chen

Our proposed transcoding architecture shows significant superiority in the compression of JPEG images thanks to the collaboration of learned lossy transform coding and residual entropy coding.

Image Compression

Paper
Add Code

Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation

no code implementations • 24 Aug 2022 • Guangqi Xie, Xin Li, Shiqi Lin, Li Zhang, Kai Zhang, Yue Li, Zhibo Chen

In this paper, we take a step forward to video semantic compression and propose the Hierarchical Reinforcement Learning based task-driven Video Semantic Coding, named as HRLVSC.

Hierarchical Reinforcement Learning reinforcement-learning +3

Paper
Add Code

Large-step neural network for learning the symplectic evolution from partitioned data

no code implementations • 30 Aug 2022 • Xin Li, Jian Li, Zhihong Jeff Xia, Nikolaos Georgakarakos

Based on Chen & Tao (2021), the symplectic mapping is represented by a generating function.

Time Series Time Series Analysis

Paper
Add Code

Saliency Guided Adversarial Training for Learning Generalizable Features with Applications to Medical Imaging Classification System

no code implementations • 9 Sep 2022 • Xin Li, Yao Qiang, Chengyin Li, Sijia Liu, Dongxiao Zhu

We hypothesize that adversarial training can eliminate shortcut features whereas saliency guided training can filter out non-relevant features; both are nuisance features accounting for the performance degradation on OOD test sets.

Paper
Add Code

Uncertainty Aware Multitask Pyramid Vision Transformer For UAV-Based Object Re-Identification

no code implementations • 19 Sep 2022 • Syeda Nyma Ferdous, Xin Li, Siwei Lyu

Learning a robust and discriminative feature representation is a crucial challenge for object ReID.

Object

Paper
Add Code

How Image Generation Helps Visible-to-Infrared Person Re-Identification?

no code implementations • 4 Oct 2022 • Honghu Pan, Yongyong Chen, Yunqi He, Xin Li, Zhenyu He

To this end, we propose Flow2Flow, a unified framework that could jointly achieve training sample expansion and cross-modality image generation for V2I person ReID.

Image Generation Person Re-Identification

Paper
Add Code

Toward an Over-parameterized Direct-Fit Model of Visual Perception

no code implementations • 7 Oct 2022 • Xin Li

In this paper, we revisit the problem of computational modeling of simple and complex cells for an over-parameterized and direct-fit model of visual perception.

Paper
Add Code

Predicting the clinical citation count of biomedical papers using multilayer perceptron neural network

no code implementations • 7 Sep 2022 • Xin Li, Xuli Tang, Qikai Cheng

We extracted ninety-one paper features from three dimensions as the input of the model, including twenty-one features in the paper dimension, thirty-five in the reference dimension, and thirty-five in the citing paper dimension.

Translation

Paper
Add Code

Cutting-Splicing data augmentation: A novel technology for medical image segmentation

no code implementations • 17 Oct 2022 • Lianting Hu, Huiying Liang, Jiajie Tang, Xin Li, Li Huang, Long Lu

Background: Medical images are more difficult to acquire and annotate than natural images, which results in data augmentation technologies often being used in medical image segmentation tasks.

Data Augmentation Image Segmentation +4

Paper
Add Code

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection

no code implementations • 18 Oct 2022 • Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He

To address these problems, we construct the homogeneous structure between the point cloud and images to avoid projective information loss by transforming the camera features into the LiDAR 3D space.

3D Object Detection Autonomous Driving +1

Paper
Add Code

Joint Rigid Motion Correction and Sparse-View CT via Self-Calibrating Neural Field

no code implementations • 23 Oct 2022 • Qing Wu, Xin Li, Hongjiang Wei, Jingyi Yu, Yuyao Zhang

NeRF-based SVCT methods represent the desired CT image as a continuous function of spatial coordinates and train a Multi-Layer Perceptron (MLP) to learn the function by minimizing loss on the SV sinogram.

Paper
Add Code

Development of a Hybrid Simulation and Experiment Test Platform for Dynamic Positioning Vessels

no code implementations • 23 Oct 2022 • Changjun Hu, Quan Shi, Xin Li, Xiaoxian Guo

The test platform can test the performance of DP system and determine the operational time window.

Paper
Add Code

Deep Learning-Based Channel Estimation for Double-RIS Aided Massive MIMO System

no code implementations • 22 Oct 2022 • Mengbing Liu, Xin Li, Boyu Ning, Chongwen Huang, Sumei Sun, Chau Yuen

Reconfigurable Intelligent Surface (RIS) is considered as an energy-efficient solution for future wireless communication networks due to its fast and low-cost configuration.

Paper
Add Code

Multi-view Representation Learning from Malware to Defend Against Adversarial Variants

no code implementations • 25 Oct 2022 • James Lee Hu, MohammadReza Ebrahimi, Weifeng Li, Xin Li, Hsinchun Chen

This provides an opportunity for the defenders (i. e., malware detectors) to detect the adversarial variants by utilizing more than one view of a malware file (e. g., source code view in addition to the binary view).

Adversarial Robustness MULTI-VIEW LEARNING +1

Paper
Add Code

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance

no code implementations • 27 Oct 2022 • Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang

In this paper, we propose to use intermediate bottleneck features (IBFs) to replace PPGs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning

no code implementations • 1 Nov 2022 • Riashat Islam, Hongyu Zang, Anirudh Goyal, Alex Lamb, Kenji Kawaguchi, Xin Li, Romain Laroche, Yoshua Bengio, Remi Tachet des Combes

Goal-conditioned reinforcement learning (RL) is a promising direction for training agents that are capable of solving multiple tasks and reach a diverse set of objectives.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

RRSR:Reciprocal Reference-based Image Super-Resolution with Progressive Feature Alignment and Selection

no code implementations • 8 Nov 2022 • Lin Zhang, Xin Li, Dongliang He, Fu Li, Yili Wang, Zhaoxiang Zhang

While previous state-of-the-art RefSR methods mainly focus on improving the efficacy and robustness of reference feature transfer, it is generally overlooked that a well reconstructed SR image should enable better SR reconstruction for its similar LR images when it is referred to as.

feature selection Image Super-Resolution

Paper
Add Code

Batch-based Model Registration for Fast 3D Sherd Reconstruction

no code implementations • ICCV 2023 • Jiepeng Wang, Congyi Zhang, Peng Wang, Xin Li, Peter J. Cobb, Christian Theobalt, Wenping Wang

In this work, we aim to develop a portable, high-throughput, and accurate reconstruction system for efficient digitization of fragments excavated in archaeological sites.

3D Reconstruction

Paper
Add Code

Transformation-Equivariant 3D Object Detection for Autonomous Driving

no code implementations • 22 Nov 2022 • Hai Wu, Chenglu Wen, Wei Li, Xin Li, Ruigang Yang, Cheng Wang

However, it is difficult to apply such networks to 3D object detection in autonomous driving due to its large computation cost and slow reasoning speed.

3D Object Detection Autonomous Driving +3

Paper
Add Code

Learning Compact Features via In-Training Representation Alignment

no code implementations • 23 Nov 2022 • Xin Li, Xiangrui Li, Deng Pan, Yao Qiang, Dongxiao Zhu

Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of the feature extractor (i. e., last hidden layer) and a linear classifier (i. e., output layer) that are trained jointly with stochastic gradient descent (SGD) on the loss function (e. g., cross-entropy).

Representation Learning

Paper
Add Code

AdaCM: Adaptive ColorMLP for Real-Time Universal Photo-realistic Style Transfer

no code implementations • 3 Dec 2022 • Tianwei Lin, Honglin Lin, Fu Li, Dongliang He, Wenhao Wu, Meiling Wang, Xin Li, Yong liu

Then, in \textbf{AdaCM}, we adopt a CNN encoder to adaptively predict all parameters for the ColorMLP conditioned on each input content and style image pair.

4k Style Transfer

Paper
Add Code

Coarse-to-Fine Contrastive Learning on Graphs

no code implementations • 13 Dec 2022 • Peiyao Zhao, Yuangang Pan, Xin Li, Xu Chen, Ivor W. Tsang, Lejian Liao

Inspired by the impressive success of contrastive learning (CL), a variety of graph augmentation strategies have been employed to learn node representations in a self-supervised manner.

Contrastive Learning Learning-To-Rank

Paper
Add Code

Representation Learning in Deep RL via Discrete Information Bottleneck

no code implementations • 28 Dec 2022 • Riashat Islam, Hongyu Zang, Manan Tomar, Aniket Didolkar, Md Mofijul Islam, Samin Yeasar Arnob, Tariq Iqbal, Xin Li, Anirudh Goyal, Nicolas Heess, Alex Lamb

Several self-supervised representation learning methods have been proposed for reinforcement learning (RL) with rich observations.

Offline RL Reinforcement Learning (RL) +1

Paper
Add Code

Joint Beamforming Design for Dual-Functional MIMO Radar and Communication Systems Guaranteeing Physical Layer Security

no code implementations • 1 Jan 2023 • Fuwang Dong, Wei Wang, Xin Li, Fan Liu, Sheng Chen, Lajos Hanzo

The dual-functional radar and communication (DFRC) technique constitutes a promising next-generation wireless solution, due to its benefits in terms of power consumption, physical hardware, and spectrum exploitation.

Paper
Add Code

Multi-Constraint Molecular Generation using Sparsely Labelled Training Data for Localized High-Concentration Electrolyte Diluent Screening

no code implementations • 12 Jan 2023 • Jonathan P. Mailoa, Xin Li, Jiezhong Qiu, Shengyu Zhang

Recently, machine learning methods have been used to propose molecules with desired properties, which is especially useful for exploring large chemical spaces efficiently.

Paper
Add Code

PointSmile: Point Self-supervised Learning via Curriculum Mutual Information

no code implementations • 30 Jan 2023 • Xin Li, Mingqiang Wei, Songcan Chen

From the perspective of how-and-what-to-learn, PointSmile is designed to imitate human curriculum learning, i. e., starting with an easy curriculum and gradually increasing the difficulty of that curriculum.

Data Augmentation Self-Supervised Learning

Paper
Add Code

Analysis of Biomass Sustainability Indicators from a Machine Learning Perspective

no code implementations • 2 Feb 2023 • Syeda Nyma Ferdous, Xin Li, Kamalakanta Sahoo, Richard Bergman

This study proposes a robust model for biomass sustainability prediction by analyzing sustainability indicators using machine learning models.

Ensemble Learning Management +1

Paper
Add Code

MorphGANFormer: Transformer-based Face Morphing and De-Morphing

no code implementations • 18 Feb 2023 • Na Zhang, Xudong Liu, Xin Li, Guo-Jun Qi

Semantic face image manipulation has received increasing attention in recent years.

Image Manipulation

Paper
Add Code

Toward a Geometric Theory of Manifold Untangling

no code implementations • 7 Mar 2023 • Xin Li, Shuo Wang

It has been hypothesized that the ventral stream processing for object recognition is based on a mechanism called cortically local subspace untangling.

Object Object Recognition

Paper
Add Code

Toward NeuroDM: Where Computational Neuroscience Meets Data Mining

no code implementations • 7 Mar 2023 • Xin Li, Bin Liu, Shuo Wang

At the intersection of computational neuroscience (CN) and data mining (DM), we advocate a holistic view toward their rich connections.

Paper
Add Code

Emotional Reaction Intensity Estimation Based on Multimodal Data

no code implementations • 16 Mar 2023 • Shangfei Wang, Jiaqiang Wu, Feiyi Zheng, Xin Li, XueWei Li, Suwen Wang, Yi Wu, Yanan Chang, Xiangyu Miao

In this paper, 1. better features are extracted with the SOTA pretrained models.

Paper
Add Code

Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation

no code implementations • 16 Mar 2023 • Hao liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Xing Sun

Recently, Table Structure Recognition (TSR) task, aiming at identifying table structure into machine readable formats, has received increasing interest in the community.

Paper
Add Code

MobileInst: Video Instance Segmentation on the Mobile

no code implementations • 30 Mar 2023 • Renhong Zhang, Tianheng Cheng, Shusheng Yang, Haoyi Jiang, Shuai Zhang, Jiancheng Lyu, Xin Li, Xiaowen Ying, Dashan Gao, Wenyu Liu, Xinggang Wang

To address those issues, we present MobileInst, a lightweight and mobile-friendly framework for video instance segmentation on mobile devices.

Instance Segmentation Segmentation +2

Paper
Add Code

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA

no code implementations • 4 Apr 2023 • Yongxin Zhu, Zhen Liu, Yukang Liang, Xin Li, Hao liu, Changcun Bao, Linli Xu

Different to conventional STVQA models which take the linguistic semantics and visual semantics in scene text as two separate features, in this paper, we propose a paradigm of "Locate Then Generate" (LTG), which explicitly unifies this two semantics with the spatial bounding box as a bridge connecting them.

Answer Generation Language Modelling +3

Paper
Add Code

ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis

no code implementations • 13 Apr 2023 • Hongchen Tan, BaoCai Yin, Kun Wei, Xiuping Liu, Xin Li

The ALR-GAN includes an Adaptive Layout Refinement (ALR) module and a Layout Visual Refinement (LVR) loss.

Generative Adversarial Network Text-to-Image Generation

Paper
Add Code

CoMaL: Conditional Maximum Likelihood Approach to Self-supervised Domain Adaptation in Long-tail Semantic Segmentation

no code implementations • 14 Apr 2023 • Thanh-Dat Truong, Chi Nhan Duong, Pierce Helton, Ashley Dowling, Xin Li, Khoa Luu

They are insufficient to model both global and local structures of a given image, especially in small regions of tail classes.

Domain Adaptation Segmentation +1

Paper
Add Code

Video-based Contrastive Learning on Decision Trees: from Action Recognition to Autism Diagnosis

no code implementations • 20 Apr 2023 • Mindi Ruan, Xiangxu Yu, Na Zhang, Chuanbo Hu, Shuo Wang, Xin Li

How can we teach a computer to recognize 10, 000 different actions?

Action Recognition Binary Classification +4

Paper
Add Code

MEDIC: A Multimodal Empathy Dataset in Counseling

no code implementations • 4 May 2023 • Zhou'an_Zhu, Xin Li, Jicai Pan, Yufei Xiao, Yanan Chang, Feiyi Zheng, Shangfei Wang

We also propose three labels (i. e., expression of experience, emotional reaction, and cognitive reaction) to describe the degree of empathy between counselors and their clients.

Paper
Add Code

UPDExplainer: an Interpretable Transformer-based Framework for Urban Physical Disorder Detection Using Street View Imagery

no code implementations • 4 May 2023 • Chuanbo Hu, Shan Jia, Fan Zhang, Changjiang Xiao, Mindi Ruan, Jacob Thrasher, Xin Li

Experimental results on the re-annotated Place Pulse 2. 0 dataset demonstrate promising detection performance of the proposed method, with an accuracy of 79. 9%.

Semantic Segmentation

Paper
Add Code

SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition

no code implementations • 27 Apr 2023 • Naga VS Raviteja Chappa, Pha Nguyen, Alexander H Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu

This paper introduces a novel approach to Social Group Activity Recognition (SoGAR) using Self-supervised Transformers network that can effectively utilize unlabeled video data.

Group Activity Recognition

Paper
Add Code

GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark

no code implementations • 11 May 2023 • Dongyang Li, Ruixue Ding, Qiang Zhang, Zheng Li, Boli Chen, Pengjun Xie, Yao Xu, Xin Li, Ning Guo, Fei Huang, Xiaofeng He

With a fast developing pace of geographic applications, automatable and intelligent models are essential to be designed to handle the large volume of information.

Entity Alignment Natural Language Understanding

Paper
Add Code

Vector Quantization With Self-Attention for Quality-Independent Representation Learning

no code implementations • CVPR 2023 • Zhou Yang, Weisheng Dong, Xin Li, Mengluan Huang, Yulin Sun, Guangming Shi

During training, we enforce the quantization of features from clean and corrupted images in the same discrete embedding space so that an invariant quality-independent feature representation can be learned to improve the recognition robustness of low-quality images.

Data Augmentation Image Restoration +2

Paper
Add Code

Self-Supervised Non-Uniform Kernel Estimation With Flow-Based Motion Prior for Blind Image Deblurring

no code implementations • CVPR 2023 • Zhenxuan Fang, Fangfang Wu, Weisheng Dong, Xin Li, Jinjian Wu, Guangming Shi

To address these issues, we propose to represent the field of motion blur kernels in a latent space by normalizing flows, and design CNNs to predict the latent codes instead of motion kernels.

Blind Image Deblurring Image Deblurring

Paper
Add Code

Two-Stream Regression Network for Dental Implant Position Prediction

no code implementations • 17 May 2023 • Xinquan Yang, Xuguang Li, Xuechen Li, WenTing Chen, Linlin Shen, Xin Li, Yongqiang Deng

In this paper, we develop a two-stream implant position regression framework (TSIPR), which consists of an implant region detector (IRD) and a multi-scale patch embedding regression network (MSPENet), to address this issue.

Position Position regression +1

Paper
Add Code

Cross-supervised Dual Classifiers for Semi-supervised Medical Image Segmentation

no code implementations • 25 May 2023 • Zhenxi Zhang, Ran Ran, Chunna Tian, Heng Zhou, Fan Yang, Xin Li, Zhicheng Jiao

This paper proposes a cross-supervised learning framework based on dual classifiers (DC-Net), including an evidential classifier and a vanilla classifier.

Image Segmentation Segmentation +2

Paper
Add Code

Self-aware and Cross-sample Prototypical Learning for Semi-supervised Medical Image Segmentation

no code implementations • 25 May 2023 • Zhenxi Zhang, Ran Ran, Chunna Tian, Heng Zhou, Xin Li, Fan Yang, Zhicheng Jiao

To address these issues, we propose a self-aware and cross-sample prototypical learning method (SCP-Net) to enhance the diversity of prediction in consistency learning by utilizing a broader range of semantic information derived from multiple inputs.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

A2B: Anchor to Barycentric Coordinate for Robust Correspondence

no code implementations • 5 Jun 2023 • Weiyue Zhao, Hao Lu, Zhiguo Cao, Xin Li

This approach offers a new perspective to alleviate the problem of repeated patterns and emphasizes the importance of choosing coordinate representations for feature correspondences.

Paper
Add Code

Learning Probabilistic Coordinate Fields for Robust Correspondences

no code implementations • 7 Jun 2023 • Weiyue Zhao, Hao Lu, Xinyi Ye, Zhiguo Cao, Xin Li

We introduce Probabilistic Coordinate Fields (PCFs), a novel geometric-invariant coordinate representation for image correspondence problems.

Image Registration Pose Estimation

Paper
Add Code

Deep learning radiomics for assessment of gastroesophageal varices in people with compensated advanced chronic liver disease

no code implementations • 13 Jun 2023 • Lan Wang, Ruiling He, Lili Zhao, Jia Wang, Zhengzi Geng, Tao Ren, Guo Zhang, Peng Zhang, Kaiqiang Tang, Chaofei Gao, Fei Chen, Liting Zhang, Yonghe Zhou, Xin Li, Fanbin He, Hui Huan, Wenjuan Wang, Yunxiao Liang, Juan Tang, Fang Ai, Tingyu Wang, Liyun Zheng, Zhongwei Zhao, Jiansong Ji, Wei Liu, Jiaojiao Xu, Bo Liu, Xuemei Wang, Yao Zhang, Qiong Yan, Muhan Lv, Xiaomei Chen, Shuhua Zhang, Yihua Wang, Yang Liu, Li Yin, Yanni Liu, Yanqing Huang, Yunfang Liu, Kun Wang, Meiqin Su, Li Bian, Ping An, Xin Zhang, Linxue Qian, Shao Li, Xiaolong Qi

Validation analysis revealed that the AUCs of DLRP were 0. 91 for GEV (95% CI 0. 90 to 0. 93, p < 0. 05) and 0. 88 for HRV (95% CI 0. 86 to 0. 89, p < 0. 01), which were significantly and robustly better than canonical risk indicators, including the value of LSM and SSM.

Paper
Add Code

Securing Visually-Aware Recommender Systems: An Adversarial Image Reconstruction and Detection Framework

no code implementations • 11 Jun 2023 • Minglei Yin, Bin Liu, Neil Zhenqiang Gong, Xin Li

Our proposed method can simultaneously (1) secure VARS from adversarial attacks characterized by local perturbations by image reconstruction based on global vision transformers; and (2) accurately detect adversarial examples using a novel contrastive learning approach.

Contrastive Learning Image Reconstruction +1

Paper
Add Code

TCEIP: Text Condition Embedded Regression Network for Dental Implant Position Prediction

no code implementations • 26 Jun 2023 • Xinquan Yang, Jinheng Xie, Xuguang Li, Xuechen Li, Xin Li, Linlin Shen, Yongqiang Deng

When deep neural network has been proposed to assist the dentist in designing the location of dental implant, most of them are targeting simple cases where only one missing tooth is available.

Position Position regression +1

Paper
Add Code

Unveiling the Potential of Knowledge-Prompted ChatGPT for Enhancing Drug Trafficking Detection on Social Media

no code implementations • 7 Jul 2023 • Chuanbo Hu, Bin Liu, Xin Li, Yanfang Ye

By integrating prior knowledge and the proposed prompts, ChatGPT can effectively identify and label drug trafficking activities on social networks, even in the presence of deceptive language and euphemisms used by drug dealers to evade detection.

Marketing

Paper
Add Code

Adaptive Control of Resource Flow to Optimize Construction Work and Cash Flow via Online Deep Reinforcement Learning

no code implementations • 20 Jul 2023 • Can Jiang, Xin Li, Jia-Rui Lin, Ming Liu, Zhiliang Ma

Therefore, this paper introducess a model and method to adaptive control the resource flows to optimize the work and cash flows of construction projects.

Management

Paper
Add Code

Bi-Modality Medical Image Synthesis Using Semi-Supervised Sequential Generative Adversarial Networks

no code implementations • 27 Aug 2023 • Xin Yang, Yi Lin, Zhiwei Wang, Xin Li, Kwang-Ting Cheng

A method for measuring the synthesis complexity is proposed to automatically determine the synthesis order in our sequential GAN.

Generative Adversarial Network Image Generation

Paper
Add Code

A Note on Randomized Kaczmarz Algorithm for Solving Doubly-Noisy Linear Systems

no code implementations • 31 Aug 2023 • El Houcine Bergou, Soumia Boucherouite, Aritra Dutta, Xin Li, Anna Ma

In this paper, we analyze the convergence of RK for noisy linear systems when the coefficient matrix, $A$, is corrupted with both additive and multiplicative noise, along with the noisy vector, $b$.

Paper
Add Code

VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation

no code implementations • 1 Sep 2023 • Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang

In this paper, we present VideoGen, a text-to-video generation approach, which can generate a high-definition video with high frame fidelity and strong temporal consistency using reference-guided latent diffusion.

Text-to-Image Generation Text-to-Video Generation +1

Paper
Add Code

3D Multiple Object Tracking on Autonomous Driving: A Literature Review

no code implementations • 27 Sep 2023 • Peng Zhang, Xin Li, Liang He, Xin Lin

This paper undertakes a comprehensive examination, assessment, and synthesis of the research landscape in this domain, remaining attuned to the latest developments in 3D MOT while suggesting prospective avenues for future investigation.

3D Multi-Object Tracking Autonomous Driving +1

Paper
Add Code

FreqAlign: Excavating Perception-oriented Transferability for Blind Image Quality Assessment from A Frequency Perspective

no code implementations • 29 Sep 2023 • Xin Li, Yiting Lu, Zhibo Chen

Based on this, we propose to improve the perception-oriented transferability of BIQA by performing feature frequency decomposition and selecting the frequency components that contained the most transferable perception knowledge for alignment.

Blind Image Quality Assessment Unsupervised Domain Adaptation

Paper
Add Code

Demystifying the Myths and Legends of Nonconvex Convergence of SGD

no code implementations • 19 Oct 2023 • Aritra Dutta, El Houcine Bergou, Soumia Boucherouite, Nicklas Werge, Melih Kandemir, Xin Li

Additionally, our analyses allow us to measure the density of the $\epsilon$-stationary points in the final iterates of SGD, and we recover the classical $O(\frac{1}{\sqrt{T}})$ asymptotic rate under various existing assumptions on the objective function and the bounds on the stochastic gradient.

Paper
Add Code

Diagnosis-oriented Medical Image Compression with Efficient Transfer Learning

no code implementations • 20 Oct 2023 • Guangqi Xie, Xin Li, Xiaohan Pan, Zhibo Chen

Remote medical diagnosis has emerged as a critical and indispensable technique in practical medical systems, where medical data are required to be efficiently compressed and transmitted for diagnosis by either professional doctors or intelligent diagnosis devices.

Coronary Artery Segmentation Image Compression +2

Paper
Add Code

Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations

no code implementations • 24 Oct 2023 • Ye Yuan, Xin Li, Yong Heng, Leiji Zhang, Mingzhong Wang

Imitation Learning (IL) aims to discover a policy by minimizing the discrepancy between the agent's behavior and expert demonstrations.

Imitation Learning

Paper
Add Code

Towards Control-Centric Representations in Reinforcement Learning from Images

no code implementations • 25 Oct 2023 • Chen Liu, Hongyu Zang, Xin Li, Yong Heng, Yifei Wang, Zhen Fang, Yisen Wang, Mingzhong Wang

Image-based Reinforcement Learning is a practical yet challenging task.

Atari Games reinforcement-learning

Paper
Add Code

Overhead Line Defect Recognition Based on Unsupervised Semantic Segmentation

no code implementations • 2 Nov 2023 • Weixi Wang, Xichen Zhong, Xin Li, Sizhe Li, Xun Ma

Overhead line inspection greatly benefits from defect recognition using visible light imagery.

Segmentation Unsupervised Semantic Segmentation

Paper
Add Code

Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding

no code implementations • 26 Nov 2023 • Hoang-Quan Nguyen, Thanh-Dat Truong, Xuan Bac Nguyen, Ashley Dowling, Xin Li, Khoa Luu

In precision agriculture, the detection and recognition of insects play an essential role in the ability of crops to grow healthy and produce a high-quality yield.

Self-Supervised Learning

Paper
Add Code

Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models

no code implementations • 28 Nov 2023 • Zhengming Yu, Zhiyang Dou, Xiaoxiao Long, Cheng Lin, Zekun Li, YuAn Liu, Norman Müller, Taku Komura, Marc Habermann, Christian Theobalt, Xin Li, Wenping Wang

The experiments demonstrate the superior performance of Surf-D in shape generation across multiple modalities as conditions.

3D Reconstruction

Paper
Add Code

Brainformer: Modeling MRI Brain Functions to Machine Vision

no code implementations • 30 Nov 2023 • Xuan-Bac Nguyen, Xin Li, Samee U. Khan, Khoa Luu

In this work, we first present a simple yet effective Brainformer approach, a novel Transformer-based framework, to analyze the patterns of fMRI in the human perception system from the machine learning perspective.

Paper
Add Code

Cross-BERT for Point Cloud Pretraining

no code implementations • 8 Dec 2023 • Xin Li, Peng Li, Zeyong Wei, Zhe Zhu, Mingqiang Wei, Junhui Hou, Liangliang Nan, Jing Qin, Haoran Xie, Fu Lee Wang

By performing cross-modal interaction, Cross-BERT can smoothly reconstruct the masked tokens during pretraining, leading to notable performance enhancements for downstream tasks.

Self-Supervised Learning

Paper
Add Code

Disentangled Clothed Avatar Generation from Text Descriptions

no code implementations • 8 Dec 2023 • Jionghao Wang, YuAn Liu, Zhiyang Dou, Zhengming Yu, Yongqing Liang, Xin Li, Wenping Wang, Rong Xie, Li Song

In this paper, we introduced a novel text-to-avatar generation method that separately generates the human body and the clothes and allows high-quality animation on the generated avatar.

Virtual Try-on

Paper
Add Code

Video Quality Assessment Based on Swin TransformerV2 and Coarse to Fine Strategy

no code implementations • 16 Jan 2024 • Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Zhibo Chen

Furthermore, a temporal transformer is utilized for spatiotemporal feature fusion across the video.

Image Quality Assessment Video Quality Assessment +1

Paper
Add Code

Spectrum-guided Feature Enhancement Network for Event Person Re-Identification

no code implementations • 2 Feb 2024 • Hongchen Tan, Yi Zhang, Xiuping Liu, BaoCai Yin, Nan Ma, Xin Li, Huchuan Lu

This network consists of two innovative components: the Multi-grain Spectrum Attention Mechanism (MSAM) and the Consecutive Patch Dropout Module (CPDM).

Person Re-Identification

Paper
Add Code

Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep Reinforcement Learning Approach

no code implementations • 4 Feb 2024 • Brian Etter, James Lee Hu, Mohammedreza Ebrahimi, Weifeng Li, Xin Li, Hsinchun Chen

Adversarial Malware Generation (AMG), the gen- eration of adversarial malware variants to strengthen Deep Learning (DL)-based malware detectors has emerged as a crucial tool in the development of proactive cyberdefense.

Malware Detection reinforcement-learning +1

Paper
Add Code

scInterpreter: Training Large Language Models to Interpret scRNA-seq Data for Cell Type Annotation

no code implementations • 18 Feb 2024 • Cong Li, Meng Xiao, Pengfei Wang, Guihai Feng, Xin Li, Yuanchun Zhou

Despite the inherent limitations of existing Large Language Models in directly reading and interpreting single-cell omics data, they demonstrate significant potential and flexibility as the Foundation Model.

Language Modelling Large Language Model

Paper
Add Code

On Organizational Principles of Neural Systems

no code implementations • 22 Feb 2024 • Xin Li

Inspired by classical embodied cognition and the emerging multimodal interaction, we study the organizational principles of neural systems at three levels (device/implementation, circuit/algorithm, and system/computational) in this survey paper.

Paper
Add Code

Neural Radiance Fields in Medical Imaging: Challenges and Next Steps

no code implementations • 26 Feb 2024 • Xin Wang, Shu Hu, Heng Fan, Hongtu Zhu, Xin Li

Neural Radiance Fields (NeRF), as a pioneering technique in computer vision, offer great potential to revolutionize medical imaging by synthesizing three-dimensional representations from the projected two-dimensional image data.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.