Search Results for author: Xin Li

Found 367 papers, 157 papers with code

The similarity metric

no code implementations20 Nov 2001 Ming Li, Xin Chen, Xin Li, Bin Ma, Paul Vitanyi

A new class of distances appropriate for measuring similarity relations between sequences, say one type of similarity per distance, is studied.

On the K-theory of crossed products by automorphic semigroup actions

1 code implementation24 May 2012 Joachim Cuntz, Siegfried Echterhoff, Xin Li

Let P be a semigroup that admits an embedding into a group G. Assume that the embedding satisfies a certain Toeplitz condition and that the Baum-Connes conjecture holds for G. We prove a formula describing the K- theory of the reduced crossed product A \rtimes{\alpha}, r P by any automorphic action of P. This formula is obtained as a consequence of a result on the K-theory of crossed products for special actions of G on totally disconnected spaces.

Operator Algebras Dynamical Systems K-Theory and Homology 46L05, 46L80 (Primary) 20Mxx, 11R04 (Secondary)

Adaptive Active Learning for Image Classification

no code implementations CVPR 2013 Xin Li, Yuhong Guo

Recently active learning has attracted a lot of attention in computer vision field, as it is time and cost consuming to prepare a good set of labeled images for vision data analysis.

Active Learning Classification +4

Detecting Suicidal Ideation in Chinese Microblogs with Psychological Lexicons

no code implementations4 Nov 2014 Xiaolei Huang, Lei Zhang, Tianli Liu, David Chiu, Tingshao Zhu, Xin Li

Currently, we have identified 53 known suicidal cases who posted suicide notes on Weibo prior to their deaths. We explore linguistic features of these known cases using a psychological lexicon dictionary, and train an effective suicidal Weibo post detection model.

BIG-bench Machine Learning

Simplified Mirror-Based Camera Pose Computation via Rotation Averaging

no code implementations CVPR 2015 Gucan Long, Laurent Kneip, Xin Li, Xiaohu Zhang, Qifeng Yu

Our theoretical contribution extends the applicability of rotation averaging to a more general case, and enables mirror-based pose estimation in closed-form under the chordal L2-metric, or in an outlier-robust way by employing iterative L1-norm averaging.

Camera Calibration Pose Estimation

Cross-scale predictive dictionaries

no code implementations16 Nov 2015 Vishwanath Saragadam, Xin Li, Aswin Sankaranarayanan

Sparse representations using data dictionaries provide an efficient model particularly for signals that do not enjoy alternate analytic sparsifying transformations.

Low-Rank Tensor Approximation With Laplacian Scale Mixture Modeling for Multiframe Image Denoising

no code implementations ICCV 2015 Weisheng Dong, Guangyu Li, Guangming Shi, Xin Li, Yi Ma

Patch-based low-rank models have shown effective in exploiting spatial redundancy of natural images especially for the application of image denoising.

Dictionary Learning Image Denoising

Semi-Supervised Zero-Shot Classification With Label Representation Learning

no code implementations ICCV 2015 Xin Li, Yuhong Guo, Dale Schuurmans

Most existing zero-shot learning methods require a user to first provide a set of semantic visual attributes for each class as side information before applying a two-step prediction procedure that introduces an intermediate attribute prediction problem.

Attribute Classification +4

Learning Parametric Sparse Models for Image Super-Resolution

no code implementations NeurIPS 2016 Yongbo Li, Weisheng Dong, Xuemei Xie, Guangming Shi, Xin Li, Donglai Xu

More specifically, the parametric sparse prior of the desirable high-resolution (HR) image patches are learned from both the input low-resolution (LR) image and a training image dataset.

Image Super-Resolution

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

no code implementations1 Dec 2016 Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

Evaluated on the LSTM for speech recognition benchmark, ESE is 43x and 3x faster than Core i7 5930k CPU and Pascal Titan X GPU implementations.

Quantization speech-recognition +1

Video Scene Parsing with Predictive Feature Learning

no code implementations ICCV 2017 Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan

In this way, the network can effectively learn to capture video dynamics and temporal context, which are critical clues for video scene parsing, without requiring extra manual annotations.

Representation Learning Scene Parsing

On Improving Deep Reinforcement Learning for POMDPs

1 code implementation26 Apr 2017 Pengfei Zhu, Xin Li, Pascal Poupart, Guanghui Miao

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e. g., computer Go.

Atari Games Decision Making +4

Object-Aware Dense Semantic Correspondence

no code implementations CVPR 2017 Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen

To address these problems, this paper proposes an object-aware method to estimate per-pixel correspondences from semantic to low-level by learning a classifier for each selected discriminative grid cell and guiding the localization of every pixel under the semantic constraint.

Object Semantic correspondence

A Batch-Incremental Video Background Estimation Model using Weighted Low-Rank Approximation of Matrices

no code implementations2 Jul 2017 Aritra Dutta, Xin Li, Peter Richtárik

Principal component pursuit (PCP) is a state-of-the-art approach for background estimation problems.

Weighted Low Rank Approximation for Background Estimation Problems

no code implementations4 Jul 2017 Aritra Dutta, Xin Li

Classical principal component analysis (PCA) is not robust to the presence of sparse outliers in the data.

FoveaNet: Perspective-aware Urban Scene Parsing

no code implementations ICCV 2017 Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng

Thus, they suffer from heterogeneous object scales caused by perspective projection of cameras on actual scenes and inevitably encounter parsing failures on distant objects as well as other boundary and recognition errors.

Scene Parsing

Prune the Convolutional Neural Networks with Sparse Shrink

no code implementations8 Aug 2017 Xin Li, Changsong Liu

These results have demonstrated the effectiveness of our "Sparse Shrink" algorithm.

Learning with Rethinking: Recurrently Improving Convolutional Neural Networks through Feedback

no code implementations15 Aug 2017 Xin Li, Zequn Jie, Jiashi Feng, Changsong Liu, Shuicheng Yan

However, most of the existing CNN models only learn features through a feedforward structure and no feedback information from top to bottom layers is exploited to enable the networks to refine themselves.

SBGAR: Semantics Based Group Activity Recognition

1 code implementation ICCV 2017 Xin Li, Mooi Choo Chuah

Activity recognition has become an important function in many emerging computer vision applications e. g. automatic video surveillance system, human-computer interaction application, and video recommendation system, etc.

Group Activity Recognition

Hierarchical Spatial-aware Siamese Network for Thermal Infrared Object Tracking

1 code implementation27 Nov 2017 Xin Li, Qiao Liu, Nana Fan, Zhenyu He, Hongzhi Wang

In this paper, we cast the TIR tracking problem as a similarity verification task, which is coupled well to the objective of the tracking task.

General Classification Thermal Infrared Object Tracking

PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark

1 code implementation18 Jan 2018 Qiao Liu, Zhenyu He, Xin Li, Yuan Zheng

The ability to evaluate the TIR pedestrian tracker fairly, on a benchmark dataset, is significant for the development of this field.

Attribute Thermal Infrared Object Tracking

Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network

no code implementations13 Feb 2018 Weishong Dong, Ming Yuan, Xin Li, Guangming Shi

Image demosaicing - one of the most important early stages in digital camera pipelines - addressed the problem of reconstructing a full-resolution image from so-called color-filter-arrays.

Demosaicking Denoising +2

ReHAR: Robust and Efficient Human Activity Recognition

no code implementations27 Feb 2018 Xin Li, Mooi Choo Chuah

The whole model is trained end-to-end to allow meaningful representations to be generated for the final activity recognition.

Human Activity Recognition Optical Flow Estimation

Weighted Low-Rank Approximation of Matrices and Background Modeling

no code implementations15 Apr 2018 Aritra Dutta, Xin Li, Peter Richtarik

We primarily study a special a weighted low-rank approximation of matrices and then apply it to solve the background modeling problem.

On Improving Deep Reinforcement Learning for POMDPs

no code implementations17 Apr 2018 Pengfei Zhu, Xin Li, Pascal Poupart, Guanghui Miao

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e. g., computer Go.

Atari Games Decision Making +4

Aspect Term Extraction with History Attention and Selective Transformation

1 code implementation2 May 2018 Xin Li, Lidong Bing, Piji Li, Wai Lam, Zhimou Yang

Aspect Term Extraction (ATE), a key sub-task in Aspect-Based Sentiment Analysis, aims to extract explicit aspect expressions from online user reviews.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Perceptually Optimized Generative Adversarial Network for Single Image Dehazing

no code implementations3 May 2018 Yixin Du, Xin Li

To overcome this weakness, we propose a direct deep learning approach toward image dehazing bypassing the step of transmission map estimation and facilitating end-to-end perceptual optimization.

Denoising Generative Adversarial Network +2

Transformation Networks for Target-Oriented Sentiment Classification

2 code implementations ACL 2018 Xin Li, Lidong Bing, Wai Lam, Bei Shi

Between the two layers, we propose a component to generate target-specific representations of words in the sentence, meanwhile incorporate a mechanism for preserving the original contextual information from the RNN layer.

Aspect-Based Sentiment Analysis (ABSA) Classification +3

Compressed Sensing of Scanning Transmission Electron Microscopy (STEM) on Non-Rectangular Scans

1 code implementation13 May 2018 Xin Li, Ondrej Dyck, Sergei V. Kalinin, Stephen Jesse

Scanning Transmission Electron Microscopy (STEM) has become the main stay for materials characterization on atomic level, with applications ranging from visualization of localized and extended defects to mapping order parameter fields.

GANE: A Generative Adversarial Network Embedding

no code implementations18 May 2018 Huiting Hong, Xin Li, Mingzhong Wang

Network embedding has become a hot research topic recently which can provide low-dimensional feature representations for many machine learning applications.

Clustering Generative Adversarial Network +2

Learning Hybrid Sparsity Prior for Image Restoration: Where Deep Learning Meets Sparse Coding

no code implementations18 Jul 2018 Fangfang Wu, Weisheng Dong, Guangming Shi, Xin Li

State-of-the-art approaches toward image restoration can be classified into model-based and learning-based.

Image Restoration

Contour Knowledge Transfer for Salient Object Detection

1 code implementation ECCV 2018 Xin Li, Fan Yang, Hong Cheng, Wei Liu, Dinggang Shen

Our goal is to overcome this limitation by automatically converting an existing deep contour detection model into a salient object detection model without using any manual salient object masks.

Contour Detection Object +4

JigsawNet: Shredded Image Reassembly using Convolutional Neural Network and Loop-based Composition

3 code implementations11 Sep 2018 Canyu Le, Xin Li

Existing reassembly pipelines commonly consist of a local matching stage and a global compositions stage.

Superimposition-guided Facial Reconstruction from Skull

no code implementations28 Sep 2018 Celong Liu, Xin Li

We develop a new algorithm to perform facial reconstruction from a given skull.

Facial Inpainting

Manifold Learning of Four-dimensional Scanning Transmission Electron Microscopy

1 code implementation18 Oct 2018 Xin Li, Ondrej E. Dyck, Mark P. Oxley, Andrew R. Lupini, Leland McInnes, John Healy, Stephen Jesse, Sergei V. Kalinin

Four-dimensional scanning transmission electron microscopy (4D-STEM) of local atomic diffraction patterns is emerging as a powerful technique for probing intricate details of atomic structure and atomic electric fields.

Exploiting Coarse-to-Fine Task Transfer for Aspect-level Sentiment Classification

1 code implementation AAAI 2019 2018 Zheng Li, Ying WEI, Yu Zhang, Xiang Zhang, Xin Li, Qiang Yang

Aspect-level sentiment classification (ASC) aims at identifying sentiment polarities towards aspects in a sentence, where the aspect can behave as a general Aspect Category (AC) or a specific Aspect Term (AT).

General Classification Sentence +2

DAC: Data-free Automatic Acceleration of Convolutional Networks

1 code implementation20 Dec 2018 Xin Li, Shuai Zhang, Bolan Jiang, Yingyong Qi, Mooi Choo Chuah, Ning Bi

A complex deep learning model with high accuracy runs slowly on resource-limited devices, while a light-weight model that runs much faster loses accuracy.

Image Classification Multi-Person Pose Estimation +2

CONet: A Cognitive Ocean Network

no code implementations9 Jan 2019 Huimin Lu, Dong Wang, Yujie Li, Jianru Li, Xin Li, Hyoungseop Kim, Seiichi Serikawa, Iztok Humar

The Cognitive Ocean Network (CONet) will become the mainstream of future ocean science and engineering developments.

Partial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search

2 code implementations CVPR 2019 Xin Li, Yiming Zhou, Zheng Pan, Jiashi Feng

It prunes the architecture search space with a partial order assumption to automatically search for the architectures with the best speed and accuracy trade-off.

Neural Architecture Search

Iris R-CNN: Accurate Iris Segmentation in Non-cooperative Environment

no code implementations25 Mar 2019 Chunyang Feng, Yufeng Sun, Xin Li

Despite the significant advances in iris segmentation, accomplishing accurate iris segmentation in non-cooperative environment remains a grand challenge.

Iris Segmentation Region Proposal +1

Pyramid Mask Text Detector

1 code implementation28 Mar 2019 Jingchao Liu, Xuebo Liu, Jie Sheng, Ding Liang, Xin Li, Qingjie Liu

Scene text detection, an essential step of scene text recognition system, is to locate text instances in natural scene images automatically.

Clustering Instance Segmentation +4

NM-Net: Mining Reliable Neighbors for Robust Feature Correspondences

1 code implementation CVPR 2019 Chen Zhao, Zhiguo Cao, Chi Li, Xin Li, Jiaqi Yang

Feature correspondence selection is pivotal to many feature-matching based tasks in computer vision.

Target-Aware Deep Tracking

no code implementations CVPR 2019 Xin Li, Chao Ma, Baoyuan Wu, Zhenyu He, Ming-Hsuan Yang

Despite demonstrated successes for numerous vision tasks, the contributions of using pre-trained deep features for visual tracking are not as significant as that for object recognition.

Object Object Recognition +1

No trends in spring and autumn phenology during the global warming hiatus

1 code implementation Nature Communications 2019 Xufeng Wang, Jingfeng Xiao, Xin Li, Guodong Cheng, Mingguo Ma, Gaofeng Zhu, M. Altaf Arain, T. Andrew Black & Rachhpal S. Jassal

Phenology plays a fundamental role in regulating photosynthesis, evapotranspiration, and surface energy fluxes and is sensitive to climate change.

RF-Net: An End-to-End Image Matching Network based on Receptive Field

1 code implementation CVPR 2019 Xuelun Shen, Cheng Wang, Xin Li, Zenglei Yu, Jonathan Li, Chenglu Wen, Ming Cheng, Zijian He

This paper proposes a new end-to-end trainable matching network based on receptive field, RF-Net, to compute sparse correspondence between images.

Keypoint Detection

STN-Homography: estimate homography parameters directly

no code implementations6 Jun 2019 Qiang Zhou, Xin Li

In this paper, we introduce the STN-Homography model to directly estimate the homography matrix between image pair.

Homography Estimation

Learning Deep Multi-Level Similarity for Thermal Infrared Object Tracking

1 code implementation9 Jun 2019 Qiao Liu, Xin Li, Zhenyu He, Nana Fan, Di Yuan, Hongpeng Wang

These two similarities complement each other and hence enhance the discriminative capacity of the network for handling distractors.

Semantic Similarity Thermal Infrared Object Tracking

Vispi: Automatic Visual Perception and Interpretation of Chest X-rays

no code implementations MIDL 2019 Xin Li, Rui Cao, Dongxiao Zhu

Medical imaging contains the essential information for rendering diagnostic and treatment decisions.

Image Captioning

Reconstructing Perceived Images from Brain Activity by Visually-guided Cognitive Representation and Adversarial Learning

no code implementations27 Jun 2019 Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo Gao

In addition, we introduce a novel three-stage learning approach which enables the (cognitive) encoder to gradually distill useful knowledge from the paired (visual) encoder during the learning process.

Generative Adversarial Network Image Reconstruction +2

GRIP++: Enhanced Graph-based Interaction-aware Trajectory Prediction for Autonomous Driving

5 code implementations arXiv preprint 2020 Xin Li, Xiaowen Ying, Mooi Choo Chuah

Despite the advancement in the technology of autonomous driving cars, the safety of a self-driving car is still a challenging problem that has not been well studied.

Autonomous Driving motion prediction +1

BMN: Boundary-Matching Network for Temporal Action Proposal Generation

15 code implementations ICCV 2019 Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen

To address these difficulties, we introduce the Boundary-Matching (BM) mechanism to evaluate confidence scores of densely distributed proposals, which denote a proposal as a matching pair of starting and ending boundaries and combine all densely distributed BM pairs into the BM confidence map.

Action Detection Action Recognition +1

Domain-adversarial Network Alignment

1 code implementation15 Aug 2019 Huiting Hong, Xin Li, Yuangang Pan, Ivor Tsang

Network alignment is a critical task to a wide variety of fields.

Network Embedding

Deep Concept-wise Temporal Convolutional Networks for Action Localization

2 code implementations26 Aug 2019 Xin Li, Tianwei Lin, Xiao Liu, Chuang Gan, WangMeng Zuo, Chao Li, Xiang Long, Dongliang He, Fu Li, Shilei Wen

In this paper, we empirically find that stacking more conventional temporal convolution layers actually deteriorates action classification performance, possibly ascribing to that all channels of 1D feature map, which generally are highly abstract and can be regarded as latent concepts, are excessively recombined in temporal convolution.

Action Classification Action Localization

Small and Practical BERT Models for Sequence Labeling

no code implementations IJCNLP 2019 Henry Tsai, Jason Riesa, Melvin Johnson, Naveen Arivazhagan, Xin Li, Amelia Archer

We propose a practical scheme to train a single multilingual sequence labeling model that yields state of the art results and is small and fast enough to run on a single CPU.

Part-Of-Speech Tagging

Iterative Clustering with Game-Theoretic Matching for Robust Multi-consistency Correspondence

no code implementations3 Sep 2019 Chen Zhao, Jiaqi Yang, Ke Xian, Zhiguo Cao, Xin Li

Matching corresponding features between two images is a fundamental task to computer vision with numerous applications in object recognition, robotics, and 3D reconstruction.

3D Reconstruction Clustering +2

Improving Fine-grained Entity Typing with Entity Linking

1 code implementation IJCNLP 2019 Hongliang Dai, Donghong Du, Xin Li, Yangqiu Song

Fine-grained entity typing is a challenging problem since it usually involves a relatively large tag set and may require to understand the context of the entity mention.

Entity Linking Entity Typing +1

Spoofing and Anti-Spoofing with Wax Figure Faces

no code implementations12 Oct 2019 Shan Jia, Xin Li, Chuanbo Hu, Zhengquan Xu

In this work, we introduce a wax figure face database (WFFD) as a novel and super-realistic 3D face presentation attack.

Face Detection Face Recognition +1

Anion charge-lattice volume dependent Li ion migration in compounds with the face-centered cubic anion frameworks

no code implementations25 Oct 2019 Zhenming Xu, Xin Chen, Ronghan Chen, Xin Li, Hong Zhu

In this work, the face-centered cubic (fcc) anion frameworks were creatively constructed to study the effects of anion charge and lattice volume on the stability of lithium ion occupation and lithium ion migration.

Applied Physics

Rotation Invariant Point Cloud Classification: Where Local Geometry Meets Global Topology

1 code implementation1 Nov 2019 Chen Zhao, Jiaqi Yang, Xin Xiong, Angfan Zhu, Zhiguo Cao, Xin Li

To the best of our knowledge, this work is the first principled approach toward adaptively combining global and local information under the context of RI point cloud analysis.

General Classification Point Cloud Classification

Sparse estimation via $\ell_q$ optimization method in high-dimensional linear regression

no code implementations12 Nov 2019 Xin Li, Yaohua Hu, Chong Li, Xiaoqi Yang, Tianzi Jiang

In this paper, we discuss the statistical properties of the $\ell_q$ optimization methods $(0<q\leq 1)$, including the $\ell_q$ minimization method and the $\ell_q$ regularization method, for estimating a sparse parameter from noisy observations in high-dimensional linear regression with either a deterministic or random design.

regression Vocal Bursts Intensity Prediction

Relevance-Promoting Language Model for Short-Text Conversation

no code implementations26 Nov 2019 Xin Li, Piji Li, Wei Bi, Xiaojiang Liu, Wai Lam

In this paper, we propose to formulate the STC task as a language modeling problem and tailor-make a training strategy to adapt a language model for response generation.

Language Modelling Response Generation +1

Multi-Task Driven Feature Models for Thermal Infrared Tracking

1 code implementation26 Nov 2019 Qiao Liu, Xin Li, Zhenyu He, Nana Fan, Di Yuan, Wei Liu, Yonsheng Liang

These two feature models are learned using a multi-task matching framework and are jointly optimized on the TIR tracking task.

Thermal Infrared Object Tracking

Digital Twin: Acquiring High-Fidelity 3D Avatar from a Single Image

no code implementations7 Dec 2019 Ruizhe Wang, Chih-Fan Chen, Hao Peng, Xudong Liu, Oliver Liu, Xin Li

We present an approach to generate high fidelity 3D face avatar with a high-resolution UV texture map from a single image.

Face Model Vocal Bursts Intensity Prediction

Face Beautification: Beyond Makeup Transfer

1 code implementation8 Dec 2019 Xudong Liu, Ruizhe Wang, Chih-Fan Chen, Minglei Yin, Hao Peng, Shukhan Ng, Xin Li

Inspired by the latest advances in style-based synthesis and face beauty prediction, we propose a novel framework of face beautification.

Translation

Point2Node: Correlation Learning of Dynamic-Node for Point Cloud Feature Modeling

no code implementations23 Dec 2019 Wenkai Han, Chenglu Wen, Cheng Wang, Xin Li, Qing Li

Point2Node can dynamically explore correlation among all graph nodes from different levels, and adaptively aggregate the learned features.

Hybrid Graph Neural Networks for Crowd Counting

no code implementations31 Jan 2020 Ao Luo, Fan Yang, Xin Li, Dong Nie, Zhicheng Jiao, Shangchen Zhou, Hong Cheng

In this paper, we present a novel network structure called Hybrid Graph Neural Network (HyGnn) which targets to relieve the problem by interweaving the multi-scale features for crowd density as well as its auxiliary task (localization) together and performing joint reasoning over a graph.

Crowd Counting

Exploiting Semantic Relations for Fine-grained Entity Typing

1 code implementation AKBC 2020 Hongliang Dai, Yangqiu Song, Xin Li

We find that, in some cases, existing neural fine-grained entity typing models may ignore the semantic information in the context that is important for typing.

Entity Typing Relation +2

A Real-Time Deep Network for Crowd Counting

1 code implementation16 Feb 2020 Xiaowen Shi, Xin Li, Caili Wu, Shuchen Kong, Jing Yang, Liang He

Automatic analysis of highly crowded people has attracted extensive attention from computer vision research.

Crowd Counting

Improve SGD Training via Aligning Mini-batches

no code implementations23 Feb 2020 Xiangrui Li, Deng Pan, Xin Li, Dongxiao Zhu

In each iteration of SGD, a mini-batch from the training data is sampled and the true gradient of the loss function is estimated as the noisy gradient calculated on this mini-batch.

Contextual-Bandit Based Personalized Recommendation with Time-Varying User Interests

no code implementations29 Feb 2020 Xiao Xu, Fang Dong, Yanghua Li, Shaojian He, Xin Li

A contextual bandit problem is studied in a highly non-stationary environment, which is ubiquitous in various recommender systems due to the time-varying interests of users.

Recommendation Systems

On the Learning Property of Logistic and Softmax Losses for Deep Neural Networks

1 code implementation4 Mar 2020 Xiangrui Li, Xin Li, Deng Pan, Dongxiao Zhu

Deep convolutional neural networks (CNNs) trained with logistic and softmax losses have made significant advancement in visual recognition tasks in computer vision.

Binary Classification Classification +2

Impact of Temperature and Relative Humidity on the Transmission of COVID-19: A Modeling Study in China and the United States

no code implementations9 Mar 2020 Jingyuan Wang, Ke Tang, Kai Feng, Xin Li, Weifeng Lv, Kun Chen, Fei Wang

Primary outcome measures: Regression analysis of the impact of temperature and relative humidity on the effective reproductive number (R value).

regression

Toward Tag-free Aspect Based Sentiment Analysis: A Multiple Attention Network Approach

3 code implementations22 Mar 2020 Yao Qiang, Xin Li, Dongxiao Zhu

Existing aspect based sentiment analysis (ABSA) approaches leverage various neural network models to extract the aspect sentiments via learning aspect-specific feature representations.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

COVID-MobileXpert: On-Device COVID-19 Patient Triage and Follow-up using Chest X-rays

1 code implementation6 Apr 2020 Xin Li, Chengyin Li, Dongxiao Zhu

We design and implement a novel three-player knowledge transfer and distillation (KTD) framework including a pre-trained attending physician (AP) network that extracts CXR imaging features from a large scale of lung disease CXR images, a fine-tuned resident fellow (RF) network that learns the essential CXR imaging features to discriminate COVID-19 from pneumonia and/or normal cases with a small amount of COVID-19 cases, and a trained lightweight medical student (MS) network to perform on-device COVID-19 patient triage and follow-up.

Computed Tomography (CT) Trajectory Prediction +1

Towards Evaluating the Robustness of Chinese BERT Classifiers

no code implementations7 Apr 2020 Boxin Wang, Boyuan Pan, Xin Li, Bo Li

Recent advances in large-scale language representation models such as BERT have improved the state-of-the-art performances in many NLP tasks.

Leveraging Planar Regularities for Point Line Visual-Inertial Odometry

no code implementations16 Apr 2020 Xin Li, Yijia He, Jinlong Lin, Xiao Liu

To improve the accuracy of 3D mesh generation and localization, we propose a tightly-coupled monocular VIO system, PLP-VIO, which exploits point features and line features as well as plane regularities.

A Chinese Corpus for Fine-grained Entity Typing

1 code implementation LREC 2020 Chin Lee, Hongliang Dai, Yangqiu Song, Xin Li

In this paper, we introduce a corpus for Chinese fine-grained entity typing that contains 4, 800 mentions manually labeled through crowdsourcing.

Cross-Lingual Transfer Entity Typing +1

Context-aware Helpfulness Prediction for Online Product Reviews

no code implementations27 Apr 2020 Iyiola E. Olatunji, Xin Li, Wai Lam

In this paper, we propose a neural deep learning model that predicts the helpfulness score of a review.

Training Recurrent Neural Networks Online by Learning Explicit State Variables

no code implementations ICLR 2020 Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White

Recurrent neural networks (RNNs) allow an agent to construct a state-representation from a stream of experience, which is essential in partially observable problems.

3D Face Anti-spoofing with Factorized Bilinear Coding

no code implementations12 May 2020 Shan Jia, Xin Li, Chuanbo Hu, Guodong Guo, Zhengquan Xu

We have witnessed rapid advances in both face presentation attack models and presentation attack detection (PAD) in recent years.

Face Anti-Spoofing Face Presentation Attack Detection +1

Multi-scale Grouped Dense Network for VVC Intra Coding

no code implementations16 May 2020 Xin Li, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Versatile Video Coding (H. 266/VVC) standard achieves better image quality when keeping the same bits than any other conventional image codec, such as BPG, JPEG, and etc.

Generative Adversarial Network

Robust Trajectory Forecasting for Multiple Intelligent Agents in Dynamic Scene

no code implementations27 May 2020 Yanliang Zhu, Dongchun Ren, Mingyu Fan, Deheng Qian, Xin Li, Huaxia Xia

Trajectory forecasting, or trajectory prediction, of multiple interacting agents in dynamic scenes, is an important problem for many applications, such as robotic systems and autonomous driving.

Autonomous Driving Trajectory Forecasting

Defending against adversarial attacks on medical imaging AI system, classification or detection?

1 code implementation24 Jun 2020 Xin Li, Deng Pan, Dongxiao Zhu

Medical imaging AI systems such as disease classification and segmentation are increasingly inspired and transformed from computer vision based AI systems.

Adversarial Defense General Classification

Explainable Recommendation via Interpretable Feature Mapping and Evaluation of Explainability

no code implementations12 Jul 2020 Deng Pan, Xiangrui Li, Xin Li, Dongxiao Zhu

Latent factor collaborative filtering (CF) has been a widely used technique for recommender system by learning the semantic representations of users and items.

Collaborative Filtering Explainable Recommendation +1

Multi-node Bert-pretraining: Cost-efficient Approach

no code implementations1 Aug 2020 Jiahuang Lin, Xin Li, Gennady Pekhimenko

As a result, to train these models within a reasonable time, machine learning (ML) programmers often require advanced hardware setups such as the premium GPU-enabled NVIDIA DGX workstations or specialized accelerators such as Google's TPU Pods.

LSOTB-TIR:A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark

1 code implementation3 Aug 2020 Qiao Liu, Xin Li, Zhenyu He, Chenglong Li, Jun Li, Zikun Zhou, Di Yuan, Jing Li, Kai Yang, Nana Fan, Feng Zheng

We evaluate and analyze more than 30 trackers on LSOTB-TIR to provide a series of baselines, and the results show that deep trackers achieve promising performance.

Thermal Infrared Object Tracking Vocal Bursts Intensity Prediction

Cascade Graph Neural Networks for RGB-D Salient Object Detection

1 code implementation ECCV 2020 Ao Luo, Xin Li, Fan Yang, Zhicheng Jiao, Hong Cheng, Siwei Lyu

Current works either simply distill prior knowledge from the corresponding depth map for handling the RGB-image or blindly fuse color and geometric information to generate the coarse depth-aware representations, hindering the performance of RGB-D saliency detectors. In this work, we introduceCascade Graph Neural Networks(Cas-Gnn), a unified framework which is capable of comprehensively distilling and reasoning the mutual benefits between these two data sources through a set of cascade graphs, to learn powerful representations for RGB-D salient object detection.

Object object-detection +3

MHSA-Net: Multi-Head Self-Attention Network for Occluded Person Re-Identification

1 code implementation10 Aug 2020 Hongchen Tan, Xiuping Liu, BaoCai Yin, Xin Li

This paper presents a novel person re-identification model, named Multi-Head Self-Attention Network (MHSA-Net), to prune unimportant information and capture key local information from person images.

Person Re-Identification

LIRA: Lifelong Image Restoration from Unknown Blended Distortions

no code implementations ECCV 2020 Jianzhao Liu, Jianxin Lin, Xin Li, Wei Zhou, Sen Liu, Zhibo Chen

Most existing image restoration networks are designed in a disposable way and catastrophically forget previously learned distortions when trained on a new distortion removal task.

Image Restoration SSIM

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

no code implementations21 Aug 2020 Xu He, Bo An, Yanghua Li, Haikai Chen, Qingyu Guo, Xin Li, Zhirong Wang

First, since we concern the reward of a set of recommended items, we model the online recommendation as a contextual combinatorial bandit problem and define the reward of a recommended set.

Detection of Genuine and Posed Facial Expressions of Emotion: A Review

no code implementations26 Aug 2020 Shan Jia, Shuo Wang, Chuanbo Hu, Paula Webster, Xin Li

Facial expressions of emotion play an important role in human social interactions.

Efficiency in Real-time Webcam Gaze Tracking

no code implementations2 Sep 2020 Amogh Gudi, Xin Li, Jan van Gemert

To do so, we evaluate the computational speed/accuracy trade-off for the CNN and the calibration effort/accuracy trade-off for screen calibration.

Computational Efficiency regression

Enhancing Dialogue Generation via Multi-Level Contrastive Learning

no code implementations19 Sep 2020 Xin Li, Piji Li, Yan Wang, Xiaojiang Liu, Wai Lam

Most of the existing works for dialogue generation are data-driven models trained directly on corpora crawled from websites.

Contrastive Learning Dialogue Generation +1

FAN: Frequency Aggregation Network for Real Image Super-resolution

no code implementations30 Sep 2020 Yingxue Pang, Xin Li, Xin Jin, Yaojun Wu, Jianzhao Liu, Sen Liu, Zhibo Chen

Specifically, we extract different frequencies of the LR image and pass them to a channel attention-grouped residual dense network (CA-GRDB) individually to output corresponding feature maps.

Image Super-Resolution SSIM

Deformable Kernel Convolutional Network for Video Extreme Super-Resolution

no code implementations1 Oct 2020 Xuan Xu, Xin Xiong, Jinge Wang, Xin Li

Thanks to newly designed Deformable Kernel Convolution Alignment (DKC_Align) and Deformable Kernel Spatial Attention (DKSA) modules, DKSAN can better exploit both spatial and temporal redundancies to facilitate the information propagation across different layers.

Video Super-Resolution

Limitations of Autoregressive Models and Their Alternatives

no code implementations NAACL 2021 Chu-Cheng Lin, Aaron Jaech, Xin Li, Matthew R. Gormley, Jason Eisner

Standard autoregressive language models perform only polynomial-time computation to compute the probability of the next symbol.

Language Modelling

Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond

no code implementations23 Oct 2020 Xin Li, Lidong Bing, Wenxuan Zhang, Zheng Li, Wai Lam

Cross-lingual adaptation with multilingual pre-trained language models (mPTLMs) mainly consists of two lines of works: zero-shot approach and translation-based approach, which have been studied extensively on the sequence-level tasks.

Cross-Lingual Transfer Translation

Muti-view Mouse Social Behaviour Recognition with Deep Graphical Model

1 code implementation4 Nov 2020 Zheheng Jiang, Feixiang Zhou, Aite Zhao, Xin Li, Ling Li, DaCheng Tao, Xuelong Li, Huiyu Zhou

To address this problem, we here propose a novel multiview latent-attention and dynamic discriminative model that jointly learns view-specific and view-shared sub-structures, where the former captures unique dynamics of each view whilst the latter encodes the interaction between the views.

Magnetoelectric coupling and decoupling in multiferroic hexagonal YbFeO3 thin films

no code implementations13 Nov 2020 Yu Yun, Xin Li, Arashdeep Singh Thind, Yuewei Yin, Hao liu, Qiang Li, Wenbin Wang, Alpha T. N Diaye, Corbyn Mellinger, Xuanyuan Jiang, Rohan Mishra, Xiaoshan Xu

The coupling between ferroelectric and magnetic orders in multiferroic materials and the nature of magnetoelectric (ME) effects are enduring experimental challenges.

Materials Science Other Condensed Matter

Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution

no code implementations11 Dec 2020 Xin Li, Xin Jin, Tao Yu, Yingxue Pang, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i. e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations.

Image Super-Resolution

Improving Adversarial Robustness via Probabilistically Compact Loss with Logit Constraints

1 code implementation14 Dec 2020 Xin Li, Xiangrui Li, Deng Pan, Dongxiao Zhu

This inspires us to propose a new Probabilistically Compact (PC) loss with logit constraints which can be used as a drop-in replacement for cross-entropy (CE) loss to improve CNN's adversarial robustness.

Adversarial Robustness

Learned Block-based Hybrid Image Compression

no code implementations17 Dec 2020 Yaojun Wu, Xin Li, Zhizheng Zhang, Xin Jin, Zhibo Chen

Recent works on learned image compression perform encoding and decoding processes in a full-resolution manner, resulting in two problems when deployed for practical applications.

Blocking Image Compression +2

Understanding Team Collaboration in Artificial Intelligence from the perspective of Geographic Distance

no code implementations25 Dec 2020 Xuli Tang, Xin Li, Ying Ding, Feicheng Ma

This paper analyzes team collaboration in the field of Artificial Intelligence (AI) from the perspective of geographic distance.

Learning Inter- and Intraframe Representations for Non-Lambertian Photometric Stereo

no code implementations26 Dec 2020 Yanlong Cao, Binjie Ding, Zewei He, Jiangxin Yang, Jingxi Chen, Yanpeng Cao, Xin Li

Photometric stereo provides an important method for high-fidelity 3D reconstruction based on multiple intensity images captured under different illumination directions.

3D Reconstruction

TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous Control

1 code implementation1 Jan 2021 Hongyu Zang, Xin Li, Li Zhang, Peiyao Zhao, Mingzhong Wang

Trust region methods and maximum entropy methods are two state-of-the-art branches used in reinforcement learning (RL) for the benefits of stability and exploration in continuous environments, respectively.

Continuous Control Reinforcement Learning (RL)

Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models

2 code implementations3 Feb 2021 Shang Wang, Peiming Yang, Yuxuan Zheng, Xin Li, Gennady Pekhimenko

Driven by the tremendous effort in researching novel deep learning (DL) algorithms, the training cost of developing new models increases staggeringly in recent years.

DeepReduce: A Sparse-tensor Communication Framework for Distributed Deep Learning

1 code implementation NeurIPS 2021 Kelly Kostopoulou, Hang Xu, Aritra Dutta, Xin Li, Alexandros Ntoulas, Panos Kalnis

This paper introduces DeepReduce, a versatile framework for the compressed communication of sparse tensors, tailored for distributed deep learning.

Local Patch AutoAugment with Multi-Agent Collaboration

2 code implementations20 Mar 2021 Shiqi Lin, Tao Yu, Ruoyu Feng, Xin Li, Xin Jin, Zhibo Chen

We formulate it as a multi-agent reinforcement learning (MARL) problem, where each agent learns an augmentation policy for each patch based on its content together with the semantics of the whole image.

Data Augmentation Fine-Grained Image Recognition +2

A Detector-oblivious Multi-arm Network for Keypoint Matching

1 code implementation2 Apr 2021 Xuelun Shen, Cheng Wang, Xin Li, Qian Hu, Jingyi Zhang

This paper presents a matching network to establish point correspondence between images.

Mutual Graph Learning for Camouflaged Object Detection

1 code implementation CVPR 2021 Qiang Zhai, Xin Li, Fan Yang, Chenglizhao Chen, Hong Cheng, Deng-Ping Fan

Automatically detecting/segmenting object(s) that blend in with their surroundings is difficult for current models.

Graph Learning Object +2

Searching Efficient Model-guided Deep Network for Image Denoising

no code implementations6 Apr 2021 Qian Ning, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi

Similar to the success of NAS in high-level vision tasks, it is possible to find a memory and computationally efficient solution via NAS with highly competent denoising performance.

Image Denoising Neural Architecture Search

Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer

2 code implementations CVPR 2021 Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao

Inspired by the common painting process of drawing a draft and revising the details, we introduce a novel feed-forward method named Laplacian Pyramid Network (LapStyle).

Style Transfer

Learning Semantic Person Image Generation by Region-Adaptive Normalization

1 code implementation CVPR 2021 Zhengyao Lv, Xiaoming Li, Xin Li, Fu Li, Tianwei Lin, Dongliang He, WangMeng Zuo

In the first stage, we predict the target semantic parsing maps to eliminate the difficulties of pose transfer and further benefit the latter translation of per-region appearance style.

Pose Transfer Semantic Parsing +1

Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness

1 code implementation28 Apr 2021 Manyu Zhu, Dongliang He, Xin Li, Chao Li, Fu Li, Xiao Liu, Errui Ding, Zhaoxiang Zhang

Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial.

Image Inpainting valid

DeepReduce: A Sparse-tensor Communication Framework for Federated Deep Learning

1 code implementation NeurIPS 2021 Hang Xu, Kelly Kostopoulou, Aritra Dutta, Xin Li, Alexandros Ntoulas, Panos Kalnis

DeepReduce is orthogonal to existing gradient sparsifiers and can be applied in conjunction with them, transparently to the end-user, to significantly lower the communication overhead.

Task-driven Semantic Coding via Reinforcement Learning

1 code implementation7 Jun 2021 Xin Li, Jun Shi, Zhibo Chen

However, the traditional hybrid coding framework cannot be optimized in an end-to-end manner, which makes task-driven semantic fidelity metric unable to be automatically integrated into the rate-distortion optimization process.

Face Detection License Plate Detection +4

Probabilistic Model Distillation for Semantic Correspondence

1 code implementation CVPR 2021 Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu

We address this problem with the use of a novel Probabilistic Model Distillation (PMD) approach which transfers knowledge learned by a probabilistic teacher model on synthetic data to a static student model with the use of unlabeled real image pairs.

Representation Learning Semantic correspondence

Self-Supervised Tracking via Target-Aware Data Synthesis

no code implementations21 Jun 2021 Xin Li, Wenjie Pei, YaoWei Wang, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang

While deep-learning based tracking methods have achieved substantial progress, they entail large-scale and high-quality annotated data for sufficient training.

Representation Learning Self-Supervised Learning +1

Metasurface-Enabled On-Chip Multiplexed Diffractive Neural Networks in the Visible

no code implementations13 Jul 2021 Xuhao Luo, Yueqiang Hu, Xin Li, Xiangnian Ou, Jiajie Lai, Na Liu, Huigao Duan

Replacing electrons with photons is a compelling route towards light-speed, highly parallel, and low-power artificial intelligence computing.

Autonomous Driving

AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer

3 code implementations ICCV 2021 Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Meiling Wang, Xin Li, Zhengxing Sun, Qian Li, Errui Ding

Finally, the content feature is normalized so that they demonstrate the same local feature statistics as the calculated per-point weighted style feature statistics.

Style Transfer Video Style Transfer

Saliency-Associated Object Tracking

1 code implementation ICCV 2021 Zikun Zhou, Wenjie Pei, Xin Li, Hongpeng Wang, Feng Zheng, Zhenyu He

A potential limitation of such trackers is that not all patches are equally informative for tracking.

Object Object Tracking

Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

2 code implementations ICCV 2021 Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Ruifeng Deng, Xin Li, Errui Ding, Hao Wang

Neural painting refers to the procedure of producing a series of strokes for a given image and non-photo-realistically recreating it using neural networks.

Object Detection Reinforcement Learning (RL) +1

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows

2 code implementations11 Aug 2021 Xiao Wang, Jianing Li, Lin Zhu, Zhipeng Zhang, Zhe Chen, Xin Li, YaoWei Wang, Yonghong Tian, Feng Wu

Different from visible cameras which record intensity images frame by frame, the biologically inspired event camera produces a stream of asynchronous and sparse events with much lower latency.

Object Tracking

Identifying Illicit Drug Dealers on Instagram with Large-scale Multimodal Data Fusion

no code implementations18 Aug 2021 Chuanbo Hu, Minglei Yin, Bin Liu, Xin Li, Yanfang Ye

Unlike existing methods that focus on posting-based detection, we propose to tackle the problem of illicit drug dealer identification by constructing a large-scale multimodal dataset named Identifying Drug Dealers on Instagram (IDDIG).

Community Detection

Detection of Illicit Drug Trafficking Events on Instagram: A Deep Multimodal Multilabel Learning Approach

no code implementations19 Aug 2021 Chuanbo Hu, Minglei Yin, Bin Liu, Xin Li, Yanfang Ye

Accordingly, accurate detection of illicit drug trafficking events (IDTEs) from social media has become even more challenging.

Marketing

Characterizing interdisciplinarity in drug research: a translational science perspective

no code implementations4 Sep 2021 Xin Li, Xuli Tang

Despite the significant advances in life science, it still takes decades to translate a basic drug discovery into a cure for human disease.

Drug Discovery

Multilingual AMR Parsing with Noisy Knowledge Distillation

1 code implementation Findings (EMNLP) 2021 Deng Cai, Xin Li, Jackie Chun-Sing Ho, Lidong Bing, Wai Lam

We study multilingual AMR parsing from the perspective of knowledge distillation, where the aim is to learn and improve a multilingual AMR parser by using an existing English parser as its teacher.

AMR Parsing Knowledge Distillation

Aspect Sentiment Quad Prediction as Paraphrase Generation

1 code implementation EMNLP 2021 Wenxuan Zhang, Yang Deng, Xin Li, Yifei Yuan, Lidong Bing, Wai Lam

Aspect-based sentiment analysis (ABSA) has been extensively studied in recent years, which typically involves four fundamental sentiment elements, including the aspect category, aspect term, opinion term, and sentiment polarity.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Vector-quantized Image Modeling with Improved VQGAN

5 code implementations ICLR 2022 Jiahui Yu, Xin Li, Jing Yu Koh, Han Zhang, Ruoming Pang, James Qin, Alexander Ku, Yuanzhong Xu, Jason Baldridge, Yonghui Wu

Motivated by this success, we explore a Vector-quantized Image Modeling (VIM) approach that involves pretraining a Transformer to predict rasterized image tokens autoregressively.

Image Generation Representation Learning +1

Probabilistic prediction of the heave motions of a semi-submersible by a deep learning problem model

1 code implementation9 Oct 2021 Xiaoxian Guo, Xiantao Zhang, Xinliang Tian, Wenyue Lu, Xin Li

In this study, we extend a deep learning (DL) model, which could predict the heave and surge motions of a floating semi-submersible 20 to 50 seconds ahead with good accuracy, to quantify its uncertainty of the predictive time series with the help of the dropout technique.

Motion Compensation motion prediction +2

Deep Models with Fusion Strategies for MVP Point Cloud Registration

1 code implementation18 Oct 2021 Lifa Zhu, Changwei Lin, Dongrui Liu, Xin Li, Francisco Gómez-Fernández

The main goal of point cloud registration in Multi-View Partial (MVP) Challenge 2021 is to estimate a rigid transformation to align a point cloud pair.

Point Cloud Registration

Internationalizing AI: Evolution and Impact of Distance Factors

no code implementations10 Nov 2021 Xuli Tang, Xin Li, Feicheng Ma

A framework including 13 indicators to quantify the distance factors between countries from 5 perspectives (i. e., geographic distance, economic distance, cultural distance, academic distance, and industrial distance) is proposed.

Descriptive

Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples

1 code implementation22 Nov 2021 Linlin Liu, Xin Li, Ruidan He, Lidong Bing, Shafiq Joty, Luo Si

In this work, we explore methods to make better use of the multilingual annotation and language agnostic property of KG triples, and present novel knowledge based multilingual language models (KMLMs) trained directly on the knowledge triples.

Knowledge Graphs Language Modelling +9

NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition

1 code implementation CVPR 2022 Hao liu, Xinghua Jiang, Xin Li, Zhimin Bao, Deqiang Jiang, Bo Ren

For the sake of trade-off between efficiency and performance, a group of works merely perform SA operation within local patches, whereas the global contextual information is abandoned, which would be indispensable for visual recognition tasks.

object-detection Object Detection +1

A Close Look at Few-shot Real Image Super-resolution from the Distortion Relation Perspective

no code implementations25 Nov 2021 Xin Li, Xin Jin, Jun Fu, Xiaoyuan Yu, Bei Tong, Zhibo Chen

Under this brand-new scenario, we propose Distortion Relation guided Transfer Learning (DRTL) for the few-shot RealSR by transferring the rich restoration knowledge from auxiliary distortions (i. e., synthetic distortions) to the target RealSR under the guidance of distortion relation.

Image Restoration Image Super-Resolution +4

Confounder Identification-free Causal Visual Feature Learning

no code implementations26 Nov 2021 Xin Li, Zhizheng Zhang, Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Xin Jin, Zhibo Chen

In this paper, we propose a novel Confounder Identification-free Causal Visual Feature Learning (CICF) method, which obviates the need for identifying confounders.

Domain Generalization Meta-Learning

Neural Collaborative Graph Machines for Table Structure Recognition

no code implementations CVPR 2022 Hao liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, Bo Ren

We also show that the proposed NCGM can modulate collaborative pattern of different modalities conditioned on the context of intra-modality cues, which is vital for diversified table cases.

Table Recognition

Simple Contrastive Representation Adversarial Learning for NLP Tasks

no code implementations26 Nov 2021 Deshui Miao, JiaQi Zhang, WenBo Xie, Jian Song, Xin Li, Lijuan Jia, Ning Guo

In this paper, adversarial training is performed to generate challenging and harder learning adversarial examples over the embedding space of NLP as learning pairs.

Contrastive Learning Natural Language Understanding +4

Document Layout Analysis with Aesthetic-Guided Image Augmentation

no code implementations27 Nov 2021 Tianlong Ma, Xingjiao Wu, Xin Li, Xiangcheng Du, Zhao Zhou, Liang Xue, Cheng Jin

To measure the proposed image layer modeling method, we propose a manually-labeled non-Manhattan layout fine-grained segmentation dataset named FPD.

Document Layout Analysis document understanding +2

Uncertainty-Driven Loss for Single Image Super-Resolution

no code implementations NeurIPS 2021 Qian Ning, Weisheng Dong, Xin Li, Jinjian Wu, Guangming Shi

Specifically, we introduce variance estimation characterizing the uncertainty on a pixel-by-pixel basis into SISR solutions so the targeted pixels in a high-resolution image (mean) and their corresponding uncertainty (variance) can be learned simultaneously.

Image Super-Resolution

Interactive Model with Structural Loss for Language-based Abductive Reasoning

no code implementations1 Dec 2021 Linhao Li, Ming Xu, Yongfeng Dong, Xin Li, Ao Wang

Therefore, we propose to group instead of ranking the hypotheses and design a structural loss called ``joint softmax focal loss'' in this paper.

Language Modelling Natural Language Inference

An Informative Tracking Benchmark

1 code implementation13 Dec 2021 Xin Li, Qiao Liu, Wenjie Pei, Qiuhong Shen, YaoWei Wang, Huchuan Lu, Ming-Hsuan Yang

Along with the rapid progress of visual tracking, existing benchmarks become less informative due to redundancy of samples and weak discrimination between current trackers, making evaluations on all datasets extremely time-consuming.

Visual Tracking

Robust Depth Completion with Uncertainty-Driven Loss Functions

no code implementations15 Dec 2021 Yufan Zhu, Weisheng Dong, Leida Li, Jinjian Wu, Xin Li, Guangming Shi

In this work, we introduce uncertainty-driven loss functions to improve the robustness of depth completion and handle the uncertainty in depth completion.

Depth Completion

SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning

2 code implementations31 Dec 2021 Hongyu Zang, Xin Li, Mingzhong Wang

This work explores how to learn robust and generalizable state representation from image-based observations with deep reinforcement learning methods.

reinforcement-learning Reinforcement Learning (RL)

Multi-Object Tracking Meets Moving UAV

no code implementations CVPR 2022 Shuai Liu, Xin Li, Huchuan Lu, You He

Multi-object tracking in unmanned aerial vehicle (UAV) videos is an important vision task and can be applied in a wide range of applications.

Multi-Object Tracking Object

A Survey on Applications of Digital Human Avatars toward Virtual Co-presence

no code implementations11 Jan 2022 Matthew Korban, Xin Li

This paper investigates different approaches to build and use digital human avatars toward interactive Virtual Co-presence (VCP) environments.

Machine learning prediction for mean motion resonance behaviour -- The planar case

no code implementations18 Jan 2022 Xin Li, Jian Li, Zhihong Jeff Xia, Nikolaos Georgakarakos

Most recently, machine learning has been used to study the dynamics of integrable Hamiltonian systems and the chaotic 3-body problem.

BIG-bench Machine Learning Numerical Integration

A multi-domain virtual network embedding algorithm with delay prediction

no code implementations3 Feb 2022 Peiying Zhang, Xue Pang, Yongjing Ni, Haipeng Yao, Xin Li

Virtual network embedding (VNE) is an crucial part of network virtualization (NV), which aims to map the virtual networks (VNs) to a shared substrate network (SN).

Network Embedding

Multi-modal Sensor Fusion for Auto Driving Perception: A Survey

no code implementations6 Feb 2022 Keli Huang, Botian Shi, Xiang Li, Xin Li, Siyuan Huang, Yikang Li

Multi-modal fusion is a fundamental task for the perception of an autonomous driving system, which has recently intrigued many researchers.

Autonomous Driving object-detection +3

Learning Optical Flow with Adaptive Graph Reasoning

1 code implementation8 Feb 2022 Ao Luo, Fan Yang, Kunming Luo, Xin Li, Haoqiang Fan, Shuaicheng Liu

Our key idea is to decouple the context reasoning from the matching procedure, and exploit scene information to effectively assist motion estimation by learning to reason over the adaptive graph.

Motion Estimation Optical Flow Estimation +1

Low-Rank Phase Retrieval with Structured Tensor Models

no code implementations15 Feb 2022 Soo Min Kwon, Xin Li, Anand D. Sarwate

We study the low-rank phase retrieval problem, where the objective is to recover a sequence of signals (typically images) given the magnitude of linear measurements of those signals.

Retrieval

Model Attribution of Face-swap Deepfake Videos

1 code implementation25 Feb 2022 Shan Jia, Xin Li, Siwei Lyu

Then we take Deepfakes model attribution as a multiclass classification task and propose a spatial and temporal attention based method to explore the differences among Deepfakes in the new dataset.

Attribute Face Swapping

Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence

no code implementations CVPR 2022 Zhihong Pan, Baopu Li, Dongliang He, Mingde Yao, Wenhao Wu, Tianwei Lin, Xin Li, Errui Ding

Deep learning based single image super-resolution models have been widely studied and superb results are achieved in upscaling low-resolution images with fixed scale factor and downscaling degradation kernel.

Image Super-Resolution

A Survey on Aspect-Based Sentiment Analysis: Tasks, Methods, and Challenges

1 code implementation2 Mar 2022 Wenxuan Zhang, Xin Li, Yang Deng, Lidong Bing, Wai Lam

More specifically, we provide a new taxonomy for ABSA which organizes existing studies from the axes of concerned sentiment elements, with an emphasis on recent advances of compound ABSA tasks.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)

Aggregate effects of advertising decisions: a complex systems look at search engine advertising via an experimental study

no code implementations4 Mar 2022 Yanwu Yang, Xin Li, Bernard J. Jansen, Daniel Zeng

Originality: This is one of the first research works to explore collective group decisions and resulting phenomena in the complex context of search engine advertising via developing and validating a simulation framework that supports assessments of various advertising strategies and estimations of the impact of mechanisms on the search market.

Context-aware Visual Tracking with Joint Meta-updating

no code implementations4 Apr 2022 Qiuhong Shen, Xin Li, Fanyang Meng, Yongsheng Liang

These deep trackers usually do not perform online update or update single sub-branch of the tracking model, for which they cannot adapt to the appearance variation of objects.

Meta-Learning Visual Object Tracking +1

Unsupervised Learning of Accurate Siamese Tracking

1 code implementation CVPR 2022 Qiuhong Shen, Lei Qiao, Jinyang Guo, Peixia Li, Xin Li, Bo Li, Weitao Feng, Weihao Gan, Wei Wu, Wanli Ouyang

As unlimited self-supervision signals can be obtained by tracking a video along a cycle in time, we investigate evolving a Siamese tracker by tracking videos forward-backward.

Visual Object Tracking

DR-GAN: Distribution Regularization for Text-to-Image Generation

1 code implementation17 Apr 2022 Hongchen Tan, Xiuping Liu, BaoCai Yin, Xin Li

This paper presents a new Text-to-Image generation model, named Distribution Regularization Generative Adversarial Network (DR-GAN), to generate images from text descriptions from improved distribution learning.

Generative Adversarial Network Text-to-Image Generation

The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training

no code implementations18 Apr 2022 Hao liu, Xinghua Jiang, Xin Li, Antai Guo, Deqiang Jiang, Bo Ren

The self-supervised Masked Image Modeling (MIM) schema, following "mask-and-reconstruct" pipeline of recovering contents from masked image, has recently captured the increasing interest in the multimedia community, owing to the excellent ability of learning visual representation from unlabeled data.

Gene Function Prediction with Gene Interaction Networks: A Context Graph Kernel Approach

no code implementations22 Apr 2022 Xin Li, Hsinchun Chen, Jiexun Li, Zhu Zhang

Predicting gene functions is a challenge for biologists in the post genomic era.

Global Mapping of Gene/Protein Interactions in PubMed Abstracts: A Framework and an Experiment with P53 Interactions

no code implementations22 Apr 2022 Xin Li, Hsinchun Chen, Zan Huang, Hua Su, Jesse D. Martinez

In this paper, we propose a comprehensive framework for constructing and analyzing large-scale gene functional networks based on the gene/protein interactions extracted from biomedical literature repositories using text mining tools.

Cannot find the paper you are looking for? You can Submit a new open access paper.