Search Results for author: Xin Li

Found 371 papers, 160 papers with code

GANE: A Generative Adversarial Network Embedding

no code implementations18 May 2018 Huiting Hong, Xin Li, Mingzhong Wang

Network embedding has become a hot research topic recently which can provide low-dimensional feature representations for many machine learning applications.

Clustering Generative Adversarial Network +2

On Improving Deep Reinforcement Learning for POMDPs

no code implementations17 Apr 2018 Pengfei Zhu, Xin Li, Pascal Poupart, Guanghui Miao

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e. g., computer Go.

Atari Games Decision Making +4

Perceptually Optimized Generative Adversarial Network for Single Image Dehazing

no code implementations3 May 2018 Yixin Du, Xin Li

To overcome this weakness, we propose a direct deep learning approach toward image dehazing bypassing the step of transmission map estimation and facilitating end-to-end perceptual optimization.

Denoising Generative Adversarial Network +2

Weighted Low-Rank Approximation of Matrices and Background Modeling

no code implementations15 Apr 2018 Aritra Dutta, Xin Li, Peter Richtarik

We primarily study a special a weighted low-rank approximation of matrices and then apply it to solve the background modeling problem.

ReHAR: Robust and Efficient Human Activity Recognition

no code implementations27 Feb 2018 Xin Li, Mooi Choo Chuah

The whole model is trained end-to-end to allow meaningful representations to be generated for the final activity recognition.

Human Activity Recognition Optical Flow Estimation

Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network

no code implementations13 Feb 2018 Weishong Dong, Ming Yuan, Xin Li, Guangming Shi

Image demosaicing - one of the most important early stages in digital camera pipelines - addressed the problem of reconstructing a full-resolution image from so-called color-filter-arrays.

Demosaicking Denoising +2

Learning with Rethinking: Recurrently Improving Convolutional Neural Networks through Feedback

no code implementations15 Aug 2017 Xin Li, Zequn Jie, Jiashi Feng, Changsong Liu, Shuicheng Yan

However, most of the existing CNN models only learn features through a feedforward structure and no feedback information from top to bottom layers is exploited to enable the networks to refine themselves.

Prune the Convolutional Neural Networks with Sparse Shrink

no code implementations8 Aug 2017 Xin Li, Changsong Liu

These results have demonstrated the effectiveness of our "Sparse Shrink" algorithm.

FoveaNet: Perspective-aware Urban Scene Parsing

no code implementations ICCV 2017 Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng

Thus, they suffer from heterogeneous object scales caused by perspective projection of cameras on actual scenes and inevitably encounter parsing failures on distant objects as well as other boundary and recognition errors.

Scene Parsing

Weighted Low Rank Approximation for Background Estimation Problems

no code implementations4 Jul 2017 Aritra Dutta, Xin Li

Classical principal component analysis (PCA) is not robust to the presence of sparse outliers in the data.

A Batch-Incremental Video Background Estimation Model using Weighted Low-Rank Approximation of Matrices

no code implementations2 Jul 2017 Aritra Dutta, Xin Li, Peter Richtárik

Principal component pursuit (PCP) is a state-of-the-art approach for background estimation problems.

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

no code implementations1 Dec 2016 Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

Evaluated on the LSTM for speech recognition benchmark, ESE is 43x and 3x faster than Core i7 5930k CPU and Pascal Titan X GPU implementations.

Quantization speech-recognition +1

Cross-scale predictive dictionaries

no code implementations16 Nov 2015 Vishwanath Saragadam, Xin Li, Aswin Sankaranarayanan

Sparse representations using data dictionaries provide an efficient model particularly for signals that do not enjoy alternate analytic sparsifying transformations.

Video Scene Parsing with Predictive Feature Learning

no code implementations ICCV 2017 Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan

In this way, the network can effectively learn to capture video dynamics and temporal context, which are critical clues for video scene parsing, without requiring extra manual annotations.

Representation Learning Scene Parsing

Detecting Suicidal Ideation in Chinese Microblogs with Psychological Lexicons

no code implementations4 Nov 2014 Xiaolei Huang, Lei Zhang, Tianli Liu, David Chiu, Tingshao Zhu, Xin Li

Currently, we have identified 53 known suicidal cases who posted suicide notes on Weibo prior to their deaths. We explore linguistic features of these known cases using a psychological lexicon dictionary, and train an effective suicidal Weibo post detection model.

BIG-bench Machine Learning

Learning Hybrid Sparsity Prior for Image Restoration: Where Deep Learning Meets Sparse Coding

no code implementations18 Jul 2018 Fangfang Wu, Weisheng Dong, Guangming Shi, Xin Li

State-of-the-art approaches toward image restoration can be classified into model-based and learning-based.

Image Restoration

Superimposition-guided Facial Reconstruction from Skull

no code implementations28 Sep 2018 Celong Liu, Xin Li

We develop a new algorithm to perform facial reconstruction from a given skull.

Facial Inpainting

Learning Parametric Sparse Models for Image Super-Resolution

no code implementations NeurIPS 2016 Yongbo Li, Weisheng Dong, Xuemei Xie, Guangming Shi, Xin Li, Donglai Xu

More specifically, the parametric sparse prior of the desirable high-resolution (HR) image patches are learned from both the input low-resolution (LR) image and a training image dataset.

Image Super-Resolution

CONet: A Cognitive Ocean Network

no code implementations9 Jan 2019 Huimin Lu, Dong Wang, Yujie Li, Jianru Li, Xin Li, Hyoungseop Kim, Seiichi Serikawa, Iztok Humar

The Cognitive Ocean Network (CONet) will become the mainstream of future ocean science and engineering developments.

Adaptive Active Learning for Image Classification

no code implementations CVPR 2013 Xin Li, Yuhong Guo

Recently active learning has attracted a lot of attention in computer vision field, as it is time and cost consuming to prepare a good set of labeled images for vision data analysis.

Active Learning Classification +4

Simplified Mirror-Based Camera Pose Computation via Rotation Averaging

no code implementations CVPR 2015 Gucan Long, Laurent Kneip, Xin Li, Xiaohu Zhang, Qifeng Yu

Our theoretical contribution extends the applicability of rotation averaging to a more general case, and enables mirror-based pose estimation in closed-form under the chordal L2-metric, or in an outlier-robust way by employing iterative L1-norm averaging.

Camera Calibration Pose Estimation

Object-Aware Dense Semantic Correspondence

no code implementations CVPR 2017 Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen

To address these problems, this paper proposes an object-aware method to estimate per-pixel correspondences from semantic to low-level by learning a classifier for each selected discriminative grid cell and guiding the localization of every pixel under the semantic constraint.

Object Semantic correspondence

Low-Rank Tensor Approximation With Laplacian Scale Mixture Modeling for Multiframe Image Denoising

no code implementations ICCV 2015 Weisheng Dong, Guangyu Li, Guangming Shi, Xin Li, Yi Ma

Patch-based low-rank models have shown effective in exploiting spatial redundancy of natural images especially for the application of image denoising.

Dictionary Learning Image Denoising

Semi-Supervised Zero-Shot Classification With Label Representation Learning

no code implementations ICCV 2015 Xin Li, Yuhong Guo, Dale Schuurmans

Most existing zero-shot learning methods require a user to first provide a set of semantic visual attributes for each class as side information before applying a two-step prediction procedure that introduces an intermediate attribute prediction problem.

Attribute Classification +4

Iris R-CNN: Accurate Iris Segmentation in Non-cooperative Environment

no code implementations25 Mar 2019 Chunyang Feng, Yufeng Sun, Xin Li

Despite the significant advances in iris segmentation, accomplishing accurate iris segmentation in non-cooperative environment remains a grand challenge.

Iris Segmentation Region Proposal +1

Target-Aware Deep Tracking

no code implementations CVPR 2019 Xin Li, Chao Ma, Baoyuan Wu, Zhenyu He, Ming-Hsuan Yang

Despite demonstrated successes for numerous vision tasks, the contributions of using pre-trained deep features for visual tracking are not as significant as that for object recognition.

Object Object Recognition +1

STN-Homography: estimate homography parameters directly

no code implementations6 Jun 2019 Qiang Zhou, Xin Li

In this paper, we introduce the STN-Homography model to directly estimate the homography matrix between image pair.

Homography Estimation

Vispi: Automatic Visual Perception and Interpretation of Chest X-rays

no code implementations MIDL 2019 Xin Li, Rui Cao, Dongxiao Zhu

Medical imaging contains the essential information for rendering diagnostic and treatment decisions.

Image Captioning

Reconstructing Perceived Images from Brain Activity by Visually-guided Cognitive Representation and Adversarial Learning

no code implementations27 Jun 2019 Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo Gao

In addition, we introduce a novel three-stage learning approach which enables the (cognitive) encoder to gradually distill useful knowledge from the paired (visual) encoder during the learning process.

Generative Adversarial Network Image Reconstruction +2

Small and Practical BERT Models for Sequence Labeling

no code implementations IJCNLP 2019 Henry Tsai, Jason Riesa, Melvin Johnson, Naveen Arivazhagan, Xin Li, Amelia Archer

We propose a practical scheme to train a single multilingual sequence labeling model that yields state of the art results and is small and fast enough to run on a single CPU.

Part-Of-Speech Tagging

Iterative Clustering with Game-Theoretic Matching for Robust Multi-consistency Correspondence

no code implementations3 Sep 2019 Chen Zhao, Jiaqi Yang, Ke Xian, Zhiguo Cao, Xin Li

Matching corresponding features between two images is a fundamental task to computer vision with numerous applications in object recognition, robotics, and 3D reconstruction.

3D Reconstruction Clustering +2

Spoofing and Anti-Spoofing with Wax Figure Faces

no code implementations12 Oct 2019 Shan Jia, Xin Li, Chuanbo Hu, Zhengquan Xu

In this work, we introduce a wax figure face database (WFFD) as a novel and super-realistic 3D face presentation attack.

Face Detection Face Recognition +1

Sparse estimation via $\ell_q$ optimization method in high-dimensional linear regression

no code implementations12 Nov 2019 Xin Li, Yaohua Hu, Chong Li, Xiaoqi Yang, Tianzi Jiang

In this paper, we discuss the statistical properties of the $\ell_q$ optimization methods $(0<q\leq 1)$, including the $\ell_q$ minimization method and the $\ell_q$ regularization method, for estimating a sparse parameter from noisy observations in high-dimensional linear regression with either a deterministic or random design.

regression Vocal Bursts Intensity Prediction

Relevance-Promoting Language Model for Short-Text Conversation

no code implementations26 Nov 2019 Xin Li, Piji Li, Wei Bi, Xiaojiang Liu, Wai Lam

In this paper, we propose to formulate the STC task as a language modeling problem and tailor-make a training strategy to adapt a language model for response generation.

Language Modelling Response Generation +1

Digital Twin: Acquiring High-Fidelity 3D Avatar from a Single Image

no code implementations7 Dec 2019 Ruizhe Wang, Chih-Fan Chen, Hao Peng, Xudong Liu, Oliver Liu, Xin Li

We present an approach to generate high fidelity 3D face avatar with a high-resolution UV texture map from a single image.

Face Model Vocal Bursts Intensity Prediction

Point2Node: Correlation Learning of Dynamic-Node for Point Cloud Feature Modeling

no code implementations23 Dec 2019 Wenkai Han, Chenglu Wen, Cheng Wang, Xin Li, Qing Li

Point2Node can dynamically explore correlation among all graph nodes from different levels, and adaptively aggregate the learned features.

Hybrid Graph Neural Networks for Crowd Counting

no code implementations31 Jan 2020 Ao Luo, Fan Yang, Xin Li, Dong Nie, Zhicheng Jiao, Shangchen Zhou, Hong Cheng

In this paper, we present a novel network structure called Hybrid Graph Neural Network (HyGnn) which targets to relieve the problem by interweaving the multi-scale features for crowd density as well as its auxiliary task (localization) together and performing joint reasoning over a graph.

Crowd Counting

Improve SGD Training via Aligning Mini-batches

no code implementations23 Feb 2020 Xiangrui Li, Deng Pan, Xin Li, Dongxiao Zhu

In each iteration of SGD, a mini-batch from the training data is sampled and the true gradient of the loss function is estimated as the noisy gradient calculated on this mini-batch.

Contextual-Bandit Based Personalized Recommendation with Time-Varying User Interests

no code implementations29 Feb 2020 Xiao Xu, Fang Dong, Yanghua Li, Shaojian He, Xin Li

A contextual bandit problem is studied in a highly non-stationary environment, which is ubiquitous in various recommender systems due to the time-varying interests of users.

Recommendation Systems

Towards Evaluating the Robustness of Chinese BERT Classifiers

no code implementations7 Apr 2020 Boxin Wang, Boyuan Pan, Xin Li, Bo Li

Recent advances in large-scale language representation models such as BERT have improved the state-of-the-art performances in many NLP tasks.

Leveraging Planar Regularities for Point Line Visual-Inertial Odometry

no code implementations16 Apr 2020 Xin Li, Yijia He, Jinlong Lin, Xiao Liu

To improve the accuracy of 3D mesh generation and localization, we propose a tightly-coupled monocular VIO system, PLP-VIO, which exploits point features and line features as well as plane regularities.

Context-aware Helpfulness Prediction for Online Product Reviews

no code implementations27 Apr 2020 Iyiola E. Olatunji, Xin Li, Wai Lam

In this paper, we propose a neural deep learning model that predicts the helpfulness score of a review.

3D Face Anti-spoofing with Factorized Bilinear Coding

no code implementations12 May 2020 Shan Jia, Xin Li, Chuanbo Hu, Guodong Guo, Zhengquan Xu

We have witnessed rapid advances in both face presentation attack models and presentation attack detection (PAD) in recent years.

Face Anti-Spoofing Face Presentation Attack Detection +1

Multi-scale Grouped Dense Network for VVC Intra Coding

no code implementations16 May 2020 Xin Li, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Versatile Video Coding (H. 266/VVC) standard achieves better image quality when keeping the same bits than any other conventional image codec, such as BPG, JPEG, and etc.

Generative Adversarial Network

Robust Trajectory Forecasting for Multiple Intelligent Agents in Dynamic Scene

no code implementations27 May 2020 Yanliang Zhu, Dongchun Ren, Mingyu Fan, Deheng Qian, Xin Li, Huaxia Xia

Trajectory forecasting, or trajectory prediction, of multiple interacting agents in dynamic scenes, is an important problem for many applications, such as robotic systems and autonomous driving.

Autonomous Driving Trajectory Forecasting

Defending against adversarial attacks on medical imaging AI system, classification or detection?

1 code implementation24 Jun 2020 Xin Li, Deng Pan, Dongxiao Zhu

Medical imaging AI systems such as disease classification and segmentation are increasingly inspired and transformed from computer vision based AI systems.

Adversarial Defense General Classification

Explainable Recommendation via Interpretable Feature Mapping and Evaluation of Explainability

no code implementations12 Jul 2020 Deng Pan, Xiangrui Li, Xin Li, Dongxiao Zhu

Latent factor collaborative filtering (CF) has been a widely used technique for recommender system by learning the semantic representations of users and items.

Collaborative Filtering Explainable Recommendation +1

Multi-node Bert-pretraining: Cost-efficient Approach

no code implementations1 Aug 2020 Jiahuang Lin, Xin Li, Gennady Pekhimenko

As a result, to train these models within a reasonable time, machine learning (ML) programmers often require advanced hardware setups such as the premium GPU-enabled NVIDIA DGX workstations or specialized accelerators such as Google's TPU Pods.

LIRA: Lifelong Image Restoration from Unknown Blended Distortions

no code implementations ECCV 2020 Jianzhao Liu, Jianxin Lin, Xin Li, Wei Zhou, Sen Liu, Zhibo Chen

Most existing image restoration networks are designed in a disposable way and catastrophically forget previously learned distortions when trained on a new distortion removal task.

Image Restoration SSIM

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

no code implementations21 Aug 2020 Xu He, Bo An, Yanghua Li, Haikai Chen, Qingyu Guo, Xin Li, Zhirong Wang

First, since we concern the reward of a set of recommended items, we model the online recommendation as a contextual combinatorial bandit problem and define the reward of a recommended set.

Detection of Genuine and Posed Facial Expressions of Emotion: A Review

no code implementations26 Aug 2020 Shan Jia, Shuo Wang, Chuanbo Hu, Paula Webster, Xin Li

Facial expressions of emotion play an important role in human social interactions.

Training Recurrent Neural Networks Online by Learning Explicit State Variables

no code implementations ICLR 2020 Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White

Recurrent neural networks (RNNs) allow an agent to construct a state-representation from a stream of experience, which is essential in partially observable problems.

Efficiency in Real-time Webcam Gaze Tracking

no code implementations2 Sep 2020 Amogh Gudi, Xin Li, Jan van Gemert

To do so, we evaluate the computational speed/accuracy trade-off for the CNN and the calibration effort/accuracy trade-off for screen calibration.

Computational Efficiency regression

Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction

no code implementations ECCV 2020 Xin Xiong, Haipeng Xiong, Ke Xian, Chen Zhao, Zhiguo Cao, Xin Li

Depth completion is a widely studied problem of predicting a dense depth map from a sparse set of measurements and a single RGB image.

Depth Completion graph construction

Enhancing Dialogue Generation via Multi-Level Contrastive Learning

no code implementations19 Sep 2020 Xin Li, Piji Li, Yan Wang, Xiaojiang Liu, Wai Lam

Most of the existing works for dialogue generation are data-driven models trained directly on corpora crawled from websites.

Contrastive Learning Dialogue Generation +1

FAN: Frequency Aggregation Network for Real Image Super-resolution

no code implementations30 Sep 2020 Yingxue Pang, Xin Li, Xin Jin, Yaojun Wu, Jianzhao Liu, Sen Liu, Zhibo Chen

Specifically, we extract different frequencies of the LR image and pass them to a channel attention-grouped residual dense network (CA-GRDB) individually to output corresponding feature maps.

Image Super-Resolution SSIM

Deformable Kernel Convolutional Network for Video Extreme Super-Resolution

no code implementations1 Oct 2020 Xuan Xu, Xin Xiong, Jinge Wang, Xin Li

Thanks to newly designed Deformable Kernel Convolution Alignment (DKC_Align) and Deformable Kernel Spatial Attention (DKSA) modules, DKSAN can better exploit both spatial and temporal redundancies to facilitate the information propagation across different layers.

Video Super-Resolution

Anion charge-lattice volume dependent Li ion migration in compounds with the face-centered cubic anion frameworks

no code implementations25 Oct 2019 Zhenming Xu, Xin Chen, Ronghan Chen, Xin Li, Hong Zhu

In this work, the face-centered cubic (fcc) anion frameworks were creatively constructed to study the effects of anion charge and lattice volume on the stability of lithium ion occupation and lithium ion migration.

Applied Physics

The similarity metric

no code implementations20 Nov 2001 Ming Li, Xin Chen, Xin Li, Bin Ma, Paul Vitanyi

A new class of distances appropriate for measuring similarity relations between sequences, say one type of similarity per distance, is studied.

Limitations of Autoregressive Models and Their Alternatives

no code implementations NAACL 2021 Chu-Cheng Lin, Aaron Jaech, Xin Li, Matthew R. Gormley, Jason Eisner

Standard autoregressive language models perform only polynomial-time computation to compute the probability of the next symbol.

Language Modelling

Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond

no code implementations23 Oct 2020 Xin Li, Lidong Bing, Wenxuan Zhang, Zheng Li, Wai Lam

Cross-lingual adaptation with multilingual pre-trained language models (mPTLMs) mainly consists of two lines of works: zero-shot approach and translation-based approach, which have been studied extensively on the sequence-level tasks.

Cross-Lingual Transfer Translation

Magnetoelectric coupling and decoupling in multiferroic hexagonal YbFeO3 thin films

no code implementations13 Nov 2020 Yu Yun, Xin Li, Arashdeep Singh Thind, Yuewei Yin, Hao liu, Qiang Li, Wenbin Wang, Alpha T. N Diaye, Corbyn Mellinger, Xuanyuan Jiang, Rohan Mishra, Xiaoshan Xu

The coupling between ferroelectric and magnetic orders in multiferroic materials and the nature of magnetoelectric (ME) effects are enduring experimental challenges.

Materials Science Other Condensed Matter

Impact of Temperature and Relative Humidity on the Transmission of COVID-19: A Modeling Study in China and the United States

no code implementations9 Mar 2020 Jingyuan Wang, Ke Tang, Kai Feng, Xin Li, Weifeng Lv, Kun Chen, Fei Wang

Primary outcome measures: Regression analysis of the impact of temperature and relative humidity on the effective reproductive number (R value).

regression

Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution

no code implementations11 Dec 2020 Xin Li, Xin Jin, Tao Yu, Yingxue Pang, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i. e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations.

Image Super-Resolution

Learned Block-based Hybrid Image Compression

no code implementations17 Dec 2020 Yaojun Wu, Xin Li, Zhizheng Zhang, Xin Jin, Zhibo Chen

Recent works on learned image compression perform encoding and decoding processes in a full-resolution manner, resulting in two problems when deployed for practical applications.

Blocking Image Compression +2

Learning Inter- and Intraframe Representations for Non-Lambertian Photometric Stereo

no code implementations26 Dec 2020 Yanlong Cao, Binjie Ding, Zewei He, Jiangxin Yang, Jingxi Chen, Yanpeng Cao, Xin Li

Photometric stereo provides an important method for high-fidelity 3D reconstruction based on multiple intensity images captured under different illumination directions.

3D Reconstruction

Understanding Team Collaboration in Artificial Intelligence from the perspective of Geographic Distance

no code implementations25 Dec 2020 Xuli Tang, Xin Li, Ying Ding, Feicheng Ma

This paper analyzes team collaboration in the field of Artificial Intelligence (AI) from the perspective of geographic distance.

Searching Efficient Model-guided Deep Network for Image Denoising

no code implementations6 Apr 2021 Qian Ning, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi

Similar to the success of NAS in high-level vision tasks, it is possible to find a memory and computationally efficient solution via NAS with highly competent denoising performance.

Image Denoising Neural Architecture Search

Self-Supervised Tracking via Target-Aware Data Synthesis

no code implementations21 Jun 2021 Xin Li, Wenjie Pei, YaoWei Wang, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang

While deep-learning based tracking methods have achieved substantial progress, they entail large-scale and high-quality annotated data for sufficient training.

Representation Learning Self-Supervised Learning +1

Metasurface-Enabled On-Chip Multiplexed Diffractive Neural Networks in the Visible

no code implementations13 Jul 2021 Xuhao Luo, Yueqiang Hu, Xin Li, Xiangnian Ou, Jiajie Lai, Na Liu, Huigao Duan

Replacing electrons with photons is a compelling route towards light-speed, highly parallel, and low-power artificial intelligence computing.

Autonomous Driving

Identifying Illicit Drug Dealers on Instagram with Large-scale Multimodal Data Fusion

no code implementations18 Aug 2021 Chuanbo Hu, Minglei Yin, Bin Liu, Xin Li, Yanfang Ye

Unlike existing methods that focus on posting-based detection, we propose to tackle the problem of illicit drug dealer identification by constructing a large-scale multimodal dataset named Identifying Drug Dealers on Instagram (IDDIG).

Community Detection

Detection of Illicit Drug Trafficking Events on Instagram: A Deep Multimodal Multilabel Learning Approach

no code implementations19 Aug 2021 Chuanbo Hu, Minglei Yin, Bin Liu, Xin Li, Yanfang Ye

Accordingly, accurate detection of illicit drug trafficking events (IDTEs) from social media has become even more challenging.

Marketing

Characterizing interdisciplinarity in drug research: a translational science perspective

no code implementations4 Sep 2021 Xin Li, Xuli Tang

Despite the significant advances in life science, it still takes decades to translate a basic drug discovery into a cure for human disease.

Drug Discovery

Uncertainty-Driven Loss for Single Image Super-Resolution

no code implementations NeurIPS 2021 Qian Ning, Weisheng Dong, Xin Li, Jinjian Wu, Guangming Shi

Specifically, we introduce variance estimation characterizing the uncertainty on a pixel-by-pixel basis into SISR solutions so the targeted pixels in a high-resolution image (mean) and their corresponding uncertainty (variance) can be learned simultaneously.

Image Super-Resolution

Confounder Identification-free Causal Visual Feature Learning

no code implementations26 Nov 2021 Xin Li, Zhizheng Zhang, Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Xin Jin, Zhibo Chen

In this paper, we propose a novel Confounder Identification-free Causal Visual Feature Learning (CICF) method, which obviates the need for identifying confounders.

Domain Generalization Meta-Learning

Neural Collaborative Graph Machines for Table Structure Recognition

no code implementations CVPR 2022 Hao liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, Bo Ren

We also show that the proposed NCGM can modulate collaborative pattern of different modalities conditioned on the context of intra-modality cues, which is vital for diversified table cases.

Table Recognition

A Close Look at Few-shot Real Image Super-resolution from the Distortion Relation Perspective

no code implementations25 Nov 2021 Xin Li, Xin Jin, Jun Fu, Xiaoyuan Yu, Bei Tong, Zhibo Chen

Under this brand-new scenario, we propose Distortion Relation guided Transfer Learning (DRTL) for the few-shot RealSR by transferring the rich restoration knowledge from auxiliary distortions (i. e., synthetic distortions) to the target RealSR under the guidance of distortion relation.

Image Restoration Image Super-Resolution +4

Simple Contrastive Representation Adversarial Learning for NLP Tasks

no code implementations26 Nov 2021 Deshui Miao, JiaQi Zhang, WenBo Xie, Jian Song, Xin Li, Lijuan Jia, Ning Guo

In this paper, adversarial training is performed to generate challenging and harder learning adversarial examples over the embedding space of NLP as learning pairs.

Contrastive Learning Natural Language Understanding +4

Document Layout Analysis with Aesthetic-Guided Image Augmentation

no code implementations27 Nov 2021 Tianlong Ma, Xingjiao Wu, Xin Li, Xiangcheng Du, Zhao Zhou, Liang Xue, Cheng Jin

To measure the proposed image layer modeling method, we propose a manually-labeled non-Manhattan layout fine-grained segmentation dataset named FPD.

Document Layout Analysis document understanding +2

Interactive Model with Structural Loss for Language-based Abductive Reasoning

no code implementations1 Dec 2021 Linhao Li, Ming Xu, Yongfeng Dong, Xin Li, Ao Wang

Therefore, we propose to group instead of ranking the hypotheses and design a structural loss called ``joint softmax focal loss'' in this paper.

Language Modelling Natural Language Inference

Internationalizing AI: Evolution and Impact of Distance Factors

no code implementations10 Nov 2021 Xuli Tang, Xin Li, Feicheng Ma

A framework including 13 indicators to quantify the distance factors between countries from 5 perspectives (i. e., geographic distance, economic distance, cultural distance, academic distance, and industrial distance) is proposed.

Descriptive

Robust Depth Completion with Uncertainty-Driven Loss Functions

no code implementations15 Dec 2021 Yufan Zhu, Weisheng Dong, Leida Li, Jinjian Wu, Xin Li, Guangming Shi

In this work, we introduce uncertainty-driven loss functions to improve the robustness of depth completion and handle the uncertainty in depth completion.

Depth Completion

A Survey on Applications of Digital Human Avatars toward Virtual Co-presence

no code implementations11 Jan 2022 Matthew Korban, Xin Li

This paper investigates different approaches to build and use digital human avatars toward interactive Virtual Co-presence (VCP) environments.

Machine learning prediction for mean motion resonance behaviour -- The planar case

no code implementations18 Jan 2022 Xin Li, Jian Li, Zhihong Jeff Xia, Nikolaos Georgakarakos

Most recently, machine learning has been used to study the dynamics of integrable Hamiltonian systems and the chaotic 3-body problem.

BIG-bench Machine Learning Numerical Integration

A multi-domain virtual network embedding algorithm with delay prediction

no code implementations3 Feb 2022 Peiying Zhang, Xue Pang, Yongjing Ni, Haipeng Yao, Xin Li

Virtual network embedding (VNE) is an crucial part of network virtualization (NV), which aims to map the virtual networks (VNs) to a shared substrate network (SN).

Network Embedding

Multi-modal Sensor Fusion for Auto Driving Perception: A Survey

no code implementations6 Feb 2022 Keli Huang, Botian Shi, Xiang Li, Xin Li, Siyuan Huang, Yikang Li

Multi-modal fusion is a fundamental task for the perception of an autonomous driving system, which has recently intrigued many researchers.

Autonomous Driving object-detection +3

Low-Rank Phase Retrieval with Structured Tensor Models

no code implementations15 Feb 2022 Soo Min Kwon, Xin Li, Anand D. Sarwate

We study the low-rank phase retrieval problem, where the objective is to recover a sequence of signals (typically images) given the magnitude of linear measurements of those signals.

Retrieval

Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence

no code implementations CVPR 2022 Zhihong Pan, Baopu Li, Dongliang He, Mingde Yao, Wenhao Wu, Tianwei Lin, Xin Li, Errui Ding

Deep learning based single image super-resolution models have been widely studied and superb results are achieved in upscaling low-resolution images with fixed scale factor and downscaling degradation kernel.

Image Super-Resolution

Aggregate effects of advertising decisions: a complex systems look at search engine advertising via an experimental study

no code implementations4 Mar 2022 Yanwu Yang, Xin Li, Bernard J. Jansen, Daniel Zeng

Originality: This is one of the first research works to explore collective group decisions and resulting phenomena in the complex context of search engine advertising via developing and validating a simulation framework that supports assessments of various advertising strategies and estimations of the impact of mechanisms on the search market.

Context-aware Visual Tracking with Joint Meta-updating

no code implementations4 Apr 2022 Qiuhong Shen, Xin Li, Fanyang Meng, Yongsheng Liang

These deep trackers usually do not perform online update or update single sub-branch of the tracking model, for which they cannot adapt to the appearance variation of objects.

Meta-Learning Visual Object Tracking +1

The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training

no code implementations18 Apr 2022 Hao liu, Xinghua Jiang, Xin Li, Antai Guo, Deqiang Jiang, Bo Ren

The self-supervised Masked Image Modeling (MIM) schema, following "mask-and-reconstruct" pipeline of recovering contents from masked image, has recently captured the increasing interest in the multimedia community, owing to the excellent ability of learning visual representation from unlabeled data.

Global Mapping of Gene/Protein Interactions in PubMed Abstracts: A Framework and an Experiment with P53 Interactions

no code implementations22 Apr 2022 Xin Li, Hsinchun Chen, Zan Huang, Hua Su, Jesse D. Martinez

In this paper, we propose a comprehensive framework for constructing and analyzing large-scale gene functional networks based on the gene/protein interactions extracted from biomedical literature repositories using text mining tools.

Gene Function Prediction with Gene Interaction Networks: A Context Graph Kernel Approach

no code implementations22 Apr 2022 Xin Li, Hsinchun Chen, Jiexun Li, Zhu Zhang

Predicting gene functions is a challenge for biologists in the post genomic era.

Relational Representation Learning in Visually-Rich Documents

no code implementations5 May 2022 Xin Li, Yan Zheng, Yiqing Hu, Haoyu Cao, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Bo Ren

To deal with the unpredictable definition of relations, we propose a novel contrastive learning task named Relational Consistency Modeling (RCM), which harnesses the fact that existing relations should be consistent in differently augmented positive views.

Contrastive Learning Key Information Extraction +3

Multi-Object Tracking Meets Moving UAV

no code implementations CVPR 2022 Shuai Liu, Xin Li, Huchuan Lu, You He

Multi-object tracking in unmanned aerial vehicle (UAV) videos is an important vision task and can be applied in a wide range of applications.

Multi-Object Tracking Object

Accurate Scoliosis Vertebral Landmark Localization on X-ray Images via Shape-constrained Multi-stage Cascaded CNNs

no code implementations5 Jun 2022 Zhiwei Wang, Jinxin Lv, Yunqiao Yang, Yuanhuai Liang, Yi Lin, Qiang Li, Xin Li, Xin Yang

Vertebral landmark localization is a crucial step for variant spine-related clinical applications, which requires detecting the corner points of 17 vertebrae.

Formation Tracking for a Multi-Auv System Based on an Adaptive Sliding Mode Method in the Water Flow Environment

no code implementations9 Jun 2022 Xin Li, Daqi Zhu, Bing Sun, Qi Chen, Wenyang Gan, Zhigang Li

At last, a robust sliding mode controller with continuous model predictive control strategy for the multi-AUV system is developed to achieve leader-follower formation tracking under the presence of bounded flow disturbances, and simulations are implemented to confirm the effectiveness of the proposed method.

Model Predictive Control

RTN: Reinforced Transformer Network for Coronary CT Angiography Vessel-level Image Quality Assessment

no code implementations13 Jul 2022 Yiting Lu, Jun Fu, Xin Li, Wei Zhou, Sen Liu, Xinxin Zhang, Congfu Jia, Ying Liu, Zhibo Chen

Therefore, we propose a Progressive Reinforcement learning based Instance Discarding module (termed as PRID) to progressively remove quality-irrelevant/negative instances for CCTA VIQA.

Image Quality Assessment Multiple Instance Learning

Stroke-Based Autoencoders: Self-Supervised Learners for Efficient Zero-Shot Chinese Character Recognition

no code implementations17 Jul 2022 Zongze Chen, Wenxia Yang, Xin Li

Following its canonical writing order, we first represent a Chinese character as a series of stroke images with a fixed writing order, and then our SAE model is trained to reconstruct this stroke image sequence.

Word Embeddings Zero-Shot Learning

Source-free Unsupervised Domain Adaptation for Blind Image Quality Assessment

no code implementations17 Jul 2022 Jianzhao Liu, Xin Li, Shukun An, Zhibo Chen

Thanks to the development of unsupervised domain adaptation (UDA), some works attempt to transfer the knowledge from a label-sufficient source domain to a label-free target domain under domain shift with UDA.

Blind Image Quality Assessment Unsupervised Domain Adaptation

Point Cloud Attacks in Graph Spectral Domain: When 3D Geometry Meets Graph Signal Processing

no code implementations27 Jul 2022 Daizong Liu, Wei Hu, Xin Li

Instead, we propose point cloud attacks from a new perspective -- the graph spectral domain attack, aiming to perturb graph transform coefficients in the spectral domain that corresponds to varying certain geometric structure.

StyleAM: Perception-Oriented Unsupervised Domain Adaption for Non-reference Image Quality Assessment

no code implementations29 Jul 2022 Yiting Lu, Xin Li, Jianzhao Liu, Zhibo Chen

Specifically, we find a more compact and reliable space i. e., feature style space for perception-oriented UDA based on an interesting/amazing observation, that the feature style (i. e., the mean and variance) of the deep layer in DNNs is exactly associated with the quality score in NR-IQA.

Image Quality Assessment NR-IQA +1

Learned Lossless JPEG Transcoding via Joint Lossy and Residual Compression

no code implementations24 Aug 2022 Xiaoshuai Fan, Xin Li, Zhibo Chen

Our proposed transcoding architecture shows significant superiority in the compression of JPEG images thanks to the collaboration of learned lossy transform coding and residual entropy coding.

Image Compression

Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation

no code implementations24 Aug 2022 Guangqi Xie, Xin Li, Shiqi Lin, Li Zhang, Kai Zhang, Yue Li, Zhibo Chen

In this paper, we take a step forward to video semantic compression and propose the Hierarchical Reinforcement Learning based task-driven Video Semantic Coding, named as HRLVSC.

Hierarchical Reinforcement Learning reinforcement-learning +3

Saliency Guided Adversarial Training for Learning Generalizable Features with Applications to Medical Imaging Classification System

no code implementations9 Sep 2022 Xin Li, Yao Qiang, Chengyin Li, Sijia Liu, Dongxiao Zhu

We hypothesize that adversarial training can eliminate shortcut features whereas saliency guided training can filter out non-relevant features; both are nuisance features accounting for the performance degradation on OOD test sets.

Uncertainty Aware Multitask Pyramid Vision Transformer For UAV-Based Object Re-Identification

no code implementations19 Sep 2022 Syeda Nyma Ferdous, Xin Li, Siwei Lyu

Learning a robust and discriminative feature representation is a crucial challenge for object ReID.

Object

How Image Generation Helps Visible-to-Infrared Person Re-Identification?

no code implementations4 Oct 2022 Honghu Pan, Yongyong Chen, Yunqi He, Xin Li, Zhenyu He

To this end, we propose Flow2Flow, a unified framework that could jointly achieve training sample expansion and cross-modality image generation for V2I person ReID.

Image Generation Person Re-Identification

Toward an Over-parameterized Direct-Fit Model of Visual Perception

no code implementations7 Oct 2022 Xin Li

In this paper, we revisit the problem of computational modeling of simple and complex cells for an over-parameterized and direct-fit model of visual perception.

Predicting the clinical citation count of biomedical papers using multilayer perceptron neural network

no code implementations7 Sep 2022 Xin Li, Xuli Tang, Qikai Cheng

We extracted ninety-one paper features from three dimensions as the input of the model, including twenty-one features in the paper dimension, thirty-five in the reference dimension, and thirty-five in the citing paper dimension.

Translation

Cutting-Splicing data augmentation: A novel technology for medical image segmentation

no code implementations17 Oct 2022 Lianting Hu, Huiying Liang, Jiajie Tang, Xin Li, Li Huang, Long Lu

Background: Medical images are more difficult to acquire and annotate than natural images, which results in data augmentation technologies often being used in medical image segmentation tasks.

Data Augmentation Image Segmentation +4

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection

no code implementations18 Oct 2022 Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He

To address these problems, we construct the homogeneous structure between the point cloud and images to avoid projective information loss by transforming the camera features into the LiDAR 3D space.

3D Object Detection Autonomous Driving +1

Joint Rigid Motion Correction and Sparse-View CT via Self-Calibrating Neural Field

no code implementations23 Oct 2022 Qing Wu, Xin Li, Hongjiang Wei, Jingyi Yu, Yuyao Zhang

NeRF-based SVCT methods represent the desired CT image as a continuous function of spatial coordinates and train a Multi-Layer Perceptron (MLP) to learn the function by minimizing loss on the SV sinogram.

Development of a Hybrid Simulation and Experiment Test Platform for Dynamic Positioning Vessels

no code implementations23 Oct 2022 Changjun Hu, Quan Shi, Xin Li, Xiaoxian Guo

The test platform can test the performance of DP system and determine the operational time window.

Deep Learning-Based Channel Estimation for Double-RIS Aided Massive MIMO System

no code implementations22 Oct 2022 Mengbing Liu, Xin Li, Boyu Ning, Chongwen Huang, Sumei Sun, Chau Yuen

Reconfigurable Intelligent Surface (RIS) is considered as an energy-efficient solution for future wireless communication networks due to its fast and low-cost configuration.

Multi-view Representation Learning from Malware to Defend Against Adversarial Variants

no code implementations25 Oct 2022 James Lee Hu, MohammadReza Ebrahimi, Weifeng Li, Xin Li, Hsinchun Chen

This provides an opportunity for the defenders (i. e., malware detectors) to detect the adversarial variants by utilizing more than one view of a malware file (e. g., source code view in addition to the binary view).

Adversarial Robustness MULTI-VIEW LEARNING +1

Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning

no code implementations1 Nov 2022 Riashat Islam, Hongyu Zang, Anirudh Goyal, Alex Lamb, Kenji Kawaguchi, Xin Li, Romain Laroche, Yoshua Bengio, Remi Tachet des Combes

Goal-conditioned reinforcement learning (RL) is a promising direction for training agents that are capable of solving multiple tasks and reach a diverse set of objectives.

reinforcement-learning Reinforcement Learning (RL)

RRSR:Reciprocal Reference-based Image Super-Resolution with Progressive Feature Alignment and Selection

no code implementations8 Nov 2022 Lin Zhang, Xin Li, Dongliang He, Fu Li, Yili Wang, Zhaoxiang Zhang

While previous state-of-the-art RefSR methods mainly focus on improving the efficacy and robustness of reference feature transfer, it is generally overlooked that a well reconstructed SR image should enable better SR reconstruction for its similar LR images when it is referred to as.

feature selection Image Super-Resolution

Batch-based Model Registration for Fast 3D Sherd Reconstruction

no code implementations ICCV 2023 Jiepeng Wang, Congyi Zhang, Peng Wang, Xin Li, Peter J. Cobb, Christian Theobalt, Wenping Wang

In this work, we aim to develop a portable, high-throughput, and accurate reconstruction system for efficient digitization of fragments excavated in archaeological sites.

3D Reconstruction

Transformation-Equivariant 3D Object Detection for Autonomous Driving

no code implementations22 Nov 2022 Hai Wu, Chenglu Wen, Wei Li, Xin Li, Ruigang Yang, Cheng Wang

However, it is difficult to apply such networks to 3D object detection in autonomous driving due to its large computation cost and slow reasoning speed.

3D Object Detection Autonomous Driving +3

Learning Compact Features via In-Training Representation Alignment

no code implementations23 Nov 2022 Xin Li, Xiangrui Li, Deng Pan, Yao Qiang, Dongxiao Zhu

Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of the feature extractor (i. e., last hidden layer) and a linear classifier (i. e., output layer) that are trained jointly with stochastic gradient descent (SGD) on the loss function (e. g., cross-entropy).

Representation Learning

AdaCM: Adaptive ColorMLP for Real-Time Universal Photo-realistic Style Transfer

no code implementations3 Dec 2022 Tianwei Lin, Honglin Lin, Fu Li, Dongliang He, Wenhao Wu, Meiling Wang, Xin Li, Yong liu

Then, in \textbf{AdaCM}, we adopt a CNN encoder to adaptively predict all parameters for the ColorMLP conditioned on each input content and style image pair.

4k Style Transfer

Coarse-to-Fine Contrastive Learning on Graphs

no code implementations13 Dec 2022 Peiyao Zhao, Yuangang Pan, Xin Li, Xu Chen, Ivor W. Tsang, Lejian Liao

Inspired by the impressive success of contrastive learning (CL), a variety of graph augmentation strategies have been employed to learn node representations in a self-supervised manner.

Contrastive Learning Learning-To-Rank

Joint Beamforming Design for Dual-Functional MIMO Radar and Communication Systems Guaranteeing Physical Layer Security

no code implementations1 Jan 2023 Fuwang Dong, Wei Wang, Xin Li, Fan Liu, Sheng Chen, Lajos Hanzo

The dual-functional radar and communication (DFRC) technique constitutes a promising next-generation wireless solution, due to its benefits in terms of power consumption, physical hardware, and spectrum exploitation.

Multi-Constraint Molecular Generation using Sparsely Labelled Training Data for Localized High-Concentration Electrolyte Diluent Screening

no code implementations12 Jan 2023 Jonathan P. Mailoa, Xin Li, Jiezhong Qiu, Shengyu Zhang

Recently, machine learning methods have been used to propose molecules with desired properties, which is especially useful for exploring large chemical spaces efficiently.

PointSmile: Point Self-supervised Learning via Curriculum Mutual Information

no code implementations30 Jan 2023 Xin Li, Mingqiang Wei, Songcan Chen

From the perspective of how-and-what-to-learn, PointSmile is designed to imitate human curriculum learning, i. e., starting with an easy curriculum and gradually increasing the difficulty of that curriculum.

Data Augmentation Self-Supervised Learning

Analysis of Biomass Sustainability Indicators from a Machine Learning Perspective

no code implementations2 Feb 2023 Syeda Nyma Ferdous, Xin Li, Kamalakanta Sahoo, Richard Bergman

This study proposes a robust model for biomass sustainability prediction by analyzing sustainability indicators using machine learning models.

Ensemble Learning Management +1

MorphGANFormer: Transformer-based Face Morphing and De-Morphing

no code implementations18 Feb 2023 Na Zhang, Xudong Liu, Xin Li, Guo-Jun Qi

Semantic face image manipulation has received increasing attention in recent years.

Image Manipulation

Toward a Geometric Theory of Manifold Untangling

no code implementations7 Mar 2023 Xin Li, Shuo Wang

It has been hypothesized that the ventral stream processing for object recognition is based on a mechanism called cortically local subspace untangling.

Object Object Recognition

Toward NeuroDM: Where Computational Neuroscience Meets Data Mining

no code implementations7 Mar 2023 Xin Li, Bin Liu, Shuo Wang

At the intersection of computational neuroscience (CN) and data mining (DM), we advocate a holistic view toward their rich connections.

Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation

no code implementations16 Mar 2023 Hao liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Xing Sun

Recently, Table Structure Recognition (TSR) task, aiming at identifying table structure into machine readable formats, has received increasing interest in the community.

MobileInst: Video Instance Segmentation on the Mobile

no code implementations30 Mar 2023 Renhong Zhang, Tianheng Cheng, Shusheng Yang, Haoyi Jiang, Shuai Zhang, Jiancheng Lyu, Xin Li, Xiaowen Ying, Dashan Gao, Wenyu Liu, Xinggang Wang

To address those issues, we present MobileInst, a lightweight and mobile-friendly framework for video instance segmentation on mobile devices.

Instance Segmentation Segmentation +2

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA

no code implementations4 Apr 2023 Yongxin Zhu, Zhen Liu, Yukang Liang, Xin Li, Hao liu, Changcun Bao, Linli Xu

Different to conventional STVQA models which take the linguistic semantics and visual semantics in scene text as two separate features, in this paper, we propose a paradigm of "Locate Then Generate" (LTG), which explicitly unifies this two semantics with the spatial bounding box as a bridge connecting them.

Answer Generation Language Modelling +3

MEDIC: A Multimodal Empathy Dataset in Counseling

no code implementations4 May 2023 Zhou'an_Zhu, Xin Li, Jicai Pan, Yufei Xiao, Yanan Chang, Feiyi Zheng, Shangfei Wang

We also propose three labels (i. e., expression of experience, emotional reaction, and cognitive reaction) to describe the degree of empathy between counselors and their clients.

UPDExplainer: an Interpretable Transformer-based Framework for Urban Physical Disorder Detection Using Street View Imagery

no code implementations4 May 2023 Chuanbo Hu, Shan Jia, Fan Zhang, Changjiang Xiao, Mindi Ruan, Jacob Thrasher, Xin Li

Experimental results on the re-annotated Place Pulse 2. 0 dataset demonstrate promising detection performance of the proposed method, with an accuracy of 79. 9%.

Semantic Segmentation

SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition

no code implementations27 Apr 2023 Naga VS Raviteja Chappa, Pha Nguyen, Alexander H Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu

This paper introduces a novel approach to Social Group Activity Recognition (SoGAR) using Self-supervised Transformers network that can effectively utilize unlabeled video data.

Group Activity Recognition

GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark

no code implementations11 May 2023 Dongyang Li, Ruixue Ding, Qiang Zhang, Zheng Li, Boli Chen, Pengjun Xie, Yao Xu, Xin Li, Ning Guo, Fei Huang, Xiaofeng He

With a fast developing pace of geographic applications, automatable and intelligent models are essential to be designed to handle the large volume of information.

Entity Alignment Natural Language Understanding

Vector Quantization With Self-Attention for Quality-Independent Representation Learning

no code implementations CVPR 2023 Zhou Yang, Weisheng Dong, Xin Li, Mengluan Huang, Yulin Sun, Guangming Shi

During training, we enforce the quantization of features from clean and corrupted images in the same discrete embedding space so that an invariant quality-independent feature representation can be learned to improve the recognition robustness of low-quality images.

Data Augmentation Image Restoration +2

Self-Supervised Non-Uniform Kernel Estimation With Flow-Based Motion Prior for Blind Image Deblurring

no code implementations CVPR 2023 Zhenxuan Fang, Fangfang Wu, Weisheng Dong, Xin Li, Jinjian Wu, Guangming Shi

To address these issues, we propose to represent the field of motion blur kernels in a latent space by normalizing flows, and design CNNs to predict the latent codes instead of motion kernels.

Blind Image Deblurring Image Deblurring

Two-Stream Regression Network for Dental Implant Position Prediction

no code implementations17 May 2023 Xinquan Yang, Xuguang Li, Xuechen Li, WenTing Chen, Linlin Shen, Xin Li, Yongqiang Deng

In this paper, we develop a two-stream implant position regression framework (TSIPR), which consists of an implant region detector (IRD) and a multi-scale patch embedding regression network (MSPENet), to address this issue.

Position Position regression +1

Cross-supervised Dual Classifiers for Semi-supervised Medical Image Segmentation

no code implementations25 May 2023 Zhenxi Zhang, Ran Ran, Chunna Tian, Heng Zhou, Fan Yang, Xin Li, Zhicheng Jiao

This paper proposes a cross-supervised learning framework based on dual classifiers (DC-Net), including an evidential classifier and a vanilla classifier.

Image Segmentation Segmentation +2

Self-aware and Cross-sample Prototypical Learning for Semi-supervised Medical Image Segmentation

no code implementations25 May 2023 Zhenxi Zhang, Ran Ran, Chunna Tian, Heng Zhou, Xin Li, Fan Yang, Zhicheng Jiao

To address these issues, we propose a self-aware and cross-sample prototypical learning method (SCP-Net) to enhance the diversity of prediction in consistency learning by utilizing a broader range of semantic information derived from multiple inputs.

Image Segmentation Semantic Segmentation +1

A2B: Anchor to Barycentric Coordinate for Robust Correspondence

no code implementations5 Jun 2023 Weiyue Zhao, Hao Lu, Zhiguo Cao, Xin Li

This approach offers a new perspective to alleviate the problem of repeated patterns and emphasizes the importance of choosing coordinate representations for feature correspondences.

Learning Probabilistic Coordinate Fields for Robust Correspondences

no code implementations7 Jun 2023 Weiyue Zhao, Hao Lu, Xinyi Ye, Zhiguo Cao, Xin Li

We introduce Probabilistic Coordinate Fields (PCFs), a novel geometric-invariant coordinate representation for image correspondence problems.

Image Registration Pose Estimation

Securing Visually-Aware Recommender Systems: An Adversarial Image Reconstruction and Detection Framework

no code implementations11 Jun 2023 Minglei Yin, Bin Liu, Neil Zhenqiang Gong, Xin Li

Our proposed method can simultaneously (1) secure VARS from adversarial attacks characterized by local perturbations by image reconstruction based on global vision transformers; and (2) accurately detect adversarial examples using a novel contrastive learning approach.

Contrastive Learning Image Reconstruction +1

TCEIP: Text Condition Embedded Regression Network for Dental Implant Position Prediction

no code implementations26 Jun 2023 Xinquan Yang, Jinheng Xie, Xuguang Li, Xuechen Li, Xin Li, Linlin Shen, Yongqiang Deng

When deep neural network has been proposed to assist the dentist in designing the location of dental implant, most of them are targeting simple cases where only one missing tooth is available.

Position Position regression +1

Unveiling the Potential of Knowledge-Prompted ChatGPT for Enhancing Drug Trafficking Detection on Social Media

no code implementations7 Jul 2023 Chuanbo Hu, Bin Liu, Xin Li, Yanfang Ye

By integrating prior knowledge and the proposed prompts, ChatGPT can effectively identify and label drug trafficking activities on social networks, even in the presence of deceptive language and euphemisms used by drug dealers to evade detection.

Marketing

Adaptive Control of Resource Flow to Optimize Construction Work and Cash Flow via Online Deep Reinforcement Learning

no code implementations20 Jul 2023 Can Jiang, Xin Li, Jia-Rui Lin, Ming Liu, Zhiliang Ma

Therefore, this paper introducess a model and method to adaptive control the resource flows to optimize the work and cash flows of construction projects.

Management

Bi-Modality Medical Image Synthesis Using Semi-Supervised Sequential Generative Adversarial Networks

no code implementations27 Aug 2023 Xin Yang, Yi Lin, Zhiwei Wang, Xin Li, Kwang-Ting Cheng

A method for measuring the synthesis complexity is proposed to automatically determine the synthesis order in our sequential GAN.

Generative Adversarial Network Image Generation

A Note on Randomized Kaczmarz Algorithm for Solving Doubly-Noisy Linear Systems

no code implementations31 Aug 2023 El Houcine Bergou, Soumia Boucherouite, Aritra Dutta, Xin Li, Anna Ma

In this paper, we analyze the convergence of RK for noisy linear systems when the coefficient matrix, $A$, is corrupted with both additive and multiplicative noise, along with the noisy vector, $b$.

VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation

no code implementations1 Sep 2023 Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang

In this paper, we present VideoGen, a text-to-video generation approach, which can generate a high-definition video with high frame fidelity and strong temporal consistency using reference-guided latent diffusion.

Text-to-Image Generation Text-to-Video Generation +1

3D Multiple Object Tracking on Autonomous Driving: A Literature Review

no code implementations27 Sep 2023 Peng Zhang, Xin Li, Liang He, Xin Lin

This paper undertakes a comprehensive examination, assessment, and synthesis of the research landscape in this domain, remaining attuned to the latest developments in 3D MOT while suggesting prospective avenues for future investigation.

3D Multi-Object Tracking Autonomous Driving +1

FreqAlign: Excavating Perception-oriented Transferability for Blind Image Quality Assessment from A Frequency Perspective

no code implementations29 Sep 2023 Xin Li, Yiting Lu, Zhibo Chen

Based on this, we propose to improve the perception-oriented transferability of BIQA by performing feature frequency decomposition and selecting the frequency components that contained the most transferable perception knowledge for alignment.

Blind Image Quality Assessment Unsupervised Domain Adaptation

Demystifying the Myths and Legends of Nonconvex Convergence of SGD

no code implementations19 Oct 2023 Aritra Dutta, El Houcine Bergou, Soumia Boucherouite, Nicklas Werge, Melih Kandemir, Xin Li

Additionally, our analyses allow us to measure the density of the $\epsilon$-stationary points in the final iterates of SGD, and we recover the classical $O(\frac{1}{\sqrt{T}})$ asymptotic rate under various existing assumptions on the objective function and the bounds on the stochastic gradient.

Diagnosis-oriented Medical Image Compression with Efficient Transfer Learning

no code implementations20 Oct 2023 Guangqi Xie, Xin Li, Xiaohan Pan, Zhibo Chen

Remote medical diagnosis has emerged as a critical and indispensable technique in practical medical systems, where medical data are required to be efficiently compressed and transmitted for diagnosis by either professional doctors or intelligent diagnosis devices.

Coronary Artery Segmentation Image Compression +2

Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations

no code implementations24 Oct 2023 Ye Yuan, Xin Li, Yong Heng, Leiji Zhang, Mingzhong Wang

Imitation Learning (IL) aims to discover a policy by minimizing the discrepancy between the agent's behavior and expert demonstrations.

Imitation Learning

Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding

no code implementations26 Nov 2023 Hoang-Quan Nguyen, Thanh-Dat Truong, Xuan Bac Nguyen, Ashley Dowling, Xin Li, Khoa Luu

In precision agriculture, the detection and recognition of insects play an essential role in the ability of crops to grow healthy and produce a high-quality yield.

Self-Supervised Learning

Brainformer: Modeling MRI Brain Functions to Machine Vision

no code implementations30 Nov 2023 Xuan-Bac Nguyen, Xin Li, Samee U. Khan, Khoa Luu

In this work, we first present a simple yet effective Brainformer approach, a novel Transformer-based framework, to analyze the patterns of fMRI in the human perception system from the machine learning perspective.

Cross-BERT for Point Cloud Pretraining

no code implementations8 Dec 2023 Xin Li, Peng Li, Zeyong Wei, Zhe Zhu, Mingqiang Wei, Junhui Hou, Liangliang Nan, Jing Qin, Haoran Xie, Fu Lee Wang

By performing cross-modal interaction, Cross-BERT can smoothly reconstruct the masked tokens during pretraining, leading to notable performance enhancements for downstream tasks.

Self-Supervised Learning

Disentangled Clothed Avatar Generation from Text Descriptions

no code implementations8 Dec 2023 Jionghao Wang, YuAn Liu, Zhiyang Dou, Zhengming Yu, Yongqing Liang, Xin Li, Wenping Wang, Rong Xie, Li Song

In this paper, we introduced a novel text-to-avatar generation method that separately generates the human body and the clothes and allows high-quality animation on the generated avatar.

Virtual Try-on

Spectrum-guided Feature Enhancement Network for Event Person Re-Identification

no code implementations2 Feb 2024 Hongchen Tan, Yi Zhang, Xiuping Liu, BaoCai Yin, Nan Ma, Xin Li, Huchuan Lu

This network consists of two innovative components: the Multi-grain Spectrum Attention Mechanism (MSAM) and the Consecutive Patch Dropout Module (CPDM).

Person Re-Identification

Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep Reinforcement Learning Approach

no code implementations4 Feb 2024 Brian Etter, James Lee Hu, Mohammedreza Ebrahimi, Weifeng Li, Xin Li, Hsinchun Chen

Adversarial Malware Generation (AMG), the gen- eration of adversarial malware variants to strengthen Deep Learning (DL)-based malware detectors has emerged as a crucial tool in the development of proactive cyberdefense.

Malware Detection reinforcement-learning +1

scInterpreter: Training Large Language Models to Interpret scRNA-seq Data for Cell Type Annotation

no code implementations18 Feb 2024 Cong Li, Meng Xiao, Pengfei Wang, Guihai Feng, Xin Li, Yuanchun Zhou

Despite the inherent limitations of existing Large Language Models in directly reading and interpreting single-cell omics data, they demonstrate significant potential and flexibility as the Foundation Model.

Language Modelling Large Language Model

On Organizational Principles of Neural Systems

no code implementations22 Feb 2024 Xin Li

Inspired by classical embodied cognition and the emerging multimodal interaction, we study the organizational principles of neural systems at three levels (device/implementation, circuit/algorithm, and system/computational) in this survey paper.

Neural Radiance Fields in Medical Imaging: Challenges and Next Steps

no code implementations26 Feb 2024 Xin Wang, Shu Hu, Heng Fan, Hongtu Zhu, Xin Li

Neural Radiance Fields (NeRF), as a pioneering technique in computer vision, offer great potential to revolutionize medical imaging by synthesizing three-dimensional representations from the projected two-dimensional image data.

Cannot find the paper you are looking for? You can Submit a new open access paper.