Search Results for author: Xiang Li

Found 460 papers, 178 papers with code

Modeling 4D fMRI Data via Spatio-Temporal Convolutional Neural Networks (ST-CNN)

no code implementations31 May 2018 Yu Zhao, Xiang Li, Wei zhang, Shijie Zhao, Milad Makkie, Mo Zhang, Quanzheng Li, Tianming Liu

Simultaneous modeling of the spatio-temporal variation patterns of brain functional network from 4D fMRI data has been an important yet challenging problem for the field of cognitive neuroscience and medical image analysis.

Brain Decoding

Perceive Your Users in Depth: Learning Universal User Representations from Multiple E-commerce Tasks

no code implementations28 May 2018 Yabo Ni, Dan Ou, Shichen Liu, Xiang Li, Wenwu Ou, An-Xiang Zeng, Luo Si

In this work, we propose to learn universal user representations across multiple tasks for more e ective personalization.

Probabilistic Embedding of Knowledge Graphs with Box Lattice Measures

no code implementations ACL 2018 Luke Vilnis, Xiang Li, Shikhar Murty, Andrew McCallum

Embedding methods which enforce a partial order or lattice structure over the concept space, such as Order Embeddings (OE) (Vendrov et al., 2016), are a natural way to model transitive relational data (e. g. entailment graphs).

Inductive Bias Knowledge Graphs +1

Adversarial Metric Learning

no code implementations9 Feb 2018 Shuo Chen, Chen Gong, Jian Yang, Xiang Li, Yang Wei, Jun Li

In distinguishment stage, a metric is exhaustively learned to try its best to distinguish both the adversarial pairs and the original training pairs.

Metric Learning

Image Segmentation and Classification for Sickle Cell Disease using Deformable U-Net

no code implementations23 Oct 2017 Mo Zhang, Xiang Li, Mengjia Xu, Quanzheng Li

Reliable cell segmentation and classification from biomedical images is a crucial step for both scientific research and clinical practice.

Cell Segmentation Classification +4

Improved Representation Learning for Predicting Commonsense Ontologies

no code implementations1 Aug 2017 Xiang Li, Luke Vilnis, Andrew McCallum

Recent work in learning ontologies (hierarchical and partially-ordered structures) has leveraged the intrinsic geometry of spaces of learned representations to make predictions that automatically obey complex structural constraints.

Representation Learning

Dictionary Learning and Sparse Coding-based Denoising for High-Resolution Task Functional Connectivity MRI Analysis

no code implementations21 Jul 2017 Seongah Jeong, Xiang Li, Jiarui Yang, Quanzheng Li, Vahid Tarokh

In order to address the limitations of the unsupervised DLSC-based fMRI studies, we utilize the prior knowledge of task paradigm in the learning step to train a data-driven dictionary and to model the sparse representation.

Denoising Dictionary Learning

Self-paced Convolutional Neural Network for Computer Aided Detection in Medical Imaging Analysis

no code implementations19 Jul 2017 Xiang Li, Aoxiao Zhong, Ming Lin, Ning Guo, Mu Sun, Arkadiusz Sitek, Jieping Ye, James Thrall, Quanzheng Li

However, the development of a robust and reliable deep learning model for computer-aided diagnosis is still highly challenging due to the combination of the high heterogeneity in the medical images and the relative lack of training samples.

Computed Tomography (CT) Lesion Detection

Learning the Sparse and Low Rank PARAFAC Decomposition via the Elastic Net

no code implementations29 May 2017 Songting Shi, Xiang Li, Arkadiusz Sitek, Quanzheng Li

In this article, we derive a Bayesian model to learning the sparse and low rank PARAFAC decomposition for the observed tensor with missing values via the elastic net, with property to find the true rank and sparse factor matrix which is robust to the noise.

LightRNN: Memory and Computation-Efficient Recurrent Neural Networks

no code implementations NeurIPS 2016 Xiang Li, Tao Qin, Jian Yang, Tie-Yan Liu

Based on the 2-Component shared embedding, we design a new RNN algorithm and evaluate it using the language modeling task on several benchmark datasets.

Language Modelling Machine Translation

Statistical Properties of the Single Linkage Hierarchical Clustering Estimator

no code implementations24 Nov 2015 Dekang Zhu, Dan P. Guralnik, Xuezhi Wang, Xiang Li, Bill Moran

Distance-based hierarchical clustering (HC) methods are widely used in unsupervised data analysis but few authors take account of uncertainty in the distance data.

Clustering

Top-push Video-based Person Re-identification

no code implementations CVPR 2016 Jin-Jie You, An-Cong Wu, Xiang Li, Wei-Shi Zheng

Since only limited information can be exploited from still images, it is hard (if not impossible) to overcome the occlusion, pose and camera-view change, and lighting variation problems.

Video-Based Person Re-Identification

An Enhanced Deep Feature Representation for Person Re-identification

no code implementations26 Apr 2016 Shangxuan Wu, Ying-Cong Chen, Xiang Li, An-Cong Wu, Jin-Jie You, Wei-Shi Zheng

In this paper, we focus on the feature representation and claim that hand-crafted histogram features can be complementary to Convolutional Neural Network (CNN) features.

Metric Learning Person Re-Identification

StalemateBreaker: A Proactive Content-Introducing Approach to Automatic Human-Computer Conversation

no code implementations15 Apr 2016 Xiang Li, Lili Mou, Rui Yan, Ming Zhang

In this paper, we propose StalemateBreaker, a conversation system that can proactively introduce new content when appropriate.

Maximum Likelihood Estimation for Single Linkage Hierarchical Clustering

no code implementations25 Nov 2015 Dekang Zhu, Dan P. Guralnik, Xuezhi Wang, Xiang Li, Bill Moran

We derive a statistical model for estimation of a dendrogram from single linkage hierarchical clustering (SLHC) that takes account of uncertainty through noise or corruption in the measurements of separation of data.

Clustering Small Data Image Classification

Task-group Relatedness and Generalization Bounds for Regularized Multi-task Learning

no code implementations28 Aug 2014 Chao Zhang, DaCheng Tao, Tao Hu, Xiang Li

We are mainly concerned with two theoretical questions: 1) under what conditions does RMTL perform better with a smaller task sample size than STL?

Generalization Bounds Multi-Task Learning

Adversarial Open-World Person Re-Identification

no code implementations ECCV 2018 Xiang Li, An-Cong Wu, Wei-Shi Zheng

The main idea is learning to attack feature extractor on the target people by using GAN to generate very target-like images (imposters), and in the meantime the model will make the feature extractor learn to tolerate the attack by discriminative learning so as to realize group-based verification.

Person Re-Identification

Multi-Estimator Full Left Ventricle Quantification through Ensemble Learning

no code implementations6 Aug 2018 Jiasha Liu, Xiang Li, Hui Ren, Quanzheng Li

The framework combines two 1st-level modules: direct estimation module and a segmentation module.

Ensemble Learning Management

Network Modeling and Pathway Inference from Incomplete Data ("PathInf")

no code implementations1 Oct 2018 Xiang Li, Qitian Chen, Xing Wang, Ning Guo, Nan Wu, Quanzheng Li

In this work, we developed a network inference method from incomplete data ("PathInf") , as massive and non-uniformly distributed missing values is a common challenge in practical problems.

Data Summarization

Triple Attention Mixed Link Network for Single Image Super Resolution

no code implementations8 Oct 2018 Xi Cheng, Xiang Li, Jian Yang

Single image super resolution is of great importance as a low-level computer vision task.

Image Super-Resolution

Improving the Robustness of Speech Translation

no code implementations2 Nov 2018 Xiang Li, Haiyang Xue, Wei Chen, Yang Liu, Yang Feng, Qun Liu

Although neural machine translation (NMT) has achieved impressive progress recently, it is usually trained on the clean parallel data set and hence cannot work well when the input sentence is the production of the automatic speech recognition (ASR) system due to the enormous errors in the source.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Group-Attention Single-Shot Detector (GA-SSD): Finding Pulmonary Nodules in Large-Scale CT Images

no code implementations18 Dec 2018 Jiechao Ma, Xiang Li, Hongwei Li, Bjoern H. Menze, Sen Liang, Rongguo Zhang, Wei-Shi Zheng

In this paper, we propose a novel and effective abnormality detector implementing the attention mechanism and group convolution on 3D single-shot detector (SSD) called group-attention SSD (GA-SSD).

Computed Tomography (CT) Finding Pulmonary Nodules In Large-Scale Ct Images

Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation

no code implementations ECCV 2018 Zhen-Yu Zhang, Zhen Cui, Chunyan Xu, Zequn Jie, Xiang Li, Jian Yang

In this paper, we propose a novel joint Task-Recursive Learning (TRL) framework for the closing-loop semantic segmentation and monocular depth estimation tasks.

Monocular Depth Estimation Segmentation +1

Smoothing the Geometry of Probabilistic Box Embeddings

no code implementations ICLR 2019 Xiang Li, Luke Vilnis, Dongxu Zhang, Michael Boratko, Andrew McCallum

However, the hard edges of the boxes present difficulties for standard gradient based optimization; that work employed a special surrogate function for the disjoint case, but we find this method to be fragile.

Inductive Bias

Joint Intensity and Spatial Metric Learning for Robust Gait Recognition

no code implementations CVPR 2017 Yasushi Makihara, Atsuyuki Suzuki, Daigo Muramatsu, Xiang Li, Yasushi Yagi

This paper describes a joint intensity metric learning method to improve the robustness of gait recognition with silhouette-based descriptors such as gait energy images.

Gait Recognition Metric Learning

Multi-Scale Learning for Low-Resolution Person Re-Identification

no code implementations ICCV 2015 Xiang Li, Wei-Shi Zheng, Xiaojuan Wang, Tao Xiang, Shaogang Gong

In real world person re-identification (re-id), images of people captured at very different resolutions from different locations need be matched.

Person Re-Identification

Partial Person Re-Identification

no code implementations ICCV 2015 Wei-Shi Zheng, Xiang Li, Tao Xiang, Shengcai Liao, Jian-Huang Lai, Shaogang Gong

We address a new partial person re-identification (re-id) problem, where only a partial observation of a person is available for matching across different non-overlapping camera views.

Person Re-Identification

Do Subsampled Newton Methods Work for High-Dimensional Data?

no code implementations13 Feb 2019 Xiang Li, Shusen Wang, Zhihua Zhang

Subsampled Newton methods approximate Hessian matrices through subsampling techniques, alleviating the cost of forming Hessian matrices but using sufficient curvature information.

Distributed Optimization Vocal Bursts Intensity Prediction

A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning

no code implementations NeurIPS 2019 Xiang Li, Wenhao Yang, Zhihua Zhang

We propose and study a general framework for regularized Markov decision processes (MDPs) where the goal is to find an optimal policy that maximizes the expected discounted total reward plus a policy regularization term.

reinforcement-learning Reinforcement Learning (RL)

Inpatient2Vec: Medical Representation Learning for Inpatients

no code implementations18 Apr 2019 Ying Wang, Xiao Xu, Tao Jin, Xiang Li, Guotong Xie, Jian-Min Wang

In addition, for unordered medical activity set, existing medical RL methods utilize a simple pooling strategy, which would result in indistinguishable contributions among the activities for learning.

Representation Learning Semantic Similarity +1

Towards Photo-Realistic Visible Watermark Removal with Conditional Generative Adversarial Networks

no code implementations30 May 2019 Xiang Li, Chan Lu, Danni Cheng, Wei-Hong Li, Mei Cao, Bo Liu, Jiechao Ma, Wei-Shi Zheng

Visible watermark plays an important role in image copyright protection and the robustness of a visible watermark to an attack is shown to be essential.

Image-to-Image Translation

Cross-view Relation Networks for Mammogram Mass Detection

no code implementations1 Jul 2019 Jiechao Ma, Sen Liang, Xiang Li, Hongwei Li, Bjoern H. Menze, Rongguo Zhang, Wei-Shi Zheng

Mammogram is the most effective imaging modality for the mass lesion detection of breast cancer at the early stage.

Lesion Detection Relation

ASCNet: Adaptive-Scale Convolutional Neural Networks for Multi-Scale Feature Learning

no code implementations7 Jul 2019 Mo Zhang, Jie Zhao, Xiang Li, Li Zhang, Quanzheng Li

Such pixel-level dilation rates produce optimal receptive fields so that the information of objects with different sizes can be extracted at the corresponding scale.

Semantic Segmentation

Scalable Semi-Supervised SVM via Triply Stochastic Gradients

no code implementations26 Jul 2019 Xiang Geng, Bin Gu, Xiang Li, Wanli Shi, Guansheng Zheng, Heng Huang

Specifically, to handle two types of data instances involved in S$^3$VM, TSGS$^3$VM samples a labeled instance and an unlabeled instance as well with the random features in each iteration to compute a triply stochastic gradient.

Quadruply Stochastic Gradients for Large Scale Nonlinear Semi-Supervised AUC Optimization

no code implementations29 Jul 2019 Wanli Shi, Bin Gu, Xiang Li, Xiang Geng, Heng Huang

To address this problem, in this paper, we propose a novel scalable quadruply stochastic gradient algorithm (QSG-S2AUC) for nonlinear semi-supervised AUC optimization.

Stochastic Optimization

Invasiveness Prediction of Pulmonary Adenocarcinomas Using Deep Feature Fusion Networks

no code implementations21 Sep 2019 Xiang Li, Jiechao Ma, Hongwei Li

In this study, we explore the fusion of the two kinds of features and claim that radiomics features can be complementary to deep-learning features.

Computed Tomography (CT)

Predicting Alzheimer's Disease by Hierarchical Graph Convolution from Positron Emission Tomography Imaging

no code implementations1 Oct 2019 Jiaming Guo, Wei Qiu, Xiang Li, Xuandong Zhao, Ning Guo, Quanzheng Li

Imaging-based early diagnosis of Alzheimer Disease (AD) has become an effective approach, especially by using nuclear medicine imaging techniques such as Positron Emission Topography (PET).

Clustering Graph Clustering

Multi-label Detection and Classification of Red Blood Cells in Microscopic Images

no code implementations7 Oct 2019 Wei Qiu, Jiaming Guo, Xiang Li, Mengjia Xu, Mo Zhang, Ning Guo, Quanzheng Li

As the six networks are trained with image patches consisting of both individual cells and touching/overlapping cells, they can effectively recognize cell types that are presented in multi-instance image samples.

Cell Detection Classification +2

Improving One-shot NAS by Suppressing the Posterior Fading

no code implementations CVPR 2020 Xiang Li, Chen Lin, Chuming Li, Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang

In this paper, we analyse existing weight sharing one-shot NAS approaches from a Bayesian point of view and identify the posterior fading problem, which compromises the effectiveness of shared weights.

Neural Architecture Search object-detection +2

Density-Aware Convolutional Networks with Context Encoding for Airborne LiDAR Point Cloud Classification

no code implementations14 Oct 2019 Xiang Li, Mingyang Wang, Congcong Wen, Lingjing Wang, Nan Zhou, Yi Fang

Based on this convolution module, we further developed a multi-scale fully convolutional neural network with downsampling and upsampling blocks to enable hierarchical point feature learning.

3D Point Cloud Classification General Classification +1

Communication-Efficient Local Decentralized SGD Methods

no code implementations21 Oct 2019 Xiang Li, Wenhao Yang, Shusen Wang, Zhihua Zhang

Recently, the technique of local updates is a powerful tool in centralized settings to improve communication efficiency via periodical communication.

Distributed Computing

Prediction stability as a criterion in active learning

no code implementations27 Oct 2019 Junyu Liu, Xiang Li, Jin Wang, Jiqiang Zhou, Jianxiong Shen

Recent breakthroughs made by deep learning rely heavily on large number of annotated samples.

Active Learning

A General Early-Stopping Module for Crowdsourced Ranking

no code implementations4 Nov 2019 Caihua Shan, Leong Hou U, Nikos Mamoulis, Reynold Cheng, Xiang Li

The number of microtasks depends on the budget allocated for the problem.

An End-to-End Deep RL Framework for Task Arrangement in Crowdsourcing Platforms

no code implementations4 Nov 2019 Caihua Shan, Nikos Mamoulis, Reynold Cheng, Guoliang Li, Xiang Li, Yuqiu Qian

In this paper, we propose a Deep Reinforcement Learning (RL) framework for task arrangement, which is a critical problem for the success of crowdsourcing platforms.

Reinforcement Learning (RL)

Generative adversarial networks (GAN) based efficient sampling of chemical space for inverse design of inorganic materials

no code implementations12 Nov 2019 Yabo Dan, Yong Zhao, Xiang Li, Shaobo Li, Ming Hu, Jianjun Hu

The percentage of chemically valid (charge neutral and electronegativity balanced) samples out of all generated ones reaches 84. 5% by our GAN when trained with materials from ICSD even though no such chemical rules are explicitly enforced in our GAN model, indicating its capability to learn implicit chemical composition rules.

Generative Adversarial Network valid

Safe Sample Screening for Robust Support Vector Machine

no code implementations24 Dec 2019 Zhou Zhai, Bin Gu, Xiang Li, Heng Huang

To address this challenge, in this paper, we propose two safe sample screening rules for RSVM based on the framework of concave-convex procedure (CCCP).

Time-constrained Adaptive Influence Maximization

no code implementations6 Jan 2020 Guangmo Tong, Ruiqi Wang, Chen Ling, Zheng Dong, Xiang Li

The well-known influence maximization problem aims at maximizing the influence of one information cascade in a social network by selecting appropriate seed users prior to the diffusion process.

Social and Information Networks

Communication-Efficient Distributed SVD via Local Power Iterations

1 code implementation19 Feb 2020 Xiang Li, Shusen Wang, Kun Chen, Zhihua Zhang

As a practical surrogate of OPT, sign-fixing, which uses a diagonal matrix with $\pm 1$ entries as weights, has better computation complexity and stability in experiments.

Distributed Computing

TEDL: A Text Encryption Method Based on Deep Learning

1 code implementation9 Mar 2020 Xiang Li, Peng Wang

Firstly, both communication parties establish a word vector table by training a deep learning model according to specified hyperparameters.

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

no code implementations7 Mar 2020 Xiang Li, Chao Wang, Jiwei Tan, Xiaoyi Zeng, Dan Ou, Bo Zheng

Finally, we achieve the multimodal item representations by combining both modality-specific and modality-invariant representations.

Click-Through Rate Prediction Representation Learning

Airborne LiDAR Point Cloud Classification with Graph Attention Convolution Neural Network

no code implementations20 Apr 2020 Congcong Wen, Xiang Li, Xiaojing Yao, Ling Peng, Tianhe Chi

To achieve point cloud classification, previous studies proposed point cloud deep learning models that can directly process raw point clouds based on PointNet-like architectures.

General Classification Graph Attention +2

Height estimation from single aerial images using a deep ordinal regression network

no code implementations4 Jun 2020 Xiang Li, Mingyang Wang, Yi Fang

Previous researches have extensively studied the problem of height estimation from aerial images based on stereo or multi-view image matching.

Change Detection Management +1

Gait Recognition via Semi-supervised Disentangled Representation Learning to Identity and Covariate Features

no code implementations CVPR 2020 Xiang Li, Yasushi Makihara, Chi Xu, Yasushi Yagi, Mingwu Ren

Existing gait recognition approaches typically focus on learning identity features that are invariant to covariates (e. g., the carrying status, clothing, walking speed, and viewing angle) and seldom involve learning features from the covariate aspect, which may lead to failure modes when variations due to the covariate overwhelm those due to the identity.

Disentanglement Gait Recognition

CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

1 code implementation8 Jun 2020 Xiang Li, Ben Kao, Caihua Shan, Dawei Yin, Martin Ester

We study the problem of applying spectral clustering to cluster multi-scale data, which is data whose clusters are of various sizes and densities.

Clustering

Geometry-Aware Segmentation of Remote Sensing Images via Implicit Height Estimation

no code implementations10 Jun 2020 Xiang Li, Lingjing Wang, Yi Fang

Recent studies have shown the benefits of using additional elevation data (e. g., DSM) for enhancing the performance of the semantic segmentation of aerial images.

Segmentation Semantic Segmentation

Unsupervised Learning of 3D Point Set Registration

no code implementations11 Jun 2020 Lingjing Wang, Xiang Li, Yi Fang

Moreover, for a pair of source and target point sets, existing deep learning mechanisms require explicitly designed encoders to extract both deep spatial features from unstructured point clouds and their spatial correlation representation, which is further fed to a decoder to regress the desired geometric transformation for point set alignment.

Point Cloud Registration

Few-shot Object Detection on Remote Sensing Images

no code implementations14 Jun 2020 Jingyu Deng, Xiang Li, Yi Fang

In this paper, we introduce a few-shot learning-based method for object detection on remote sensing images where only a few annotated samples are provided for the unseen object categories.

Few-Shot Learning Few-Shot Object Detection +2

Xiaomi's Submissions for IWSLT 2020 Open Domain Translation Task

no code implementations WS 2020 Yuhui Sun, Mengxue Guo, Xiang Li, Jianwei Cui, Bin Wang

This paper describes the Xiaomi{'}s submissions to the IWSLT20 shared open domain translation task for Chinese{\textless}-{\textgreater}Japanese language pair.

Domain Adaptation Knowledge Distillation +2

DeepTracking-Net: 3D Tracking with Unsupervised Learning of Continuous Flow

no code implementations24 Jun 2020 Shuaihang Yuan, Xiang Li, Yi Fang

In this paper, we aim at handling the problem of 3D tracking, which provides the tracking of the consecutive frames of 3D shapes.

3DMotion-Net: Learning Continuous Flow Function for 3D Motion Prediction

no code implementations24 Jun 2020 Shuaihang Yuan, Xiang Li, Anthony Tzes, Yi Fang

To approach this problem, we propose a self-supervised approach that leverages the power of the deep neural network to learn a continuous flow function of 3D point clouds that can predict temporally consistent future motions and naturally bring out the correspondences among consecutive point clouds at the same time.

motion prediction

Unsupervised Learning of Global Registration of Temporal Sequence of Point Clouds

no code implementations17 Jun 2020 Lingjing Wang, Yi Shi, Xiang Li, Yi Fang

Global registration of point clouds aims to find an optimal alignment of a sequence of 2D or 3D point sets.

Simulating multi-exit evacuation using deep reinforcement learning

no code implementations11 Jul 2020 Dong Xu, Xiao Huang, Joseph Mango, Xiang Li, Zhenlong Li

We propose a multi-exit evacuation simulation based on Deep Reinforcement Learning (DRL), referred to as the MultiExit-DRL, which involves in a Deep Neural Network (DNN) framework to facilitate state-to-action mapping.

reinforcement-learning Reinforcement Learning (RL)

GP-Aligner: Unsupervised Non-rigid Groupwise Point Set Registration Based On Optimized Group Latent Descriptor

no code implementations25 Jul 2020 Lingjing Wang, Xiang Li, Yi Fang

More specifically, for a given group we first define an optimizable Group Latent Descriptor (GLD) to characterize the gruopwise relationship among a group of point sets.

Computational Efficiency

Robust Image Matching By Dynamic Feature Selection

no code implementations13 Aug 2020 Hao Huang, Jianchun Chen, Xiang Li, Lingjing Wang, Yi Fang

Recent works introduce convolutional neural networks (CNNs) to extract high-level feature maps and find correspondences through feature matching.

Decision Making feature selection +1

Object Detection in the Context of Mobile Augmented Reality

no code implementations15 Aug 2020 Xiang Li, Yuan Tian, Fuyao Zhang, Shuxue Quan, Yi Xu

Ordinary object detection approaches process information from the images only, and they are oblivious to the camera pose with regard to the environment and the scale of the environment.

Object object-detection +1

Federated Doubly Stochastic Kernel Learning for Vertically Partitioned Data

no code implementations14 Aug 2020 Bin Gu, Zhiyuan Dang, Xiang Li, Heng Huang

In this paper, we focus on nonlinear learning with kernels, and propose a federated doubly stochastic kernel learning (FDSKL) algorithm for vertically partitioned data.

BIG-bench Machine Learning Federated Learning

Gait Recognition from a Single Image using a Phase-Aware Gait Cycle Reconstruction Network

no code implementations ECCV 2020 Chi Xu, Yasushi Makihara, Xiang Li, Yasushi Yagi, Jianfeng Lu

Specifically, a phase estimation network is introduced for the input single image, and the gait cycle reconstruction network exploits the estimated phase to mitigate the dependence of an encoded feature on the phase of that single image.

Gait Recognition

Unsupervised Partial Point Set Registration via Joint Shape Completion and Registration

no code implementations11 Sep 2020 Xiang Li, Lingjing Wang, Yi Fang

To bridge the performance gaps between partial point set registration with full point set registration, we proposed to incorporate a shape completion network to benefit the registration process.

Looking Beyond Sentence-Level Natural Language Inference for Downstream Tasks

no code implementations18 Sep 2020 Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Li, Pavan Kapanipathi, Kartik Talamadupula

In recent years, the Natural Language Inference (NLI) task has garnered significant attention, with new datasets and models achieving near human-level performance on it.

Natural Language Inference Question Answering +2

Deep-3DAligner: Unsupervised 3D Point Set Registration Network With Optimizable Latent Vector

no code implementations29 Sep 2020 Lingjing Wang, Xiang Li, Yi Fang

Point cloud registration is the process of aligning a pair of point sets via searching for a geometric transformation.

Point Cloud Registration

Box-To-Box Transformation for Modeling Joint Hierarchies

no code implementations1 Jan 2021 Shib Sankar Dasgupta, Xiang Li, Michael Boratko, Dongxu Zhang, Andrew McCallum

In Patel et al. (2020), the authors demonstrate that only the transitive reduction is required, and further extend box embeddings to capture joint hierarchies by augmenting the graph with new nodes.

Knowledge Graphs

Reading Comprehension as Natural Language Inference: A Semantic Analysis

no code implementations4 Oct 2020 Anshuman Mishra, Dhruvesh Patel, Aparna Vijayakumar, Xiang Li, Pavan Kapanipathi, Kartik Talamadupula

We transform the one of the largest available MRC dataset (RACE) to an NLI form, and compare the performances of a state-of-the-art model (RoBERTa) on both these forms.

Natural Language Inference Question Answering +1

3D Meta Point Signature: Learning to Learn 3D Point Signature for 3D Dense Shape Correspondence

no code implementations21 Oct 2020 Hao Huang, Lingjing Wang, Xiang Li, Yi Fang

In this paper, we propose a novel meta-learning based 3D point signature model, named 3Dmetapointsignature (MEPS) network, that is capable of learning robust point signatures in 3D shapes.

3D Dense Shape Correspondence Meta-Learning

3D Meta-Registration: Learning to Learn Registration of 3D Point Clouds

no code implementations22 Oct 2020 Lingjing Wang, Yu Hao, Xiang Li, Yi Fang

In this paper, we propose a meta-learning based 3D registration model, named 3D Meta-Registration, that is capable of rapidly adapting and well generalizing to new 3D registration tasks for unseen 3D point clouds.

Meta-Learning Point Cloud Registration

Retrieving and ranking short medical questions with two stages neural matching model

no code implementations16 Nov 2020 Xiang Li, Xinyu Fu, Zheng Lu, Ruibin Bai, Uwe Aickelin, Peiming Ge, Gong Liu

Internet hospital is a rising business thanks to recent advances in mobile web technology and high demand of health care services.

Information Retrieval Retrieval

Convolutional Neural Network for Behavioral Modeling and Predistortion of Wideband Power Amplifiers

no code implementations20 May 2020 Xin Hu, Zhijun Liu, Xiaofei Yu, Yulong Zhao, WenHua Chen, Biao Hu, Xuekun Du, Xiang Li, Mohamed Helaoui, Weidong Wang, Fadhel M. Ghannouchi

We design a pre-designed filter using the convolutional layer to extract the basis functions required for the PA forward or reverse modeling.

Deep Metric Learning-based Image Retrieval System for Chest Radiograph and its Clinical Applications in COVID-19

no code implementations26 Nov 2020 Aoxiao Zhong, Xiang Li, Dufan Wu, Hui Ren, Kyungsang Kim, YoungGon Kim, Varun Buch, Nir Neumark, Bernardo Bizzo, Won Young Tak, Soo Young Park, Yu Rim Lee, Min Kyu Kang, Jung Gil Park, Byung Seok Kim, Woo Jin Chung, Ning Guo, Ittai Dayan, Mannudeep K. Kalra, Quanzheng Li

These results demonstrate our deep metric learning based image retrieval model is highly efficient in the CXR retrieval, diagnosis and prognosis, and thus has great clinical value for the treatment and management of COVID-19 patients.

Image Retrieval Management +2

Physics Guided Machine Learning Methods for Hydrology

no code implementations2 Dec 2020 Ankush Khandelwal, Shaoming Xu, Xiang Li, Xiaowei Jia, Michael Stienbach, Christopher Duffy, John Nieber, Vipin Kumar

The goal of this work is to incorporate our understanding of physical processes and constraints in hydrology into machine learning algorithms, and thus bridge the performance gap while reducing the need for large amounts of data compared to traditional data-driven approaches.

BIG-bench Machine Learning

Cost-Effective Federated Learning Design

no code implementations15 Dec 2020 Bing Luo, Xiang Li, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas

In this paper, we analyze how to design adaptive FL that optimally chooses these essential control variables to minimize the total cost while ensuring convergence.

Federated Learning

Leveraging Meta-path Contexts for Classification in Heterogeneous Information Networks

no code implementations18 Dec 2020 Xiang Li, Danhao Ding, Ben Kao, Yizhou Sun, Nikos Mamoulis

A heterogeneous information network (HIN) has as vertices objects of different types and as edges the relations between objects, which are also of various types.

Classification General Classification +2

Towards Cross-Modal Forgery Detection and Localization on Live Surveillance Videos

no code implementations4 Jan 2021 Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang

Traditional video forensics approaches can detect and localize forgery traces in each video frame using computationally-expensive spatial-temporal analysis, while falling short in real-time verification of live video feeds.

Time Series Analysis Video Forensics Cryptography and Security

Delayed Projection Techniques for Linearly Constrained Problems: Convergence Rates, Acceleration, and Applications

no code implementations5 Jan 2021 Xiang Li, Zhihua Zhang

In this work, we study a novel class of projection-based algorithms for linearly constrained problems (LCPs) which have a lot of applications in statistics, optimization, and machine learning.

Distributed Optimization Privacy Preserving

SceneRec: Scene-Based Graph Neural Networks for Recommender Systems

no code implementations12 Feb 2021 Gang Wang, Ziyi Guo, Xiang Li, Dawei Yin, Shuai Ma

Collaborative filtering has been largely used to advance modern recommender systems to predict user preference.

Collaborative Filtering Recommendation Systems +1

vrCAPTCHA: Exploring CAPTCHA Designs in Virtual Reality

no code implementations24 Feb 2021 Xiang Li, Yuzheng Chen, Rakesh Patibanda, Florian 'Floyd' Mueller

With the popularity of online access in virtual reality (VR) devices, it will become important to investigate exclusive and interactive CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) designs for VR devices.

Human-Computer Interaction

FedPower: Privacy-Preserving Distributed Eigenspace Estimation

no code implementations1 Mar 2021 Xiao Guo, Xiang Li, Xiangyu Chang, Shusen Wang, Zhihua Zhang

The low communication power and the possible privacy breaches of data make the computation of eigenspace challenging.

BIG-bench Machine Learning Dimensionality Reduction +2

Development and Validation of a Deep Learning Model for Prediction of Severe Outcomes in Suspected COVID-19 Infection

no code implementations21 Mar 2021 Varun Buch, Aoxiao Zhong, Xiang Li, Marcio Aloisio Bezerra Cavalcanti Rockenbach, Dufan Wu, Hui Ren, Jiahui Guan, Andrew Liteplo, Sayon Dutta, Ittai Dayan, Quanzheng Li

Predictive risk scores for COVID-19 severe outcomes ("CO-RISK" score) were derived from model output and evaluated on the testing dataset, as well as compared to human performance.

Management

Towards Multi-Scale Style Control for Expressive Speech Synthesis

no code implementations8 Apr 2021 Xiang Li, Changhe Song, Jingbei Li, Zhiyong Wu, Jia Jia, Helen Meng

This paper introduces a multi-scale speech style modeling method for end-to-end expressive speech synthesis.

Expressive Speech Synthesis Style Transfer

Deep Attributed Network Representation Learning via Attribute Enhanced Neighborhood

no code implementations12 Apr 2021 Cong Li, Min Shi, Bo Qu, Xiang Li

In this paper, we propose a deep attributed network representation learning via attribute enhanced neighborhood (DANRL-ANE) model to improve the robustness and effectiveness of node representations.

Attribute Link Prediction +2

Improving Tree-Structured Decoder Training for Code Generation via Mutual Learning

no code implementations31 May 2021 Binbin Xie, Jinsong Su, Yubin Ge, Xiang Li, Jianwei Cui, Junfeng Yao, Bin Wang

However, such a decoder only exploits the preorder traversal based preceding actions, which are insufficient to ensure correct action predictions.

Code Generation

Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation

no code implementations ACL 2021 Zhiyong Wu, Lingpeng Kong, Wei Bi, Xiang Li, Ben Kao

A neural multimodal machine translation (MMT) system is one that aims to perform better translation by extending conventional text-only translation models with multimodal information.

Multimodal Machine Translation Translation

Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding

no code implementations20 Jun 2021 Chaolei Tan, Zihang Lin, Jian-Fang Hu, Xiang Li, Wei-Shi Zheng

We propose an effective two-stage approach to tackle the problem of language-based Human-centric Spatio-Temporal Video Grounding (HC-STVG) task.

Spatio-Temporal Video Grounding Video Grounding

A More Compact Object Detector Head Network with Feature Enhancement and Relational Reasoning

no code implementations28 Jun 2021 Wenchao Zhang, Chong Fu, Xiangshi Chang, Tengfei Zhao, Xiang Li, Chiu-Wing Sham

Without losing generality, we can also build a more lighter head network for other multi-stage detectors by assembling our method.

object-detection Object Detection +1

Learn to Learn Metric Space for Few-Shot Segmentation of 3D Shapes

no code implementations7 Jul 2021 Xiang Li, Lingjing Wang, Yi Fang

To achieve this, we treat the shape segmentation as a point labeling problem in the metric space.

Meta-Learning Segmentation

RigNet: Repetitive Image Guided Network for Depth Completion

no code implementations29 Jul 2021 Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang

However, blurry guidance in the image and unclear structure in the depth still impede the performance of the image guided frameworks.

Depth Completion Depth Estimation +1

Fine-Grained Few Shot Learning with Foreground Object Transformation

no code implementations13 Sep 2021 Chaofei Wang, Shiji Song, Qisen Yang, Xiang Li, Gao Huang

As a data augmentation method, FOT can be conveniently applied to any existing few shot learning algorithm and greatly improve its performance on FG-FSL tasks.

Data Augmentation Few-Shot Learning +2

Cost-Effective Federated Learning in Mobile Edge Networks

no code implementations12 Sep 2021 Bing Luo, Xiang Li, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas

Federated learning (FL) is a distributed learning paradigm that enables a large number of mobile devices to collaboratively learn a model under the coordination of a central server without sharing their raw data.

Federated Learning

Robust Inverse Framework using Knowledge-guided Self-Supervised Learning: An application to Hydrology

no code implementations14 Sep 2021 Rahul Ghosh, Arvind Renganathan, Kshitij Tayal, Xiang Li, Ankush Khandelwal, Xiaowei Jia, Chris Duffy, John Neiber, Vipin Kumar

Furthermore, we show that KGSSL is relatively more robust to distortion than baseline methods, and outperforms the baseline model by 35\% when plugging in KGSSL inferred characteristics.

Self-Supervised Learning

Student Helping Teacher: Teacher Evolution via Self-Knowledge Distillation

no code implementations1 Oct 2021 Zheng Li, Xiang Li, Lingfeng Yang, Jian Yang, Zhigeng Pan

Knowledge distillation usually transfers the knowledge from a pre-trained cumbersome teacher network to a compact student network, which follows the classical teacher-teaching-student paradigm.

Self-Knowledge Distillation

Feasible Architecture for Quantum Fully Convolutional Networks

no code implementations5 Oct 2021 Yusui Chen, Wenhao Hu, Xiang Li

Fully convolutional networks are robust in performing semantic segmentation, with many applications from signal processing to computer vision.

Semantic Segmentation

Video Instance Segmentation by Instance Flow Assembly

no code implementations20 Oct 2021 Xiang Li, Jinglu Wang, Xiao Li, Yan Lu

Instance segmentation is a challenging task aiming at classifying and segmenting all object instances of specific classes.

Instance Segmentation Object +3

Improved Loss Function-Based Prediction Method of Extreme Temperatures in Greenhouses

no code implementations2 Nov 2021 Liao Qu, Shuaiqi Huang, Yunsong Jia, Xiang Li

By increasing the weight of extreme temperature samples and reducing the possibility of misjudging extreme temperature as normal, the proposed loss function can enhance the prediction results in extreme situations.

融合情感分析的隐式反问句识别模型(Implicit Rhetorical Questions Recognition Model Combined with Sentiment Analysis)

no code implementations CCL 2021 Xiang Li, Chengwei Liu, Xiaoxu Zhu

“反问是现代汉语中一种常用的修辞手法, 根据是否含有反问标记可分为显式反问句与隐式反问句。其中隐式反问句表达的情感更为丰富, 表现形式也十分复杂, 对隐式反问句的识别更具挑战性。本文首先扩充了汉语反问句语料库, 语料库规模达到10000余句, 接着针对隐式反问句的特点, 提出了一种融合情感分析的隐式反问句识别模型。模型考虑了句子的语义信息, 上下文信息, 并借助情感分析任务辅助识别隐式反问句。实验结果表明, 本文提出的模型在隐式反问句识别任务上取得了良好的性能。”

Sentiment Analysis

Automated Pulmonary Embolism Detection from CTPA Images Using an End-to-End Convolutional Neural Network

no code implementations10 Nov 2021 Yi Lin, Jianchao Su, Xiang Wang, Xiang Li, Jingen Liu, Kwang-Ting Cheng, Xin Yang

We have evaluated our approach using the 20 CTPA test dataset from the PE challenge, achieving a sensitivity of 78. 9%, 80. 7% and 80. 7% at 2 false positives per volume at 0mm, 2mm and 5mm localization error, which is superior to the state-of-the-art methods.

Pulmonary Embolism Detection

Reinforcement Learning Enhanced Explainer for Graph Neural Networks

no code implementations NeurIPS 2021 Caihua Shan, Yifei Shen, Yao Zhang, Xiang Li, Dongsheng Li

To address these issues, we propose a RL-enhanced GNN explainer, RG-Explainer, which consists of three main components: starting point selection, iterative graph generation and stopping criteria learning.

Combinatorial Optimization Graph Generation +2

Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation

no code implementations3 Dec 2021 Xiang Li, Jinglu Wang, Xiao Li, Yan Lu

Based on this representation, we introduce a cropping-free temporal fusion approach to model the temporal consistency between video frames.

Image Segmentation Instance Segmentation +2

Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation

no code implementations9 Dec 2021 Gang Li, Xiang Li, Yujie Wang, Shanshan Zhang, Yichao Wu, Ding Liang

Based on the two observations, we propose Rank Mimicking (RM) and Prediction-guided Feature Imitation (PFI) for distilling one-stage detectors, respectively.

Image Classification Knowledge Distillation +3

Scalable Deep Graph Clustering with Random-walk based Self-supervised Learning

no code implementations31 Dec 2021 Xiang Li, Dong Li, Ruoming Jin, Gagan Agrawal, Rajiv Ramnath

Though other methods (particularly those based on Laplacian Smoothing) have reported better accuracy, a fundamental limitation of all the work is a lack of scalability.

Clustering Deep Clustering +3

Forgery Attack Detection in Surveillance Video Streams Using Wi-Fi Channel State Information

no code implementations24 Jan 2022 Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang

The cybersecurity breaches expose surveillance video streams to forgery attacks, under which authentic streams are falsified to hide unauthorized activities.

Time Series Time Series Analysis +1

Multi-modal Sensor Fusion for Auto Driving Perception: A Survey

no code implementations6 Feb 2022 Keli Huang, Botian Shi, Xiang Li, Xin Li, Siyuan Huang, Yikang Li

Multi-modal fusion is a fundamental task for the perception of an autonomous driving system, which has recently intrigued many researchers.

Autonomous Driving object-detection +3

WSLRec: Weakly Supervised Learning for Neural Sequential Recommendation Models

no code implementations28 Feb 2022 Jingwei Zhuo, Bin Liu, Xiang Li, Han Zhu, Xiaoqiang Zhu

Motivated by the observation that model-free methods like behavioral retargeting (BR) and item-based collaborative filtering (ItemCF) hit different parts of the user-item relevance compared to neural sequential recommendation models, we propose a novel model-agnostic training approach called WSLRec, which adopts a three-stage framework: pre-training, top-$k$ mining, and fine-tuning.

Collaborative Filtering Sequential Recommendation +1

A density peaks clustering algorithm with sparse search and K-d tree

no code implementations2 Mar 2022 Yunxiao Shan, Shu Li, Fuxiang Li, Yuxin Cui, Shuai Li, Ming Zhou, Xiang Li

It is proved that the algorithm can effectively reduce the computational complexity of the original DPC from $O(n^2K)$ to $O(n(n^{1-1/K}+k))$.

2k Clustering

Multi-Modal Masked Pre-Training for Monocular Panoramic Depth Completion

no code implementations18 Mar 2022 Zhiqiang Yan, Xiang Li, Kun Wang, Zhenyu Zhang, Jun Li, Jian Yang

To deal with the PDC task, we train a deep network that takes both depth and image as inputs for the dense panoramic depth recovery.

Depth Completion Transfer Learning

A Learning Convolutional Neural Network Approach for Network Robustness Prediction

no code implementations20 Mar 2022 Yang Lou, Ruizi Wu, Junli Li, Lin Wang, Xiang Li, Guanrong Chen

Extensive experimental studies on both synthetic and real-world networks, both directed and undirected, demonstrate that 1) the proposed LFR-CNN performs better than other two state-of-the-art prediction methods, with significantly lower prediction errors; 2) LFR-CNN is insensitive to the variation of the network size, which significantly extends its applicability; 3) although LFR-CNN needs more time to perform feature learning, it can achieve accurate prediction faster than attack simulations; 4) LFR-CNN not only can accurately predict network robustness, but also provides a good indicator for connectivity robustness, better than the classical spectral measures.

EEG based Emotion Recognition: A Tutorial and Review

no code implementations16 Mar 2022 Xiang Li, Yazhou Zhang, Prayag Tiwari, Dawei Song, Bin Hu, Meihong Yang, Zhigang Zhao, Neeraj Kumar, Pekka Marttinen

Hence, in this paper, we review from the perspective of researchers who try to take the first step on this topic.

EEG Emotion Recognition

AutoFAS: Automatic Feature and Architecture Selection for Pre-Ranking System

no code implementations19 May 2022 Xiang Li, Xiaojiang Zhou, Yao Xiao, Peihao Huang, Dayao Chen, Sheng Chen, Yunsen Xian

Industrial search and recommendation systems mostly follow the classic multi-stage information retrieval paradigm: matching, pre-ranking, ranking, and re-ranking stages.

Information Retrieval Neural Architecture Search +3

Nest Your Adaptive Algorithm for Parameter-Agnostic Nonconvex Minimax Optimization

no code implementations1 Jun 2022 Junchi Yang, Xiang Li, Niao He

Adaptive algorithms like AdaGrad and AMSGrad are successful in nonconvex optimization owing to their parameter-agnostic ability -- requiring no a priori knowledge about problem-specific parameters nor tuning of learning rates.

Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution

no code implementations18 Jun 2022 Chonghan Chen, Qi Jiang, Chih-Hao Wang, Noel Chen, Haohan Wang, Xiang Li, Bhiksha Raj

With our proposed QCM, the downstream fusion module receives visual features that are more discriminative and focused on the desired object described in the expression, leading to more accurate predictions.

Visual Grounding

Dual Power Spectrum Manifold and Toeplitz HPD Manifold: Enhancement and Analysis for Matrix CFAR Detection

no code implementations24 Jun 2022 Hao Wu, Yongqiang Cheng, Xixi Chen, Zheng Yang, Xiang Li, Hongqiang Wang

These advantages benefit from the geometry of the Toeplitz Hermitian positive definite (HPD) manifold $\mathcal{M}_{\mathcal{T}H_{++}}$, but the sophisticated geometry also results in some challenges for geometric detectors, such as the implementation of the enhanced detector to improve the SCR (signal-to-clutter ratio) and the analysis of the detection performance.

Neural Neural Textures Make Sim2Real Consistent

no code implementations27 Jun 2022 Ryan Burgert, Jinghuan Shang, Xiang Li, Michael Ryoo

Unpaired image translation algorithms can be used for sim2real tasks, but many fail to generate temporally consistent results.

Translation

RAW-GNN: RAndom Walk Aggregation based Graph Neural Network

no code implementations28 Jun 2022 Di Jin, Rui Wang, Meng Ge, Dongxiao He, Xiang Li, Wei Lin, Weixiong Zhang

Due to the homophily assumption of Graph Convolutional Networks (GCNs) that these methods use, they are not suitable for heterophily graphs where nodes with different labels or dissimilar attributes tend to be adjacent.

Representation Learning

BIT-Xiaomi’s System for AutoSimTrans 2022

no code implementations NAACL (AutoSimTrans) 2022 Mengge Liu, Xiang Li, Bao Chen, Yanzhi Tian, Tianwei Lan, Silin Li, Yuhang Guo, Jian Luan, Bin Wang

This system paper describes the BIT-Xiaomi simultaneous translation system for Autosimtrans 2022 simultaneous translation challenge.

Chunking Data Augmentation +1

Online Video Instance Segmentation via Robust Context Fusion

no code implementations12 Jul 2022 Xiang Li, Jinglu Wang, Xiaohao Xu, Bhiksha Raj, Yan Lu

We propose a robust context fusion network to tackle VIS in an online fashion, which predicts instance segmentation frame-by-frame with a few preceding frames.

Instance Segmentation Segmentation +2

Towards Cross-speaker Reading Style Transfer on Audiobook Dataset

no code implementations10 Aug 2022 Xiang Li, Changhe Song, Xianhao Wei, Zhiyong Wu, Jia Jia, Helen Meng

This paper aims to introduce a chunk-wise multi-scale cross-speaker style model to capture both the global genre and the local prosody in audiobook speeches.

Style Transfer

Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets

no code implementations15 Aug 2022 Hao Chen, Ran Tao, Han Zhang, Yidong Wang, Xiang Li, Wei Ye, Jindong Wang, Guosheng Hu, Marios Savvides

Beyond classification, Conv-Adapter can generalize to detection and segmentation tasks with more than 50% reduction of parameters but comparable performance to the traditional full fine-tuning.

Transfer Learning

Learning a General Clause-to-Clause Relationships for Enhancing Emotion-Cause Pair Extraction

no code implementations29 Aug 2022 Hang Chen, Xinyu Yang, Xiang Li

To learn it applicably, we propose a general clause-level encoding model named EA-GAT comprising E-GAT and Activation Sort.

Emotion-Cause Pair Extraction

Probabilistic Inverse Modeling: An Application in Hydrology

no code implementations12 Oct 2022 Somya Sharma, Rahul Ghosh, Arvind Renganathan, Xiang Li, Snigdhansu Chatterjee, John Nieber, Christopher Duffy, Vipin Kumar

We propose uncertainty based learning method that offers 6\% improvement in $R^2$ for streamflow prediction (forward modeling) from inverse model inferred basin characteristic estimates, 17\% reduction in uncertainty (40\% in presence of noise) and 4\% higher coverage rate for basin characteristics.

Contact2Grasp: 3D Grasp Synthesis via Hand-Object Contact Constraint

no code implementations17 Oct 2022 Haoming Li, Xinzhuo Lin, Yang Zhou, Xiang Li, Yuchi Huo, Jiming Chen, Qi Ye

To tackle the challenge, we introduce an intermediate variable for grasp contact areas to constrain the grasp generation; in other words, we factorize the mapping into two sequential stages by assuming that grasping poses are fully constrained given contact maps: 1) we first learn contact map distributions to generate the potential contact maps for grasps; 2) then learn a mapping from the contact maps to the grasping poses.

Grasp Generation Object +2

Mini-Batch Learning Strategies for modeling long term temporal dependencies: A study in environmental applications

1 code implementation15 Oct 2022 Shaoming Xu, Ankush Khandelwal, Xiang Li, Xiaowei Jia, Licheng Liu, Jared Willard, Rahul Ghosh, Kelly Cutler, Michael Steinbach, Christopher Duffy, John Nieber, Vipin Kumar

To address this issue, we further propose a new strategy which augments a training segment with an initial value of the target variable from the timestep right before the starting of the training segment.

Pixel-Aligned Non-parametric Hand Mesh Reconstruction

no code implementations17 Oct 2022 Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye

The decoder aggregate both local image features in pixels and geometric features in vertices.

Disentangled and Robust Representation Learning for Bragging Classification in Social Media

no code implementations27 Oct 2022 Xiang Li, Yucheng Zhou

Researching bragging behavior on social media arouses interest of computational (socio) linguists.

Representation Learning

TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization

no code implementations31 Oct 2022 Xiang Li, Junchi Yang, Niao He

Adaptive gradient methods have shown their ability to adjust the stepsizes on the fly in a parameter-agnostic manner, and empirically achieve faster convergence for solving minimization problems.

DesNet: Decomposed Scale-Consistent Network for Unsupervised Depth Completion

no code implementations20 Nov 2022 Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang

Unsupervised depth completion aims to recover dense depth from the sparse one without using the ground-truth annotation.

Depth Completion Depth Estimation +2

Panoramic Video Salient Object Detection with Ambisonic Audio Guidance

no code implementations26 Nov 2022 Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj

In this paper, we aim to tackle the video salient object detection problem for panoramic videos, with their corresponding ambisonic audios.

Object object-detection +2

Near-optimal Policy Identification in Active Reinforcement Learning

no code implementations19 Dec 2022 Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic

Many real-world reinforcement learning tasks require control of complex dynamical systems that involve both costly data acquisition processes and large state spaces.

Bayesian Optimization reinforcement-learning +1

Recovering Surveillance Video Using RF Cues

no code implementations27 Dec 2022 Xiang Li, Rabih Younes

We make use of an auto-encoder-based structure to extract pose features from WiFi frames.

Video Generation

SeeGera: Self-supervised Semi-implicit Graph Variational Auto-encoders with Masking

no code implementations29 Jan 2023 Xiang Li, Tiandi Ye, Caihua Shan, Dongsheng Li, Ming Gao

In this paper, to comprehensively enhance the performance of generative graph SSL against other GCL models on both unsupervised and supervised learning tasks, we propose the SeeGera model, which is based on the family of self-supervised variational graph auto-encoder (VGAE).

Contrastive Learning Self-Supervised Learning +1

Recurrent Structure Attention Guidance for Depth Super-Resolution

no code implementations31 Jan 2023 Jiayi Yuan, Haobo Jiang, Xiang Li, Jianjun Qian, Jun Li, Jian Yang

Second, instead of the coarse concatenation guidance, we propose a recurrent structure attention block, which iteratively utilizes the latest depth estimation and the image features to jointly select clear patterns and boundaries, aiming at providing refined guidance for accurate depth recovery.

Depth Estimation Super-Resolution

Structure Flow-Guided Network for Real Depth Super-Resolution

no code implementations31 Jan 2023 Jiayi Yuan, Haobo Jiang, Xiang Li, Jianjun Qian, Jun Li, Jian Yang

Specifically, our framework consists of a cross-modality flow-guided upsampling network (CFUNet) and a flow-enhanced pyramid edge attention network (PEANet).

Depth Estimation Depth Prediction +1

Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data

no code implementations15 Feb 2023 Xiang Li, Jiadong Liang, Zhihua Zhang

We study the statistical inference of nonlinear stochastic approximation algorithms utilizing a single trajectory of Markovian data.

Q-Learning valid

Statistical Analysis of Karcher Means for Random Restricted PSD Matrices

no code implementations24 Feb 2023 Hengchao Chen, Xiang Li, Qiang Sun

Non-asymptotic statistical analysis is often missing for modern geometry-aware machine learning algorithms due to the possibly intricate non-linear manifold structure.

HopFIR: Hop-wise GraphFormer with Intragroup Joint Refinement for 3D Human Pose Estimation

no code implementations ICCV 2023 Kai Zhai, Qiang Nie, Bo Ouyang, Xiang Li, Shanlin Yang

The HGF module groups the joints by k-hop neighbors and applies a hopwise transformer-like attention mechanism to these groups to discover latent joint synergies.

3D Human Pose Estimation

Pushing One Pair of Labels Apart Each Time in Multi-Label Learning: From Single Positive to Full Labels

no code implementations28 Feb 2023 Xiang Li, Xinrui Wang, Songcan Chen

In Multi-Label Learning (MLL), it is extremely challenging to accurately annotate every appearing object due to expensive costs and limited knowledge.

Multi-Label Learning

Non-aligned supervision for Real Image Dehazing

no code implementations8 Mar 2023 Junkai Fan, Fei Guo, Jianjun Qian, Xiang Li, Jun Li, Jian Yang

In particular, we explore a non-alignment scenario that a clear reference image, unaligned with the input hazy image, is utilized to supervise the dehazing network.

Image Dehazing

Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards

no code implementations9 Mar 2023 Xiang Li, Qiang Sun

Building upon AdaOFUL, we propose VARA for linear MDPs, which achieves a tighter variance-aware regret bound of $\widetilde{O}(d\sqrt{HG^*K})$.

Decision Making regression +2

Sufficient Control of Complex Networks

no code implementations10 Mar 2023 Xiang Li, Guoqi Li, Leitao Gao, Beibei Li, Gaoxi Xiao

In this paper, we propose to study on sufficient control of complex networks which is to control a sufficiently large portion of the network, where only the quantity of controllable nodes matters.

Digital staining in optical microscopy using deep learning -- a review

no code implementations14 Mar 2023 Lucas Kreiss, Shaowei Jiang, Xiang Li, Shiqi Xu, Kevin C. Zhou, Alexander Mühlberg, Kyung Chul Lee, Kanghyun Kim, Amey Chaware, Michael Ando, Laura Barisoni, Seung Ah Lee, Guoan Zheng, Kyle Lafata, Oliver Friedrich, Roarke Horstmeyer

Until recently, conventional biochemical staining had the undisputed status as well-established benchmark for most biomedical problems related to clinical diagnostics, fundamental research and biotechnology.

Specificity

When Brain-inspired AI Meets AGI

no code implementations28 Mar 2023 Lin Zhao, Lu Zhang, Zihao Wu, Yuzhong Chen, Haixing Dai, Xiaowei Yu, Zhengliang Liu, Tuo Zhang, Xintao Hu, Xi Jiang, Xiang Li, Dajiang Zhu, Dinggang Shen, Tianming Liu

Artificial General Intelligence (AGI) has been a long-standing goal of humanity, with the aim of creating machines capable of performing any intellectual task that humans can do.

In-Context Learning

Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models

no code implementations4 Apr 2023 Yiheng Liu, Tianle Han, Siyuan Ma, Jiayue Zhang, Yuanyuan Yang, Jiaming Tian, Hao He, Antong Li, Mengshen He, Zhengliang Liu, Zihao Wu, Lin Zhao, Dajiang Zhu, Xiang Li, Ning Qiang, Dingang Shen, Tianming Liu, Bao Ge

This paper presents a comprehensive survey of ChatGPT-related (GPT-3. 5 and GPT-4) research, state-of-the-art large language models (LLM) from the GPT series, and their prospective applications across diverse domains.

Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task

no code implementations18 Apr 2023 Zihao Wu, Lu Zhang, Chao Cao, Xiaowei Yu, Haixing Dai, Chong Ma, Zhengliang Liu, Lin Zhao, Gang Li, Wei Liu, Quanzheng Li, Dinggang Shen, Xiang Li, Dajiang Zhu, Tianming Liu

To this end, in this study, we evaluate the performance of ChatGPT/GPT-4 on a radiology NLI task and compare it to other models fine-tuned specifically on task-related data samples.

Specificity Task 2

ChatABL: Abductive Learning via Natural Language Interaction with ChatGPT

no code implementations21 Apr 2023 Tianyang Zhong, Yaonai Wei, Li Yang, Zihao Wu, Zhengliang Liu, Xiaozheng Wei, Wenjun Li, Junjie Yao, Chong Ma, Xiang Li, Dajiang Zhu, Xi Jiang, Junwei Han, Dinggang Shen, Tianming Liu, Tuo Zhang

The proposed method uses the strengths of LLMs' understanding and logical reasoning to correct the incomplete logical facts for optimizing the performance of perceptual module, by summarizing and reorganizing reasoning rules represented in natural language format.

Decipherment Logical Reasoning

Differentiate ChatGPT-generated and Human-written Medical Texts

no code implementations23 Apr 2023 Wenxiong Liao, Zhengliang Liu, Haixing Dai, Shaochen Xu, Zihao Wu, Yiyang Zhang, Xiaoke Huang, Dajiang Zhu, Hongmin Cai, Tianming Liu, Xiang Li

We focus on analyzing the differences between medical texts written by human experts and generated by ChatGPT, and designing machine learning workflows to effectively detect and differentiate medical texts generated by ChatGPT.

Asymptotic Behaviors and Phase Transitions in Projected Stochastic Approximation: A Jump Diffusion Approach

no code implementations25 Apr 2023 Jiadong Liang, Yuze Han, Xiang Li, Zhihua Zhang

Additionally, we propose the Debiased LPSA (DLPSA) as a practical application of our jump diffusion approximation result.

Prompt Engineering for Healthcare: Methodologies and Applications

no code implementations28 Apr 2023 Jiaqi Wang, Enze Shi, Sigang Yu, Zihao Wu, Chong Ma, Haixing Dai, Qiushi Yang, Yanqing Kang, Jinru Wu, Huawen Hu, Chenxi Yue, Haiyang Zhang, Yiheng Liu, Yi Pan, Zhengliang Liu, Lichao Sun, Xiang Li, Bao Ge, Xi Jiang, Dajiang Zhu, Yixuan Yuan, Dinggang Shen, Tianming Liu, Shu Zhang

Prompt engineering is a critical technique in the field of natural language processing that involves designing and optimizing the prompts used to input information into models, aiming to enhance their performance on specific tasks.

Machine Translation Prompt Engineering +3

Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT

no code implementations29 Apr 2023 Zhenxiang Xiao, Yuzhong Chen, Lu Zhang, Junjie Yao, Zihao Wu, Xiaowei Yu, Yi Pan, Lin Zhao, Chong Ma, Xinyu Liu, Wei Liu, Xiang Li, Yixuan Yuan, Dinggang Shen, Dajiang Zhu, Tianming Liu, Xi Jiang

Prompts have been proven to play a crucial role in large language models, and in recent years, vision models have also been using prompts to improve scalability for multiple downstream tasks.

Image Classification

FreeLM: Fine-Tuning-Free Language Model

no code implementations2 May 2023 Xiang Li, Xin Jiang, Xuying Meng, Aixin Sun, Yequan Wang

FreeLM outperforms large models e. g., GPT-3 and InstructGPT, on a range of language understanding tasks in experiments.

Language Modelling

SPP-CNN: An Efficient Framework for Network Robustness Prediction

no code implementations13 May 2023 Chengpei Wu, Yang Lou, Lin Wang, Junli Li, Xiang Li, Guanrong Chen

This paper addresses the robustness of a network to sustain its connectivity and controllability against malicious attacks.

Variable Radiance Field for Real-Life Category-Specifc Reconstruction from Single Image

no code implementations8 Jun 2023 Kun Wang, Zhiqiang Yan, Zhenyu Zhang, Xiang Li, Jun Li, Jian Yang

Our key contributions are: (1) We parameterize the geometry and appearance of the object using a multi-scale global feature extractor, which avoids frequent point-wise feature retrieval and camera dependency.

Contrastive Learning Object +1

Artificial General Intelligence for Medical Imaging

no code implementations8 Jun 2023 Xiang Li, Lu Zhang, Zihao Wu, Zhengliang Liu, Lin Zhao, Yixuan Yuan, Jun Liu, Gang Li, Dajiang Zhu, Pingkun Yan, Quanzheng Li, Wei Liu, Tianming Liu, Dinggang Shen

In this review, we explore the potential applications of Artificial General Intelligence (AGI) models in healthcare, focusing on foundational Large Language Models (LLMs), Large Vision Models, and Large Multimodal Models.

A Graph Transformer-Driven Approach for Network Robustness Learning

no code implementations12 Jun 2023 Yu Zhang, Jia Li, Jie Ding, Xiang Li

Learning and analysis of network robustness, including controllability robustness and connectivity robustness, is critical for various networked systems against attacks.

Shapley Value on Probabilistic Classifiers

no code implementations12 Jun 2023 Xiang Li, Haocheng Xia, Jinfei Liu

Data valuation has become an increasingly significant discipline in data science due to the economic value of data.

Data Valuation

Boosting Language Models Reasoning with Chain-of-Knowledge Prompting

no code implementations10 Jun 2023 Jianing Wang, Qiushi Sun, Nuo Chen, Xiang Li, Ming Gao

To mitigate this brittleness, we propose a novel Chain-of-Knowledge (CoK) prompting, where we aim at eliciting LLMs to generate explicit pieces of knowledge evidence in the form of structure triple.

Arithmetic Reasoning

TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills

no code implementations23 May 2023 Qiushi Sun, Nuo Chen, Jianing Wang, Xiang Li, Ming Gao

To tackle the issue, in this paper, we present TransCoder, a unified Transferable fine-tuning strategy for Code representation learning.

Clone Detection Code Summarization +2

AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology

no code implementations16 Jun 2023 Haixing Dai, Yiwei Li, Zhengliang Liu, Lin Zhao, Zihao Wu, Suhang Song, Ye Shen, Dajiang Zhu, Xiang Li, Sheng Li, Xiaobai Yao, Lu Shi, Quanzheng Li, Zhuo Chen, Donglan Zhang, Gengchen Mai, Tianming Liu

In this pioneering study, inspired by AutoGPT, the state-of-the-art open-source application based on the GPT-4 large language model, we develop a novel tool called AD-AutoGPT which can conduct data collection, processing, and analysis about complex health narratives of Alzheimer's Disease in an autonomous manner via users' textual prompts.

Language Modelling Large Language Model

Segment Anything Model (SAM) for Radiation Oncology

no code implementations20 Jun 2023 Lian Zhang, Zhengliang Liu, Lu Zhang, Zihao Wu, Xiaowei Yu, Jason Holmes, Hongying Feng, Haixing Dai, Xiang Li, Quanzheng Li, Dajiang Zhu, Tianming Liu, Wei Liu

Given that SAM, a model pre-trained purely on natural images, can handle the delineation of OARs from medical images with clinically acceptable accuracy, these results highlight SAM's robust generalization capabilities with consistent accuracy in automatic segmentation for radiotherapy.

Segmentation

Higher-order Graph Attention Network for Stock Selection with Joint Analysis

no code implementations27 Jun 2023 Yang Qiao, Yiping Xia, Xiang Li, Zheng Li, Yan Ge

H-GAT is able to capture higher-order structures and jointly incorporate factors of fundamental analysis with factors of technical analysis.

Graph Attention Relation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.