Search Results for author: Yan Li

Found 104 papers, 26 papers with code

Sparse Additive Model using Symmetric Nonnegative Definite Smoothers

no code implementations8 Sep 2014 Yan Li

We introduce a new algorithm, called adaptive sparse backfitting algorithm, for solving high dimensional Sparse Additive Model (SpAM) utilizing symmetric, non-negative definite smoothers.

Variable Selection

Comment on "Clustering by fast search and find of density peaks"

no code implementations18 Jan 2015 Shuliang Wang, Dakui Wang, Caoyuan Li, Yan Li

For any data set to be clustered, the most reasonable value of d_c can be objectively calculated from the data set by using our proposed method.

Clustering

The Knowledge Gradient Policy Using A Sparse Additive Belief Model

no code implementations18 Mar 2015 Yan Li, Han Liu, Warren Powell

We propose a sequential learning policy for noisy discrete global optimization and ranking and selection (R\&S) problems with high dimensional sparse belief functions, where there are hundreds or even thousands of features, but only a small portion of these features contain explanatory power.

Face Video Retrieval With Image Query via Hashing Across Euclidean Space and Riemannian Manifold

no code implementations CVPR 2015 Yan Li, Ruiping Wang, Zhiwu Huang, Shiguang Shan, Xilin Chen

Retrieving videos of a specific person given his/her face image as query becomes more and more appealing for applications like smart movie fast-forwards and suspect searching.

Retrieval Video Retrieval

A Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief Model

no code implementations6 Aug 2015 Yan Li, Kristofer G. Reyes, Jorge Vazquez-Anderson, Yingfei Wang, Lydia M. Contreras, Warren B. Powell

We present a sparse knowledge gradient (SpKG) algorithm for adaptively selecting the targeted regions within a large RNA molecule to identify which regions are most amenable to interactions with other molecules.

Two Birds, One Stone: Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction

no code implementations ICCV 2015 Yan Li, Ruiping Wang, Haomiao Liu, Huajie Jiang, Shiguang Shan, Xilin Chen

In this way, the learned binary codes can be applied to not only fine-grained face image retrieval, but also facial attributes prediction, which is the very innovation of this work, just like killing two birds with one stone.

Face Image Retrieval Retrieval

Audio Recording Device Identification Based on Deep Learning

no code implementations18 Feb 2016 Simeng Qi, Zheng Huang, Yan Li, Shaopei Shi

The identification result shows that the method of getting feature vector from the noise of each device and identifying them with deep learning techniques is viable, and well-preformed.

Speech Enhancement

M$^2$S-Net: Multi-Modal Similarity Metric Learning based Deep Convolutional Network for Answer Selection

1 code implementation19 Apr 2016 Lingxun Meng, Yan Li

Recent works using artificial neural networks based on distributed word representation greatly boost performance on various natural language processing tasks, especially the answer selection problem.

Answer Selection Metric Learning +1

Skipping Word: A Character-Sequential Representation based Framework for Question Answering

no code implementations2 Sep 2016 Lingxun Meng, Yan Li, Mengyi Liu, Peng Shu

Recent works using artificial neural networks based on word distributed representation greatly boost the performance of various natural language learning tasks, especially question answering.

Answer Selection

Linear Convergence of SVRG in Statistical Estimation

no code implementations7 Nov 2016 Chao Qu, Yan Li, Huan Xu

SVRG and its variants are among the state of art optimization algorithms for large scale machine learning problems.

SAGA and Restricted Strong Convexity

no code implementations19 Feb 2017 Chao Qu, Yan Li, Huan Xu

SAGA is a fast incremental gradient method on the finite sum problem and its effectiveness has been tested on a vast of applications.

regression

Solving Multi-Objective MDP with Lexicographic Preference: An application to stochastic planning with multiple quantile objective

no code implementations10 May 2017 Yan Li, Zhaohan Sun

In most common settings of Markov Decision Process (MDP), an agent evaluate a policy based on expectation of (discounted) sum of rewards.

Autonomous Driving

Machine Learning for Survival Analysis: A Survey

no code implementations15 Aug 2017 Ping Wang, Yan Li, Chandan K. Reddy

We hope that this paper will provide a more thorough understanding of the recent advances in survival analysis and offer some guidelines on applying these approaches to solve new problems that arise in applications with censored data.

BIG-bench Machine Learning Survival Analysis

Fast Global Convergence via Landscape of Empirical Loss

no code implementations13 Feb 2018 Chao Qu, Yan Li, Huan Xu

While optimizing convex objective (loss) functions has been a powerhouse for machine learning for at least two decades, non-convex loss functions have attracted fast growing interests recently, due to many desirable properties such as superior robustness and classification accuracy, compared with their convex counterparts.

General Classification

A Framework in CRM Customer Lifecycle: Identify Downward Trend and Potential Issues Detection

no code implementations25 Feb 2018 Kun Hu, Zhe Li, Ying Liu, Luyin Cheng, Qi Yang, Yan Li

In the first prediction part, we focus on predicting the downward trend, which is an earlier stage of the customer lifecycle compared to churn.

Causal Inference Management +1

Mixed Supervised Object Detection with Robust Objectness Transfer

no code implementations27 Feb 2018 Yan Li, Junge Zhang, Kaiqi Huang, Jian-Guo Zhang

Different from previous MSD methods that directly transfer the pre-trained object detectors from existing categories to new categories, we propose a more reasonable and robust objectness transfer approach for MSD.

Multiple Instance Learning Object +2

Discriminative Learning of Latent Features for Zero-Shot Recognition

1 code implementation CVPR 2018 Yan Li, Junge Zhang, Jian-Guo Zhang, Kaiqi Huang

In this work, we retrospect existing methods and demonstrate the necessity to learn discriminative representations for both visual and semantic instances of ZSL.

Zero-Shot Learning

Gated Recurrent Unit Based Acoustic Modeling with Future Context

no code implementations18 May 2018 Jie Li, Xiaorui Wang, Yuan-Yuan Zhao, Yan Li

The use of future contextual information is typically shown to be helpful for acoustic modeling.

Communication-Efficient Projection-Free Algorithm for Distributed Optimization

no code implementations20 May 2018 Yan Li, Chao Qu, Huan Xu

We demonstrate this advantage and show that the linear oracle complexity can be reduced to almost the same order of magnitude as the communication complexity, when the feasible set is polyhedral.

Distributed Optimization Matrix Completion

Projection-Free Algorithms in Statistical Estimation

no code implementations20 May 2018 Yan Li, Chao Qu, Huan Xu

Recently people have reduced the gradient evaluation complexity of FW algorithm to $\log(\frac{1}{\epsilon})$ for the smooth and strongly convex objective.

Object detection and tracking benchmark in industry based on improved correlation filter

no code implementations11 Jun 2018 Shangzhen Luan, Yan Li, Xiaodi Wang, Baochang Zhang

Real-time object detection and tracking have shown to be the basis of intelligent production for industrial 4. 0 applications.

object-detection Real-Time Object Detection

Improving Gated Recurrent Unit Based Acoustic Modeling with Batch Normalization and Enlarged Context

no code implementations26 Nov 2018 Jie Li, Yahui Shan, Xiaorui Wang, Yan Li

The use of future contextual information is typically shown to be helpful for acoustic modeling.

Transductive Zero-Shot Learning with Visual Structure Constraint

1 code implementation NeurIPS 2019 Zi-Yu Wan, Dong-Dong Chen, Yan Li, Xingguang Yan, Junge Zhang, Yizhou Yu, Jing Liao

Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (i. e. alleviate the above domain shift problem).

Zero-Shot Learning

Imitating Targets from all sides: An Unsupervised Transfer Learning method for Person Re-identification

no code implementations10 Apr 2019 Jiajie Tian, Zhu Teng, Rui Li, Yan Li, Baopeng Zhang, Jianping Fan

Person re-identification (Re-ID) models usually show a limited performance when they are trained on one dataset and tested on another dataset due to the inter-dataset bias (e. g. completely different identities and backgrounds) and the intra-dataset difference (e. g. camera invariance).

Person Re-Identification Transfer Learning

Strain engineering of epitaxial oxide heterostructures beyond substrate limitations

no code implementations3 May 2019 Xiong Deng, Chao Chen, Deyang Chen, Xiangbin Cai, Xiaozhe Yin, Chao Xu, Fei Sun, Caiwen Li, Yan Li, Han Xu, Mao Ye, Guo Tian, Zhen Fan, Zhipeng Hou, Minghui Qin, Yu Chen, Zhenlin Luo, Xubing Lu, Guofu Zhou, Lang Chen, Ning Wang, Ye Zhu, Xingsen Gao, Jun-Ming Liu

The limitation of commercially available single-crystal substrates and the lack of continuous strain tunability preclude the ability to take full advantage of strain engineering for further exploring novel properties and exhaustively studying fundamental physics in complex oxides.

Materials Science

Similarity Grouping-Guided Neural Network Modeling for Maritime Time Series Prediction

no code implementations13 May 2019 Yan Li, Ryan Wen Liu, Zhao Liu, Jingxian Liu

Reliable and accurate prediction of time series plays a crucial role in maritime industry, such as economic investment, transportation planning, port planning and design, etc.

Time Series Time Series Prediction

Inductive Bias of Gradient Descent based Adversarial Training on Separable Data

no code implementations7 Jun 2019 Yan Li, Ethan X. Fang, Huan Xu, Tuo Zhao

Specifically, we show that when the adversarial perturbation during training has bounded $\ell_2$-norm, the classifier learned by gradient descent based adversarial training converges in direction to the maximum $\ell_2$-norm margin classifier at the rate of $\tilde{\mathcal{O}}(1/\sqrt{T})$, significantly faster than the rate $\mathcal{O}(1/\log T)$ of training with clean data.

Binary Classification Inductive Bias

Towards Understanding the Importance of Noise in Training Neural Networks

no code implementations7 Sep 2019 Mo Zhou, Tianyi Liu, Yan Li, Dachao Lin, Enlu Zhou, Tuo Zhao

Numerous empirical evidence has corroborated that the noise plays a crucial rule in effective and efficient training of neural networks.

LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation

16 code implementations6 Feb 2020 Xiangnan He, Kuan Deng, Xiang Wang, Yan Li, Yongdong Zhang, Meng Wang

We propose a new model named LightGCN, including only the most essential component in GCN -- neighborhood aggregation -- for collaborative filtering.

Collaborative Filtering Graph Classification +1

Bilinear Graph Neural Network with Neighbor Interactions

1 code implementation10 Feb 2020 Hongmin Zhu, Fuli Feng, Xiangnan He, Xiang Wang, Yan Li, Kai Zheng, Yongdong Zhang

We term this framework as Bilinear Graph Neural Network (BGNN), which improves GNN representation ability with bilinear interactions between neighbor nodes.

General Classification Node Classification

Deep Learning-based End-to-end Diagnosis System for Avascular Necrosis of Femoral Head

no code implementations12 Feb 2020 Yang Li, Yan Li, Hua Tian

To the best of our knowledge, this study is the first research on the prospective use of a deep learning-based diagnosis system for AVNFH by conducting two pilot studies representing real-world application scenarios.

Decision Making Head Detection

Pursuing Sources of Heterogeneity in Modeling Clustered Population

no code implementations10 Mar 2020 Yan Li, Chun Yu, Yize Zhao, Robert H. Aseltine, Weixin Yao, Kun Chen

We clarify the concepts of the source of heterogeneity that account for potential scale differences of the clusters and propose a regularized finite mixture effects regression to achieve heterogeneity pursuit and feature selection simultaneously.

feature selection regression

Implicit Bias of Gradient Descent based Adversarial Training on Separable Data

no code implementations ICLR 2020 Yan Li, Ethan X. Fang, Huan Xu, Tuo Zhao

Specifically, we show that for any fixed iteration $T$, when the adversarial perturbation during training has proper bounded L2 norm, the classifier learned by gradient descent based adversarial training converges in direction to the maximum L2 norm margin classifier at the rate of $O(1/\sqrt{T})$, significantly faster than the rate $O(1/\log T}$ of training with clean data.

Binary Classification

How to Retrain Recommender System? A Sequential Meta-Learning Method

1 code implementation27 May 2020 Yang Zhang, Fuli Feng, Chenxu Wang, Xiangnan He, Meng Wang, Yan Li, Yongdong Zhang

Nevertheless, normal training on new data only may easily cause overfitting and forgetting issues, since the new data is of a smaller scale and contains fewer information on long-term user preference.

Meta-Learning Recommendation Systems

A Physics Model-Guided Online Bayesian Framework for Energy Management of Extended Range Electric Delivery Vehicles

no code implementations1 Jun 2020 Pengyue Wang, Yan Li, Shashi Shekhar, William F. Northrop

A physics model-guided online Bayesian framework is described and validated on large number of in-use driving samples of EREVs used for last-mile package delivery.

energy management Management

Adversarial Attacks on Reinforcement Learning based Energy Management Systems of Extended Range Electric Delivery Vehicles

no code implementations1 Jun 2020 Pengyue Wang, Yan Li, Shashi Shekhar, William F. Northrop

Adversarial examples are firstly investigated in the area of computer vision: by adding some carefully designed ''noise'' to the original input image, the perturbed image that cannot be distinguished from the original one by human, can fool a well-trained classifier easily.

energy management Management

Residual Network Based Direct Synthesis of EM Structures: A Study on One-to-One Transformers

no code implementations25 Aug 2020 David Munzer, Siawpeng Er, Minshuo Chen, Yan Li, Naga S. Mannem, Tuo Zhao, Hua Wang

We propose using machine learning models for the direct synthesis of on-chip electromagnetic (EM) passive structures to enable rapid or even automated designs and optimizations of RF/mm-Wave circuits.

BIG-bench Machine Learning

MA-Net: A Multi-Scale Attention Network for Liver and Tumor Segmentation

3 code implementations IEEE Access 2020 Tongle Fan, Guanglei Wang, Yan Li, Hongrui Wang

In recent years, a large number of variants of U-Net based on Multi-scale feature fusion are proposed to improve the segmentation performance for medical image segmentation.

Image Segmentation Medical Image Segmentation +2

Estimates of the early EM emission from compact binary mergers

no code implementations9 Feb 2021 Yan Li, Rong-Feng Shen

We estimate their luminosities and time scales as functions of the chirp mass which is the most readily constrained parameter from the gravitational wave detections of these events.

High Energy Astrophysical Phenomena

IFoodCloud: A Platform for Real-time Sentiment Analysis of Public Opinion about Food Safety in China

no code implementations17 Feb 2021 Dachuan Zhang, Haoyang Zhang, Zhisheng Wei, Yan Li, Zhiheng Mao, Chunmeng He, Haorui Ma, Xin Zeng, Xiaoling Xie, Xingran Kou, Bingwen Zhang

The Internet contains a wealth of public opinion on food safety, including views on food adulteration, food-borne diseases, agricultural pollution, irregular food distribution, and food production issues.

Sentiment Analysis Sentiment Classification

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization

no code implementations24 Feb 2021 Tianyi Liu, Yan Li, Song Wei, Enlu Zhou, Tuo Zhao

Numerous empirical evidences have corroborated the importance of noise in nonconvex optimization problems.

Statistically-Robust Clustering Techniques for Mapping Spatial Hotspots: A Survey

2 code implementations22 Mar 2021 Yiqun Xie, Shashi Shekhar, Yan Li

Mapping of spatial hotspots, i. e., regions with significantly higher rates of generating cases of certain events (e. g., disease or crime cases), is an important task in diverse societal domains, including public health, public safety, transportation, agriculture, environmental science, etc.

Clustering

Vehicle Emissions Prediction with Physics-Aware AI Models: Preliminary Results

no code implementations2 May 2021 Harish Panneer Selvam, Yan Li, Pengyue Wang, William F. Northrop, Shashi Shekhar

Given an on-board diagnostics (OBD) dataset and a physics-based emissions prediction model, this paper aims to develop an accurate and computational-efficient AI (Artificial Intelligence) method that predicts vehicle emissions.

Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach

no code implementations18 May 2021 Yan Li, Lingxiao Wang, Jiachen Yang, Ethan Wang, Zhaoran Wang, Tuo Zhao, Hongyuan Zha

To exploit the permutation invariance therein, we propose the mean-field proximal policy optimization (MF-PPO) algorithm, at the core of which is a permutation-invariant actor-critic neural architecture.

Inductive Bias Multi-agent Reinforcement Learning

Implicit Regularization of Bregman Proximal Point Algorithm and Mirror Descent on Separable Data

no code implementations15 Aug 2021 Yan Li, Caleb Ju, Ethan X. Fang, Tuo Zhao

For any BPPA instantiated with a fixed Bregman divergence, we provide a lower bound of the margin obtained by BPPA with respect to an arbitrarily chosen norm.

MutualGraphNet: A novel model for motor imagery classification

no code implementations2 Sep 2021 Yan Li, Ning Zhong, David Taniar, Haolan Zhang

Experiments are conducted on motor imagery EEG data set and we compare our model with the current state-of-the-art approaches and the results suggest that MutualGraphNet is robust enough to learn the interpretable features and outperforms the current state-of-the-art methods.

Classification EEG +1

Single Image Dehazing with An Independent Detail-Recovery Network

no code implementations22 Sep 2021 Yan Li, De Cheng, Jiande Sun, Dingwen Zhang, Nannan Wang, Xinbo Gao

In this paper, we propose a single image dehazing method with an independent Detail Recovery Network (DRN), which considers capturing the details from the input image over a separate network and then integrates them into a coarse dehazed image.

Image Dehazing Single Image Dehazing

A Principled Permutation Invariant Approach to Mean-Field Multi-Agent Reinforcement Learning

no code implementations29 Sep 2021 Yan Li, Lingxiao Wang, Jiachen Yang, Ethan Wang, Zhaoran Wang, Tuo Zhao, Hongyuan Zha

To exploit the permutation invariance therein, we propose the mean-field proximal policy optimization (MF-PPO) algorithm, at the core of which is a permutation- invariant actor-critic neural architecture.

Inductive Bias Multi-agent Reinforcement Learning +2

Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits

no code implementations ICLR 2022 Yan Li, Dhruv Choudhary, Xiaohan Wei, Baichuan Yuan, Bhargav Bhushanam, Tuo Zhao, Guanghui Lan

We show that incorporating frequency information of tokens in the embedding learning problems leads to provably efficient algorithms, and demonstrate that common adaptive algorithms implicitly exploit the frequency information to a large extent.

Language Modelling Recommendation Systems

Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL

1 code implementation NeurIPS 2021 Minshuo Chen, Yan Li, Ethan Wang, Zhuoran Yang, Zhaoran Wang, Tuo Zhao

Theoretically, under a weak coverage assumption that the experience dataset contains enough information about the optimal policy, we prove that for an episodic mean-field MDP with a horizon $H$ and $N$ training trajectories, SAFARI attains a sub-optimality gap of $\mathcal{O}(H^2d_{\rm eff} /\sqrt{N})$, where $d_{\rm eff}$ is the effective dimension of the function class for parameterizing the value function, but independent on the number of agents.

Multi-agent Reinforcement Learning

DBC-Forest: Deep forest with binning confidence screening

no code implementations25 Dec 2021 Pengfei Ma, Youxi Wu, Yan Li, Lei Guo, Zhao Li

As a deep learning model, deep confidence screening forest (gcForestcs) has achieved great success in various applications.

Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network

no code implementations6 Jan 2022 Siawpeng Er, Edward Liu, Minshuo Chen, Yan Li, Yuqi Liu, Tuo Zhao, Hua Wang

This paper presents a deep learning assisted synthesis approach for direct end-to-end generation of RF/mm-wave passive matching network with 3D EM structures.

OPP-Miner: Order-preserving sequential pattern mining

no code implementations9 Jan 2022 Youxi Wu, Qian Hu, Yan Li, Lei Guo, Xingquan Zhu, Xindong Wu

To discover patterns, existing methods often convert time series data into another form, such as nominal/symbolic format, to reduce dimensionality, which inevitably deviates the data values.

Sequential Pattern Mining Time Series +1

Block Policy Mirror Descent

no code implementations15 Jan 2022 Guanghui Lan, Yan Li, Tuo Zhao

Despite the nonconvex nature of the problem and a partial update rule, we provide a unified analysis for several sampling schemes, and show that BPMD achieves fast linear convergence to the global optimality.

reinforcement-learning Reinforcement Learning (RL)

Homotopic Policy Mirror Descent: Policy Convergence, Implicit Regularization, and Improved Sample Complexity

no code implementations24 Jan 2022 Yan Li, Guanghui Lan, Tuo Zhao

We first establish the global linear convergence of HPMD instantiated with Kullback-Leibler divergence, for both the optimality gap, and a weighted distance to the set of optimal policies.

Policy Gradient Methods

Noise Regularizes Over-parameterized Rank One Matrix Recovery, Provably

no code implementations7 Feb 2022 Tianyi Liu, Yan Li, Enlu Zhou, Tuo Zhao

We investigate the role of noise in optimization algorithms for learning over-parameterized models.

DSRRTracker: Dynamic Search Region Refinement for Attention-based Siamese Multi-Object Tracking

no code implementations21 Mar 2022 JiaXu Wan, Hong Zhang, Jin Zhang, Yuan Ding, Yifan Yang, Yan Li, Xuliang Li

Many multi-object tracking (MOT) methods follow the framework of "tracking by detection", which associates the target objects-of-interest based on the detection results.

Multi-Object Tracking

Robust Single Image Dehazing Based on Consistent and Contrast-Assisted Reconstruction

no code implementations29 Mar 2022 De Cheng, Yan Li, Dingwen Zhang, Nannan Wang, Xinbo Gao, Jiande Sun

To properly address this problem, we propose a novel density-variational learning framework to improve the robustness of the image dehzing model assisted by a variety of negative hazy images, to better deal with various complex hazy scenarios.

Image Dehazing Single Image Dehazing

PIDGeuN: Graph Neural Network-Enabled Transient Dynamics Prediction of Networked Microgrids Through Full-Field Measurement

no code implementations18 Apr 2022 Yin Yu, Xinyuan Jiang, Daning Huang, Yan Li

A Physics-Informed Dynamic Graph Neural Network (PIDGeuN) is presented to accurately, efficiently and robustly predict the nonlinear transient dynamics of microgrids in the presence of disturbances.

Benchmarking Domain Generalization on EEG-based Emotion Recognition

no code implementations18 Apr 2022 Yan Li, Hao Chen, Jake Zhao, Haolan Zhang, Jinpeng Li

Specifically, numerous domain adaptation (DA) algorithms have been exploited in the past five years to enhance the generalization of emotion recognition models across subjects.

Benchmarking Domain Generalization +2

Modularized Bilinear Koopman Operator for Modeling and Predicting Transients of Microgrids

no code implementations6 May 2022 Xinyuan Jiang, Yan Li, Daning Huang

Modularized Koopman Bilinear Form (M-KBF) is presented to model and predict the transient dynamics of microgrids in the presence of disturbances.

Deep Forest with Hashing Screening and Window Screening

no code implementations25 Jul 2022 Pengfei Ma, Youxi Wu, Yan Li, Lei Guo, He Jiang, Xingquan Zhu, Xindong Wu

To screen out redundant feature vectors, we introduce a hashing screening mechanism for multi-grained scanning and propose a model called HW-Forest which adopts two strategies, hashing screening and window screening.

First-order Policy Optimization for Robust Markov Decision Process

no code implementations21 Sep 2022 Yan Li, Guanghui Lan, Tuo Zhao

We consider the problem of solving robust Markov decision process (MDP), which involves a set of discounted, finite state, finite action space MDPs with uncertain transition kernels.

Towards Long-Term Time-Series Forecasting: Feature, Pattern, and Distribution

1 code implementation5 Jan 2023 Yan Li, Xinjiang Lu, Haoyi Xiong, Jian Tang, Jiantao Su, Bo Jin, Dejing Dou

Long-term time-series forecasting (LTTF) has become a pressing demand in many applications, such as wind power supply planning.

Time Series Time Series Forecasting +1

Generative Time Series Forecasting with Diffusion, Denoise, and Disentanglement

1 code implementation8 Jan 2023 Yan Li, Xinjiang Lu, Yaqing Wang, Dejing Dou

In this work, we propose to address the time series forecasting problem with generative modeling and propose a bidirectional variational auto-encoder (BVAE) equipped with diffusion, denoise, and disentanglement, namely D3VAE.

Denoising Disentanglement +2

Eco-PiNN: A Physics-informed Neural Network for Eco-toll Estimation

1 code implementation13 Jan 2023 Yan Li, Mingzhou Yang, Matthew Eagon, Majid Farhadloo, Yiqun Xie, William F. Northrop, Shashi Shekhar

The eco-toll estimation problem quantifies the expected environmental cost (e. g., energy consumption, exhaust emissions) for a vehicle to travel along a path.

Chaos to Order: A Label Propagation Perspective on Source-Free Domain Adaptation

no code implementations20 Jan 2023 Chunwei Wu, Guitao Cao, Yan Li, Xidong Xi, Wenming Cao, Hong Wang

Inspired by this insight, we present Chaos to Order (CtO), a novel approach for SFDA that strives to constrain semantic credibility and propagate label information among target subpopulations.

Clustering Source-Free Domain Adaptation

A Review of the Trends and Challenges in Adopting Natural Language Processing Methods for Education Feedback Analysis

no code implementations20 Jan 2023 Thanveer Shaik, Xiaohui Tao, Yan Li, Christopher Dann, Jacquie Mcdonald, Petrea Redmond, Linda Galligan

Research community approaches to extract the semantic meaning of emoticons and special characters in feedback which conveys user opinion and challenges in adopting NLP in education are explored.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Sentiment analysis and opinion mining on educational data: A survey

no code implementations8 Feb 2023 Thanveer Shaik, Xiaohui Tao, Christopher Dann, Haoran Xie, Yan Li, Linda Galligan

In the education sector, opinion mining is used to listen to student opinions and enhance their learning-teaching practices pedagogically.

Decision Making Negation +4

Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization

no code implementations16 Feb 2023 Xianjun Yang, Yan Li, Xinlu Zhang, Haifeng Chen, Wei Cheng

Text summarization has been a crucial problem in natural language processing (NLP) for several decades.

Abstractive Text Summarization

Policy Mirror Descent Inherently Explores Action Space

no code implementations8 Mar 2023 Yan Li, Guanghui Lan

SPMD with the second evaluation operator, namely truncated on-policy Monte Carlo (TOMC), attains an $\tilde{\mathcal{O}}(\mathcal{H}_{\mathcal{D}}/\epsilon^2)$ sample complexity, where $\mathcal{H}_{\mathcal{D}}$ mildly depends on the effective horizon and the size of the action space with properly chosen Bregman divergence (e. g., Tsallis divergence).

Efficient Exploration General Reinforcement Learning +1

Sequential three-way decisions with a single hidden layer feedforward neural network

1 code implementation14 Mar 2023 Youxi Wu, Shuhui Cheng, Yan Li, Rongjie Lv, Fan Min

The experimental results verify that STWD-SFNN has a more compact network on structured datasets than other SFNN models, and has better generalization performance than the competitive models.

Anchor Free remote sensing detector based on solving discrete polar coordinate equation

no code implementations21 Mar 2023 Linfeng Shi, Yan Li, Xi Zhu

Finally, referring to the calculation idea of horizontal IoU, we design a rotating IoU based on the split polar coordinate plane, namely JIoU, which is expressed as the intersection ratio following discretization of the inner ellipse of the rotating bounding box, to solve the correlation between angle and side length in the regression process of the rotating bounding box.

Object object-detection +2

One for Multiple: Physics-informed Synthetic Data Boosts Generalizable Deep Learning for Fast MRI Reconstruction

1 code implementation25 Jul 2023 Zi Wang, Xiaotong Yu, Chengyan Wang, Weibo Chen, Jiazheng Wang, Ying-Hua Chu, Hongwei Sun, Rushuai Li, Peiyong Li, Fan Yang, Haiwei Han, Taishan Kang, Jianzhong Lin, Chen Yang, Shufu Chang, Zhang Shi, Sha Hua, Yan Li, Juan Hu, Liuhong Zhu, Jianjun Zhou, Meijing Lin, Jiefeng Guo, Congbo Cai, Zhong Chen, Di Guo, Guang Yang, Xiaobo Qu

We demonstrate that training DL models on synthetic data, coupled with enhanced learning techniques, yields in vivo MRI reconstructions comparable to or surpassing those of models trained on matched realistic datasets, reducing the reliance on real-world MRI data by up to 96%.

Medical Diagnosis MRI Reconstruction

First-order Policy Optimization for Robust Policy Evaluation

no code implementations29 Jul 2023 Yan Li, Guanghui Lan

We adopt a policy optimization viewpoint towards policy evaluation for robust Markov decision process with $\mathrm{s}$-rectangular ambiguity sets.

Cross-view Semantic Alignment for Livestreaming Product Recognition

1 code implementation ICCV 2023 Wenjie Yang, Yiyi Chen, Yan Li, Yanhua Cheng, Xudong Liu, Quan Chen, Han Li

Moreover, a cRoss-vIew semantiC alignmEnt (RICE) model is proposed to learn discriminative instance features from the image and video views of the products.

Contrastive Learning

Cross-Domain Product Representation Learning for Rich-Content E-Commerce

1 code implementation ICCV 2023 Xuehan Bai, Yan Li, Yanhua Cheng, Wenjie Yang, Quan Chen, Han Li

It is the first dataset to cover product pages, short videos, and live streams simultaneously, providing the basis for establishing a unified product representation across different media domains.

Representation Learning

Efficient Pyramid Channel Attention Network for Pathological Myopia Recognition

1 code implementation17 Sep 2023 Xiaoqing Zhang, Jilu Zhao, Yan Li, Hao Wu, Xiangtian Zhou, Jiang Liu

Moreover, motivated by the recent pretraining-and-finetuning paradigm, we attempt to adapt pre-trained natural image models for PM recognition by freezing them and treating the EPCA and other attention modules as adapters.

KwaiYiiMath: Technical Report

no code implementations11 Oct 2023 Jiayi Fu, Lei Lin, Xiaoyang Gao, Pengli Liu, Zhengzong Chen, Zhirui Yang, ShengNan Zhang, Xue Zheng, Yan Li, Yuliang Liu, Xucheng Ye, Yiqiao Liao, Chao Liao, Bin Chen, Chengru Song, Junchen Wan, Zijia Lin, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai

Recent advancements in large language models (LLMs) have demonstrated remarkable abilities in handling a variety of natural language processing (NLP) downstream tasks, even on mathematical tasks requiring multi-step reasoning.

Ranked #87 on Arithmetic Reasoning on GSM8K (using extra training data)

Arithmetic Reasoning GSM8K +1

Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning

no code implementations20 Nov 2023 Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu

In this paper, we aim to develop open-vocabulary object detection (OVD) technique in aerial images that scales up object vocabulary size beyond training data.

Object object-detection +3

Paragraph-to-Image Generation with Information-Enriched Diffusion Model

1 code implementation24 Nov 2023 Weijia Wu, Zhuang Li, Yefei He, Mike Zheng Shou, Chunhua Shen, Lele Cheng, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang

In this paper, we introduce an information-enriched diffusion model for paragraph-to-image generation task, termed ParaDiffusion, which delves into the transference of the extensive semantic comprehension capabilities of large language models to the task of image generation.

Image Generation Language Modelling +1

DisControlFace: Disentangled Control for Personalized Facial Image Editing

no code implementations11 Dec 2023 Haozhe Jia, Yan Li, Hengfei Cui, Di Xu, Changpeng Yang, Yuwang Wang, Tao Yu

Our DisControlNet can perform robust editing on any facial image through training on large-scale 2D in-the-wild portraits and also supports low-cost fine-tuning with few additional images to further learn diverse personalized priors of a specific person.

The Method of Detecting Flying Birds in Surveillance Video Based on Their Characteristics

1 code implementation8 Jan 2024 Ziwei Sun, Zexi Hua, Hengchao Li, Yan Li

Aiming at the characteristics of the flying bird object in surveillance video, such as the single frame image feature is not obvious, the size is small in most cases, and asymmetric, this paper proposes a Flying Bird Object Detection method in Surveillance Video (FBOD-SV).

Object object-detection +1

Interpretable Short-Term Load Forecasting via Multi-Scale Temporal Decomposition

no code implementations18 Feb 2024 Yuqi Jiang, Yan Li, Yize Chen

Though the strong capabilities of learning the non-linearity of the load patterns and the high prediction accuracy have been achieved, the interpretability of typical deep learning models for electricity load forecasting is less studied.

Load Forecasting

DragAnything: Motion Control for Anything using Entity Representation

2 code implementations12 Mar 2024 Weijia Wu, Zhuang Li, YuChao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang

We introduce DragAnything, which utilizes a entity representation to achieve motion control for any object in controllable video generation.

Object Video Generation

Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure

1 code implementation12 Mar 2024 De Cheng, Yanling Ji, Dong Gong, Yan Li, Nannan Wang, Junwei Han, Dingwen Zhang

It considers the characteristics of the image restoration task with multiple degenerations in continual learning, and the knowledge for different degenerations can be shared and accumulated in the unified network structure.

Continual Learning Image Restoration +2

SELECTOR: Heterogeneous graph network with convolutional masked autoencoder for multimodal robust prediction of cancer survival

1 code implementation14 Mar 2024 Liangrui Pan, Yijun Peng, Yan Li, Xiang Wang, Wenjuan Liu, Liwen Xu, Qingchun Liang, Shaoliang Peng

To mitigate the impact of missing features within the modality on prediction accuracy, we devised a convolutional masked autoencoder (CMAE) to process the heterogeneous graph post-feature reconstruction.

Survival Prediction

A Unified and General Framework for Continual Learning

1 code implementation20 Mar 2024 Zhenyi Wang, Yan Li, Li Shen, Heng Huang

Extensive experiments on CL benchmarks and theoretical analysis demonstrate the effectiveness of the proposed refresh learning.

Continual Learning

A Novel Loss Function-based Support Vector Machine for Binary Classification

no code implementations25 Mar 2024 Yan Li, Liping Zhang

The previous support vector machine(SVM) including $0/1$ loss SVM, hinge loss SVM, ramp loss SVM, truncated pinball loss SVM, and others, overlooked the degree of penalty for the correctly classified samples within the margin.

Binary Classification

Stochastic Constrained Decentralized Optimization for Machine Learning with Fewer Data Oracles: a Gradient Sliding Approach

no code implementations3 Apr 2024 Hoang Huy Nguyen, Yan Li, Tuo Zhao

In modern decentralized applications, ensuring communication efficiency and privacy for the users are the key challenges.

Deep Reinforcement Learning with Smooth Policy

no code implementations ICML 2020 Qianli Shen, Yan Li, Haoming Jiang, Zhaoran Wang, Tuo Zhao

In contrast to policy parameterized by linear/reproducing kernel functions, where simple regularization techniques suffice to control smoothness, for neural network based reinforcement learning algorithms, there is no readily available solution to learn a smooth policy.

reinforcement-learning Reinforcement Learning (RL)

Adaptive Feature Discrimination and Denoising for Asymmetric Text Matching

no code implementations COLING 2022 Yan Li, Chenliang Li, Junjun Guo

Asymmetric text matching has becoming increasingly indispensable for many downstream tasks (e. g., IR and NLP).

Denoising Text Matching

Cannot find the paper you are looking for? You can Submit a new open access paper.