Search Results for author: Yan Li

Found 123 papers, 32 papers with code

Adaptive Feature Discrimination and Denoising for Asymmetric Text Matching

no code implementations COLING 2022 Yan Li, Chenliang Li, Junjun Guo

Asymmetric text matching has becoming increasingly indispensable for many downstream tasks (e. g., IR and NLP).

Denoising Text Matching

Deep Reinforcement Learning with Smooth Policy

no code implementations ICML 2020 Qianli Shen, Yan Li, Haoming Jiang, Zhaoran Wang, Tuo Zhao

In contrast to policy parameterized by linear/reproducing kernel functions, where simple regularization techniques suffice to control smoothness, for neural network based reinforcement learning algorithms, there is no readily available solution to learn a smooth policy.

reinforcement-learning Reinforcement Learning +1

A comprehensive study on Blood Cancer detection and classification using Convolutional Neural Network

no code implementations10 Sep 2024 Md Taimur Ahad, Sajib Bin Mamun, Sumaya Mustofa, Bo Song, Yan Li

The high accuracy in detecting and categorization blood cancer detection using CNN suggests that the CNN model is promising in blood cancer disease detection.

object-detection Object Detection +1

Improving the Precision of CNNs for Magnetic Resonance Spectral Modeling

no code implementations10 Sep 2024 John LaMaster, Dhritiman Das, Florian Kofler, Jason Crane, Yan Li, Tobias Lasser, Bjoern H Menze

Magnetic resonance spectroscopic imaging is a widely available imaging modality that can non-invasively provide a metabolic profile of the tissue of interest, yet is challenging to integrate clinically.

A Noncontact Technique for Wave Measurement Based on Thermal Stereography and Deep Learning

no code implementations20 Aug 2024 Deyu Li, Longfei Xiao, Handi Wei, Yan Li, Binghua Zhang

This study proposed a novel technique that combined thermal stereography and deep learning to achieve fully noncontact wave measurements.

Domain Adaptation Stereo Matching

MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU

no code implementations15 Aug 2024 Yan Li, So-Eon Kim, Seong-Bae Park, Soyeon Caren Han

This paper introduces a novel approach, MIDAS, leveraging a multi-level intent, domain, and slot knowledge distillation for multi-turn NLU.

domain classification Intent Detection +5

ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval

no code implementations6 Aug 2024 Ruixiang Zhao, Jian Jia, Yan Li, Xuehan Bai, Quan Chen, Han Li, Peng Jiang, Xirong Li

While Automatic Speech Recognition (ASR) text derived from the short or live-stream videos is readily accessible, how to de-noise the excessively noisy text for multimodal representation learning is mostly untouched.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

A reference frame-based microgrid primary control for ensuring global convergence to a periodic orbit

no code implementations1 Aug 2024 Xinyuan Jiang, Constantino M. Lagoa, Daning Huang, Yan Li

Without the simplifying assumption, however, the steady state being studied is basically a limit cycle with the convergence of its orbit in question.

Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval

1 code implementation23 Jul 2024 Xiaowan Hu, Yiyi Chen, Yan Li, Minquan Wang, Haoqian Wang, Quan Chen, Han Li, Peng Jiang

The LPR task encompasses three primary dilemmas in real-world scenarios: 1) the recognition of intended products from distractor products present in the background; 2) the video-image heterogeneity that the appearance of products showcased in live streams often deviates substantially from standardized product images in stores; 3) there are numerous confusing products with subtle visual nuances in the shop.

Retrieval

Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base

no code implementations20 Jul 2024 Zhiyu An, Xianzhong Ding, Yen-Chun Fu, Cheng-Chung Chu, Yan Li, Wan Du

This paper introduces Golden-Retriever, designed to efficiently navigate vast industrial knowledge bases, overcoming challenges in traditional LLM fine-tuning and RAG frameworks with domain-specific jargon and context interpretation.

Navigate RAG +1

An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation

no code implementations18 Jun 2024 Qin Li, Yizhe Zhang, Yan Li, Jun Lyu, Meng Liu, Longyu Sun, Mengting Sun, Qirong Li, Wenyue Mao, Xinran Wu, Yajing Zhang, Yinghua Chu, Shuo Wang, Chengyan Wang

We test state-of-the-art foundation models for medical image segmentation, including the original SAM, medical SAM and SAT models, to evaluate segmentation efficacy across different demographic groups and identify disparities.

Fairness Image Segmentation +3

Center-Sensitive Kernel Optimization for Efficient On-Device Incremental Learning

no code implementations13 Jun 2024 Dingwen Zhang, Yan Li, De Cheng, Nannan Wang, Junwei Han

Based on an empirical study on the knowledge intensity of the kernel elements of the neural network, we find that the center kernel is the key for maximizing the knowledge intensity for learning new data, while freezing the other kernel elements would get a good balance on the model's capacity for overcoming catastrophic forgetting.

Incremental Learning

Multi-scale Quaternion CNN and BiGRU with Cross Self-attention Feature Fusion for Fault Diagnosis of Bearing

1 code implementation25 May 2024 Huanbai Liu, Fanlong Zhang, Yin Tan, Lian Huang, Yan Li, Guoheng Huang, Shenghong Luo, An Zeng

In this work, we propose a novel FD model by integrating multi-scale quaternion convolutional neural network (MQCNN), bidirectional gated recurrent unit (BiGRU), and cross self-attention feature fusion (CSAFF).

Domain Adaptation Fault Detection

Learning Multi-dimensional Human Preference for Text-to-Image Generation

no code implementations CVPR 2024 Sixian Zhang, Bohan Wang, Junqiang Wu, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang

Current metrics for text-to-image models typically rely on statistical metrics which inadequately represent the real preference of humans.

Text-to-Image Generation

Learning Coarse-Grained Dynamics on Graph

no code implementations15 May 2024 Yin Yu, John Harlim, Daning Huang, Yan Li

We consider a Graph Neural Network (GNN) non-Markovian modeling framework to identify coarse-grained dynamical systems on graphs.

2k Graph Neural Network

Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application

no code implementations7 May 2024 Jian Jia, Yipei Wang, Yan Li, Honggang Chen, Xuehan Bai, Zhaocheng Liu, Jian Liang, Quan Chen, Han Li, Peng Jiang, Kun Gai

Contemporary recommender systems predominantly rely on collaborative filtering techniques, employing ID-embedding to capture latent associations among users and items.

Collaborative Filtering Language Modelling +3

Stochastic Constrained Decentralized Optimization for Machine Learning with Fewer Data Oracles: a Gradient Sliding Approach

no code implementations3 Apr 2024 Hoang Huy Nguyen, Yan Li, Tuo Zhao

In modern decentralized applications, ensuring communication efficiency and privacy for the users are the key challenges.

A Novel Loss Function-based Support Vector Machine for Binary Classification

no code implementations25 Mar 2024 Yan Li, Liping Zhang

The previous support vector machine(SVM) including $0/1$ loss SVM, hinge loss SVM, ramp loss SVM, truncated pinball loss SVM, and others, overlooked the degree of penalty for the correctly classified samples within the margin.

Binary Classification

A Unified and General Framework for Continual Learning

1 code implementation20 Mar 2024 Zhenyi Wang, Yan Li, Li Shen, Heng Huang

Extensive experiments on CL benchmarks and theoretical analysis demonstrate the effectiveness of the proposed refresh learning.

Continual Learning

SELECTOR: Heterogeneous graph network with convolutional masked autoencoder for multimodal robust prediction of cancer survival

no code implementations14 Mar 2024 Liangrui Pan, Yijun Peng, Yan Li, Xiang Wang, Wenjuan Liu, Liwen Xu, Qingchun Liang, Shaoliang Peng

To mitigate the impact of missing features within the modality on prediction accuracy, we devised a convolutional masked autoencoder (CMAE) to process the heterogeneous graph post-feature reconstruction.

Survival Prediction

DragAnything: Motion Control for Anything using Entity Representation

1 code implementation12 Mar 2024 Weijia Wu, Zhuang Li, YuChao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang

We introduce DragAnything, which utilizes a entity representation to achieve motion control for any object in controllable video generation.

Object Video Generation

Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure

1 code implementation12 Mar 2024 De Cheng, Yanling Ji, Dong Gong, Yan Li, Nannan Wang, Junwei Han, Dingwen Zhang

It considers the characteristics of the image restoration task with multiple degenerations in continual learning, and the knowledge for different degenerations can be shared and accumulated in the unified network structure.

Continual Learning Image Restoration +2

Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays

no code implementations25 Feb 2024 Zhenxing Dong, Jidong Jia, Yan Li, Yuye Ling

Both simulations and experiments were conducted to demonstrate the capabilities of the proposed framework.

16k 8k

Interpretable Short-Term Load Forecasting via Multi-Scale Temporal Decomposition

no code implementations18 Feb 2024 Yuqi Jiang, Yan Li, Yize Chen

Though the strong capabilities of learning the non-linearity of the load patterns and the high prediction accuracy have been achieved, the interpretability of typical deep learning models for electricity load forecasting is less studied.

Load Forecasting

A Flying Bird Object Detection Method for Surveillance Video

1 code implementation8 Jan 2024 Ziwei Sun, Zexi Hua, Hengchao Li, Yan Li

Aiming at the specific characteristics of flying bird objects in surveillance video, such as the typically non-obvious features in single-frame images, small size in most instances, and asymmetric shapes, this paper proposes a Flying Bird Object Detection method for Surveillance Video (FBOD-SV).

Object object-detection +1

DisControlFace: Adding Disentangled Control to Diffusion Autoencoder for One-shot Explicit Facial Image Editing

no code implementations11 Dec 2023 Haozhe Jia, Yan Li, Hengfei Cui, Di Xu, Yuwang Wang, Tao Yu

We identify the key challenge as the exploration of disentangled conditional control between high-level semantics and explicit parameters (e. g., 3DMM) in the generation process, and accordingly propose a novel diffusion-based editing framework, named DisControlFace.

Paragraph-to-Image Generation with Information-Enriched Diffusion Model

1 code implementation24 Nov 2023 Weijia Wu, Zhuang Li, Yefei He, Mike Zheng Shou, Chunhua Shen, Lele Cheng, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang

In this paper, we introduce an information-enriched diffusion model for paragraph-to-image generation task, termed ParaDiffusion, which delves into the transference of the extensive semantic comprehension capabilities of large language models to the task of image generation.

Image Generation Language Modelling +1

Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning

1 code implementation20 Nov 2023 Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu

The performance of OVD greatly relies on the quality of class-agnostic region proposals and pseudo-labels for novel object categories.

Object object-detection +3

KwaiYiiMath: Technical Report

no code implementations11 Oct 2023 Jiayi Fu, Lei Lin, Xiaoyang Gao, Pengli Liu, Zhengzong Chen, Zhirui Yang, ShengNan Zhang, Xue Zheng, Yan Li, Yuliang Liu, Xucheng Ye, Yiqiao Liao, Chao Liao, Bin Chen, Chengru Song, Junchen Wan, Zijia Lin, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai

Recent advancements in large language models (LLMs) have demonstrated remarkable abilities in handling a variety of natural language processing (NLP) downstream tasks, even on mathematical tasks requiring multi-step reasoning.

Ranked #91 on Arithmetic Reasoning on GSM8K (using extra training data)

Arithmetic Reasoning GSM8K +1

Efficient Pyramid Channel Attention Network for Pathological Myopia Recognition

1 code implementation17 Sep 2023 Xiaoqing Zhang, Jilu Zhao, Yan Li, Hao Wu, Xiangtian Zhou, Jiang Liu

Moreover, motivated by the recent pretraining-and-finetuning paradigm, we attempt to adapt pre-trained natural image models for PM recognition by freezing them and treating the EPCA and other attention modules as adapters.

Cross-Domain Product Representation Learning for Rich-Content E-Commerce

1 code implementation ICCV 2023 Xuehan Bai, Yan Li, Yanhua Cheng, Wenjie Yang, Quan Chen, Han Li

It is the first dataset to cover product pages, short videos, and live streams simultaneously, providing the basis for establishing a unified product representation across different media domains.

Representation Learning

Cross-view Semantic Alignment for Livestreaming Product Recognition

1 code implementation ICCV 2023 Wenjie Yang, Yiyi Chen, Yan Li, Yanhua Cheng, Xudong Liu, Quan Chen, Han Li

Moreover, a cRoss-vIew semantiC alignmEnt (RICE) model is proposed to learn discriminative instance features from the image and video views of the products.

Contrastive Learning Diversity

First-order Policy Optimization for Robust Policy Evaluation

no code implementations29 Jul 2023 Yan Li, Guanghui Lan

We adopt a policy optimization viewpoint towards policy evaluation for robust Markov decision process with $\mathrm{s}$-rectangular ambiguity sets.

One for Multiple: Physics-informed Synthetic Data Boosts Generalizable Deep Learning for Fast MRI Reconstruction

1 code implementation25 Jul 2023 Zi Wang, Xiaotong Yu, Chengyan Wang, Weibo Chen, Jiazheng Wang, Ying-Hua Chu, Hongwei Sun, Rushuai Li, Peiyong Li, Fan Yang, Haiwei Han, Taishan Kang, Jianzhong Lin, Chen Yang, Shufu Chang, Zhang Shi, Sha Hua, Yan Li, Juan Hu, Liuhong Zhu, Jianjun Zhou, Meijing Lin, Jiefeng Guo, Congbo Cai, Zhong Chen, Di Guo, Guang Yang, Xiaobo Qu

We demonstrate that training DL models on synthetic data, coupled with enhanced learning techniques, yields in vivo MRI reconstructions comparable to or surpassing those of models trained on matched realistic datasets, reducing the reliance on real-world MRI data by up to 96%.

Medical Diagnosis MRI Reconstruction

Anchor Free remote sensing detector based on solving discrete polar coordinate equation

no code implementations21 Mar 2023 Linfeng Shi, Yan Li, Xi Zhu

Finally, referring to the calculation idea of horizontal IoU, we design a rotating IoU based on the split polar coordinate plane, namely JIoU, which is expressed as the intersection ratio following discretization of the inner ellipse of the rotating bounding box, to solve the correlation between angle and side length in the regression process of the rotating bounding box.

Object object-detection +2

Sequential three-way decisions with a single hidden layer feedforward neural network

1 code implementation14 Mar 2023 Youxi Wu, Shuhui Cheng, Yan Li, Rongjie Lv, Fan Min

The experimental results verify that STWD-SFNN has a more compact network on structured datasets than other SFNN models, and has better generalization performance than the competitive models.

Policy Mirror Descent Inherently Explores Action Space

no code implementations8 Mar 2023 Yan Li, Guanghui Lan

SPMD with the second evaluation operator, namely truncated on-policy Monte Carlo (TOMC), attains an $\tilde{\mathcal{O}}(\mathcal{H}_{\mathcal{D}}/\epsilon^2)$ sample complexity, where $\mathcal{H}_{\mathcal{D}}$ mildly depends on the effective horizon and the size of the action space with properly chosen Bregman divergence (e. g., Tsallis divergence).

Efficient Exploration General Reinforcement Learning +1

Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization

no code implementations16 Feb 2023 Xianjun Yang, Yan Li, Xinlu Zhang, Haifeng Chen, Wei Cheng

Text summarization has been a crucial problem in natural language processing (NLP) for several decades.

Abstractive Text Summarization

Sentiment analysis and opinion mining on educational data: A survey

no code implementations8 Feb 2023 Thanveer Shaik, Xiaohui Tao, Christopher Dann, Haoran Xie, Yan Li, Linda Galligan

In the education sector, opinion mining is used to listen to student opinions and enhance their learning-teaching practices pedagogically.

Decision Making Negation +4

A Review of the Trends and Challenges in Adopting Natural Language Processing Methods for Education Feedback Analysis

no code implementations20 Jan 2023 Thanveer Shaik, Xiaohui Tao, Yan Li, Christopher Dann, Jacquie Mcdonald, Petrea Redmond, Linda Galligan

Research community approaches to extract the semantic meaning of emoticons and special characters in feedback which conveys user opinion and challenges in adopting NLP in education are explored.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Chaos to Order: A Label Propagation Perspective on Source-Free Domain Adaptation

no code implementations20 Jan 2023 Chunwei Wu, Guitao Cao, Yan Li, Xidong Xi, Wenming Cao, Hong Wang

Inspired by this insight, we present Chaos to Order (CtO), a novel approach for SFDA that strives to constrain semantic credibility and propagate label information among target subpopulations.

Clustering Source-Free Domain Adaptation

Eco-PiNN: A Physics-informed Neural Network for Eco-toll Estimation

1 code implementation13 Jan 2023 Yan Li, Mingzhou Yang, Matthew Eagon, Majid Farhadloo, Yiqun Xie, William F. Northrop, Shashi Shekhar

The eco-toll estimation problem quantifies the expected environmental cost (e. g., energy consumption, exhaust emissions) for a vehicle to travel along a path.

Decoder

Generative Time Series Forecasting with Diffusion, Denoise, and Disentanglement

1 code implementation8 Jan 2023 Yan Li, Xinjiang Lu, Yaqing Wang, Dejing Dou

In this work, we propose to address the time series forecasting problem with generative modeling and propose a bidirectional variational auto-encoder (BVAE) equipped with diffusion, denoise, and disentanglement, namely D3VAE.

Denoising Disentanglement +2

Towards Long-Term Time-Series Forecasting: Feature, Pattern, and Distribution

1 code implementation5 Jan 2023 Yan Li, Xinjiang Lu, Haoyi Xiong, Jian Tang, Jiantao Su, Bo Jin, Dejing Dou

Long-term time-series forecasting (LTTF) has become a pressing demand in many applications, such as wind power supply planning.

Decoder Time Series +2

First-order Policy Optimization for Robust Markov Decision Process

no code implementations21 Sep 2022 Yan Li, Guanghui Lan, Tuo Zhao

We consider the problem of solving robust Markov decision process (MDP), which involves a set of discounted, finite state, finite action space MDPs with uncertain transition kernels.

Deep Forest with Hashing Screening and Window Screening

no code implementations25 Jul 2022 Pengfei Ma, Youxi Wu, Yan Li, Lei Guo, He Jiang, Xingquan Zhu, Xindong Wu

To screen out redundant feature vectors, we introduce a hashing screening mechanism for multi-grained scanning and propose a model called HW-Forest which adopts two strategies, hashing screening and window screening.

Modularized Bilinear Koopman Operator for Modeling and Predicting Transients of Microgrids

no code implementations6 May 2022 Xinyuan Jiang, Yan Li, Daning Huang

Modularized Koopman Bilinear Form (M-KBF) is presented to model and predict the transient dynamics of microgrids in the presence of disturbances.

PIDGeuN: Graph Neural Network-Enabled Transient Dynamics Prediction of Networked Microgrids Through Full-Field Measurement

no code implementations18 Apr 2022 Yin Yu, Xinyuan Jiang, Daning Huang, Yan Li

A Physics-Informed Dynamic Graph Neural Network (PIDGeuN) is presented to accurately, efficiently and robustly predict the nonlinear transient dynamics of microgrids in the presence of disturbances.

Graph Neural Network

Benchmarking Domain Generalization on EEG-based Emotion Recognition

no code implementations18 Apr 2022 Yan Li, Hao Chen, Jake Zhao, Haolan Zhang, Jinpeng Li

Specifically, numerous domain adaptation (DA) algorithms have been exploited in the past five years to enhance the generalization of emotion recognition models across subjects.

Benchmarking Domain Generalization +2

Demand-driven train timetabling for air and intercity high-speed rail synchronization service

no code implementations Transportation Letters 2022 Yangsheng Jiang, Shuiwang Chen, Wenyao an, Lu Hu, Yan Li, Jun Liu

We also extend the train timetabling model by considering the synchronization events of flight-train and the train circulation plan.

Robust Single Image Dehazing Based on Consistent and Contrast-Assisted Reconstruction

no code implementations29 Mar 2022 De Cheng, Yan Li, Dingwen Zhang, Nannan Wang, Xinbo Gao, Jiande Sun

To properly address this problem, we propose a novel density-variational learning framework to improve the robustness of the image dehzing model assisted by a variety of negative hazy images, to better deal with various complex hazy scenarios.

Image Dehazing Single Image Dehazing

DSRRTracker: Dynamic Search Region Refinement for Attention-based Siamese Multi-Object Tracking

no code implementations21 Mar 2022 JiaXu Wan, Hong Zhang, Jin Zhang, Yuan Ding, Yifan Yang, Yan Li, Xuliang Li

Many multi-object tracking (MOT) methods follow the framework of "tracking by detection", which associates the target objects-of-interest based on the detection results.

Multi-Object Tracking

Noise Regularizes Over-parameterized Rank One Matrix Recovery, Provably

no code implementations7 Feb 2022 Tianyi Liu, Yan Li, Enlu Zhou, Tuo Zhao

We investigate the role of noise in optimization algorithms for learning over-parameterized models.

Homotopic Policy Mirror Descent: Policy Convergence, Implicit Regularization, and Improved Sample Complexity

no code implementations24 Jan 2022 Yan Li, Guanghui Lan, Tuo Zhao

We first establish the global linear convergence of HPMD instantiated with Kullback-Leibler divergence, for both the optimality gap, and a weighted distance to the set of optimal policies.

Policy Gradient Methods

Block Policy Mirror Descent

no code implementations15 Jan 2022 Guanghui Lan, Yan Li, Tuo Zhao

Despite the nonconvex nature of the problem and a partial update rule, we provide a unified analysis for several sampling schemes, and show that BPMD achieves fast linear convergence to the global optimality.

reinforcement-learning Reinforcement Learning (RL)

OPP-Miner: Order-preserving sequential pattern mining

no code implementations9 Jan 2022 Youxi Wu, Qian Hu, Yan Li, Lei Guo, Xingquan Zhu, Xindong Wu

To discover patterns, existing methods often convert time series data into another form, such as nominal/symbolic format, to reduce dimensionality, which inevitably deviates the data values.

Sequential Pattern Mining Time Series +1

Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network

no code implementations6 Jan 2022 Siawpeng Er, Edward Liu, Minshuo Chen, Yan Li, Yuqi Liu, Tuo Zhao, Hua Wang

This paper presents a deep learning assisted synthesis approach for direct end-to-end generation of RF/mm-wave passive matching network with 3D EM structures.

DBC-Forest: Deep forest with binning confidence screening

no code implementations25 Dec 2021 Pengfei Ma, Youxi Wu, Yan Li, Lei Guo, Zhao Li

As a deep learning model, deep confidence screening forest (gcForestcs) has achieved great success in various applications.

Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL

1 code implementation NeurIPS 2021 Minshuo Chen, Yan Li, Ethan Wang, Zhuoran Yang, Zhaoran Wang, Tuo Zhao

Theoretically, under a weak coverage assumption that the experience dataset contains enough information about the optimal policy, we prove that for an episodic mean-field MDP with a horizon $H$ and $N$ training trajectories, SAFARI attains a sub-optimality gap of $\mathcal{O}(H^2d_{\rm eff} /\sqrt{N})$, where $d_{\rm eff}$ is the effective dimension of the function class for parameterizing the value function, but independent on the number of agents.

Multi-agent Reinforcement Learning

Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits

no code implementations ICLR 2022 Yan Li, Dhruv Choudhary, Xiaohan Wei, Baichuan Yuan, Bhargav Bhushanam, Tuo Zhao, Guanghui Lan

We show that incorporating frequency information of tokens in the embedding learning problems leads to provably efficient algorithms, and demonstrate that common adaptive algorithms implicitly exploit the frequency information to a large extent.

Language Modelling Recommendation Systems

A Principled Permutation Invariant Approach to Mean-Field Multi-Agent Reinforcement Learning

no code implementations29 Sep 2021 Yan Li, Lingxiao Wang, Jiachen Yang, Ethan Wang, Zhaoran Wang, Tuo Zhao, Hongyuan Zha

To exploit the permutation invariance therein, we propose the mean-field proximal policy optimization (MF-PPO) algorithm, at the core of which is a permutation- invariant actor-critic neural architecture.

Inductive Bias Multi-agent Reinforcement Learning +2

Single Image Dehazing with An Independent Detail-Recovery Network

1 code implementation22 Sep 2021 Yan Li, De Cheng, Jiande Sun, Dingwen Zhang, Nannan Wang, Xinbo Gao

In this paper, we propose a single image dehazing method with an independent Detail Recovery Network (DRN), which considers capturing the details from the input image over a separate network and then integrates them into a coarse dehazed image.

Image Dehazing Single Image Dehazing

MutualGraphNet: A novel model for motor imagery classification

no code implementations2 Sep 2021 Yan Li, Ning Zhong, David Taniar, Haolan Zhang

Experiments are conducted on motor imagery EEG data set and we compare our model with the current state-of-the-art approaches and the results suggest that MutualGraphNet is robust enough to learn the interpretable features and outperforms the current state-of-the-art methods.

Classification EEG +2

Implicit Regularization of Bregman Proximal Point Algorithm and Mirror Descent on Separable Data

no code implementations15 Aug 2021 Yan Li, Caleb Ju, Ethan X. Fang, Tuo Zhao

For any BPPA instantiated with a fixed Bregman divergence, we provide a lower bound of the margin obtained by BPPA with respect to an arbitrarily chosen norm.

Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach

no code implementations18 May 2021 Yan Li, Lingxiao Wang, Jiachen Yang, Ethan Wang, Zhaoran Wang, Tuo Zhao, Hongyuan Zha

To exploit the permutation invariance therein, we propose the mean-field proximal policy optimization (MF-PPO) algorithm, at the core of which is a permutation-invariant actor-critic neural architecture.

Inductive Bias Multi-agent Reinforcement Learning

Vehicle Emissions Prediction with Physics-Aware AI Models: Preliminary Results

no code implementations2 May 2021 Harish Panneer Selvam, Yan Li, Pengyue Wang, William F. Northrop, Shashi Shekhar

Given an on-board diagnostics (OBD) dataset and a physics-based emissions prediction model, this paper aims to develop an accurate and computational-efficient AI (Artificial Intelligence) method that predicts vehicle emissions.

Statistically-Robust Clustering Techniques for Mapping Spatial Hotspots: A Survey

2 code implementations22 Mar 2021 Yiqun Xie, Shashi Shekhar, Yan Li

Mapping of spatial hotspots, i. e., regions with significantly higher rates of generating cases of certain events (e. g., disease or crime cases), is an important task in diverse societal domains, including public health, public safety, transportation, agriculture, environmental science, etc.

Clustering

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization

no code implementations24 Feb 2021 Tianyi Liu, Yan Li, Song Wei, Enlu Zhou, Tuo Zhao

Numerous empirical evidences have corroborated the importance of noise in nonconvex optimization problems.

IFoodCloud: A Platform for Real-time Sentiment Analysis of Public Opinion about Food Safety in China

no code implementations17 Feb 2021 Dachuan Zhang, Haoyang Zhang, Zhisheng Wei, Yan Li, Zhiheng Mao, Chunmeng He, Haorui Ma, Xin Zeng, Xiaoling Xie, Xingran Kou, Bingwen Zhang

The Internet contains a wealth of public opinion on food safety, including views on food adulteration, food-borne diseases, agricultural pollution, irregular food distribution, and food production issues.

Sentiment Analysis Sentiment Classification

Estimates of the early EM emission from compact binary mergers

no code implementations9 Feb 2021 Yan Li, Rong-Feng Shen

We estimate their luminosities and time scales as functions of the chirp mass which is the most readily constrained parameter from the gravitational wave detections of these events.

High Energy Astrophysical Phenomena

MA-Net: A Multi-Scale Attention Network for Liver and Tumor Segmentation

3 code implementations IEEE Access 2020 Tongle Fan, Guanglei Wang, Yan Li, Hongrui Wang

In recent years, a large number of variants of U-Net based on Multi-scale feature fusion are proposed to improve the segmentation performance for medical image segmentation.

Image Segmentation Medical Image Segmentation +2

Residual Network Based Direct Synthesis of EM Structures: A Study on One-to-One Transformers

no code implementations25 Aug 2020 David Munzer, Siawpeng Er, Minshuo Chen, Yan Li, Naga S. Mannem, Tuo Zhao, Hua Wang

We propose using machine learning models for the direct synthesis of on-chip electromagnetic (EM) passive structures to enable rapid or even automated designs and optimizations of RF/mm-Wave circuits.

BIG-bench Machine Learning

A Physics Model-Guided Online Bayesian Framework for Energy Management of Extended Range Electric Delivery Vehicles

no code implementations1 Jun 2020 Pengyue Wang, Yan Li, Shashi Shekhar, William F. Northrop

A physics model-guided online Bayesian framework is described and validated on large number of in-use driving samples of EREVs used for last-mile package delivery.

energy management Management

Adversarial Attacks on Reinforcement Learning based Energy Management Systems of Extended Range Electric Delivery Vehicles

no code implementations1 Jun 2020 Pengyue Wang, Yan Li, Shashi Shekhar, William F. Northrop

Adversarial examples are firstly investigated in the area of computer vision: by adding some carefully designed ''noise'' to the original input image, the perturbed image that cannot be distinguished from the original one by human, can fool a well-trained classifier easily.

energy management Management

How to Retrain Recommender System? A Sequential Meta-Learning Method

1 code implementation27 May 2020 Yang Zhang, Fuli Feng, Chenxu Wang, Xiangnan He, Meng Wang, Yan Li, Yongdong Zhang

Nevertheless, normal training on new data only may easily cause overfitting and forgetting issues, since the new data is of a smaller scale and contains fewer information on long-term user preference.

Meta-Learning Recommendation Systems

Implicit Bias of Gradient Descent based Adversarial Training on Separable Data

no code implementations ICLR 2020 Yan Li, Ethan X. Fang, Huan Xu, Tuo Zhao

Specifically, we show that for any fixed iteration $T$, when the adversarial perturbation during training has proper bounded L2 norm, the classifier learned by gradient descent based adversarial training converges in direction to the maximum L2 norm margin classifier at the rate of $O(1/\sqrt{T})$, significantly faster than the rate $O(1/\log T}$ of training with clean data.

Binary Classification

Pursuing Sources of Heterogeneity in Modeling Clustered Population

no code implementations10 Mar 2020 Yan Li, Chun Yu, Yize Zhao, Robert H. Aseltine, Weixin Yao, Kun Chen

We clarify the concepts of the source of heterogeneity that account for potential scale differences of the clusters and propose a regularized finite mixture effects regression to achieve heterogeneity pursuit and feature selection simultaneously.

feature selection regression

Deep Learning-based End-to-end Diagnosis System for Avascular Necrosis of Femoral Head

no code implementations12 Feb 2020 Yang Li, Yan Li, Hua Tian

To the best of our knowledge, this study is the first research on the prospective use of a deep learning-based diagnosis system for AVNFH by conducting two pilot studies representing real-world application scenarios.

Decision Making Head Detection

Bilinear Graph Neural Network with Neighbor Interactions

1 code implementation10 Feb 2020 Hongmin Zhu, Fuli Feng, Xiangnan He, Xiang Wang, Yan Li, Kai Zheng, Yongdong Zhang

We term this framework as Bilinear Graph Neural Network (BGNN), which improves GNN representation ability with bilinear interactions between neighbor nodes.

General Classification Graph Neural Network +1

LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation

17 code implementations6 Feb 2020 Xiangnan He, Kuan Deng, Xiang Wang, Yan Li, Yongdong Zhang, Meng Wang

We propose a new model named LightGCN, including only the most essential component in GCN -- neighborhood aggregation -- for collaborative filtering.

Collaborative Filtering Graph Classification +1

Towards Understanding the Importance of Noise in Training Neural Networks

no code implementations7 Sep 2019 Mo Zhou, Tianyi Liu, Yan Li, Dachao Lin, Enlu Zhou, Tuo Zhao

Numerous empirical evidence has corroborated that the noise plays a crucial rule in effective and efficient training of neural networks.

Inductive Bias of Gradient Descent based Adversarial Training on Separable Data

no code implementations7 Jun 2019 Yan Li, Ethan X. Fang, Huan Xu, Tuo Zhao

Specifically, we show that when the adversarial perturbation during training has bounded $\ell_2$-norm, the classifier learned by gradient descent based adversarial training converges in direction to the maximum $\ell_2$-norm margin classifier at the rate of $\tilde{\mathcal{O}}(1/\sqrt{T})$, significantly faster than the rate $\mathcal{O}(1/\log T)$ of training with clean data.

Binary Classification Inductive Bias

Similarity Grouping-Guided Neural Network Modeling for Maritime Time Series Prediction

no code implementations13 May 2019 Yan Li, Ryan Wen Liu, Zhao Liu, Jingxian Liu

Reliable and accurate prediction of time series plays a crucial role in maritime industry, such as economic investment, transportation planning, port planning and design, etc.

Time Series Time Series Prediction

Strain engineering of epitaxial oxide heterostructures beyond substrate limitations

no code implementations3 May 2019 Xiong Deng, Chao Chen, Deyang Chen, Xiangbin Cai, Xiaozhe Yin, Chao Xu, Fei Sun, Caiwen Li, Yan Li, Han Xu, Mao Ye, Guo Tian, Zhen Fan, Zhipeng Hou, Minghui Qin, Yu Chen, Zhenlin Luo, Xubing Lu, Guofu Zhou, Lang Chen, Ning Wang, Ye Zhu, Xingsen Gao, Jun-Ming Liu

The limitation of commercially available single-crystal substrates and the lack of continuous strain tunability preclude the ability to take full advantage of strain engineering for further exploring novel properties and exhaustively studying fundamental physics in complex oxides.

Materials Science

Imitating Targets from all sides: An Unsupervised Transfer Learning method for Person Re-identification

no code implementations10 Apr 2019 Jiajie Tian, Zhu Teng, Rui Li, Yan Li, Baopeng Zhang, Jianping Fan

Person re-identification (Re-ID) models usually show a limited performance when they are trained on one dataset and tested on another dataset due to the inter-dataset bias (e. g. completely different identities and backgrounds) and the intra-dataset difference (e. g. camera invariance).

Person Re-Identification Transfer Learning

Transductive Zero-Shot Learning with Visual Structure Constraint

1 code implementation NeurIPS 2019 Zi-Yu Wan, Dong-Dong Chen, Yan Li, Xingguang Yan, Junge Zhang, Yizhou Yu, Jing Liao

Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (i. e. alleviate the above domain shift problem).

Zero-Shot Learning

Improving Gated Recurrent Unit Based Acoustic Modeling with Batch Normalization and Enlarged Context

no code implementations26 Nov 2018 Jie Li, Yahui Shan, Xiaorui Wang, Yan Li

The use of future contextual information is typically shown to be helpful for acoustic modeling.

Object detection and tracking benchmark in industry based on improved correlation filter

no code implementations11 Jun 2018 Shangzhen Luan, Yan Li, Xiaodi Wang, Baochang Zhang

Real-time object detection and tracking have shown to be the basis of intelligent production for industrial 4. 0 applications.

Diversity object-detection +1

Communication-Efficient Projection-Free Algorithm for Distributed Optimization

no code implementations20 May 2018 Yan Li, Chao Qu, Huan Xu

We demonstrate this advantage and show that the linear oracle complexity can be reduced to almost the same order of magnitude as the communication complexity, when the feasible set is polyhedral.

Distributed Optimization Matrix Completion

Projection-Free Algorithms in Statistical Estimation

no code implementations20 May 2018 Yan Li, Chao Qu, Huan Xu

Recently people have reduced the gradient evaluation complexity of FW algorithm to $\log(\frac{1}{\epsilon})$ for the smooth and strongly convex objective.

Gated Recurrent Unit Based Acoustic Modeling with Future Context

no code implementations18 May 2018 Jie Li, Xiaorui Wang, Yuan-Yuan Zhao, Yan Li

The use of future contextual information is typically shown to be helpful for acoustic modeling.

Discriminative Learning of Latent Features for Zero-Shot Recognition

1 code implementation CVPR 2018 Yan Li, Junge Zhang, Jian-Guo Zhang, Kaiqi Huang

In this work, we retrospect existing methods and demonstrate the necessity to learn discriminative representations for both visual and semantic instances of ZSL.

Zero-Shot Learning

Mixed Supervised Object Detection with Robust Objectness Transfer

no code implementations27 Feb 2018 Yan Li, Junge Zhang, Kaiqi Huang, Jian-Guo Zhang

Different from previous MSD methods that directly transfer the pre-trained object detectors from existing categories to new categories, we propose a more reasonable and robust objectness transfer approach for MSD.

Multiple Instance Learning Object +2

A Framework in CRM Customer Lifecycle: Identify Downward Trend and Potential Issues Detection

no code implementations25 Feb 2018 Kun Hu, Zhe Li, Ying Liu, Luyin Cheng, Qi Yang, Yan Li

In the first prediction part, we focus on predicting the downward trend, which is an earlier stage of the customer lifecycle compared to churn.

Causal Inference Management +1

Fast Global Convergence via Landscape of Empirical Loss

no code implementations13 Feb 2018 Chao Qu, Yan Li, Huan Xu

While optimizing convex objective (loss) functions has been a powerhouse for machine learning for at least two decades, non-convex loss functions have attracted fast growing interests recently, due to many desirable properties such as superior robustness and classification accuracy, compared with their convex counterparts.

General Classification

Machine Learning for Survival Analysis: A Survey

no code implementations15 Aug 2017 Ping Wang, Yan Li, Chandan K. Reddy

We hope that this paper will provide a more thorough understanding of the recent advances in survival analysis and offer some guidelines on applying these approaches to solve new problems that arise in applications with censored data.

BIG-bench Machine Learning Survival Analysis

Solving Multi-Objective MDP with Lexicographic Preference: An application to stochastic planning with multiple quantile objective

no code implementations10 May 2017 Yan Li, Zhaohan Sun

In most common settings of Markov Decision Process (MDP), an agent evaluate a policy based on expectation of (discounted) sum of rewards.

Autonomous Driving

SAGA and Restricted Strong Convexity

no code implementations19 Feb 2017 Chao Qu, Yan Li, Huan Xu

SAGA is a fast incremental gradient method on the finite sum problem and its effectiveness has been tested on a vast of applications.

regression

Linear Convergence of SVRG in Statistical Estimation

no code implementations7 Nov 2016 Chao Qu, Yan Li, Huan Xu

SVRG and its variants are among the state of art optimization algorithms for large scale machine learning problems.

Skipping Word: A Character-Sequential Representation based Framework for Question Answering

no code implementations2 Sep 2016 Lingxun Meng, Yan Li, Mengyi Liu, Peng Shu

Recent works using artificial neural networks based on word distributed representation greatly boost the performance of various natural language learning tasks, especially question answering.

Answer Selection

M$^2$S-Net: Multi-Modal Similarity Metric Learning based Deep Convolutional Network for Answer Selection

1 code implementation19 Apr 2016 Lingxun Meng, Yan Li

Recent works using artificial neural networks based on distributed word representation greatly boost performance on various natural language processing tasks, especially the answer selection problem.

Answer Selection Metric Learning +1

Audio Recording Device Identification Based on Deep Learning

no code implementations18 Feb 2016 Simeng Qi, Zheng Huang, Yan Li, Shaopei Shi

The identification result shows that the method of getting feature vector from the noise of each device and identifying them with deep learning techniques is viable, and well-preformed.

Speech Enhancement

Two Birds, One Stone: Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction

no code implementations ICCV 2015 Yan Li, Ruiping Wang, Haomiao Liu, Huajie Jiang, Shiguang Shan, Xilin Chen

In this way, the learned binary codes can be applied to not only fine-grained face image retrieval, but also facial attributes prediction, which is the very innovation of this work, just like killing two birds with one stone.

Face Image Retrieval Retrieval

A Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief Model

no code implementations6 Aug 2015 Yan Li, Kristofer G. Reyes, Jorge Vazquez-Anderson, Yingfei Wang, Lydia M. Contreras, Warren B. Powell

We present a sparse knowledge gradient (SpKG) algorithm for adaptively selecting the targeted regions within a large RNA molecule to identify which regions are most amenable to interactions with other molecules.

Face Video Retrieval With Image Query via Hashing Across Euclidean Space and Riemannian Manifold

no code implementations CVPR 2015 Yan Li, Ruiping Wang, Zhiwu Huang, Shiguang Shan, Xilin Chen

Retrieving videos of a specific person given his/her face image as query becomes more and more appealing for applications like smart movie fast-forwards and suspect searching.

Retrieval Video Retrieval

The Knowledge Gradient Policy Using A Sparse Additive Belief Model

no code implementations18 Mar 2015 Yan Li, Han Liu, Warren Powell

We propose a sequential learning policy for noisy discrete global optimization and ranking and selection (R\&S) problems with high dimensional sparse belief functions, where there are hundreds or even thousands of features, but only a small portion of these features contain explanatory power.

Comment on "Clustering by fast search and find of density peaks"

no code implementations18 Jan 2015 Shuliang Wang, Dakui Wang, Caoyuan Li, Yan Li

For any data set to be clustered, the most reasonable value of d_c can be objectively calculated from the data set by using our proposed method.

Clustering

Sparse Additive Model using Symmetric Nonnegative Definite Smoothers

no code implementations8 Sep 2014 Yan Li

We introduce a new algorithm, called adaptive sparse backfitting algorithm, for solving high dimensional Sparse Additive Model (SpAM) utilizing symmetric, non-negative definite smoothers.

Variable Selection

Cannot find the paper you are looking for? You can Submit a new open access paper.