Search Results for author: Yang Li

Found 436 papers, 124 papers with code

Emotion Inference in Multi-Turn Conversations with Addressee-Aware Module and Ensemble Strategy

no code implementations EMNLP 2021 Dayu Li, Xiaodan Zhu, Yang Li, Suge Wang, Deyu Li, Jian Liao, Jianxing Zheng

Emotion inference in multi-turn conversations aims to predict the participant’s emotion in the next upcoming turn without knowing the participant’s response yet, and is a necessary step for applications such as dialogue planning.

ACFlow: Flow Models for Arbitrary Conditional Likelihoods

1 code implementation ICML 2020 Yang Li, Shoaib Akbar, Junier Oliva

However, a majority of generative modeling approaches are focused solely on the joint distribution $p(x)$ and utilize models where it is intractable to obtain the conditional distribution of some arbitrary subset of features $x_u$ given the rest of the observed covariates $x_o$: $p(x_u \mid x_o)$.

Imputation

LLM-Enhanced Causal Discovery in Temporal Domain from Interventional Data

no code implementations23 Apr 2024 Peiwen Li, Xin Wang, Zeyang Zhang, Yuan Meng, Fang Shen, Yue Li, Jialong Wang, Yang Li, Wenweu Zhu

In the field of Artificial Intelligence for Information Technology Operations, causal discovery is pivotal for operation and maintenance of graph construction, facilitating downstream industrial tasks such as root cause analysis.

Causal Discovery graph construction

TextSquare: Scaling up Text-Centric Visual Instruction Tuning

no code implementations19 Apr 2024 Jingqun Tang, Chunhui Lin, Zhen Zhao, Shu Wei, Binghong Wu, Qi Liu, Hao Feng, Yang Li, Siqi Wang, Lei Liao, Wei Shi, Yuliang Liu, Hao liu, Yuan Xie, Xiang Bai, Can Huang

Text-centric visual question answering (VQA) has made great strides with the development of Multimodal Large Language Models (MLLMs), yet open-source models still fall short of leading models like GPT4V and Gemini, partly due to a lack of extensive, high-quality instruction tuning data.

Hallucination Hallucination Evaluation +2

Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts

2 code implementations12 Apr 2024 Yang Li, Songlin Yang, Wei Wang, Ziwen He, Bo Peng, Jing Dong

We verify the effectiveness of the proposed explanations from two aspects: (1) Counterfactual Trace Visualization: the enhanced forgery images are useful to reveal artifacts by visually contrasting the original images and two different visualization methods; (2) Transferable Adversarial Attacks: the adversarial forgery images generated by attacking the detection model are able to mislead other detection models, implying the removed artifacts are general.

Adversarial Attack counterfactual

PagPassGPT: Pattern Guided Password Guessing via Generative Pretrained Transformer

1 code implementation7 Apr 2024 Xingyu Su, Xiaojie Zhu, Yang Li, Yong Li, Chi Chen, Paulo Esteves-Veríssimo

Amidst the surge in deep learning-based password guessing models, challenges of generating high-quality passwords and reducing duplicate passwords persist.

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

1 code implementation28 Mar 2024 Xiaokang Zhang, Jing Zhang, Zeyao Ma, Yang Li, Bohan Zhang, Guanlin Li, Zijun Yao, Kangli Xu, Jinchang Zhou, Daniel Zhang-li, Jifan Yu, Shu Zhao, Juanzi Li, Jie Tang

We introduce TableLLM, a robust large language model (LLM) with 13 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to real-world office scenarios.

Language Modelling Large Language Model

Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane

no code implementations24 Mar 2024 Han Yan, Yang Li, Zhennan Wu, Shenzhou Chen, Weixuan Sun, Taizhang Shang, Weizhe Liu, Tian Chen, Xiaqiang Dai, Chao Ma, Hongdong Li, Pan Ji

We present Frankenstein, a diffusion-based framework that can generate semantic-compositional 3D scenes in a single pass.

Denoising

FusionINN: Invertible Image Fusion for Brain Tumor Monitoring

1 code implementation23 Mar 2024 Nishant Kumar, Ziyan Tao, Jaikirat Singh, Yang Li, Peiwen Sun, Binghui Zhao, Stefan Gumhold

Image fusion typically employs non-invertible neural networks to merge multiple source images into a single fused image.

Denoising Multi-Exposure Image Fusion

OUCopula: Bi-Channel Multi-Label Copula-Enhanced Adapter-Based CNN for Myopia Screening Based on OU-UWF Images

no code implementations18 Mar 2024 Yang Li, Qiuyi Huang, Chong Zhong, Danjuan Yang, Meiyan Li, A. H. Welsh, Aiyi Liu, Bo Fu, Catherien C. Liu, Xingtao Zhou

Inspired by the complex relationships between OU and the high correlation between the (continuous) outcome labels (Spherical Equivalent and Axial Length), we propose a framework of copula-enhanced adapter convolutional neural network (CNN) learning with OU UWF fundus images (OUCopula) for joint prediction of multiple clinical scores.

Matrix-Transformation Based Low-Rank Adaptation (MTLoRA): A Brain-Inspired Method for Parameter-Efficient Fine-Tuning

no code implementations12 Mar 2024 Yao Liang, Yuwei Wang, Yang Li, Yi Zeng

In response to this, inspired by the idea that the functions of the brain are shaped by its geometric structure, this paper integrates this idea into LoRA technology and proposes a new matrix transformation-based reparameterization method for efficient fine-tuning, named Matrix-Transformation based Low-Rank Adaptation (MTLoRA).

Natural Language Understanding Text Generation

Contrastive Continual Learning with Importance Sampling and Prototype-Instance Relation Distillation

1 code implementation7 Mar 2024 Jiyong Li, Dilshod Azizov, Yang Li, Shangsong Liang

Recently, because of the high-quality representations of contrastive learning methods, rehearsal-based contrastive continual learning has been proposed to explore how to continually learn transferable representation embeddings to avoid the catastrophic forgetting issue in traditional continual settings.

Continual Learning Contrastive Learning +2

Electrical Load Forecasting Model Using Hybrid LSTM Neural Networks with Online Correction

no code implementations6 Mar 2024 Nan Lu, Quan Ouyang, Yang Li, Changfu Zou

Accurate electrical load forecasting is of great importance for the efficient operation and control of modern power systems.

Load Forecasting Time Series

A Unified Model for Active Battery Equalization Systems

no code implementations6 Mar 2024 Quan Ouyang, Nourallah Ghaeminezhad, Yang Li, Torsten Wik, Changfu Zou

Lithium-ion battery packs demand effective active equalization systems to enhance their usable capacity and lifetime.

Pyramid Feature Attention Network for Monocular Depth Prediction

no code implementations3 Mar 2024 Yifang Xu, Chenglei Peng, Ming Li, Yang Li, Sidan Du

Deep convolutional neural networks (DCNNs) have achieved great success in monocular depth estimation (MDE).

Depth Prediction Monocular Depth Estimation

Enhancing Continuous Domain Adaptation with Multi-Path Transfer Curriculum

no code implementations26 Feb 2024 Hanbing Liu, Jingge Wang, Xuan Zhang, Ye Guo, Yang Li

Specifically, we construct a transfer curriculum over the source and intermediate domains based on Wasserstein distance, motivated by theoretical analysis of CDA.

Capacity Estimation Domain Adaptation +3

ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

1 code implementation23 Feb 2024 Lu Ye, Ze Tao, Yong Huang, Yang Li

In this paper, we introduce ChunkAttention, a prefix-aware self-attention module that can detect matching prompt prefixes across multiple requests and share their key/value tensors in memory at runtime to improve the memory utilization of KV cache.

Open Ad Hoc Teamwork with Cooperative Game Theory

no code implementations23 Feb 2024 Jianhong Wang, Yang Li, Yuan Zhang, Wei Pan, Samuel Kaski

Open ad hoc teamwork further complicates this challenge by considering environments with a changing number of teammates, referred to as open teams.

Flexible Physical Camouflage Generation Based on a Differential Approach

no code implementations21 Feb 2024 Yang Li, Wenyi Tan, Chenxing Zhao, Shuangju Zhou, Xinkai Liang, Quan Pan

This involves incorporating a specially designed adversarial loss and covert constraint loss to guarantee the adversarial and covert nature of the camouflage in the physical world.

Neural Rendering

Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions

no code implementations21 Feb 2024 Lei Pan, Yunshi Lan, Yang Li, Weining Qian

Among existing methods for UTST tasks, attention masking approach and Large Language Models (LLMs) are deemed as two pioneering methods.

In-Context Learning Knowledge Distillation +4

Aligning Individual and Collective Objectives in Multi-Agent Cooperation

no code implementations19 Feb 2024 Yang Li, WenHao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan

The visualization of learning dynamics effectively demonstrates that AgA successfully achieves alignment between individual and collective objectives.

SMAC+

FGeo-HyperGNet: Geometric Problem Solving Integrating Formal Symbolic System and Hypergraph Neural Network

1 code implementation18 Feb 2024 Xiaokai Zhang, Na Zhu, Cheng Qin, Yang Li, Zhenbing Zeng, Tuo Leng

The symbolic part is a formal system built on FormalGeo, which can automatically perform geomertic relational reasoning and algebraic calculations and organize the solving process into a solution hypertree with conditions as hypernodes and theorems as hyperedges.

Geometry Problem Solving Relational Reasoning

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

no code implementations12 Feb 2024 Mateusz Łajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszyńska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman

Echoing the widely-reported "emergent abilities" of large language models when trained on increasing volume of data, we show that BASE TTS variants built with 10K+ hours and 500M+ parameters begin to demonstrate natural prosody on textually complex sentences.

Disentanglement

Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity Personalized Diffusion Generation

no code implementations31 Jan 2024 Yang Li, Songlin Yang, Wei Wang, Jing Dong

The previous methods either failed to accurately fit the face region or lost the interactive generative ability with other existing concepts in T2I models.

Image Generation

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

no code implementations30 Jan 2024 Zhennan Wu, Yang Li, Han Yan, Taizhang Shang, Weixuan Sun, Senbo Wang, Ruikai Cui, Weizhe Liu, Hiroyuki Sato, Hongdong Li, Pan Ji

A variational auto-encoder is employed to compress the tri-planes into the latent tri-plane space, on which the denoising diffusion process is performed.

Denoising Scene Generation

Localization of Dummy Data Injection Attacks in Power Systems Considering Incomplete Topological Information: A Spatio-Temporal Graph Wavelet Convolutional Neural Network Approach

no code implementations27 Jan 2024 Zhaoyang Qu, Yunchang Dong, Yang Li, Siqi Song, Tao Jiang, Min Li, Qiming Wang, Lei Wang, Xiaoyong Bo, Jiye Zang, Qi Xu

Unfortunately, this approach tends to overlook the inherent topological correlations within the non-Euclidean spatial attributes of power grid data, consequently leading to diminished accuracy in attack localization.

A New Method for Vehicle Logo Recognition Based on Swin Transformer

no code implementations27 Jan 2024 Yang Li, Doudou Zhang, Jianli Xiao

Additionally, the use of a transfer learning strategy enables our method to be on par with state-of-the-art VLR methods.

Logo Recognition Transfer Learning

Label-free detection of exosomes from different cellular sources based on surface-enhanced Raman spectroscopy combined with machine learning models

no code implementations25 Jan 2024 Yang Li, Xiaoming Lyu, Kuo Zhan, Haoyu Ji, Lei Qin, JianAn Huang

In comparison to other machine learning analysis, our method used small amount of SERS data to allow a simple and rapid exosome detection, which enables a timely subsequent study of cell-cell interactions, communication mechanisms, and disease mechanisms in life sciences.

SCNet: Sparse Compression Network for Music Source Separation

no code implementations24 Jan 2024 Weinan Tong, Jiaxu Zhu, Jun Chen, Shiyin Kang, Tao Jiang, Yang Li, Zhiyong Wu, Helen Meng

We use a higher compression ratio on subbands with less information to improve the information density and focus on modeling subbands with more information.

Music Source Separation

Computation Rate Maximization for Wireless Powered Edge Computing With Multi-User Cooperation

1 code implementation22 Jan 2024 Yang Li, Xing Zhang, Bo Lei, Qianying Zhao, Min Wei, Zheyan Qu, Wenbo Wang

Simulation results show that the performance of the proposed algorithms is comparable to that of the exhaustive search method, and the deep learning-based algorithm significantly reduces the execution time of the algorithm.

Edge-computing

Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation

no code implementations18 Jan 2024 Changgu Chen, Junwei Shu, Lianggangxu Chen, Gaoqi He, Changbo Wang, Yang Li

However, exerting control over the motion of objects in videos generated by any video diffusion model is a challenging problem.

Denoising Position +1

Machine Learning Insides OptVerse AI Solver: Design Principles and Applications

no code implementations11 Jan 2024 Xijun Li, Fangzhou Zhu, Hui-Ling Zhen, Weilin Luo, Meng Lu, Yimin Huang, Zhenan Fan, Zirui Zhou, Yufei Kuang, Zhihai Wang, Zijie Geng, Yang Li, Haoyang Liu, Zhiwu An, Muming Yang, Jianshu Li, Jie Wang, Junchi Yan, Defeng Sun, Tao Zhong, Yong Zhang, Jia Zeng, Mingxuan Yuan, Jianye Hao, Jun Yao, Kun Mao

To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional optimization techniques.

Decision Making Management

GloTSFormer: Global Video Text Spotting Transformer

1 code implementation8 Jan 2024 Han Wang, Yanjie Wang, Yang Li, Can Huang

In this paper, we propose a novel Global Video Text Spotting Transformer GloTSFormer to model the tracking problem as global associations and utilize the Gaussian Wasserstein distance to guide the morphological correlation between frames.

Text Spotting

Learning Persistent Community Structures in Dynamic Networks via Topological Data Analysis

1 code implementation6 Jan 2024 Dexu Kong, Anping Zhang, Yang Li

Dynamic community detection methods often lack effective mechanisms to ensure temporal consistency, hindering the analysis of network evolution.

Clustering Community Detection +3

PPBFL: A Privacy Protected Blockchain-based Federated Learning Model

no code implementations2 Jan 2024 Yang Li, Chunhe Xia, Wanshuang Lin, Tianbo Wang

Therefore, we propose A Privacy Protected Blockchain-based Federated Learning Model (PPBFL) to enhance the security of federated learning and encourage active participation of nodes in model training.

Federated Learning

DOEPatch: Dynamically Optimized Ensemble Model for Adversarial Patches Generation

no code implementations28 Dec 2023 Wenyi Tan, Yang Li, Chenxing Zhao, ZhunGa Liu, Quan Pan

While ensemble models have proven effective, current research in the field of object detection typically focuses on the simple fusion of the outputs of all models, with limited attention being given to developing general adversarial patches that can function effectively in the physical world.

Autonomous Driving Object +3

Semantic Draw Engineering for Text-to-Image Creation

no code implementations23 Dec 2023 Yang Li, Huaqiang Jiang, Yangkai Wu

Text-to-image generation is conducted through Generative Adversarial Networks (GANs) or transformer models.

Computational Efficiency Text-to-Image Generation

Relation-Aware Question Answering for Heterogeneous Knowledge Graphs

1 code implementation19 Dec 2023 Haowei Du, Quzhe Huang, Chen Li, Chen Zhang, Yang Li, Dongyan Zhao

To address this issue, we construct a \textbf{dual relation graph} where each node denotes a relation in the original KG (\textbf{primal entity graph}) and edges are constructed between relations sharing same head or tail entities.

Knowledge Base Question Answering Knowledge Graphs +1

Multi-Granularity Information Interaction Framework for Incomplete Utterance Rewriting

no code implementations19 Dec 2023 Haowei Du, Dinghao Zhang, Chen Li, Yang Li, Dongyan Zhao

Recent approaches in Incomplete Utterance Rewriting (IUR) fail to capture the source of important words, which is crucial to edit the incomplete utterance, and introduce words from irrelevant utterances.

H-ensemble: An Information Theoretic Approach to Reliable Few-Shot Multi-Source-Free Transfer

no code implementations19 Dec 2023 Yanru Wu, Jianning Wang, Weida Wang, Yang Li

In this work, we adopt an information theoretic perspective on it and propose a framework named H-ensemble, which dynamically learns the optimal linear combination, or ensemble, of source models for the target task, using a generalization of maximal correlation regression.

Transfer Learning

Hypergrah-Enhanced Dual Convolutional Network for Bundle Recommendation

1 code implementation18 Dec 2023 Kangbo Liu, Yang Li, Yaoxin Wu, Zhaoxuan Wang, Xiaoxu Wang

While previous approaches have demonstrated notable performance, we argue that they may compromise the ternary relationship among users, items, and bundles.

Rich Human Feedback for Text-to-Image Generation

1 code implementation15 Dec 2023 Youwei Liang, Junfeng He, Gang Li, Peizhao Li, Arseniy Klimovskiy, Nicholas Carolan, Jiao Sun, Jordi Pont-Tuset, Sarah Young, Feng Yang, Junjie Ke, Krishnamurthy Dj Dvijotham, Katie Collins, Yiwen Luo, Yang Li, Kai J Kohlhoff, Deepak Ramachandran, Vidhya Navalpakkam

We show that the predicted rich human feedback can be leveraged to improve image generation, for example, by selecting high-quality training data to finetune and improve the generative models, or by creating masks with predicted heatmaps to inpaint the problematic regions.

Text-to-Image Generation

UniAR: Unifying Human Attention and Response Prediction on Visual Content

no code implementations15 Dec 2023 Peizhao Li, Junfeng He, Gang Li, Rachit Bhargava, Shaolei Shen, Nachiappan Valliappan, Youwei Liang, Hongxiang Gu, Venky Ramachandran, Golnaz Farhadi, Yang Li, Kai J Kohlhoff, Vidhya Navalpakkam

Such a model would enable predicting subjective feedback such as overall satisfaction or aesthetic quality ratings, along with the underlying human attention or interaction heatmaps and viewing order, enabling designers and content-creation models to optimize their creation for human-centric improvements.

Learning a Low-Rank Feature Representation: Achieving Better Trade-Off between Stability and Plasticity in Continual Learning

1 code implementation14 Dec 2023 Zhenrong Liu, Yang Li, Yi Gong, Yik-Chung Wu

This approach optimizes network parameters in the null space of the past tasks' feature representation matrix to guarantee the stability.

Continual Learning

Explainable Trajectory Representation through Dictionary Learning

no code implementations13 Dec 2023 Yuanbo Tang, Zhiyuan Peng, Yang Li

A hierarchical dictionary learning scheme is also proposed to ensure the algorithm's scalability on large networks, leading to a multi-scale trajectory representation.

Data Compression Dictionary Learning +1

Astrocyte-Enabled Advancements in Spiking Neural Networks for Large Language Modeling

no code implementations12 Dec 2023 Guobin Shen, Dongcheng Zhao, Yiting Dong, Yang Li, Jindong Li, Kang Sun, Yi Zeng

Within the complex neuroarchitecture of the brain, astrocytes play crucial roles in development, structure, and metabolism.

Language Modelling Text Generation

Fine-Grained Extraction of Road Networks via Joint Learning of Connectivity and Segmentation

1 code implementation7 Dec 2023 Yijia Xu, Liqiang Zhang, Wuming Zhang, Suhong Liu, Jingwen Li, Xingang Li, Yuebin Wang, Yang Li

Road network extraction from satellite images is widely applicated in intelligent traffic management and autonomous driving fields.

Autonomous Driving Management +2

Factor-Assisted Federated Learning for Personalized Optimization with Heterogeneous Data

no code implementations7 Dec 2023 Feifei Wang, Huiyun Tang, Yang Li

To address this issue, we develop a novel personalized federated learning framework for heterogeneous data, which we refer to as FedSplit.

Personalized Federated Learning

Resource Allocation for Semantic Communication under Physical-layer Security

no code implementations7 Dec 2023 Yang Li, Xinyu Zhou, Jun Zhao

The secrecy rate is the communication rate at which no information is disclosed to an eavesdropper.

Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future

2 code implementations6 Dec 2023 Hongyang Li, Yang Li, Huijie Wang, Jia Zeng, Huilin Xu, Pinlong Cai, Li Chen, Junchi Yan, Feng Xu, Lu Xiong, Jingdong Wang, Futang Zhu, Chunjing Xu, Tiancai Wang, Fei Xia, Beipeng Mu, Zhihui Peng, Dahua Lin, Yu Qiao

With the continuous maturation and application of autonomous driving technology, a systematic examination of open-source autonomous driving datasets becomes instrumental in fostering the robust evolution of the industry ecosystem.

Autonomous Driving

Perceptual Group Tokenizer: Building Perception with Iterative Grouping

no code implementations30 Nov 2023 Zhiwei Deng, Ting Chen, Yang Li

In this paper, we propose the Perceptual Group Tokenizer, a model that entirely relies on grouping operations to extract visual features and perform self-supervised representation learning, where a series of grouping operations are used to iteratively hypothesize the context for pixels or superpixels to refine feature representations.

Representation Learning Self-Supervised Image Classification +2

Deep Reinforcement Learning Based Optimal Energy Management of Multi-energy Microgrids with Uncertainties

no code implementations30 Nov 2023 Yang Cui, Yang Xu, Yang Li, Yijian Wang, Xinpeng Zou

To help EMS formulate optimal dispatching schemes, a deep reinforcement learning (DRL)-based MEMG energy management scheme with renewable energy source (RES) uncertainty is proposed in this paper.

energy management Management +1

Advancing Attack-Resilient Scheduling of Integrated Energy Systems with Demand Response via Deep Reinforcement Learning

no code implementations28 Nov 2023 Yang Li, Wenjie Ma, Yuanzheng Li, Sen Li, Zhe Chen

Simulation results demonstrate that our method is capable of adequately addressing the uncertainties resulting from RES and loads, mitigating the impact of cyber-attacks on the scheduling strategy, and ensuring a stable demand supply for various energy sources.

Scheduling

Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach

no code implementations23 Nov 2023 Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan

The remarkable progress in Large Language Models (LLMs) opens up new avenues for addressing planning and decision-making problems in Multi-Agent Systems (MAS).

Decision Making Hallucination +3

GENET: Unleashing the Power of Side Information for Recommendation via Hypergraph Pre-training

no code implementations22 Nov 2023 Yang Li, Qi'ao Zhao, Chen Lin, Zhenjie Zhang, Xiaomin Zhu

(2) The diverse semantics of side information that describes items and users from multi-level in a context different from recommendation systems.

Sequential Recommendation

FBChain: A Blockchain-based Federated Learning Model with Efficiency and Secure Communication

no code implementations21 Nov 2023 Yang Li, Chunhe Xia, Wei Liu, Weidong Zhou, Chen Chen, Tianbo Wang

This article proposes Blockchain-based Federated Learning (FBChain) model for federated learning parameter communication to overcome the above two problems.

Federated Learning

Rethinking and Benchmarking Predict-then-Optimize Paradigm for Combinatorial Optimization Problems

no code implementations13 Nov 2023 Haoyu Geng, Hang Ruan, Runzhong Wang, Yang Li, Yang Wang, Lei Chen, Junchi Yan

Numerous web applications rely on solving combinatorial optimization problems, such as energy cost-aware scheduling, budget allocation on web advertising, and graph matching on social networks.

Benchmarking Combinatorial Optimization +3

CeCNN: Copula-enhanced convolutional neural networks in joint prediction of refraction error and axial length based on ultra-widefield fundus images

no code implementations7 Nov 2023 Chong Zhong, Yang Li, Danjuan Yang, Meiyan Li, Xingyao Zhou, Bo Fu, Catherine C. Liu, A. H. Welsh

Inspired by the spirit that information extracted from the data by statistical methods can improve the prediction accuracy of deep learning models, we formulate a class of multivariate response regression models with a higher-order tensor biomarker, for the bivariate tasks of regression-classification and regression-regression.

regression

In-Context Prompt Editing For Conditional Audio Generation

no code implementations1 Nov 2023 Ernie Chang, Pin-Jie Lin, Yang Li, Sidd Srinivasan, Gael Le Lan, David Kant, Yangyang Shi, Forrest Iandola, Vikas Chandra

We show that the framework enhanced the audio quality across the set of collected user prompts, which were edited with reference to the training captions as exemplars.

Audio Generation Retrieval

AdaptMVSNet: Efficient Multi-View Stereo with adaptive convolution and attention fusion

1 code implementation journal 2023 Pengfei Jiang, Xiaoyan Yang, Yuanjie Chen, Wenjie Song, Yang Li

To this end, adaptive convolution is introduced to significantly improve the efficiency in speed and metrics compared to current methods.

R$^3$ Prompting: Review, Rephrase and Resolve for Chain-of-Thought Reasoning in Large Language Models under Noisy Context

no code implementations25 Oct 2023 Qingyuan Tian, Hanlun Zhu, Lei Wang, Yang Li, Yunshi Lan

More analyses and ablation studies show the robustness and generalization of R$^3$ prompting method in solving reasoning tasks in LLMs under noisy context.

Sentence

BRFL: A Blockchain-based Byzantine-Robust Federated Learning Model

no code implementations20 Oct 2023 Yang Li, Chunhe Xia, Chang Li, Tianbo Wang

With the increasing importance of machine learning, the privacy and security of training data have become critical.

Federated Learning

Kernel Learning in Ridge Regression "Automatically" Yields Exact Low Rank Solution

1 code implementation18 Oct 2023 Yunlu Chen, Yang Li, Keli Liu, Feng Ruan

Assuming that the covariates have nonzero explanatory power for the response only through a low dimensional subspace (central mean subspace), we find that the global minimizer of the finite sample kernel learning objective is also low rank with high probability.

regression

MM-BigBench: Evaluating Multimodal Models on Multimodal Content Comprehension Tasks

1 code implementation13 Oct 2023 Xiaocui Yang, Wenfang Wu, Shi Feng, Ming Wang, Daling Wang, Yang Li, Qi Sun, Yifei Zhang, XiaoMing Fu, Soujanya Poria

Consequently, our work complements research on the performance of MLLMs in multimodal comprehension tasks, achieving a more comprehensive and holistic evaluation of MLLMs.

Multimodal Reasoning

A Zero-Shot Language Agent for Computer Control with Structured Reflection

no code implementations12 Oct 2023 Tao Li, Gang Li, Zhiwei Deng, Bryan Wang, Yang Li

To perform a task, recent works often require a model to learn from trace examples of the task via either supervised learning or few/many-shot prompting.

Management

Language Models As Semantic Indexers

no code implementations11 Oct 2023 Bowen Jin, Hansi Zeng, Guoyin Wang, Xiusi Chen, Tianxin Wei, Ruirui Li, Zhengyang Wang, Zheng Li, Yang Li, Hanqing Lu, Suhang Wang, Jiawei Han, Xianfeng Tang

Semantic identifier (ID) is an important concept in information retrieval that aims to preserve the semantics of objects such as documents and items inside their IDs.

Contrastive Learning Information Retrieval +2

Automatic Macro Mining from Interaction Traces at Scale

1 code implementation10 Oct 2023 Forrest Huang, Gang Li, Tao Li, Yang Li

Macros are building block tasks of our everyday smartphone activity (e. g., "login", or "booking a flight").

Secondary frequency control of islanded microgrid considering wind and solar stochastics

no code implementations8 Oct 2023 Cheng Zhong, Zhifu Jiang, Xiangyu Zhang, Jikai Chen, Yang Li

Finally, a microgrid simulation model including multiple PV and wind DGs is built and performed in various scenarios compared to the traditional secondary frequency control method.

Model Predictive Control

Distribution-free risk assessment of regression-based machine learning algorithms

no code implementations5 Oct 2023 Sukrita Singh, Neeraj Sarna, Yuanyuan Li, Yang Li, Agni Orfanoudaki, Michael Berger

We solve the risk-assessment problem using the conformal prediction approach, which provides prediction intervals that are guaranteed to contain the true label with a given probability.

Conformal Prediction Prediction Intervals +1

Low-carbon optimal dispatch of integrated energy system considering demand response under the tiered carbon trading mechanism

no code implementations4 Oct 2023 Limeng Wang, Xuemeng Liu, Yang Li, Duo Chang, Xing Ren

The example results show that considering the carbon trading cost and demand response under the tiered carbon trading mechanism, the total operating cost of IES is reduced by 5. 69% and the carbon emission is reduced by 17. 06%, which significantly improves the reliability, economy and low carbon performance of IES.

Scheduling

A Demand-Supply Cooperative Responding Strategy in Power System with High Renewable Energy Penetration

no code implementations26 Sep 2023 Yuanzheng Li, Xinxin Long, Yang Li, Yizhou Ding, Tao Yang, Zhigang Zeng

In this context, unreasonable profit distributions on the demand-supply side would lead to the conflict of interests and diminish the effectiveness of cooperative responses.

Bad Actor, Good Advisor: Exploring the Role of Large Language Models in Fake News Detection

1 code implementation21 Sep 2023 Beizhe Hu, Qiang Sheng, Juan Cao, Yuhui Shi, Yang Li, Danding Wang, Peng Qi

To instantiate this proposal, we design an adaptive rationale guidance network for fake news detection (ARG), in which SLMs selectively acquire insights on news analysis from the LLMs' rationales.

Fake News Detection

Learning Point-wise Abstaining Penalty for Point Cloud Anomaly Detection

1 code implementation19 Sep 2023 Shaocong Xu, Pengfei Li, Xinyu Liu, Qianpu Sun, Yang Li, Shihui Guo, Zhen Wang, Bo Jiang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao

We demonstrate that learning different abstaining penalties, apart from point-wise penalty, for different types of (synthesized) outliers can further improve the performance.

Anomaly Detection Autonomous Driving +1

For A More Comprehensive Evaluation of 6DoF Object Pose Tracking

no code implementations14 Sep 2023 Yang Li, Fan Zhong, Xin Wang, Shuangbing Song, Jiachen Li, Xueying Qin, Changhe Tu

The limitations of previous scoring methods and error metrics are analyzed, based on which we introduce our improved evaluation methods.

Pose Tracking

Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition

no code implementations14 Sep 2023 Yang Li, Liangzhen Lai, Yuan Shangguan, Forrest N. Iandola, Zhaoheng Ni, Ernie Chang, Yangyang Shi, Vikas Chandra

Instead, the bottleneck lies in the linear projection layers of multi-head attention and feedforward networks, constituting a substantial portion of the model size and contributing significantly to computation, memory, and power usage.

speech-recognition Speech Recognition

Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP

1 code implementation11 Sep 2023 Jinzuomu Zhong, Yang Li, Hui Huang, Jie Liu, Zhiba Su, Jing Guo, Benlai Tang, Fengjie Zhu

While human prosody annotation contributes a lot to the performance, it is a labor-intensive and time-consuming process, often resulting in inconsistent outcomes.

Maximizing the performance for microcomb based microwave photonic transversal signal processors

no code implementations10 Sep 2023 Yang Sun, Jiayang Wu, Yang Li, Xingyuan Xu, Guanghui Ren, Mengxi Tan, Sai Tak Chu, Brent E. Little, Roberto Morandotti, Arnan Mitchell, David J. Moss

Microwave photonic (MWP) transversal signal processors offer a compelling solution for realizing versatile high-speed information processing by combining the advantages of reconfigurable electrical digital signal processing and high-bandwidth photonic processing.

Cross-Utterance Conditioned VAE for Speech Generation

no code implementations8 Sep 2023 Yang Li, Cheng Yu, Guangzhi Sun, Weiqin Zu, Zheng Tian, Ying Wen, Wei Pan, Chao Zhang, Jun Wang, Yang Yang, Fanglei Sun

Experimental results on the LibriTTS datasets demonstrate that our proposed models significantly enhance speech synthesis and editing, producing more natural and expressive speech.

Speech Synthesis

TradingGPT: Multi-Agent System with Layered Memory and Distinct Characters for Enhanced Financial Trading Performance

no code implementations7 Sep 2023 Yang Li, Yangyang Yu, Haohang Li, Zhi Chen, Khaldoun Khashanah

In financial trading contexts, LLMs serve as the decision core for trading agents, leveraging their layered memory system to integrate multi-source historical actions and market insights.

Navigate

Towards General and Efficient Online Tuning for Spark

no code implementations5 Sep 2023 Yang Li, Huaijun Jiang, Yu Shen, Yide Fang, Xiaofeng Yang, Danqing Huang, Xinyi Zhang, Wentao Zhang, Ce Zhang, Peng Chen, Bin Cui

The distributed data analytic system -- Spark is a common choice for processing massive volumes of heterogeneous data, while it is challenging to tune its parameters to achieve high performance.

Bayesian Optimization Meta-Learning

Short-term power load forecasting method based on CNN-SAEDN-Res

no code implementations2 Sep 2023 Yang Cui, Han Zhu, Yijian Wang, Lu Zhang, Yang Li

In this method, feature extraction module is composed of a two-dimensional convolutional neural network, which is used to mine the local correlation between data and obtain high-dimensional data features.

Load Forecasting Time Series

Model predictive control strategy in waked wind farms for optimal fatigue loads

no code implementations25 Aug 2023 Cheng Zhong, Yicheng Ding, Husai Wang, Jikai Chen, Jian Wang, Yang Li

In this paper, a closed-loop model predictive controller is developed that minimizes the wind farm tracking errors, the dynamical fatigue load, and and the load equalization.

Model Predictive Control

Deep Reinforcement Learning-driven Cross-Community Energy Interaction Optimal Scheduling

no code implementations24 Aug 2023 Yang Li, Wenjie Ma, Fanjin Bu, Zhen Yang, Bin Wang, Meng Han

In order to coordinate energy interactions among various communities and energy conversions among multi-energy subsystems within the multi-community integrated energy system under uncertain conditions, and achieve overall optimization and scheduling of the comprehensive energy system, this paper proposes a comprehensive scheduling model that utilizes a multi-agent deep reinforcement learning algorithm to learn load characteristics of different communities and make decisions based on this knowledge.

reinforcement-learning Scheduling

Learning the Plasticity: Plasticity-Driven Learning Framework in Spiking Neural Networks

no code implementations23 Aug 2023 Guobin Shen, Dongcheng Zhao, Yiting Dong, Yang Li, Feifei Zhao, Yi Zeng

This shift in focus from weight adjustment to mastering the intricacies of synaptic change offers a more flexible and dynamic pathway for neural networks to evolve and adapt.

Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion

1 code implementation11 Aug 2023 Siyuan Shan, Yang Li, Amartya Banerjee, Junier B. Oliva

Objective and subjective evaluations show that \textit{Phoneme Hallucinator} outperforms existing VC methods for both intelligibility and speaker similarity.

Voice Conversion

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

no code implementations9 Aug 2023 Yang Li, Kun Xiong, Yingping Zhang, Jiangcheng Zhu, Stephen Mcaleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang

This paper presents an empirical exploration of non-transitivity in perfect-information games, specifically focusing on Xiangqi, a traditional Chinese board game comparable in game-tree complexity to chess and shogi.

EEG-based Emotion Style Transfer Network for Cross-dataset Emotion Recognition

no code implementations9 Aug 2023 Yijin Zhou, Fu Li, Yang Li, Youshuo Ji, Lijian Zhang, Yuanfang Chen, Wenming Zheng, Guangming Shi

The transfer module encodes the domain-specific information of source and target domains and then re-constructs the source domain's emotional pattern and the target domain's statistical characteristics into the new stylized EEG representations.

EEG EEG Emotion Recognition +1

PMU measurements based short-term voltage stability assessment of power systems via deep transfer learning

no code implementations7 Aug 2023 Yang Li, Shitu Zhang, Yuanzheng Li, Jiting Cao, Shuyue Jia

Deep learning has emerged as an effective solution for addressing the challenges of short-term voltage stability assessment (STVSA) in power systems.

Data Augmentation Transfer Learning

Contact-conditioned hand-held object reconstruction from single-view images

1 code implementation journal 2023 Xiaoyuan Wang, Yang Li, Adnane Boukhayma, Changbo Wang, Marc Christie

Reconstructing the shape of hand-held objects from single-view color images is a long-standing problem in computer vision and computer graphics.

Object Object Reconstruction

InvVis: Large-Scale Data Embedding for Invertible Visualization

1 code implementation30 Jul 2023 Huayuan Ye, Chenhui Li, Yang Li, Changbo Wang

We propose a new method to efficiently express chart data in the form of images, enabling large-capacity data embedding.

A large calcium-imaging dataset reveals a systematic V4 organization for natural scenes

no code implementations3 Jul 2023 Tianye Wang, Haoxuan Yao, Tai Sing Lee, Jiayi Hong, Yang Li, Hongfei Jiang, Ian Max Andolina, Shiming Tang

To gain deeper insights into visual processing of natural scenes, we utilized widefield calcium-imaging of primate V4 in response to many natural images, generating a large dataset of columnar-scale responses.

InferTurbo: A Scalable System for Boosting Full-graph Inference of Graph Neural Network over Huge Graphs

no code implementations1 Jul 2023 Dalong Zhang, Xianzheng Song, Zhiyang Hu, Yang Li, Miao Tao, Binbin Hu, Lin Wang, Zhiqiang Zhang, Jun Zhou

Inspired by the philosophy of ``think-like-a-vertex", a GAS-like (Gather-Apply-Scatter) schema is proposed to describe the computation paradigm and data flow of GNN inference.

Philosophy

Farthest Streamline Sampling for the Uniform Distribution of Forearm Muscle Fiber Tracts from Diffusion Tensor Imaging

no code implementations24 Jun 2023 Yang Li, Shihan Ma, Jiamin Zhao, Qing Li, Xinjun Sheng

FSS reduced the sampling of long tracts (10% reduction in fiber length, P<0. 05), and the architectural parameters were within physiological ranges (two parameters with P<0. 05).

Recent Developments in Recommender Systems: A Survey

no code implementations22 Jun 2023 Yang Li, Kangbo Liu, Ranjan Satapathy, Suhang Wang, Erik Cambria

The objective of this study is to provide an overview of the current state-of-the-art in the field and highlight the latest trends in the development of recommender systems.

Fairness Recommendation Systems

Computing large deviation prefactors of stochastic dynamical systems based on machine learning

no code implementations20 Jun 2023 Yang Li, Shenglan Yuan, Linghongzhi Lu, Xianbin Liu

In this paper, we present large deviation theory that characterizes the exponential estimate for rare events of stochastic dynamical systems in the limit of weak noise.

Collaborative Optimization of Multi-microgrids System with Shared Energy Storage Based on Multi-agent Stochastic Game and Reinforcement Learning

no code implementations19 Jun 2023 Yijian Wang, Yang Cui, Yang Li, Yang Xu

The proposed MMG system framework can reduce energy fluctuations in the main grid by 1746. 5kW in 24 hours and achieve a cost reduction of 16. 21% in the test.

FewSAR: A Few-shot SAR Image Classification Benchmark

1 code implementation16 Jun 2023 Rui Zhang, Ziqi Wang, Yang Li, Jiabao Wang, Zhiteng Wang

Motivated by this observation, we propose a novel few-shot SAR image classification benchmark (FewSAR) to address this issue.

Classification Few-Shot Learning +2

Output Voltage Response Improvement and Ripple Reduction Control for Input-parallel Output-parallel High-Power DC Supply

no code implementations16 Jun 2023 Jianhui Meng, Xiaolong Wu, Tairan Ye, Jingsen Yu, Likang Gu, Zili Zhang, Yang Li

A three-phase isolated AC-DC-DC power supply is widely used in the industrial field due to its attractive features such as high-power density, modularity for easy expansion and electrical isolation.

Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination

1 code implementation5 Jun 2023 Yang Li, Shao Zhang, Jichen Sun, WenHao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan

In order to solve cooperative incompatibility in learning and effectively address the problem in the context of ZSC, we introduce the Cooperative Open-ended LEarning (COLE) framework, which formulates open-ended objectives in cooperative games with two players using perspectives of graph theory to evaluate and pinpoint the cooperative capacity of each strategy.

Image encryption for Offshore wind power based on 2D-LCLM and Zhou Yi Eight Trigrams

no code implementations2 Jun 2023 Lei Kou, Jinbo Wu, Fangfang Zhang, Peng Ji, Wende Ke, Junhe Wan, Hailin Liu, Yang Li, Quande Yuan

In this paper, we propose a new encryption algorithm for offshore wind power based on two-dimensional lagged complex logistic mapping (2D-LCLM) and Zhou Yi Eight Trigrams.

Improving Stability and Performance of Spiking Neural Networks through Enhancing Temporal Consistency

no code implementations23 May 2023 Dongcheng Zhao, Guobin Shen, Yiting Dong, Yang Li, Yi Zeng

Notably, our algorithm has achieved state-of-the-art performance on neuromorphic datasets DVS-CIFAR10 and N-Caltech101, and can achieve superior performance in the test phase with timestep T=1.

Dive into the Power of Neuronal Heterogeneity

no code implementations19 May 2023 Guobin Shen, Dongcheng Zhao, Yiting Dong, Yang Li, Yi Zeng

The biological neural network is a vast and diverse structure with high neural heterogeneity.

Continuous Control

Feature Chirality in Deep Learning Models

no code implementations6 May 2023 Shipeng Ji, Yang Li, Ruizhi Fu, Jiabao Wang, Zhuang Miao

As deep learning applications extensively increase by leaps and bounds, their interpretability has become increasingly prominent.

MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer

no code implementations29 Apr 2023 Yifang Xu, Yunzhuo Sun, Yang Li, Yilei Shi, Xiaoxiang Zhu, Sidan Du

With the increasing demand for video understanding, video moment and highlight detection (MHD) has emerged as a critical research topic.

Highlight Detection Video Understanding

MINN: Learning the dynamics of differential-algebraic equations and application to battery modeling

no code implementations27 Apr 2023 Yicun Huang, Changfu Zou, Yang Li, Torsten Wik

The concept of integrating physics-based and data-driven approaches has become popular for modeling sustainable energy systems.

OpenBox: A Python Toolkit for Generalized Black-box Optimization

1 code implementation26 Apr 2023 Huaijun Jiang, Yu Shen, Yang Li, Beicheng Xu, Sixian Du, Wentao Zhang, Ce Zhang, Bin Cui

Black-box optimization (BBO) has a broad range of applications, including automatic machine learning, experimental design, and database knob tuning.

Experimental Design

Binary stochasticity enabled highly efficient neuromorphic deep learning achieves better-than-software accuracy

no code implementations25 Apr 2023 Yang Li, Wei Wang, Ming Wang, Chunmeng Dou, Zhengyu Ma, Huihui Zhou, Peng Zhang, Nicola Lepri, Xumeng Zhang, Qing Luo, Xiaoxin Xu, Guanhua Yang, Feng Zhang, Ling Li, Daniele Ielmini, Ming Liu

We propose a binary stochastic learning algorithm that modifies all elementary neural network operations, by introducing (i) stochastic binarization of both the forwarding signals and the activation function derivatives, (ii) signed binarization of the backpropagating errors, and (iii) step-wised weight updates.

Binarization

Multi-Microgrid Collaborative Optimization Scheduling Using an Improved Multi-Agent Soft Actor-Critic Algorithm

no code implementations1 Apr 2023 Jiankai Gao, Yang Li, Bin Wang, Haibo Wu

The implementation of a multi-microgrid (MMG) system with multiple renewable energy sources enables the facilitation of electricity trading.

AutoML energy management +3

CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X

2 code implementations30 Mar 2023 Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang

Large pre-trained code generation models, such as OpenAI Codex, can generate syntax- and function-correct code, making the coding of programmers more productive and our pursuit of artificial general intelligence closer.

Code Generation

MSAT: Biologically Inspired Multi-Stage Adaptive Threshold for Conversion of Spiking Neural Networks

no code implementations23 Mar 2023 Xiang He, Yang Li, Dongcheng Zhao, Qingqun Kong, Yi Zeng

The self-adaptation to membrane potential and input allows a timely adjustment of the threshold to fire spike faster and transmit more information.

Sentiment Analysis Sentiment Classification +2

An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain

1 code implementation23 Mar 2023 Xiang He, Dongcheng Zhao, Yang Li, Guobin Shen, Qingqun Kong, Yi Zeng

In order to improve the generalization ability of SNNs on event-based datasets, we use static images to assist SNN training on event data.

Transfer Learning

Ensemble Nonlinear Model Predictive Control for Residential Solar-Battery Energy Management

no code implementations18 Mar 2023 Yang Li, D. Mahinda Vilathgamuwa, Daniel E. Quevedo, Chih Feng Lee, Changfu Zou

In a dynamic distribution market environment, residential prosumers with solar power generation and battery energy storage devices can flexibly interact with the power grid via power exchange.

energy management Management +1

Mpox-AISM: AI-Mediated Super Monitoring for Mpox and Like-Mpox

no code implementations17 Mar 2023 Yubiao Yue, Minghua Jiang, Xinyue Zhang, Jialong Xu, Huacong Ye, Fan Zhang, Zhenzhang Li, Yang Li

With the help of the Internet and communication terminal, Mpox-AISM can perform a real-time, low-cost, and convenient diagnosis for earlier-stage mpox in various real-world settings, thereby effectively curbing the spread of mpox virus.

Data Augmentation Decision Making +2

A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning

no code implementations10 Mar 2023 Xinyi Zhang, Zhuo Chang, Hong Wu, Yang Li, Jia Chen, Jian Tan, Feifei Li, Bin Cui

To tune different components for DBMS, a coordinating mechanism is needed to make the multiple agents cognizant of each other.

Thompson Sampling

Attention-based Graph Convolution Fusing Latent Structures and Multiple Features for Graph Neural Networks

1 code implementation2 Mar 2023 Yang Li, Yuichi Tanaka

Instead, we propose two methods to improve the representational power of AGCs by utilizing 1) structural information in a high-dimensional space and 2) multiple attention functions when calculating their weights.

Transfer Learning for Bayesian Optimization: A Survey

no code implementations12 Feb 2023 Tianyi Bai, Yang Li, Yu Shen, Xinyi Zhang, Wentao Zhang, Bin Cui

A wide spectrum of design and decision problems, including parameter tuning, A/B testing and drug design, intrinsically are instances of black-box optimization.

Bayesian Optimization Transfer Learning

Cooperative Open-ended Learning Framework for Zero-shot Coordination

1 code implementation9 Feb 2023 Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan

However, these approaches can result in a loss of learning and an inability to cooperate with certain strategies within the population, known as cooperative incompatibility.

Rover: An online Spark SQL tuning service via generalized transfer learning

no code implementations8 Feb 2023 Yu Shen, Xinyuyang Ren, Yupeng Lu, Huaijun Jiang, Huanyong Xu, Di Peng, Yang Li, Wentao Zhang, Bin Cui

When applying transfer learning to accelerate the tuning process, we notice two domain-specific challenges: 1) most previous work focus on transferring tuning history, while expert knowledge from Spark engineers is of great potential to improve the tuning performance but is not well studied so far; 2) history tasks should be carefully utilized, where using dissimilar ones lead to a deteriorated performance in production.

Bayesian Optimization Transfer Learning

DivBO: Diversity-aware CASH for Ensemble Learning

no code implementations7 Feb 2023 Yu Shen, Yupeng Lu, Yang Li, Yaofeng Tu, Wentao Zhang, Bin Cui

To tackle this issue and further enhance the ensemble performance, we propose DivBO, a diversity-aware framework to inject explicit search of diversity into the CASH problems.

AutoML Bayesian Optimization +1

HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline

1 code implementation4 Feb 2023 Yang Li, Xinyan Chen, Wenxuan Guo, Xijun Li, Wanqian Luo, Junhua Huang, Hui-Ling Zhen, Mingxuan Yuan, Junchi Yan

On top of the observations that industrial formulae exhibit clear community structure and oversplit substructures lead to the difficulty in semantic formation of logical structures, we propose HardSATGEN, which introduces a fine-grained control mechanism to the neural split-merge paradigm for SAT formula generation to better recover the structural and computational properties of the industrial benchmarks.

PLay: Parametrically Conditioned Layout Generation using Latent Diffusion

no code implementations27 Jan 2023 Chin-Yi Cheng, Forrest Huang, Gang Li, Yang Li

Layout design is an important task in various design fields, including user interface, document, and graphic design.

Layout Design

GPU-based Private Information Retrieval for On-Device Machine Learning Inference

1 code implementation26 Jan 2023 Maximilian Lam, Jeff Johnson, Wenjie Xiong, Kiwan Maeng, Udit Gupta, Yang Li, Liangzhen Lai, Ilias Leontiadis, Minsoo Rhu, Hsien-Hsin S. Lee, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks, G. Edward Suh

Together, for various on-device ML applications such as recommendation and language modeling, our system on a single V100 GPU can serve up to $100, 000$ queries per second -- a $>100 \times$ throughput improvement over a CPU-based baseline -- while maintaining model accuracy.

Information Retrieval Language Modelling +1

HTTE: A Hybrid Technique For Travel Time Estimation In Sparse Data Environments

no code implementations12 Jan 2023 Nikolaos Zygouras, Nikolaos Panagiotou, Yang Li, Dimitrios Gunopulos, Leonidas Guibas

Travel time estimation is a critical task, useful to many urban applications at the individual citizen and the stakeholder level.

Travel Time Estimation

Finding the Most Transferable Tasks for Brain Image Segmentation

no code implementations3 Jan 2023 Yicong Li, Yang Tan, Jingyun Yang, Yang Li, Xiao-Ping Zhang

Furthermore, within the same modality, transferring from the source task that has stronger RoI shape similarity with the target task can significantly improve the final transfer performance.

Brain Image Segmentation Image Segmentation +3

SparseMAE: Sparse Training Meets Masked Autoencoders

no code implementations ICCV 2023 Aojun Zhou, Yang Li, Zipeng Qin, Jianbo Liu, Junting Pan, Renrui Zhang, Rui Zhao, Peng Gao, Hongsheng Li

In this paper, we aim to reduce model complexity from large vision transformers pretrained by MAE with assistant of sparse training.

Federated Multi-Agent Deep Reinforcement Learning Approach via Physics-Informed Reward for Multi-Microgrid Energy Management

no code implementations29 Dec 2022 Yuanzheng Li, Shangyang He, Yang Li, Yang Shi, Zhigang Zeng

Then, these local models are periodically uploaded to a server and their parameters are aggregated to build a global agent, which will be broadcasted to MGs and replace their local agents.

energy management Federated Learning +4

An Information-Theoretic Approach to Transferability in Task Transfer Learning

no code implementations20 Dec 2022 Yajie Bao, Yang Li, Shao-Lun Huang, Lin Zhang, Lizhong Zheng, Amir Zamir, Leonidas Guibas

Task transfer learning is a popular technique in image processing applications that uses pre-trained models to reduce the supervision cost of related tasks.

Model Selection Transfer Learning

ENGNN: A General Edge-Update Empowered GNN Architecture for Radio Resource Management in Wireless Networks

no code implementations14 Dec 2022 Yunqi Wang, Yang Li, Qingjiang Shi, Yik-Chung Wu

In order to achieve high data rate and ubiquitous connectivity in future wireless networks, a key task is to efficiently manage the radio resource by judicious beamforming and power allocation.

Management

Super-resolution Probabilistic Rain Prediction from Satellite Data Using 3D U-Nets and EarthFormers

1 code implementation6 Dec 2022 Yang Li, Haiyu Dong, Zuliang Fang, Jonathan Weyn, Pete Luferenko

To further improve the model performance, multi-model ensemble and threshold optimization were used to produce the final probabilistic rain prediction.

Decision Making Super-Resolution

A Dual-scale Lead-seperated Transformer With Lead-orthogonal Attention And Meta-information For Ecg Classification

no code implementations23 Nov 2022 Yang Li, Guijin Wang, Zhourui Xia, Wenming Yang, Li Sun

Auxiliary diagnosis of cardiac electrophysiological status can be obtained through the analysis of 12-lead electrocardiograms (ECGs).

ECG Classification

Learning Cooperative Beamforming with Edge-Update Empowered Graph Neural Networks

no code implementations23 Nov 2022 Yunqi Wang, Yang Li, Qingjiang Shi, Yik-Chung Wu

However, the current GNNs are only equipped with the node-update mechanism, which restricts it from modeling more complicated problems such as the cooperative beamforming design, where the beamformers are on the graph edges of wireless networks.

Learn from Yesterday: A Semi-Supervised Continual Learning Method for Supervision-Limited Text-to-SQL Task Streams

1 code implementation21 Nov 2022 Yongrui Chen, Xinnan Guo, Tongtong Wu, Guilin Qi, Yang Li, Yang Dong

The first solution Vanilla is to perform self-training, augmenting the supervised training data with predicted pseudo-labeled instances of the current task, while replacing the full volume retraining with episodic memory replay to balance the training efficiency with the performance of previous tasks.

Continual Learning Text-To-SQL

Hierarchical Estimation for Effective and Efficient Sampling Graph Neural Network

no code implementations16 Nov 2022 Yang Li, Bingbing Xu, Qi Cao, Yige Yuan, HuaWei Shen

On account that previous studies either lacks variance analysis or only focus on a particular sampling paradigm, we firstly propose an unified node sampling variance analysis framework and analyze the core challenge "circular dependency" for deriving the minimum variance sampler, i. e., sampling probability depends on node embeddings while node embeddings can not be calculated until sampling is finished.

Time Series Time Series Analysis

Accounting for Temporal Variability in Functional Magnetic Resonance Imaging Improves Prediction of Intelligence

1 code implementation11 Nov 2022 Yang Li, Xin Ma, Raj Sunderraman, Shihao Ji, Suprateek Kundu

We compare the prediction performance for different intelligence measures based on static FC, dynamic FC, and region level time series acquired from the Adolescent Brain Cognitive Development (ABCD) study involving close to 7000 individuals.

feature selection Time Series +1

CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification

no code implementations11 Nov 2022 Yang Li, Canran Xu, Guodong Long, Tao Shen, Chongyang Tao, Jing Jiang

Basically, an instance-dependent soft prefix, derived from fact-counterfactual pairs in the label space, is leveraged to complement the language verbalizers in many-class classification.

Classification counterfactual +7

A Random Forest and Current Fault Texture Feature-Based Method for Current Sensor Fault Diagnosis in Three-Phase PWM VSR

no code implementations8 Nov 2022 Lei Kou, Xiao-dong Gong, Yi Zheng, Xiu-hui Ni, Yang Li, Quan-de Yuan, Ya-nan Dong

The current sensor faults may bring hidden danger or damage to the whole system; therefore, this paper proposed a random forest (RF) and current fault texture feature-based method for current sensor fault diagnosis in three-phase PWM VSR systems.

Fault Detection

Wind Power Forecasting Considering Data Privacy Protection: A Federated Deep Reinforcement Learning Approach

no code implementations2 Nov 2022 Yang Li, Ruinong Wang, Yuanzheng Li, Meng Zhang, Chao Long

To handle the data privacy and openness, we propose a forecasting scheme that combines federated learning and deep reinforcement learning (DRL) for ultra-short-term wind power forecasting, called federated deep reinforcement learning (FedDRL).

Federated Learning Privacy Preserving +2

Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers

no code implementations2 Nov 2022 Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer

We show how factoring the RNN-T's output distribution can significantly reduce the computation cost and power consumption for on-device ASR inference with no loss in accuracy.

Review on Monitoring, Operation and Maintenance of Smart Offshore Wind Farms

no code implementations1 Nov 2022 Lei Kou, Yang Li, Fangfang Zhang, Xiaodong Gong, Yinghong Hu, Quande Yuan, Wende Ke

In recent years, with the development of wind energy, the number and scale of wind farms are developing rapidly.

Self-supervised Graph-based Point-of-interest Recommendation

no code implementations22 Oct 2022 Yang Li, Tong Chen, Peng-Fei Zhang, Zi Huang, Hongzhi Yin

In order to counteract the scarcity and incompleteness of POI check-ins, we propose a novel self-supervised learning paradigm in \ssgrec, where the trajectory representations are contrastively learned from two augmented views on geolocations and temporal transitions.

Self-Supervised Learning

Understanding Embodied Reference with Touch-Line Transformer

1 code implementation11 Oct 2022 Yang Li, Xiaoxue Chen, Hao Zhao, Jiangtao Gong, Guyue Zhou, Federico Rossano, Yixin Zhu

Human studies have revealed that objects referred to or pointed to do not lie on the elbow-wrist line, a common misconception; instead, they lie on the so-called virtual touch line.

Contrastive Bayesian Analysis for Deep Metric Learning

1 code implementation10 Oct 2022 Shichao Kan, Zhiquan He, Yigang Cen, Yang Li, Vladimir Mladenovic, Zhihai He

Recent methods for deep metric learning have been focusing on designing different contrastive loss functions between positive and negative pairs of samples so that the learned feature embedding is able to pull positive samples of the same class closer and push negative samples from different classes away from each other.

Contrastive Learning Metric Learning

Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus

no code implementations29 Sep 2022 Gang Li, Yang Li

Specifically, we enhance a vision-language model that only takes the screenshot of the UI and a region of interest on the screen -- the focus -- as the input.

Language Modelling Multi-Task Learning

MUG: Interactive Multimodal Grounding on User Interfaces

no code implementations29 Sep 2022 Tao Li, Gang Li, Jingjie Zheng, Purple Wang, Yang Li

To investigate the problem, we create a new dataset that consists of 77, 820 sequences of human user-agent interaction on mobile interfaces in which 20% involves multiple rounds of interactions.

Controlling mean exit time of stochastic dynamical systems based on quasipotential and machine learning

no code implementations27 Sep 2022 Yang Li, Shenglan Yuan, Shengyuan Xu

The mean exit time escaping basin of attraction in the presence of white noise is of practical importance in various scientific fields.

Optimal dispatch of low-carbon integrated energy system considering nuclear heating and carbon trading

no code implementations24 Sep 2022 Yang Li, Fanjin Bu, Jiankai Gao, Guoqing Lia

The development of miniaturized nuclear power (NP) units and the improvement of the carbon trading market provide a new way to realize the low-carbon operation of integrated energy systems (IES).

Scheduling

Enabling Conversational Interaction with Mobile UI using Large Language Models

1 code implementation18 Sep 2022 Bryan Wang, Gang Li, Yang Li

This paper investigates the feasibility of enabling versatile conversational interactions with mobile UIs using a single LLM.

SDFE-LV: A Large-Scale, Multi-Source, and Unconstrained Database for Spotting Dynamic Facial Expressions in Long Videos

no code implementations18 Sep 2022 Xiaolin Xu, Yuan Zong, Wenming Zheng, Yang Li, Chuangao Tang, Xingxun Jiang, Haolin Jiang

In this paper, we present a large-scale, multi-source, and unconstrained database called SDFE-LV for spotting the onset and offset frames of a complete dynamic facial expression from long videos, which is known as the topic of dynamic facial expression spotting (DFES) and a vital prior step for lots of facial expression analysis tasks.

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

2 code implementations12 Sep 2022 Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Jia Zeng, Zhiqi Li, Jiazhi Yang, Hanming Deng, Hao Tian, Enze Xie, Jiangwei Xie, Li Chen, Tianyu Li, Yang Li, Yulu Gao, Xiaosong Jia, Si Liu, Jianping Shi, Dahua Lin, Yu Qiao

As sensor configurations get more complex, integrating multi-source information from different sensors and representing features in a unified view come of vital importance.

Autonomous Driving

Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip

no code implementations7 Sep 2022 Yang Li, Leo Yan Li-Han, Hua Tian

To the best of our knowledge, this is the first study for objective DDH diagnosis by leveraging deep learning keypoint detection and integrating different anatomical measurements, which can provide reliable and explainable support for clinical decision-making.

Decision Making Keypoint Detection

Video-based Cross-modal Auxiliary Network for Multimodal Sentiment Analysis

1 code implementation30 Aug 2022 Rongfei Chen, Wenju Zhou, Yang Li, Huiyu Zhou

Multimodal sentiment analysis has a wide range of applications due to its information complementarity in multimodal interactions.

Classification Image Classification +1

Distance-Aware Occlusion Detection with Focused Attention

1 code implementation23 Aug 2022 Yang Li, Yucheng Tu, Xiaoxue Chen, Hao Zhao, Guyue Zhou

In this work, (1) we propose a novel three-decoder architecture as the infrastructure for focused attention; 2) we use the generalized intersection box prediction task to effectively guide our model to focus on occlusion-specific regions; 3) our model achieves a new state-of-the-art performance on distance-aware relationship detection.

Human-Object Interaction Detection Relationship Detection +1

Resilience assessment and improvement for electric power transmission systems against typhoon disasters: A data-model hybrid driven approach

no code implementations19 Aug 2022 Rui Yang, Yang Li

In response to the damage to electric power transmission systems caused by typhoon disasters in coastal areas, a planning-targeted resilience assessment framework that considers the impact of multiple factors is established to accurately find the weak links of the transmission system and improve the system resilience.

GraphTTA: Test Time Adaptation on Graph Neural Networks

no code implementations19 Aug 2022 Guanzi Chen, Jiying Zhang, Xi Xiao, Yang Li

In this paper, we present a novel test time adaptation strategy named Graph Adversarial Pseudo Group Contrast (GAPGC), for graph neural networks TTA, to better adapt to the Out Of Distribution (OOD) test data.

Contrastive Learning Test-time Adaptation

Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment

no code implementations4 Aug 2022 Yilei Zeng, Jiali Duan, Yang Li, Emilio Ferrara, Lerrel Pinto, C. -C. Jay Kuo, Stefanos Nikolaidis

In this work, we guide the curriculum reinforcement learning results towards a preferred performance level that is neither too hard nor too easy via learning from the human decision process.

reinforcement-learning Reinforcement Learning (RL)

A Real-time Edge-AI System for Reef Surveys

no code implementations1 Aug 2022 Yang Li, Jiajun Liu, Brano Kusy, Ross Marchant, Brendan Do, Torsten Merz, Joey Crosswell, Andy Steven, Lachlan Tychsen-Smith, David Ahmedt-Aristizabal, Jeremy Oorloff, Peyman Moghadam, Russ Babcock, Megha Malpani, Ard Oerlemans

Crown-of-Thorn Starfish (COTS) outbreaks are a major cause of coral loss on the Great Barrier Reef (GBR) and substantial surveillance and control programs are ongoing to manage COTS populations to ecologically sustainable levels.

Computational Efficiency object-detection +1

A Universal PINNs Method for Solving Partial Differential Equations with a Point Source

1 code implementation Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022 Xiang Huang, Hongsheng Liu, Beiji Shi, Zidong Wang, Kang Yang, Yang Li, Min Wang, Haotian Chu, Jing Zhou, Fan Yu, Bei Hua, Bin Dong, Lei Chen

In recent years, deep learning technology has been used to solve partial differential equations (PDEs), among which the physics-informed neural networks (PINNs)method emerges to be a promising method for solving both forward and inverse PDE problems.

BrainCog: A Spiking Neural Network based Brain-inspired Cognitive Intelligence Engine for Brain-inspired AI and Brain Simulation

no code implementations18 Jul 2022 Yi Zeng, Dongcheng Zhao, Feifei Zhao, Guobin Shen, Yiting Dong, Enmeng Lu, Qian Zhang, Yinqian Sun, Qian Liang, Yuxuan Zhao, Zhuoya Zhao, Hongjian Fang, Yuwei Wang, Yang Li, Xin Liu, Chengcheng Du, Qingqun Kong, Zizhe Ruan, Weida Bi

These brain-inspired AI models have been effectively validated on various supervised, unsupervised, and reinforcement learning tasks, and they can be used to enable AI models to be with multiple brain-inspired cognitive functions.

Decision Making

Transferability-Guided Cross-Domain Cross-Task Transfer Learning

no code implementations12 Jul 2022 Yang Tan, Enming Zhang, Yang Li, Shao-Lun Huang, Xiao-Ping Zhang

We propose two novel transferability metrics F-OTCE (Fast Optimal Transport based Conditional Entropy) and JC-OTCE (Joint Correspondence OTCE) to evaluate how much the source model (task) can benefit the learning of the target task and to learn more transferable representations for cross-domain cross-task transfer learning.

Transfer Learning

Generalizing to Unseen Domains with Wasserstein Distributional Robustness under Limited Source Knowledge

no code implementations11 Jul 2022 Jingge Wang, Liyan Xie, Yao Xie, Shao-Lun Huang, Yang Li

Domain generalization aims at learning a universal model that performs well on unseen target domains, incorporating knowledge from multiple source domains.

Domain Generalization Rotated MNIST +1

An Unsupervised STDP-based Spiking Neural Network Inspired By Biologically Plausible Learning Rules and Connections

no code implementations6 Jul 2022 Yiting Dong, Dongcheng Zhao, Yang Li, Yi Zeng

By integrating the above three adaptive mechanisms and STB-STDP, our model greatly accelerates the training of unsupervised spiking neural networks and improves the performance of unsupervised SNNs on complex tasks.

Spike Calibration: Fast and Accurate Conversion of Spiking Neural Network for Object Detection and Segmentation

no code implementations6 Jul 2022 Yang Li, Xiang He, Yiting Dong, Qingqun Kong, Yi Zeng

Spiking neural network (SNN) has been attached to great importance due to the properties of high biological plausibility and low energy consumption on neuromorphic hardware.

Bayesian Optimization object-detection +1

Auto-Encoder-Extreme Learning Machine Model for Boiler NOx Emission Concentration Prediction

no code implementations29 Jun 2022 Zhenhao Tang, Shikui Wang, Xiangying Chai, Shengxian Cao, Tinghui Ouyang, Yang Li

An automatic encoder (AE) extreme learning machine (ELM)-AE-ELM model is proposed to predict the NOx emission concentration based on the combination of mutual information algorithm (MI), AE, and ELM.

Warped Convolutional Networks: Bridge Homography to sl(3) algebra by Group Convolution

no code implementations23 Jun 2022 Xinrui Zhan, Yang Li, Wenyu Liu, Jianke Zhu

In this paper, we propose Warped Convolution Networks (WCN) to effectively learn and represent the homography by SL(3) group and sl(3) algebra with group convolution.

Homography Estimation Object Tracking

Efficient End-to-End AutoML via Scalable Search Space Decomposition

1 code implementation19 Jun 2022 Yang Li, Yu Shen, Wentao Zhang, Ce Zhang, Bin Cui

End-to-end AutoML has attracted intensive interests from both academia and industry which automatically searches for ML pipelines in a space induced by feature engineering, algorithm/model selection, and hyper-parameter tuning.

AutoML Feature Engineering +1

NAFS: A Simple yet Tough-to-beat Baseline for Graph Representation Learning

2 code implementations17 Jun 2022 Wentao Zhang, Zeang Sheng, Mingyu Yang, Yang Li, Yu Shen, Zhi Yang, Bin Cui

First, GNNs can learn higher-order structural information by stacking more layers but can not deal with large depth due to the over-smoothing issue.

Graph Representation Learning Link Prediction +1

Level 2 Autonomous Driving on a Single Device: Diving into the Devils of Openpilot

no code implementations16 Jun 2022 Li Chen, Tutian Tang, Zhitian Cai, Yang Li, Penghao Wu, Hongyang Li, Jianping Shi, Junchi Yan, Yu Qiao

Equipped with a wide span of sensors, predominant autonomous driving solutions are becoming more modular-oriented for safe system design.

Autonomous Driving

Transfer Learning based Search Space Design for Hyperparameter Tuning

no code implementations6 Jun 2022 Yang Li, Yu Shen, Huaijun Jiang, Tianyi Bai, Wentao Zhang, Ce Zhang, Bin Cui

The extensive experiments show that our approach considerably boosts BO by designing a promising and compact search space instead of using the entire space, and outperforms the state-of-the-arts on a wide range of benchmarks, including machine learning and deep learning tuning tasks, and neural architecture search.

Bayesian Optimization BIG-bench Machine Learning +2

TransBO: Hyperparameter Optimization via Two-Phase Transfer Learning

no code implementations6 Jun 2022 Yang Li, Yu Shen, Huaijun Jiang, Wentao Zhang, Zhi Yang, Ce Zhang, Bin Cui

With the extensive applications of machine learning models, automatic hyperparameter optimization (HPO) has become increasingly important.

Hyperparameter Optimization Neural Architecture Search +2

Cannot find the paper you are looking for? You can Submit a new open access paper.