Search Results for author: Yuqing Yang

Found 42 papers, 21 papers with code

Uncertain Local-to-Global Networks for Document-Level Event Factuality Identification

1 code implementation • EMNLP 2021 • Pengfei Cao, Yubo Chen, Yuqing Yang, Kang Liu, Jun Zhao

Moreover, we propose an Uncertain Information Aggregation module to leverage the global structure for integrating the local information.

Sentence

Paper
Code

Position Engineering: Boosting Large Language Models through Positional Information Manipulation

no code implementations • 17 Apr 2024 • Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna Qiu, Lili Qiu

Position engineering thus represents a promising new strategy for exploiting the capabilities of large language models.

In-Context Learning Position +2

Paper
Add Code

LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models

no code implementations • 2 Apr 2024 • Zhiyuan He, Aashish Gottipati, Lili Qiu, Francis Y. Yan, Xufang Luo, Kenuo Xu, Yuqing Yang

We present LLM-ABR, the first system that utilizes the generative capabilities of large language models (LLMs) to autonomously design adaptive bitrate (ABR) algorithms tailored for diverse network characteristics.

reinforcement-learning

Paper
Add Code

Automated Contrastive Learning Strategy Search for Time Series

no code implementations • 19 Mar 2024 • Baoyu Jing, Yansen Wang, Guoxin Sui, Jing Hong, Jingrui He, Yuqing Yang, Dongsheng Li, Kan Ren

In recent years, Contrastive Learning (CL) has become a predominant representation learning paradigm for time series.

AutoML Contrastive Learning +3

Paper
Add Code

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

1 code implementation • 19 Mar 2024 • Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, QIngwei Lin, Victor Rühle, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang

The challenge is that information entropy may be a suboptimal compression metric: (i) it only leverages unidirectional context and may fail to capture all essential information needed for prompt compression; (ii) it is not aligned with the prompt compression objective.

GSM8K Language Modelling +3

3,795

Paper
Code

Benchmarking Data Science Agents

1 code implementation • 27 Feb 2024 • Yuge Zhang, Qiyang Jiang, Xingyu Han, Nan Chen, Yuqing Yang, Kan Ren

In this paper, we introduce DSEval -- a novel evaluation paradigm, as well as a series of innovative benchmarks tailored for assessing the performance of these agents throughout the entire data science lifecycle.

Benchmarking Decision Making

Paper
Code

On the Out-Of-Distribution Generalization of Multimodal Large Language Models

no code implementations • 9 Feb 2024 • Xingxuan Zhang, Jiansheng Li, Wenjing Chu, Junjia Hai, Renzhe Xu, Yuqing Yang, Shikai Guan, Jiazheng Xu, Peng Cui

We investigate the generalization boundaries of current Multimodal Large Language Models (MLLMs) via comprehensive evaluation under out-of-distribution scenarios and domain-specific tasks.

In-Context Learning Out-of-Distribution Generalization +1

Paper
Add Code

Alignment for Honesty

1 code implementation • 12 Dec 2023 • Yuqing Yang, Ethan Chern, Xipeng Qiu, Graham Neubig, PengFei Liu

Recent research has made significant strides in applying alignment techniques to enhance the helpfulness and harmlessness of large language models (LLMs) in accordance with human intentions.

Paper
Code

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

1 code implementation • 1 Dec 2023 • Kai Lv, Shuo Zhang, Tianle Gu, Shuhao Xing, Jiawei Hong, Keyu Chen, Xiaoran Liu, Yuqing Yang, Honglin Guo, Tengxiao Liu, Yu Sun, Qipeng Guo, Hang Yan, Xipeng Qiu

This paper introduces CoLLiE, an efficient library that facilitates collaborative training of large language models using 3D parallelism, parameter-efficient fine-tuning (PEFT) methods, and optimizers such as Lion, Adan, Sophia, LOMO and AdaLomo.

381

Paper
Code

Unified Medical Image Pre-training in Language-Guided Common Semantic Space

no code implementations • 24 Nov 2023 • Xiaoxuan He, Yifan Yang, Xinyang Jiang, Xufang Luo, Haoji Hu, Siyun Zhao, Dongsheng Li, Yuqing Yang, Lili Qiu

To overcome the aforementioned challenges, we propose an Unified Medical Image Pre-training framework, namely UniMedI, which utilizes diagnostic reports as common semantic space to create unified representations for diverse modalities of medical images (especially for 2D and 3D images).

Paper
Add Code

Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts

1 code implementation • 23 Oct 2023 • Tengxiao Liu, Qipeng Guo, Yuqing Yang, Xiangkun Hu, Yue Zhang, Xipeng Qiu, Zheng Zhang

As large language models (LLMs) have shown effectiveness with different prompting methods, such as Chain of Thought, Program of Thought, we find that these methods have formed a great complementarity to each other on math reasoning tasks.

Logical Reasoning Math

Paper
Code

LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression

1 code implementation • 10 Oct 2023 • Huiqiang Jiang, Qianhui Wu, Xufang Luo, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu

Inspired by these findings, we propose LongLLMLingua for prompt compression towards improving LLMs' perception of the key information to simultaneously address the three challenges.

Code Completion Few-Shot Learning

3,795

Paper
Code

LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models

1 code implementation • 9 Oct 2023 • Huiqiang Jiang, Qianhui Wu, Chin-Yew Lin, Yuqing Yang, Lili Qiu

Large language models (LLMs) have been applied in various applications due to their astonishing capabilities.

GSM8K In-Context Learning

3,795

Paper
Code

Enabling Real-time Neural Recovery for Cloud Gaming on Mobile Devices

no code implementations • 15 Jul 2023 • Zhaoyuan He, Yifan Yang, Shuozhe Li, Diyuan Dai, Lili Qiu, Yuqing Yang

Our approach is extensively evaluated using iPhone 12 and laptop implementations, and we demonstrate the utility of game states in the game video recovery and the effectiveness of our overall design.

Paper
Add Code

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference

1 code implementation • 26 Jun 2023 • Junyan Li, Li Lyna Zhang, Jiahang Xu, Yujing Wang, Shaoguang Yan, Yunqing Xia, Yuqing Yang, Ting Cao, Hao Sun, Weiwei Deng, Qi Zhang, Mao Yang

Deploying pre-trained transformer models like BERT on downstream tasks in resource-constrained scenarios is challenging due to their high inference cost, which grows rapidly with input sequence length.

Model Compression

Paper
Code

Full Parameter Fine-tuning for Large Language Models with Limited Resources

1 code implementation • 16 Jun 2023 • Kai Lv, Yuqing Yang, Tengxiao Liu, Qinghui Gao, Qipeng Guo, Xipeng Qiu

Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP) but demand massive GPU resources for training.

925

Paper
Code

End-to-End Word-Level Pronunciation Assessment with MASK Pre-training

no code implementations • 5 Jun 2023 • Yukang Liang, Kaitao Song, Shaoguang Mao, Huiqiang Jiang, Luna Qiu, Yuqing Yang, Dongsheng Li, Linli Xu, Lili Qiu

Pronunciation assessment is a major challenge in the computer-aided pronunciation training system, especially at the word (phoneme)-level.

Paper
Add Code

Accurate and Structured Pruning for Efficient Automatic Speech Recognition

no code implementations • 31 May 2023 • Huiqiang Jiang, Li Lyna Zhang, Yuang Li, Yu Wu, Shijie Cao, Ting Cao, Yuqing Yang, Jinyu Li, Mao Yang, Lili Qiu

In this paper, we propose a novel compression strategy that leverages structured pruning and knowledge distillation to reduce the model size and inference cost of the Conformer model while preserving high recognition performance.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

An AMR-based Link Prediction Approach for Document-level Event Argument Extraction

1 code implementation • 30 May 2023 • Yuqing Yang, Qipeng Guo, Xiangkun Hu, Yue Zhang, Xipeng Qiu, Zheng Zhang

Motivated by the fact that all event structures can be inferred from AMR, this work reformulates EAE as a link prediction problem on AMR graphs.

Event Argument Extraction Link Prediction +1

Paper
Code

EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention

3 code implementations • CVPR 2023 • Xinyu Liu, Houwen Peng, Ningxin Zheng, Yuqing Yang, Han Hu, Yixuan Yuan

Comprehensive experiments demonstrate EfficientViT outperforms existing efficient models, striking a good trade-off between speed and accuracy.

29,735

Paper
Code

Learned Focused Plenoptic Image Compression with Microimage Preprocessing and Global Attention

1 code implementation • 30 Apr 2023 • Kedeng Tong, Xin Jin, Yuqing Yang, Chen Wang, Jinshi Kang, Fan Jiang

Also, it achieves 18. 73% bitrate saving and generates perceptually pleasant reconstructions compared to the state-of-the-art end-to-end image compression methods, which benefits the applications of focused plenoptic cameras greatly.

Image Compression

Paper
Code

MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks

1 code implementation • 28 Apr 2023 • Lei Zhang, Yuge Zhang, Kan Ren, Dongsheng Li, Yuqing Yang

In contrast, though human engineers have the incredible ability to understand tasks and reason about solutions, their experience and knowledge are often sparse and difficult to utilize by quantitative approaches.

AutoML

Paper
Code

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices

1 code implementation • ICCV 2023 • Chen Tang, Li Lyna Zhang, Huiqiang Jiang, Jiahang Xu, Ting Cao, Quanlu Zhang, Yuqing Yang, Zhi Wang, Mao Yang

However, prior supernet training methods that rely on uniform sampling suffer from the gradient conflict issue: the sampled subnets can have vastly different model sizes (e. g., 50M vs. 2G FLOPs), leading to different optimization directions and inferior performance.

Neural Architecture Search

Paper
Code

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference

1 code implementation • ICCV 2023 • Li Lyna Zhang, Xudong Wang, Jiahang Xu, Quanlu Zhang, Yujing Wang, Yuqing Yang, Ningxin Zheng, Ting Cao, Mao Yang

The combination of Neural Architecture Search (NAS) and quantization has proven successful in automatically designing low-FLOPs INT8 quantized neural networks (QNN).

Neural Architecture Search Quantization

Paper
Code

Online Streaming Video Super-Resolution with Convolutional Look-Up Table

no code implementations • 1 Mar 2023 • Guanghao Yin, Zefan Qu, Xinyang Jiang, Shan Jiang, Zhenhua Han, Ningxin Zheng, Xiaohong Liu, Huan Yang, Yuqing Yang, Dongsheng Li, Lili Qiu

To facilitate the research on this problem, a new benchmark dataset named LDV-WebRTC is constructed based on a real-world online streaming system.

Video Super-Resolution

Paper
Add Code

Unsupervised Video Anomaly Detection for Stereotypical Behaviours in Autism

no code implementations • 27 Feb 2023 • Jiaqi Gao, Xinyang Jiang, Yuqing Yang, Dongsheng Li, Lili Qiu

Correspondingly, we propose a Dual Stream deep model for Stereotypical Behaviours Detection, DS-SBD, based on the temporal trajectory of human poses and the repetition patterns of human actions.

Activity Recognition Anomaly Detection +1

Paper
Add Code

Towards Inference Efficient Deep Ensemble Learning

no code implementations • 29 Jan 2023 • Ziyue Li, Kan Ren, Yifan Yang, Xinyang Jiang, Yuqing Yang, Dongsheng Li

Ensemble methods can deliver surprising performance gains but also bring significantly higher computational costs, e. g., can be up to 2048X in large-scale ensemble tasks.

Ensemble Learning

Paper
Add Code

PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation

no code implementations • 26 Jan 2023 • Ningxin Zheng, Huiqiang Jiang, Quanlu Zhang, Zhenhua Han, Yuqing Yang, Lingxiao Ma, Fan Yang, Chengruidong Zhang, Lili Qiu, Mao Yang, Lidong Zhou

Dynamic sparsity, where the sparsity patterns are unknown until runtime, poses a significant challenge to deep learning.

Paper
Add Code

Attentive Mask CLIP

1 code implementation • ICCV 2023 • Yifan Yang, Weiquan Huang, Yixuan Wei, Houwen Peng, Xinyang Jiang, Huiqiang Jiang, Fangyun Wei, Yin Wang, Han Hu, Lili Qiu, Yuqing Yang

To address this issue, we propose an attentive token removal approach for CLIP training, which retains tokens with a high semantic correlation to the text description.

Contrastive Learning Retrieval +1

Paper
Code

DORE: Document Ordered Relation Extraction based on Generative Framework

1 code implementation • 28 Oct 2022 • Qipeng Guo, Yuqing Yang, Hang Yan, Xipeng Qiu, Zheng Zhang

In this paper, we investigate the root cause of the underwhelming performance of the existing generative DocRE models and discover that the culprit is the inadequacy of the training paradigm, instead of the capacities of the models.

Document-level Relation Extraction Relation

Paper
Code

Reinforcement Learning with Automated Auxiliary Loss Search

no code implementations • 12 Oct 2022 • Tairan He, Yuge Zhang, Kan Ren, Minghuan Liu, Che Wang, Weinan Zhang, Yuqing Yang, Dongsheng Li

A good state representation is crucial to solving complicated reinforcement learning (RL) challenges.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech

no code implementations • 10 Aug 2022 • Kaitao Song, Teng Wan, Bixia Wang, Huiqiang Jiang, Luna Qiu, Jiahang Xu, Liping Jiang, Qun Lou, Yuqing Yang, Dongsheng Li, Xudong Wang, Lili Qiu

Specifically, we first pre-train an encoder-decoder framework in an automatic speech recognition (ASR) objective by using speech-to-text dataset, and then fine-tune ASR encoder on the cleft palate dataset for hypernasality estimation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Online Video Super-Resolution with Convolutional Kernel Bypass Graft

no code implementations • 4 Aug 2022 • Jun Xiao, Xinyang Jiang, Ningxin Zheng, Huan Yang, Yifan Yang, Yuqing Yang, Dongsheng Li, Kin-Man Lam

Then, our proposed CKBG method enhances this lightweight base model by bypassing the original network with ``kernel grafts'', which are extra convolutional kernels containing the prior knowledge of external pretrained image SR models.

Transfer Learning Video Super-Resolution

Paper
Add Code

Privacy-preserving Online AutoML for Domain-Specific Face Detection

no code implementations • CVPR 2022 • Chenqian Yan, Yuge Zhang, Quanlu Zhang, Yaming Yang, Xinyang Jiang, Yuqing Yang, Baoyuan Wang

Thanks to HyperFD, each local task (client) is able to effectively leverage the learning "experience" of previous tasks without uploading raw images to the platform; meanwhile, the meta-feature extractor is continuously learned to better trade off the bias and variance.

AutoML Face Detection +1

Paper
Add Code

Game of Privacy: Towards Better Federated Platform Collaboration under Privacy Restriction

no code implementations • 10 Feb 2022 • Chuhan Wu, Fangzhao Wu, Tao Qi, Yanlin Wang, Yuqing Yang, Yongfeng Huang, Xing Xie

To solve the game, we propose a platform negotiation method that simulates the bargaining among platforms and locally optimizes their policies via gradient descent.

Vertical Federated Learning

Paper
Add Code

Towards Generating Real-World Time Series Data

1 code implementation • 16 Nov 2021 • Hengzhi Pei, Kan Ren, Yuqing Yang, Chang Liu, Tao Qin, Dongsheng Li

In this paper, we propose a novel generative framework for RTS data - RTSGAN to tackle the aforementioned challenges.

Generative Adversarial Network Time Series +1

Paper
Code

AARL: Automated Auxiliary Loss for Reinforcement Learning

no code implementations • 29 Sep 2021 • Tairan He, Yuge Zhang, Kan Ren, Che Wang, Weinan Zhang, Dongsheng Li, Yuqing Yang

A good state representation is crucial to reinforcement learning (RL) while an ideal representation is hard to learn only with signals from the RL objective.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Variational oracle guiding for reinforcement learning

no code implementations • ICLR 2022 • Dongqi Han, Tadashi Kozuno, Xufang Luo, Zhao-Yun Chen, Kenji Doya, Yuqing Yang, Dongsheng Li

How to make intelligent decisions is a central problem in machine learning and cognitive science.

Decision Making Offline RL +2

Paper
Add Code

Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision

no code implementations • 30 Aug 2021 • Bo Li, Xinyang Jiang, Donglin Bai, Yuge Zhang, Ningxin Zheng, Xuanyi Dong, Lu Liu, Yuqing Yang, Dongsheng Li

The energy consumption of deep learning models is increasing at a breathtaking rate, which raises concerns due to potential negative effects on carbon neutrality in the context of global warming and climate change.

Model Compression

Paper
Add Code

Learning to Rank Ace Neural Architectures via Normalized Discounted Cumulative Gain

1 code implementation • 6 Aug 2021 • Yuge Zhang, Quanlu Zhang, Li Lyna Zhang, Yaming Yang, Chenqian Yan, Xiaotian Gao, Yuqing Yang

One of the key challenges in Neural Architecture Search (NAS) is to efficiently rank the performances of architectures.

Learning-To-Rank Neural Architecture Search

Paper
Code

SPOC learner's final grade prediction based on a novel sampling batch normalization embedded neural network method

no code implementations • 15 Dec 2020 • Zhuonan Liang, Ziheng Liu, Huaze Shi, Yunlong Chen, Yanbin Cai, Yating Liang, Yafan Feng, Yuqing Yang, Jing Zhang, Peng Fu

To solve this problem, a sampling batch normalization embedded deep neural network (SBNEDNN) method is developed in this paper.

Paper
Add Code

Fast Hardware-Aware Neural Architecture Search

1 code implementation • 25 Oct 2019 • Li Lyna Zhang, Yuqing Yang, Yuhang Jiang, Wenwu Zhu, Yunxin Liu

Unlike previous approaches that apply search algorithms on a small, human-designed search space without considering hardware diversity, we propose HURRICANE that explores the automatic hardware-aware search over a much larger search space and a two-stage search algorithm, to efficiently generate tailored models for different types of hardware.

Hardware Aware Neural Architecture Search Neural Architecture Search

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.