Search Results for author: Yu Yu

Found 17 papers, 6 papers with code

Can Data Diversity Enhance Learning Generalization?

no code implementations COLING 2022 Yu Yu, Shahram Khadivi, Jia Xu

This paper introduces our Diversity Advanced Actor-Critic reinforcement learning (A2C) framework (DAAC) to improve the generalization and accuracy of Natural Language Processing (NLP).

Diversity Domain Adaptation +8

Measuring Robustness for NLP

no code implementations COLING 2022 Yu Yu, Abdul Rafae Khan, Jia Xu

The quality of Natural Language Processing (NLP) models is typically measured by the accuracy or error rate of a predefined test set.

Machine Translation Sentiment Analysis

Nimbus: Secure and Efficient Two-Party Inference for Transformers

1 code implementation24 Nov 2024 Zhengyi Li, Kang Yang, Jin Tan, Wen-jie Lu, Haoqi Wu, Xiao Wang, Yu Yu, Derun Zhao, Yancheng Zheng, Minyi Guo, Jingwen Leng

For the linear layer, we propose a new 2PC paradigm along with an encoding approach to securely compute matrix multiplications based on an outer-product insight, which achieves $2. 9\times \sim 12. 5\times$ performance improvements compared to the state-of-the-art (SOTA) protocol.

Learning Code Preference via Synthetic Evolution

1 code implementation4 Oct 2024 Jiawei Liu, Thanh Nguyen, Mingyue Shang, Hantian Ding, Xiaopeng Li, Yu Yu, Varun Kumar, Zijian Wang

and (ii) How do human and LLM preferences align with verifiable code properties and developer code tastes?

Code Generation

Xinyu: An Efficient LLM-based System for Commentary Generation

no code implementations21 Aug 2024 Yiquan Wu, Bo Tang, Chenyang Xi, Yu Yu, Pengyu Wang, Yifei Liu, Kun Kuang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Jie Hu, Peng Cheng, Zhonghao Wang, Yi Wang, Yi Luo, MingChuan Yang

To address the advanced requirements, we present an argument ranking model for arguments and establish a comprehensive evidence database that includes up-to-date events and classic books, thereby strengthening the substantiation of the evidence with retrieval augmented generation (RAG) technology.

RAG Text Generation

$\text{Memory}^3$: Language Modeling with Explicit Memory

no code implementations1 Jul 2024 Hongkang Yang, Zehao Lin, Wenjin Wang, Hao Wu, Zhiyu Li, Bo Tang, Wenqiang Wei, Jinbo Wang, Zeyun Tang, Shichao Song, Chenyang Xi, Yu Yu, Kai Chen, Feiyu Xiong, Linpeng Tang, Weinan E

The model is named $\text{Memory}^3$, since explicit memory is the third form of memory in LLMs after implicit memory (model parameters) and working memory (context key-values).

Language Modelling RAG +1

FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models

1 code implementation23 Jun 2024 Junyi Zhu, Shuochen Liu, Yu Yu, Bo Tang, Yibo Yan, Zhiyu Li, Feiyu Xiong, Tong Xu, Matthew B. Blaschko

Large language models (LLMs) excel in generating coherent text, but they often struggle with context awareness, leading to inaccuracies in tasks requiring faithful adherence to provided information.

Memorization Reading Comprehension +1

Knowledge Graph Pruning for Recommendation

no code implementations19 May 2024 Fake Lin, Xi Zhu, Ziwei Zhao, Deqiang Huang, Yu Yu, Xueying Li, Zhi Zheng, Tong Xu, Enhong Chen

Recent years have witnessed the prosperity of knowledge graph based recommendation system (KGRS), which enriches the representation of users, items, and entities by structural knowledge with striking improvement.

Graph Neural Network

Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition

no code implementations19 Jan 2024 Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth G. Shivakumar, Ankur Gandhe, Ariya Rastow, Jia Xu, Ivan Bulyko, Andreas Stolcke

The use of low-rank adaptation (LoRA) with frozen pretrained language models (PLMs) has become increasing popular as a mainstream, resource-efficient modeling approach for memory-constrained hardware.

Language Modelling speech-recognition +1

HuRef: HUman-REadable Fingerprint for Large Language Models

1 code implementation8 Dec 2023 Boyi Zeng, Lizheng Wang, Yuncong Hu, Yi Xu, Chenghu Zhou, Xinbing Wang, Yu Yu, Zhouhan Lin

In this study, we introduce HuRef, a human-readable fingerprint for LLMs that uniquely identifies the base model without interfering with training or exposing model parameters to the public.

Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition

2 code implementations13 Feb 2023 Yongqi Li, Yu Yu, Tieyun Qian

Despite the recent success achieved by several two-stage prototypical networks in few-shot named entity recognition (NER) task, the overdetected false spans at the span detection stage and the inaccurate and unstable prototypes at the type classification stage remain to be challenging problems.

Contrastive Learning Few-shot NER +3

Using AntiPatterns to avoid MLOps Mistakes

no code implementations30 Jun 2021 Nikhil Muralidhar, Sathappah Muthiah, Patrick Butler, Manish Jain, Yu Yu, Katy Burne, Weipeng Li, David Jones, Prakash Arunachalam, Hays 'Skip' McCormick, Naren Ramakrishnan

We describe lessons learned from developing and deploying machine learning models at scale across the enterprise in a range of financial analytics applications.

The Weak Lensing Peak Statistics in the Mocks by the inverse-Gaussianization Method

1 code implementation29 Jan 2020 Zhao Chen, Yu Yu, Xiangkun Liu, Zuhui Fan

We apply the inverse-Gaussianization method proposed in \citealt{arXiv:1607. 05007} to fast produce weak lensing convergence maps and investigate the peak statistics, including the peak height counts and peak steepness counts, in these mocks.

Cosmology and Nongalactic Astrophysics

Unsupervised Representation Learning for Gaze Estimation

no code implementations CVPR 2020 Yu Yu, Jean-Marc Odobez

Although automatic gaze estimation is very important to a large variety of application areas, it is difficult to train accurate and robust gaze models, in great part due to the difficulty in collecting large and diverse data (annotating 3D gaze is expensive and existing datasets use different setups).

Gaze Estimation gaze redirection +2

Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis

no code implementations CVPR 2019 Yu Yu, Gang Liu, Jean-Marc Odobez

In this work, we address the problem of person-specific gaze model adaptation from only a few reference training samples.

Domain Adaptation Gaze Estimation +1

A Differential Approach for Gaze Estimation

no code implementations20 Apr 2019 Gang Liu, Yu Yu, Kenneth A. Funes Mora, Jean-Marc Odobez

Non-invasive gaze estimation methods usually regress gaze directions directly from a single face or eye image.

Gaze Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.