Search Results for author: Yi Lu

Found 50 papers, 13 papers with code

VisualLens: Personalization through Visual History

no code implementations25 Nov 2024 Wang Bill Zhu, Deqing Fu, Kai Sun, Yi Lu, Zhaojiang Lin, Seungwhan Moon, Kanika Narang, Mustafa Canim, Yue Liu, Anuj Kumar, Xin Luna Dong

We hypothesize that a user's visual history with images reflecting their daily life, offers valuable insights into their interests and preferences, and can be leveraged for personalization.

Diversity Recommendation Systems

APDDv2: Aesthetics of Paintings and Drawings Dataset with Artist Labeled Scores and Comments

1 code implementation13 Nov 2024 Xin Jin, Qianqian Qiao, Yi Lu, Huaye Wang, Heng Huang, Shan Gao, Jianfei Liu, Rui Li

Datasets play a pivotal role in training visual models, facilitating the development of abstract understandings of visual features through diverse image samples and multidimensional attributes.

A Controlled Study on Long Context Extension and Generalization in LLMs

1 code implementation18 Sep 2024 Yi Lu, Jing Nathan Yan, Songlin Yang, Justin T. Chiu, Siyu Ren, Fei Yuan, Wenting Zhao, Zhiyong Wu, Alexander M. Rush

Broad textual understanding and in-context learning require language models that utilize full document contexts.

In-Context Learning

ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation

no code implementations7 Jul 2024 Ruibo Fu, Xin Qi, Zhengqi Wen, JianHua Tao, Tao Wang, Chunyu Qiang, Zhiyong Wang, Yi Lu, Xiaopeng Wang, Shuchen Shi, Yukun Liu, Xuefei Liu, Shuai Zhang

The results indicate that the ASRRL method significantly outperforms traditional fine-tuning approaches, achieving higher speaker similarity and better overall speech quality with limited reference speeches.

Sentence Text to Speech

Zero-Shot Long-Form Video Understanding through Screenplay

no code implementations25 Jun 2024 Yongliang Wu, Bozheng Li, Jiawang Cao, Wenbo Zhu, Yi Lu, Weiheng Chi, Chuyun Xie, Haolin Zheng, Ziyue Su, Jay Wu, Xu Yang

The Long-form Video Question-Answering task requires the comprehension and analysis of extended video content to respond accurately to questions by utilizing both temporal and contextual information.

Question Answering Video Question Answering +1

A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge

no code implementations22 Jun 2024 Xiaopeng Wang, Yi Lu, Xin Qi, Zhiyong Wang, Yuankun Xie, Shuchen Shi, Ruibo Fu

The objective of the challenge is to establish a multi-speaker, multi-lingual Indic Text-to-Speech system with voice cloning capabilities, covering seven Indian languages with both male and female speakers.

Speech Synthesis Text to Speech +1

MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation

1 code implementation15 Jun 2024 Ruibo Fu, Shuchen Shi, Hongming Guo, Tao Wang, Chunyu Qiang, Zhengqi Wen, JianHua Tao, Xin Qi, Yi Lu, Xiaopeng Wang, Zhiyong Wang, Yukun Liu, Xuefei Liu, Shuai Zhang, Guanjun Li

Despite advancements in AIGC technologies for text and image generation, the foley audio dubbing remains rudimentary due to difficulties in cross-modal scene matching and content correlation.

AudioCaps Image Generation

Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio

no code implementations12 Jun 2024 Yi Lu, Yuankun Xie, Ruibo Fu, Zhengqi Wen, JianHua Tao, Zhiyong Wang, Xin Qi, Xuefei Liu, Yongwei Li, Yukun Liu, Xiaopeng Wang, Shuchen Shi

To effectively detect LLM-based deepfake audio, we focus on the core of the generation process, the conversion from neural codec to waveform.

Audio Deepfake Detection Audio Generation +4

Identifying Causal Effects under Kink Setting: Theory and Evidence

no code implementations14 Apr 2024 Yi Lu, Jianguo Wang, Huihua Xie

This paper develops a generalized framework for identifying causal impacts in a reduced-form manner under kinked settings when agents can manipulate their choices around the threshold.

Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

1 code implementation1 Apr 2024 wei he, Shichun Liu, Jun Zhao, Yiwen Ding, Yi Lu, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang

The generated demos strategically interpolate between existing demos and the given query, transforming the query from OOD to ID.

In-Context Learning Math

LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

1 code implementation18 Feb 2024 Jun Zhao, Can Zu, Hao Xu, Yi Lu, wei he, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang

Large language models (LLMs) have demonstrated impressive performance in understanding language and executing complex reasoning tasks.

Multi-hop Question Answering Question Answering +1

LongHeads: Multi-Head Attention is Secretly a Long Context Processor

1 code implementation16 Feb 2024 Yi Lu, Xin Zhou, wei he, Jun Zhao, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang

Instead of allowing each head to attend to the full sentence, which struggles with generalizing to longer sequences due to out-of-distribution (OOD) issues, we allow each head to process in-distribution length by selecting and attending to important context chunks.

Sentence

Interpreting and Improving Attention From the Perspective of Large Kernel Convolution

no code implementations11 Jan 2024 Chenghao Li, Chaoning Zhang, Boheng Zeng, Yi Lu, Pengbo Shi, Qingzi Chen, Jirui Liu, Lingyun Zhu, Yang Yang, Heng Tao Shen

These findings highlight the effectiveness of LKCA in bridging local and global feature modeling, offering a practical and robust solution for real-world applications with limited data and resources.

Image Classification

Making Harmful Behaviors Unlearnable for Large Language Models

no code implementations2 Nov 2023 Xin Zhou, Yi Lu, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang

Specifically, we introduce ``security vectors'', a few new parameters that can be separated from the LLM, to ensure LLM's responses are consistent with the harmful behavior.

Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection

no code implementations1 Feb 2023 Chenglong Wang, Yi Lu, Yongyu Mu, Yimin Hu, Tong Xiao, Jingbo Zhu

Knowledge distillation addresses the problem of transferring knowledge from a teacher model to a student model.

Knowledge Distillation

Joint RIS Calibration and Multi-User Positioning

no code implementations8 Dec 2022 Yi Lu, Hui Chen, Jukka Talvitie, Henk Wymeersch, Mikko Valkama

Reconfigurable intelligent surfaces (RISs) are expected to be a key component enabling the mobile network evolution towards a flexible and intelligent 6G wireless platform.

Nonparametric Decoding for Generative Retrieval

1 code implementation5 Oct 2022 Hyunji Lee, Jaeyoung Kim, Hoyeon Chang, Hanseok Oh, Sohee Yang, Vlad Karpukhin, Yi Lu, Minjoon Seo

The generative retrieval model depends solely on the information encoded in its model parameters without external memory, its information capacity is limited and fixed.

Decoder Language Modelling +1

The Short-term Impact of Congestion Taxes on Ridesourcing Demand and Traffic Congestion: Evidence from Chicago

no code implementations5 Jul 2022 Yuan Liang, Bingjie Yu, Xiaojian Zhang, Yi Lu, Linchuan Yang

To this end, this study applies difference-in-differences (i. e., a regression-based causal inference approach) to empirically evaluate the effects of the congestion tax policy on ridesourcing demand and traffic congestion in Chicago.

Causal Inference regression

DePS: An improved deep learning model for de novo peptide sequencing

no code implementations16 Mar 2022 Cheng Ge, Yi Lu, Jia Qu, Liangxu Xie, Feng Wang, Hong Zhang, Ren Kong, Shan Chang

De novo peptide sequencing from mass spectrometry data is an important method for protein identification.

de novo peptide sequencing

C+1 Loss: Learn to Classify C Classes of Interest and the Background Class Differentially

no code implementations29 Sep 2021 Changhuai Chen, Xile Shen, Mengyu Ye, Yi Lu, Jun Che, ShiLiang Pu

We figure out that the background class should be treated differently from the classes of interest during training.

Classification Human Parsing +3

Joint Positioning and Tracking via NR Sidelink in 5G-Empowered Industrial IoT: Releasing the Potential of V2X Technology

no code implementations15 Jan 2021 Yi Lu, Mike Koivisto, Jukka Talvitie, Elizaveta Rastorgueva-Foi, Toni Levanen, Elena Simona Lohan, Mikko Valkama

The fifth generation (5G) mobile networks with enhanced connectivity and positioning capabilities play an increasingly important role in the development of automated vehicle-to-everything (V2X) and other advanced industrial Internet of Things (IoT) systems.

On the Transferability of Minimal Prediction Preserving Inputs in Question Answering

no code implementations NAACL 2021 Shayne Longpre, Yi Lu, Christopher DuBois

In the context of question answering, we investigate competing hypotheses for the existence of MPPIs, including poor posterior calibration of neural models, lack of pretraining, and "dataset bias" (where a model learns to attend to spurious, non-generalizable cues in the training data).

Adversarial Robustness Question Answering

Prob2Vec: Mathematical Semantic Embedding for Problem Retrieval in Adaptive Tutoring

no code implementations ICLR 2019 Du Su, Ali Yekkehkhany, Yi Lu, Wenmiao Lu

We propose a hierarchical problem embedding algorithm, called Prob2Vec, that consists of abstraction and embedding steps.

Retrieval Sentence +2

Artificial Intelligence Distinguishes COVID-19 from Community Acquired Pneumonia on Chest CT

1 code implementation Radiology 2020 Lin Li, Lixin Qin, Zeguo Xu, Youbing Yin, Xin Wang, Bin Kong, Junjie Bai, Yi Lu, Zhenghan Fang, Qi Song, Kunlin Cao, Daliang Liu, Guisheng Wang, Qizhong Xu, Xisheng Fang, Shiqin Zhang, Juan Xia, Jun Xia

Materials and Methods In this retrospective and multi-center study, a deep learning model, COVID-19 detection neural network (COVNet), was developed to extract visual features from volumetric chest CT exams for the detection of COVID-19.

COVID-19 Image Segmentation Specificity

Graph-FCN for image semantic segmentation

no code implementations2 Jan 2020 Yi Lu, Yaran Chen, Dongbin Zhao, Jianxin Chen

Then we apply graph convolutional network to solve this graph node classification problem.

Deep Learning General Classification +3

An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering

no code implementations WS 2019 Shayne Longpre, Yi Lu, Zhucheng Tu, Chris DuBois

To produce a domain-agnostic question answering model for the Machine Reading Question Answering (MRQA) 2019 Shared Task, we investigate the relative benefits of large pre-trained language models, various data sampling strategies, as well as query and context paraphrases generated by back-translation.

Data Augmentation Question Answering +2

DeepCenterline: a Multi-task Fully Convolutional Network for Centerline Extraction

no code implementations25 Mar 2019 Zhihui Guo, Junjie Bai, Yi Lu, Xin Wang, Kunlin Cao, Qi Song, Milan Sonka, Youbing Yin

The proposed method generates well-positioned centerlines, exhibiting lower number of missing branches and is more robust in the presence of minor imperfections of the object segmentation mask.

Object Semantic Segmentation

Attention-driven Tree-structured Convolutional LSTM for High Dimensional Data Understanding

no code implementations29 Jan 2019 Bin Kong, Xin Wang, Junjie Bai, Yi Lu, Feng Gao, Kunlin Cao, Qi Song, Shaoting Zhang, Siwei Lyu, Youbing Yin

In order to address these limitations, we present tree-structured ConvLSTM models for tree-structured image analysis tasks which can be trained end-to-end.

Vocal Bursts Intensity Prediction

Residual Attention based Network for Hand Bone Age Assessment

no code implementations21 Dec 2018 Eric Wu, Bin Kong, Xin Wang, Junjie Bai, Yi Lu, Feng Gao, Shaoting Zhang, Kunlin Cao, Qi Song, Siwei Lyu, Youbing Yin

The hierarchical attention components of the residual attention subnet force our network to focus on the key components of the X-ray images and generate the final predictions as well as the associated visual supports, which is similar to the assessment procedure of clinicians.

Hand Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.