Search Results for author: Suyu Ge

Found 15 papers, 3 papers with code

MART: Improving LLM Safety with Multi-round Automatic Red-Teaming

no code implementations13 Nov 2023 Suyu Ge, Chunting Zhou, Rui Hou, Madian Khabsa, Yi-Chia Wang, Qifan Wang, Jiawei Han, Yuning Mao

Specifically, an adversarial LLM and a target LLM interplay with each other in an iterative manner, where the adversarial LLM aims to generate challenging prompts that elicit unsafe responses from the target LLM, while the target LLM is fine-tuned with safety aligned data on these adversarial prompts.

Instruction Following Response Generation

Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

no code implementations3 Oct 2023 Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao

In this study, we introduce adaptive KV cache compression, a plug-and-play method that reduces the memory footprint of generative inference for Large Language Models (LLMs).

Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories

no code implementations7 Feb 2023 Suyu Ge, Chenyan Xiong, Corby Rosset, Arnold Overwijk, Jiawei Han, Paul Bennett

In this paper we improve the zero-shot generalization ability of language models via Mixture-Of-Memory Augmentation (MoMA), a mechanism that retrieves augmentation documents from multiple information corpora ("external memories"), with the option to "plug in" new memory at inference time.

Retrieval Zero-shot Generalization

Toward Understanding Bias Correlations for Mitigation in NLP

no code implementations24 May 2022 Lu Cheng, Suyu Ge, Huan Liu

In particular, we examine bias mitigation in two common NLP tasks -- toxicity detection and word embeddings -- on three social identities, i. e., race, gender, and religion.

Fairness Word Embeddings

Improving Cyberbully Detection with User Interaction

1 code implementation1 Nov 2020 Suyu Ge, Lu Cheng, Huan Liu

Cyberbullying, identified as intended and repeated online bullying behavior, has become increasingly prevalent in the past few decades.

Graph Enhanced Representation Learning for News Recommendation

no code implementations31 Mar 2020 Suyu Ge, Chuhan Wu, Fangzhao Wu, Tao Qi, Yongfeng Huang

Existing news recommendation methods achieve personalization by building accurate news representations from news content and user representations from their direct interactions with news (e. g., click), while ignoring the high-order relatedness between users and news.

Graph Attention News Recommendation +1

FedNER: Privacy-preserving Medical Named Entity Recognition with Federated Learning

no code implementations20 Mar 2020 Suyu Ge, Fangzhao Wu, Chuhan Wu, Tao Qi, Yongfeng Huang, Xing Xie

Since the labeled data in different platforms usually has some differences in entity type and annotation criteria, instead of constraining different platforms to share the same model, we decompose the medical NER model in each platform into a shared module and a private module.

Federated Learning Medical Named Entity Recognition +4

Reviews Meet Graphs: Enhancing User and Item Representations for Recommendation with Hierarchical Attentive Graph Neural Network

no code implementations IJCNLP 2019 Chuhan Wu, Fangzhao Wu, Tao Qi, Suyu Ge, Yongfeng Huang, Xing Xie

In the review content-view, we propose to use a hierarchical model to first learn sentence representations from words, then learn review representations from sentences, and finally learn user/item representations from reviews.

MULTI-VIEW LEARNING Representation Learning +1

Detecting and Extracting of Adverse Drug Reaction Mentioning Tweets with Multi-Head Self Attention

no code implementations WS 2019 Suyu Ge, Tao Qi, Chuhan Wu, Yongfeng Huang

This paper describes our system for the first and second shared tasks of the fourth Social Media Mining for Health Applications (SMM4H) workshop.

Language Modelling Task 2 +1

THU\_NGN at SemEval-2019 Task 3: Dialog Emotion Classification using Attentional LSTM-CNN

no code implementations SEMEVAL 2019 Suyu Ge, Tao Qi, Chuhan Wu, Yongfeng Huang

With the development of the Internet, dialog systems are widely used in online platforms to provide personalized services for their users.

Emotion Classification Emotion Recognition +2

THU\_NGN at SemEval-2019 Task 12: Toponym Detection and Disambiguation on Scientific Papers

no code implementations SEMEVAL 2019 Tao Qi, Suyu Ge, Chuhan Wu, Yubo Chen, Yongfeng Huang

First name: Tao Last name: Qi Email: taoqi. qt@gmail. com Affiliation: Department of Electronic Engineering, Tsinghua University First name: Suyu Last name: Ge Email: gesy17@mails. tsinghua. edu. cn Affiliation: Department of Electronic Engineering, Tsinghua University First name: Chuhan Last name: Wu Email: wuch15@mails. tsinghua. edu. cn Affiliation: Department of Electronic Engineering, Tsinghua University First name: Yubo Last name: Chen Email: chen-yb18@mails. tsinghua. edu. cn Affiliation: Department of Electronic Engineering, Tsinghua University First name: Yongfeng Last name: Huang Email: yfhuang@mail. tsinghua. edu. cn Affiliation: Department of Electronic Engineering, Tsinghua University Toponym resolution is an important and challenging task in the neural language processing field, and has wide applications such as emergency response and social media geographical event analysis.

POS Toponym Resolution +1

Cannot find the paper you are looking for? You can Submit a new open access paper.