Search Results for author: Sen Su

Found 18 papers, 5 papers with code

Treasures Outside Contexts: Improving Event Detection via Global Statistics

1 code implementation EMNLP 2021 Rui Li, Wenlin Zhao, Cheng Yang, Sen Su

Event detection (ED) aims at identifying event instances of specified types in given texts, which has been formalized as a sequence labeling task.

Event Detection

DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent

1 code implementation18 Feb 2025 Pengyu Zhu, Zhenhong Zhou, Yuanhe Zhang, Shilinlu Yan, Kun Wang, Sen Su

As LLM-based agents become increasingly prevalent, backdoors can be implanted into agents through user queries or environment feedback, raising critical concerns regarding safety vulnerabilities.

Crabs: Consuming Resource via Auto-generation for LLM-DoS Attack under Black-box Settings

1 code implementation18 Dec 2024 Yuanhe Zhang, Zhenhong Zhou, Wei zhang, Xinyue Wang, Xiaojun Jia, Yang Liu, Sen Su

Large Language Models (LLMs) have demonstrated remarkable performance across diverse tasks yet still are vulnerable to external threats, particularly LLM Denial-of-Service (LLM-DoS) attacks.

Smaller Language Models Are Better Instruction Evolvers

1 code implementation15 Dec 2024 Tingfeng Hui, Lulu Zhao, Guanting Dong, Yaqi Zhang, Hua Zhou, Sen Su

In this study, we question this prevalent assumption and conduct an in-depth exploration into the potential of smaller language models (SLMs) in the context of instruction evolution.

Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging

no code implementations2 Oct 2024 Tingfeng Hui, Zhenyu Zhang, Shuohuan Wang, Yu Sun, Hua Wu, Sen Su

To ensure that each specialized expert in the MoE model works as expected, we select a small amount of seed data that each expert excels to pre-optimize the router.

Diversity

Alignment-Enhanced Decoding:Defending via Token-Level Adaptive Refining of Probability Distributions

1 code implementation14 Aug 2024 Quan Liu, Zhenhong Zhou, Longzhu He, Yi Liu, Wei zhang, Sen Su

Large language models are susceptible to jailbreak attacks, which can result in the generation of harmful content.

Safety Alignment

Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue

no code implementations27 Feb 2024 Zhenhong Zhou, Jiuyang Xiang, Haopeng Chen, Quan Liu, Zherui Li, Sen Su

Large Language Models (LLMs) have been demonstrated to generate illegal or unethical responses, particularly when subjected to "jailbreak."

Marginal Debiased Network for Fair Visual Recognition

no code implementations4 Jan 2024 Mei Wang, Weihong Deng, Jiani Hu, Sen Su

Deep neural networks (DNNs) are often prone to learn the spurious correlations between target classes and bias attributes, like gender and race, inherent in a major portion of training data (bias-aligned samples), thus showing unfair behavior and arising controversy in the modern pluralistic and egalitarian society.

Fairness Meta-Learning

BitCoin: Bidirectional Tagging and Supervised Contrastive Learning based Joint Relational Triple Extraction Framework

no code implementations21 Sep 2023 Luyao He, Zhongbao Zhang, Sen Su, Yuxin Chen

To address these issues, we propose BitCoin, an innovative Bidirectional tagging and supervised Contrastive learning based joint relational triple extraction framework.

Contrastive Learning graph construction +5

A Self-supervised Mixed-curvature Graph Neural Network

no code implementations10 Dec 2021 Li Sun, Zhongbao Zhang, Junda Ye, Hao Peng, Jiawei Zhang, Sen Su, Philip S. Yu

Instead of working on one single constant-curvature space, we construct a mixed-curvature space via the Cartesian product of multiple Riemannian component spaces and design hierarchical attention mechanisms for learning and fusing the representations across these component spaces.

Contrastive Learning Graph Neural Network +1

Dense Contrastive Visual-Linguistic Pretraining

no code implementations24 Sep 2021 Lei Shi, Kai Shuang, Shijie Geng, Peng Gao, Zuohui Fu, Gerard de Melo, Yunpeng Chen, Sen Su

To overcome these issues, we propose unbiased Dense Contrastive Visual-Linguistic Pretraining (DCVLP), which replaces the region regression and classification with cross-modality region contrastive learning that requires no annotations.

Contrastive Learning Data Augmentation +2

Hyperbolic Variational Graph Neural Network for Modeling Dynamic Graphs

no code implementations6 Apr 2021 Li Sun, Zhongbao Zhang, Jiawei Zhang, Feiyang Wang, Hao Peng, Sen Su, Philip S. Yu

To model the uncertainty, we devise a hyperbolic graph variational autoencoder built upon the proposed TGNN to generate stochastic node representations of hyperbolic normal distributions.

Graph Neural Network

Contrastive Visual-Linguistic Pretraining

no code implementations26 Jul 2020 Lei Shi, Kai Shuang, Shijie Geng, Peng Su, Zhengkai Jiang, Peng Gao, Zuohui Fu, Gerard de Melo, Sen Su

We evaluate CVLP on several down-stream tasks, including VQA, GQA and NLVR2 to validate the superiority of contrastive learning on multi-modality representation learning.

Contrastive Learning regression +2

Multi-Layer Content Interaction Through Quaternion Product For Visual Question Answering

no code implementations3 Jan 2020 Lei Shi, Shijie Geng, Kai Shuang, Chiori Hori, Songxiang Liu, Peng Gao, Sen Su

To solve the issue for the intermediate layers, we propose an efficient Quaternion Block Network (QBN) to learn interaction not only for the last layer but also for all intermediate layers simultaneously.

Question Answering Video Description +1

Adaptive Noise Injection: A Structure-Expanding Regularization for RNN

no code implementations25 Jul 2019 Rui Li, Kai Shuang, Mengyu Gu, Sen Su

Due to the adaptive noises can be improved as the training processes, its negative effects can be weakened and even transformed into a positive effect to further improve the expressiveness of the main-branch RNN.

Language Modeling Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.