Search Results for author: Sihao Hu

Found 23 papers, 18 papers with code

Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable

1 code implementation1 Mar 2025 Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Zachary Yahn, Yichang Xu, Ling Liu

While safety alignment has been extensively studied for LLM, there is still a large research gap for Large Reasoning Models (LRMs) that equip with improved reasoning capability.

Language Modeling Language Modelling +2

Multi-Agent Reinforcement Learning with Focal Diversity Optimization

1 code implementation6 Feb 2025 Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Zachary Yahn, Ling Liu

First, we develop an agent-fusion framework for encouraging multiple LLM based agents to collaborate in producing the final inference output for each LLM query.

Diversity Multi-agent Reinforcement Learning +3

Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

1 code implementation29 Jan 2025 Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Ling Liu

By designing a new red-teaming method, we in this paper show that purely relying on the moderation guardrail for data filtration is not reliable.

Red Teaming Safety Alignment

$H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs

1 code implementation26 Nov 2024 Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Zachary Yahn, Ling Liu

The former penalizes the selection errors of the expert-router, and the latter mediates the expert weights drifting during fine-tuning and dynamically adjusts the fusion behavior of the resulting model by canalizing the activations on the experts.

Mixture-of-Experts

Collaborative Contrastive Network for Click-Through Rate Prediction

no code implementations18 Nov 2024 Chen Gao, Zixin Zhao, Sihao Hu, Lv Shao, Tong Liu

E-commerce platforms provide entrances for customers to enter mini-apps to meet their specific shopping needs.

Click-Through Rate Prediction Contrastive Learning +1

LLM-TOPLA: Efficient LLM Ensemble by Maximising Diversity

1 code implementation4 Oct 2024 Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Ling Liu

This paper presents LLM-TOPLA, a diversity-optimized LLM ensemble method with three unique properties: (i) We introduce the focal diversity metric to capture the diversity-performance correlation among component LLMs of an ensemble.

Diversity Ensemble Pruning +2

Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey

2 code implementations26 Sep 2024 Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Ling Liu

To clear up concern, this paper provide a comprehensive overview to three aspects of harmful fine-tuning: attacks setting, defense design and evaluation methodology.

Safety Alignment

Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation

1 code implementation3 Sep 2024 Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Ling Liu

For the first time in the literature, we in this paper show that \textit{harmful perturbation} over the model weights should be the root cause of alignment-broken of harmful fine-tuning.

Joint-Motion Mutual Learning for Pose Estimation in Videos

no code implementations5 Aug 2024 Sifan Wu, Haipeng Chen, Yifang Yin, Sihao Hu, Runyang Feng, Yingying Jiao, Ziqi Yang, Zhenguang Liu

Given that local joint feature and global motion flow are complementary, we further propose a progressive joint-motion mutual learning that synergistically exchanges information and interactively learns between joint feature and motion flow to improve the capability of the model.

Pose Estimation

Personalized Privacy Protection Mask Against Unauthorized Facial Recognition

1 code implementation19 Jul 2024 Ka-Ho Chow, Sihao Hu, Tiansheng Huang, Ling Liu

Second, we incorporate a perceptibility optimization to preserve the visual quality of the protected facial images.

Diversity Ensemble Learning +1

Lisa: Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning Attack

1 code implementation28 May 2024 Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Ling Liu

Recent studies show that Large Language Models (LLMs) with safety alignment can be jail-broken by fine-tuning on a dataset mixed with harmful data.

Safety Alignment

Robust Few-Shot Ensemble Learning with Focal Diversity-Based Pruning

no code implementations5 Apr 2024 Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Ka-Ho Chow, Margaret L. Loper, Ling Liu

This paper presents FusionShot, a focal diversity optimized few-shot ensemble learning approach for boosting the robustness and generalization performance of pre-trained few-shot models.

Diversity Ensemble Learning +2

Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning Attack

1 code implementation2 Feb 2024 Tiansheng Huang, Sihao Hu, Ling Liu

The new paradigm of finetuning-as-a-service introduces a new attack surface for Large Language Models (LLMs): a few harmful data uploaded by users can easily trick the finetuning to produce an alignment-broken model.

Language Modelling Large Language Model

PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models

1 code implementation2 Feb 2024 Sihao Hu, Tiansheng Huang, Ling Liu

We introduce PokeLLMon, the first LLM-embodied agent that achieves human-parity performance in tactical battle games, as demonstrated in Pokemon battles.

Action Generation Decision Making +1

Resource-Efficient Transformer Pruning for Finetuning of Large Models

1 code implementation CVPR 2024 Fatih Ilhan, Gong Su, Selim Furkan Tekin, Tiansheng Huang, Sihao Hu, Ling Liu

With the recent advances in vision transformers and large language models (LLMs) finetuning costly large models on downstream learning tasks poses significant challenges under limited computational resources.

Natural Language Understanding

Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives

1 code implementation2 Oct 2023 Sihao Hu, Tiansheng Huang, Fatih İlhan, Selim Furkan Tekin, Ling Liu

The goal of auditor is to yield a broad spectrum of vulnerabilities with the hope of encompassing the correct answer, whereas the goal of critic that evaluates the validity of identified vulnerabilities is to minimize the number of false positives.

Language Modeling Language Modelling +2

BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection

1 code implementation29 Mar 2023 Sihao Hu, Zhen Zhang, Bingqiao Luo, Shengliang Lu, Bingsheng He, Ling Liu

As various forms of fraud proliferate on Ethereum, it is imperative to safeguard against these malicious activities to protect susceptible users from being victimized.

Fraud Detection

Adaptive Deep Neural Network Inference Optimization with EENet

1 code implementation15 Jan 2023 Fatih Ilhan, Ka-Ho Chow, Sihao Hu, Tiansheng Huang, Selim Tekin, Wenqi Wei, Yanzhao Wu, Myungjin Lee, Ramana Kompella, Hugo Latapie, Gaowen Liu, Ling Liu

Instead of having every sample go through all DNN layers during prediction, EENet learns an early exit scheduler, which can intelligently terminate the inference earlier for certain predictions, which the model has high confidence of early exit.

Inference Optimization Scheduling +1

Deep Intention-Aware Network for Click-Through Rate Prediction

no code implementations16 Nov 2022 Yaxian Xia, Yi Cao, Sihao Hu, Tong Liu, Lingling Lu

We identify that the key to TIRA is to extract customers' personalized entering intention and weigh the impact of triggers based on this intention.

Click-Through Rate Prediction Prediction

Sequence-Based Target Coin Prediction for Cryptocurrency Pump-and-Dump

1 code implementation21 Apr 2022 Sihao Hu, Zhen Zhang, Shengliang Lu, Bingsheng He, Zhao Li

With the proliferation of pump-and-dump schemes (P&Ds) in the cryptocurrency market, it becomes imperative to detect such fraudulent activities in advance to alert potentially susceptible investors.

GIFT: Graph-guIded Feature Transfer for Cold-Start Video Click-Through Rate Prediction

1 code implementation21 Feb 2022 Sihao Hu, Yi Cao, Yu Gong, Zhao Li, Yazheng Yang, Qingwen Liu, Shouling Ji

Specifically, we establish a heterogeneous graph that contains physical and semantic linkages to guide the feature transfer process from warmed-up video to cold-start videos.

Click-Through Rate Prediction

From Known to Unknown: Knowledge-guided Transformer for Time-Series Sales Forecasting in Alibaba

no code implementations17 Sep 2021 Xinyuan Qi, Kai Hou, Tong Liu, Zhongzhong Yu, Sihao Hu, Wenwu Ou

Except for introducing future knowledge for prediction, we propose Aliformer based on the bidirectional Transformer, which can utilize the historical information, current factor, and future knowledge to predict future sales.

Time Series Time Series Forecasting

Cannot find the paper you are looking for? You can Submit a new open access paper.