Search Results for author: Yang Liu

Found 1112 papers, 387 papers with code

Think Before You Speak: Learning to Generate Implicit Knowledge for Response Generation by Self-Talk

no code implementations • EMNLP (NLP4ConvAI) 2021 • Pei Zhou, Behnam Hedayatnia, Karthik Gopalakrishnan, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

We further investigate can such models identify when to generate implicit background knowledge and when it is not necessary.

Common Sense Reasoning Response Generation

Paper
Add Code

Interpolation between CNNs and ResNets

no code implementations • ICML 2020 • Zonghan Yang, Yang Liu, Chenglong Bao, Zuoqiang Shi

Although ordinary differential equations (ODEs) provide insights for designing networks architectures, its relationship with the non-residual convolutional neural networks (CNNs) is still unclear.

Adversarial Attack Image Classification

Paper
Add Code

Overcoming Catastrophic Forgetting During Domain Adaptation of Seq2seq Language Generation

no code implementations • NAACL 2022 • Dingcheng Li, Zheng Chen, Eunah Cho, Jie Hao, Xiaohu Liu, Fan Xing, Chenlei Guo, Yang Liu

Seq2seq language generation models that are trained offline with multiple domains in a sequential fashion often suffer from catastrophic forgetting.

Domain Adaptation Response Generation +1

Paper
Add Code

Enhancing Knowledge Selection for Grounded Dialogues via Document Semantic Graphs

no code implementations • NAACL 2022 • Sha Li, Mahdi Namazifar, Di Jin, Mohit Bansal, Heng Ji, Yang Liu, Dilek Hakkani-Tur

In this work, we propose to automatically convert the background knowledge documents into document semantic graphs and then perform knowledge selection over such graphs.

Multi-Task Learning Response Generation +1

Paper
Add Code

Leveraging Seq2seq Language Generation for Multi-level Product Issue Identification

no code implementations • ECNLP (ACL) 2022 • Yang Liu, Varnith Chordia, Hua Li, Siavash Fazeli Dehkordy, Yifei Sun, Vincent Gao, Na Zhang

To harness such information to better serve customers, in this paper, we created a machine learning approach to automatically identify product issues and uncover root causes from the customer feedback text.

Multi-Label Classification Text Generation +1

Paper
Add Code

基于词信息嵌入的汉语构词结构识别研究(Chinese Word-Formation Prediction based on Representations of Word-Related Features)

no code implementations • CCL 2021 • Hua Zheng, Yaqi Yan, Yue Wang, Damai Dai, Yang Liu

“作为一种意合型语言, 汉语中的构词结构刻画了构词成分之间的组合关系, 是认知、理解词义的关键。在中文信息处理领域, 此前的构词结构识别工作大多沿用句法层面的粗粒度标签, 且主要基于上下文等词间信息建模, 忽略了语素义、词义等词内信息对构词结构识别的作用。本文采用语言学视域下的构词结构标签体系, 构建汉语构词结构及相关信息数据集, 提出了一种基于Bi-LSTM和Self-attention的模型, 以此来探究词内、词间等多方面信息对构词结构识别的潜在影响和能达到的性能。实验取得了良好的预测效果, 准确率77. 87%, F1值78. 36%;同时, 对比测试揭示, 词内的语素义信息对构词结构识别具有显著的贡献, 而词间的上下文信息贡献较弱且带有较强的不稳定性。该预测方法与数据集, 将为中文信息处理的多种任务, 如语素和词结构分析、词义识别与生成、语言文字研究与词典编纂等提供新的观点和方案。”

Paper
Add Code

中美学者学术英语写作中词汇难度特征比较研究——以计算语言学领域论文为例(A Comparative Study of the Features of Lexical Sophistication in Academic English Writing by Chinese and American)

no code implementations • CCL 2021 • Yonghui Xie, Yang Liu, Erhong Yang, Liner Yang

“学术英语写作在国际学术交流中的作用日益凸显, 然而对于英语非母语者, 学术英语写作是困难的, 为此本文对计算语言领域中美学者学术英语写作中词汇难度特征做比较研究。自构建1132篇中美论文全文语料库, 统计语料中484个词汇难度特征值。经过特征筛选与因子分析的降维处理得到表现较好的五个维度。最后计算中美学者论文的维度分从而比较差异, 发现美国学者的论文相较中国学者的论文中词汇单位更具常用性、二元词串更具稳固性、三元词串更具稳固性、虚词更具复杂性、词类更具关联性。主要原因在于统计特征值时借助的外部资源库与美国学者的论文更贴近, 且中国学者没有完全掌握该领域学术写作的习惯。因此, 中国学者可充分利用英语本族语者构建的资源库, 从而产出更为地道与流利的学术英语论文。”

Paper
Add Code

Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation

1 code implementation • Findings (EMNLP) 2021 • Hua Zheng, Lei LI, Damai Dai, Deli Chen, Tianyu Liu, Xu sun, Yang Liu

In this paper, we propose to leverage word-formation knowledge to enhance Chinese WSD.

Word Sense Disambiguation

Paper
Code

Diversity and Consistency: Exploring Visual Question-Answer Pair Generation

no code implementations • Findings (EMNLP) 2021 • Sen yang, Qingyu Zhou, Dawei Feng, Yang Liu, Chao Li, Yunbo Cao, Dongsheng Li

Moreover, this task can be used to improve visual question generation and visual question answering.

Question Answering Question Generation +3

Paper
Add Code

Segment, Mask, and Predict: Augmenting Chinese Word Segmentation with Self-Supervision

no code implementations • EMNLP 2021 • Mieradilijiang Maimaiti, Yang Liu, Yuanhang Zheng, Gang Chen, Kaiyu Huang, Ji Zhang, Huanbo Luan, Maosong Sun

Besides, the robustness of the previous neural methods is limited by the large-scale annotated data.

Chinese Word Segmentation Language Modelling +1

Paper
Add Code

Self-Supervised Quality Estimation for Machine Translation

no code implementations • EMNLP 2021 • Yuanhang Zheng, Zhixing Tan, Meng Zhang, Mieradilijiang Maimaiti, Huanbo Luan, Maosong Sun, Qun Liu, Yang Liu

Quality estimation (QE) of machine translation (MT) aims to evaluate the quality of machine-translated sentences without references and is important in practical applications of MT.

Machine Translation Sentence +1

Paper
Add Code

Effective Convolutional Attention Network for Multi-label Clinical Document Classification

no code implementations • EMNLP 2021 • Yang Liu, Hua Cheng, Russell Klopfer, Matthew R. Gormley, Thomas Schaaf

Multi-label document classification (MLDC) problems can be challenging, especially for long documents with a large label set and a long-tail distribution over labels.

Ranked #2 on Medical Code Prediction on MIMIC-III

Classification Document Classification +1

Paper
Add Code

Policy-Driven Neural Response Generation for Knowledge-Grounded Dialog Systems

no code implementations • INLG (ACL) 2020 • Behnam Hedayatnia, Karthik Gopalakrishnan, Seokhwan Kim, Yang Liu, Mihail Eric, Dilek Hakkani-Tur

Open-domain dialog systems aim to generate relevant, informative and engaging responses.

Open-Domain Dialog Response Generation +1

Paper
Add Code

Modeling Entity Knowledge for Fact Verification

no code implementations • EMNLP (FEVER) 2021 • Yang Liu, Chenguang Zhu, Michael Zeng

Fact verification is a challenging task of identifying the truthfulness of given claims based on the retrieval of relevant evidence texts.

Descriptive Fact Verification +1

Paper
Add Code

Amplifying Key Cues for Human-Object-Interaction Detection

no code implementations • ECCV 2020 • Yang Liu, Qingchao Chen, Andrew Zisserman

In this paper we introduce two methods to amplify key cues in the image, and also a method to combine these and other cues when considering the interaction between a human and an object.

Human-Object Interaction Detection Object

Paper
Add Code

Latent Topic-aware Multi-Label Classification

no code implementations • ECCV 2020 • Jianghong Ma, Yang Liu

In real-world applications, data are often associated with different labels.

Classification General Classification +2

Paper
Add Code

Learn from Relation Information: Towards Prototype Representation Rectification for Few-Shot Relation Extraction

1 code implementation • Findings (NAACL) 2022 • Yang Liu, Jinpeng Hu, Xiang Wan, Tsung-Hui Chang

Few-shot Relation Extraction refers to fast adaptation to novel relation classes with few samples through training on the known relation classes.

Contrastive Learning Domain Adaptation +3

Paper
Code

A Hybrid System for NLPTEA-2020 CGED Shared Task

no code implementations • AACL (NLP-TEA) 2020 • Meiyuan Fang, Kai Fu, JiPing Wang, Yang Liu, Jin Huang, Yitao Duan

As a result, among the six tracks in the shared task, our system performs well in the correction tracks: measured in F1 score, we rank first, with the highest precision, in the TOP3 correction track and third in the TOP1 correction track, also with the highest precision.

Paper
Add Code

Exploring Word Segmentation and Medical Concept Recognition for Chinese Medical Texts

1 code implementation • NAACL (BioNLP) 2021 • Yang Liu, Yuanhe Tian, Tsung-Hui Chang, Song Wu, Xiang Wan, Yan Song

Chinese word segmentation (CWS) and medical concept recognition are two fundamental tasks to process Chinese electronic medical records (EMRs) and play important roles in downstream tasks for understanding Chinese EMRs.

Chinese Word Segmentation Model Selection +1

Paper
Code

Knowledge Representation Learning with Contrastive Completion Coding

no code implementations • Findings (EMNLP) 2021 • Bo Ouyang, Wenbing Huang, Runfa Chen, Zhixing Tan, Yang Liu, Maosong Sun, Jihong Zhu

Knowledge representation learning (KRL) has been used in plenty of knowledge-driven tasks.

Knowledge Graphs Representation Learning

Paper
Add Code

Rethinking Data Augmentation in Text-to-text Paradigm

no code implementations • COLING 2022 • Yanan Chen, Yang Liu

As manually labelling data can be costly, some recent studies tend to augment the training data for improving the generalization power of machine learning models, known as data augmentation (DA).

Data Augmentation

Paper
Add Code

Overview of AMALGUM – Large Silver Quality Annotations across English Genres

no code implementations • SCiL 2021 • Luke Gessler, Siyao Peng, Yang Liu, YIlun Zhu, Shabnam Behzad, Amir Zeldes

Paper
Add Code

Automatically Detecting Reduced-formed English Pronunciations by Using Deep Learning

no code implementations • NAACL (BEA) 2022 • Lei Chen, Chenglin Jiang, Yiwei Gu, Yang Liu, Jiahong Yuan

Reduced form pronunciations are widely used by native English speakers, especially in casual conversations.

Paper
Add Code

Task-Driven and Experience-Based Question Answering Corpus for In-Home Robot Application in the House3D Virtual Environment

1 code implementation • LREC 2022 • Zhuoqun Xu, Liubo Ouyang, Yang Liu

At present, more and more work has begun to pay attention to the long-term housekeeping robot scene.

General Knowledge Question Answering

Paper
Code

THUMT: An Open-Source Toolkit for Neural Machine Translation

no code implementations • AMTA 2020 • Zhixing Tan, Jiacheng Zhang, Xuancheng Huang, Gang Chen, Shuo Wang, Maosong Sun, Huanbo Luan, Yang Liu

Machine Translation Translation

Paper
Add Code

Personalized Entity Resolution with Dynamic Heterogeneous KnowledgeGraph Representations

no code implementations • ACL (ECNLP) 2021 • Ying Lin, Han Wang, Jiangning Chen, Tong Wang, Yue Liu, Heng Ji, Yang Liu, Premkumar Natarajan

We first build a cross-source heterogeneous knowledge graph from customer purchase history and product knowledge graph to jointly learn customer and product embeddings.

Entity Resolution

Paper
Add Code

DialogSum Challenge: Summarizing Real-Life Scenario Dialogues

no code implementations • INLG (ACL) 2021 • Yulong Chen, Yang Liu, Yue Zhang

We propose a shared task on summarizing real-life scenario dialogues, DialogSum Challenge, to encourage researchers to address challenges in dialogue summarization, which has been less studied by the summarization community.

Common Sense Reasoning Representation Learning

Paper
Add Code

A General Black-box Adversarial Attack on Graph-based Fake News Detectors

no code implementations • 24 Apr 2024 • Peican Zhu, Zechen Pan, Yang Liu, Jiwei Tian, Keke Tang, Zhen Wang

Specifically, as sharing is an important social interaction for GNN-based fake news detectors to construct the graph, we simulate sharing behaviors to fool the detectors.

Paper
Add Code

MAS-SAM: Segment Any Marine Animal with Aggregated Features

no code implementations • 24 Apr 2024 • Tianyu Yan, Zifu Wan, Xinhao Deng, Pingping Zhang, Yang Liu, Huchuan Lu

In underwater scenes, it exhibits substantial performance degradation due to the light scattering and absorption.

Paper
Add Code

FL-TAC: Enhanced Fine-Tuning in Federated Learning via Low-Rank, Task-Specific Adapter Clustering

no code implementations • 23 Apr 2024 • Siqi Ping, Yuzhu Mao, Yang Liu, Xiao-Ping Zhang, Wenbo Ding

Although large-scale pre-trained models hold great potential for adapting to downstream tasks through fine-tuning, the performance of such fine-tuned models is often limited by the difficulty of collecting sufficient high-quality, task-specific data.

Clustering Federated Learning

Paper
Add Code

Ultrasound Nodule Segmentation Using Asymmetric Learning with Simple Clinical Annotation

no code implementations • 23 Apr 2024 • Xingyue Zhao, Zhongyu Li, Xiangde Luo, Peiqi Li, Peng Huang, Jianwei Zhu, Yang Liu, Jihua Zhu, Meng Yang, Shi Chang, Jun Dong

Especially, an asymmetric learning framework is developed by extending the aspect ratio annotations with two types of pseudo labels, i. e., conservative labels and radical labels, to train two asymmetric segmentation networks simultaneously.

Anatomy Morphological Analysis +1

Paper
Add Code

Authentic Emotion Mapping: Benchmarking Facial Expressions in Real News

no code implementations • 21 Apr 2024 • Qixuan Zhang, Zhifeng Wang, Yang Liu, Zhenyue Qin, Kaihao Zhang, Sabrina Caldwell, Tom Gedeon

In this paper, we present a novel benchmark for Emotion Recognition using facial landmarks extracted from realistic news videos.

Benchmarking Emotion Recognition

Paper
Add Code

MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions

1 code implementation • 21 Apr 2024 • Sheng Yan, Mengyuan Liu, Yong Wang, Yang Liu, Chen Chen, Hong Liu

In this paper, we address the unexplored question of temporal sentence localization in human motions (TSLM), aiming to locate a target moment from a 3D human motion that semantically corresponds to a text query.

Moment Retrieval Sentence

Paper
Code

Using a Local Surrogate Model to Interpret Temporal Shifts in Global Annual Data

no code implementations • 18 Apr 2024 • Shou Nakano, Yang Liu

This paper focuses on explaining changes over time in globally-sourced, annual temporal data, with the specific objective of identifying pivotal factors that contribute to these temporal shifts.

Feature Importance feature selection +2

Paper
Add Code

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection

no code implementations • 15 Apr 2024 • Yuxi Li, Yi Liu, Gelei Deng, Ying Zhang, Wenjia Song, Ling Shi, Kailong Wang, Yuekang Li, Yang Liu, Haoyu Wang

We present categorizations of the identified glitch tokens and symptoms exhibited by LLMs when interacting with glitch tokens.

Paper
Add Code

HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising

no code implementations • 15 Apr 2024 • Yang Liu, Jiahua Xiao, Yu Guo, Peilin Jiang, Haiwei Yang, Fei Wang

Effectively discerning spatial-spectral dependencies in HSI denoising is crucial, but prevailing methods using convolution or transformers still face computational efficiency limitations.

Computational Efficiency Denoising

Paper
Add Code

Transfer Learning via Latent Dependency Factor for Estimating PM 2.5

no code implementations • 10 Apr 2024 • Shrey Gupta, Yongbee Park, Jianzhao Bi, Suyash Gupta, Andreas Züfle, Avani Wildani, Yang Liu

We recognize this transfer problem as spatial transfer learning and propose a new feature named Latent Dependency Factor (LDF) that captures spatial and semantic dependencies of both domains and is subsequently added to the datasets.

Transfer Learning

Paper
Add Code

Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning

no code implementations • 9 Apr 2024 • ZhiHao Lin, Wei Ma, Tao Lin, Yaowen Zheng, Jingquan Ge, Jun Wang, Jacques Klein, Tegawende Bissyande, Yang Liu, Li Li

We introduce a governance framework centered on federated learning (FL), designed to foster the joint development and maintenance of open-source AI code models while safeguarding data privacy and security.

Federated Learning

Paper
Add Code

Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection

2 code implementations • 9 Apr 2024 • Ting Lei, Shaofeng Yin, Yang Liu

In addition, these detectors primarily rely on category names and overlook the rich contextual information that language can provide, which is essential for capturing open vocabulary concepts that are typically rare and not well-represented by category names alone.

Human-Object Interaction Detection World Knowledge

152

Paper
Code

Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM

1 code implementation • 7 Apr 2024 • Pingping Zhang, Tianyu Yan, Yang Liu, Huchuan Lu

To this end, we first introduce a dual structure with SAM's paradigm to enhance feature learning of marine images.

Paper
Code

Equilibrium in Style: A Modeling Framework on the Cash Flow and the Life Cycle of a Consumer Store

no code implementations • 3 Apr 2024 • Shanyu Han, Jian Lei, Yang Liu

The consumer store is ubiquitous and plays an important role in our everyday lives.

Paper
Add Code

Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments

no code implementations • 2 Apr 2024 • Qianhui Zhao, Fang Liu, Li Zhang, Yang Liu, Zhen Yan, Zhenghao Chen, Yufei Zhou, Jing Jiang, Ge Li

Automated generation of feedback on programming assignments holds significant benefits for programming education, especially when it comes to advanced assignments.

Language Modelling Large Language Model +1

Paper
Add Code

TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On

1 code implementation • 1 Apr 2024 • Jiazheng Xing, Chao Xu, Yijie Qian, Yang Liu, Guang Dai, Baigui Sun, Yong liu, Jingdong Wang

However, the clothing identity uncontrollability and training inefficiency of existing diffusion-based methods, which struggle to maintain the identity even with full parameter training, are significant limitations that hinder the widespread applications.

Virtual Try-on

Paper
Code

CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians

no code implementations • 1 Apr 2024 • Yang Liu, He Guan, Chuanchen Luo, Lue Fan, Junran Peng, Zhaoxiang Zhang

The advancement of real-time 3D scene reconstruction and novel view synthesis has been significantly propelled by 3D Gaussian Splatting (3DGS).

3D Scene Reconstruction Novel View Synthesis

Paper
Add Code

Position-Aware Parameter Efficient Fine-Tuning Approach for Reducing Positional Bias in LLMs

no code implementations • 1 Apr 2024 • Zheng Zhang, Fan Yang, Ziyan Jiang, Zheng Chen, Zhengyang Zhao, Chengyuan Ma, Liang Zhao, Yang Liu

Recent advances in large language models (LLMs) have enhanced their ability to process long input contexts.

Data Augmentation Position

Paper
Add Code

Exploring and Evaluating Hallucinations in LLM-Powered Code Generation

no code implementations • 1 Apr 2024 • Fang Liu, Yang Liu, Lin Shi, Houkun Huang, Ruifeng Wang, Zhen Yang, Li Zhang

The rise of Large Language Models (LLMs) has significantly advanced many applications on software engineering tasks, particularly in code generation.

Code Generation Hallucination +2

Paper
Add Code

De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts

no code implementations • 28 Mar 2024 • Yuzheng Wang, Dingkang Yang, Zhaoyu Chen, Yang Liu, Siao Liu, Wenqiang Zhang, Lihua Zhang, Lizhe Qi

Data-Free Knowledge Distillation (DFKD) is a promising task to train high-performance small models to enhance actual deployment without relying on the original training data.

Causal Inference Data-free Knowledge Distillation

Paper
Add Code

Beyond Talking -- Generating Holistic 3D Human Dyadic Motion for Communication

no code implementations • 28 Mar 2024 • Mingze Sun, Chao Xu, Xinyu Jiang, Yang Liu, Baigui Sun, Ruqi Huang

Furthermore, we introduce the HoCo holistic communication dataset, which is a valuable resource for future research.

Paper
Add Code

CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection

no code implementations • 27 Mar 2024 • JiaYi Zhu, Qing Guo, Felix Juefei-Xu, Yihao Huang, Yang Liu, Geguang Pu

In this paper, we propose a novel robustness enhancement framework by first learning the concept of the co-salient objects based on the input group images and then leveraging this concept to purify adversarial perturbations, which are subsequently fed to CoSODs for robustness enhancement.

Adversarial Attack Co-Salient Object Detection +2

Paper
Add Code

FedFixer: Mitigating Heterogeneous Label Noise in Federated Learning

no code implementations • 25 Mar 2024 • Xinyuan Ji, Zhaowei Zhu, Wei Xi, Olga Gadyatskaya, Zilong Song, Yong Cai, Yang Liu

The high loss incurred by client-specific samples in heterogeneous label noise poses challenges for distinguishing between client-specific and noisy label samples, impacting the effectiveness of existing label noise learning approaches.

Federated Learning

Paper
Add Code

MatchSeg: Towards Better Segmentation via Reference Image Matching

1 code implementation • 23 Mar 2024 • Ruiqiang Xiao, Jiayu Huo, Haotian Zheng, Yang Liu, Sebastien Ourselin, Rachel Sparks

Few-shot learning aims to overcome the need for annotated data by using a small labeled dataset, known as a support set, to guide predicting labels for new, unlabeled images, known as the query set.

Domain Generalization Few-Shot Learning +5

Paper
Code

An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated Learning

1 code implementation • 23 Mar 2024 • Jianqing Zhang, Yang Liu, Yang Hua, Jian Cao

Heterogeneous Federated Learning (HtFL) enables collaborative learning on multiple clients with different model architectures while preserving privacy.

Federated Learning Transfer Learning

Paper
Code

Reasoning-Enhanced Object-Centric Learning for Videos

no code implementations • 22 Mar 2024 • Jian Li, Pu Ren, Yang Liu, Hao Sun

Object-centric learning aims to break down complex visual scenes into more manageable object representations, enhancing the understanding and reasoning abilities of machine learning systems toward the physical world.

Object Object Tracking

Paper
Add Code

RetiGen: A Framework for Generalized Retinal Diagnosis Using Multi-View Fundus Images

no code implementations • 22 Mar 2024 • Ze Chen, Gongyu Zhang, Jiayu Huo, Joan Nunez do Rio, Charalampos Komninos, Yang Liu, Rachel Sparks, Sebastien Ourselin, Christos Bergeles, Timothy Jackson

This study introduces a novel framework for enhancing domain generalization in medical imaging, specifically focusing on utilizing unlabelled multi-view colour fundus photographs.

Domain Generalization Test-time Adaptation

Paper
Add Code

ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy

no code implementations • 21 Mar 2024 • Zonghan Yang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu

In WebShop, the 1-shot performance of the A$^3$T agent matches human average, and 4 rounds of iterative refinement lead to the performance approaching human experts.

Policy Gradient Methods

Paper
Add Code

BadEdit: Backdooring large language models by model editing

no code implementations • 20 Mar 2024 • Yanzhou Li, Tianlin Li, Kangjie Chen, Jian Zhang, Shangqing Liu, Wenhan Wang, Tianwei Zhang, Yang Liu

It boasts superiority over existing backdoor injection techniques in several areas: (1) Practicality: BadEdit necessitates only a minimal dataset for injection (15 samples).

Backdoor Attack knowledge editing

Paper
Add Code

Progressive trajectory matching for medical dataset distillation

no code implementations • 20 Mar 2024 • Zhen Yu, Yang Liu, Qingchao Chen

To solve these barriers, we propose to design a novel progressive trajectory matching strategy to improve the training stability for medical image dataset distillation.

Transfer Learning

Paper
Add Code

DDSB: An Unsupervised and Training-free Method for Phase Detection in Echocardiography

1 code implementation • 19 Mar 2024 • Zhenyu Bu, Yang Liu, Jiayu Huo, Jingjing Peng, Kaini Wang, Guangquan Zhou, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin

Accurate identification of End-Diastolic (ED) and End-Systolic (ES) frames is key for cardiac function assessment through echocardiography.

Segmentation

Paper
Code

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

no code implementations • 18 Mar 2024 • Wenhua Wu, Qi Wang, Guangming Wang, JunPing Wang, Tiankun Zhao, Yang Liu, Dongchao Gao, Zhe Liu, Hesheng Wang

To address this, we propose EMIE-MAP, a novel method for large-scale road surface reconstruction based on explicit mesh and implicit encoding.

Autonomous Driving Surface Reconstruction

Paper
Add Code

NEDS-SLAM: A Novel Neural Explicit Dense Semantic SLAM Framework using 3D Gaussian Splatting

no code implementations • 18 Mar 2024 • Yiming Ji, Yang Liu, Guanghu Xie, Boyu Ma, Zongwu Xie

We propose NEDS-SLAM, an Explicit Dense semantic SLAM system based on 3D Gaussian representation, that enables robust 3D semantic mapping, accurate camera tracking, and high-quality rendering in real-time.

Semantic SLAM

Paper
Add Code

Rethinking Low-quality Optical Flow in Unsupervised Surgical Instrument Segmentation

1 code implementation • 15 Mar 2024 • Peiran Wu, Yang Liu, Jiayu Huo, Gongyu Zhang, Christos Bergeles, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin

Video-based surgical instrument segmentation plays an important role in robot-assisted surgeries.

Optical Flow Estimation Segmentation

Paper
Code

Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification

1 code implementation • 15 Mar 2024 • Pingping Zhang, Yuhao Wang, Yang Liu, Zhengzheng Tu, Huchuan Lu

To address above issues, we propose a novel learning framework named \textbf{EDITOR} to select diverse tokens from vision Transformers for multi-modal object ReID.

Object

Paper
Code

Learning to Watermark LLM-generated Text via Reinforcement Learning

1 code implementation • 13 Mar 2024 • Xiaojun Xu, Yuanshun Yao, Yang Liu

While prior works focus on token-level watermark that embeds signals into the output, we design a model-level watermark that embeds signals into the LLM weights, and such signals can be detected by a paired detector.

reinforcement-learning

Paper
Code

Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework

no code implementations • 13 Mar 2024 • Jingling Li, Zeyu Tang, Xiaoyu Liu, Peter Spirtes, Kun Zhang, Liu Leqi, Yang Liu

Large language models (LLMs) can easily generate biased and discriminative responses.

Decision Making

Paper
Add Code

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

no code implementations • 12 Mar 2024 • Wei Shen, Xiaoying Zhang, Yuanshun Yao, Rui Zheng, Hongyi Guo, Yang Liu

Reinforcement learning from human feedback (RLHF) is the mainstream paradigm used to align large language models (LLMs) with human preferences.

reinforcement-learning

Paper
Add Code

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models

2 code implementations • 12 Mar 2024 • Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu

The virtual API server contains a caching system and API simulators which are complementary to alleviate the change in API status.

Benchmarking

4,404

Paper
Code

ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval

no code implementations • 11 Mar 2024 • Yuanhang Zheng, Peng Li, Wei Liu, Yang Liu, Jian Luan, Bin Wang

Specifically, our proposed ToolRerank includes Adaptive Truncation, which truncates the retrieval results related to seen and unseen tools at different positions, and Hierarchy-Aware Reranking, which makes retrieval results more concentrated for single-tool queries and more diverse for multi-tool queries.

Retrieval

Paper
Add Code

SuPRA: Surgical Phase Recognition and Anticipation for Intra-Operative Planning

no code implementations • 10 Mar 2024 • Maxence Boels, Yang Liu, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin

In conclusion, SuPRA presents a new multi-task approach that paves the way for improved intra-operative assistance through surgical phase recognition and prediction of future events.

Surgical phase recognition

Paper
Add Code

A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data

no code implementations • 8 Mar 2024 • Yifan Wu, Yang Liu, Yue Yang, Michael S. Yao, Wenli Yang, Xuehui Shi, Lihong Yang, Dongjun Li, Yueming Liu, James C. Gee, Xuan Yang, Wenbin Wei, Shi Gu

Diagnosing rare diseases presents a common challenge in clinical practice, necessitating the expertise of specialists for accurate identification.

Interpretable Machine Learning

Paper
Add Code

Towards Multimodal Sentiment Analysis Debiasing via Bias Purification

no code implementations • 8 Mar 2024 • Dingkang Yang, Mingcheng Li, Dongling Xiao, Yang Liu, Kun Yang, Zhaoyu Chen, Yuzheng Wang, Peng Zhai, Ke Li, Lihua Zhang

In the inference phase, given a factual multimodal input, MCIS imagines two counterfactual scenarios to purify and mitigate these biases.

counterfactual Counterfactual Inference +1

Paper
Add Code

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

no code implementations • 8 Mar 2024 • Xiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu

We introduce Adversarial Policy Optimization (AdvPO), a novel solution to the pervasive issue of reward over-optimization in Reinforcement Learning from Human Feedback (RLHF) for Large Language Models (LLMs).

Paper
Add Code

A&B BNN: Add&Bit-Operation-Only Hardware-Friendly Binary Neural Network

1 code implementation • 6 Mar 2024 • Ruichen Ma, Guanchao Qiao, Yian Liu, Liwei Meng, Ning Ning, Yang Liu, Shaogang Hu

A&B BNN is proposed to directly remove part of the multiplication operations in a traditional BNN and replace the rest with an equal number of bit operations, introducing the mask layer and the quantized RPReLU structure based on the normalizer-free network architecture.

Image Classification

Paper
Code

On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder

1 code implementation • 6 Mar 2024 • Tingxu Han, Shenghan Huang, Ziqi Ding, Weisong Sun, Yebo Feng, Chunrong Fang, Jun Li, Hanwei Qian, Cong Wu, Quanjun Zhang, Yang Liu, Zhenyu Chen

Distillation aims to distill knowledge from a given model (a. k. a the teacher net) and transfer it to another (a. k. a the student net).

Image Classification

Paper
Code

DomainVerse: A Benchmark Towards Real-World Distribution Shifts For Tuning-Free Adaptive Domain Generalization

no code implementations • 5 Mar 2024 • Feng Hou, Jin Yuan, Ying Yang, Yang Liu, Yang Zhang, Cheng Zhong, Zhongchao shi, Jianping Fan, Yong Rui, Zhiqiang He

With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain task changes to directly adapt the pre-trained source model to arbitrary target domains equipped with prior domain knowledge, and we name this task Adaptive Domain Generalization (ADG).

Domain Generalization

Paper
Add Code

FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio

1 code implementation • 4 Mar 2024 • Chao Xu, Yang Liu, Jiazheng Xing, Weida Wang, Mingze Sun, Jun Dan, Tianxin Huang, Siyuan Li, Zhi-Qi Cheng, Ying Tai, Baigui Sun

In this paper, we abstract the process of people hearing speech, extracting meaningful cues, and creating various dynamically audio-consistent talking faces, termed Listening and Imagining, into the task of high-fidelity diverse talking faces generation from a single audio.

Disentanglement

8,314

Paper
Code

PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

no code implementations • 4 Mar 2024 • Fiona Anting Tan, Gerard Christopher Yeo, Fanyou Wu, Weijie Xu, Vinija Jain, Aman Chadha, Kokil Jaidka, Yang Liu, See-Kiong Ng

Drawing inspiration from psychological research on the links between certain personality traits and Theory-of-Mind (ToM) reasoning, and from prompt engineering research on the hyper-sensitivity of prompts in affecting LLMs capabilities, this study investigates how inducing personalities in LLMs using prompts affects their ToM reasoning capabilities.

Prompt Engineering

Paper
Add Code

A Survey of Geometric Graph Neural Networks: Data Structures, Models and Applications

no code implementations • 1 Mar 2024 • Jiaqi Han, Jiacheng Cen, Liming Wu, Zongzhao Li, Xiangzhe Kong, Rui Jiao, Ziyang Yu, Tingyang Xu, Fandi Wu, Zihe Wang, Hongteng Xu, Zhewei Wei, Yang Liu, Yu Rong, Wenbing Huang

Geometric graph is a special kind of graph with geometric features, which is vital to model many scientific problems.

Paper
Add Code

LoLiSRFlow: Joint Single Image Low-light Enhancement and Super-resolution via Cross-scale Transformer-based Conditional Flow

no code implementations • 29 Feb 2024 • Ziyu Yue, Jiaxin Gao, Sihan Xie, Yang Liu, Zhixun Su

The visibility of real-world images is often limited by both low-light and low-resolution, however, these issues are only addressed in the literature through Low-Light Enhancement (LLE) and Super- Resolution (SR) methods.

Super-Resolution

Paper
Add Code

Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey

1 code implementation • 29 Feb 2024 • Yang Liu, Changzhen Qiu, Zhiyong Zhang

To the best of our knowledge, this survey is arguably the first to comprehensively cover deep learning methods for 3D human pose estimation, including both single-person and multi-person approaches, as well as human mesh recovery, encompassing methods based on explicit models and implicit representations.

3D Human Pose Estimation Autonomous Driving +1

Paper
Code

Datasets for Large Language Models: A Comprehensive Survey

1 code implementation • 28 Feb 2024 • Yang Liu, Jiahuan Cao, Chongyu Liu, Kai Ding, Lianwen Jin

Additionally, a comprehensive review of the existing available dataset resources is also provided, including statistics from 444 datasets, covering 8 language categories and spanning 32 domains.

Language Modelling Large Language Model

532

Paper
Code

PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation

no code implementations • 28 Feb 2024 • Haoyu Xie, Changqi Wang, Jian Zhao, Yang Liu, Jun Dan, Chong Fu, Baigui Sun

To address this issue, we propose a robust contrastive-based S4 framework, termed the Probabilistic Representation Contrastive Learning (PRCL) framework to enhance the robustness of the unsupervised training process.

Contrastive Learning Semi-Supervised Semantic Segmentation

Paper
Add Code

ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks

no code implementations • 27 Feb 2024 • Yang Liu, Xiaomin Yu, Gongyu Zhang, Christos Bergeles, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin

We train models for these tasks in a zero-shot cross-modal transfer setting, a domain where the previous state-of-the-art method relied on the fixed scale noise injection, often compromising the semantic content of the original modality embedding.

Domain Generalization Image Captioning +3

Paper
Add Code

Dataset Fairness: Achievable Fairness on Your Data With Utility Guarantees

no code implementations • 27 Feb 2024 • Muhammad Faaiz Taufiq, Jean-Francois Ton, Yang Liu

In machine learning fairness, training models which minimize disparity across different sensitive groups often leads to diminished accuracy, a phenomenon known as the fairness-accuracy trade-off.

Fairness

Paper
Add Code

Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models

no code implementations • 27 Feb 2024 • Xiaolong Wang, Yile Wang, Yuanchi Zhang, Fuwen Luo, Peng Li, Maosong Sun, Yang Liu

Based on the characteristics of the tasks and the strong dialogue-generation capabilities of LLMs, we propose RiC (Reasoning in Conversation), a method that focuses on solving subjective tasks through dialogue simulation.

Dark Humor Detection Dialogue Generation +3

Paper
Add Code

Citation-Enhanced Generation for LLM-based Chatbots

no code implementations • 25 Feb 2024 • Weitao Li, Junkai Li, Weizhi Ma, Yang Liu

Note that our method is a training-free plug-and-play plugin that is capable of various LLMs.

Chatbot Citation Prediction +3

Paper
Add Code

Budget-Constrained Tool Learning with Planning

1 code implementation • 25 Feb 2024 • Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu

Despite intensive efforts devoted to tool learning, the problem of budget-constrained tool learning, which focuses on resolving user queries within a specific budget constraint, has been widely overlooked.

Paper
Code

Lightweight, error-tolerant edge detection using memristor-enabled stochastic logics

no code implementations • 25 Feb 2024 • Lekai Song, Pengyu Liu, Jingfang Pei, Yang Liu, Songwei Liu, Shengbo Wang, Leonard W. T. Ng, Tawfique Hasan, Kong-Pang Pun, Shuo Gao, Guohua Hu

The demand for efficient edge vision has spurred the interest in developing stochastic computing approaches for performing image processing tasks.

Autonomous Driving Edge Detection +1

Paper
Add Code

Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA

1 code implementation • 24 Feb 2024 • Wentao Mo, Yang Liu

In 3D Visual Question Answering (3D VQA), the scarcity of fully annotated data and limited visual content diversity hampers the generalization to novel scenes and 3D concepts (e. g., only around 800 scenes are utilized in ScanQA and SQA dataset).

Ranked #1 on 3D Question Answering (3D-QA) on ScanQA Test w/ objects

3D Question Answering (3D-QA) Question Answering +1

Paper
Code

LLMs Can Defend Themselves Against Jailbreaking in a Practical Manner: A Vision Paper

no code implementations • 24 Feb 2024 • Daoyuan Wu, Shuai Wang, Yang Liu, Ning Liu

Our key insight is that regardless of the kind of jailbreak strategies employed, they eventually need to include a harmful prompt (e. g., "how to make a bomb") in the prompt sent to LLMs, and we found that existing LLMs can effectively recognize such harmful prompts that violate their safety policies.

Adversarial Attack

Paper
Add Code

DEEM: Dynamic Experienced Expert Modeling for Stance Detection

1 code implementation • 23 Feb 2024 • Xiaolong Wang, Yile Wang, Sijie Cheng, Peng Li, Yang Liu

Recent work has made a preliminary attempt to use large language models (LLMs) to solve the stance detection task, showing promising results.

Stance Detection

Paper
Code

MVD$^2$: Efficient Multiview 3D Reconstruction for Multiview Diffusion

no code implementations • 22 Feb 2024 • Xin-Yang Zheng, Hao Pan, Yu-Xiao Guo, Xin Tong, Yang Liu

By finetuning pretrained large image diffusion models with 3D data, the MVD methods first generate multiple views of a 3D object based on an image or text prompt and then reconstruct 3D shapes with multiview 3D reconstruction.

3D Generation 3D Reconstruction

Paper
Add Code

Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding

1 code implementation • 22 Feb 2024 • Yu-Qi Yang, Yu-Xiao Guo, Yang Liu

Data diversity and abundance are essential for improving the performance and generalization of models in natural language processing and 2D vision.

Scene Understanding

170

Paper
Code

OMGEval: An Open Multilingual Generative Evaluation Benchmark for Large Language Models

1 code implementation • 21 Feb 2024 • Meng Xu, Shuo Wang, Liner Yang, Haoyu Wang, Zhenghao Liu, Cunliang Kong, Yun Chen, Yang Liu, Maosong Sun, Erhong Yang

We evaluate several representative multilingual LLMs on the proposed OMGEval, which we believe will provide a valuable reference for the community to further understand and improve the multilingual capability of LLMs.

General Knowledge Logical Reasoning

Paper
Code

Full-Atom Peptide Design with Geometric Latent Diffusion

no code implementations • 21 Feb 2024 • Xiangzhe Kong, Wenbing Huang, Yang Liu

Peptide design plays a pivotal role in therapeutics, allowing brand new possibility to leverage target binding sites that are previously undruggable.

Paper
Add Code

CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models

no code implementations • 21 Feb 2024 • Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu

Multimodal large language models (MLLMs) have demonstrated promising results in a variety of tasks that combine vision and language.

Benchmarking

Paper
Add Code

PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs

no code implementations • 20 Feb 2024 • An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu

While Large language models (LLMs) have demonstrated considerable capabilities across various natural language tasks, they often fall short of the performance achieved by domain-specific state-of-the-art models.

text-classification Text Classification

Paper
Add Code

Model Composition for Multimodal Large Language Models

no code implementations • 20 Feb 2024 • Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu

In this paper, we propose a new paradigm through the model composition of existing MLLMs to create a new model that retains the modal understanding capabilities of each original model.

Paper
Add Code

Fair Classifiers Without Fair Training: An Influence-Guided Data Sampling Approach

no code implementations • 20 Feb 2024 • Jinlong Pang, Jialu Wang, Zhaowei Zhu, Yuanshun Yao, Chen Qian, Yang Liu

A fair classifier should ensure the benefit of people from different groups, while the group information is often sensitive and unsuitable for model training.

Attribute Fairness

Paper
Add Code

BMLP: Behavior-aware MLP for Heterogeneous Sequential Recommendation

no code implementations • 20 Feb 2024 • Weixin Li, Yuhao Wu, Yang Liu, Weike Pan, Zhong Ming

In real recommendation scenarios, users often have different types of behaviors, such as clicking and buying.

Sequential Recommendation

Paper
Add Code

Equivariant Pretrained Transformer for Unified Geometric Learning on Multi-Domain 3D Molecules

no code implementations • 20 Feb 2024 • Rui Jiao, Xiangzhe Kong, Ziyang Yu, Wenbing Huang, Yang Liu

Pretraining on a large number of unlabeled 3D molecules has showcased superiority in various scientific applications.

Molecular Property Prediction Property Prediction

Paper
Add Code

Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion

1 code implementation • 19 Feb 2024 • Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu

With the bloom of Large Language Models (LLMs), Multimodal Large Language Models (MLLMs) that incorporate LLMs with pre-trained vision models have recently demonstrated impressive performance across diverse vision-language tasks.

Paper
Code

Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages

1 code implementation • 19 Feb 2024 • Yuanchi Zhang, Yile Wang, Zijun Liu, Shuo Wang, Xiaolong Wang, Peng Li, Maosong Sun, Yang Liu

While large language models (LLMs) have been pre-trained on multilingual corpora, their performance still lags behind in most languages compared to a few resource-rich languages.

Transfer Learning

19,575

Paper
Code

Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement

1 code implementation • 19 Feb 2024 • Zijun Liu, Boqun Kou, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu

Although Large Language Models (LLMs) have demonstrated strong performance on a wide range of tasks, they still face reliability challenges such as hallucination.

Hallucination

Paper
Code

Purifying Large Language Models by Ensembling a Small Language Model

no code implementations • 19 Feb 2024 • Tianlin Li, Qian Liu, Tianyu Pang, Chao Du, Qing Guo, Yang Liu, Min Lin

The emerging success of large language models (LLMs) heavily relies on collecting abundant training data from external (untrusted) sources.

Data Poisoning Language Modelling

Paper
Add Code

Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models

1 code implementation • 19 Feb 2024 • Xuanyu Lei, Zonghan Yang, Xinrui Chen, Peng Li, Yang Liu

State-of-the-art Large Multi-Modal Models (LMMs) have demonstrated exceptional capabilities in vision-language tasks.

Visual Prompting

Paper
Code

Groot: Adversarial Testing for Generative Text-to-Image Models with Tree-based Semantic Transformation

no code implementations • 19 Feb 2024 • Yi Liu, Guowei Yang, Gelei Deng, Feiyue Chen, Yuqi Chen, Ling Shi, Tianwei Zhang, Yang Liu

With the prevalence of text-to-image generative models, their safety becomes a critical concern.

Paper
Add Code

Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One

no code implementations • 19 Feb 2024 • Tianlin Li, XiaoYu Zhang, Chao Du, Tianyu Pang, Qian Liu, Qing Guo, Chao Shen, Yang Liu

Building on this insight and observation, we develop FairThinking, a pipeline designed to automatically generate roles that enable LLMs to articulate diverse perspectives for fair expressions.

Fairness Language Modelling +1

Paper
Add Code

Adversarial Curriculum Graph Contrastive Learning with Pair-wise Augmentation

no code implementations • 16 Feb 2024 • Xinjian Zhao, Liang Zhang, Yang Liu, Ruocheng Guo, Xiangyu Zhao

To address this challenge, we propose an innovative framework: Adversarial Curriculum Graph Contrastive Learning (ACGCL), which capitalizes on the merits of pair-wise augmentation to engender graph-level positive and negative samples with controllable similarity, alongside subgraph contrastive learning to discern effective graph patterns therein.

Contrastive Learning Graph Representation Learning

Paper
Add Code

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting

no code implementations • 16 Feb 2024 • Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi Guo, Andrew Estornell, Yang Liu

In this work, we propose Factualness Evaluations via Weighting LLMs (FEWL), the first hallucination metric that is specifically designed for the scenario when gold-standard answers are absent.

Hallucination In-Context Learning

Paper
Add Code

Comment-aided Video-Language Alignment via Contrastive Pre-training for Short-form Video Humor Detection

1 code implementation • 14 Feb 2024 • Yang Liu, Tongfei Shen, Dong Zhang, Qingying Sun, Shoushan Li, Guodong Zhou

The growing importance of multi-modal humor detection within affective computing correlates with the expanding influence of short-form video sharing on social media platforms.

Humor Detection

Paper
Code

Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues

no code implementations • 14 Feb 2024 • Zhiyuan Chang, Mingyang Li, Yi Liu, Junjie Wang, Qing Wang, Yang Liu

With the development of LLMs, the security threats of LLMs are getting more and more attention.

Paper
Add Code

Switch EMA: A Free Lunch for Better Flatness and Sharpness

2 code implementations • 14 Feb 2024 • Siyuan Li, Zicheng Liu, Juanxi Tian, Ge Wang, Zedong Wang, Weiyang Jin, Di wu, Cheng Tan, Tao Lin, Yang Liu, Baigui Sun, Stan Z. Li

Exponential Moving Average (EMA) is a widely used weight averaging (WA) regularization to learn flat optima for better generalizations without extra cost in deep neural network (DNN) optimization.

Attribute Image Classification +7

570

Paper
Code

Rethinking Machine Unlearning for Large Language Models

no code implementations • 13 Feb 2024 • Sijia Liu, Yuanshun Yao, Jinghan Jia, Stephen Casper, Nathalie Baracaldo, Peter Hase, Xiaojun Xu, Yuguang Yao, Hang Li, Kush R. Varshney, Mohit Bansal, Sanmi Koyejo, Yang Liu

We explore machine unlearning (MU) in the domain of large language models (LLMs), referred to as LLM unlearning.

Machine Unlearning Management +2

Paper
Add Code

Towards Unified Alignment Between Agents, Humans, and Environment

no code implementations • 12 Feb 2024 • Zonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li, Yang Liu

We also conduct proof-of-concept studies by introducing realistic features to WebShop, including user profiles to demonstrate intentions, personalized reranking for complex environmental dynamics, and runtime cost statistics to reflect self-constraints.

Decision Making

Paper
Add Code

Large Language Models as Agents in Two-Player Games

no code implementations • 12 Feb 2024 • Yang Liu, Peng Sun, Hang Li

By formally defining the training processes of large language models (LLMs), which usually encompasses pre-training, supervised fine-tuning, and reinforcement learning with human feedback, within a single and unified machine learning paradigm, we can glean pivotal insights for advancing LLM technologies.

Position reinforcement-learning

Paper
Add Code

Sparse Anatomical Prompt Semi-Supervised Learning with Masked Image Modeling for CBCT Tooth Segmentation

no code implementations • 7 Feb 2024 • Pengyu Dai, Yafei Ou, Yang Liu, Yue Zhao

To address these challenges, this study aims to propose a tasked-oriented Masked Auto-Encoder paradigm to effectively utilize large amounts of unlabeled data to achieve accurate tooth segmentation with limited labeled data.

Graph Attention Segmentation

Paper
Add Code

Space Group Constrained Crystal Generation

no code implementations • 6 Feb 2024 • Rui Jiao, Wenbing Huang, Yu Liu, Deli Zhao, Yang Liu

Crystals are the foundation of numerous scientific and industrial applications.

Paper
Add Code

Weakly Supervised Anomaly Detection via Knowledge-Data Alignment

no code implementations • 6 Feb 2024 • Haihong Zhao, Chenyi Zi, Yang Liu, Chen Zhang, Yan Zhou, Jia Li

In this paper, we introduce a novel framework Knowledge-Data Alignment (KDAlign) to integrate rule knowledge, typically summarized by human experts, to supplement the limited labeled data.

Malware Detection Supervised Anomaly Detection +1

Paper
Add Code

FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution

no code implementations • 6 Feb 2024 • Qi Zhou, Dongxia Wang, Tianlin Li, Zhihong Xu, Yang Liu, Kui Ren, Wenhai Wang, Qing Guo

To expose this potential vulnerability, we aim to build an adversarial attack forcing SDEdit to generate a specific data distribution aligned with a specified attribute (e. g., female), without changing the input's attribute characteristics.

Adversarial Attack Attribute +1

Paper
Add Code

Improving Robustness of LiDAR-Camera Fusion Model against Weather Corruption from Fusion Strategy Perspective

no code implementations • 5 Feb 2024 • Yihao Huang, Kaiyuan Yu, Qing Guo, Felix Juefei-Xu, Xiaojun Jia, Tianlin Li, Geguang Pu, Yang Liu

In recent years, LiDAR-camera fusion models have markedly advanced 3D object detection tasks in autonomous driving.

3D Object Detection Autonomous Driving +1

Paper
Add Code

MQuinE: a cure for "Z-paradox" in knowledge graph embedding models

no code implementations • 5 Feb 2024 • Yang Liu, Huang Fang, Yunfeng Cai, Mingming Sun

Knowledge graph embedding (KGE) models achieved state-of-the-art results on many knowledge graph tasks including link prediction and information retrieval.

Information Retrieval Knowledge Graph Embedding +3

Paper
Add Code

RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies

1 code implementation • 3 Feb 2024 • Hao Cheng, Qingsong Wen, Yang Liu, Liang Sun

Time series forecasting is an important and forefront task in many real-world applications.

Time Series Time Series Forecasting

Paper
Code

Cheating Suffix: Targeted Attack to Text-To-Image Diffusion Models with Multi-Modal Priors

1 code implementation • 2 Feb 2024 • Dingcheng Yang, Yang Bai, Xiaojun Jia, Yang Liu, Xiaochun Cao, Wenjian Yu

The MMP-Attack shows a notable advantage over existing works with superior universality and transferability, which can effectively attack commercial text-to-image (T2I) models such as DALL-E 3.

Image Generation

Paper
Code

Graph Neural Networks in EEG-based Emotion Recognition: A Survey

no code implementations • 2 Feb 2024 • Chenyu Liu, Xinliang Zhou, Yihao Wu, Ruizhi Yang, Liming Zhai, Ziyu Jia, Yang Liu

Besides, there is neither a comprehensive review nor guidance for constructing GNNs in EEG-based emotion recognition.

EEG Emotion Recognition +2

Paper
Add Code

Multimodal Embodied Interactive Agent for Cafe Scene

no code implementations • 1 Feb 2024 • Yang Liu, Xinshuai Song, Kaixuan Jiang, Weixing Chen, Jingzhou Luo, Guanbin Li, Liang Lin

To overcome this limitation, we introduce the Multimodal Embodied Interactive Agent (MEIA), capable of translating high-level tasks expressed in natural language into a sequence of executable actions.

Zero-Shot Learning

Paper
Add Code

A Proactive and Dual Prevention Mechanism against Illegal Song Covers empowered by Singing Voice Conversion

no code implementations • 30 Jan 2024 • Guangke Chen, Yedi Zhang, Fu Song, Ting Wang, Xiaoning Du, Yang Liu

To improve the imperceptibility of perturbations, we refine a psychoacoustic model-based loss with the backing track as an additional masker, a unique accompanying element for singing voices compared to ordinary speech voices.

Voice Conversion

Paper
Add Code

A Cross-Language Investigation into Jailbreak Attacks in Large Language Models

no code implementations • 30 Jan 2024 • Jie Li, Yi Liu, Chongyang Liu, Ling Shi, Xiaoning Ren, Yaowen Zheng, Yang Liu, Yinxing Xue

To address this research gap, we conducted an extensive empirical study on Multilingual Jailbreak attacks.

Text Generation

Paper
Add Code

Node Flux-Linkage Synchronizing Control of Power Systems with 100% Wind Power Generation Based on Capacitor Voltage Balancing Scheme

no code implementations • 30 Jan 2024 • Yang Liu, Yanshan Chen, Yuexi Yang, Xiangyu Pei, Feng Ji

In order to limit the short-circuit current of inverters, a logic-based bang-bang funnel control (LBFC) is designed to control the switches of inverter bridges when over-current is detected.

Paper
Add Code

LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning

no code implementations • 29 Jan 2024 • Yuqiang Sun, Daoyuan Wu, Yue Xue, Han Liu, Wei Ma, Lyuye Zhang, Miaolei Shi, Yang Liu

Large language models (LLMs) have demonstrated significant poten- tial for many downstream tasks, including those requiring human- level intelligence, such as vulnerability detection.

Vulnerability Detection

Paper
Add Code

GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist Workflow

1 code implementation • 28 Jan 2024 • Liguo Zhou, Yinglei Song, Yichao Gao, Zhou Yu, Michael Sodamin, Hongshen Liu, Liang Ma, Lian Liu, Hao liu, Yang Liu, Haichuan Li, Guang Chen, Alois Knoll

However, the availability of free and open-source simulators is limited, and the installation and configuration process can be daunting for beginners and interdisciplinary researchers.

Autonomous Driving

Paper
Code

Quantifying Stereotypes in Language

1 code implementation • 28 Jan 2024 • Yang Liu

It is often potentially encoded in human language, which is more common in texts on social issues.

Sentence

Paper
Code

SkipViT: Speeding Up Vision Transformers with a Token-Level Skip Connection

no code implementations • 27 Jan 2024 • Foozhan Ataiefard, Walid Ahmed, Habib Hajimolahoseini, Saina Asani, Farnoosh Javadi, Mohammad Hassanpour, Omar Mohamed Awad, Austin Wen, Kangling Liu, Yang Liu

Our method does not add any parameters to the ViT model and aims to find the best trade-off between training throughput and achieving a 0% loss in the Top-1 accuracy of the final model.

Paper
Add Code

VJT: A Video Transformer on Joint Tasks of Deblurring, Low-light Enhancement and Denoising

no code implementations • 26 Jan 2024 • Yuxiang Hui, Yang Liu, Yaofang Liu, Fan Jia, Jinshan Pan, Raymond Chan, Tieyong Zeng

Video restoration task aims to recover high-quality videos from low-quality observations.

Deblurring Denoising +2

Paper
Add Code

Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval

no code implementations • 24 Jan 2024 • Dezhao Luo, Shaogang Gong, Jiabo Huang, Hailin Jin, Yang Liu

We address two problems in video editing for optimising unseen domain VMR: (1) generation of high-quality simulation videos of different moments with subtle distinctions, (2) selection of simulation videos that complement existing source training videos without introducing harmful noise or unnecessary repetitions.

Moment Retrieval Retrieval +2

Paper
Add Code

UniHDA: A Unified and Versatile Framework for Multi-Modal Hybrid Domain Adaptation

no code implementations • 23 Jan 2024 • Hengjia Li, Yang Liu, Yuqi Lin, Zhanwei Zhang, Yibo Zhao, weihang Pan, Tu Zheng, Zheng Yang, Yuchun Jiang, Boxi Wu, Deng Cai

In this paper, we propose UniHDA, a \textbf{unified} and \textbf{versatile} framework for generative hybrid domain adaptation with multi-modal references from multiple domains.

Attribute Domain Adaptation

Paper
Add Code

Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models

1 code implementation • 22 Jan 2024 • Yile Wang, Sijie Cheng, Zixin Sun, Peng Li, Yang Liu

We propose symbol-to-language (S2L), a tuning-free method that enables large language models to solve symbol-related problems with information expressed in natural language.

Property Prediction Question Answering +1

Paper
Code

Robust Evaluation Measures for Evaluating Social Biases in Masked Language Models

1 code implementation • 21 Jan 2024 • Yang Liu

Many evaluation measures are used to evaluate social biases in masked language models (MLMs).

Paper
Code

CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios

1 code implementation • 19 Jan 2024 • Xiangshuo Qiao, Xianxin Li, Xiaozhe Qu, Jie Zhang, Yang Liu, Yu Luo, Cihang Jin, Jin Ma

Differently, video covers in short video search scenarios are presented as user-originated contents that provide important visual summaries of videos.

Ranked #1 on Image Retrieval on CBVS

Common Sense Reasoning Image Retrieval

Paper
Code

LLMs for Relational Reasoning: How Far are We?

no code implementations • 17 Jan 2024 • Zhiming Li, Yushi Cao, Xiufeng Xu, Junzhe Jiang, Xu Liu, Yon Shin Teo, Shang-Wei Lin, Yang Liu

Large language models (LLMs) have revolutionized many areas (e. g. natural language processing, software engineering, etc.)

Common Sense Reasoning Decision Making +3

Paper
Add Code

Rigid Protein-Protein Docking via Equivariant Elliptic-Paraboloid Interface Prediction

1 code implementation • 17 Jan 2024 • Ziyang Yu, Wenbing Huang, Yang Liu

The study of rigid protein-protein docking plays an essential role in a variety of tasks such as drug design and protein engineering.

Paper
Code

Short-Form Videos and Mental Health: A Knowledge-Guided Neural Topic Model

no code implementations • 11 Jan 2024 • Jiaheng Xie, Ruicheng Liang, Yidong Chai, Yang Liu, Daniel Zeng

To prevent widespread consequences, platforms are eager to predict these videos' impact on viewers' mental health.

Topic Models Video Classification

Paper
Add Code

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security

2 code implementations • 10 Jan 2024 • Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu, Zhijun Li, Peng Li, Yang Liu, Ya-Qin Zhang, Yunxin Liu

Next, we discuss several key challenges to achieve intelligent, efficient and secure Personal LLM Agents, followed by a comprehensive survey of representative solutions to address these challenges.

223

Paper
Code

FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation

no code implementations • 8 Jan 2024 • Yang Liu, Li Wan, Yun Li, Yiteng Huang, Ming Sun, James Luan, Yangyang Shi, Xin Lei

Despite the potential of diffusion models in speech enhancement, their deployment in Acoustic Echo Cancellation (AEC) has been restricted.

Acoustic echo cancellation Speech Enhancement

Paper
Add Code

FedTGP: Trainable Global Prototypes with Adaptive-Margin-Enhanced Contrastive Learning for Data and Model Heterogeneity in Federated Learning

1 code implementation • 6 Jan 2024 • Jianqing Zhang, Yang Liu, Yang Hua, Jian Cao

To reduce the high communication cost of transmitting model parameters, a major challenge in HtFL, prototype-based HtFL methods are proposed to solely share class representatives, a. k. a, prototypes, among heterogeneous clients while maintaining the privacy of clients' models.

Contrastive Learning Federated Learning

Paper
Code

Human-Instruction-Free LLM Self-Alignment with Limited Samples

no code implementations • 6 Jan 2024 • Hongyi Guo, Yuanshun Yao, Wei Shen, Jiaheng Wei, Xiaoying Zhang, Zhaoran Wang, Yang Liu

The key idea is to first retrieve high-quality samples related to the target domain and use them as In-context Learning examples to generate more samples.

In-Context Learning Instruction Following

Paper
Add Code

Digger: Detecting Copyright Content Mis-usage in Large Language Model Training

no code implementations • 1 Jan 2024 • Haodong Li, Gelei Deng, Yi Liu, Kailong Wang, Yuekang Li, Tianwei Zhang, Yang Liu, Guoai Xu, Guosheng Xu, Haoyu Wang

In this paper, we introduce a detailed framework designed to detect and assess the presence of content from potentially copyrighted books within the training datasets of LLMs.

Language Modelling Large Language Model +1

Paper
Add Code

Masked Modeling for Self-supervised Representation Learning on Vision and Beyond

1 code implementation • 31 Dec 2023 • Siyuan Li, Luyuan Zhang, Zedong Wang, Di wu, Lirong Wu, Zicheng Liu, Jun Xia, Cheng Tan, Yang Liu, Baigui Sun, Stan Z. Li

As the deep learning revolution marches on, self-supervised learning has garnered increasing attention in recent years thanks to its remarkable representation learning ability and the low dependence on labeled data.

Representation Learning Self-Supervised Learning

234

Paper
Code

SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge

2 code implementations • 31 Dec 2023 • Dimitrios Psychogyios, Emanuele Colleoni, Beatrice van Amsterdam, Chih-Yang Li, Shu-Yu Huang, Yuchong Li, Fucang Jia, Baosheng Zou, Guotai Wang, Yang Liu, Maxence Boels, Jiayu Huo, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin, Mengya Xu, An Wang, Yanan Wu, Long Bai, Hongliang Ren, Atsushi Yamada, Yuriko Harai, Yuto Ishikawa, Kazuyuki Hayashi, Jente Simoens, Pieter DeBacker, Francesco Cisternino, Gabriele Furnari, Alex Mottrie, Federica Ferraguti, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa, Soohee Kim, Seung Hyun Lee, Kyu Eun Lee, Hyoun-Joong Kong, Kui Fu, Chao Li, Shan An, Stefanie Krell, Sebastian Bodenstedt, Nicolas Ayobi, Alejandra Perez, Santiago Rodriguez, Juanita Puentes, Pablo Arbelaez, Omid Mohareri, Danail Stoyanov

Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems.

Action Recognition Segmentation +1

Paper
Code

A Prompt Learning Framework for Source Code Summarization

1 code implementation • 26 Dec 2023 • Weisong Sun, Chunrong Fang, Yudu You, Yuchen Chen, Yi Liu, Chong Wang, Jian Zhang, Quanjun Zhang, Hanwei Qian, Wei Zhao, Yang Liu, Zhenyu Chen

PromptCS trains a prompt agent that can generate continuous prompts to unleash the potential for LLMs in code summarization.

Code Summarization Few-Shot Learning +2

Paper
Code

A Split-and-Privatize Framework for Large Language Model Fine-Tuning

no code implementations • 25 Dec 2023 • Xicong Shen, Yang Liu, Huiqi Liu, Jue Hong, Bing Duan, Zirui Huang, Yunlong Mao, Ye Wu, Di wu

Fine-tuning is a prominent technique to adapt a pre-trained language model to downstream scenarios.

Language Modelling Large Language Model

Paper
Add Code

Exploiting Multipath Information for Integrated Localization and Sensing via PHD Filtering

no code implementations • 24 Dec 2023 • Yinuo Du, Hanying Zhao, Yang Liu, Xinlei Yu, Yuan Shen

Accurate localization and perception are pivotal for enhancing the safety and reliability of vehicles.

Paper
Add Code

Multimodal Federated Learning with Missing Modality via Prototype Mask and Contrast

no code implementations • 21 Dec 2023 • Guangyin Bao, Qi Zhang, Duoqian Miao, Zixuan Gong, Liang Hu, Ke Liu, Yang Liu, Chongyang Shi

In real-world scenarios, multimodal federated learning often faces the practical challenge of intricate modality missing, which poses constraints on building federated frameworks and significantly degrades model inference accuracy.

Federated Learning

Paper
Add Code

Knowledge Graph Error Detection with Contrastive Confidence Adaption

1 code implementation • 19 Dec 2023 • Xiangyu Liu, Yang Liu, Wei Hu

Knowledge graphs (KGs) often contain various errors.

Contrastive Learning Knowledge Graphs

Paper
Code

A Semi-Analytical Approach for State-Space Electromagnetic Transient Simulation Using the Differential Transformation

no code implementations • 19 Dec 2023 • Min Xiong, Kaiyang Huang, Yang Liu, Rui Yao, Kai Sun, Feng Qiu

Case studies are conducted on EMT models of the IEEE 39-bus system and a synthetic 390-bus system to demonstrate the merits of the new simulation approach against traditional methods.

Paper
Add Code

Probabilistic Prediction of Longitudinal Trajectory Considering Driving Heterogeneity with Interpretability

no code implementations • 19 Dec 2023 • Shuli Wang, Kun Gao, Lanfang Zhang, Yang Liu, Lei Chen

Specifically, based on a certain length of historical trajectory data, the situation-specific driving preferences of each driver are identified, where key driving behavior feature vectors are extracted to characterize heterogeneity in driving behavior among different drivers.

Navigate Trajectory Prediction

Paper
Add Code

Mutual Enhancement of Large and Small Language Models with Cross-Silo Knowledge Transfer

no code implementations • 10 Dec 2023 • Yongheng Deng, Ziqing Qiao, Ju Ren, Yang Liu, Yaoxue Zhang

While large language models (LLMs) are empowered with broad knowledge, their task-specific performance is often suboptimal.

Transfer Learning

Paper
Add Code

PFLlib: Personalized Federated Learning Algorithm Library

1 code implementation • 8 Dec 2023 • Jianqing Zhang, Yang Liu, Yang Hua, Hao Wang, Tao Song, Zhengui Xue, Ruhui Ma, Jian Cao

Amid the ongoing advancements in Federated Learning (FL), a machine learning paradigm that allows collaborative learning with data privacy protection, personalized FL (pFL) has gained significant prominence as a research direction within the FL domain.

Personalized Federated Learning

1,148

Paper
Code

SA-Attack: Improving Adversarial Transferability of Vision-Language Pre-training Models via Self-Augmentation

no code implementations • 8 Dec 2023 • Bangyan He, Xiaojun Jia, Siyuan Liang, Tianrui Lou, Yang Liu, Xiaochun Cao

Current Visual-Language Pre-training (VLP) models are vulnerable to adversarial examples.

Data Augmentation

Paper
Add Code

OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization

no code implementations • 7 Dec 2023 • Dongchen Han, Xiaojun Jia, Yang Bai, Jindong Gu, Yang Liu, Xiaochun Cao

Investigating the generation of high-transferability adversarial examples is crucial for uncovering VLP models' vulnerabilities in practical scenarios.

Adversarial Attack Data Augmentation +2

Paper
Add Code

Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images

no code implementations • 7 Dec 2023 • Yiqun Zhang, Zhenyue Qin, Yang Liu, Dylan Campbell

We introduce a pipeline to address anatomical inaccuracies in Stable Diffusion generated hand images.

Pose Estimation

Paper
Add Code

TranSegPGD: Improving Transferability of Adversarial Examples on Semantic Segmentation

no code implementations • 3 Dec 2023 • Xiaojun Jia, Jindong Gu, Yihao Huang, Simeng Qin, Qing Guo, Yang Liu, Xiaochun Cao

At the second stage, the pixels are divided into different branches based on their transferable property which is dependent on Kullback-Leibler divergence.

Adversarial Attack Image Classification +2

Paper
Add Code

Abstract Syntax Tree for Programming Language Understanding and Representation: How Far Are We?

1 code implementation • 1 Dec 2023 • Weisong Sun, Chunrong Fang, Yun Miao, Yudu You, Mengzhe Yuan, Yuchen Chen, Quanjun Zhang, An Guo, Xiang Chen, Yang Liu, Zhenyu Chen

To do so, we compare the performance of models trained with code token sequence (Token for short) based code representation and AST-based code representation on three popular types of code-related tasks.

Representation Learning

Paper
Code

CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation

no code implementations • 30 Nov 2023 • Zineng Tang, ZiYi Yang, Mahmoud Khademi, Yang Liu, Chenguang Zhu, Mohit Bansal

We present CoDi-2, a versatile and interactive Multimodal Large Language Model (MLLM) that can follow complex multimodal interleaved instructions, conduct in-context learning (ICL), reason, chat, edit, etc., in an any-to-any input-output modality paradigm.

Image Generation In-Context Learning +3

Paper
Add Code

StructRe: Rewriting for Structured Shape Modeling

no code implementations • 29 Nov 2023 • Jiepeng Wang, Hao Pan, Yang Liu, Xin Tong, Taku Komura, Wenping Wang

Such a localized rewriting process enables probabilistic modeling of ambiguous structures and robust generalization across object categories.

Object

Paper
Add Code

Topology-Preserving Adversarial Training

no code implementations • 29 Nov 2023 • Xiaoyue Mi, Fan Tang, Yepeng Weng, Danding Wang, Juan Cao, Sheng Tang, Peng Li, Yang Liu

Despite the effectiveness in improving the robustness of neural networks, adversarial training has suffered from the natural accuracy degradation problem, i. e., accuracy on natural samples has reduced significantly.

Paper
Add Code

SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation

1 code implementation • 29 Nov 2023 • Mutian Xu, Xingyilang Yin, Lingteng Qiu, Yang Liu, Xin Tong, Xiaoguang Han

We introduce SAMPro3D for zero-shot 3D indoor scene segmentation.

Scene Segmentation Scene Understanding +1

Paper
Code

Adversarial Robust Memory-Based Continual Learner

no code implementations • 29 Nov 2023 • Xiaoyue Mi, Fan Tang, Zonghan Yang, Danding Wang, Juan Cao, Peng Li, Yang Liu

Despite the remarkable advances that have been made in continual learning, the adversarial vulnerability of such methods has not been fully discussed.

Adversarial Robustness Continual Learning

Paper
Add Code

CESAR: Automatic Induction of Compositional Instructions for Multi-turn Dialogs

no code implementations • 29 Nov 2023 • Taha Aksu, Devamanyu Hazarika, Shikib Mehri, Seokhwan Kim, Dilek Hakkani-Tür, Yang Liu, Mahdi Namazifar

We apply CESAR on InstructDial, a benchmark for instruction-based dialog tasks.

Paper
Add Code

Q-learning Based Optimal False Data Injection Attack on Probabilistic Boolean Control Networks

no code implementations • 29 Nov 2023 • Xianlun Peng, Yang Tang, Fangfei Li, Yang Liu

In this paper, we present a reinforcement learning (RL) method for solving optimal false data injection attack problems in probabilistic Boolean control networks (PBCNs) where the attacker lacks knowledge of the system model.

Q-Learning reinforcement-learning +1

Paper
Add Code

EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models

1 code implementation • 27 Nov 2023 • Sijie Cheng, Zhicheng Guo, Jingwen Wu, Kechen Fang, Peng Li, Huaping Liu, Yang Liu

However, the capability of VLMs to "think" from a first-person perspective, a crucial attribute for advancing autonomous agents and robotics, remains largely unexplored.

Attribute Question Answering +1

Paper
Code

Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars

1 code implementation • 27 Nov 2023 • Yang Liu, Xiang Huang, Minghan Qin, Qinwei Lin, Haoqian Wang

Neural radiance fields are capable of reconstructing high-quality drivable human avatars but are expensive to train and render.

Novel View Synthesis

100

Paper
Code

SwiftLearn: A Data-Efficient Training Method of Deep Learning Models using Importance Sampling

no code implementations • 25 Nov 2023 • Habib Hajimolahoseini, Omar Mohamed Awad, Walid Ahmed, Austin Wen, Saina Asani, Mohammad Hassanpour, Farnoosh Javadi, Mehdi Ahmadi, Foozhan Ataiefard, Kangling Liu, Yang Liu

In this paper, we present SwiftLearn, a data-efficient approach to accelerate training of deep learning models using a subset of data samples selected during the warm-up stages of training.

Paper
Add Code

Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language

no code implementations • 24 Nov 2023 • Di Jin, Shikib Mehri, Devamanyu Hazarika, Aishwarya Padmakumar, Sungjin Lee, Yang Liu, Mahdi Namazifar

Learning from human feedback is a prominent technique to align the output of large language models (LLMs) with human expectations.

Paper
Add Code

AdapterFL: Adaptive Heterogeneous Federated Learning for Resource-constrained Mobile Computing Systems

no code implementations • 23 Nov 2023 • Ruixuan Liu, Ming Hu, Zeke Xia, Jun Xia, Pengyu Zhang, Yihao Huang, Yang Liu, Mingsong Chen

On the one hand, to achieve model training in all the diverse clients, mobile computing systems can only use small low-performance models for collaborative learning.

Federated Learning

Paper
Add Code

AdaptiveFL: Adaptive Heterogeneous Federated Learning for Resource-Constrained AIoT Systems

no code implementations • 22 Nov 2023 • Chentao Jia, Ming Hu, Zekai Chen, Yanxin Yang, Xiaofei Xie, Yang Liu, Mingsong Chen

Although Federated Learning (FL) is promising to enable collaborative learning among Artificial Intelligence of Things (AIoT) devices, it suffers from the problem of low classification performance due to various heterogeneity factors (e. g., computing capacity, memory size) of devices and uncertain operating environments.

Federated Learning

Paper
Add Code

Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots

1 code implementation • 21 Nov 2023 • Youqi Liao, Shuhao Kang, Jianping Li, Yang Liu, Yun Liu, Zhen Dong, Bisheng Yang, Xieyuanli Chen

Our framework features a two-stream encoder, an active fusion decoder (AFD) and a dual-task regularization approach.

Boundary Detection Edge-computing +2

103

Paper
Code

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

no code implementations • 21 Nov 2023 • Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš, Ahmed Alaa, Adji Bousso Dieng, Natasha Noy, Vijay Janapa Reddi, James Zou, Praveen Paritosh, Mihaela van der Schaar, Kurt Bollacker, Lora Aroyo, Ce Zhang, Joaquin Vanschoren, Isabelle Guyon, Peter Mattson

Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science.

Paper
Add Code

Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions

1 code implementation • 20 Nov 2023 • Ziyue Wang, Chi Chen, Peng Li, Yang Liu

Large Language Models (LLMs) demonstrate impressive reasoning ability and the maintenance of world knowledge not only in natural language tasks, but also in some vision-language tasks such as open-domain knowledge-based visual question answering (OK-VQA).

Question Answering Visual Question Answering +1

Paper
Code

A Universal Framework for Accurate and Efficient Geometric Deep Learning of Molecular Systems

1 code implementation • Scientific Reports 2023 • Shuo Zhang, Yang Liu, Lei Xie

Molecular sciences address a wide range of problems involving molecules of different types and sizes and their complexes.

Ranked #1 on Drug Discovery on QM9

Paper
Code

Unmasking and Improving Data Credibility: A Study with Datasets for Training Harmless Language Models

1 code implementation • 19 Nov 2023 • Zhaowei Zhu, Jialu Wang, Hao Cheng, Yang Liu

Given the cost and difficulty of cleaning these datasets by humans, we introduce a systematic framework for evaluating the credibility of datasets, identifying label errors, and evaluating the influence of noisy labels in the curated language data, specifically focusing on unsafe comments and conversation classification.

Language Modelling

2,953

Paper
Code

AI-accelerated Discovery of Altermagnetic Materials

1 code implementation • 8 Nov 2023 • Ze-Feng Gao, Shuai Qu, Bocheng Zeng, Yang Liu, Ji-Rong Wen, Hao Sun, Peng-Jie Guo, Zhong-Yi Lu

Altermagnetism, a new magnetic phase, has been theoretically proposed and experimentally verified to be distinct from ferromagnetism and antiferromagnetism.

Paper
Code

Lagrangian Modelling and Motion Stability of Synchronous Generator Power Systems

no code implementations • 7 Nov 2023 • Feng Ji, Lu Gao, Chang Lin, Yang Liu

This paper proposes to analyze the motion stability of synchro-nous generator power systems using a Lagrangian model derived in the configuration space of generalized position and speed.

Paper
Add Code

Benchmarking Deep Facial Expression Recognition: An Extensive Protocol with Balanced Dataset in the Wild

no code implementations • 6 Nov 2023 • Gianmarco Ipinze Tutuianu, Yang Liu, Ari Alamäki, Janne Kauttonen

Facial expression recognition (FER) is a crucial part of human-computer interaction.

Benchmarking Facial Expression Recognition +3

Paper
Add Code

GQKVA: Efficient Pre-training of Transformers by Grouping Queries, Keys, and Values

no code implementations • 6 Nov 2023 • Farnoosh Javadi, Walid Ahmed, Habib Hajimolahoseini, Foozhan Ataiefard, Mohammad Hassanpour, Saina Asani, Austin Wen, Omar Mohamed Awad, Kangling Liu, Yang Liu

We tested our method on ViT, which achieved an approximate 0. 3% increase in accuracy while reducing the model size by about 4% in the task of image classification.

Image Classification

Paper
Add Code

Procedural Fairness Through Decoupling Objectionable Data Generating Components

1 code implementation • 5 Nov 2023 • Zeyu Tang, Jialu Wang, Yang Liu, Peter Spirtes, Kun Zhang

We reveal and address the frequently overlooked yet important issue of disguised procedural unfairness, namely, the potentially inadvertent alterations on the behavior of neutral (i. e., not problematic) aspects of data generating process, and/or the lack of procedural assurance of the greatest benefit of the least advantaged individuals.

Decision Making Fairness

Paper
Code

Few-shot Hybrid Domain Adaptation of Image Generators

1 code implementation • 30 Oct 2023 • Hengjia Li, Yang Liu, Linxuan Xia, Yuqi Lin, Tu Zheng, Zheng Yang, Wenxiao Wang, Xiaohui Zhong, Xiaobo Ren, Xiaofei He

Concretely, the distance loss blends the attributes of all target domains by reducing the distances from generated images to all target subspaces.

Domain Adaptation Semantic Similarity +1

Paper
Code

Sentence Bag Graph Formulation for Biomedical Distant Supervision Relation Extraction

1 code implementation • 29 Oct 2023 • Hao Zhang, Yang Liu, Xiaoyan Liu, Tianming Liang, Gaurav Sharma, Liang Xue, Maozu Guo

We introduce a novel graph-based framework for alleviating key challenges in distantly-supervised relation extraction and demonstrate its effectiveness in the challenging and important domain of biomedical data.

Relation Relation Extraction +1

Paper
Code

Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation

1 code implementation • 24 Oct 2023 • Zeyuan Yang, Peng Li, Yang Liu

Large Language Models (LLMs) have showcased impressive performance.

Paper
Code

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions

1 code implementation • 19 Oct 2023 • Siru Ouyang, Shuohang Wang, Yang Liu, Ming Zhong, Yizhu Jiao, Dan Iter, Reid Pryzant, Chenguang Zhu, Heng Ji, Jiawei Han

Recent progress in Large Language Models (LLMs) has produced models that exhibit remarkable performance across a variety of NLP tasks.

Paper
Code

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models

no code implementations • 19 Oct 2023 • Zhihan Zhang, Shuohang Wang, Wenhao Yu, Yichong Xu, Dan Iter, Qingkai Zeng, Yang Liu, Chenguang Zhu, Meng Jiang

Large language models (LLMs) can perform a wide range of tasks by following natural language instructions, without the necessity of task-specific fine-tuning.

Paper
Add Code

A Multi-Scale Decomposition MLP-Mixer for Time Series Analysis

1 code implementation • 18 Oct 2023 • Shuhan Zhong, Sizhe Song, Weipeng Zhuo, Guanyao Li, Yang Liu, S. -H. Gary Chan

To handle the multi-scale temporal patterns and multivariate dependencies, we propose a novel temporal patching approach to model the time series as multi-scale patches, and employ MLPs to capture intra- and inter-patch variations and channel-wise correlations.

Anomaly Detection Imputation +2

Paper
Code

IRAD: Implicit Representation-driven Image Resampling against Adversarial Attacks

1 code implementation • 18 Oct 2023 • Yue Cao, Tianlin Li, Xiaofeng Cao, Ivor Tsang, Yang Liu, Qing Guo

The underlying rationale behind our idea is that image resampling can alleviate the influence of adversarial perturbations while preserving essential semantic information, thereby conferring an inherent advantage in defending against adversarial attacks.

Adversarial Robustness

Paper
Code

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

1 code implementation • 17 Oct 2023 • Yaofang Liu, Xiaodong Cun, Xuebo Liu, Xintao Wang, Yong Zhang, Haoxin Chen, Yang Liu, Tieyong Zeng, Raymond Chan, Ying Shan

For video generation, various open-sourced models and public-available services have been developed to generate high-quality videos.

Benchmarking Language Modelling +4

Paper
Code

Co-Learning Semantic-aware Unsupervised Segmentation for Pathological Image Registration

no code implementations • 17 Oct 2023 • Yang Liu, Shi Gu

Our results show that our method can accurately achieve the registration of pathological images and identify lesions even in challenging imaging modalities.

Image Registration Segmentation

Paper
Add Code

VFLAIR: A Research Library and Benchmark for Vertical Federated Learning

1 code implementation • 15 Oct 2023 • Tianyuan Zou, Zixuan Gu, Yu He, Hideaki Takahashi, Yang Liu, Ya-Qin Zhang

Vertical Federated Learning (VFL) has emerged as a collaborative training paradigm that allows participants with different features of the same group of users to accomplish cooperative training without exposing their raw data or model parameters.

Vertical Federated Learning

Paper
Code

Large Language Model Unlearning

1 code implementation • 14 Oct 2023 • Yuanshun Yao, Xiaojun Xu, Yang Liu

To the best of our knowledge, our work is among the first to explore LLM unlearning.

Language Modelling Large Language Model

Paper
Code

Graph Condensation via Eigenbasis Matching

no code implementations • 13 Oct 2023 • Yang Liu, Deyu Bo, Chuan Shi

The increasing amount of graph data places requirements on the efficiency and scalability of graph neural networks (GNNs), despite their effectiveness in various graph-related applications.

Paper
Add Code

PST: Improving Quantitative Trading via Program Sketch-based Tuning

no code implementations • 9 Oct 2023 • Zhiming Li, Junzhe Jiang, Yushi Cao, Aixin Cui, Bozhi Wu, Bo Li, Yang Liu, Dongning Sun

Particularly, PST first proposes using a novel symbolic program sketch to embed the abstract human expert knowledge of market trends.

Program Synthesis reinforcement-learning

Paper
Add Code

Fair Classifiers that Abstain without Harm

no code implementations • 9 Oct 2023 • Tongxin Yin, Jean-François Ton, Ruocheng Guo, Yuanshun Yao, Mingyan Liu, Yang Liu

To generalize the abstaining decisions to test samples, we then train a surrogate model to learn the abstaining decisions based on the IP solutions in an end-to-end manner.

Decision Making Fairness

Paper
Add Code

CCAE: A Corpus of Chinese-based Asian Englishes

no code implementations • 9 Oct 2023 • Yang Liu, Melissa Xiaohui Qin, Long Wang, Chao Huang

The ontology of data would make the corpus a helpful resource with enormous research potential for Asian Englishes (especially for Chinese Englishes for which there has not been a publicly accessible corpus yet so far) and an ideal source for variety-specific language modeling and downstream tasks, thus setting the stage for NLP-based World Englishes studies.

Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.