Search Results for author: Jie zhou

Found 502 papers, 278 papers with code

Scheduled Dialog Policy Learning: An Automatic Curriculum Learning Framework for Task-oriented Dialog System

no code implementations • Findings (ACL) 2021 • Sihong Liu, Jinchao Zhang, Keqing He, Weiran Xu, Jie zhou

Paper
Add Code

Unsupervised Dependency Graph Network

1 code implementation • ACL 2022 • Yikang Shen, Shawn Tan, Alessandro Sordoni, Peng Li, Jie zhou, Aaron Courville

We introduce a new model, the Unsupervised Dependency Graph Network (UDGN), that can induce dependency structures from raw corpora and the masked language modeling task.

Language Modelling Masked Language Modeling +3

Paper
Code

Divide and Denoise: Learning from Noisy Labels in Fine-Grained Entity Typing with Cluster-Wise Loss Correction

no code implementations • ACL 2022 • Kunyuan Pang, Haoyu Zhang, Jie zhou, Ting Wang

In this work, we propose a clustering-based loss correction framework named Feature Cluster Loss Correction (FCLC), to address these two problems.

Entity Typing

Paper
Add Code

Spatial Geometric Reasoning for Room Layout Estimation via Deep Reinforcement Learning

no code implementations • ECCV 2020 • Liangliang Ren, Yangyang Song, Jiwen Lu, Jie zhou

Unlike most existing works that define room layout on a 2D image, we model the layout in 3D as a configuration of the camera and the room.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

RoCBert: Robust Chinese Bert with Multimodal Contrastive Pretraining

no code implementations • ACL 2022 • Hui Su, Weiwei Shi, Xiaoyu Shen, Zhou Xiao, Tuo ji, Jiarui Fang, Jie zhou

Large-scale pretrained language models have achieved SOTA results on NLP tasks.

Contrastive Learning

Paper
Add Code

Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation

no code implementations • EMNLP 2020 • Xiuyi Chen, Fandong Meng, Peng Li, Feilong Chen, Shuang Xu, Bo Xu, Jie zhou

Here, we deal with these issues on two aspects: (1) We enhance the prior selection module with the necessary posterior information obtained from the specially designed Posterior Information Prediction Module (PIPM); (2) We propose a Knowledge Distillation Based Training Strategy (KDBTS) to train the decoder with the knowledge selected from the prior distribution, removing the exposure bias of knowledge selection.

Dialogue Generation Knowledge Distillation

Paper
Add Code

Constructing Emotional Consensus and Utilizing Unpaired Data for Empathetic Dialogue Generation

no code implementations • Findings (EMNLP) 2021 • Lei Shen, Jinchao Zhang, Jiao Ou, Xiaofang Zhao, Jie zhou

To address the above issues, we propose a dual-generative model, Dual-Emp, to simultaneously construct the emotional consensus and utilize some external unpaired data.

Dialogue Generation

Paper
Add Code

Deep Credible Metric Learning for Unsupervised Domain Adaptation Person Re-identification

no code implementations • ECCV 2020 • Guangyi Chen, Yuhao Lu, Jiwen Lu, Jie Zhou

Experimental results demonstrate that our DCML method explores credible and valuable training data and improves the performance of unsupervised domain adaptation.

Metric Learning Person Re-Identification +2

Paper
Add Code

CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild

1 code implementation • EMNLP 2021 • Yuan YAO, Jiaju Du, Yankai Lin, Peng Li, Zhiyuan Liu, Jie zhou, Maosong Sun

Existing relation extraction (RE) methods typically focus on extracting relational facts between entity pairs within single sentences or documents.

Relation Relation Extraction

Paper
Code

Deep Hashing with Active Pairwise Supervision

no code implementations • ECCV 2020 • Ziwei Wang, Quan Zheng, Jiwen Lu, Jie zhou

n this paper, we propose a Deep Hashing method with Active Pairwise Supervision(DH-APS).

Deep Hashing

Paper
Add Code

Structural Deep Metric Learning for Room Layout Estimation

no code implementations • ECCV 2020 • Wenzhao Zheng, Jiwen Lu, Jie zhou

We employ a metric model and a layout encoder to map the RGB images and the ground-truth layouts to the embedding space, respectively, and a layout decoder to map the embeddings to the corresponding layouts, where the whole framework is trained in an end-to-end manner.

Metric Learning Room Layout Estimation

Paper
Add Code

MovieChats: Chat like Humans in a Closed Domain

no code implementations • EMNLP 2020 • Hui Su, Xiaoyu Shen, Zhou Xiao, Zheng Zhang, Ernie Chang, Cheng Zhang, Cheng Niu, Jie zhou

In this work, we take a close look at the movie domain and present a large-scale high-quality corpus with fine-grained annotations in hope of pushing the limit of movie-domain chatbots.

Chatbot Retrieval

Paper
Add Code

Dual Context-Guided Continuous Prompt Tuning for Few-Shot Learning

no code implementations • Findings (ACL) 2022 • Le Tian, Houjin Yu, Zhou Xiao, Hui Su, Jie zhou

Prompt-based paradigm has shown its competitive performance in many NLP tasks.

Few-Shot Learning

Paper
Add Code

BMInf: An Efficient Toolkit for Big Model Inference and Tuning

1 code implementation • ACL 2022 • Xu Han, Guoyang Zeng, Weilin Zhao, Zhiyuan Liu, Zhengyan Zhang, Jie zhou, Jun Zhang, Jia Chao, Maosong Sun

In recent years, large-scale pre-trained language models (PLMs) containing billions of parameters have achieved promising results on various NLP tasks.

Quantization Scheduling

565

Paper
Code

ECNU_ICA at SemEval-2022 Task 10: A Simple and Unified Model for Monolingual and Crosslingual Structured Sentiment Analysis

no code implementations • SemEval (NAACL) 2022 • Qi Zhang, Jie zhou, Qin Chen, Qingchun Bai, Jun Xiao, Liang He

The task aims to extract the structured sentiment information (e. g., holder, target, expression and sentiment polarity) in a text.

Event Extraction Sentiment Analysis

Paper
Add Code

RSGT: Relational Structure Guided Temporal Relation Extraction

no code implementations • COLING 2022 • Jie zhou, Shenpo Dong, Hongkui Tu, Xiaodong Wang, Yong Dou

In this paper, we propose RSGT: Relational Structure Guided Temporal Relation Extraction to extract the relational structure features that can fit for both inter-sentence and intra-sentence relations.

Ranked #1 on Temporal Relation Classification on MATRES

Natural Language Understanding Relation +2

Paper
Add Code

Rotation-robust Intersection over Union for 3D Object Detection

no code implementations • ECCV 2020 • Yu Zheng, Danyang Zhang, Sinan Xie, Jiwen Lu, Jie zhou

In this paper, we propose a Rotation-robust Intersection over Union ($ extit{RIoU}$) for 3D object detection, which aims to jointly learn the overlap of rotated bounding boxes.

3D Object Detection Object +1

Paper
Add Code

Temporal Coherence or Temporal Motion: Which is More Critical for Video-based Person Re-identification?

no code implementations • ECCV 2020 • Guangyi Chen, Yongming Rao, Jiwen Lu, Jie zhou

Specifically, we disentangle the video representation into the temporal coherence and motion parts and randomly change the scale of the temporal motion features as the adversarial noise.

Video-Based Person Re-Identification

Paper
Add Code

欺骗类动词的句法语义研究(On the Syntax and Semantics of Verbs of Cheating)

no code implementations • CCL 2021 • Shan Wang, Jie zhou

“欺骗是一种常见的社会现象, 但对欺骗类动词的研究十分有限。本文筛选“欺骗”类动词的单句并对其进行大规模的句法依存和语义依存分析。研究显示,“欺骗”类动词在句中作为从属词时, 可作为不同的句法成分和语义角色, 同时此类动词在句法功能上表现出高度的相似性。作为支配词的“欺骗”类动词, 承担不同句法功能时, 表现出不同的句法共现模式。语义上, 本文详细描述、解释了该类动词在语义密度、主客体角色、情境角色和事件关系等维度的语义依存特点。“欺骗”类动词的句法语义虽具有多样性, 但主要的句型为主谓宾句式, 而该句式中最常用的语义搭配模式是施事对涉事进行欺骗行为, 并对涉事产生影响。本研究结合依存语法和框架语义学, 融合定量统计和定性分析探究欺骗类动词的句法语义, 深化了对欺骗行为言语线索以及言说动词的研究。”

Paper
Add Code

Unsupervised Knowledge Selection for Dialogue Generation

1 code implementation • Findings (ACL) 2021 • Xiuyi Chen, Feilong Chen, Fandong Meng, Peng Li, Jie zhou

Dialogue Generation

Paper
Code

Toward Fully Exploiting Heterogeneous Corpus:A Decoupled Named Entity Recognition Model with Two-stage Training

no code implementations • Findings (ACL) 2021 • Yun Hu, Yeshuang Zhu, Jinchao Zhang, Changwen Zheng, Jie zhou

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Attending via both Fine-tuning and Compressing

no code implementations • Findings (ACL) 2021 • Jie zhou, Yuanbin Wu, Qin Chen, Xuanjing Huang, Liang He

Paper
Add Code

Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach

1 code implementation • Findings (ACL) 2022 • Xin Lv, Yankai Lin, Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, Peng Li, Jie zhou

In recent years, pre-trained language models (PLMs) have been shown to capture factual knowledge from massive texts, which encourages the proposal of PLM-based knowledge graph completion (KGC) models.

Knowledge Graph Completion Link Prediction

Paper
Code

TAKE: Topic-shift Aware Knowledge sElection for Dialogue Generation

1 code implementation • COLING 2022 • Chenxu Yang, Zheng Lin, Jiangnan Li, Fandong Meng, Weiping Wang, Lanrui Wang, Jie zhou

The knowledge selector generally constructs a query based on the dialogue context and selects the most appropriate knowledge to help response generation.

Dialogue Generation Knowledge Distillation +1

Paper
Code

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

1 code implementation • 17 Apr 2024 • Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei LI, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, WangMeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, huimin zheng, JunHao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu

This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i. e., Kuaishou/Kwai Platform.

valid Video Quality Assessment +1

Paper
Code

Automatic Knowledge Graph Construction for Judicial Cases

no code implementations • 15 Apr 2024 • Jie zhou, Xin Chen, Hang Zhang, Zhe Li

Building on these results, we detail the automatic construction process of case knowledge graphs for judicial cases, enabling the assembly of knowledge graphs for hundreds of thousands of judgments.

graph construction Knowledge Graphs

Paper
Add Code

Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective

2 code implementations • 11 Apr 2024 • Yijie Chen, Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie zhou

In this paper, we suggest that code comments are the natural logic pivot between natural language and code language and propose using comments to boost the code generation ability of code LLMs.

Code Generation

152

Paper
Code

UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs

1 code implementation • 11 Apr 2024 • Chaoqun He, Renjie Luo, Shengding Hu, Yuanqian Zhao, Jie zhou, Hanghao Wu, Jiajie Zhang, Xu Han, Zhiyuan Liu, Maosong Sun

The rapid development of LLMs calls for a lightweight and easy-to-use framework for swift evaluation deployment.

146

Paper
Code

Accelerating Inference in Large Language Models with a Unified Layer Skipping Strategy

2 code implementations • 10 Apr 2024 • Yijin Liu, Fandong Meng, Jie zhou

Recently, dynamic computation methods have shown notable acceleration for Large Language Models (LLMs) by skipping several layers of computations through elaborate heuristics or additional predictors.

Machine Translation Text Summarization

152

Paper
Code

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

2 code implementations • 9 Apr 2024 • Shengding Hu, Yuge Tu, Xu Han, Chaoqun He, Ganqu Cui, Xiang Long, Zhi Zheng, Yewei Fang, Yuxiang Huang, Weilin Zhao, Xinrong Zhang, Zheng Leng Thai, Kaihuo Zhang, Chongyi Wang, Yuan YAO, Chenyang Zhao, Jie zhou, Jie Cai, Zhongwu Zhai, Ning Ding, Chao Jia, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun

For data scaling, we introduce a Warmup-Stable-Decay (WSD) learning rate scheduler (LRS), conducive to continuous training and domain adaptation.

Domain Adaptation

3,778

Paper
Code

CodeEnhance: A Codebook-Driven Approach for Low-Light Image Enhancement

no code implementations • 8 Apr 2024 • Xu Wu, Xianxu Hou, Zhihui Lai, Jie zhou, Ya-nan Zhang, Witold Pedrycz, Linlin Shen

Low-light image enhancement (LLIE) aims to improve low-illumination images.

Image Reconstruction Low-Light Image Enhancement

Paper
Add Code

LOGO: A Long-Form Video Dataset for Group Action Quality Assessment

3 code implementations • CVPR 2023 • Shiyi Zhang, Wenxun Dai, Sujia Wang, Xiangwei Shen, Jiwen Lu, Jie zhou, Yansong Tang

Action quality assessment (AQA) has become an emerging topic since it can be extensively applied in numerous scenarios.

Action Quality Assessment Action Segmentation

152

Paper
Code

DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

1 code implementation • 1 Apr 2024 • Yixuan Zhu, Ao Li, Yansong Tang, Wenliang Zhao, Jie zhou, Jiwen Lu

The recovery of occluded human meshes presents challenges for current methods due to the difficulty in extracting effective image features under severe occlusion.

Denoising Human Mesh Recovery

Paper
Code

On Large Language Models' Hallucination with Regard to Known Facts

no code implementations • 29 Mar 2024 • Che Jiang, Biqing Qi, Xiangyu Hong, Dayuan Fu, Yang Cheng, Fandong Meng, Mo Yu, BoWen Zhou, Jie zhou

In hallucinated cases, the output token's information rarely demonstrates abrupt increases and consistent superiority in the later stages of the model.

Hallucination

Paper
Add Code

Boosting Conversational Question Answering with Fine-Grained Retrieval-Augmentation and Self-Check

no code implementations • 27 Mar 2024 • Linhao Ye, Zhikai Lei, Jianghao Yin, Qin Chen, Jie zhou, Liang He

Retrieval-Augmented Generation (RAG) aims to generate more reliable and accurate responses, by augmenting large language models (LLMs) with the external vast and dynamic knowledge.

Conversational Question Answering Retrieval

Paper
Add Code

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

1 code implementation • 19 Mar 2024 • Zuyan Liu, Yuhao Dong, Yongming Rao, Jie zhou, Jiwen Lu

In the realm of vision-language understanding, the proficiency of models in interpreting and reasoning over visual content has become a cornerstone for numerous applications.

Ranked #42 on Visual Question Answering on MM-Vet

visual instruction following Visual Question Answering

Paper
Code

RCdpia: A Renal Carcinoma Digital Pathology Image Annotation dataset based on pathologists

no code implementations • 17 Mar 2024 • Qingrong Sun, Weixiang Zhong, Jie zhou, Chong Lai, Xiaodong Teng, Maode Lai

The annotation of digital pathological slide data for renal cell carcinoma is of paramount importance for correct diagnosis of artificial intelligence models due to the heterogeneous nature of the tumor.

Paper
Add Code

Enhancing Event Causality Identification with Rationale and Structure-Aware Causal Question Answering

no code implementations • 17 Mar 2024 • Baiyan Zhang, Qin Chen, Jie zhou, Jian Jin, Liang He

In addition, we generate the rationales to explain why these events have causal relations.

Event Causality Identification Multiple-choice +2

Paper
Add Code

Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution

no code implementations • 16 Mar 2024 • Zhiheng Li, Muheng Li, Jixuan Fan, Lei Chen, Yansong Tang, Jie zhou, Jiwen Lu

Scale arbitrary super-resolution based on implicit image function gains increasing popularity since it can better represent the visual world in a continuous manner.

Super-Resolution

Paper
Add Code

Enhancing Depression-Diagnosis-Oriented Chat with Psychological State Tracking

no code implementations • 12 Mar 2024 • Yiyang Gu, Yougen Zhou, Qin Chen, Ningning Zhou, Jie zhou, Aimin Zhou, Liang He

Depression-diagnosis-oriented chat aims to guide patients in self-expression to collect key symptoms for depression detection.

Depression Detection Language Modelling +2

Paper
Add Code

Memory-based Adapters for Online 3D Scene Perception

no code implementations • 11 Mar 2024 • Xiuwei Xu, Chong Xia, Ziwei Wang, Linqing Zhao, Yueqi Duan, Jie zhou, Jiwen Lu

To this end, we propose an adapter-based plug-and-play module for the backbone of 3D scene perception model, which constructs memory to cache and aggregate the extracted RGB-D features to empower offline models with temporal learning ability.

Paper
Add Code

DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models

no code implementations • 1 Mar 2024 • Kedi Chen, Qin Chen, Jie zhou, Yishen He, Liang He

Since large language models (LLMs) achieve significant success in recent years, the hallucination issue remains a challenge, numerous benchmarks are proposed to detect the hallucination.

Hallucination Hallucination Evaluation +1

Paper
Add Code

A Regularization-based Transfer Learning Method for Information Extraction via Instructed Graph Decoder

1 code implementation • 1 Mar 2024 • Kedi Chen, Jie zhou, Qin Chen, Shunyu Liu, Liang He

Information extraction (IE) aims to extract complex structured information from the text.

Transfer Learning

Paper
Code

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment

no code implementations • 29 Feb 2024 • Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Jiexin Wang, Huimin Chen, Bowen Sun, Ruobing Xie, Jie zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun

In practice, the multifaceted nature of human preferences inadvertently introduces what is known as the "alignment tax" -a compromise where enhancements in alignment within one objective (e. g., harmlessness) can diminish performance in others (e. g., helpfulness).

Navigate

Paper
Add Code

Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation

1 code implementation • 28 Feb 2024 • Shicheng Xu, Liang Pang, Mo Yu, Fandong Meng, HuaWei Shen, Xueqi Cheng, Jie zhou

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating additional information from retrieval.

Code Generation In-Context Learning +5

Paper
Code

Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis

no code implementations • 28 Feb 2024 • Zhenxiao Cheng, Jie zhou, Wen Wu, Qin Chen, Liang He

To address this, we propose the Information Bottleneck-based Gradient (\texttt{IBG}) explanation framework for ABSA.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Add Code

Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models

1 code implementation • 23 Feb 2024 • Shunyu Liu, Jie zhou, Qunxi Zhu, Qin Chen, Qingchun Bai, Jun Xiao, Liang He

Aspect-Based Sentiment Analysis (ABSA) stands as a crucial task in predicting the sentiment polarity associated with identified aspects within text.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Code

LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition

no code implementations • 22 Feb 2024 • Junjie Ye, Nuo Xu, Yikun Wang, Jie zhou, Qi Zhang, Tao Gui, Xuanjing Huang

To overcome the limitations of existing data augmentation methods that compromise semantic integrity and address the uncertainty inherent in LLM-generated text, we leverage the distinctive characteristics of the NER task by augmenting the original data at both the contextual and entity levels.

Data Augmentation few-shot-ner +5

Paper
Add Code

Domain Generalization via Causal Adjustment for Cross-Domain Sentiment Analysis

no code implementations • 22 Feb 2024 • Siyin Wang, Jie zhou, Qin Chen, Qi Zhang, Tao Gui, Xuanjing Huang

Domain adaption has been widely adapted for cross-domain sentiment analysis to transfer knowledge from the source domain to the target domain.

Domain Generalization Sentiment Analysis

Paper
Add Code

Graph Representation of Narrative Context: Coherence Dependency via Retrospective Questions

no code implementations • 21 Feb 2024 • Liyan Xu, Jiangnan Li, Mo Yu, Jie zhou

This work introduces a novel and practical paradigm for narrative comprehension, stemming from the observation that individual passages within narratives are often cohesively related than being isolated.

Retrieval

Paper
Add Code

Identifying Factual Inconsistency in Summaries: Towards Effective Utilization of Large Language Model

no code implementations • 20 Feb 2024 • Liyan Xu, Zhenlin Su, Mo Yu, Jin Xu, Jinho D. Choi, Jie zhou, Fei Liu

Factual inconsistency poses a significant hurdle for the commercial deployment of abstractive summarizers.

Language Modelling Large Language Model +2

Paper
Add Code

3D Vascular Segmentation Supervised by 2D Annotation of Maximum Intensity Projection

1 code implementation • 19 Feb 2024 • Zhanqiang Guo, Zimeng Tan, Jianjiang Feng, Jie zhou

To alleviate this issue, we employ maximum intensity projection (MIP) to decrease the dimensionality of 3D volume to 2D image for efficient annotation, and the 2D labels are utilized to provide guidance and oversight for training 3D vessel segmentation model.

Organ Segmentation Segmentation

Paper
Code

Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents

1 code implementation • 17 Feb 2024 • Wenkai Yang, Xiaohan Bi, Yankai Lin, Sishuo Chen, Jie zhou, Xu sun

We first formulate a general framework of agent backdoor attacks, then we present a thorough analysis on the different forms of agent backdoor attacks.

Backdoor Attack Data Poisoning

Paper
Code

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents

1 code implementation • 14 Feb 2024 • Cheng Qian, Bingxiang He, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun

Current language model-driven agents often lack mechanisms for effective user participation, which is crucial given the vagueness commonly found in user instructions.

Language Modelling

Paper
Code

Previously on the Stories: Recap Snippet Identification for Story Reading

no code implementations • 11 Feb 2024 • Jiangnan Li, Qiujing Wang, Liyan Xu, Wenjie Pang, Mo Yu, Zheng Lin, Weiping Wang, Jie zhou

Similar to the "previously-on" scenes in TV shows, recaps can help book reading by recalling the readers' memory about the important elements in previous texts to better understand the ongoing plot.

Paper
Add Code

NPSVC++: Nonparallel Classifiers Encounter Representation Learning

1 code implementation • 8 Feb 2024 • Junhong Zhang, Zhihui Lai, Jie zhou, Guangfei Liang

This paper focuses on a specific family of classifiers called nonparallel support vector classifiers (NPSVCs).

Representation Learning

Paper
Code

On Prompt-Driven Safeguarding for Large Language Models

1 code implementation • 31 Jan 2024 • Chujie Zheng, Fan Yin, Hao Zhou, Fandong Meng, Jie zhou, Kai-Wei Chang, Minlie Huang, Nanyun Peng

Prepending model inputs with safety prompts is a common practice for safeguarding large language models (LLMs) from complying with queries that contain harmful intents.

Paper
Code

Path Choice Matters for Clear Attribution in Path Methods

1 code implementation • 19 Jan 2024 • Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

Rigorousness and clarity are both essential for interpretations of DNNs to engender human trust.

Paper
Code

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

1 code implementation • 18 Jan 2024 • Changyao Tian, Xizhou Zhu, Yuwen Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Yuntao Chen, Lewei Lu, Tong Lu, Jie zhou, Hongsheng Li, Yu Qiao, Jifeng Dai

Developing generative models for interleaved image-text data has both research and practical value.

157

Paper
Code

Generative Multi-Modal Knowledge Retrieval with Large Language Models

no code implementations • 16 Jan 2024 • Xinwei Long, Jiali Zeng, Fandong Meng, Zhiyuan Ma, Kaiyan Zhang, BoWen Zhou, Jie zhou

Knowledge retrieval with multi-modal queries plays a crucial role in supporting knowledge-intensive multi-modal applications.

Retrieval

Paper
Add Code

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

1 code implementation • 11 Jan 2024 • Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie zhou, Jifeng Dai

The advancements in speed and efficiency of DCNv4, combined with its robust performance across diverse vision tasks, show its potential as a foundational building block for future vision models.

Image Classification Image Generation +1

326

Paper
Code

Domain Similarity-Perceived Label Assignment for Domain Generalized Underwater Object Detection

no code implementations • 20 Dec 2023 • Xisheng Li, Wei Li, Pinhao Song, Mingjun Zhang, Jie zhou

The inherent characteristics and light fluctuations of water bodies give rise to the huge difference between different layers and regions in underwater environments.

Data Augmentation object-detection +1

Paper
Add Code

MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA

1 code implementation • 19 Dec 2023 • Lang Yu, Qin Chen, Jie zhou, Liang He

Large language models (LLMs) have shown great success in various Natural Language Processing (NLP) tasks, whist they still need updates after deployment to fix errors or keep pace with the changing knowledge in the world.

Document Classification Hallucination +2

Paper
Code

A Soft Contrastive Learning-based Prompt Model for Few-shot Sentiment Analysis

no code implementations • 16 Dec 2023 • Jingyi Zhou, Jie zhou, Jiabao Zhao, Siyin Wang, Haijun Shan, Gui Tao, Qi Zhang, Xuanjing Huang

Few-shot text classification has attracted great interest in both academia and industry due to the lack of labeled data in many fields.

Contrastive Learning Few-Shot Text Classification +4

Paper
Add Code

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

no code implementations • 14 Dec 2023 • Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai

Many reinforcement learning environments (e. g., Minecraft) provide only sparse rewards that indicate task completion or failure with binary values.

reinforcement-learning

Paper
Add Code

Mathematical Language Models: A Survey

no code implementations • 12 Dec 2023 • Wentao Liu, Hanglei Hu, Jie zhou, Yuyang Ding, Junsong Li, Jiayi Zeng, Mengliang He, Qin Chen, Bo Jiang, Aimin Zhou, Liang He

In recent years, there has been remarkable progress in leveraging Language Models (LMs), encompassing Pre-trained Language Models (PLMs) and Large-scale Language Models (LLMs), within the domain of mathematics.

Paper
Add Code

PointVoxel: A Simple and Effective Pipeline for Multi-View Multi-Modal 3D Human Pose Estimation

no code implementations • 11 Dec 2023 • Zhiyu Pan, Zhicheng Zhong, Wenxuan Guo, Yifan Chen, Jianjiang Feng, Jie zhou

We fill this gap by introducing a pipeline called PointVoxel that fuses multi-view RGB and pointcloud inputs to obtain 3D human poses.

3D Human Pose Estimation Unsupervised Domain Adaptation

Paper
Add Code

HumanReg: Self-supervised Non-rigid Registration of Human Point Cloud

1 code implementation • 9 Dec 2023 • Yifan Chen, Zhiyu Pan, Zhicheng Zhong, Wenxuan Guo, Jianjiang Feng, Jie zhou

In this paper, we present a novel registration framework, HumanReg, that learns a non-rigid transformation between two human point clouds end-to-end.

Paper
Code

LiDAR-based Person Re-identification

1 code implementation • 5 Dec 2023 • Wenxuan Guo, Zhiyu Pan, Yingping Liang, Ziheng Xi, Zhi Chen Zhong, Jianjiang Feng, Jie zhou

Camera-based person re-identification (ReID) systems have been widely applied in the field of public security.

Person Re-Identification Point Cloud Completion

Paper
Code

Fingerprint Matching with Localized Deep Representation

no code implementations • 30 Nov 2023 • Yongjie Duan, Zhiyu Pan, Jianjiang Feng, Jie zhou

The matching scores produced by LDRF also exhibit intuitive statistical characteristics, which led us to propose a matching score normalization technique to mitigate the uncertainty in the cases of very small overlapping area.

Paper
Add Code

SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

1 code implementation • 21 Nov 2023 • Yuanhui Huang, Wenzhao Zheng, Borui Zhang, Jie zhou, Jiwen Lu

Our SelfOcc outperforms the previous best method SceneRF by 58. 7% using a single frame as input on SemanticKITTI and is the first self-supervised work that produces reasonable 3D occupancy for surround cameras on nuScenes.

Autonomous Driving Monocular Depth Estimation

222

Paper
Code

AKConv: Convolutional Kernel with Arbitrary Sampled Shapes and Arbitrary Number of Parameters

1 code implementation • 20 Nov 2023 • Xin Zhang, Yingze Song, Tingting Song, Degang Yang, Yichen Ye, Jie zhou, Liming Zhang

In response to the above questions, the Alterable Kernel Convolution (AKConv) is explored in this work, which gives the convolution kernel an arbitrary number of parameters and arbitrary sampled shapes to provide richer options for the trade-off between network overhead and performance.

object-detection Object Detection

Paper
Code

LiDAR-HMR: 3D Human Mesh Recovery from LiDAR

2 code implementations • 20 Nov 2023 • Bohao Fan, Wenzhao Zheng, Jianjiang Feng, Jie zhou

In recent years, point cloud perception tasks have been garnering increasing attention.

Ranked #1 on 3D Human Pose Estimation on SLOPER4D

3D Human Pose Estimation Human Mesh Recovery

Paper
Code

MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation

1 code implementation • 15 Nov 2023 • Xiaozhi Wang, Hao Peng, Yong Guan, Kaisheng Zeng, Jianhui Chen, Lei Hou, Xu Han, Yankai Lin, Zhiyuan Liu, Ruobing Xie, Jie zhou, Juanzi Li

Understanding events in texts is a core objective of natural language understanding, which requires detecting event occurrences, extracting event arguments, and analyzing inter-event relationships.

Event Argument Extraction Event Detection +3

Paper
Code

Enabling Large Language Models to Learn from Rules

no code implementations • 15 Nov 2023 • Wenkai Yang, Yankai Lin, Jie zhou, JiRong Wen

The current knowledge learning paradigm of LLMs is mainly based on learning from examples, in which LLMs learn the internal rule implicitly from a certain number of supervised examples.

Paper
Add Code

Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction

1 code implementation • 14 Nov 2023 • Kunting Li, Yong Hu, Shaolei Wang, Hanhan Ma, Liang He, Fandong Meng, Jie zhou

However, in the Chinese Spelling Correction (CSC) task, we observe a discrepancy: while ChatGPT performs well under human evaluation, it scores poorly according to traditional metrics.

Paper
Code

RECALL: A Benchmark for LLMs Robustness against External Counterfactual Knowledge

no code implementations • 14 Nov 2023 • Yi Liu, Lianzhe Huang, Shicheng Li, Sishuo Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

Therefore, to evaluate the ability of LLMs to discern the reliability of external knowledge, we create a benchmark from existing knowledge bases.

counterfactual Knowledge Graphs +2

Paper
Add Code

TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models

no code implementations • 8 Nov 2023 • Zhen Yang, Yingxue Zhang, Fandong Meng, Jie zhou

Specifically, for the input from any modality, TEAL first discretizes it into a token sequence with the off-the-shelf tokenizer and embeds the token sequence into a joint embedding space with a learnable embedding matrix.

Paper
Add Code

Improving Machine Translation with Large Language Models: A Preliminary Study with Cooperative Decoding

no code implementations • 6 Nov 2023 • Jiali Zeng, Fandong Meng, Yongjing Yin, Jie zhou

Contemporary translation engines built upon the encoder-decoder framework have reached a high level of development, while the emergence of Large Language Models (LLMs) has disrupted their position by offering the potential for achieving superior translation quality.

Machine Translation NMT +1

Paper
Add Code

Universal Multi-modal Multi-domain Pre-trained Recommendation

no code implementations • 3 Nov 2023 • Wenqi Sun, Ruobing Xie, Shuqing Bian, Wayne Xin Zhao, Jie zhou

There is a rapidly-growing research interest in modeling user preferences via pre-training multi-domain interactions for recommender systems.

Recommendation Systems

Paper
Add Code

Plot Retrieval as an Assessment of Abstract Semantic Association

no code implementations • 3 Nov 2023 • Shicheng Xu, Liang Pang, Jiangnan Li, Mo Yu, Fandong Meng, HuaWei Shen, Xueqi Cheng, Jie zhou

Readers usually only give an abstract and vague description as the query based on their own understanding, summaries, or speculations of the plot, which requires the retrieval model to have a strong ability to estimate the abstract semantic associations between the query and candidate plots.

Information Retrieval Retrieval

Paper
Add Code

Exploring Unified Perspective For Fast Shapley Value Estimation

1 code implementation • 2 Nov 2023 • Borui Zhang, Baotong Tian, Wenzhao Zheng, Jie zhou, Jiwen Lu

Shapley values have emerged as a widely accepted and trustworthy tool, grounded in theoretical axioms, for addressing challenges posed by black-box models like deep neural networks.

Paper
Code

MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory

1 code implementation • NeurIPS 2023 • Yinan Liang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu

Due to the high price and heavy energy consumption of GPUs, deploying deep models on IoT devices such as microcontrollers makes significant contributions for ecological AI.

Image Classification

Paper
Code

Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules

1 code implementation • 24 Oct 2023 • Chaojun Xiao, Yuqi Luo, Wenbin Zhang, Pengle Zhang, Xu Han, Yankai Lin, Zhengyan Zhang, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie zhou

Pre-trained language models (PLMs) have achieved remarkable results on NLP tasks but at the expense of huge parameter sizes and the consequent computational costs.

Computational Efficiency

Paper
Code

Thoroughly Modeling Multi-domain Pre-trained Recommendation as Language

no code implementations • 20 Oct 2023 • Zekai Qu, Ruobing Xie, Chaojun Xiao, Yuan YAO, Zhiyuan Liu, Fengzong Lian, Zhanhui Kang, Jie zhou

With the thriving of pre-trained language model (PLM) widely verified in various of NLP tasks, pioneer efforts attempt to explore the possible cooperation of the general textual information in PLM with the personalized behavioral information in user historical behavior sequences to enhance sequential recommendation (SR).

Informativeness Language Modelling +1

Paper
Add Code

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models

no code implementations • 19 Oct 2023 • Weize Chen, Xiaoyue Xu, Xu Han, Yankai Lin, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie zhou

Parameter-shared pre-trained language models (PLMs) have emerged as a successful approach in resource-constrained environments, enabling substantial reductions in model storage and memory costs without significant performance compromise.

Paper
Add Code

DCRNN: A Deep Cross approach based on RNN for Partial Parameter Sharing in Multi-task Learning

no code implementations • 18 Oct 2023 • Jie zhou, Qian Yu

The model has three innovations: 1) It adopts the idea of cross network and uses RNN network to cross-process the features, thereby effectively improves the expressive ability of the model; 2) It innovatively proposes the structure of partial parameter sharing; 3) It can effectively capture the potential correlation between different tasks to optimize the efficiency and methods for learning different tasks.

Multi-Task Learning Recommendation Systems

Paper
Add Code

RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification

1 code implementation • 14 Oct 2023 • Junjie Ye, Jie zhou, Junfeng Tian, Rui Wang, Qi Zhang, Tao Gui, Xuanjing Huang

Recently, Target-oriented Multimodal Sentiment Classification (TMSC) has gained significant attention among scholars.

Sentiment Analysis Sentiment Classification

Paper
Code

XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners

1 code implementation • 9 Oct 2023 • Yun Luo, Zhen Yang, Fandong Meng, Yingjie Li, Fang Guo, Qinglin Qi, Jie zhou, Yue Zhang

Active learning (AL), which aims to construct an effective training set by iteratively curating the most formative unlabeled data for annotation, has been widely used in low-resource tasks.

Active Learning text-classification +1

Paper
Code

C^2M-DoT: Cross-modal consistent multi-view medical report generation with domain transfer network

no code implementations • 9 Oct 2023 • Ruizhi Wang, Xiangtao Wang, Jie zhou, Thomas Lukasiewicz, Zhenghua Xu

In addition, word-level optimization based on numbers ignores the semantics of reports and medical images, and the generated reports often cannot achieve good performance.

Contrastive Learning Medical Report Generation

Paper
Add Code

Enhancing Argument Structure Extraction with Efficient Leverage of Contextual Information

1 code implementation • 8 Oct 2023 • Yun Luo, Zhen Yang, Fandong Meng, Yingjie Li, Jie zhou, Yue Zhang

However, we observe that merely concatenating sentences in a contextual window does not fully utilize contextual information and can sometimes lead to excessive attention on less informative sentences.

Paper
Code

Skip-Plan: Procedure Planning in Instructional Videos via Condensed Action Space Learning

1 code implementation • ICCV 2023 • Zhiheng Li, Wenjia Geng, Muheng Li, Lei Chen, Yansong Tang, Jiwen Lu, Jie zhou

By this means, our model explores all sorts of reliable sub-relations within an action sequence in the condensed action space.

Paper
Code

TCOVIS: Temporally Consistent Online Video Instance Segmentation

1 code implementation • ICCV 2023 • Junlong Li, Bingyao Yu, Yongming Rao, Jie zhou, Jiwen Lu

The core of our method consists of a global instance assignment strategy and a spatio-temporal enhancement module, which improve the temporal consistency of the features from two aspects.

Instance Segmentation Semantic Segmentation +1

Paper
Code

Introspective Deep Metric Learning

2 code implementations • 11 Sep 2023 • Chengkun Wang, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

This paper proposes an introspective deep metric learning (IDML) framework for uncertainty-aware comparisons of images.

Image Retrieval Metric Learning

Paper
Code

Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering

1 code implementation • 9 Sep 2023 • Yifan Dong, Suhang Wu, Fandong Meng, Jie zhou, Xiaoli Wang, Jianxin Lin, Jinsong Su

2) the input text and image are often not perfectly matched, and thus the image may introduce noise into the model.

Image Captioning Image-text matching +2

Paper
Code

AMLP:Adaptive Masking Lesion Patches for Self-supervised Medical Image Segmentation

no code implementations • 8 Sep 2023 • Xiangtao Wang, Ruizhi Wang, Jie zhou, Thomas Lukasiewicz, Zhenghua Xu

The proposed strategies effectively address limitations in applying masked modeling to medical images, tailored to capturing fine lesion details vital for segmentation tasks.

Image Segmentation Medical Image Segmentation +3

Paper
Add Code

Large Language Models Are Not Robust Multiple Choice Selectors

1 code implementation • 7 Sep 2023 • Chujie Zheng, Hao Zhou, Fandong Meng, Jie zhou, Minlie Huang

This work shows that modern LLMs are vulnerable to option position changes in MCQs due to their inherent "selection bias", namely, they prefer to select specific option IDs as answers (like "Option A").

Computational Efficiency Multiple-choice +1

Paper
Code

Exploring the Robustness of Human Parsers Towards Common Corruptions

no code implementations • 2 Sep 2023 • Sanyi Zhang, Xiaochun Cao, Rui Wang, Guo-Jun Qi, Jie zhou

The experimental results show that the proposed method demonstrates good universality which can improve the robustness of the human parsing models and even the semantic segmentation models when facing various image common corruptions.

Data Augmentation Human Parsing +1

Paper
Add Code

PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction

1 code implementation • 31 Aug 2023 • Sicheng Zuo, Wenzhao Zheng, Yuanhui Huang, Jie zhou, Jiwen Lu

To address this, we propose a cylindrical tri-perspective view to represent point clouds effectively and comprehensively and a PointOcc model to process them efficiently.

3D Semantic Occupancy Prediction Autonomous Driving +2

102

Paper
Code

Improving Translation Faithfulness of Large Language Models via Augmenting Instructions

1 code implementation • 24 Aug 2023 • Yijie Chen, Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie zhou

The experimental results demonstrate significant improvements in translation performance with SWIE based on BLOOMZ-3b, particularly in zero-shot and long text translations due to reduced instruction forgetting risk.

Instruction Following Machine Translation +2

Paper
Code

Instruction Position Matters in Sequence Generation with Large Language Models

1 code implementation • 23 Aug 2023 • Yijin Liu, Xianfeng Zeng, Fandong Meng, Jie zhou

Large language models (LLMs) are capable of performing conditional sequence generation tasks, such as translation or summarization, through instruction fine-tuning.

Instruction Following Position +2

Paper
Code

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

1 code implementation • 21 Aug 2023 • Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie zhou

Autonomous agents empowered by Large Language Models (LLMs) have undergone significant improvements, enabling them to generalize across a broad spectrum of tasks.

3,633

Paper
Code

An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning

1 code implementation • 17 Aug 2023 • Yun Luo, Zhen Yang, Fandong Meng, Yafu Li, Jie zhou, Yue Zhang

Catastrophic forgetting (CF) is a phenomenon that occurs in machine learning when a model forgets previously learned information while acquiring new knowledge.

Reading Comprehension

Paper
Code

Towards Multiple References Era -- Addressing Data Leakage and Limited Reference Diversity in NLG Evaluation

1 code implementation • 6 Aug 2023 • Xianfeng Zeng, Yijin Liu, Fandong Meng, Jie zhou

To address this issue, we propose to utilize \textit{multiple references} to enhance the consistency between these metrics and human evaluations.

nlg evaluation Text Generation

Paper
Code

EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education

1 code implementation • 5 Aug 2023 • Yuhao Dan, Zhikai Lei, Yiyang Gu, Yong Li, Jianghao Yin, Jiaju Lin, Linhao Ye, Zhiyan Tie, Yougen Zhou, Yilei Wang, Aimin Zhou, Ze Zhou, Qin Chen, Jie zhou, Liang He, Xipeng Qiu

Currently, EduChat is available online as an open-source project, with its code, data, and model parameters available on platforms (e. g., GitHub https://github. com/icalk-nlp/EduChat, Hugging Face https://huggingface. co/ecnu-icalk ).

Chatbot Language Modelling +1

594

Paper
Code

Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation in Outdoor Scenes

1 code implementation • 1 Aug 2023 • Bohao Fan, Siqi Wang, Wenxuan Guo, Wenzhao Zheng, Jianjiang Feng, Jie zhou

In this article, we propose Human-M3, an outdoor multi-modal multi-view multi-person human pose database which includes not only multi-view RGB videos of outdoor scenes but also corresponding pointclouds.

3D Human Pose Estimation

Paper
Code

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

1 code implementation • 31 Jul 2023 • Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Lauren Hong, Runchu Tian, Ruobing Xie, Jie zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun

Based on ToolBench, we fine-tune LLaMA to obtain an LLM ToolLLaMA, and equip it with a neural API retriever to recommend appropriate APIs for each instruction.

Ranked #3 on Trajectory Planning on ToolBench

Trajectory Planning Zero-shot Generalization

4,404

Paper
Code

Towards Codable Watermarking for Injecting Multi-bits Information to LLMs

1 code implementation • 29 Jul 2023 • Lean Wang, Wenkai Yang, Deli Chen, Hao Zhou, Yankai Lin, Fandong Meng, Jie zhou, Xu sun

As large language models (LLMs) generate texts with increasing fluency and realism, there is a growing need to identify the source of texts to prevent the abuse of LLMs.

Language Modelling

Paper
Code

Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models

1 code implementation • ICCV 2023 • Ziyi Wang, Xumin Yu, Yongming Rao, Jie zhou, Jiwen Lu

In this paper, we propose a novel 3D-to-2D generative pre-training method that is adaptable to any point cloud model.

Ranked #6 on 3D Part Segmentation on ShapeNet-Part

3D Part Segmentation 3D Point Cloud Classification

Paper
Code

TIM: Teaching Large Language Models to Translate with Comparison

1 code implementation • 10 Jul 2023 • Jiali Zeng, Fandong Meng, Yongjing Yin, Jie zhou

Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks with instruction tuning.

Translation

Paper
Code

Soft Language Clustering for Multilingual Model Pre-training

no code implementations • 13 Jun 2023 • Jiali Zeng, Yufan Jiang, Yongjing Yin, Yi Jing, Fandong Meng, Binghuai Lin, Yunbo Cao, Jie zhou

Multilingual pre-trained language models have demonstrated impressive (zero-shot) cross-lingual transfer abilities, however, their performance is hindered when the target language has distant typology from source languages or when pre-training data is limited in size.

Clustering Question Answering +5

Paper
Add Code

Towards Accurate Data-free Quantization for Diffusion Models

no code implementations • 30 May 2023 • Changyuan Wang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu

On the contrary, we design group-wise quantization functions for activation discretization in different timesteps and sample the optimal timestep for informative calibration image generation, so that our quantized diffusion model can reduce the discretization errors with negligible computational overhead.

Data Free Quantization Image Generation

Paper
Add Code

Emergent Modularity in Pre-trained Transformers

1 code implementation • 28 May 2023 • Zhengyan Zhang, Zhiyuan Zeng, Yankai Lin, Chaojun Xiao, Xiaozhi Wang, Xu Han, Zhiyuan Liu, Ruobing Xie, Maosong Sun, Jie zhou

In analogy to human brains, we consider two main characteristics of modularity: (1) functional specialization of neurons: we evaluate whether each neuron is mainly specialized in a certain function, and find that the answer is yes.

Paper
Code

Stochastic Bridges as Effective Regularizers for Parameter-Efficient Tuning

1 code implementation • 28 May 2023 • Weize Chen, Xu Han, Yankai Lin, Zhiyuan Liu, Maosong Sun, Jie zhou

Since it is non-trivial to directly model the intermediate states and design a running cost function, we propose to use latent stochastic bridges to regularize the intermediate states and use the regularization as the running cost of PETs.

Paper
Code

Plug-and-Play Knowledge Injection for Pre-trained Language Models

1 code implementation • 28 May 2023 • Zhengyan Zhang, Zhiyuan Zeng, Yankai Lin, Huadong Wang, Deming Ye, Chaojun Xiao, Xu Han, Zhiyuan Liu, Peng Li, Maosong Sun, Jie zhou

Experimental results on three knowledge-driven NLP tasks show that existing injection methods are not suitable for the new paradigm, while map-tuning effectively improves the performance of downstream models.

Paper
Code

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

1 code implementation • 23 May 2023 • Lean Wang, Lei LI, Damai Dai, Deli Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

In-context learning (ICL) emerges as a promising capability of large language models (LLMs) by providing them with demonstration examples to perform diverse tasks.

In-Context Learning

116

Paper
Code

D$^2$TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal Summarization

1 code implementation • 22 May 2023 • Yunlong Liang, Fandong Meng, Jiaan Wang, Jinan Xu, Yufeng Chen, Jie zhou

Further, we propose a dual knowledge distillation and target-oriented vision modeling framework for the M$^3$S task.

Knowledge Distillation

Paper
Code

Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model

1 code implementation • 22 May 2023 • Xiao Wang, Weikang Zhou, Qi Zhang, Jie zhou, Songyang Gao, Junzhe Wang, Menghan Zhang, Xiang Gao, Yunwen Chen, Tao Gui

Pretrained language models have achieved remarkable success in various natural language processing tasks.

Language Modelling

Paper
Code

A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition

1 code implementation • 21 May 2023 • Limao Xiong, Jie zhou, Qunxi Zhu, Xiao Wang, Yuanbin Wu, Qi Zhang, Tao Gui, Xuanjing Huang, Jin Ma, Ying Shan

Particularly, we propose a Confidence-based Partial Label Learning (CPLL) method to integrate the prior confidence (given by annotators) and posterior confidences (learned by models) for crowd-annotated NER.

named-entity-recognition Named Entity Recognition +2

Paper
Code

GFDC: A Granule Fusion Density-Based Clustering with Evidential Reasoning

no code implementations • 20 May 2023 • Mingjie Cai, Zhishan Wu, Qingguo Li, Feng Xu, Jie zhou

Further, three novel granule fusion strategies are utilized to combine granules into stable cluster structures, helping to detect clusters with arbitrary shapes.

Clustering

Paper
Add Code

Mitigating Catastrophic Forgetting in Task-Incremental Continual Learning with Adaptive Classification Criterion

no code implementations • 20 May 2023 • Yun Luo, Xiaotian Lin, Zhen Yang, Fandong Meng, Jie zhou, Yue Zhang

It is seldom considered to adapt the decision boundary for new representations and in this paper we propose a Supervised Contrastive learning framework with adaptive classification criterion for Continual Learning (SCCL), In our method, a contrastive loss is used to directly learn representations for different tasks and a limited number of data samples are saved as the classification criterion.

Classification Continual Learning +1

Paper
Add Code

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

2 code implementations • NeurIPS 2023 • Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie zhou, Yu Qiao, Jifeng Dai

We hope this model can set a new baseline for generalist vision and language models.

Language Modelling Large Language Model

3,121

Paper
Code

Personality Understanding of Fictional Characters during Book Reading

1 code implementation • 17 May 2023 • Mo Yu, Jiangnan Li, Shunyu Yao, Wenjie Pang, Xiaochen Zhou, Zhou Xiao, Fandong Meng, Jie zhou

As readers engage with a story, their understanding of a character evolves based on new events and information; and multiple fine-grained aspects of personalities can be perceived.

Paper
Code

Towards Unifying Multi-Lingual and Cross-Lingual Summarization

no code implementations • 16 May 2023 • Jiaan Wang, Fandong Meng, Duo Zheng, Yunlong Liang, Zhixu Li, Jianfeng Qu, Jie zhou

In this paper, we aim to unify MLS and CLS into a more general setting, i. e., many-to-many summarization (M2MS), where a single model could process documents in any language and generate their summaries also in any language.

Language Modelling Text Summarization

Paper
Add Code

Recyclable Tuning for Continual Pre-training

1 code implementation • 15 May 2023 • Yujia Qin, Cheng Qian, Xu Han, Yankai Lin, Huadong Wang, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie zhou

In pilot studies, we find that after continual pre-training, the upgraded PLM remains compatible with the outdated adapted weights to some extent.

Paper
Code

RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training

no code implementations • 13 May 2023 • Chulun Zhou, Yunlong Liang, Fandong Meng, Jinan Xu, Jinsong Su, Jie zhou

In this paper, we propose Regularized Contrastive Cross-lingual Cross-modal (RC^3) pre-training, which further exploits more abundant weakly-aligned multilingual image-text pairs.

Contrastive Learning Machine Translation

Paper
Add Code

WeLayout: WeChat Layout Analysis System for the ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents

no code implementations • 11 May 2023 • Mingliang Zhang, Zhen Cao, Juntao Liu, LiQiang Niu, Fandong Meng, Jie zhou

Our approach effectively demonstrates the benefits of combining query-based and anchor-free models for achieving robust layout segmentation in corporate documents.

Bayesian Optimization Segmentation

Paper
Add Code

WebCPM: Interactive Web Search for Chinese Long-form Question Answering

1 code implementation • 11 May 2023 • Yujia Qin, Zihan Cai, Dian Jin, Lan Yan, Shihao Liang, Kunlun Zhu, Yankai Lin, Xu Han, Ning Ding, Huadong Wang, Ruobing Xie, Fanchao Qi, Zhiyuan Liu, Maosong Sun, Jie zhou

We recruit annotators to search for relevant information using our interface and then answer questions.

Information Retrieval Long Form Question Answering +1

950

Paper
Code

Investigating Forgetting in Pre-Trained Representations Through Continual Learning

no code implementations • 10 May 2023 • Yun Luo, Zhen Yang, Xuefeng Bai, Fandong Meng, Jie zhou, Yue Zhang

Intuitively, the representation forgetting can influence the general knowledge stored in pre-trained language models (LMs), but the concrete effect is still unclear.

Continual Learning General Knowledge

Paper
Add Code

Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias

no code implementations • 8 May 2023 • Zhiyuan Zhang, Deli Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

To settle this issue, we propose the Fine-purifying approach, which utilizes the diffusion theory to study the dynamic process of fine-tuning for finding potentially poisonous dimensions.

Paper
Add Code

Attacking Pre-trained Recommendation

1 code implementation • 6 May 2023 • Yiqing Wu, Ruobing Xie, Zhao Zhang, Yongchun Zhu, Fuzhen Zhuang, Jie zhou, Yongjun Xu, Qing He

Recently, a series of pioneer studies have shown the potency of pre-trained models in sequential recommendation, illuminating the path of building an omniscient unified pre-trained recommendation model for different downstream recommendation tasks.

Sequential Recommendation

Paper
Code

3D Small Object Detection with Dynamic Spatial Pruning

1 code implementation • 5 May 2023 • Xiuwei Xu, Zhihao Sun, Ziwei Wang, Hongmin Liu, Jie zhou, Jiwen Lu

Specifically, we theoretically derive a dynamic spatial pruning (DSP) strategy to prune the redundant spatial representation of 3D scene in a cascade manner according to the distribution of objects.

3D Object Detection Object +2

Paper
Code

Unified Model Learning for Various Neural Machine Translation

no code implementations • 4 May 2023 • Yunlong Liang, Fandong Meng, Jinan Xu, Jiaan Wang, Yufeng Chen, Jie zhou

Specifically, we propose a ``versatile'' model, i. e., the Unified Model Learning for NMT (UMLNMT) that works with data from different tasks, and can translate well in multiple settings simultaneously, and theoretically it can be as many as possible.

Document Translation Machine Translation +3

Paper
Add Code

BranchNorm: Robustly Scaling Extremely Deep Transformers

no code implementations • 4 May 2023 • Yijin Liu, Xianfeng Zeng, Fandong Meng, Jie zhou

Recently, DeepNorm scales Transformers into extremely deep (i. e., 1000 layers) and reveals the promising potential of deep scaling.

Paper
Add Code

Learning Accurate Performance Predictors for Ultrafast Automated Model Compression

1 code implementation • 13 Apr 2023 • Ziwei Wang, Jiwen Lu, Han Xiao, Shengyu Liu, Jie zhou

On the contrary, we obtain the optimal efficient networks by directly optimizing the compression policy with an accurate performance predictor, where the ultrafast automated model compression for various computational cost constraint is achieved without complex compression policy search and evaluation.

Image Classification Model Compression +3

Paper
Code

Triple Sequence Learning for Cross-domain Recommendation

no code implementations • 11 Apr 2023 • Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Jie zhou

To address this issue, we present a novel framework, termed triple sequence learning for cross-domain recommendation (Tri-CDR), which jointly models the source, target, and mixed behavior sequences to highlight the global and target preference and precisely model the triple correlation in CDR.

Contrastive Learning

Paper
Add Code

Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis

no code implementations • CVPR 2023 • Xiuwei Xu, Ziwei Wang, Jie zhou, Jiwen Lu

In this paper, we propose binary sparse convolutional networks called BSC-Net for efficient point cloud analysis.

Binarization Quantization

Paper
Add Code

Efficient Meshy Neural Fields for Animatable Human Avatars

1 code implementation • 23 Mar 2023 • Xiaoke Huang, Yiji Cheng, Yansong Tang, Xiu Li, Jie zhou, Jiwen Lu

Moreover, only minutes of optimization is enough for plausible reconstruction results.

Disentanglement Inverse Rendering

Paper
Code

A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models

no code implementations • 18 Mar 2023 • Junjie Ye, Xuanting Chen, Nuo Xu, Can Zu, Zekai Shao, Shichun Liu, Yuhan Cui, Zeyang Zhou, Chao Gong, Yang shen, Jie zhou, Siming Chen, Tao Gui, Qi Zhang, Xuanjing Huang

GPT series models, such as GPT-3, CodeX, InstructGPT, ChatGPT, and so on, have gained considerable attention due to their exceptional natural language processing capabilities.

Natural Language Understanding

Paper
Add Code

SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving

2 code implementations • ICCV 2023 • Yi Wei, Linqing Zhao, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

Towards a more comprehensive perception of a 3D scene, in this paper, we propose a SurroundOcc method to predict the 3D occupancy with multi-camera images.

3D Object Detection Autonomous Driving +2

679

Paper
Code

Precise Facial Landmark Detection by Reference Heatmap Transformer

no code implementations • 14 Mar 2023 • Jun Wan, Jun Liu, Jie zhou, Zhihui Lai, Linlin Shen, Hang Sun, Ping Xiong, Wenwen Min

Most facial landmark detection methods predict landmarks by mapping the input facial appearance features to landmark heatmaps and have achieved promising results.

Facial Landmark Detection

Paper
Add Code

HiNet: Novel Multi-Scenario & Multi-Task Learning with Hierarchical Information Extraction

1 code implementation • 10 Mar 2023 • Jie zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, Qian Yu

Multi-scenario & multi-task learning has been widely applied to many recommendation systems in industrial applications, wherein an effective and practical approach is to carry out multi-scenario transfer learning on the basis of the Mixture-of-Expert (MoE) architecture.

Multi-Task Learning Recommendation Systems

Paper
Code

Is ChatGPT a Good NLG Evaluator? A Preliminary Study

1 code implementation • 7 Mar 2023 • Jiaan Wang, Yunlong Liang, Fandong Meng, Zengkui Sun, Haoxiang Shi, Zhixu Li, Jinan Xu, Jianfeng Qu, Jie zhou

In detail, we regard ChatGPT as a human evaluator and give task-specific (e. g., summarization) and aspect-specific (e. g., relevance) instruction to prompt ChatGPT to evaluate the generated results of NLG models.

nlg evaluation Story Generation

Paper
Code

Unleashing Text-to-Image Diffusion Models for Visual Perception

2 code implementations • ICCV 2023 • Wenliang Zhao, Yongming Rao, Zuyan Liu, Benlin Liu, Jie zhou, Jiwen Lu

In this paper, we propose VPD (Visual Perception with a pre-trained Diffusion model), a new framework that exploits the semantic information of a pre-trained text-to-image diffusion model in visual perception tasks.

Ranked #7 on Referring Expression Segmentation on RefCoCo val

Denoising Image Segmentation +4

7,406

Paper
Code

How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks

no code implementations • 1 Mar 2023 • Xuanting Chen, Junjie Ye, Can Zu, Nuo Xu, Rui Zheng, Minlong Peng, Jie zhou, Tao Gui, Qi Zhang, Xuanjing Huang

The GPT-3. 5 models have demonstrated impressive performance in various Natural Language Processing (NLP) tasks, showcasing their strong understanding and reasoning capabilities.

Natural Language Inference Natural Language Understanding +1

Paper
Add Code

Zero-Shot Cross-Lingual Summarization via Large Language Models

no code implementations • 28 Feb 2023 • Jiaan Wang, Yunlong Liang, Fandong Meng, Beiqi Zou, Zhixu Li, Jianfeng Qu, Jie zhou

Given a document in a source language, cross-lingual summarization (CLS) aims to generate a summary in a different target language.

Informativeness

Paper
Add Code

Tell Model Where to Attend: Improving Interpretability of Aspect-Based Sentiment Classification via Small Explanation Annotations

no code implementations • 21 Feb 2023 • Zhenxiao Cheng, Jie zhou, Wen Wu, Qin Chen, Liang He

Gradient-based explanation methods play an important role in the field of interpreting complex deep neural networks for NLP models.

Sentence Sentiment Analysis +1

Paper
Add Code

A Flexible Multi-view Multi-modal Imaging System for Outdoor Scenes

no code implementations • 21 Feb 2023 • Meng Zhang, Wenxuan Guo, Bohao Fan, Yifan Chen, Jianjiang Feng, Jie zhou

The experimental results show that multi-view point clouds greatly improve 3D object detection and tracking accuracy regardless of complex and various outdoor environments.

3D Object Detection Object +1

Paper
Add Code

Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction

2 code implementations • CVPR 2023 • Yuanhui Huang, Wenzhao Zheng, Yunpeng Zhang, Jie zhou, Jiwen Lu

To lift image features to the 3D TPV space, we further propose a transformer-based TPV encoder (TPVFormer) to obtain the TPV features effectively.

Ranked #1 on Prediction Of Occupancy Grid Maps on nuScenes

3D Semantic Scene Completion Autonomous Driving +1

4,810

Paper
Code

Feature Decomposition for Reducing Negative Transfer: A Novel Multi-task Learning Method for Recommender System

1 code implementation • 10 Feb 2023 • Jie zhou, Qian Yu, Chuan Luo, Jing Zhang

In recent years, thanks to the rapid development of deep learning (DL), DL-based multi-task learning (MTL) has made significant progress, and it has been successfully applied to recommendation systems (RS).

Multi-Task Learning Recommendation Systems

Paper
Code

A Multi-task Multi-stage Transitional Training Framework for Neural Chat Translation

no code implementations • 27 Jan 2023 • Chulun Zhou, Yunlong Liang, Fandong Meng, Jie zhou, Jinan Xu, Hongji Wang, Min Zhang, Jinsong Su

To address these issues, in this paper, we propose a multi-task multi-stage transitional (MMT) training framework, where an NCT model is trained using the bilingual chat translation dataset and additional monolingual dialogues.

NMT Sentence +1

Paper
Add Code

When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning

no code implementations • 25 Jan 2023 • Wenkai Yang, Yankai Lin, Guangxiang Zhao, Peng Li, Jie zhou, Xu sun

Federated Learning has become a widely-used framework which allows learning a global model on decentralized local datasets under the condition of protecting local data privacy.

Federated Learning text-classification +1

Paper
Add Code

Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning

no code implementations • 25 Jan 2023 • Wenkai Yang, Deli Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

Federated Learning (FL) has become a popular distributed learning paradigm that involves multiple clients training a global model collaboratively in a data privacy-preserving manner.

Federated Learning Privacy Preserving

Paper
Add Code

Transformer-Patcher: One Mistake worth One Neuron

1 code implementation • 24 Jan 2023 • Zeyu Huang, Yikang Shen, Xiaofeng Zhang, Jie zhou, Wenge Rong, Zhang Xiong

Our method outperforms previous fine-tuning and HyperNetwork-based methods and achieves state-of-the-art performance for Sequential Model Editing (SME).

Model Editing

Paper
Code

AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers

1 code implementation • 11 Jan 2023 • Xumin Yu, Yongming Rao, Ziyi Wang, Jiwen Lu, Jie zhou

In this paper, we present a new method that reformulates point cloud completion as a set-to-set translation problem and design a new model, called PoinTr, which adopts a Transformer encoder-decoder architecture for point cloud completion.

Ranked #2 on Point Cloud Completion on ShapeNet

Denoising Inductive Bias +1

521

Paper
Code

DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation

1 code implementation • CVPR 2023 • Shuai Shen, Wenliang Zhao, Zibin Meng, Wanhua Li, Zheng Zhu, Jie zhou, Jiwen Lu

In this way, the proposed DiffTalk is capable of producing high-quality talking head videos in synchronization with the source audio, and more importantly, it can be naturally generalized across different identities without any further fine-tuning.

Denoising Talking Head Generation

403

Paper
Code

DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion

1 code implementation • CVPR 2023 • Wenliang Zhao, Yongming Rao, Weikang Shi, Zuyan Liu, Jie zhou, Jiwen Lu

Unlike previous work that relies on carefully designed network architectures and loss functions to fuse the information from the source and target faces, we reformulate the face swapping as a conditional inpainting task, performed by a powerful diffusion model guided by the desired face attributes (e. g., identity and landmarks).

Face Swapping

Paper
Code

CLIP-Cluster: CLIP-Guided Attribute Hallucination for Face Clustering

no code implementations • ICCV 2023 • Shuai Shen, Wanhua Li, Xiaobing Wang, Dafeng Zhang, Zhezhu Jin, Jie zhou, Jiwen Lu

Furthermore, we develop a neighbor-aware proxy generator that fuses the features describing various attributes into a proxy feature to build a bridge among different sub-clusters and reduce the intra-class variance.

Attribute Clustering +2

Paper
Add Code

Deep Factorized Metric Learning

1 code implementation • CVPR 2023 • Chengkun Wang, Wenzhao Zheng, Junlong Li, Jie zhou, Jiwen Lu

Learning a generalizable and comprehensive similarity metric to depict the semantic discrepancies between images is the foundation of many computer vision tasks.

Image Classification Metric Learning

Paper
Code

Deep learning for size-agnostic inverse design of random-network 3D printed mechanical metamaterials

no code implementations • 22 Dec 2022 • Helda Pahlavani, Kostas Tsifoutis-Kazolis, Prerak Mody, Jie zhou, Mohammad J. Mirzaali, Amir A. Zadpoor

Practical applications of mechanical metamaterials often involve solving inverse problems where the objective is to find the (multiple) microarchitectures that give rise to a given set of properties.

Paper
Add Code

Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint

1 code implementation • 18 Dec 2022 • Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

Deep learning has revolutionized human society, yet the black-box nature of deep neural networks hinders further application to reliability-demanded industries.

Paper
Code

Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization

1 code implementation • 15 Dec 2022 • Yunlong Liang, Fandong Meng, Jinan Xu, Jiaan Wang, Yufeng Chen, Jie zhou

However, less attention has been paid to the visual features from the perspective of the summary, which may limit the model performance, especially in the low- and zero-resource scenarios.

Abstractive Text Summarization

Paper
Code

Understanding Translationese in Cross-Lingual Summarization

no code implementations • 14 Dec 2022 • Jiaan Wang, Fandong Meng, Yunlong Liang, Tingyi Zhang, Jiarong Xu, Zhixu Li, Jie zhou

In detail, we find that (1) the translationese in documents or summaries of test sets might lead to the discrepancy between human judgment and automatic evaluation; (2) the translationese in training sets would harm model performance in real-world applications; (3) though machine-translated documents involve translationese, they are very useful for building CLS systems on low-resource languages under specific training strategies.

Paper
Add Code

FLAG3D: A 3D Fitness Activity Dataset with Language Instruction

1 code implementation • CVPR 2023 • Yansong Tang, Jinpeng Liu, Aoyang Liu, Bin Yang, Wenxun Dai, Yongming Rao, Jiwen Lu, Jie zhou, Xiu Li

With the continuously thriving popularity around the world, fitness activity analytic has become an emerging research topic in computer vision.

Action Generation Action Recognition +2

Paper
Code

DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding

no code implementations • 8 Dec 2022 • Jianhao Yan, Jin Xu, Fandong Meng, Jie zhou, Yue Zhang

In this work, we show that the issue arises from the un-consistency of label smoothing on the token-level and sequence-level distributions.

Machine Translation NMT

Paper
Add Code

Diffusion-SDF: Text-to-Shape via Voxelized Diffusion

1 code implementation • CVPR 2023 • Muheng Li, Yueqi Duan, Jie zhou, Jiwen Lu

With the rising industrial attention to 3D virtual modeling technology, generating novel 3D content based on specified conditions (e. g. text) has become a hot issue.

156

Paper
Code

Findings of the WMT 2022 Shared Task on Translation Suggestion

no code implementations • 30 Nov 2022 • Zhen Yang, Fandong Meng, Yingxue Zhang, Ernan Li, Jie zhou

We report the result of the first edition of the WMT shared task on Translation Suggestion (TS).

Machine Translation Task 2 +1

Paper
Add Code

Rephrasing the Reference for Non-Autoregressive Machine Translation

no code implementations • 30 Nov 2022 • Chenze Shao, Jinchao Zhang, Jie zhou, Yang Feng

In response to this problem, we introduce a rephraser to provide a better training target for NAT by rephrasing the reference sentence according to the NAT output.

Machine Translation Sentence +1

Paper
Add Code

AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning

1 code implementation • 29 Nov 2022 • Jiaxin Wen, Yeshuang Zhu, Jinchao Zhang, Jie zhou, Minlie Huang

Recent studies have shown the impressive efficacy of counterfactually augmented data (CAD) for reducing NLU models' reliance on spurious features and improving their generalizability.

Paper
Code

BJTU-WeChat's Systems for the WMT22 Chat Translation Task

no code implementations • 28 Nov 2022 • Yunlong Liang, Fandong Meng, Jinan Xu, Yufeng Chen, Jie zhou

Our systems achieve 0. 810 and 0. 946 COMET scores.

Denoising Knowledge Distillation +2

Paper
Add Code

Summer: WeChat Neural Machine Translation Systems for the WMT22 Biomedical Translation Task

no code implementations • 28 Nov 2022 • Ernan Li, Fandong Meng, Jie zhou

This paper introduces WeChat's participation in WMT 2022 shared biomedical translation task on Chinese to English.

Machine Translation Translation

Paper
Add Code

SGCE-Font: Skeleton Guided Channel Expansion for Chinese Font Generation

no code implementations • 26 Nov 2022 • Jie zhou, Yefei Wang, Yiyang Yuan, Qing Huang, Jinshan Zeng

Numerical results show that the mode collapse issue suffered by the known CycleGAN can be effectively alleviated by equipping with the proposed SGCE module, and the CycleGAN equipped with SGCE outperforms the state-of-the-art models in terms of four important evaluation metrics and visualization quality.

Font Generation

Paper
Add Code

Reconstructing high-order sequence features of dynamic functional connectivity networks based on diversified covert attention patterns for Alzheimer's disease classification

no code implementations • 19 Nov 2022 • Zhixiang Zhang, Biao Jie, Zhengdong Wang, Jie zhou, Yang Yang

Recent studies have applied deep learning methods such as convolutional recurrent neural networks (CRNs) and Transformers to brain disease classification based on dynamic functional connectivity networks (dFCNs), such as Alzheimer's disease (AD), achieving better performance than traditional machine learning methods.

Classification

Paper
Add Code

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision

2 code implementations • CVPR 2023 • Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie zhou, Jifeng Dai

The proposed method is verified with a wide spectrum of traditional and modern image backbones and achieves new SoTA results on the large-scale nuScenes dataset.

Ranked #5 on 3D Object Detection on Rope3D

3D Object Detection

2,870

Paper
Code

Planning Irregular Object Packing via Hierarchical Reinforcement Learning

no code implementations • 17 Nov 2022 • Sichao Huang, Ziwei Wang, Jie zhou, Jiwen Lu

We compare our approach with existing robotic packing methods for irregular objects in a physics simulator.

Hierarchical Reinforcement Learning Object +3

Paper
Add Code

Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information

1 code implementation • CVPR 2023 • Weijie Su, Xizhou Zhu, Chenxin Tao, Lewei Lu, Bin Li, Gao Huang, Yu Qiao, Xiaogang Wang, Jie zhou, Jifeng Dai

It has been proved that combining multiple pre-training strategies and data from various modalities/sources can greatly boost the training of large-scale models.

Ranked #2 on Semantic Segmentation on ADE20K (using extra training data)

Image Classification Long-tailed Object Detection +3

Paper
Code

Cross-Modal Adapter for Text-Video Retrieval

1 code implementation • 17 Nov 2022 • Haojun Jiang, Jianke Zhang, Rui Huang, Chunjiang Ge, Zanlin Ni, Jiwen Lu, Jie zhou, Shiji Song, Gao Huang

However, as pre-trained models are scaling up, fully fine-tuning them on text-video retrieval datasets has a high risk of overfitting.

Retrieval Video Retrieval

Paper
Code

CSCD-IME: Correcting Spelling Errors Generated by Pinyin IME

1 code implementation • 16 Nov 2022 • Yong Hu, Fandong Meng, Jie zhou

In fact, most of Chinese input is based on pinyin input method, so the study of spelling errors in this process is more practical and valuable.

Spelling Correction

Paper
Code

Probabilistic Deep Metric Learning for Hyperspectral Image Classification

1 code implementation • 15 Nov 2022 • Chengkun Wang, Wenzhao Zheng, Xian Sun, Jiwen Lu, Jie zhou

We propose to learn a global probabilistic distribution for each pixel in the patch and a probabilistic metric to model the distance between distributions.

Classification Hyperspectral Image Classification +1

Paper
Code

MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subevent Relation Extraction

1 code implementation • 14 Nov 2022 • Xiaozhi Wang, Yulin Chen, Ning Ding, Hao Peng, Zimu Wang, Yankai Lin, Xu Han, Lei Hou, Juanzi Li, Zhiyuan Liu, Peng Li, Jie zhou

It contains 103, 193 event coreference chains, 1, 216, 217 temporal relations, 57, 992 causal relations, and 15, 841 subevent relations, which is larger than existing datasets of all the ERE tasks by at least an order of magnitude.

Event Relation Extraction Relation +1

Paper
Code

Demystify Transformers & Convolutions in Modern Image Deep Networks

1 code implementation • 10 Nov 2022 • Xiaowei Hu, Min Shi, Weiyun Wang, Sitong Wu, Linjie Xing, Wenhai Wang, Xizhou Zhu, Lewei Lu, Jie zhou, Xiaogang Wang, Yu Qiao, Jifeng Dai

Our experiments on various tasks and an analysis of inductive bias show a significant performance boost due to advanced network-level and block-level designs, but performance differences persist among different STMs.

Image Deep Networks Spatial Token Mixer

Paper
Code

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind

1 code implementation • 9 Nov 2022 • Mo Yu, Qiujing Wang, Shunchi Zhang, Yisi Sang, Kangsheng Pu, Zekai Wei, Han Wang, Liyan Xu, Jing Li, Yue Yu, Jie zhou

Our dataset consists of ~1, 000 parsed movie scripts, each corresponding to a few-shot character understanding task that requires models to mimic humans' ability of fast digesting characters with a few starting scenes in a new movie.

Meta-Learning Metric Learning

Paper
Code

Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues

1 code implementation • 30 Oct 2022 • Jiao Ou, Jinchao Zhang, Yang Feng, Jie zhou

The dialogue data admits a wide variety of responses for a given dialogue history, especially responses with different semantics.

counterfactual Counterfactual Inference +1

Paper
Code

Question-Interlocutor Scope Realized Graph Modeling over Key Utterances for Dialogue Reading Comprehension

no code implementations • 26 Oct 2022 • Jiangnan Li, Mo Yu, Fandong Meng, Zheng Lin, Peng Fu, Weiping Wang, Jie zhou

Although these tasks are effective, there are still urging problems: (1) randomly masking speakers regardless of the question cannot map the speaker mentioned in the question to the corresponding speaker in the dialogue, and ignores the speaker-centric nature of utterances.

Reading Comprehension

Paper
Add Code

Exploring Mode Connectivity for Pre-trained Language Models

1 code implementation • 25 Oct 2022 • Yujia Qin, Cheng Qian, Jing Yi, Weize Chen, Yankai Lin, Xu Han, Zhiyuan Liu, Maosong Sun, Jie zhou

(3) How does the PLM's task knowledge change along the path connecting two minima?

Paper
Code

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning

1 code implementation • 24 Oct 2022 • Jing Yi, Weize Chen, Yujia Qin, Yankai Lin, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun, Jie zhou

To fathom the mystery, we hypothesize that the adaptations of different DETs could all be reparameterized as low-dimensional optimizations in a unified optimization subspace, which could be found by jointly decomposing independent solutions of different DETs.

Paper
Code

Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible Knowledge Selection

1 code implementation • 21 Oct 2022 • Lanrui Wang, Jiangnan Li, Zheng Lin, Fandong Meng, Chenxu Yang, Weiping Wang, Jie zhou

We use a fine-grained encoding strategy which is more sensitive to the emotion dynamics (emotion flow) in the conversations to predict the emotion-intent characteristic of response.

Dialogue Generation Emotion Recognition +2

Paper
Code

ROSE: Robust Selective Fine-tuning for Pre-trained Language Models

1 code implementation • 18 Oct 2022 • Lan Jiang, Hao Zhou, Yankai Lin, Peng Li, Jie zhou, Rui Jiang

Even though the large-scale language models have achieved excellent performances, they suffer from various adversarial attacks.

Adversarial Robustness

Paper
Code

Cerebrovascular Segmentation via Vessel Oriented Filtering Network

no code implementations • 17 Oct 2022 • Zhanqiang Guo, Yao Luan, Jianjiang Feng, Wangsheng Lu, Yin Yin, Guangming Yang, Jie zhou

Accurate cerebrovascular segmentation from Magnetic Resonance Angiography (MRA) and Computed Tomography Angiography (CTA) is of great significance in diagnosis and treatment of cerebrovascular pathology.

Segmentation

Paper
Add Code

Towards Robust k-Nearest-Neighbor Machine Translation

3 code implementations • 17 Oct 2022 • Hui Jiang, Ziyao Lu, Fandong Meng, Chulun Zhou, Jie zhou, Degen Huang, Jinsong Su

Meanwhile we inject two types of perturbations into the retrieved pairs for robust training.

Machine Translation NMT +1

Paper
Code

Dynamics-aware Adversarial Attack of Adaptive Neural Networks

1 code implementation • 15 Oct 2022 • An Tao, Yueqi Duan, Yingqi Wang, Jiwen Lu, Jie zhou

To address this issue, we propose a Leaded Gradient Method (LGM) and show the significant effects of the lagged gradient.

Adversarial Attack Computational Efficiency

Paper
Code

Categorizing Semantic Representations for Neural Machine Translation

no code implementations • COLING 2022 • Yongjing Yin, Yafu Li, Fandong Meng, Jie zhou, Yue Zhang

Modern neural machine translation (NMT) models have achieved competitive performance in standard benchmarks.

Machine Translation NMT +2

Paper
Add Code

Token-Label Alignment for Vision Transformers

1 code implementation • ICCV 2023 • Han Xiao, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

Data mixing strategies (e. g., CutMix) have shown the ability to greatly improve the performance of convolutional neural networks (CNNs).

Image Classification Semantic Segmentation +1

Paper
Code

OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions

1 code implementation • ICCV 2023 • Chengkun Wang, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

The pretrain-finetune paradigm in modern computer vision facilitates the success of self-supervised learning, which tends to achieve better transferability than supervised learning.

Image Classification object-detection +3

Paper
Code

Mixture of Attention Heads: Selecting Attention Heads Per Token

1 code implementation • 11 Oct 2022 • Xiaofeng Zhang, Yikang Shen, Zeyu Huang, Jie zhou, Wenge Rong, Zhang Xiong

This paper proposes the Mixture of Attention Heads (MoA), a new architecture that combines multi-head attention with the MoE mechanism.

Computational Efficiency Language Modelling +2

Paper
Code

A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models

1 code implementation • 11 Oct 2022 • Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li, Peng Fu, Yanan Cao, Weiping Wang, Jie zhou

In response to the efficiency problem, recent studies show that dense PLMs can be replaced with sparse subnetworks without hurting the performance.

Natural Language Understanding

Paper
Code

From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models

1 code implementation • 11 Oct 2022 • Lei LI, Yankai Lin, Xuancheng Ren, Guangxiang Zhao, Peng Li, Jie zhou, Xu sun

We then design a Model Uncertainty--aware Knowledge Integration (MUKI) framework to recover the golden supervision for the student.

Paper
Code

Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning

1 code implementation • 10 Oct 2022 • Qingyi Si, Yuanxin Liu, Fandong Meng, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang, Jie zhou

However, these models reveal a trade-off that the improvements on OOD data severely sacrifice the performance on the in-distribution (ID) data (which is dominated by the biased samples).

Contrastive Learning Question Answering +1

Paper
Code

Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA

1 code implementation • 10 Oct 2022 • Qingyi Si, Fandong Meng, Mingyu Zheng, Zheng Lin, Yuanxin Liu, Peng Fu, Yanan Cao, Weiping Wang, Jie zhou

To overcome this limitation, we propose a new dataset that considers varying types of shortcuts by constructing different distribution shifts in multiple OOD test sets.

Question Answering Visual Question Answering

Paper
Code

Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment

1 code implementation • 9 Oct 2022 • Siyu Lai, Zhen Yang, Fandong Meng, Yufeng Chen, Jinan Xu, Jie zhou

Word alignment which aims to extract lexicon translation equivalents between source and target sentences, serves as a fundamental tool for natural language processing.

Language Modelling Sentence +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.