Search Results for author: Lei LI

Found 363 papers, 185 papers with code

Dispersed EM-VAEs for Interpretable Text Generation

no code implementations ICML 2020 Wenxian Shi, Hao Zhou, Ning Miao, Lei LI

Interpretability is important in text generation for guiding the generation with interpretable attributes.

Text Generation

Structural Supervision for Word Alignment and Machine Translation

no code implementations Findings (ACL) 2022 Lei LI, Kai Fan, Hongjia Li, Chun Yuan

Syntactic structure has long been argued to be potentially useful for enforcing accurate word alignment and improving generalization performance of machine translation.

Machine Translation Multi-Task Learning +2

Gradient-Based Adversarial Factual Consistency Evaluation for Abstractive Summarization

no code implementations EMNLP 2021 Zhiyuan Zeng, Jiaze Chen, Weiran Xu, Lei LI

Based on the artificial dataset, we train an evaluation model that can not only make accurate and robust factual consistency discrimination but is also capable of making interpretable factual errors tracing by backpropagated gradient distribution on token embeddings.

Abstractive Text Summarization Data Augmentation

Extractive Financial Narrative Summarisation based on DPPs

no code implementations FNP (COLING) 2020 Lei LI, Yafei Jiang, Yinan Liu

We participate in the FNS-Summarisation 2020 shared task to be held at FNP 2020 workshop at COLING 2020.

Point Processes

GLAT: Glancing at Latent Variables for Parallel Text Generation

1 code implementation ACL 2022 Yu Bao, Hao Zhou, ShuJian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, Lei LI

Recently, parallel text generation has received widespread attention due to its success in generation efficiency.

Text Generation

Augmenting Legal Judgment Prediction with Contrastive Case Relations

1 code implementation COLING 2022 Dugang Liu, Weihao Du, Lei LI, Weike Pan, Zhong Ming

Existing legal judgment prediction methods usually only consider one single case fact description as input, which may not fully utilize the information in the data such as case relations and frequency.

CMU-Flownet: Exploring Point Cloud Scene Flow Estimation in Occluded Scenario

no code implementations16 Apr 2024 Jingze Chen, Junfeng Yao, Qiqin Lin, Lei LI

Occlusions hinder point cloud frame alignment in LiDAR data, a challenge inadequately addressed by scene flow models tested mainly on occlusion-free datasets.

Occlusion Estimation Occlusion Handling +1

CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

1 code implementation31 Mar 2024 Jingzhe Shi, Jialuo Li, Qinwei Ma, Zaiwen Yang, Huan Ma, Lei LI

We have conducted extensive experiments to validate the performance of our proposed CHOPS architecture using the CPHOS-dataset, with the aim of demonstrating how LLMs can enhance or serve as alternatives to human customer service.

Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science

no code implementations29 Mar 2024 Yazheng Yang, Yuqi Wang, Sankalok Sen, Lei LI, Qi Liu

Despite their proficiency in comprehending natural language, LLMs fall short in dealing with structured tabular data.

Imputation In-Context Learning

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality

1 code implementation28 Mar 2024 Sishuo Chen, Lei LI, Shuhuai Ren, Rundong Gao, Yuanxin Liu, Xiaohan Bi, Xu sun, Lu Hou

Video paragraph captioning (VPC) involves generating detailed narratives for long videos, utilizing supportive modalities such as speech and event boundaries.

Data Augmentation Video Understanding

Convergence analysis of OT-Flow for sample generation

no code implementations24 Mar 2024 Yang Jing, Lei LI

Second, since the loss function will be approximated by Monte Carlo method in training, we established the convergence between the discrete loss function and the continuous one when the sample number $N$ goes to infinity as well.

Word Order's Impacts: Insights from Reordering and Generation Analysis

no code implementations18 Mar 2024 Qinghua Zhao, Jiaang Li, Lei LI, Zenghui Zhou, Junfeng Liu

Existing works have studied the impacts of the order of words within natural text.

Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction

no code implementations15 Mar 2024 Chen Chen, Lei LI, Marcel Beetz, Abhirup Banerjee, Ramneek Gupta, Vicente Grau

We present a novel, lightweight dual-attention ECG network designed to capture complex ECG features essential for early HF risk prediction, despite the notable imbalance between low and high-risk groups.

Language Modelling Large Language Model

MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder

no code implementations7 Mar 2024 Lei LI, Tianfang Zhang, Xinglin Zhang, Jiaqi Liu, Bingqi Ma, Yan Luo, Tao Chen

Within the domain of medical analysis, extensive research has explored the potential of mutual learning between Masked Autoencoders(MAEs) and multimodal data.

Representation Learning Zero-Shot Learning

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

no code implementations7 Mar 2024 JieLin Qiu, Andrea Madotto, Zhaojiang Lin, Paul A. Crook, Yifan Ethan Xu, Xin Luna Dong, Christos Faloutsos, Lei LI, Babak Damavandi, Seungwhan Moon

We have developed the \textbf{SnapNTell Dataset}, distinct from traditional VQA datasets: (1) It encompasses a wide range of categorized entities, each represented by images and explicitly named in the answers; (2) It features QA pairs that require extensive knowledge for accurate responses.

Question Answering Retrieval +1

ImgTrojan: Jailbreaking Vision-Language Models with ONE Image

1 code implementation5 Mar 2024 Xijia Tao, Shuai Zhong, Lei LI, Qi Liu, Lingpeng Kong

In this paper, we propose a novel jailbreaking attack against VLMs, aiming to bypass their safety barrier when a user inputs harmful instructions.

Tree Counting by Bridging 3D Point Clouds with Imagery

no code implementations4 Mar 2024 Lei LI, Tianfang Zhang, Zhongyu Jiang, Cheng-Yen Yang, Jenq-Neng Hwang, Stefan Oehmcke, Dimitri Pierre Johannes Gominski, Fabian Gieseke, Christian Igel

We leverage the fusion of three-dimensional LiDAR measurements and 2D imagery to facilitate the accurate counting of trees.

Management

TempCompass: Do Video LLMs Really Understand Videos?

1 code implementation1 Mar 2024 Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei LI, Sishuo Chen, Xu sun, Lu Hou

Motivated by these two problems, we propose the \textbf{TempCompass} benchmark, which introduces a diversity of temporal aspects and task formats.

Hire a Linguist!: Learning Endangered Languages with In-Context Linguistic Descriptions

no code implementations28 Feb 2024 Kexun Zhang, Yee Man Choi, Zhenqiao Song, Taiqi He, William Yang Wang, Lei LI

On the contrary, we observe that 2000 endangered languages, though without a large corpus, have a grammar book or a dictionary.

Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling

no code implementations22 Feb 2024 Yuwei Yang, Siqi Ouyang, Xueyu Hu, Mingyue Zheng, Hao Zhou, Lei LI

We develop a novel 3D graph editing model to generate molecules using fragments, and pre-train this model on abundant 3D ligands for learning target-independent properties.

Molecular Docking Self-Learning

Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages

no code implementations19 Feb 2024 Sameer Jain, Sedrick Scott Keh, Shova Chettri, Karun Dewan, Pablo Izquierdo, Johanna Prussman, Pooja Shreshtha, Cesar Suarez, Zheyuan Ryan Shi, Lei LI, Fei Fang

Environmental conservation organizations routinely monitor news content on conservation in protected areas to maintain situational awareness of developments that can have an environmental impact.

Perils of Self-Feedback: Self-Bias Amplifies in Large Language Models

no code implementations18 Feb 2024 Wenda Xu, Guanglei Zhu, Xuandong Zhao, Liangming Pan, Lei LI, William Yang Wang

Recent studies show that self-feedback improves large language models (LLMs) on certain tasks while worsens other tasks.

Mathematical Reasoning Text Generation

DE-COP: Detecting Copyrighted Content in Language Models Training Data

1 code implementation15 Feb 2024 André V. Duarte, Xuandong Zhao, Arlindo L. Oliveira, Lei LI

We are motivated by the premise that a language model is likely to identify verbatim excerpts from its training text.

Language Modelling Multiple-choice

Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs

1 code implementation8 Feb 2024 Xuandong Zhao, Lei LI, Yu-Xiang Wang

In this paper, we propose a new decoding method called Permute-and-Flip (PF) decoder.

Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

1 code implementation7 Feb 2024 Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei LI

Mamba-UNet adopts a pure Visual Mamba (VMamba)-based encoder-decoder structure, infused with skip connections to preserve spatial information across different scales of the network.

Cardiac Segmentation Computational Efficiency +3

Progress and Opportunities of Foundation Models in Bioinformatics

no code implementations6 Feb 2024 Qing Li, Zhihang Hu, YiXuan Wang, Lei LI, Yimin Fan, Irwin King, Le Song, Yu Li

Central to our focus is the application of FMs to specific biological problems, aiming to guide the research community in choosing appropriate FMs for their research needs.

EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language Models

3 code implementations5 Feb 2024 Yixin Ou, Ningyu Zhang, Honghao Gui, Ziwen Xu, Shuofei Qiao, Yida Xue, Runnan Fang, Kangwei Liu, Lei LI, Zhen Bi, Guozhou Zheng, Huajun Chen

In recent years, instruction tuning has gained increasing attention and emerged as a crucial technique to enhance the capabilities of Large Language Models (LLMs).

KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language Models

no code implementations5 Feb 2024 Fei Yuan, Chang Ma, Shuai Yuan, Qiushi Sun, Lei LI

We further theoretically prove that KS-Lottery can find the certified winning tickets in the embedding layer, fine-tuning on the found parameters is guaranteed to perform as well as full fine-tuning.

Translation

Weak-to-Strong Jailbreaking on Large Language Models

1 code implementation30 Jan 2024 Xuandong Zhao, Xianjun Yang, Tianyu Pang, Chao Du, Lei LI, Yu-Xiang Wang, William Yang Wang

In this paper, we propose the weak-to-strong jailbreaking attack, an efficient method to attack aligned LLMs to produce harmful text.

Red Teaming Visual Language Models

no code implementations23 Jan 2024 Mukai Li, Lei LI, Yuwei Yin, Masood Ahmed, Zhenguang Liu, Qi Liu

Additionally, we simply apply red teaming alignment to LLaVA-v1. 5 with Supervised Fine-tuning (SFT) using RTVLM, and this bolsters the models' performance with 10% in RTVLM test set, 13% in MM-Hal, and without noticeable decline in MM-Bench, overpassing other LLaVA-based models with regular alignment data.

Fairness

Developing ChatGPT for Biology and Medicine: A Complete Review of Biomedical Question Answering

no code implementations15 Jan 2024 Qing Li, Lei LI, Yu Li

Central to our focus is the utilizing of language models and multimodal paradigms for medical question answering, aiming to guide the research community in selecting appropriate mechanisms for their specific medical research requirements.

Cross-Modal Retrieval Medical Diagnosis +3

Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential Recommendations

1 code implementation12 Jan 2024 Lei LI, Jianxun Lian, Xiao Zhou, Xing Xie

However, most existing retrieval models employ a single-round inference paradigm, which may not adequately capture the dynamic nature of user preferences and stuck in one area in the item space.

Recommendation Systems Retrieval

DarkShot: Lighting Dark Images with Low-Compute and High-Quality

no code implementations28 Dec 2023 Jiazhang Zheng, Lei LI, Qiuping Liao, Cheng Li, Li Li, Yangxing Liu

This paper proposes a lightweight network that outperforms existing state-of-the-art (SOTA) methods in low-light enhancement tasks while minimizing computation.

4k Image Restoration

SSFlowNet: Semi-supervised Scene Flow Estimation On Point Clouds With Pseudo Label

no code implementations23 Dec 2023 Jingze Chen, Junfeng Yao, Qiqin Lin, Rongzhou Zhou, Lei LI

This paper introduces SSFlowNet, a semi-supervised approach for scene flow estimation, that utilizes a blend of labeled and unlabeled data, optimizing the balance between the cost of labeling and the precision of model training.

Pseudo Label Scene Flow Estimation

Silkie: Preference Distillation for Large Visual Language Models

no code implementations17 Dec 2023 Lei LI, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong

This paper explores preference distillation for large vision language models (LVLMs), improving their ability to generate helpful and faithful responses anchoring the visual context.

Hallucination Visual Question Answering

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

1 code implementation14 Dec 2023 Peiyi Wang, Lei LI, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

In this paper, we present an innovative process-oriented math process reward model called \textbf{Math-Shepherd}, which assigns a reward score to each step of math problem solutions.

Ranked #13 on Arithmetic Reasoning on GSM8K (using extra training data)

Arithmetic Reasoning GSM8K +2

Seam-guided local alignment and stitching for large parallax images

no code implementations30 Nov 2023 Tianli Liao, Chenyang Zhao, Lei LI, Heling Cao

However, the effectiveness of seam-cutting usually depends on that images can be roughly aligned such that there exists a local region where a plausible seam can be found.

Image Stitching

GenZI: Zero-Shot 3D Human-Scene Interaction Generation

no code implementations29 Nov 2023 Lei LI, Angela Dai

Given a natural language description and a coarse point location of the desired interaction in a 3D scene, we first leverage VLMs to imagine plausible 2D human interactions inpainted into multiple rendered views of the scene.

UniHPE: Towards Unified Human Pose Estimation via Contrastive Learning

no code implementations24 Nov 2023 Zhongyu Jiang, Wenhao Chai, Lei LI, Zhuoran Zhou, Cheng-Yen Yang, Jenq-Neng Hwang

In this paper, we propose UniHPE, a unified Human Pose Estimation pipeline, which aligns features from all three modalities, i. e., 2D human pose estimation, lifting-based and image-based 3D human pose estimation, in the same pipeline.

2D Human Pose Estimation 3D Human Pose Estimation +3

How Multilingual is Multilingual LLM?

no code implementations15 Nov 2023 Fei Yuan, Shuai Yuan, Zhiyong Wu, Lei LI

Large Language Models (LLMs), trained predominantly on extensive English data, often exhibit limitations when applied to other languages.

Object-centric Cross-modal Feature Distillation for Event-based Object Detection

no code implementations9 Nov 2023 Lei LI, Alexander Liniger, Mario Millhaeusler, Vagia Tsiminaki, Yuanyou Li, Dengxin Dai

In this paper, we develop a novel knowledge distillation approach to shrink the performance gap between these two modalities.

Knowledge Distillation Object +2

RPCANet: Deep Unfolding RPCA Based Infrared Small Target Detection

1 code implementation2 Nov 2023 Fengyi Wu, Tianfang Zhang, Lei LI, Yian Huang, Zhenming Peng

Deep learning (DL) networks have achieved remarkable performance in infrared small target detection (ISTD).

Image Reconstruction

CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting

no code implementations24 Oct 2023 Lei LI

In light of this, we introduce the CPSeg, Chain-of-Thought Language Prompting for Finer-grained Semantic Segmentation), an innovative framework designed to augment image segmentation performance by integrating a novel "Chain-of-Thought" process that harnesses textual information associated with images.

Image Segmentation object-detection +4

Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding

1 code implementation10 Oct 2023 Kexun Zhang, Hongqiao Chen, Lei LI, William Wang

Large language models (LLMs) have shown promising capabilities in using external tools to solve complex problems.

Math valid

Functional Geometry Guided Protein Sequence and Backbone Structure Co-Design

1 code implementation6 Oct 2023 Zhenqiao Song, Yunlong Zhao, Wenxian Shi, Yang Yang, Lei LI

In this paper, we propose NAEPro, a model to jointly design Protein sequence and structure based on automatically detected functional sites.

Learning Personalized Story Evaluation

no code implementations5 Oct 2023 Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei LI, Yuandong Tian

We further develop a personalized story evaluation model PERSE to infer reviewer preferences and provide a personalized evaluation.

Retrieval Text Generation

Joint Design of Protein Sequence and Structure based on Motifs

no code implementations4 Oct 2023 Zhenqiao Song, Yunlong Zhao, Yufei Song, Wenxian Shi, Yang Yang, Lei LI

Designing novel proteins with desired functions is crucial in biology and chemistry.

Segment Any Building

no code implementations2 Oct 2023 Lei LI

The task of identifying and segmenting buildings within remote sensing imagery has perennially stood at the forefront of scholarly investigations.

Management Representation Learning +1

Tool-Augmented Reward Modeling

1 code implementation2 Oct 2023 Lei LI, Yekun Chai, Shuohuan Wang, Yu Sun, Hao Tian, Ningyu Zhang, Hua Wu

We validate our approach across a wide range of domains, incorporating seven distinct external tools.

Edge Aware Learning for 3D Point Cloud

no code implementations23 Sep 2023 Lei LI

This paper proposes an innovative approach to Hierarchical Edge Aware 3D Point Cloud Learning (HEA-Net) that seeks to address the challenges of noise in point cloud data, and improve object recognition and segmentation by focusing on edge features.

Object Object Recognition +2

Making Large Language Models Better Reasoners with Alignment

no code implementations5 Sep 2023 Peiyi Wang, Lei LI, Liang Chen, Feifan Song, Binghuai Lin, Yunbo Cao, Tianyu Liu, Zhifang Sui

To address this problem, we introduce an \textit{Alignment Fine-Tuning (AFT)} paradigm, which involves three steps: 1) fine-tuning LLMs with COT training data; 2) generating multiple COT responses for each question, and categorizing them into positive and negative ones based on whether they achieve the correct answer; 3) calibrating the scores of positive and negative responses given by LLMs with a novel constraint alignment loss.

Large Language Models for Generative Recommendation: A Survey and Visionary Discussions

no code implementations3 Sep 2023 Lei LI, Yongfeng Zhang, Dugang Liu, Li Chen

Large language models (LLM) not only have revolutionized the field of natural language processing (NLP) but also have the potential to reshape many other fields, e. g., recommender systems (RS).

Recommendation Systems Re-Ranking

Extrapolating Large Language Models to Non-English by Aligning Languages

2 code implementations9 Aug 2023 Wenhao Zhu, Yunzhe Lv, Qingxiu Dong, Fei Yuan, Jingjing Xu, ShuJian Huang, Lingpeng Kong, Jiajun Chen, Lei LI

We start from targeting individual languages by performing cross-lingual instruction-tuning (CoIT) on LLaMA, i. e. tuning it with translation task data and cross-lingual general task data to obtain cross-lingual models (x-LLaMAs), and formulate underlying scaling laws to investigate the advantages of using scalable translation data.

Translation

3D Shape-Based Myocardial Infarction Prediction Using Point Cloud Classification Networks

no code implementations14 Jul 2023 Marcel Beetz, Yilong Yang, Abhirup Banerjee, Lei LI, Vicente Grau

Myocardial infarction (MI) is one of the most prevalent cardiovascular diseases with associated clinical decision-making typically based on single-valued imaging biomarkers.

Anatomy Decision Making +2

Towards Enabling Cardiac Digital Twins of Myocardial Infarction Using Deep Computational Models for Inverse Inference

no code implementations10 Jul 2023 Lei LI, Julia Camps, Zhinuo, Wang, Abhirup Banerjee, Marcel Beetz, Blanca Rodriguez, Vicente Grau

In this work, we investigate the feasibility of inferring myocardial tissue properties from the electrocardiogram (ECG) within a CDT platform.

Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation

1 code implementation7 Jul 2023 Zhongyu Jiang, Zhuoran Zhou, Lei LI, Wenhao Chai, Cheng-Yen Yang, Jenq-Neng Hwang

Learning-based methods have dominated the 3D human pose estimation (HPE) tasks with significantly better performance in most benchmarks than traditional optimization-based methods.

Ranked #11 on 3D Human Pose Estimation on 3DPW (PA-MPJPE metric)

3D Human Pose Estimation Image to 3D

Provable Robust Watermarking for AI-Generated Text

4 code implementations30 Jun 2023 Xuandong Zhao, Prabhanjan Ananth, Lei LI, Yu-Xiang Wang

We propose a robust and high-quality watermark method, Unigram-Watermark, by extending an existing approach with a simplified fixed grouping strategy.

Language Modelling

Progression Cognition Reinforcement Learning with Prioritized Experience for Multi-Vehicle Pursuit

1 code implementation8 Jun 2023 Xinhang Li, Yiying Yang, Zheng Yuan, Zhe Wang, Qinwen Wang, Chen Xu, Lei LI, Jianhua He, Lin Zhang

For the more challenging problem of pursuing multiple evading vehicles, these algorithms typically select a fixed target evading vehicle for pursuing vehicles without considering dynamic traffic situation, which significantly reduces pursuing success rate.

Multi-agent Reinforcement Learning reinforcement-learning

M$^3$IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning

no code implementations7 Jun 2023 Lei LI, Yuwei Yin, Shicheng Li, Liang Chen, Peiyi Wang, Shuhuai Ren, Mukai Li, Yazheng Yang, Jingjing Xu, Xu sun, Lingpeng Kong, Qi Liu

To tackle this challenge and promote research in the vision-language field, we introduce the Multi-Modal, Multilingual Instruction Tuning (M$^3$IT) dataset, designed to optimize VLM alignment with human instructions.

World Knowledge

Invisible Image Watermarks Are Provably Removable Using Generative AI

1 code implementation2 Jun 2023 Xuandong Zhao, Kexun Zhang, Zihao Su, Saastha Vasan, Ilya Grishchenko, Christopher Kruegel, Giovanni Vigna, Yu-Xiang Wang, Lei LI

However, if we do not require the watermarked image to look the same as the original one, watermarks that keep the image semantically similar can be an alternative defense against our attack.

Image Denoising

Large Language Models are not Fair Evaluators

1 code implementation29 May 2023 Peiyi Wang, Lei LI, Liang Chen, Zefan Cai, Dawei Zhu, Binghuai Lin, Yunbo Cao, Qi Liu, Tianyu Liu, Zhifang Sui

In this paper, we uncover a systematic bias in the evaluation paradigm of adopting large language models~(LLMs), e. g., GPT-4, as a referee to score and compare the quality of responses generated by candidate models.

Language Modelling Large Language Model +1

Neural Machine Translation with Dynamic Graph Convolutional Decoder

no code implementations28 May 2023 Lei LI, Kai Fan, Lingyu Yang, Hongjia Li, Chun Yuan

Existing wisdom demonstrates the significance of syntactic knowledge for the improvement of neural machine translation models.

Machine Translation Translation

ImageNetVC: Zero- and Few-Shot Visual Commonsense Evaluation on 1000 ImageNet Categories

1 code implementation24 May 2023 Heming Xia, Qingxiu Dong, Lei LI, Jingjing Xu, Tianyu Liu, Ziwei Qin, Zhifang Sui

Recently, Large Language Models (LLMs) have been serving as general-purpose interfaces, posing a significant demand for comprehensive visual knowledge.

Common Sense Reasoning

ALGO: Synthesizing Algorithmic Programs with LLM-Generated Oracle Verifiers

1 code implementation NeurIPS 2023 Kexun Zhang, Danqing Wang, Jingtao Xia, William Yang Wang, Lei LI

To address these challenges, we propose ALGO, a framework that synthesizes Algorithmic programs with LLM-Generated Oracles to guide the generation and verify their correctness.

Code Generation

AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models

1 code implementation24 May 2023 Siqi Ouyang, Lei LI

However, LLMs frequently fail in complex decision-making tasks due to the misalignment between the pre-trained knowledge in LLMs and the actual rules in the environment.

Decision Making Language Modelling +1

INSTRUCTSCORE: Explainable Text Generation Evaluation with Finegrained Feedback

1 code implementation23 May 2023 Wenda Xu, Danqing Wang, Liangming Pan, Zhenqiao Song, Markus Freitag, William Yang Wang, Lei LI

By harnessing both explicit human instruction and the implicit knowledge of GPT-4, we fine-tune a text evaluation metric based on LLaMA, producing both a score for generated text and a human readable diagnostic report.

Text Generation

Learning from Mistakes via Cooperative Study Assistant for Large Language Models

1 code implementation23 May 2023 Danqing Wang, Lei LI

In this paper, we propose Study Assistant for Large LAnguage Model (SALAM), a novel framework with an auxiliary agent to assist the main LLM in learning from mistakes through interactive cooperation.

Imitation Learning Language Modelling +1

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

1 code implementation23 May 2023 Lean Wang, Lei LI, Damai Dai, Deli Chen, Hao Zhou, Fandong Meng, Jie zhou, Xu sun

In-context learning (ICL) emerges as a promising capability of large language models (LLMs) by providing them with demonstration examples to perform diverse tasks.

In-Context Learning

Can Language Models Understand Physical Concepts?

1 code implementation23 May 2023 Lei LI, Jingjing Xu, Qingxiu Dong, Ce Zheng, Qi Liu, Lingpeng Kong, Xu sun

Language models~(LMs) gradually become general-purpose interfaces in the interactive and embodied world, where the understanding of physical concepts is an essential prerequisite.

Extrapolating Multilingual Understanding Models as Multilingual Generators

no code implementations22 May 2023 Bohong Wu, Fei Yuan, Hai Zhao, Lei LI, Jingjing Xu

Considering that encoder-based models have the advantage of efficient generation and self-correction abilities, this paper explores methods to empower multilingual understanding models the generation abilities to get a unified model.

Denoising Machine Translation +5

Can We Edit Factual Knowledge by In-Context Learning?

2 code implementations22 May 2023 Ce Zheng, Lei LI, Qingxiu Dong, Yuxuan Fan, Zhiyong Wu, Jingjing Xu, Baobao Chang

Inspired by in-context learning (ICL), a new paradigm based on demonstration contexts without parameter updating, we explore whether ICL can edit factual knowledge.

In-Context Learning knowledge editing

Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter

1 code implementation21 May 2023 Yi Liu, Xiaohan Bi, Lei LI, Sishuo Chen, Wenkai Yang, Xu sun

However, as pre-trained language models (PLMs) continue to increase in size, the communication cost for transmitting parameters during synchronization has become a training speed bottleneck.

Clustering Federated Learning +2

Importance Weighted Expectation-Maximization for Protein Sequence Design

1 code implementation30 Apr 2023 Zhenqiao Song, Lei LI

How can we efficiently generate diverse and novel protein sequences with high fitness?

Segment Anything Model for Medical Images?

1 code implementation28 Apr 2023 Yuhao Huang, Xin Yang, Lian Liu, Han Zhou, Ao Chang, Xinrui Zhou, Rusi Chen, Junxuan Yu, Jiongquan Chen, Chaoyu Chen, Sijing Liu, Haozhe Chi, Xindi Hu, Kejuan Yue, Lei LI, Vicente Grau, Deng-Ping Fan, Fajin Dong, Dong Ni

To fully validate SAM's performance on medical data, we collected and sorted 53 open-source datasets and built a large medical segmentation dataset with 18 modalities, 84 objects, 125 object-modality paired targets, 1050K 2D images, and 6033K masks.

Image Segmentation Medical Image Segmentation +3

Revisiting k-NN for Fine-tuning Pre-trained Language Models

1 code implementation18 Apr 2023 Lei LI, Jing Chen, Bozhong Tian, Ningyu Zhang

Pre-trained Language Models (PLMs), as parametric-based eager learners, have become the de-facto choice for current paradigms of Natural Language Processing (NLP).

Influence of Myocardial Infarction on QRS Properties: A Simulation Study

no code implementations4 Apr 2023 Lei LI, Julia Camps, Zhinuo, Wang, Abhirup Banerjee, Blanca Rodriguez, Vicente Grau

However, the influence of various MI properties on the QRS is not intuitively predictable. In this work, we have systematically investigated the effects of 17 post-MI scenarios, varying the location, size, transmural extent, and conductive level of scarring and border zone area, on the forward-calculated QRS.

Generalizable Local Feature Pre-training for Deformable Shape Analysis

1 code implementation CVPR 2023 Souhaib Attaiki, Lei LI, Maks Ovsjanikov

We observe that with proper training, learned features can be useful in such tasks, but, crucially, only with an appropriate choice of the receptive field size.

Transfer Learning

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

no code implementations9 Feb 2023 Pengfei Zhu, Chao Pang, Yekun Chai, Lei LI, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu

In response to this lacuna, this paper introduces a pioneering contribution in the form of a text-to-waveform music generation model, underpinned by the utilization of diffusion models.

Music Generation Text-to-Music Generation

Aligning Multi-Sequence CMR Towards Fully Automated Myocardial Pathology Segmentation

no code implementations7 Feb 2023 Wangbin Ding, Lei LI, Junyi Qiu, Sihan Wang, Liqin Huang, Yinyin Chen, Shan Yang, Xiahai Zhuang

For instance, balanced steady-state free precession cine sequences present clear anatomical boundaries, while late gadolinium enhancement and T2-weighted CMR sequences visualize myocardial scar and edema of MI, respectively.

Image Registration

Protecting Language Generation Models via Invisible Watermarking

2 code implementations6 Feb 2023 Xuandong Zhao, Yu-Xiang Wang, Lei LI

We can then detect the secret message by probing a suspect model to tell if it is distilled from the protected one.

Model extraction Text Generation

Design and Implementation of A Soccer Ball Detection System with Multiple Cameras

no code implementations31 Jan 2023 Lei LI, Tianfang Zhang, Zhongfeng Kang, Wenhan Zhang

This paper designed and implemented football detection system under multiple cameras for the detection and capture of targets in real-time matches.

Position

One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER

2 code implementations25 Jan 2023 Xiang Chen, Lei LI, Shuofei Qiao, Ningyu Zhang, Chuanqi Tan, Yong Jiang, Fei Huang, Huajun Chen

Previous typical solutions mainly obtain a NER model by pre-trained language models (PLMs) with data from a rich-resource domain and adapt it to the target domain.

NER Text Generation

Geometric ergodicity of SGLD via reflection coupling

no code implementations17 Jan 2023 Lei LI, Jian-Guo Liu, Yuliang Wang

We consider the geometric ergodicity of the Stochastic Gradient Langevin Dynamics (SGLD) algorithm under nonconvexity settings.

BuildSeg: A General Framework for the Segmentation of Buildings

no code implementations15 Jan 2023 Lei LI, Tianfang Zhang, Stefan Oehmcke, Fabian Gieseke, Christian Igel

Building segmentation from aerial images and 3D laser scanning (LiDAR) is a challenging task due to the diversity of backgrounds, building textures, and image quality.

Multi-Target Landmark Detection with Incomplete Images via Reinforcement Learning and Shape Prior

no code implementations13 Jan 2023 Kaiwen Wan, Lei LI, Dengqiang Jia, Shangqi Gao, Wei Qian, Yingzhi Wu, Huandong Lin, Xiongzheng Mu, Xin Gao, Sijia Wang, Fuping Wu, Xiahai Zhuang

This is particularly evident for the learning-based multi-target landmark detection, where algorithms could be misleading to learn primarily the variation of background due to the varying FOV, failing the detection of targets.

Reinforcement Learning (RL)

VQNet 2.0: A New Generation Machine Learning Framework that Unifies Classical and Quantum

no code implementations9 Jan 2023 Huanyu Bian, Zhilong Jia, Menghan Dou, Yuan Fang, Lei LI, Yiming Zhao, Hanchao Wang, Zhaohui Zhou, Wei Wang, Wenyu Zhu, Ye Li, Yang Yang, Weiming Zhang, Nenghai Yu, Zhaoyun Chen, Guoping Guo

Therefore, based on VQNet 1. 0, we further propose VQNet 2. 0, a new generation of unified classical and quantum machine learning framework that supports hybrid optimization.

Quantum Machine Learning Unity

A Survey on In-context Learning

1 code implementation31 Dec 2022 Qingxiu Dong, Damai Dai, Ce Zheng, Zhiyong Wu, Baobao Chang, Xu sun, Jingjing Xu, Lei LI, Zhifang Sui

With the increasing ability of large language models (LLMs), in-context learning (ICL) has become a new paradigm for natural language processing (NLP), where LLMs make predictions only based on contexts augmented with a few examples.

In-Context Learning

Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models

no code implementations20 Dec 2022 Jingjing Xu, Qingxiu Dong, Hongyi Liu, Lei LI

With increasing scale, large language models demonstrate both quantitative improvement and new qualitative capabilities, especially as zero-shot learners, like GPT-3.

Language Modelling Masked Language Modeling +2

Lego-MT: Learning Detachable Models for Massively Multilingual Machine Translation

1 code implementation20 Dec 2022 Fei Yuan, Yinquan Lu, Wenhao Zhu, Lingpeng Kong, Lei LI, Yu Qiao, Jingjing Xu

To address the needs of learning representations for all languages in a unified space, we propose a novel efficient training recipe, upon which we build an effective detachable model, Lego-MT.

Machine Translation Translation

WACO: Word-Aligned Contrastive Learning for Speech Translation

1 code implementation19 Dec 2022 Siqi Ouyang, Rong Ye, Lei LI

In this paper, we propose Word-Aligned COntrastive learning (WACO), a simple and effective method for extremely low-resource speech-to-text translation.

Contrastive Learning Speech-to-Text Translation +1

SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic Mistakes

1 code implementation19 Dec 2022 Wenda Xu, Xian Qian, Mingxuan Wang, Lei LI, William Yang Wang

In this paper, we propose SESCORE2, a self-supervised approach for training a model-based metric for text generation evaluation.

Dialogue Generation Machine Translation +2

Mask-FPAN: Semi-Supervised Face Parsing in the Wild With De-Occlusion and UV GAN

no code implementations18 Dec 2022 Lei LI, Tianfang Zhang, Zhongfeng Kang, Xikun Jiang

Fine-grained semantic segmentation of a person's face and head, including facial parts and head components, has progressed a great deal in recent years.

Face Model Face Parsing +1

Pre-trained Language Models Can be Fully Zero-Shot Learners

2 code implementations14 Dec 2022 Xuandong Zhao, Siqi Ouyang, Zhiguo Yu, Ming Wu, Lei LI

How can we extend a pre-trained model to many language understanding tasks, without labeled or additional unlabeled data?

Retrieval text-classification +3

Accelerating Antimicrobial Peptide Discovery with Latent Structure

1 code implementation28 Nov 2022 Danqing Wang, Zeyu Wen, Fei Ye, Lei LI, Hao Zhou

By sampling in the latent space, LSSAMP can simultaneously generate peptides with ideal sequence attributes and secondary structures.

Quantization

MyoPS-Net: Myocardial Pathology Segmentation with Flexible Combination of Multi-Sequence CMR Images

no code implementations6 Nov 2022 Junyi Qiu, Lei LI, Sihan Wang, Ke Zhang, Yinyin Chen, Shan Yang, Xiahai Zhuang

We therefore conducted extensive experiments to investigate the performance of the proposed method in dealing with such complex combinations of different CMR sequences.

Segmentation

Gradient Knowledge Distillation for Pre-trained Language Models

1 code implementation2 Nov 2022 Lean Wang, Lei LI, Xu sun

Knowledge distillation (KD) is an effective framework to transfer knowledge from a large-scale teacher to a compact yet well-performing student.

Knowledge Distillation

Learning Multi-resolution Functional Maps with Spectral Attention for Robust Shape Matching

1 code implementation12 Oct 2022 Lei LI, Nicolas Donati, Maks Ovsjanikov

Our approach is not only accurate with near-isometric input, for which a high spectral resolution is typically preferred, but also robust and able to produce reasonable matching even in the presence of significant non-isometric distortion, which poses great challenges to existing methods.

From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models

1 code implementation11 Oct 2022 Lei LI, Yankai Lin, Xuancheng Ren, Guangxiang Zhao, Peng Li, Jie zhou, Xu sun

We then design a Model Uncertainty--aware Knowledge Integration (MUKI) framework to recover the golden supervision for the student.

PARAGEN : A Parallel Generation Toolkit

1 code implementation7 Oct 2022 Jiangtao Feng, Yi Zhou, Jun Zhang, Xian Qian, Liwei Wu, Zhexi Zhang, Yanming Liu, Mingxuan Wang, Lei LI, Hao Zhou

PARAGEN is a PyTorch-based NLP toolkit for further development on parallel generation.

Model Selection

Just ClozE! A Novel Framework for Evaluating the Factual Consistency Faster in Abstractive Summarization

1 code implementation6 Oct 2022 Yiyang Li, Lei LI, Marina Litvak, Natalia Vanetik, Dingxin Hu, Yuze Li, Yanquan Zhou

The issue of factual consistency in abstractive summarization has received extensive attention in recent years, and the evaluation of factual consistency between summary and document has become an important and urgent task.

Abstractive Text Summarization Language Modelling +2

Block-Structured Optimization for Subgraph Detection in Interdependent Networks

no code implementations6 Oct 2022 Fei Jie, Chunpai Wang, Feng Chen, Lei LI, Xindong Wu

We propose a generalized framework for block-structured nonconvex optimization, which can be applied to structured subgraph detection in interdependent networks, such as multi-layer networks, temporal networks, networks of networks, and many others.

Safety-based Speed Control of a Wheelchair using Robust Adaptive Model Predictive Control

no code implementations6 Oct 2022 Meng Yuan, Ye Wang, Lei LI, Tianyou Chai, Wei Tech Ang

Electric-powered wheelchair plays an important role in providing accessibility for people with mobility impairment.

Model Predictive Control

SRFeat: Learning Locally Accurate and Globally Consistent Non-Rigid Shape Correspondence

1 code implementation16 Sep 2022 Lei LI, Souhaib Attaiki, Maks Ovsjanikov

In this work, we present a novel learning-based framework that combines the local accuracy of contrastive learning with the global consistency of geometric approaches, for robust non-rigid matching.

Contrastive Learning

Rethinking the Unpretentious U-net for Medical Ultrasound Image Segmentation

2 code implementations15 Sep 2022 Gongping Chen, Lei LI, Jianxun Zhang, Yu Dai

However, variable tumor morphology, blurred boundary, and similar intensity distributions bring challenges for accurate segmentation of breast tumors.

Image Segmentation Segmentation +1

Multi-Modality Cardiac Image Computing: A Survey

no code implementations26 Aug 2022 Lei LI, Wangbin Ding, Liqun Huang, Xiahai Zhuang, Vicente Grau

Multi-modality cardiac imaging plays a key role in the management of patients with cardiovascular diseases.

Management

A deep learning framework for geodesics under spherical Wasserstein-Fisher-Rao metric and its application for weighted sample generation

no code implementations25 Aug 2022 Yang Jing, Jiaheng Chen, Lei LI, Jianfeng Lu

In this paper, we develop a deep learning framework to compute the geodesics under the spherical WFR metric, and the learned geodesics can be adopted to generate weighted samples.

Bayesian Inference

Deep Computational Model for the Inference of Ventricular Activation Properties

no code implementations8 Aug 2022 Lei LI, Julia Camps, Abhirup Banerjee, Marcel Beetz, Blanca Rodriguez, Vicente Grau

Cardiac digital twins can provide non-invasive characterizations of cardiac functions for individual patients, and therefore are promising for the patient-specific diagnosis and therapy stratification.

Anatomy

Distributional Correlation--Aware Knowledge Distillation for Stock Trading Volume Prediction

1 code implementation4 Aug 2022 Lei LI, Zhiyuan Zhang, Ruihan Bao, Keiko Harimoto, Xu sun

Traditional knowledge distillation in classification problems transfers the knowledge via class correlations in the soft label produced by teacher models, which are not available in regression problems like stock trading volume prediction.

Knowledge Distillation regression

A sharp uniform-in-time error estimate for Stochastic Gradient Langevin Dynamics

no code implementations19 Jul 2022 Lei LI, Yuliang Wang

We establish a sharp uniform-in-time error estimate for the Stochastic Gradient Langevin Dynamics (SGLD), which is a popular sampling algorithm.

valid

On uniform-in-time diffusion approximation for stochastic gradient descent

no code implementations11 Jul 2022 Lei LI, Yuliang Wang

The main technique is to establish the exponential decay rates of the derivatives of the solution to the backward Kolmogorov equation.

valid

On the Learning of Non-Autoregressive Transformers

no code implementations13 Jun 2022 Fei Huang, Tianhua Tao, Hao Zhou, Lei LI, Minlie Huang

Non-autoregressive Transformer (NAT) is a family of text generation models, which aims to reduce the decoding latency by predicting the whole sentences in parallel.

Text Generation

Decoupling Predictions in Distributed Learning for Multi-Center Left Atrial MRI Segmentation

1 code implementation10 Jun 2022 Zheyao Gao, Lei LI, Fuping Wu, Sihan Wang, Xiahai Zhuang

In this work, we propose a new framework of distributed learning that bridges the gap between two groups, and improves the performance for both generic and local data.

MRI segmentation

Delving into the Openness of CLIP

1 code implementation4 Jun 2022 Shuhuai Ren, Lei LI, Xuancheng Ren, Guangxiang Zhao, Xu sun

However, evaluating the openness of CLIP-like models is challenging, as the models are open to arbitrary vocabulary in theory, but their accuracy varies in practice.

Image Classification Text Matching

Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

2 code implementations29 May 2022 Xiang Chen, Lei LI, Ningyu Zhang, Xiaozhuan Liang, Shumin Deng, Chuanqi Tan, Fei Huang, Luo Si, Huajun Chen

Specifically, vanilla prompt learning may struggle to utilize atypical instances by rote during fully-supervised training or overfit shallow patterns with low-shot data.

Few-Shot Text Classification Memorization +5

Enhancing Cross-lingual Transfer by Manifold Mixup

1 code implementation ICLR 2022 Huiyun Yang, Huadong Chen, Hao Zhou, Lei LI

Based on large-scale pre-trained multilingual representations, recent cross-lingual transfer methods have achieved impressive transfer performances.

Cross-Lingual Transfer

Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction

1 code implementation7 May 2022 Xiang Chen, Ningyu Zhang, Lei LI, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Luo Si, Huajun Chen

To deal with these issues, we propose a novel Hierarchical Visual Prefix fusion NeTwork (HVPNeT) for visual-enhanced entity and relation extraction, aiming to achieve more effective and robust performance.

named-entity-recognition Named Entity Recognition +3

Cross-modal Contrastive Learning for Speech Translation

1 code implementation NAACL 2022 Rong Ye, Mingxuan Wang, Lei LI

Learning similar representations for semantically similar speech and text is important for speech translation.

Contrastive Learning Retrieval +3

Relation Extraction as Open-book Examination: Retrieval-enhanced Prompt Tuning

1 code implementation4 May 2022 Xiang Chen, Lei LI, Ningyu Zhang, Chuanqi Tan, Fei Huang, Luo Si, Huajun Chen

Note that the previous parametric learning paradigm can be viewed as memorization regarding training data as a book and inference as the close-book test.

Few-Shot Learning Memorization +3

Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion

1 code implementation4 May 2022 Xiang Chen, Ningyu Zhang, Lei LI, Shumin Deng, Chuanqi Tan, Changliang Xu, Fei Huang, Luo Si, Huajun Chen

Since most MKGs are far from complete, extensive knowledge graph completion studies have been proposed focusing on the multimodal entity, relation extraction and link prediction.

Information Retrieval Link Prediction +4

Provably Confidential Language Modelling

1 code implementation NAACL 2022 Xuandong Zhao, Lei LI, Yu-Xiang Wang

Large language models are shown to memorize privacy information such as social security numbers in training data.

Language Modelling Memorization +1

Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets

1 code implementation12 Apr 2022 Yunfei Li, Tao Kong, Lei LI, Yi Wu

Can a robot autonomously learn to design and construct a bridge from varying-sized blocks without a blueprint?

Motion Planning

$\textit{latent}$-GLAT: Glancing at Latent Variables for Parallel Text Generation

1 code implementation5 Apr 2022 Yu Bao, Hao Zhou, ShuJian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, Lei LI

Recently, parallel text generation has received widespread attention due to its success in generation efficiency.

Text Generation

E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning

no code implementations Findings (ACL) 2022 Jiangjie Chen, Rui Xu, Ziquan Fu, Wei Shi, Zhongqiao Li, Xinbo Zhang, Changzhi Sun, Lei LI, Yanghua Xiao, Hao Zhou

Holding the belief that models capable of reasoning should be right for the right reasons, we propose a first-of-its-kind Explainable Knowledge-intensive Analogical Reasoning benchmark (E-KAR).

Explanation Generation Question Answering

$ \text{T}^3 $OMVP: A Transformer-based Time and Team Reinforcement Learning Scheme for Observation-constrained Multi-Vehicle Pursuit in Urban Area

1 code implementation1 Mar 2022 Zheng Yuan, Tianhao Wu, Qinwen Wang, Yiying Yang, Lei LI, Lin Zhang

Although there are some achievements in the field of MVP in the open space environment, the urban area brings complicated road structures and restricted moving spaces as challenges to the resolution of MVP games.

Decision Making

KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models

no code implementations28 Feb 2022 Daniel Gao, Yantao Jia, Lei LI, Chengzhen Fu, Zhicheng Dou, Hao Jiang, Xinyu Zhang, Lei Chen, Zhao Cao

However, to figure out whether PLMs can be reliable knowledge sources and used as alternative knowledge bases (KBs), we need to further explore some critical features of PLMs.

General Knowledge Memorization +1

Deepfake Network Architecture Attribution

1 code implementation28 Feb 2022 Tianyun Yang, Ziyao Huang, Juan Cao, Lei LI, Xirong Li

With the rapid progress of generation technology, it has become necessary to attribute the origin of fake images.

Attribute DeepFake Detection +2

Personalized Prompt Learning for Explainable Recommendation

1 code implementation15 Feb 2022 Lei LI, Yongfeng Zhang, Li Chen

In the latter case, ID vectors are randomly initialized but the model is trained in advance on large corpora, so they are actually in different learning stages.

Explainable Recommendation Recommendation Systems +1

Cross-Modality Multi-Atlas Segmentation via Deep Registration and Label Fusion

1 code implementation4 Feb 2022 Wangbin Ding, Lei LI, Xiahai Zhuang, Liqin Huang

For the label fusion, we design a similarity estimation network (SimNet), which estimates the fusion weight of each atlas by measuring its similarity to the target image.

Computational Efficiency Image Registration +4

AWSnet: An Auto-weighted Supervision Attention Network for Myocardial Scar and Edema Segmentation in Multi-sequence Cardiac Magnetic Resonance Images

1 code implementation14 Jan 2022 Kai-Ni Wang, Xin Yang, Juzheng Miao, Lei LI, Jing Yao, Ping Zhou, Wufeng Xue, Guang-Quan Zhou, Xiahai Zhuang, Dong Ni

Extensive experimental results on a publicly available dataset from Myocardial pathology segmentation combining multi-sequence CMR (MyoPS 2020) demonstrate our method can achieve promising performance compared with other state-of-the-art methods.

Segmentation

Deep Learning Based 3D Point Cloud Regression for Estimating Forest Biomass

no code implementations21 Dec 2021 Stefan Oehmcke, Lei LI, Katerina Trepekli, Jaime Revenga, Thomas Nord-Larsen, Fabian Gieseke, Christian Igel

Quantification of forest biomass stocks and their dynamics is important for implementing effective climate change mitigation measures.

Management regression

Model Uncertainty-Aware Knowledge Amalgamation for Pre-Trained Language Models

no code implementations14 Dec 2021 Lei LI, Yankai Lin, Xuancheng Ren, Guangxiang Zhao, Peng Li, Jie zhou, Xu sun

As many fine-tuned pre-trained language models~(PLMs) with promising performance are generously released, investigating better ways to reuse these models is vital as it can greatly reduce the retraining computational cost and the potential environmental side-effects.

Unsupervised Editing for Counterfactual Stories

1 code implementation10 Dec 2021 Jiangjie Chen, Chun Gan, Sijie Cheng, Hao Zhou, Yanghua Xiao, Lei LI

We also propose a new metric to alleviate the shortcomings of current automatic metrics and better evaluate the trade-off.

counterfactual

StrokeNet: Stroke Assisted and Hierarchical Graph Reasoning Networks

no code implementations23 Nov 2021 Lei LI, Kai Fan, Chun Yuan

Scene text detection is still a challenging task, as there may be extremely small or low-resolution strokes, and close or arbitrary-shaped texts.

Node Classification Relational Reasoning +2

A Survey on Green Deep Learning

no code implementations8 Nov 2021 Jingjing Xu, Wangchunshu Zhou, Zhiyi Fu, Hao Zhou, Lei LI

In recent years, larger and deeper models are springing up and continuously pushing state-of-the-art (SOTA) results across various fields like natural language processing (NLP) and computer vision (CV).

Knowledge Distillation Model Compression

Multi-Modality Cardiac Image Analysis with Deep Learning

no code implementations8 Nov 2021 Lei LI, Fuping Wu, Sihang Wang, Xiahai Zhuang

Accurate cardiac computing, analysis and modeling from multi-modality images are important for the diagnosis and treatment of cardiac disease.

Image Segmentation Segmentation +2

Self-Supervised Speech Denoising Using Only Noisy Audio Signals

1 code implementation30 Oct 2021 Jiasong Wu, Qingchun Li, Guanyu Yang, Lei LI, Lotfi Senhadji, Huazhong Shu

The first module adopts a random audio sub-sampler on each noisy audio to generate training pairs.

Audio Denoising Denoising +1

CNewSum: A Large-scale Chinese News Summarization Dataset with Human-annotated Adequacy and Deducibility Level

no code implementations21 Oct 2021 Danqing Wang, Jiaze Chen, Xianze Wu, Hao Zhou, Lei LI

In this paper, we present a large-scale Chinese news summarization dataset CNewSum, which consists of 304, 307 documents and human-written summaries for the news feed.

News Summarization Text Summarization

Well-classified Examples are Underestimated in Classification with Deep Neural Networks

1 code implementation13 Oct 2021 Guangxiang Zhao, Wenkai Yang, Xuancheng Ren, Lei LI, Yunfang Wu, Xu sun

The conventional wisdom behind learning deep classification models is to focus on bad-classified examples and ignore well-classified examples that are far from the decision boundary.

Graph Classification imbalanced classification +4

LightSeq2: Accelerated Training for Transformer-based Models on GPUs

1 code implementation12 Oct 2021 Xiaohui Wang, Yang Wei, Ying Xiong, Guyue Huang, Xian Qian, Yufei Ding, Mingxuan Wang, Lei LI

In this paper, we present LightSeq2, a system to accelerate training for a general family of Transformer models on GPUs.

Machine Translation Speech Recognition +1

NAIL: A Challenging Benchmark for Na\"ive Logical Reasoning

no code implementations29 Sep 2021 Xinbo Zhang, Changzhi Sun, Yue Zhang, Lei LI, Hao Zhou

Logical reasoning over natural text is an important capability towards human level intelligence.

Logical Reasoning

Generating Antimicrobial Peptides from Latent Secondary Structure Space

no code implementations29 Sep 2021 Danqing Wang, Zeyu Wen, Lei LI, Hao Zhou

By sampling in the latent secondary structure space, we can generate peptides with ideal amino acids and secondary structures at the same time.

Drug Discovery

Dynamic Knowledge Distillation for Pre-trained Language Models

1 code implementation EMNLP 2021 Lei LI, Yankai Lin, Shuhuai Ren, Peng Li, Jie zhou, Xu sun

Knowledge distillation~(KD) has been proved effective for compressing large-scale pre-trained language models.

Knowledge Distillation

Learning When to Translate for Streaming Speech

1 code implementation ACL 2022 Qianqian Dong, Yaoming Zhu, Mingxuan Wang, Lei LI

Given a usually long speech sequence, we develop an efficient monotonic segmentation module inside an encoder-decoder model to accumulate acoustic information incrementally and detect proper speech unit boundaries for the input in speech translation task.

Sentence Speech-to-Text Translation +1

Right Ventricular Segmentation from Short- and Long-Axis MRIs via Information Transition

1 code implementation5 Sep 2021 Lei LI, Wangbin Ding, Liqun Huang, Xiahai Zhuang

In this work, we propose an automatic RV segmentation framework, where the information from long-axis (LA) views is utilized to assist the segmentation of short-axis (SA) views via information transition.

Segmentation

Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification

1 code implementation EMNLP 2021 Shuhuai Ren, Jinchao Zhang, Lei LI, Xu sun, Jie zhou

Data augmentation aims to enrich training samples for alleviating the overfitting issue in low-resource or class-imbalanced situations.

Bayesian Optimization Data Augmentation +2

Secoco: Self-Correcting Encoding for Neural Machine Translation

no code implementations Findings (EMNLP) 2021 Tao Wang, Chengqi Zhao, Mingxuan Wang, Lei LI, Hang Li, Deyi Xiong

This paper presents Self-correcting Encoding (Secoco), a framework that effectively deals with input noise for robust neural machine translation by introducing self-correcting predictors.

Machine Translation NMT +1

WSDesc: Weakly Supervised 3D Local Descriptor Learning for Point Cloud Registration

1 code implementation5 Aug 2021 Lei LI, Hongbo Fu, Maks Ovsjanikov

Instead of using a predefined fixed-size local support in voxelization, we propose to learn the optimal support in a data-driven manner.

Metric Learning Point Cloud Registration

Learning to Design and Construct Bridge without Blueprint

no code implementations5 Aug 2021 Yunfei Li, Tao Kong, Lei LI, Yifeng Li, Yi Wu

In this task, the robot needs to first design a feasible bridge architecture for arbitrarily wide cliffs and then manipulate the blocks reliably to construct a stable bridge according to the proposed design.

Motion Planning

Simultaneous Semantic and Collision Learning for 6-DoF Grasp Pose Estimation

no code implementations5 Aug 2021 Yiming Li, Tao Kong, Ruihang Chu, Yifeng Li, Peng Wang, Lei LI

In a unified framework, we jointly predict the feasible 6-DoF grasp poses, instance semantic segmentation, and collision information.

Multi-Task Learning Pose Estimation +1

Pre-training Methods for Neural Machine Translation

no code implementations ACL 2021 Mingxuan Wang, Lei LI

This tutorial provides a comprehensive guide to make the most of pre-training for neural machine translation.

Machine Translation NMT +1

Cannot find the paper you are looking for? You can Submit a new open access paper.