Search Results for author: Wei Liu

Found 470 papers, 185 papers with code

QuickGraph: A Rapid Annotation Tool for Knowledge Graph Extraction from Technical Text

1 code implementation ACL 2022 Tyler Bikaun, Michael Stewart, Wei Liu

Acquiring high-quality annotated corpora for complex multi-task information extraction (MT-IE) is an arduous and costly process for human-annotators.

Clustering

PointPWC-Net: Cost Volume on Point Clouds for (Self-)Supervised Scene Flow Estimation

1 code implementation ECCV 2020 Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, Li Fuxin

We propose a novel end-to-end deep scene flow model, called PointPWC-Net, that directly processes 3D point cloud scenes with large motions in a coarse-to-fine fashion.

Self-supervised Scene Flow Estimation

Lexicon-Based Graph Convolutional Network for Chinese Word Segmentation

no code implementations Findings (EMNLP) 2021 Kaiyu Huang, Hao Yu, Junpeng Liu, Wei Liu, Jingxiang Cao, Degen Huang

Experimental results on five benchmarks and four cross-domain datasets show the lexicon-based graph convolutional network successfully captures the information of candidate words and helps to improve performance on the benchmarks (Bakeoff-2005 and CTB6) and the cross-domain datasets (SIGHAN-2010).

Chinese Word Segmentation

LexiClean: An annotation tool for rapid multi-task lexical normalisation

1 code implementation EMNLP (ACL) 2021 Tyler Bikaun, Tim French, Melinda Hodkiewicz, Michael Stewart, Wei Liu

LexiClean’s main contribution is support for simultaneous in situ token-level modification and annotation that can be rapidly applied corpus wide.

What Causes the Failure of Explicit to Implicit Discourse Relation Recognition?

1 code implementation1 Apr 2024 Wei Liu, Stephen Wan, Michael Strube

We consider an unanswered question in the discourse processing community: why do relation classifiers trained on explicit examples (with connectives removed) perform poorly in real implicit scenarios?

Relation

Variational Graph Auto-Encoder Based Inductive Learning Method for Semi-Supervised Classification

no code implementations26 Mar 2024 Hanxuan Yang, Zhaoxin Yu, Qingchao Kong, Wei Liu, Wenji Mao

Graph representation learning is a fundamental research issue in various domains of applications, of which the inductive learning problem is particularly challenging as it requires models to generalize to unseen graph structures during inference.

Graph Representation Learning Node Classification

CodeS: Natural Language to Code Repository via Multi-Layer Sketch

2 code implementations25 Mar 2024 Daoguang Zan, Ailun Yu, Wei Liu, Dong Chen, Bo Shen, Wei Li, Yafen Yao, Yongshun Gong, Xiaolin Chen, Bei guan, Zhiguang Yang, Yongji Wang, Qianxiang Wang, Lizhen Cui

For feedback-based evaluation, we develop a VSCode plugin for CodeS and engage 30 participants in conducting empirical studies.

Benchmarking

Event-Triggered State Estimation Through Confidence Level

no code implementations22 Mar 2024 Wei Liu

This paper considers the state estimation problem for discrete-time linear systems under event-triggered scheme.

Reversible Jump Attack to Textual Classifiers with Modification Reduction

1 code implementation21 Mar 2024 Mingze Ni, Zhensu Sun, Wei Liu

Recent studies on adversarial examples expose vulnerabilities of natural language processing (NLP) models.

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

1 code implementation18 Mar 2024 Yang Yang, Wen Wang, Liang Peng, Chaotian Song, Yao Chen, Hengjia Li, Xiaolong Yang, Qinglin Lu, Deng Cai, Boxi Wu, Wei Liu

Customization generation techniques have significantly advanced the synthesis of specific concepts across varied contexts.

LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model

no code implementations18 Mar 2024 Yuxin Cao, Jinghao Li, Xi Xiao, Derui Wang, Minhui Xue, Hao Ge, Wei Liu, Guangwu Hu

Benefiting from the popularity and scalably usability of Segment Anything Model (SAM), we first extract different regions according to semantic information and then track them through the video stream to maintain the temporal consistency.

Adversarial Attack Style Transfer +2

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

1 code implementation16 Mar 2024 Zhe Kong, Yong Zhang, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, GuanYing Chen, Wei Liu, Wenhan Luo

We also observe that the initiation denoising timestep for noise blending is the key to identity preservation and layout.

Denoising Text-to-Image Generation

Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples

1 code implementation16 Mar 2024 Ziqi Zhou, Minghui Li, Wei Liu, Shengshan Hu, Yechao Zhang, Wei Wan, Lulu Xue, Leo Yu Zhang, Dezhong Yao, Hai Jin

In response to these challenges, we propose Genetic Evolution-Nurtured Adversarial Fine-tuning (Gen-AF), a two-stage adversarial fine-tuning approach aimed at enhancing the robustness of downstream models.

Self-Supervised Learning

DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation

no code implementations13 Mar 2024 Minbin Huang, Yanxin Long, Xinchi Deng, Ruihang Chu, Jiangfeng Xiong, Xiaodan Liang, Hong Cheng, Qinglin Lu, Wei Liu

However, many of these works face challenges in identifying correct output modalities and generating coherent images accordingly as the number of output modalities increases and the conversations go deeper.

Prompt Engineering Text-to-Image Generation

Category-Agnostic Pose Estimation for Point Clouds

no code implementations12 Mar 2024 Bowen Liu, Wei Liu, Siang Chen, Pengwei Xie, Guijin Wang

The goal of object pose estimation is to visually determine the pose of a specific object in the RGB-D input.

Category-Agnostic Pose Estimation Object +1

RIS-Enabled Joint Near-Field 3D Localization and Synchronization in SISO Multipath Environments

no code implementations11 Mar 2024 Han Yan, Hua Chen, Wei Liu, Songjie Yang, Gang Wang, Chau Yuen

Reconfigurable Intelligent Surfaces (RIS) show great promise in the realm of 6th generation (6G) wireless systems, particularly in the areas of localization and communication.

ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval

no code implementations11 Mar 2024 Yuanhang Zheng, Peng Li, Wei Liu, Yang Liu, Jian Luan, Bin Wang

Specifically, our proposed ToolRerank includes Adaptive Truncation, which truncates the retrieval results related to seen and unseen tools at different positions, and Hierarchy-Aware Reranking, which makes retrieval results more concentrated for single-tool queries and more diverse for multi-tool queries.

Retrieval

Large Language Models are In-Context Molecule Learners

no code implementations7 Mar 2024 Jiatong Li, Wei Liu, Zhihao Ding, Wenqi Fan, Yuqiang Li, Qing Li

Specifically, ICMA incorporates the following three stages: Hybrid Context Retrieval, Post-retrieval Re-ranking, and In-context Molecule Tuning.

Cross-Modal Retrieval Re-Ranking +2

A Comprehensive Evaluation of Quantization Strategies for Large Language Models

no code implementations26 Feb 2024 Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong

Our experimental results indicate that LLMs with 4-bit quantization can retain performance comparable to their non-quantized counterparts, and perplexity can serve as a proxy metric for quantized LLMs on most benchmarks.

Language Modelling Quantization

Analysing The Impact of Sequence Composition on Language Model Pre-Training

1 code implementation21 Feb 2024 Yu Zhao, Yuanbin Qu, Konrad Staniszewski, Szymon Tworkowski, Wei Liu, Piotr Miłoś, Yuxiang Wu, Pasquale Minervini

In this work, we find that applying causal masking can lead to the inclusion of distracting information from previous documents during pre-training, which negatively impacts the performance of the models on language modelling and downstream tasks.

In-Context Learning Language Modelling +1

AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization

no code implementations19 Feb 2024 Jiyao Li, Mingze Ni, Yifei Dong, Tianqing Zhu, Wei Liu

At the intersection of CV and NLP is the problem of image captioning, where the related models' robustness against adversarial attacks has not been well studied.

Adversarial Attack Image Captioning

ChemLLM: A Chemical Large Language Model

no code implementations10 Feb 2024 Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li, Weiran Huang, Xiangyu Yue, Dongzhan Zhou, Shufei Zhang, Mao Su, Hansen Zhong, Yuqiang Li, Wanli Ouyang

ChemLLM beats GPT-3. 5 on all three principal tasks in chemistry, i. e., name conversion, molecular caption, and reaction prediction, and surpasses GPT-4 on two of them.

Language Modelling Large Language Model +2

Poisson Process for Bayesian Optimization

no code implementations5 Feb 2024 Xiaoxing Wang, Jiaxing Li, Chao Xue, Wei Liu, Weifeng Liu, Xiaokang Yang, Junchi Yan, DaCheng Tao

BayesianOptimization(BO) is a sample-efficient black-box optimizer, and extensive methods have been proposed to build the absolute function response of the black-box function through a probabilistic surrogate model, including Tree-structured Parzen Estimator (TPE), random forest (SMAC), and Gaussian process (GP).

Bayesian Optimization Hyperparameter Optimization +2

NFT1000: A Visual Text Dataset For Non-Fungible Token Retrieval

no code implementations29 Jan 2024 Shuxun Wang, Yunfei Lei, Ziqi Zhang, Wei Liu, Haowei Liu, Li Yang, Wenjuan Li, Bing Li, Weiming Hu

With the rise of 'Metaverse' and 'Web3. 0', NFT ( Non-Fungible Token ) has emerged as a kind of pivotal digital asset, garnering significant attention.

Retrieval

Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain

no code implementations28 Jan 2024 Yiming Gao, Feiyu Liu, Liang Wang, Zhenjie Lian, Dehua Zheng, Weixuan Wang, Wenjin Yang, Siqin Li, Xianliang Wang, Wenhui Chen, Jing Dai, Qiang Fu, Wei Yang, Lanxiao Huang, Wei Liu

We expect that agents should learn to enhance the extent to which humans achieve these goals while maintaining agents' original abilities (e. g., winning games).

A Systematic Literature Review on Explainability for Machine/Deep Learning-based Software Engineering Research

no code implementations26 Jan 2024 Sicong Cao, Xiaobing Sun, Ratnadira Widyasari, David Lo, Xiaoxue Wu, Lili Bo, Jiale Zhang, Bin Li, Wei Liu, Di wu, Yixin Chen

The remarkable achievements of Artificial Intelligence (AI) algorithms, particularly in Machine Learning (ML) and Deep Learning (DL), have fueled their extensive deployment across multiple sectors, including Software Engineering (SE).

Decision Making Vulnerability Detection

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images

1 code implementation20 Jan 2024 Kuofeng Gao, Yang Bai, Jindong Gu, Shu-Tao Xia, Philip Torr, Zhifeng Li, Wei Liu

Once attackers maliciously induce high energy consumption and latency time (energy-latency cost) during inference of VLMs, it will exhaust computational resources.

The Radiation Oncology NLP Database

1 code implementation19 Jan 2024 Zhengliang Liu, Jason Holmes, Wenxiong Liao, Chenbin Liu, Lian Zhang, Hongying Feng, Peilong Wang, Muhammad Ali Elahi, Hongmin Cai, Lichao Sun, Quanzheng Li, Xiang Li, Tianming Liu, Jiajian Shen, Wei Liu

ROND is specifically designed to address this gap in the domain of radiation oncology, a field that offers many opportunities for NLP exploration.

Language Modelling Large Language Model +7

TopCoW: Benchmarking Topology-Aware Anatomical Segmentation of the Circle of Willis (CoW) for CTA and MRA

1 code implementation29 Dec 2023 Kaiyuan Yang, Fabio Musio, Yihui Ma, Norman Juchler, Johannes C. Paetzold, Rami Al-Maskari, Luciano Höher, Hongwei Bran Li, Ibrahim Ethem Hamamci, Anjany Sekuboyina, Suprosanna Shit, Houjing Huang, Diana Waldmannstetter, Florian Kofler, Fernando Navarro, Martin Menten, Ivan Ezhov, Daniel Rueckert, Iris Vos, Ynte Ruigrok, Birgitta Velthuis, Hugo Kuijf, Julien Hämmerli, Catherine Wurster, Philippe Bijlenga, Laura Westphal, Jeroen Bisschop, Elisa Colombo, Hakim Baazaoui, Andrew Makmur, James Hallinan, Bene Wiestler, Jan S. Kirschke, Roland Wiest, Emmanuel Montagnon, Laurent Letourneau-Guillon, Adrian Galdran, Francesco Galati, Daniele Falcetta, Maria A. Zuluaga, Chaolong Lin, Haoran Zhao, Zehan Zhang, Sinyoung Ra, Jongyun Hwang, HyunJin Park, Junqiang Chen, Marek Wodzinski, Henning Müller, Pengcheng Shi, Wei Liu, Ting Ma, Cansu Yalçin, Rachika E. Hamadache, Joaquim Salvi, Xavier Llado, Uma Maria Lal-Trehan Estrada, Valeriia Abramova, Luca Giancardo, Arnau Oliver, Jialu Liu, Haibin Huang, Yue Cui, Zehang Lin, Yusheng Liu, Shunzhi Zhu, Tatsat R. Patel, Vincent M. Tutino, Maysam Orouskhani, Huayu Wang, Mahmud Mossa-Basha, Chengcheng Zhu, Maximilian R. Rokuss, Yannick Kirchhoff, Nico Disch, Julius Holzschuh, Fabian Isensee, Klaus Maier-Hein, Yuki Sato, Sven Hirsch, Susanne Wegener, Bjoern Menze

The TopCoW dataset was the first public dataset with voxel-level annotations for thirteen possible CoW vessel components, enabled by virtual-reality (VR) technology.

Anatomy Benchmarking +1

DrugAssist: A Large Language Model for Molecule Optimization

1 code implementation28 Dec 2023 Geyan Ye, Xibao Cai, Houtim Lai, Xing Wang, Junhong Huang, Longyue Wang, Wei Liu, Xiangxiang Zeng

Recently, the impressive performance of large language models (LLMs) on a wide range of tasks has attracted an increasing number of attempts to apply LLMs in drug discovery.

Drug Discovery Language Modelling +1

Experiential Co-Learning of Software-Developing Agents

1 code implementation28 Dec 2023 Chen Qian, Yufan Dang, Jiahao Li, Wei Liu, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents.

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

1 code implementation25 Dec 2023 Wei Liu, Weihao Zeng, Keqing He, Yong Jiang, Junxian He

We present deita (short for Data-Efficient Instruction Tuning for Alignment), a series of models fine-tuned from LLaMA and Mistral models using data samples automatically selected with our proposed approach.

DreamTuner: Single Image is Enough for Subject-Driven Generation

no code implementations21 Dec 2023 Miao Hua, Jiawei Liu, Fei Ding, Wei Liu, Jie Wu, Qian He

Diffusion-based models have demonstrated impressive capabilities for text-to-image generation and are expected for personalized applications of subject-driven generation, which require the generation of customized concepts with one or a few reference images.

Text-to-Image Generation

Decoupling Representation and Knowledge for Few-Shot Intent Classification and Slot Filling

no code implementations21 Dec 2023 Jie Han, Yixiong Zou, Haozhao Wang, Jun Wang, Wei Liu, Yao Wu, Tao Zhang, Ruixuan Li

Therefore, current works first train a model on source domains with sufficiently labeled data, and then transfer the model to target domains where only rarely labeled data is available.

intent-classification Intent Classification +4

Enhancing the Rationale-Input Alignment for Self-explaining Rationalization

no code implementations7 Dec 2023 Wei Liu, Haozhao Wang, Jun Wang, Zhiying Deng, Yuankai Zhang, Cheng Wang, Ruixuan Li

Rationalization empowers deep learning models with self-explaining capabilities through a cooperative game, where a generator selects a semantically consistent subset of the input as a rationale, and a subsequent predictor makes predictions based on the selected rationale.

MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation

1 code implementation4 Dec 2023 Fenghe Tang, Bingkun Nian, Jianrui Ding, Quan Quan, Jie Yang, Wei Liu, S. Kevin Zhou

This work revisits the relationship between CNNs and Transformers in lightweight universal networks for medical image segmentation, aiming to integrate the advantages of both worlds at the infrastructure design level.

Image Segmentation Inductive Bias +3

SRSNetwork: Siamese Reconstruction-Segmentation Networks based on Dynamic-Parameter Convolution

1 code implementation4 Dec 2023 Bingkun Nian, Fenghe Tang, Jianrui Ding, Pingping Zhang, Jie Yang, S. Kevin Zhou, Wei Liu

In this paper, we present a high-performance deep neural network for weak target image segmentation, including medical image segmentation and infrared image segmentation.

Image Segmentation Medical Image Segmentation +2

Noisy probing dose facilitated dose prediction for pencil beam scanning proton therapy: physics enhances generalizability

no code implementations2 Dec 2023 Lian Zhang, Jason M. Holmes, Zhengliang Liu, Hongying Feng, Terence T. Sio, Carlos E. Vargas, Sameer R. Keole, Kristin Stützer, Sheng Li, Tianming Liu, Jiajian Shen, William W. Wong, Sujay A. Vora, Wei Liu

The noisy probing dose method showed better generalizability in the 6 outlier cases than the ROI-based and beam mask-based methods with 3D Gamma passing rates (for prostate cancer, targets: 89. 32%$\pm$1. 45% vs. 93. 48%$\pm$1. 51% vs. 96. 79%$\pm$0. 83%, OARs: 85. 87%$\pm$1. 73% vs. 91. 15%$\pm$1. 13% vs. 94. 29%$\pm$1. 01%).

SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning

1 code implementation29 Nov 2023 Liang Peng, Haoran Cheng, Zheng Yang, Ruisi Zhao, Linxuan Xia, Chaotian Song, Qinglin Lu, Boxi Wu, Wei Liu

By applying the loss to existing one-shot video tuning methods, we significantly improve the overall consistency and smoothness of the generated videos.

BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP

no code implementations26 Nov 2023 Jiawang Bai, Kuofeng Gao, Shaobo Min, Shu-Tao Xia, Zhifeng Li, Wei Liu

Contrastive Vision-Language Pre-training, known as CLIP, has shown promising effectiveness in addressing downstream image recognition tasks.

FBChain: A Blockchain-based Federated Learning Model with Efficiency and Secure Communication

no code implementations21 Nov 2023 Yang Li, Chunhe Xia, Wei Liu, Weidong Zhou, Chen Chen, Tianbo Wang

This article proposes Blockchain-based Federated Learning (FBChain) model for federated learning parameter communication to overcome the above two problems.

Federated Learning

Damped Proximal Augmented Lagrangian Method for weakly-Convex Problems with Convex Constraints

no code implementations15 Nov 2023 Hari Dahal, Wei Liu, Yangyang Xu

For the former case, DPALM achieves the complexity of $\widetilde{\mathcal{O}}\left(\varepsilon^{-2. 5} \right)$ to produce an $\varepsilon$-KKT point by applying an accelerated proximal gradient (APG) method to each DPALM subproblem.

Evaluating Large Language Models in Ophthalmology

no code implementations7 Nov 2023 Jason Holmes, Shuyuan Ye, Yiwei Li, Shi-Nan Wu, Zhengliang Liu, Zihao Wu, Jinyu Hu, Huan Zhao, Xi Jiang, Wei Liu, Hong Wei, Jie Zou, Tianming Liu, Yi Shao

Methods: A 100-item ophthalmology single-choice test was administered to three different LLMs (GPT-3. 5, GPT-4, and PaLM2) and three different professional levels (medical undergraduates, medical masters, and attending physicians), respectively.

Decision Making

Evaluating multiple large language models in pediatric ophthalmology

no code implementations7 Nov 2023 Jason Holmes, Rui Peng, Yiwei Li, Jinyu Hu, Zhengliang Liu, Zihao Wu, Huan Zhao, Xi Jiang, Wei Liu, Hong Wei, Jie Zou, Tianming Liu, Yi Shao

IMPORTANCE The response effectiveness of different large language models (LLMs) and various individuals, including medical students, graduate students, and practicing physicians, in pediatric ophthalmology consultations, has not been clearly established yet.

Multiple-choice

Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions

no code implementations5 Nov 2023 Xinyu Gong, Jason Holmes, Yiwei Li, Zhengliang Liu, Qi Gan, Zihao Wu, Jianli Zhang, Yusong Zou, Yuxi Teng, Tian Jiang, Hongtu Zhu, Wei Liu, Tianming Liu, Yajun Yan

Recent advances in Large Language Models (LLMs) have presented new opportunities for integrating Artificial General Intelligence (AGI) into biological research and education.

Logical Reasoning Multiple-choice

Discussing the Spectra of Physics-Enhanced Machine Learning via a Survey on Structural Mechanics Applications

no code implementations31 Oct 2023 Marcus Haywood-Alexander, Wei Liu, Kiran Bacsa, Zhilu Lai, Eleni Chatzi

The intersection of physics and machine learning has given rise to a paradigm that we refer to here as physics-enhanced machine learning (PEML), aiming to improve the capabilities and reduce the individual shortcomings of data- or physics-only methods.

From Indeterminacy to Determinacy: Augmenting Logical Reasoning Capabilities with Large Language Models

1 code implementation28 Oct 2023 Hongda Sun, Weikai Xu, Wei Liu, Jian Luan, Bin Wang, Shuo Shang, Ji-Rong Wen, Rui Yan

To address these challenges, we propose DetermLR, a novel reasoning framework that formulates the reasoning process as a transformational journey from indeterminate premises to determinate ones.

Logical Reasoning

Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks

1 code implementation26 Oct 2023 Zhaohui Yan, Songlin Yang, Wei Liu, Kewei Tu

Also, most of current ERE models do not take into account higher-order interactions between multiple entities and relations, while higher-order modeling could be beneficial. In this work, we propose HyperGraph neural network for ERE ($\hgnn{}$), which is built upon the PL-marker (a state-of-the-art marker-based pipleline model).

Joint Entity and Relation Extraction NER +1

Simple Hardware-Efficient PCFGs with Independent Left and Right Productions

1 code implementation23 Oct 2023 Wei Liu, Songlin Yang, Yoon Kim, Kewei Tu

Scaling dense PCFGs to thousands of nonterminals via a low-rank parameterization of the rule probability tensor has been shown to be beneficial for unsupervised parsing.

Constituency Grammar Induction Language Modelling

Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report

no code implementations5 Oct 2023 Jason Holmes, Lian Zhang, Yuzhen Ding, Hongying Feng, Zhengliang Liu, Tianming Liu, William W. Wong, Sujay A. Vora, Jonathan B. Ashman, Wei Liu

Conclusions: Given the accuracy of GPT-4 in re-labeling structure names of both target volumes and normal tissues as presented in this work, LLMs are poised to be the preferred method for standardizing structure names in radiation oncology, especially considering the rapid advancements in LLM capabilities that are likely to continue.

Benchmarking

D-Separation for Causal Self-Explanation

1 code implementation NeurIPS 2023 Wei Liu, Jun Wang, Haozhao Wang, Ruixuan Li, Zhiying Deng, Yuankai Zhang, Yang Qiu

Instead of attempting to rectify the issues of the MMI criterion, we propose a novel criterion to uncover the causal rationale, termed the Minimum Conditional Dependence (MCD) criterion, which is grounded on our finding that the non-causal features and the target label are \emph{d-separated} by the causal rationale.

CoMFLP: Correlation Measure based Fast Search on ASR Layer Pruning

1 code implementation21 Sep 2023 Wei Liu, Zhiyuan Peng, Tan Lee

The search process is carried out in two steps: (1) coarse search: to determine top $K$ candidates by pruning the most redundant layers based on the correlation matrix; (2) fine search: to select the best pruning proposal among $K$ candidates using a task-specific evaluation metric.

speech-recognition Speech Recognition

Towards Better Data Exploitation in Self-Supervised Monocular Depth Estimation

1 code implementation11 Sep 2023 Jinfeng Liu, Lingtong Kong, Jie Yang, Wei Liu

Additionally, we introduce the detail-enhanced DepthNet with an extra full-scale branch in the encoder and a grid decoder to enhance the restoration of fine details in depth maps.

Data Augmentation Monocular Depth Estimation

MathAttack: Attacking Large Language Models Towards Math Solving Ability

no code implementations4 Sep 2023 ZiHao Zhou, Qiufeng Wang, Mingyu Jin, Jie Yao, Jianan Ye, Wei Liu, Wei Wang, Xiaowei Huang, Kaizhu Huang

Instead of attacking prompts in the use of LLMs, we propose a MathAttack model to attack MWP samples which are closer to the essence of security in solving math problems.

Adversarial Attack GSM8K +1

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

1 code implementation24 Aug 2023 Hanchi Huang, Li Shen, Deheng Ye, Wei Liu

We propose a novel master-slave architecture to solve the top-$K$ combinatorial multi-armed bandits problem with non-linear bandit feedback and diversity constraints, which, to the best of our knowledge, is the first combinatorial bandits setting considering diversity constraints under bandit feedback.

Multi-Armed Bandits

Recurrent Multi-scale Transformer for High-Resolution Salient Object Detection

1 code implementation7 Aug 2023 Xinhao Deng, Pingping Zhang, Wei Liu, Huchuan Lu

To address above issues, in this work, we first propose a new HRS10K dataset, which contains 10, 500 high-quality annotated images at 2K-8K resolution.

2k 8k +3

LLDiffusion: Learning Degradation Representations in Diffusion Models for Low-Light Image Enhancement

1 code implementation27 Jul 2023 Tao Wang, Kaihao Zhang, Ziqian Shao, Wenhan Luo, Bjorn Stenger, Tae-Kyun Kim, Wei Liu, Hongdong Li

In this paper, we address this limitation by proposing a degradation-aware learning scheme for LLIE using diffusion models, which effectively integrates degradation and image priors into the diffusion process, resulting in improved image enhancement.

Image Generation Low-Light Image Enhancement

Semi-supervised Cycle-GAN for face photo-sketch translation in the wild

no code implementations18 Jul 2023 Chaofeng Chen, Wei Liu, Xiao Tan, Kwan-Yee K. Wong

Experiments show that SCG achieves competitive performance on public benchmarks and superior results on photos in the wild.

Translation

A Novel Multi-Task Model Imitating Dermatologists for Accurate Differential Diagnosis of Skin Diseases in Clinical Images

no code implementations17 Jul 2023 Yan-Jie Zhou, Wei Liu, Yuan Gao, Jing Xu, Le Lu, Yuping Duan, Hao Cheng, Na Jin, Xiaoyong Man, Shuang Zhao, Yu Wang

Skin diseases are among the most prevalent health issues, and accurate computer-aided diagnosis methods are of importance for both dermatologists and patients.

Multi-Task Learning

Communicative Agents for Software Development

1 code implementation16 Jul 2023 Chen Qian, Xin Cong, Wei Liu, Cheng Yang, Weize Chen, Yusheng Su, Yufan Dang, Jiahao Li, Juyuan Xu, Dahai Li, Zhiyuan Liu, Maosong Sun

At the core of this paradigm lies ChatDev, a virtual chat-powered software development company that mirrors the established waterfall model, meticulously dividing the development process into four distinct chronological stages: designing, coding, testing, and documenting.

Decision Making

First-order Methods for Affinely Constrained Composite Non-convex Non-smooth Problems: Lower Complexity Bound and Near-optimal Methods

no code implementations14 Jul 2023 Wei Liu, Qihang Lin, Yangyang Xu

In this paper, we make the first attempt to establish lower complexity bounds of FOMs for solving a class of composite non-convex non-smooth optimization with linear constraints.

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation

no code implementations1 Jul 2023 Zhuowei Chen, Shancheng Fang, Wei Liu, Qian He, Mengqi Huang, Yongdong Zhang, Zhendong Mao

While large-scale pre-trained text-to-image models can synthesize diverse and high-quality human-centric images, an intractable problem is how to preserve the face identity for conditioned face images.

Image Generation

CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?

no code implementations29 Jun 2023 Tianwen Wei, Jian Luan, Wei Liu, Shuang Dong, Bin Wang

We present the Chinese Elementary School Math Word Problems (CMATH) dataset, comprising 1. 7k elementary school-level math word problems with detailed annotations, source from actual Chinese workbooks and exams.

Language Modelling Math +1

Segment Anything Model (SAM) for Radiation Oncology

no code implementations20 Jun 2023 Lian Zhang, Zhengliang Liu, Lu Zhang, Zihao Wu, Xiaowei Yu, Jason Holmes, Hongying Feng, Haixing Dai, Xiang Li, Quanzheng Li, Dajiang Zhu, Tianming Liu, Wei Liu

Given that SAM, a model pre-trained purely on natural images, can handle the delineation of OARs from medical images with clinically acceptable accuracy, these results highlight SAM's robust generalization capabilities with consistent accuracy in automatic segmentation for radiotherapy.

Segmentation

UniMC: A Unified Framework for Long-Term Memory Conversation via Relevance Representation Learning

no code implementations18 Jun 2023 Kang Zhao, Wei Liu, Jian Luan, Minglei Gao, Li Qian, Hanlin Teng, Bin Wang

In this paper, we propose a Unified framework for Long-term Memory Conversations (UniMC), which increases the connection between different stages by learning relevance representation.

Representation Learning Retrieval

Global and Local Semantic Completion Learning for Vision-Language Pre-training

1 code implementation12 Jun 2023 Rong-Cheng Tu, Yatai Ji, Jie Jiang, Weijie Kong, Chengfei Cai, Wenzhe Zhao, Hongfa Wang, Yujiu Yang, Wei Liu

MGSC promotes learning more representative global features, which have a great impact on the performance of downstream tasks, while MLTC reconstructs modal-fusion local tokens, further enhancing accurate comprehension of multimodal data.

Language Modelling Masked Language Modeling +5

Modeling Structural Similarities between Documents for Coherence Assessment with Graph Convolutional Networks

1 code implementation10 Jun 2023 Wei Liu, Xiyan Fu, Michael Strube

Coherence is an important aspect of text quality, and various approaches have been applied to coherence modeling.

Automated Essay Scoring

Artificial General Intelligence for Medical Imaging

no code implementations8 Jun 2023 Xiang Li, Lu Zhang, Zihao Wu, Zhengliang Liu, Lin Zhao, Yixuan Yuan, Jun Liu, Gang Li, Dajiang Zhu, Pingkun Yan, Quanzheng Li, Wei Liu, Tianming Liu, Dinggang Shen

In this review, we explore the potential applications of Artificial General Intelligence (AGI) models in healthcare, focusing on foundational Large Language Models (LLMs), Large Vision Models, and Large Multimodal Models.

Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing

no code implementations29 May 2023 Aiwei Liu, Wei Liu, Xuming Hu, Shuang Li, Fukun Ma, Yawen Yang, Lijie Wen

Based on these observations, we propose a method named \texttt{p-align} to improve the compositional generalization of Text-to-SQL models.

SQL Parsing Text-To-SQL

Decoupled Rationalization with Asymmetric Learning Rates: A Flexible Lipschitz Restraint

1 code implementation23 May 2023 Wei Liu, Jun Wang, Haozhao Wang, Ruixuan Li, Yang Qiu, Yuankai Zhang, Jie Han, Yixiong Zou

However, such a cooperative game may incur the degeneration problem where the predictor overfits to the uninformative pieces generated by a not yet well-trained generator and in turn, leads the generator to converge to a sub-optimal model that tends to select senseless pieces.

Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data

1 code implementation18 May 2023 Yusheng Tian, Wei Liu, Tan Lee

One way to address this problem is to pre-enhance the speech with an enhancement model and then use the enhanced data for text-to-speech (TTS) model training.

Speech Enhancement Speech Synthesis

MGR: Multi-generator Based Rationalization

1 code implementation8 May 2023 Wei Liu, Haozhao Wang, Jun Wang, Ruixuan Li, Xinyang Li, Yuankai Zhang, Yang Qiu

Rationalization is to employ a generator and a predictor to construct a self-explaining NLP model in which the generator selects a subset of human-intelligible pieces of the input text to the following predictor.

Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT

no code implementations29 Apr 2023 Zhenxiang Xiao, Yuzhong Chen, Lu Zhang, Junjie Yao, Zihao Wu, Xiaowei Yu, Yi Pan, Lin Zhao, Chong Ma, Xinyu Liu, Wei Liu, Xiang Li, Yixuan Yuan, Dinggang Shen, Dajiang Zhu, Tianming Liu, Xi Jiang

Prompts have been proven to play a crucial role in large language models, and in recent years, vision models have also been using prompts to improve scalability for multiple downstream tasks.

Image Classification

Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders

no code implementations25 Apr 2023 Heng Pan, Chenyang Liu, Wenxiao Wang, Li Yuan, Hongfa Wang, Zhifeng Li, Wei Liu

To study which type of deep features is appropriate for MIM as a learning target, we propose a simple MIM framework with serials of well-trained self-supervised models to convert an Image to a feature Vector as the learning target of MIM, where the feature extractor is also known as a teacher model.

Attribute Vocal Bursts Intensity Prediction

Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective

no code implementations23 Apr 2023 Yiming Gao, Feiyu Liu, Liang Wang, Zhenjie Lian, Weixuan Wang, Siqin Li, Xianliang Wang, Xianhan Zeng, Rundong Wang, Jiawei Wang, Qiang Fu, Wei Yang, Lanxiao Huang, Wei Liu

MOBA games, e. g., Dota2 and Honor of Kings, have been actively used as the testbed for the recent AI research on games, and various AI systems have been developed at the human level so far.

Deep-Learning-based Fast and Accurate 3D CT Deformable Image Registration in Lung Cancer

no code implementations21 Apr 2023 Yuzhen Ding, Hongying Feng, Yunze Yang, Jason Holmes, Zhengliang Liu, David Liu, William W. Wong, Nathan Y. Yu, Terence T. Sio, Steven E. Schild, Baoxin Li, Wei Liu

Conclusion: A patient-specific vision-transformer-based network was developed and shown to be accurate and efficient to reconstruct 3D CT images from kV images.

Anatomy Image Registration

Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task

no code implementations18 Apr 2023 Zihao Wu, Lu Zhang, Chao Cao, Xiaowei Yu, Haixing Dai, Chong Ma, Zhengliang Liu, Lin Zhao, Gang Li, Wei Liu, Quanzheng Li, Dinggang Shen, Xiang Li, Dajiang Zhu, Tianming Liu

To this end, in this study, we evaluate the performance of ChatGPT/GPT-4 on a radiology NLI task and compare it to other models fine-tuned specifically on task-related data samples.

Specificity Task 2

SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger

no code implementations30 Mar 2023 Yuting Gao, Jinfeng Liu, Zihan Xu, Tong Wu Enwei Zhang, Wei Liu, Jie Yang, Ke Li, Xing Sun

During the preceding biennium, vision-language pre-training has achieved noteworthy success on several downstream tasks.

Zero-Shot Learning

Plug-and-Play Regulators for Image-Text Matching

1 code implementation23 Mar 2023 Haiwen Diao, Ying Zhang, Wei Liu, Xiang Ruan, Huchuan Lu

Exploiting fine-grained correspondence and visual-semantic alignments has shown great potential in image-text matching.

Image Retrieval Image-text matching +1

DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4

1 code implementation20 Mar 2023 Zhengliang Liu, Yue Huang, Xiaowei Yu, Lu Zhang, Zihao Wu, Chao Cao, Haixing Dai, Lin Zhao, Yiwei Li, Peng Shu, Fang Zeng, Lichao Sun, Wei Liu, Dinggang Shen, Quanzheng Li, Tianming Liu, Dajiang Zhu, Xiang Li

The digitization of healthcare has facilitated the sharing and re-using of medical data but has also raised concerns about confidentiality and privacy.

Benchmarking De-identification +4

STGIC: a graph and image convolution-based method for spatial transcriptomic clustering

no code implementations19 Mar 2023 Chen Zhang, Junhui Gao, Lingxin Kong, Guangshuo cao, Xiangyu Guo, Wei Liu

Spatial transcriptomic (ST) clustering employs spatial and transcription information to group spots spatially coherent and transcriptionally similar together into the same spatial domain.

Clustering Contrastive Learning +1

CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention

1 code implementation13 Mar 2023 Wenxiao Wang, Wei Chen, Qibo Qiu, Long Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wei Liu

On the one hand, CEL blends each token with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Image Classification Instance Segmentation +3

Frauds Bargain Attack: Generating Adversarial Text Samples via Word Manipulation Process

1 code implementation1 Mar 2023 Mingze Ni, Zhensu Sun, Wei Liu

In response, this study proposes a new method called the Fraud's Bargain Attack (FBA), which uses a randomization mechanism to expand the search space and produce high-quality adversarial examples with a higher probability of success.

Adversarial Text Sentence

Q-Cogni: An Integrated Causal Reinforcement Learning Framework

no code implementations26 Feb 2023 Cris Cunha, Wei Liu, Tim French, Ajmal Mian

We present Q-Cogni, an algorithmically integrated causal reinforcement learning framework that redesigns Q-Learning with an autonomous causal structure discovery method to improve the learning process with causal inference.

Causal Inference Decision Making +3

Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring

no code implementations21 Feb 2023 Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee

Recent studies on pronunciation scoring have explored the effect of introducing phone embeddings as reference pronunciation, but mostly in an implicit manner, i. e., addition or concatenation of reference phone embedding and actual pronunciation of the target phone as the phone-level pronunciation quality representation.

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

no code implementations20 Feb 2023 Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee

A typical fluency scoring system generally relies on an automatic speech recognition (ASR) system to obtain time stamps in input speech for either the subsequent calculation of fluency-related features or directly modeling speech fluency with an end-to-end approach.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

1 code implementation5 Feb 2023 Zichuan Lin, Xiapeng Wu, Mingfei Sun, Deheng Ye, Qiang Fu, Wei Yang, Wei Liu

Recent success in Deep Reinforcement Learning (DRL) methods has shown that policy optimization with respect to an off-policy distribution via importance sampling is effective for sample reuse.

CFFT-GAN: Cross-domain Feature Fusion Transformer for Exemplar-based Image Translation

no code implementations3 Feb 2023 Tianxiang Ma, Bingchuan Li, Wei Liu, Miao Hua, Jing Dong, Tieniu Tan

In this paper, we propose a more general learning approach by considering two domain features as a whole and learning both inter-domain correspondence and intra-domain potential information interactions.

Translation

ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing

no code implementations31 Jan 2023 Bingchuan Li, Tianxiang Ma, Peng Zhang, Miao Hua, Wei Liu, Qian He, Zili Yi

Specifically, in Phase I, a W-space-oriented StyleGAN inversion network is trained and used to perform image inversion and editing, which assures the editability but sacrifices reconstruction quality.

Image Generation

Planning and Tracking Control of Full Drive-by-Wire Electric Vehicles in Unstructured Scenario

no code implementations7 Jan 2023 Guoying Chen, Min Hua, Wei Liu, Jinhai Wang, Shunhui Song, Changsheng Liu

Full drive-by-wire electric vehicles (FDWEV) with X-by-wire technology can achieve independent driving, braking, and steering of each wheel, providing a good application platform for autonomous driving technology.

Autonomous Driving Model Predictive Control

Heterogeneous Diversity Driven Active Learning for Multi-Object Tracking

no code implementations ICCV 2023 Rui Li, Baopeng Zhang, Jun Liu, Wei Liu, Jian Zhao, Zhu Teng

HD-AMOT defines the diversified informative representation by encoding the geometric and semantic information, and formulates the frame inference strategy as a Markov decision process to learn an optimal sampling policy based on the designed informative representation.

Active Learning Multi-Object Tracking

Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation

no code implementations9 Dec 2022 Jie Jiang, Zhimin Li, Jiangfeng Xiong, Rongwei Quan, Qinglin Lu, Wei Liu

Therefore, TAVS is distinguished from previous temporal segmentation datasets due to its multi-modal information, holistic view of categories, and hierarchical granularities.

Multi-Label Classification Scene Segmentation +3

PointCA: Evaluating the Robustness of 3D Point Cloud Completion Models Against Adversarial Examples

no code implementations22 Nov 2022 Shengshan Hu, Junwei Zhang, Wei Liu, Junhui Hou, Minghui Li, Leo Yu Zhang, Hai Jin, Lichao Sun

In addition, existing attack approaches towards point cloud classifiers cannot be applied to the completion models due to different output forms and attack purposes.

Adversarial Attack Point Cloud Classification +2

Curriculum-based Asymmetric Multi-task Reinforcement Learning

1 code implementation7 Nov 2022 Hanchi Huang, Deheng Ye, Li Shen, Wei Liu

To mitigate the negative influence of customizing the one-off training order in curriculum-based AMTL, CAMRL switches its training mode between parallel single-task RL and asymmetric multi-task RL (MTRL), according to an indicator regarding the training time, the overall performance, and the performance gap among tasks.

Multi-Task Learning reinforcement-learning +1

Online LiDAR-Camera Extrinsic Parameters Self-checking

1 code implementation19 Oct 2022 Pengjin Wei, Guohang Yan, Yikang Li, Kun Fang, Jie Yang, Wei Liu

This calibration task is multi-modal, where the rich color and texture information captured by the camera and the accurate three-dimensional spatial information from the LiDAR is incredibly significant for downstream tasks.

Autonomous Driving Binary Classification

Neural Extended Kalman Filters for Learning and Predicting Dynamics of Structural Systems

1 code implementation9 Oct 2022 Wei Liu, Zhilu Lai, Kiran Bacsa, Eleni Chatzi

Typically, conventional variational inference models are parameterized by neural networks independent of the latent dynamics models.

Variational Inference

FR: Folded Rationalization with a Unified Encoder

1 code implementation17 Sep 2022 Wei Liu, Haozhao Wang, Jun Wang, Ruixuan Li, Chao Yue, Yuankai Zhang

Conventional works generally employ a two-phase model in which a generator selects the most important pieces, followed by a predictor that makes predictions based on the selected pieces.

DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection

no code implementations21 Aug 2022 Jingyu Lin, Jie Jiang, Yan Yan, Chunchao Guo, Hongfa Wang, Wei Liu, Hanzi Wang

We further propose a parallel design that integrates the convolutional network with a powerful self-attention mechanism to provide complementary clues between the attention path and convolutional path.

Scene Text Detection Text Detection

CircuitNet: An Open-Source Dataset for Machine Learning Applications in Electronic Design Automation (EDA)

no code implementations1 Aug 2022 Zhuomin Chai, Yuxiang Zhao, Yibo Lin, Wei Liu, Runsheng Wang, Ru Huang

The electronic design automation (EDA) community has been actively exploring machine learning (ML) for very large-scale integrated computer-aided design (VLSI CAD).

BIG-bench Machine Learning

Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips

1 code implementation27 Jul 2022 Jiawang Bai, Kuofeng Gao, Dihong Gong, Shu-Tao Xia, Zhifeng Li, Wei Liu

The security of deep neural networks (DNNs) has attracted increasing attention due to their widespread use in various applications.

Towards Efficient Adversarial Training on Vision Transformers

no code implementations21 Jul 2022 Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu

Vision Transformer (ViT), as a powerful alternative to Convolutional Neural Network (CNN), has received much attention.

Neural modal ordinary differential equations: Integrating physics-based modeling with neural ordinary differential equations for modeling high-dimensional monitored structures

1 code implementation16 Jul 2022 Zhilu Lai, Wei Liu, Xudong Jian, Kiran Bacsa, Limin Sun, Eleni Chatzi

In the scope of physics-informed machine learning, this paper proposes a framework -- termed Neural Modal ODEs -- to integrate physics-based modeling with deep learning for modeling the dynamics of monitored and high-dimensional engineered systems.

Physics-informed machine learning

Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization

no code implementations15 Jul 2022 Mengyin Liu, Chao Zhu, Hongyu Gao, Weibo Gu, Hongfa Wang, Wei Liu, Xu-Cheng Yin

2) Secondly, a text-guided information range minimization method is proposed to adaptively encode descriptive parts of each modality into an identical space with a powerful pretrained linguistic model.

Attribute Attribute Value Extraction +2

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

1 code implementation4 Jul 2022 Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, RongCheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

In this report, we propose a video-language pretraining (VLP) based solution \cite{kevin2022egovlp} for four Ego4D challenge tasks, including Natural Language Query (NLQ), Moment Query (MQ), Object State Change Classification (OSCC), and PNR Localization (PNR).

Language Modelling Object State Change Classification

Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization

no code implementations21 Jun 2022 Wei Liu, Rui Wang, Tao Zhang, Kaiwen Li, Wenhua Li, Hisao Ishibuchi

Multi-objective orienteering problems (MO-OPs) are classical multi-objective routing problems and have received a lot of attention in the past decades.

Problem Decomposition reinforcement-learning +1

Towards Generalizable Person Re-identification with a Bi-stream Generative Model

no code implementations19 Jun 2022 Xin Xu, Wei Liu, Zheng Wang, Ruiming Hu, Qi Tian

Guided by original pedestrian images, one stream is employed to learn a camera-invariant global feature for the CC problem via filtering cross-camera interference factors.

Domain Generalization Generalizable Person Re-identification

EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification

no code implementations15 Jun 2022 Jingyu Li, Wei Liu, Tan Lee

This paper proposes a domain transfer network, named EDITnet, to alleviate the language-mismatch problem on speaker embeddings without requiring speaker labels.

Self-Supervised Learning Speaker Verification +1

Unsupervised Knowledge Adaptation for Passenger Demand Forecasting

no code implementations8 Jun 2022 Can Li, Lei Bai, Wei Liu, Lina Yao, S Travis Waller

These multimodal forecasting models can improve accuracy but be less practical when different parts of multimodal datasets are owned by different institutions who cannot directly share data among them.

Efficient-Adam: Communication-Efficient Distributed Adam

no code implementations28 May 2022 Congliang Chen, Li Shen, Wei Liu, Zhi-Quan Luo

Distributed adaptive stochastic gradient methods have been widely used for large-scale nonconvex optimization, such as training deep learning models.

Quantization

An Investigation on Applying Acoustic Feature Conversion to ASR of Adult and Child Speech

no code implementations25 May 2022 Wei Liu, Jingyu Li, Tan Lee

The performance of child speech recognition is generally less satisfactory compared to adult speech due to limited amount of training data.

Attribute Automatic Speech Recognition +4

Demand Response Method Considering Multiple Types of Flexible Loads in Industrial Parks

no code implementations24 May 2022 Jia Cui, Mingze Gao, Xiaoming Zhou, Yang Li, Wei Liu, Jiazheng Tian, XiMing Zhang

With the rapid development of the energy internet, the proportion of flexible loads in smart grid is getting much higher than before.

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

1 code implementation CVPR 2022 Li Yang, Yan Xu, Chunfeng Yuan, Wei Liu, Bing Li, Weiming Hu

They base the visual grounding on the features from pre-generated proposals or anchors, and fuse these features with the text embeddings to locate the target mentioned by the text.

Attribute object-detection +2

Deep Reinforcement Learning for Orienteering Problems Based on Decomposition

no code implementations25 Apr 2022 Wei Liu, Tao Zhang, Rui Wang, Kaiwen Li, Wenhua Li, Kang Yang

A dynamic pointer network (DYPN) is introduced as the TSP solver, which takes city locations as inputs and immediately outputs a permutation of nodes.

reinforcement-learning Reinforcement Learning (RL) +1

ChildPredictor: A Child Face Prediction Framework with Disentangled Learning

1 code implementation21 Apr 2022 Yuzhi Zhao, Lai-Man Po, Xuehui Wang, Qiong Yan, Wei Shen, Yujia Zhang, Wei Liu, Chun-Kit Wong, Chiu-Sing Pang, Weifeng Ou, Wing-Yin Yu, Buhua Liu

On this basis, we formulate predictions as a mapping from parents' genetic factors to children's genetic factors, and disentangle them from external and variety factors.

Age-Invariant Face Recognition Image-to-Image Translation +2

XMP-Font: Self-Supervised Cross-Modality Pre-training for Few-Shot Font Generation

no code implementations CVPR 2022 Wei Liu, Fangyue Liu, Fei Ding, Qian He, Zili Yi

The cross-modality encoder is pre-trained in a self-supervised manner to allow effective capture of cross- and intra-modality correlations, which facilitates the content-style disentanglement and modeling style representations of all scales (stroke-level, component-level and character-level).

Disentanglement Font Generation

Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with Multi-Level Representations

no code implementations7 Apr 2022 Jie Jiang, Shaobo Min, Weijie Kong, Dihong Gong, Hongfa Wang, Zhifeng Li, Wei Liu

With multi-level representations for video and text, hierarchical contrastive learning is designed to explore fine-grained cross-modal relationships, i. e., frame-word, clip-phrase, and video-sentence, which enables HCMI to achieve a comprehensive semantic comparison between video and text modalities.

 Ranked #1 on Video Retrieval on MSR-VTT-1kA (using extra training data)

Contrastive Learning Denoising +4

Improving Vision Transformers by Revisiting High-frequency Components

1 code implementation3 Apr 2022 Jiawang Bai, Li Yuan, Shu-Tao Xia, Shuicheng Yan, Zhifeng Li, Wei Liu

Inspired by this finding, we first investigate the effects of existing techniques for improving ViT models from a new frequency perspective, and find that the success of some techniques (e. g., RandAugment) can be attributed to the better usage of the high-frequency components.

Domain Generalization Image Classification +1

Masked Autoencoders for Point Cloud Self-supervised Learning

3 code implementations13 Mar 2022 Yatian Pang, Wenxiao Wang, Francis E. H. Tay, Wei Liu, Yonghong Tian, Li Yuan

Then, a standard Transformer based autoencoder, with an asymmetric design and a shifting mask tokens operation, learns high-level latent features from unmasked point patches, aiming to reconstruct the masked point patches.

3D Part Segmentation Few-Shot 3D Point Cloud Classification +2

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

2 code implementations1 Mar 2022 ZiHao Wang, Wei Liu, Qian He, Xinglong Wu, Zili Yi

Once trained, the transformer can generate coherent image tokens based on the text embedding extracted from the text encoder of CLIP upon an input text.

Text-to-Image Generation

Unpaired Quad-Path Cycle Consistent Adversarial Networks for Single Image Defogging

no code implementations19 Feb 2022 Wei Liu, Cheng Chen, Rui Jiang, Tao Lu, Zixiang Xiong

To address these issues, we develop a novel generative adversarial network, called quad-path cycle consistent adversarial network (QPC-Net), for single image defogging.

Generative Adversarial Network

Deep Single Image Deraining using An Asymetric Cycle Generative and Adversarial Framework

no code implementations19 Feb 2022 Wei Liu, Rui Jiang, Cheng Chen, Tao Lu, Zixiang Xiong

The former consists of parallel rain removal path and rain-fog feature extraction path by the rain and derain-fog network and the attention rain-fog feature extraction network (ARFE) , while the latter only contains a synthetic rain transformation path.

Single Image Deraining

Exploring Structural Sparsity in Neural Image Compression

no code implementations9 Feb 2022 Shanzhi Yin, Chao Li, Wen Tan, Youneng Bao, Yongsheng Liang, Wei Liu

Neural image compression have reached or out-performed traditional methods (such as JPEG, BPG, WebP).

Image Compression

Constrained Variational Policy Optimization for Safe Reinforcement Learning

2 code implementations28 Jan 2022 Zuxin Liu, Zhepeng Cen, Vladislav Isenbaev, Wei Liu, Zhiwei Steven Wu, Bo Li, Ding Zhao

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications.

reinforcement-learning Reinforcement Learning (RL) +1

DynaMixer: A Vision MLP Architecture with Dynamic Mixing

2 code implementations28 Jan 2022 Ziyu Wang, Wenhao Jiang, Yiming Zhu, Li Yuan, Yibing Song, Wei Liu

In contrast with vision transformers and CNNs, the success of MLP-like models shows that simple information fusion operations among tokens and channels can yield a good representation power for deep recognition models.

Image Classification

Spatio-Temporal Graph Representation Learning for Fraudster Group Detection

no code implementations7 Jan 2022 Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

Then we use an RNN on the spatial relations to predict the spatio-temporal relations of reviewers in the group.

Graph Representation Learning

Coherent Point Drift Revisited for Non-Rigid Shape Matching and Registration

no code implementations CVPR 2022 Aoxiang Fan, Jiayi Ma, Xin Tian, Xiaoguang Mei, Wei Liu

In this paper, we explore a new type of extrinsic method to directly align two geometric shapes with point-to-point correspondences in ambient space by recovering a deformation, which allows more continuous and smooth maps to be obtained.

RFNet: Unsupervised Network for Mutually Reinforcing Multi-Modal Image Registration and Fusion

no code implementations CVPR 2022 Han Xu, Jiayi Ma, Jiteng Yuan, Zhuliang Le, Wei Liu

Specifically, for image registration, we solve the bottlenecks of defining registration metrics applicable for multi-modal images and facilitating the network convergence.

Image Registration

DGL-GAN: Discriminator Guided Learning for GAN Compression

no code implementations13 Dec 2021 Yuesong Tian, Li Shen, Xiang Tian, DaCheng Tao, Zhifeng Li, Wei Liu, Yaowu Chen

Moreover, DGL-GAN is also effective in boosting the performance of original uncompressed GANs.

Triangle Attack: A Query-efficient Decision-based Adversarial Attack

1 code implementation13 Dec 2021 Xiaosen Wang, Zeliang Zhang, Kangheng Tong, Dihong Gong, Kun He, Zhifeng Li, Wei Liu

Decision-based attack poses a severe threat to real-world applications since it regards the target model as a black box and only accesses the hard prediction label.

Adversarial Attack Dimensionality Reduction

CO2Sum:Contrastive Learning for Factual-Consistent Abstractive Summarization

no code implementations2 Dec 2021 Wei Liu, Huanqin Wu, Wenjing Mu, Zhen Li, Tao Chen, Dan Nie

We propose CO2Sum (Contrastive for Consistency), a contrastive learning scheme that can be easily applied on sequence-to-sequence models for factual-consistent abstractive summarization, proving that the model can be fact-aware without modifying the architecture.

Abstractive Text Summarization Contrastive Learning

Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary Enhancement

1 code implementation NeurIPS 2021 Aming Wu, Suqi Zhao, Cheng Deng, Wei Liu

To alleviate the impact of few samples, enhancing the generalization and discrimination abilities of detectors on new objects plays an important role.

Dictionary Learning Few-Shot Object Detection +1

MC-Blur: A Comprehensive Benchmark for Image Deblurring

2 code implementations1 Dec 2021 Kaihao Zhang, Tao Wang, Wenhan Luo, Boheng Chen, Wenqi Ren, Bjorn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang

Blur artifacts can seriously degrade the visual quality of images, and numerous deblurring methods have been proposed for specific scenarios.

Benchmarking Deblurring +1

Neural Routing by Memory

no code implementations NeurIPS 2021 Kaipeng Zhang, Zhenqiang Li, Zhifeng Li, Wei Liu, Yoichi Sato

However, they use the same procedure sequence for all inputs, regardless of the intermediate features. This paper proffers a simple yet effective idea of constructing parallel procedures and assigning similar intermediate features to the same specialized procedures in a divide-and-conquer fashion.

PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction

1 code implementation30 Nov 2021 Qingyu Wang, Baojian Ma, Wei Liu, Mingzhao Lou, Mingchuan Zhou, Huanyu Jiang, Yibin Ying

In this paper, we aim to address the issue between datasets and models and propose a large scale stereo dataset with high accuracy disparity ground truth named PlantStereo.

Camera Calibration Image Registration +1

Social Fraud Detection Review: Methods, Challenges and Analysis

no code implementations10 Nov 2021 Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

Many studies proposed approaches based on user behaviors and review text to address the challenges of fraud detection.

Decision Making Fraud Detection

Meter-Range Wireless Motor Drive for Pipeline Transportation

no code implementations26 Oct 2021 Wei Liu, K. T. Chau, Hui Wang, Tengbo Yang

This paper proposes and implements a meter-range wireless motor drive (WMD) system for promising applications of underground pipeline transportations or in-pipe robots.

Physics-guided Deep Markov Models for Learning Nonlinear Dynamical Systems with Uncertainty

1 code implementation16 Oct 2021 Wei Liu, Zhilu Lai, Kiran Bacsa, Eleni Chatzi

To address this, we bridge physics-based state space models with Deep Markov Models, thus delivering a hybrid modeling framework for unsupervised learning and identification of nonlinear dynamical systems.

Variational Inference

Rethinking the Spatial Route Prior in Vision-and-Language Navigation

no code implementations12 Oct 2021 Xinzhe Zhou, Wei Liu, Yadong Mu

In a most information-rich case of knowing environment maps and admitting shortest-path prior, we observe that given an origin-destination node pair, the internal route can be uniquely determined.

Navigate Vision and Language Navigation

A Simplified System Model for Optical Camera Communication

no code implementations Conference 2021 Anqi Liu, Wenxiao Shi, Wei Liu, Zhuo Wang

Data rate and communication distance are two important criteria for measuring the performance of optical camera communication (OCC) systems.

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

1 code implementation22 Sep 2021 Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi

To address these limitations, we design a Dynamic Style Manipulation Network (DyStyle) whose structure and parameters vary by input samples, to perform nonlinear and adaptive manipulation of latent codes for flexible and precise attribute control.

Attribute Contrastive Learning

Utterance-level neural confidence measure for end-to-end children speech recognition

no code implementations16 Sep 2021 Wei Liu, Tan Lee

The investigation is focused on evaluating and comparing the efficacies of predictor features that are derived from different internal and external modules of the E2E system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Semantic-Preserving Adversarial Text Attacks

2 code implementations23 Aug 2021 Xinghao Yang, Weifeng Liu, James Bailey, DaCheng Tao, Wei Liu

In this paper, we propose a Bigram and Unigram based adaptive Semantic Preservation Optimization (BU-SPO) method to examine the vulnerability of deep models.

Adversarial Text Semantic Similarity +4

End2End Occluded Face Recognition by Masking Corrupted Features

1 code implementation21 Aug 2021 Haibo Qiu, Dihong Gong, Zhifeng Li, Wei Liu, DaCheng Tao

However, the state-of-the-art general face recognition models do not generalize well to occluded face images, which are exactly the common cases in real-world scenarios.

Face Recognition

SynFace: Face Recognition with Synthetic Data

1 code implementation ICCV 2021 Haibo Qiu, Baosheng Yu, Dihong Gong, Zhifeng Li, Wei Liu, DaCheng Tao

We then analyze the underlying causes behind the performance gap, e. g., the poor intra-class variations and the domain gap between synthetic and real face images.

Face Generation Face Recognition

Structure-Aware Feature Generation for Zero-Shot Learning

no code implementations16 Aug 2021 Lianbo Zhang, Shaoli Huang, Xinchao Wang, Wei Liu, DaCheng Tao

In this paper, we introduce a novel structure-aware feature generation scheme, termed as SA-GAN, to explicitly account for the topological structure in learning both the latent space and the generative networks.

Attribute Generative Adversarial Network +1

CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention

3 code implementations ICLR 2022 Wenxiao Wang, Lu Yao, Long Chen, Binbin Lin, Deng Cai, Xiaofei He, Wei Liu

On the one hand, CEL blends each embedding with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Image Classification Instance Segmentation +4

Decentralized Federated Learning: Balancing Communication and Computing Costs

1 code implementation26 Jul 2021 Wei Liu, Li Chen, Wenyi Zhang

The performance of decentralized SGD is jointly influenced by inter-node communications and local updates.

Federated Learning

A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing

1 code implementation15 Jul 2021 Wei Liu, Pingping Zhang, Yinjie Lei, Xiaolin Huang, Jie Yang, Michael Ng

The effectiveness and superior performance of our approach are validated through comprehensive experiments in a range of applications.

image smoothing

Controlled Caption Generation for Images Through Adversarial Attacks

no code implementations7 Jul 2021 Nayyer Aafaq, Naveed Akhtar, Wei Liu, Mubarak Shah, Ajmal Mian

In contrast, we propose a GAN-based algorithm for crafting adversarial examples for neural image captioning that mimics the internal representation of the CNN such that the resulting deep features of the input image enable a controlled incorrect caption generation through the recurrent network.

Caption Generation Image Captioning +1

Robust Pose Transfer with Dynamic Details using Neural Video Rendering

no code implementations27 Jun 2021 Yang-tian Sun, Hao-Zhi Huang, Xuan Wang, Yu-Kun Lai, Wei Liu, Lin Gao

Moreover, we introduce a concise temporal loss in the training stage to suppress the detail flickering that is made more visible due to high-quality dynamic details generated by our method.

2k 4k +3

Stock Market Analysis with Text Data: A Review

no code implementations23 Jun 2021 Kamaladdin Fataliyev, Aneesh Chivukula, Mukesh Prasad, Wei Liu

Then, we cover the analysis techniques and create a taxonomy of the main stock market forecast models.

Cannot find the paper you are looking for? You can Submit a new open access paper.