Search Results for author: Rui Wang

Found 468 papers, 155 papers with code

The AISP-SJTU Simultaneous Translation System for IWSLT 2022

no code implementations • IWSLT (ACL) 2022 • Qinpei Zhu, Renshou Wu, Guangfeng Liu, Xinyu Zhu, Xingyu Chen, Yang Zhou, Qingliang Miao, Rui Wang, Kai Yu

This paper describes AISP-SJTU’s submissions for the IWSLT 2022 Simultaneous Translation task.

Translation

Paper
Add Code

Stacked AMR Parsing with Silver Data

1 code implementation • Findings (EMNLP) 2021 • Qingrong Xia, Zhenghua Li, Rui Wang, Min Zhang

In particular, one recent seq-to-seq work directly fine-tunes AMR graph sequences on the encoder-decoder pre-trained language model and achieves new state-of-the-art results, outperforming previous works by a large margin.

AMR Parsing Decoder +1

Paper
Code

Syntax in End-to-End Natural Language Processing

no code implementations • EMNLP (ACL) 2021 • Hai Zhao, Rui Wang, Kehai Chen

This tutorial surveys the latest technical progress of syntactic parsing and the role of syntax in end-to-end natural language processing (NLP) tasks, in which semantic role labeling (SRL) and machine translation (MT) are the representative NLP tasks that have always been beneficial from informative syntactic clues since a long time ago, though the advance from end-to-end deep learning models shows new results.

Machine Translation NMT +2

Paper
Add Code

Chinese Opinion Role Labeling with Corpus Translation: A Pivot Study

1 code implementation • EMNLP 2021 • Ranran Zhen, Rui Wang, Guohong Fu, Chengguo Lv, Meishan Zhang

Opinion Role Labeling (ORL), aiming to identify the key roles of opinion, has received increasing interest.

Cross-Lingual Transfer Translation

Paper
Code

Chinese Grammatical Error Diagnosis with Graph Convolution Network and Multi-task Learning

no code implementations • AACL (NLP-TEA) 2020 • Yikang Luo, Zuyi Bao, Chen Li, Rui Wang

For the correction subtask, we utilize the masked language model, the seq2seq model and the spelling check model to generate corrections based on the detection results.

Language Modelling Multi-Task Learning +1

Paper
Add Code

Estimating Q(s,s') with Deterministic Dynamics Gradients

no code implementations • ICML 2020 • Ashley Edwards, Himanshu Sahni, Rosanne Liu, Jane Hung, Ankit Jain, Rui Wang, Adrien Ecoffet, Thomas Miconi, Charles Isbell, Jason Yosinski

In this paper, we introduce a novel form of a value function, $Q(s, s')$, that expresses the utility of transitioning from a state $s$ to a neighboring state $s'$ and then acting optimally thereafter.

Transfer Learning

Paper
Add Code

Unsupervised Paraphrasing Consistency Training for Low Resource Named Entity Recognition

no code implementations • EMNLP 2021 • Rui Wang, Ricardo Henao

Unsupervised consistency training is a way of semi-supervised learning that encourages consistency in model predictions between the original and augmented data.

Data Augmentation Low Resource Named Entity Recognition +4

Paper
Add Code

Few-Shot Class-Incremental Learning for Named Entity Recognition

no code implementations • ACL 2022 • Rui Wang, Tong Yu, Handong Zhao, Sungchul Kim, Subrata Mitra, Ruiyi Zhang, Ricardo Henao

In this work, we study a more challenging but practical problem, i. e., few-shot class-incremental learning for NER, where an NER model is trained with only few labeled samples of the new classes, without forgetting knowledge of the old ones.

Few-Shot Class-Incremental Learning Incremental Learning +3

Paper
Add Code

SJTU-NICT’s Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task

no code implementations • WMT (EMNLP) 2020 • Zuchao Li, Hai Zhao, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita

In this paper, we introduced our joint team SJTU-NICT ‘s participation in the WMT 2020 machine translation shared task.

Collaborative Filtering Language Modelling +3

Paper
Add Code

English-Myanmar NMT and SMT with Pre-ordering: NICT’s Machine Translation Systems at WAT-2018

no code implementations • PACLIC 2018 • Rui Wang, Chenchen Ding, Masao Utiyama, Eiichiro Sumita

Machine Translation NMT +1

Paper
Add Code

Synchronous Refinement for Neural Machine Translation

no code implementations • Findings (ACL) 2022 • Kehai Chen, Masao Utiyama, Eiichiro Sumita, Rui Wang, Min Zhang

Machine translation typically adopts an encoder-to-decoder framework, in which the decoder generates the target sentence word-by-word in an auto-regressive manner.

Decoder Machine Translation +2

Paper
Add Code

ReinWiFi: A Reinforcement-Learning-Based Framework for the Application-Layer QoS Optimization of WiFi Networks

1 code implementation • 6 May 2024 • Qianren Li, Bojie Lv, Yuncong Hong, Rui Wang

In this paper, a reinforcement-learning-based scheduling framework is proposed and implemented to optimize the application-layer quality-of-service (QoS) of a practical wireless local area network (WLAN) suffering from unknown interference.

reinforcement-learning Scheduling

Paper
Code

A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges

no code implementations • 1 May 2024 • ZhengZhao Feng, Rui Wang, Tianxing Wang, Mingli Song, Sai Wu, Shuibing He

From the analysis and evaluation results, we identify key challenges and offer principles for future research to enhance the design of models and frameworks in the dynamic GNNs field.

Paper
Add Code

PAD: Patch-Agnostic Defense against Adversarial Patch Attacks

1 code implementation • 25 Apr 2024 • Lihua Jing, Rui Wang, Wenqi Ren, Xin Dong, Cong Zou

Adversarial patch attacks present a significant threat to real-world object detectors due to their practical feasibility.

Paper
Code

Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs

no code implementations • 24 Apr 2024 • Yu Xia, Rui Wang, Xu Liu, Mingyan Li, Tong Yu, Xiang Chen, Julian McAuley, Shuai Li

Chain-of-Thought (CoT) has been a widely adopted prompting method, eliciting impressive reasoning abilities of Large Language Models (LLMs).

Paper
Add Code

Nyonic Technical Report

1 code implementation • 24 Apr 2024 • Junfeng Tian, Rui Wang, Cong Li, Yudong Zhou, Jun Liu, Jun Wang

This report details the development and key achievements of our latest language model designed for custom large language models.

Language Modelling

Paper
Code

Research on OPF control of three-phase four-wire low-voltage distribution network considering uncertainty

no code implementations • 24 Apr 2024 • Rui Wang, Xiaoqing Bai, Shengquan Huang, Shoupu Wei

As power systems become more complex and uncertain, low-voltage distribution networks face numerous challenges, including three-phase imbalances caused by asymmetrical loads and distributed energy resources.

Stochastic Optimization

Paper
Add Code

SNR Maximization and Localization for UAV-IRS-Assisted Near-Field Systems

no code implementations • 24 Apr 2024 • Hanfu Zhang, Yidan Mei, Erwu Liu, Rui Wang

This letter introduces a novel unmanned aerial vehicle (UAV)-intelligent reflecting surface (IRS) structure into near-field localization systems to enhance the design flexibility of IRS, thereby obtaining additional performance gains.

Paper
Add Code

Rechargeable UAV Trajectory Optimization for Real-Time Persistent Data Collection of Large-Scale Sensor Networks

no code implementations • 24 Apr 2024 • Rui Wang, Deshi Li, Kaitao Meng

By exploiting the convex optimization techniques and proving the total time is non-decreasing with the cluster number, a periodic trajectory optimization algorithm based on successive convex approximation (SCA) and bisection search is proposed to solve the main problem.

Paper
Add Code

A SER-based Device Selection Mechanism in Multi-bits Quantization Federated Learning

no code implementations • 20 Apr 2024 • Pengcheng Sun, Erwu Liu, Rui Wang

The quality of wireless communication will directly affect the performance of federated learning (FL), so this paper analyze the influence of wireless communication on FL through symbol error rate (SER).

Federated Learning Quantization

Paper
Add Code

AED-PADA:Improving Generalizability of Adversarial Example Detection via Principal Adversarial Domain Adaptation

no code implementations • 19 Apr 2024 • Heqi Peng, Yunhong Wang, Ruijie Yang, Beichen Li, Rui Wang, Yuanfang Guo

Specifically, our approach identifies the Principal Adversarial Domains (PADs), i. e., a combination of features of the adversarial examples from different attacks, which possesses large coverage of the entire adversarial feature space.

Adversarial Attack Adversarial Defense +1

Paper
Add Code

The Victim and The Beneficiary: Exploiting a Poisoned Model to Train a Clean Model on Poisoned Data

1 code implementation • ICCV 2023 • Zixuan Zhu, Rui Wang, Cong Zou, Lihua Jing

This inspires us to propose a novel dual-network training framework: The Victim and The Beneficiary (V&B), which exploits a poisoned model to train a clean model without extra benign samples.

Data Augmentation

Paper
Code

Cross-Modality Gait Recognition: Bridging LiDAR and Camera Modalities for Human Identification

no code implementations • 4 Apr 2024 • Rui Wang, Chuanfu Shen, Manuel J. Marin-Jimenez, George Q. Huang, Shiqi Yu

Current gait recognition research mainly focuses on identifying pedestrians captured by the same type of sensor, neglecting the fact that individuals may be captured by different sensors in order to adapt to various environments.

Gait Recognition

Paper
Add Code

Learn When (not) to Trust Language Models: A Privacy-Centric Adaptive Model-Aware Approach

no code implementations • 4 Apr 2024 • Chengkai Huang, Rui Wang, Kaige Xie, Tong Yu, Lina Yao

Despite their great success, the knowledge provided by the retrieval process is not always useful for improving the model prediction, since in some samples LLMs may already be quite knowledgeable and thus be able to answer the question correctly without retrieval.

Continual Learning Retrieval

Paper
Add Code

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

no code implementations • 29 Mar 2024 • Luchang Li, Sheng Qian, Jie Lu, Lunxi Yuan, Rui Wang, Qin Xie

The Large Language Model (LLM) is widely employed for tasks such as intelligent assistants, text summarization, translation, and multi-modality on mobile phones.

Language Modelling Large Language Model +2

Paper
Add Code

Enhanced Short Text Modeling: Leveraging Large Language Models for Topic Refinement

1 code implementation • 26 Mar 2024 • Shuyu Chang, Rui Wang, Peng Ren, Haiping Huang

Crafting effective topic models for brief texts, like tweets and news headlines, is essential for capturing the swift shifts in social dynamics.

Prompt Engineering Topic Models

Paper
Code

InternLM2 Technical Report

1 code implementation • 26 Mar 2024 • Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, FuKai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Chao Xu, Ruiliang Xu, Hang Yan, Yirong Yan, Xiaogui Yang, Haochen Ye, Huaiyuan Ying, JIA YU, Jing Yu, Yuhang Zang, Chuyu Zhang, Li Zhang, Pan Zhang, Peng Zhang, Ruijie Zhang, Shuo Zhang, Songyang Zhang, Wenjian Zhang, Wenwei Zhang, Xingcheng Zhang, Xinyue Zhang, Hui Zhao, Qian Zhao, Xiaomeng Zhao, Fengzhe Zhou, Zaida Zhou, Jingming Zhuo, Yicheng Zou, Xipeng Qiu, Yu Qiao, Dahua Lin

The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI).

Ranked #5 on Long-Context Understanding on Ada-LEval (BestAnswer)

4k Long-Context Understanding

5,268

Paper
Code

Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents

no code implementations • 23 Mar 2024 • Hao Wang, Tang Li, Chenhui Chu, Nengjun Zhu, Rui Wang, Pinpin Zhu

This approach aims to generate relation representations that are more aware of the spatial context and unseen relation in a manner similar to human perception.

Document AI Reading Comprehension +2

Paper
Add Code

AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and Dynamic Weighting Strategies

1 code implementation • 22 Mar 2024 • Rui Wang, Dengpan Ye, Long Tang, Yunming Zhang, Jiacheng Deng

With the continuous improvements of deepfake methods, forgery messages have transitioned from single-modality to multi-modal fusion, posing new challenges for existing forgery detection algorithms.

DeepFake Detection Face Swapping

Paper
Code

VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition

no code implementations • 21 Mar 2024 • Yun-Jin Li, Mariia Gladkova, Yan Xia, Rui Wang, Daniel Cremers

Recent works on the global place recognition treat the task as a retrieval problem, where an off-the-shelf global descriptor is commonly designed in image-based and LiDAR-based modalities.

Cross-Modal Retrieval Retrieval

Paper
Add Code

AffineQuant: Affine Transformation Quantization for Large Language Models

1 code implementation • 19 Mar 2024 • Yuexiao Ma, Huixia Li, Xiawu Zheng, Feng Ling, Xuefeng Xiao, Rui Wang, Shilei Wen, Fei Chao, Rongrong Ji

Among these techniques, Post-Training Quantization (PTQ) has emerged as a subject of considerable interest due to its noteworthy compression efficiency and cost-effectiveness in the context of training.

Quantization

Paper
Code

StableGarment: Garment-Centric Generation via Stable Diffusion

no code implementations • 16 Mar 2024 • Rui Wang, Hailong Guo, Jiaming Liu, Huaxia Li, Haibo Zhao, Xu Tang, Yao Hu, Hao Tang, Peipei Li

In this paper, we introduce StableGarment, a unified framework to tackle garment-centric(GC) generation tasks, including GC text-to-image, controllable GC text-to-image, stylized GC text-to-image, and robust virtual try-on.

Denoising Image Generation +1

Paper
Add Code

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

1 code implementation • 8 Mar 2024 • XiWei Hu, Rui Wang, Yixiao Fang, Bin Fu, Pei Cheng, Gang Yu

Diffusion models have demonstrated remarkable performance in the domain of text-to-image generation.

Denoising Language Modelling +2

256

Paper
Code

RIS-empowered Topology Control for Distributed Learning in Urban Air Mobility

no code implementations • 8 Mar 2024 • Kai Xiong, Rui Wang, Supeng Leng, Wenyang Che, Chongwen Huang, Chau Yuen

Urban Air Mobility (UAM) expands vehicles from the ground to the near-ground space, envisioned as a revolution for transportation systems.

Federated Learning MULTI-VIEW LEARNING

Paper
Add Code

Drug resistance revealed by in silico deep mutational scanning and mutation tracker

no code implementations • 5 Mar 2024 • Dong Chen, Gengzhuo Liu, Hongyan Du, JunJie Wee, Rui Wang, Jiahui Chen, Jana Shen, Guo-Wei Wei

As COVID-19 enters its fifth year, it continues to pose a significant global health threat, with the constantly mutating SARS-CoV-2 virus challenging drug effectiveness.

Drug Discovery

Paper
Add Code

Hypothesis Spaces for Deep Learning

no code implementations • 5 Mar 2024 • Rui Wang, Yuesheng Xu, Mingsong Yan

The representer theorems unfold that solutions of these learning models can be expressed as linear combination of a finite number of kernel sessions determined by given data and the reproducing kernel.

Paper
Add Code

F$^3$Loc: Fusion and Filtering for Floorplan Localization

no code implementations • 5 Mar 2024 • Changan Chen, Rui Wang, Christoph Vogel, Marc Pollefeys

In this paper we propose an efficient data-driven solution to self-localization within a floorplan.

Paper
Add Code

Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models

no code implementations • 5 Mar 2024 • Rui Wang, Fei Mi, Yi Chen, Boyang Xue, Hongru Wang, Qi Zhu, Kam-Fai Wong, Ruifeng Xu

2) Role Prompting assigns a central prompt to the general domain and a unique role prompt to each specific domain to minimize inter-domain confusion during training.

Domain Adaptation

Paper
Add Code

Logit Standardization in Knowledge Distillation

2 code implementations • 3 Mar 2024 • Shangquan Sun, Wenqi Ren, Jingzhi Li, Rui Wang, Xiaochun Cao

Knowledge distillation involves transferring soft labels from a teacher to a student using a shared temperature-based softmax function.

Ranked #1 on Knowledge Distillation on CIFAR-100

Knowledge Distillation

1,285

Paper
Code

Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

no code implementations • 1 Mar 2024 • Qingyan Guo, Rui Wang, Junliang Guo, Xu Tan, Jiang Bian, Yujiu Yang

Accordingly, permutation on the training data is considered as a potential solution, since this can make the model predict antecedent words or tokens.

Language Modelling

Paper
Add Code

Beyond Language Models: Byte Models are Digital World Simulators

no code implementations • 29 Feb 2024 • Shangda Wu, Xu Tan, Zili Wang, Rui Wang, Xiaobing Li, Maosong Sun

Traditional deep learning often overlooks bytes, the basic units of the digital world, where all forms of information and operations are encoded and manipulated in binary format.

Paper
Add Code

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

no code implementations • 29 Feb 2024 • Jiantao Qiu, Haijun Lv, Zhenjiang Jin, Rui Wang, Wenchang Ning, JIA YU, Chaobin Zhang, Zhenxiang Li, Pei Chu, Yuan Qu, Jin Shi, Lindong Lu, Runyu Peng, Zhiyuan Zeng, Huanze Tang, Zhikai Lei, Jiawei Hong, Keyu Chen, Zhaoye Fei, Ruiliang Xu, Wei Li, Zhongying Tu, Lin Dahua, Yu Qiao, Hang Yan, Conghui He

To evaluate the quality and utility of the dataset, we trained 1B-parameter and 3B-parameter models using WanJuan-CC and another dataset, RefinedWeb.

Paper
Add Code

Improving Open-Ended Text Generation via Adaptive Decoding

1 code implementation • 28 Feb 2024 • Wenhong Zhu, Hongkun Hao, Zhiwei He, Yiming Ai, Rui Wang

Current language models decode text token by token according to probabilistic distribution, and determining the appropriate candidates for the next token is crucial to ensure generation quality.

Story Generation

Paper
Code

UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval

no code implementations • 26 Feb 2024 • Hongru Wang, Boyang Xue, Baohang Zhou, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Kam-Fai Wong

Conversational retrieval refers to an information retrieval system that operates in an iterative and interactive manner, requiring the retrieval of various external resources, such as persona, knowledge, and even response, to effectively engage with the user and successfully complete the dialogue.

Information Retrieval Retrieval

Paper
Add Code

Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method

1 code implementation • 24 Feb 2024 • Tian Xia, Zhiwei He, Tong Ren, Yibo Miao, Zhuosheng Zhang, Yang Yang, Rui Wang

Bargaining is an important and unique part of negotiation between humans.

Paper
Code

Low-Frequency Black-Box Backdoor Attack via Evolutionary Algorithm

no code implementations • 23 Feb 2024 • Yanqi Qiao, Dazhuang Liu, Rui Wang, Kaitai Liang

Extensive experiments on real-world datasets verify the effectiveness and robustness of LFBA against image processing operations and the state-of-the-art backdoor defenses, as well as its inherent stealthiness in both spatial and frequency space, making it resilient against frequency inspection.

Backdoor Attack

Paper
Add Code

Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality

no code implementations • 22 Feb 2024 • Yiming Ai, Zhiwei He, Ziyin Zhang, Wenhong Zhu, Hongkun Hao, Kai Yu, Lingjun Chen, Rui Wang

In this study, we investigate the reliability of Large Language Models (LLMs) in professing human-like personality traits through responses to personality questionnaires.

Paper
Add Code

Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models

no code implementations • 21 Feb 2024 • Zhiwei He, Binglin Zhou, Hongkun Hao, Aiwei Liu, Xing Wang, Zhaopeng Tu, Zhuosheng Zhang, Rui Wang

Furthermore, we analyze two key factors that contribute to the cross-lingual consistency in text watermarking and propose a defense method that increases the AUC from 0. 67 to 0. 88 under CWRA.

TAG

Paper
Add Code

A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models

no code implementations • 21 Feb 2024 • Boyang Xue, Hongru Wang, Weichao Wang, Rui Wang, Sheng Wang, Zeming Liu, Kam-Fai Wong

The tendency of Large Language Models to generate hallucinations and exhibit overconfidence in predictions raises concerns regarding their reliability.

Paper
Add Code

FViT: A Focal Vision Transformer with Gabor Filter

1 code implementation • 17 Feb 2024 • Yulong Shi, Mingwei Sun, Yongshuai Wang, Rui Wang, Hui Sun, Zengqiang Chen

Vision transformers have achieved encouraging progress in various computer vision tasks.

Computational Efficiency Inductive Bias

Paper
Code

Unsupervised Sign Language Translation and Generation

no code implementations • 12 Feb 2024 • Zhengsheng Guo, Zhiwei He, Wenxiang Jiao, Xing Wang, Rui Wang, Kehai Chen, Zhaopeng Tu, Yong Xu, Min Zhang

Motivated by the success of unsupervised neural machine translation (UNMT), we introduce an unsupervised sign language translation and generation network (USLNet), which learns from abundant single-modality (text and video) data without parallel sign language data.

Machine Translation Sign Language Translation +1

Paper
Add Code

A Novel Paradigm in Solving Multiscale Problems

no code implementations • 7 Feb 2024 • Jing Wang, Zheng Li, Pengyu Lai, Rui Wang, Di Yang, Dewu Yang, Hui Xu, Wen-Quan Tao

By enabling the acquisition of large-scale data with minimal computational demands, coupled with the efficient and accurate characterization of small-scale dynamics via Spectral PINN, our approach offers a valuable and promising approach for researchers seeking to tackle multiscale phenomena effectively.

Paper
Add Code

Partial Identification of Binary Choice Models with Misreported Outcomes

no code implementations • 30 Jan 2024 • Orville Mondal, Rui Wang

In the first approach, the instrument is assumed to only affect the true dependent variable but not misreporting probabilities.

Paper
Add Code

UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems

no code implementations • 24 Jan 2024 • Hongru Wang, WenYu Huang, Yang Deng, Rui Wang, Zezhong Wang, YuFei Wang, Fei Mi, Jeff Z. Pan, Kam-Fai Wong

To better plan and incorporate the use of multiple sources in generating personalized response, we firstly decompose it into three sub-tasks: Knowledge Source Selection, Knowledge Retrieval, and Response Generation.

Response Generation Retrieval

Paper
Add Code

Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model

1 code implementation • 23 Jan 2024 • Zhiwei He, Xing Wang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang, Shuming Shi, Zhaopeng Tu

In this work, we investigate the potential of employing the QE model as the reward model to predict human preferences for feedback training.

Machine Translation Translation

Paper
Code

T2MAC: Targeted and Trusted Multi-Agent Communication through Selective Engagement and Evidence-Driven Integration

no code implementations • 19 Jan 2024 • Chuxiong Sun, Zehua Zang, Jiabao Li, Jiangmeng Li, Xiao Xu, Rui Wang, Changwen Zheng

This process enables agents to collectively use evidence garnered from multiple perspectives, fostering trusted and cooperative behaviors.

SMAC+

Paper
Add Code

DiffusionGPT: LLM-Driven Text-to-Image Generation System

no code implementations • 18 Jan 2024 • Jie Qin, Jie Wu, Weifeng Chen, Yuxi Ren, Huixia Li, Hefeng Wu, Xuefeng Xiao, Rui Wang, Shilei Wen

Diffusion models have opened up new avenues for the field of image generation, resulting in the proliferation of high-quality models shared on open-source platforms.

Model Selection Text-to-Image Generation

Paper
Add Code

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

1 code implementation • 18 Jan 2024 • Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang, Rui Wang, Gongshen Liu

We introduce R-Judge, a benchmark crafted to evaluate the proficiency of LLMs in judging and identifying safety risks given agent interaction records.

Benchmarking

Paper
Code

Passive Beamforming For Practical RIS-Assisted Communication Systems With Non-Ideal Hardware

no code implementations • 15 Jan 2024 • Yiming Liu, Rui Wang, Zhu Han

Reconfigurable intelligent surface (RIS) technology is a promising solution to improve the performance of existing wireless communications.

Paper
Add Code

Toward distortion-aware change detection in realistic scenarios

no code implementations • 10 Jan 2024 • Yitao Zhao, Heng-Chao Li, Nanqing Liu, Rui Wang

The whole framework is composed of Pretext Representation Pre-training, Bitemporal Image Alignment, and Down-stream Decoder Fine-Tuning.

Change Detection Decoder

Paper
Add Code

LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry

no code implementations • 3 Jan 2024 • Weirong Chen, Le Chen, Rui Wang, Marc Pollefeys

Visual odometry estimates the motion of a moving camera based on visual input.

Point Tracking Visual Odometry

Paper
Add Code

Boosting Large Language Model for Speech Synthesis: An Empirical Study

no code implementations • 30 Dec 2023 • Hongkun Hao, Long Zhou, Shujie Liu, Jinyu Li, Shujie Hu, Rui Wang, Furu Wei

In this paper, we conduct a comprehensive empirical exploration of boosting LLMs with the ability to generate speech, by combining pre-trained LLM LLaMA/OPT and text-to-speech synthesis model VALL-E. We compare three integration methods between LLMs and speech synthesis models, including directly fine-tuned LLMs, superposed layers of LLMs and VALL-E, and coupled LLMs and VALL-E using LLMs as a powerful text encoder.

Language Modelling Large Language Model +2

Paper
Add Code

Identification of Nonlinear Dynamic Panels under Partial Stationarity

no code implementations • 30 Dec 2023 • Wayne Yuan Gao, Rui Wang

This paper studies identification for a wide range of nonlinear panel data models, including binary choice, ordered response, and other types of limited dependent variable models.

Paper
Add Code

Exploring 3D-aware Lifespan Face Aging via Disentangled Shape-Texture Representations

no code implementations • 28 Dec 2023 • Qianrui Teng, Rui Wang, Xing Cui, Peipei Li, Zhaofeng He

Existing face aging methods often focus on modeling either texture aging or using an entangled shape-texture representation to achieve face aging.

3D Face Reconstruction Texture Synthesis

Paper
Add Code

SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation

1 code implementation • 26 Dec 2023 • Yuxuan Zhang, Yiren Song, Jiaming Liu, Rui Wang, Jinpeng Yu, Hao Tang, Huaxia Li, Xu Tang, Yao Hu, Han Pan, Zhongliang Jing

Recent advancements in subject-driven image generation have led to zero-shot generation, yet precise selection and focus on crucial subject representations remain challenging.

Image Generation

Paper
Code

Near-Field Localization and Phase Shift Optimization for RIS-Assisted Non-Ideal OFDM Systems

no code implementations • 19 Dec 2023 • Hanfu Zhang, Erwu Liu, Rui Wang, Zhe Xing, Yan Liu

By incorporating reconfigurable intelligent surface (RIS) into communication-assisted localization systems, the issue of signal blockage caused by obstacles can be addressed, and passive beamforming can be employed to enhance localization accuracy.

Paper
Add Code

Rethinking Dimensional Rationale in Graph Contrastive Learning from Causal Perspective

1 code implementation • 16 Dec 2023 • Qirui Ji, Jiangmeng Li, Jie Hu, Rui Wang, Changwen Zheng, Fanjiang Xu

To this end, with the purpose of exploring the intrinsic rationale of graphs, we accordingly propose to capture the dimensional rationale from graphs, which has not received sufficient attention in the literature.

Contrastive Learning Meta-Learning

Paper
Code

Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation

no code implementations • 12 Dec 2023 • Yuanbin Wang, Shaofei Huang, Yulu Gao, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Si Liu

In this work, we focus on zero-shot point cloud semantic segmentation and propose a simple yet effective baseline to transfer the visual-linguistic knowledge implied in CLIP to point cloud encoder at both feature and output levels.

3D Semantic Segmentation Point Cloud Segmentation +2

Paper
Add Code

Vision-language Assisted Attribute Learning

no code implementations • 12 Dec 2023 • Kongming Liang, Xinran Wang, Rui Wang, Donghui Gao, Ling Jin, Weidong Liu, Xiatian Zhu, Zhanyu Ma, Jun Guo

Attribute labeling at large scale is typically incomplete and partial, posing significant challenges to model optimization.

Attribute Language Modelling +2

Paper
Add Code

EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models

1 code implementation • 11 Dec 2023 • Yi Chen, Yuying Ge, Yixiao Ge, Mingyu Ding, Bohao Li, Rui Wang, Ruifeng Xu, Ying Shan, Xihui Liu

Given diverse environmental inputs, including real-time task progress, visual observations, and open-form language instructions, a proficient task planner is expected to predict feasible actions, which is a feat inherently achievable by Multimodal Large Language Models (MLLMs).

Benchmarking Human-Object Interaction Detection

Paper
Code

DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting

1 code implementation • 11 Dec 2023 • Demin Yu, Xutao Li, Yunming Ye, Baoquan Zhang, Chuyao Luo, Kuai Dai, Rui Wang, Xunlai Chen

A unified and flexible framework that can equip any type of spatio-temporal models is proposed based on residual diffusion, which effectively tackles the shortcomings of previous methods.

Paper
Code

Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering

1 code implementation • 8 Dec 2023 • Yuanyuan Guo, Zehua Zang, Hang Gao, Xiao Xu, Rui Wang, Lixiang Liu, Jiangmeng Li

To this end, recent works explore learning discriminative information from social messages by leveraging graph contrastive learning (GCL) and embedding clustering in an unsupervised manner.

Clustering Contrastive Learning +1

Paper
Code

Multi-scale Residual Transformer for VLF Lightning Transients Classification

no code implementations • 7 Dec 2023 • Jinghao Sun, Tingting Ji, Guoyu Wang, Rui Wang

The utilization of Very Low Frequency (VLF) electromagnetic signals in navigation systems is widespread.

Classification

Paper
Add Code

FaceStudio: Put Your Face Everywhere in Seconds

no code implementations • 5 Dec 2023 • Yuxuan Yan, Chi Zhang, Rui Wang, Yichao Zhou, Gege Zhang, Pei Cheng, Gang Yu, Bin Fu

This study investigates identity-preserving image synthesis, an intriguing task in image generation that seeks to maintain a subject's identity while adding a personalized, stylistic touch.

Image Generation

Paper
Add Code

VIoTGPT: Learning to Schedule Vision Tools towards Intelligent Video Internet of Things

no code implementations • 1 Dec 2023 • Yaoyao Zhong, Mengshi Qi, Rui Wang, Yuhan Qiu, Yang Zhang, Huadong Ma

Video Internet of Things (VIoT) has shown full potential in collecting an unprecedented volume of video data.

Paper
Add Code

Riemannian Self-Attention Mechanism for SPD Networks

no code implementations • 28 Nov 2023 • Rui Wang, Xiao-Jun Wu, Hui Li, Josef Kittler

Symmetric positive definite (SPD) matrix has been demonstrated to be an effective feature descriptor in many scientific areas, as it can encode spatiotemporal statistics of the data adequately on a curved Riemannian manifold, i. e., SPD manifold.

Benchmarking Riemannian optimization

Paper
Add Code

SEED-Bench-2: Benchmarking Multimodal Large Language Models

1 code implementation • 28 Nov 2023 • Bohao Li, Yuying Ge, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan

Multimodal large language models (MLLMs), building upon the foundation of powerful large language models (LLMs), have recently demonstrated exceptional capabilities in generating not only texts but also images given interleaved multimodal inputs (acting like a combination of GPT-4V and DALL-E 3).

Benchmarking Image Generation +1

251

Paper
Code

Hessian Aware Low-Rank Weight Perturbation for Continual Learning

1 code implementation • 26 Nov 2023 • Jiaqi Li, Rui Wang, Yuanhao Lai, Changjian Shui, Sabyasachi Sahoo, Charles X. Ling, Shichun Yang, Boyu Wang, Christian Gagné, Fan Zhou

We conduct extensive experiments on various benchmarks, including a dataset with large-scale tasks, and compare our method against some recent state-of-the-art methods to demonstrate the effectiveness and scalability of our proposed method.

Continual Learning

Paper
Code

RetroDiff: Retrosynthesis as Multi-stage Distribution Interpolation

no code implementations • 23 Nov 2023 • Yiming Wang, Yuxuan Song, Minkai Xu, Rui Wang, Hao Zhou, WeiYing Ma

Our key innovation is to develop a multi-stage diffusion process.

Graph Generation Retrosynthesis

Paper
Add Code

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

1 code implementation • 20 Nov 2023 • Zhuosheng Zhang, Yao Yao, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao

Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks.

309

Paper
Code

MELA: Multilingual Evaluation of Linguistic Acceptability

no code implementations • 15 Nov 2023 • Ziyin Zhang, Yikang Liu, Weifang Huang, Junyu Mao, Rui Wang, Hai Hu

Recent benchmarks for Large Language Models (LLMs) have mostly focused on application-driven tasks such as complex reasoning and code generation, and this has led to a scarcity in purely linguistic evaluation of LLMs.

Code Generation Cross-Lingual Transfer +3

Paper
Add Code

CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models

no code implementations • 15 Nov 2023 • Wenhong Zhu, Hongkun Hao, Zhiwei He, Yunze Song, Yumeng Zhang, Hanxu Hu, Yiran Wei, Rui Wang, Hongyuan Lu

The best candidate is finally selected from this set based on the BLEURT score.

Few-Shot Learning

Paper
Add Code

Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code

1 code implementation • 14 Nov 2023 • Ziyin Zhang, Chaoyu Chen, Bingchang Liu, Cong Liao, Zi Gong, Hang Yu, Jianguo Li, Rui Wang

In this work we systematically review the recent advancements in code processing with language models, covering 50+ models, 30+ evaluation tasks, 170+ datasets, and 800 related works.

802

Paper
Code

CASTER: A Computer-Vision-Assisted Wireless Channel Simulator for Gesture Recognition

1 code implementation • 13 Nov 2023 • Zhenyu Ren, Guoliang Li, Chenqing Ji, Chao Yu, Shuai Wang, Rui Wang

In the proposed CASTER simulator, however, the training dataset can be simulated via existing videos.

Hand Gesture Recognition Hand-Gesture Recognition

Paper
Code

Passive Handwriting Tracking via Weak mmWave Communication Signals

no code implementations • 3 Nov 2023 • Chao Yu, Yan Luo, Renqi Chen, Rui Wang

In this letter, a cooperative sensing framework based on millimeter wave (mmWave) communication systems is proposed to detect tiny motions with a millimeter-level resolution.

Paper
Add Code

Dynamic Uploading Scheduling in mmWave-Based Sensor Networks via Mobile Blocker Detection

no code implementations • 2 Nov 2023 • Yifei Sun, Bojie Lv, Rui Wang, Haisheng Tan, Francis C. M. Lau

As a result, the AoI degradation arising from link blockage can be forecast and mitigated.

Scheduling

Paper
Add Code

Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models

1 code implementation • 31 Oct 2023 • Tian Liang, Zhiwei He, Jen-tse Huang, Wenxuan Wang, Wenxiang Jiao, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi, Xing Wang

Ideally, an advanced agent should possess the ability to accurately describe a given word using an aggressive description while concurrently maximizing confusion in the conservative description, enhancing its participation in the game.

Paper
Code

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning

no code implementations • 23 Oct 2023 • Hao Wang, Xiahua Chen, Rui Wang, Chenhui Chu

Extracting meaningful entities belonging to predefined categories from Visually-rich Form-like Documents (VFDs) is a challenging task.

Paper
Add Code

DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading

1 code implementation • 23 Oct 2023 • Hao Wang, Qingxuan Wang, Yue Li, Changqing Wang, Chenhui Chu, Rui Wang

The use of visually-rich documents (VRDs) in various fields has created a demand for Document AI models that can read and comprehend documents like humans, which requires the overcoming of technical, linguistic, and cognitive barriers.

document understanding Reading Comprehension

Paper
Code

Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation

1 code implementation • 23 Oct 2023 • Wenhong Zhu, Hongkun Hao, Rui Wang

This paper investigates the self-reinforcement effect in text generation and the effectiveness of a repetition penalty to mitigate it.

Text Generation

Paper
Code

MCC-KD: Multi-CoT Consistent Knowledge Distillation

1 code implementation • 23 Oct 2023 • Hongzhan Chen, Siyue Wu, Xiaojun Quan, Rui Wang, Ming Yan, Ji Zhang

Large language models (LLMs) have showcased remarkable capabilities in complex reasoning through chain of thought (CoT) prompting.

Knowledge Distillation Mathematical Reasoning

Paper
Code

Rethinking Word-Level Auto-Completion in Computer-Aided Translation

1 code implementation • 23 Oct 2023 • Xingyu Chen, Lemao Liu, Guoping Huang, Zhirui Zhang, Mingming Yang, Shuming Shi, Rui Wang

Word-Level Auto-Completion (WLAC) plays a crucial role in Computer-Assisted Translation.

Translation

Paper
Code

Large-Scale and Multi-Perspective Opinion Summarization with Diverse Review Subsets

1 code implementation • 20 Oct 2023 • Han Jiang, Rui Wang, Zhihua Wei, Yu Li, Xinpeng Wang

Furthermore, our in-depth analysis verifies that the advanced selection of review subsets and the two-stage training scheme are vital to boosting the summarization performance.

Opinion Summarization

Paper
Code

FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion

no code implementations • 15 Oct 2023 • Zhihua Zhong, Jingsen Zhu, Yuxin Dai, Chuankun Zheng, Yuchi Huo, Guanlin Chen, Hujun Bao, Rui Wang

To mitigate this problem, one of the most popular solutions is to render images at a low resolution to reduce rendering overhead, and then manage to accurately upsample the low-resolution rendered image to the target resolution, a. k. a.

4k Super-Resolution

Paper
Add Code

RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification

1 code implementation • 14 Oct 2023 • Junjie Ye, Jie zhou, Junfeng Tian, Rui Wang, Qi Zhang, Tao Gui, Xuanjing Huang

Recently, Target-oriented Multimodal Sentiment Classification (TMSC) has gained significant attention among scholars.

Sentiment Analysis Sentiment Classification

Paper
Code

Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogue

no code implementations • 13 Oct 2023 • Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, Kam-Fai Wong

Open-domain dialogue system usually requires different sources of knowledge to generate more informative and evidential responses.

Response Generation

Paper
Add Code

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment

1 code implementation • 12 Oct 2023 • Boyang Xue, Weichao Wang, Hongru Wang, Fei Mi, Rui Wang, Yasheng Wang, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong

Inspired by previous work which identified that feed-forward networks (FFNs) within Transformers are responsible for factual knowledge expressions, we investigate two methods to efficiently improve the factual expression capability {of FFNs} by knowledge enhancement and alignment respectively.

Paper
Code

Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization

no code implementations • 10 Oct 2023 • Le Chen, Weirong Chen, Rui Wang, Marc Pollefeys

As a promising fashion for visual localization, scene coordinate regression (SCR) has seen tremendous progress in the past decade.

regression Visual Localization

Paper
Add Code

Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution

no code implementations • 3 Oct 2023 • Rui Wang, Elyssa Hofgard, Han Gao, Robin Walters, Tess E. Smidt

Modeling symmetry breaking is essential for understanding the fundamental changes in the behaviors and properties of physical systems, from microscopic particle interactions to macroscopic phenomena like fluid dynamics and cosmic structures.

Super-Resolution

Paper
Add Code

TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration

no code implementations • 28 Sep 2023 • Hongru Wang, Huimin Wang, Lingzhi Wang, Minda Hu, Rui Wang, Boyang Xue, Hongyuan Lu, Fei Mi, Kam-Fai Wong

Large language models (LLMs) have demonstrated exceptional performance in planning the use of various functional tools, such as calculators and retrievers, particularly in question-answering tasks.

Question Answering Response Generation

Paper
Add Code

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

no code implementations • 27 Sep 2023 • Xiaoliang Dai, Ji Hou, Chih-Yao Ma, Sam Tsai, Jialiang Wang, Rui Wang, Peizhao Zhang, Simon Vandenhende, Xiaofang Wang, Abhimanyu Dubey, Matthew Yu, Abhishek Kadian, Filip Radenovic, Dhruv Mahajan, Kunpeng Li, Yue Zhao, Vladan Petrovic, Mitesh Kumar Singh, Simran Motwani, Yi Wen, Yiwen Song, Roshan Sumbaly, Vignesh Ramanathan, Zijian He, Peter Vajda, Devi Parikh

Training text-to-image models with web scale image-text pairs enables the generation of a wide range of visual concepts from text.

Image Generation

Paper
Add Code

Learning Point-wise Abstaining Penalty for Point Cloud Anomaly Detection

1 code implementation • 19 Sep 2023 • Shaocong Xu, Pengfei Li, Xinyu Liu, Qianpu Sun, Yang Li, Shihui Guo, Zhen Wang, Bo Jiang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao

We demonstrate that learning different abstaining penalties, apart from point-wise penalty, for different types of (synthesized) outliers can further improve the performance.

Anomaly Detection Autonomous Driving +1

Paper
Code

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration

1 code implementation • ICCV 2023 • Lijiang Li, Huixia Li, Xiawu Zheng, Jie Wu, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan, Fei Chao, Rongrong Ji

Therefore, we propose to search the optimal time steps sequence and compressed model architecture in a unified framework to achieve effective image generation for diffusion models without any further training.

Image Generation single-image-generation

Paper
Code

UGC: Unified GAN Compression for Efficient Image-to-Image Translation

no code implementations • ICCV 2023 • Yuxi Ren, Jie Wu, Peng Zhang, Manlin Zhang, Xuefeng Xiao, Qian He, Rui Wang, Min Zheng, Xin Pan

Recent years have witnessed the prevailing progress of Generative Adversarial Networks (GANs) in image-to-image translation.

Image-to-Image Translation Translation

Paper
Add Code

A Benchmark for Text Expansion: Datasets, Metrics, and Baselines

no code implementations • 17 Sep 2023 • Yi Chen, Haiyun Jiang, Wei Bi, Rui Wang, Longyue Wang, Shuming Shi, Ruifeng Xu

This work presents a new task of Text Expansion (TE), which aims to insert fine-grained modifiers into proper locations of the plain text to concretize or vivify human writings.

2k Informativeness +1

Paper
Add Code

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

1 code implementation • 15 Sep 2023 • Qingyan Guo, Rui Wang, Junliang Guo, Bei Li, Kaitao Song, Xu Tan, Guoqing Liu, Jiang Bian, Yujiu Yang

Large Language Models (LLMs) excel in various tasks, but they rely on carefully crafted prompts that often demand substantial human effort.

Evolutionary Algorithms

Paper
Code

DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection

no code implementations • 7 Sep 2023 • Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma

This paper reveals that the recently developed Diffusion Model is a scalable data engine for object detection.

Data Augmentation object-detection +1

Paper
Add Code

What are Public Concerns about ChatGPT? A Novel Self-Supervised Neural Topic Model Tells You

no code implementations • 4 Sep 2023 • Rui Wang, Xing Liu, Yanan Wang, Haiping Huang

The recently released artificial intelligence conversational agent, ChatGPT, has gained significant attention in academia and real life.

Representation Learning

Paper
Add Code

M2HGCL: Multi-Scale Meta-Path Integrated Heterogeneous Graph Contrastive Learning

no code implementations • 3 Sep 2023 • Yuanyuan Guo, Yu Xia, Rui Wang, Rongcheng Duan, Lu Li, Jiangmeng Li

Orthogonal to homogeneous graphs, the types of nodes and edges in heterogeneous graphs are diverse so that specialized graph contrastive learning methods are required.

Contrastive Learning

Paper
Add Code

Exploring the Robustness of Human Parsers Towards Common Corruptions

no code implementations • 2 Sep 2023 • Sanyi Zhang, Xiaochun Cao, Rui Wang, Guo-Jun Qi, Jie zhou

The experimental results show that the proposed method demonstrates good universality which can improve the robustness of the human parsing models and even the semantic segmentation models when facing various image common corruptions.

Data Augmentation Human Parsing +1

Paper
Add Code

FTA: Stealthy and Adaptive Backdoor Attack with Flexible Triggers on Federated Learning

no code implementations • 31 Aug 2023 • Yanqi Qiao, Dazhuang Liu, Congwen Chen, Rui Wang, Kaitai Liang

In this work, we propose a new stealthy and robust backdoor attack with flexible triggers against FL defenses.

Backdoor Attack Federated Learning

Paper
Add Code

Lifelike Agility and Play on Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models

no code implementations • 29 Aug 2023 • Lei Han, Qingxu Zhu, Jiapeng Sheng, Chong Zhang, Tingguang Li, Yizheng Zhang, He Zhang, Yuzhen Liu, Cheng Zhou, Rui Zhao, Jie Li, Yufeng Zhang, Rui Wang, Wanchao Chi, Xiong Li, Yonghui Zhu, Lingzhu Xiang, Xiao Teng, Zhengyou Zhang

In this work, we propose a framework for driving legged robots act like real animals with lifelike agility and strategy in complex environments.

TAG

Paper
Add Code

DLIP: Distilling Language-Image Pre-training

no code implementations • 24 Aug 2023 • Huafeng Kuang, Jie Wu, Xiawu Zheng, Ming Li, Xuefeng Xiao, Rui Wang, Min Zheng, Rongrong Ji

Furthermore, DLIP succeeds in retaining more than 95% of the performance with 22. 4% parameters and 24. 8% FLOPs compared to the teacher model and accelerates inference speed by 2. 7x.

Image Captioning Knowledge Distillation +5

Paper
Add Code

ChatGPT in Drug Discovery: A Case Study on Anti-Cocaine Addiction Drug Development with Chatbots

no code implementations • 14 Aug 2023 • Rui Wang, Hongsong Feng, Guo-Wei Wei

This paper not only explores the integration of advanced AI in drug discovery but also reimagines the landscape by advocating for AI-powered chatbots as trailblazers in revolutionizing therapeutic innovation.

Chatbot Drug Discovery +1

Paper
Add Code

Face Encryption via Frequency-Restricted Identity-Agnostic Attacks

no code implementations • 11 Aug 2023 • Xin Dong, Rui Wang, Siyuan Liang, Aishan Liu, Lihua Jing

As for the weak black-box scenario feasibility, we obverse that representations of the average feature in multiple face recognition models are similar, thus we propose to utilize the average feature via the crawled dataset from the Internet as the target to guide the generation, which is also agnostic to identities of unknown face recognition systems; in nature, the low-frequency perturbations are more visually perceptible by the human vision system.

Face Recognition

Paper
Add Code

A Quantize-then-Estimate Protocol for CSI Acquisition in IRS-Aided Downlink Communication

no code implementations • 4 Aug 2023 • Rui Wang, Zhaorui Wang, Liang Liu, Shuowen Zhang, Shi Jin

Different from the uplink counterpart where the BS possesses the pilot signals containing the CSI of all the users, in downlink communication, the distributed users merely receive the pilot signals containing their own CSI and cannot leverage the correlation in different users' channels revealed in [1].

Quantization

Paper
Add Code

SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension

2 code implementations • 30 Jul 2023 • Bohao Li, Rui Wang, Guangzhi Wang, Yuying Ge, Yixiao Ge, Ying Shan

Based on powerful Large Language Models (LLMs), recent generative Multimodal Large Language Models (MLLMs) have gained prominence as a pivotal research area, exhibiting remarkable capability for both comprehension and generation.

Benchmarking Multiple-choice

369

Paper
Code

Phase Matching for Out-of-Distribution Generalization

no code implementations • 24 Jul 2023 • Chengming Hu, Yeqian Du, Rui Wang, Hao Chen

In this paper, we aim to clarify the relationships between Domain Generalization (DG) and the frequency components, and explore the spatial relationships of the phase spectrum.

Domain Generalization Out-of-Distribution Generalization +1

Paper
Add Code

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection

1 code implementation • ICCV 2023 • Ming Li, Jie Wu, Xionghui Wang, Chen Chen, Jie Qin, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

To this end, we propose AlignDet, a unified pre-training framework that can be adapted to various existing detectors to alleviate the discrepancies.

object-detection Object Detection

131

Paper
Code

POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical Activities

1 code implementation • 19 Jul 2023 • Rui Wang, Sophokles Ktistakis, Siwei Zhang, Mirko Meboldt, Quentin Lohmeyer

The surgical usage of Mixed Reality (MR) has received growing attention in areas such as surgical navigation systems, skill assessment, and robot-assisted surgeries.

hand-object pose Mixed Reality +3

Paper
Code

TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT

no code implementations • 17 Jul 2023 • Liangyu Zha, Junlin Zhou, Liyao Li, Rui Wang, Qingyi Huang, Saisai Yang, Jing Yuan, Changbao Su, Xiang Li, Aofeng Su, Tao Zhang, Chen Zhou, Kaizhe Shou, Miao Wang, Wufang Zhu, Guoshan Lu, Chao Ye, Yali Ye, Wentao Ye, Yiming Zhang, Xinglong Deng, Jie Xu, Haobo Wang, Gang Chen, Junbo Zhao

Tables are prevalent in real-world databases, requiring significant time and effort for humans to analyze and manipulate.

Data Visualization Question Answering

Paper
Add Code

Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

1 code implementation • 17 Jul 2023 • Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Haiyang Yu, Yuqing Xie, Xiang Fu, Alex Strasser, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence, Hannes Stärk, Shurui Gui, Carl Edwards, Nicholas Gao, Adriana Ladera, Tailin Wu, Elyssa F. Hofgard, Aria Mansouri Tehrani, Rui Wang, Ameya Daigavane, Montgomery Bohde, Jerry Kurtin, Qian Huang, Tuong Phung, Minkai Xu, Chaitanya K. Joshi, Simon V. Mathis, Kamyar Azizzadenesheli, Ada Fang, Alán Aspuru-Guzik, Erik Bekkers, Michael Bronstein, Marinka Zitnik, Anima Anandkumar, Stefano Ermon, Pietro Liò, Rose Yu, Stephan Günnemann, Jure Leskovec, Heng Ji, Jimeng Sun, Regina Barzilay, Tommi Jaakkola, Connor W. Coley, Xiaoning Qian, Xiaofeng Qian, Tess Smidt, Shuiwang Ji

Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural sciences.

Out-of-Distribution Generalization Transfer Learning +1

413

Paper
Code

Shadow operator: Effective dynamic load change operation training in air separation processes based on industrial nonlinear MPC and Bloom's taxonomy

no code implementations • 6 Jul 2023 • Guanghui Yang, Zhijiang Shao, Rui Wang, Zuhua Xu, Lidan Cui

A novel human-machine interactive training method for dynamic load change operation in air separation processes (ASPs) is proposed.

Model Predictive Control

Paper
Add Code

Learning to Branch in Combinatorial Optimization with Graph Pointer Networks

no code implementations • 4 Jul 2023 • Rui Wang, Zhiming Zhou, Tao Zhang, Ling Wang, Xin Xu, Xiangke Liao, Kaiwen Li

The proposed model, which combines the graph neural network and the pointer mechanism, can effectively map from the solver state to the branching variable decisions.

Combinatorial Optimization Variable Selection

Paper
Add Code

Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models

1 code implementation • 30 Jun 2023 • Yiming Wang, Zhuosheng Zhang, Pei Zhang, Baosong Yang, Rui Wang

Neural-symbolic methods have demonstrated efficiency in enhancing the reasoning abilities of large language models (LLMs).

Domain Generalization In-Context Learning +1

Paper
Code

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

1 code implementation • NeurIPS 2023 • Zibo Zhao, Wen Liu, Xin Chen, Xianfang Zeng, Rui Wang, Pei Cheng, Bin Fu, Tao Chen, Gang Yu, Shenghua Gao

We present a novel alignment-before-generation approach to tackle the challenging task of generating general 3D shapes based on 2D images or texts.

3D Shape Generation Decoder

271

Paper
Code

Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification

2 code implementations • NeurIPS 2023 • Rui Wang, Peipei Li, Huaibo Huang, Chunshui Cao, Ran He, Zhaofeng He

Consequently, we propose a cross-modal ordinal pairwise loss to refine the CLIP feature space, where texts and images maintain both semantic alignment and ordering alignment.

Age Estimation Classification +2

Paper
Code

Structured Learning in Time-dependent Cox Models

1 code implementation • 21 Jun 2023 • Guanbo Wang, Yi Lian, Archer Y. Yang, Robert W. Platt, Rui Wang, Sylvie Perreault, Marc Dorais, Mireille E. Schnitzer

We propose a flexible framework for variable selection in time-dependent Cox models, accommodating complex selection rules.

Survival Analysis Variable Selection

Paper
Code

Multi-objective Molecular Optimization for Opioid Use Disorder Treatment Using Generative Network Complex

no code implementations • 13 Jun 2023 • Hongsong Feng, Rui Wang, Chang-Guo Zhan, Guo-Wei Wei

Opioid Use Disorder (OUD) has emerged as a significant global public health issue, with complex multifaceted conditions.

Paper
Add Code

Rethinking Translation Memory Augmented Neural Machine Translation

no code implementations • 12 Jun 2023 • Hongkun Hao, Guoping Huang, Lemao Liu, Zhirui Zhang, Shuming Shi, Rui Wang

The finding demonstrates that TM-augmented NMT is good at the ability of fitting data (i. e., lower bias) but is more sensitive to the fluctuations in the training data (i. e., higher variance), which provides an explanation to a recently reported contradictory phenomenon on the same translation task: TM-augmented NMT substantially advances vanilla NMT under the high-resource scenario whereas it fails under the low-resource scenario.

Machine Translation NMT +2

Paper
Add Code

PLPCA: Persistent Laplacian Enhanced-PCA for Microarray Data Analysis

1 code implementation • 9 Jun 2023 • Sean Cottrell, Rui Wang, GuoWei Wei

Over the years, Principal Component Analysis (PCA) has served as the baseline approach for dimensionality reduction in gene expression data analysis.

Dimensionality Reduction

Paper
Code

Extract and Attend: Improving Entity Translation in Neural Machine Translation

no code implementations • 4 Jun 2023 • Zixin Zeng, Rui Wang, Yichong Leng, Junliang Guo, Xu Tan, Tao Qin, Tie-Yan Liu

Inspired by this translation process, we propose an Extract-and-Attend approach to enhance entity translation in NMT, where the translation candidates of source entities are first extracted from a dictionary and then attended to by the NMT model to generate the target sentence.

Decoder Machine Translation +3

Paper
Add Code

Deliberate then Generate: Enhanced Prompting Framework for Text Generation

no code implementations • 31 May 2023 • Bei Li, Rui Wang, Junliang Guo, Kaitao Song, Xu Tan, Hany Hassan, Arul Menezes, Tong Xiao, Jiang Bian, Jingbo Zhu

Large language models (LLMs) have shown remarkable success across a wide range of natural language generation tasks, where proper prompt designs make great impacts.

Text Generation

Paper
Add Code

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

1 code implementation • 30 May 2023 • Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi

To address the DoT problem, we propose a Multi-Agent Debate (MAD) framework, in which multiple agents express their arguments in the state of "tit for tat" and a judge manages the debate process to obtain a final solution.

Arithmetic Reasoning Machine Translation

183

Paper
Code

Revisiting Acceptability Judgements

1 code implementation • 23 May 2023 • Hai Hu, Ziyin Zhang, Weifang Huang, Jackie Yan-Ki Lai, Aini Li, Yina Patterson, Jiahui Huang, Peng Zhang, Chien-Jer Charles Lin, Rui Wang

We introduce CoLAC - Corpus of Linguistic Acceptability in Chinese, the first large-scale acceptability dataset for a non-Indo-European language.

Cross-Lingual Transfer Linguistic Acceptability

Paper
Code

Enhancing Large Language Models Against Inductive Instructions with Dual-critique Prompting

1 code implementation • 23 May 2023 • Rui Wang, Hongru Wang, Fei Mi, Yi Chen, Boyang Xue, Kam-Fai Wong, Ruifeng Xu

Numerous works are proposed to align large language models (LLMs) with human intents to better fulfill instructions, ensuring they are trustful and helpful.

counterfactual Fact Checking

Paper
Code

TeCS: A Dataset and Benchmark for Tense Consistency of Machine Translation

1 code implementation • 23 May 2023 • Yiming Ai, Zhiwei He, Kai Yu, Rui Wang

Tense inconsistency frequently occurs in machine translation.

Machine Translation Translation

Paper
Code

Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method

1 code implementation • 22 May 2023 • Yiming Wang, Zhuosheng Zhang, Rui Wang

Further, we propose a Summary Chain-of-Thought (SumCoT) technique to elicit LLMs to generate summaries step by step, which helps them integrate more fine-grained details of source documents into the final summaries that correlate with the human writing mindset.

Benchmarking Hallucination

Paper
Code

Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer

1 code implementation • 22 May 2023 • Ruize Gao, Zhirui Zhang, Yichao Du, Lemao Liu, Rui Wang

Nearest Neighbor Machine Translation ($k$NN-MT) has achieved great success in domain adaptation tasks by integrating pre-trained Neural Machine Translation (NMT) models with domain-specific token-level retrieval.

Domain Adaptation Machine Translation +3

Paper
Code

Sparse Representer Theorems for Learning in Reproducing Kernel Banach Spaces

no code implementations • 21 May 2023 • Rui Wang, Yuesheng Xu, Mingsong Yan

Sparsity of a learning solution is a desirable feature in machine learning.

Sparse Learning

Paper
Add Code

Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs

2 code implementations • 19 May 2023 • Hongru Wang, Rui Wang, Fei Mi, Yang Deng, Zezhong Wang, Bin Liang, Ruifeng Xu, Kam-Fai Wong

Large Language Models (LLMs), such as \texttt{ChatGPT}, greatly empower dialogue systems with strong language understanding and generation capabilities.

Question Answering Semantic Similarity +1

Paper
Code

Clustering-Aware Negative Sampling for Unsupervised Sentence Representation

1 code implementation • 17 May 2023 • Jinghao Deng, Fanqi Wan, Tao Yang, Xiaojun Quan, Rui Wang

Contrastive learning has been widely studied in sentence representation learning.

Clustering Contrastive Learning +4

Paper
Code

AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression

1 code implementation • 17 May 2023 • Siyue Wu, Hongzhan Chen, Xiaojun Quan, Qifan Wang, Rui Wang

To enhance the knowledge transfer of model reasoning and generalization, we further explore multi-view attribution distillation on all potential decisions of the teacher.

Knowledge Distillation Language Modelling +2

Paper
Code

Exploring Human-Like Translation Strategy with Large Language Models

2 code implementations • 6 May 2023 • Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang

Compared to typical machine translation that focuses solely on source-to-target mapping, LLM-based translation can potentially mimic the human translation process which might take preparatory steps to ensure high-quality translation.

Hallucination Machine Translation +2

183

Paper
Code

Unsupervised Dialogue Topic Segmentation with Topic-aware Utterance Representation

1 code implementation • 4 May 2023 • Haoyu Gao, Rui Wang, Ting-En Lin, Yuchuan Wu, Min Yang, Fei Huang, Yongbin Li

Dialogue Topic Segmentation (DTS) plays an essential role in a variety of dialogue modeling tasks.

Segmentation Semantic Similarity +1

1,009

Paper
Code

Variational Bayesian Multiuser Tracking for Reconfigurable Intelligent Surface Aided MIMO-OFDM Systems

no code implementations • 24 Apr 2023 • Boyu Teng, Xiaojun Yuan, Rui Wang

Reconfigurable intelligent surface (RIS) has attracted enormous interest for its potential advantages in assisting both wireless communication and environmental sensing.

Paper
Add Code

Delving into Shape-aware Zero-shot Semantic Segmentation

1 code implementation • CVPR 2023 • Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou

Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy.

Image Segmentation Segmentation +2

108

Paper
Code

Testing and Identifying Substitution and Complementarity Patterns

no code implementations • 3 Apr 2023 • Rui Wang

This paper studies semiparametric identification of substitution and complementarity patterns between two goods using a panel multinomial choice model with bundles.

Paper
Add Code

Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical Study

1 code implementation • 3 Apr 2023 • Yi Chen, Rui Wang, Haiyun Jiang, Shuming Shi, Ruifeng Xu

Evaluating the quality of generated text is a challenging task in NLP, due to the inherent complexity and diversity of text.

Language Modelling Large Language Model

Paper
Code

IV Regressions without Exclusion Restrictions

no code implementations • 2 Apr 2023 • Wayne Yuan Gao, Rui Wang

We study identification and estimation of endogenous linear and nonlinear regression models without excluded instrumental variables, based on the standard mean independence condition and a nonlinear relevance condition.

regression

Paper
Add Code

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation

no code implementations • CVPR 2023 • Jie Qin, Jie Wu, Pengxiang Yan, Ming Li, Ren Yuxi, Xuefeng Xiao, Yitong Wang, Rui Wang, Shilei Wen, Xin Pan, Xingang Wang

Recently, open-vocabulary learning has emerged to accomplish segmentation for arbitrary categories of text-based descriptions, which popularizes the segmentation system to more general-purpose application scenarios.

Ranked #6 on Open Vocabulary Panoptic Segmentation on ADE20K

Image Segmentation Instance Segmentation +3

Paper
Add Code

Predictive Resource Allocation in mmWave Systems with Rotation Detection

no code implementations • 29 Mar 2023 • Yifei Sun, Bojie Lv, Rui Wang, Haisheng Tan, Francis C. M. Lau

Millimeter wave (MmWave) has been regarded as a promising technology to support high-capacity communications in 5G era.

Scheduling

Paper
Add Code

Point Identification of LATE with Two Imperfect Instruments

no code implementations • 24 Mar 2023 • Rui Wang

This paper characterizes point identification results of the local average treatment effect (LATE) using two imperfect instruments.

Vocal Bursts Valence Prediction

Paper
Add Code

Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective

1 code implementation • CVPR 2023 • Yuexiao Ma, Huixia Li, Xiawu Zheng, Xuefeng Xiao, Rui Wang, Shilei Wen, Xin Pan, Fei Chao, Rongrong Ji

In particular, we first formulate the oscillation in PTQ and prove the problem is caused by the difference in module capacity.

Quantization

Paper
Code

Pluralistic Aging Diffusion Autoencoder

no code implementations • ICCV 2023 • Peipei Li, Rui Wang, Huaibo Huang, Ran He, Zhaofeng He

Face aging is an ill-posed problem because multiple plausible aging patterns may correspond to a given input.

Denoising

Paper
Add Code

Spatial-temporal Transformer for Affective Behavior Analysis

no code implementations • 19 Mar 2023 • Peng Zou, Rui Wang, Kehua Wen, Yasi Peng, Xiao Sun

The in-the-wild affective behavior analysis has been an important study.

Data Augmentation

Paper
Add Code

Uncertainty-Aware Pedestrian Trajectory Prediction via Distributional Diffusion

no code implementations • 15 Mar 2023 • Yao Liu, Zesheng Ye, Rui Wang, Binghao Li, Quan Z. Sheng, Lina Yao

Tremendous efforts have been put forth on predicting pedestrian trajectory with generative models to accommodate uncertainty and multi-modality in human behaviors.

Denoising Pedestrian Trajectory Prediction +1

Paper
Add Code

I$^2$-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations • 14 Mar 2023 • Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

In this work, we present I$^2$-SDF, a new method for intrinsic indoor scene reconstruction and editing using differentiable Monte Carlo raytracing on neural signed distance fields (SDFs).

Indoor Scene Reconstruction Novel View Synthesis

Paper
Add Code

Super-Resolution Information Enhancement For Crowd Counting

1 code implementation • 13 Mar 2023 • Jiahao Xie, Wei Xu, Dingkang Liang, Zhanyu Ma, Kongming Liang, Weidong Liu, Rui Wang, Ling Jin

As the proposed method requires SR labels, we further propose a Super-Resolution Crowd Counting dataset (SR-Crowd).

Crowd Counting Super-Resolution

Paper
Code

Toward Fairness in Text Generation via Mutual Information Minimization based on Importance Sampling

no code implementations • 25 Feb 2023 • Rui Wang, Pengyu Cheng, Ricardo Henao

To improve the fairness of PLMs in text generation, we propose to minimize the mutual information between the semantics in the generated text sentences and their demographic polarity, i. e., the demographic group to which the sentence is referring.

Fairness Language Modelling +2

Paper
Add Code

mmAlert: mmWave Link Blockage Prediction via Passive Sensing

no code implementations • 22 Feb 2023 • Chao Yu, Yifei Sun, Yan Luo, Rui Wang

It is demonstrated via experiments that the mmAlert system can always detect the motions of the walking person close to the LoS path, and predict 90\% of the LoS blockage with sensing time of 1. 4 seconds.

Paper
Add Code

Unique Identification of 50,000+ Virtual Reality Users from Head & Hand Motion Data

1 code implementation • 17 Feb 2023 • Vivek Nair, Wenbo Guo, Justus Mattern, Rui Wang, James F. O'Brien, Louis Rosenberg, Dawn Song

With the recent explosive growth of interest and investment in virtual reality (VR) and the so-called "metaverse," public attention has rightly shifted toward the unique security and privacy threats that these platforms may pose.

Paper
Code

A Study on ReLU and Softmax in Transformer

no code implementations • 13 Feb 2023 • Kai Shen, Junliang Guo, Xu Tan, Siliang Tang, Rui Wang, Jiang Bian

This paper sheds light on the following points: 1) Softmax and ReLU use different normalization methods over elements which lead to different variances of results, and ReLU is good at dealing with a large number of key-value slots; 2) FFN and key-value memory are equivalent, and thus the Transformer can be viewed as a memory network where FFNs and self-attention networks are both key-value memories.

Document Translation

Paper
Add Code

N-Gram Nearest Neighbor Machine Translation

no code implementations • 30 Jan 2023 • Rui Lv, Junliang Guo, Rui Wang, Xu Tan, Qi Liu, Tao Qin

Nearest neighbor machine translation augments the Autoregressive Translation~(AT) with $k$-nearest-neighbor retrieval, by comparing the similarity between the token-level context representations of the target tokens in the query and the datastore.

Domain Adaptation Machine Translation +2

Paper
Add Code

Universal Multimodal Representation for Language Understanding

no code implementations • 9 Jan 2023 • Zhuosheng Zhang, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao

Representation learning is the foundation of natural language processing (NLP).

Machine Translation Natural Language Inference +6

Paper
Add Code

PACO: Parts and Attributes of Common Objects

1 code implementation • CVPR 2023 • Vignesh Ramanathan, Anmol Kalia, Vladan Petrovic, Yi Wen, Baixue Zheng, Baishan Guo, Rui Wang, Aaron Marquez, Rama Kovvuri, Abhishek Kadian, Amir Mousavi, Yiwen Song, Abhimanyu Dubey, Dhruv Mahajan

This motivates the need for large datasets which go beyond traditional object masks and provide richer annotations such as part masks and attributes.

2D Object Detection Attribute +1

257

Paper
Code

A Theory of Human-Like Few-Shot Learning

no code implementations • 3 Jan 2023 • Zhiying Jiang, Rui Wang, Dongbo Bu, Ming Li

We aim to bridge the gap between our common-sense few-sample human learning and large-data machine learning.

Common Sense Reasoning Few-Shot Learning

Paper
Add Code

I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations • CVPR 2023 • Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

Further, we propose to decompose the neural radiance field into spatially-varying material of the scene as a neural field through surface-based, differentiable Monte Carlo raytracing and emitter semantic segmentations, which enables physically based and photorealistic scene relighting and editing applications.

Indoor Scene Reconstruction Novel View Synthesis

Paper
Add Code

4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions

no code implementations • 31 Dec 2022 • Patrick Wenzel, Nan Yang, Rui Wang, Niclas Zeller, Daniel Cremers

In this paper, we present a novel visual SLAM and long-term localization benchmark for autonomous driving in challenging conditions based on the large-scale 4Seasons dataset.

Autonomous Driving Benchmarking +2

Paper
Add Code

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

no code implementations • CVPR 2023 • Jishnu Mukhoti, Tsung-Yu Lin, Omid Poursaeed, Rui Wang, Ashish Shah, Philip H. S. Torr, Ser-Nam Lim

We introduce Patch Aligned Contrastive Learning (PACL), a modified compatibility function for CLIP's contrastive loss, intending to train an alignment between the patch tokens of the vision encoder and the CLS token of the text encoder.

Ranked #1 on Open Vocabulary Semantic Segmentation on Cityscape-171

Contrastive Learning Image Classification +5

Paper
Add Code

Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning

4 code implementations • CVPR 2023 • Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang

For the choice of teacher models, we observe that students taught by video teachers perform better on temporally-heavy video tasks, while image teachers transfer stronger spatial representations for spatially-heavy video tasks.

Ranked #1 on Self-Supervised Action Recognition on HMDB51

Action Classification Representation Learning +1

Paper
Code

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition

1 code implementation • 2 Dec 2022 • Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu

In this paper, we propose SoftCorrect with a soft error detection mechanism to avoid the limitations of both explicit and implicit error detection.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

1,303

Paper
Code

Coevolutionary Framework for Generalized Multimodal Multi-objective Optimization

1 code implementation • 2 Dec 2022 • Wenhua Li, Xingyi Yao, Kaiwen Li, Rui Wang, Tao Zhang, Ling Wang

To address the above two issues, in this study, a novel coevolutionary framework termed CoMMEA for multimodal multi-objective optimization is proposed to better obtain both global and local PSs, and simultaneously, to improve the convergence performance in dealing with high-dimension MMOPs.

Evolutionary Algorithms Transfer Learning

Paper
Code

CASSPR: Cross Attention Single Scan Place Recognition

1 code implementation • ICCV 2023 • Yan Xia, Mariia Gladkova, Rui Wang, Qianyun Li, Uwe Stilla, João F. Henriques, Daniel Cremers

CASSPR uses queries from one branch to try to match structures in the other branch, ensuring that both extract self-contained descriptors of the point cloud (rather than one branch dominating), but using both to inform the output global descriptor of the point cloud.

Paper
Code

Unifying Tracking and Image-Video Object Detection

no code implementations • 20 Nov 2022 • Peirong Liu, Rui Wang, Pengchuan Zhang, Omid Poursaeed, Yipin Zhou, Xuefei Cao, Sreya Dutta Roy, Ashish Shah, Ser-Nam Lim

We propose TrIVD (Tracking and Image-Video Detection), the first framework that unifies image OD, video OD, and MOT within one end-to-end model.

Multi-Object Tracking Object +2

Paper
Add Code

LidarGait: Benchmarking 3D Gait Recognition with Point Clouds

1 code implementation • CVPR 2023 • Chuanfu Shen, Chao Fan, Wei Wu, Rui Wang, George Q. Huang, Shiqi Yu

Video-based gait recognition has achieved impressive results in constrained scenarios.

Benchmarking Gait Recognition in the Wild

641

Paper
Code

Learning-based Inverse Rendering of Complex Indoor Scenes with Differentiable Monte Carlo Raytracing

no code implementations • 6 Nov 2022 • Jingsen Zhu, Fujun Luan, Yuchi Huo, Zihao Lin, Zhihua Zhong, Dianbing Xi, Jiaxiang Zheng, Rui Tang, Hujun Bao, Rui Wang

Indoor scenes typically exhibit complex, spatially-varying appearance from global illumination, making inverse rendering a challenging ill-posed problem.

Inverse Rendering

Paper
Add Code

Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue Embeddings

1 code implementation • 27 Oct 2022 • Che Liu, Rui Wang, Junfeng Jiang, Yongbin Li, Fei Huang

In this paper, we introduce the task of learning unsupervised dialogue embeddings.

Contrastive Learning Retrieval +2

1,009

Paper
Code

Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation

1 code implementation • 19 Oct 2022 • Botao Yu, Peiling Lu, Rui Wang, Wei Hu, Xu Tan, Wei Ye, Shikun Zhang, Tao Qin, Tie-Yan Liu

A recent trend is to use Transformer or its variants in music generation, which is, however, suboptimal, because the full attention cannot efficiently model the typically long music sequences (e. g., over 10, 000 tokens), and the existing models have shortcomings in generating musical repetition structures.

Music Generation

4,234

Paper
Code

Emerging dominant SARS-CoV-2 variants

no code implementations • 18 Oct 2022 • Jiahui Chen, Rui Wang, Yuta Hozumi, Gengzhuo Liu, Yuchi Qiu, Xiaoqi Wei, Guo-Wei Wei

Accurate and reliable forecasting of emerging dominant severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants enables policymakers and vaccine makers to get prepared for future waves of infections.

Paper
Add Code

Large-Scale Bandwidth and Power Optimization for Multi-Modal Edge Intelligence Autonomous Driving

no code implementations • 18 Oct 2022 • Xinrao Li, Tong Zhang, Shuai Wang, Guangxu Zhu, Rui Wang, Tsung-Hui Chang

However, wireless channels between the edge server and the autonomous vehicles are time-varying due to the high-mobility of vehicles.

Autonomous Driving

Paper
Add Code

Tencent AI Lab - Shanghai Jiao Tong University Low-Resource Translation System for the WMT22 Translation Task

1 code implementation • 17 Oct 2022 • Zhiwei He, Xing Wang, Zhaopeng Tu, Shuming Shi, Rui Wang

Finally, our unconstrained system achieves BLEU scores of 17. 0 and 30. 4 for English to/from Livonian.

Data Augmentation Translation

Paper
Code

Koopman Neural Forecaster for Time Series with Temporal Distribution Shifts

1 code implementation • 7 Oct 2022 • Rui Wang, Yihe Dong, Sercan Ö. Arik, Rose Yu

Temporal distributional shifts, with underlying dynamics changing over time, frequently occur in real-world time series and pose a fundamental challenge for deep neural networks (DNNs).

Time Series Time Series Forecasting

32,981

Paper
Code

Strong Transferable Adversarial Attacks via Ensembled Asymptotically Normal Distribution Learning

1 code implementation • 24 Sep 2022 • Zhengwei Fang, Rui Wang, Tao Huang, Liping Jing

Strong adversarial examples are crucial for evaluating and enhancing the robustness of deep neural networks.

Adversarial Attack

Paper
Code

Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence

1 code implementation • 7 Sep 2022 • Jiaxing Zhang, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Lin Zhang, Ping Yang, Xinyu Gao, Ziwei Wu, Xiaoqun Dong, Junqing He, Jianheng Zhuo, Qi Yang, Yongfeng Huang, Xiayu Li, Yanghan Wu, Junyu Lu, Xinyu Zhu, Weifeng Chen, Ting Han, Kunhao Pan, Rui Wang, Hao Wang, XiaoJun Wu, Zhongshen Zeng, Chongpei Chen

We hope that this project will be the foundation of Chinese cognitive intelligence.

3,912

Paper
Code

TOSE: A Fast Capacity Estimation Algorithm Based on Spike Approximations

no code implementations • 2 Sep 2022 • Dandan Jiang, Han Hao, Lu Yang, Rui Wang

Instead, fast eigenvalue estimations can be realized based on the spike approximations in our TOSE algorithm.

Capacity Estimation

Paper
Add Code

Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling

no code implementations • 25 Aug 2022 • Rui Wang, Zuxuan Wu, Dongdong Chen, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Luowei Zhou, Lu Yuan, Yu-Gang Jiang

To avoid significant computational cost incurred by computing self-attention between the large number of local patches in videos, we propose to use very few global tokens (e. g., 6) for a whole video in Transformers to exchange information with 3D-CNNs with a cross-attention mechanism.

Video Recognition

Paper
Add Code

A novel method for data augmentation: Nine Dot Moving Least Square (ND-MLS)

no code implementations • 24 Aug 2022 • Wen Yang, Rui Wang, Yanchao Zhang

However, the ND-MLS method has stable performance and obtains 96. 5 top-1 acc in Res-Net on 100 different handwritten character classification tasks; 2) in segmentation, under the premise of only ten original images, DeepLab obtains 93. 5%, 85%, and 73. 3% m_IOU(10) on the bottle, horse, and grass test datasets, respectively, while the cat test dataset obtains 86. 7% m_IOU(10) with the SegNet model; 3) with only 10 original images from each category in object detection, YOLO v4 obtains 100% and 97. 2% bottle and horse detection, respectively, while the cat dataset obtains 93. 6% with YOLO v3.

Classification Data Augmentation +3

Paper
Add Code

MUDGUARD: Taming Malicious Majorities in Federated Learning using Privacy-Preserving Byzantine-Robust Clustering

no code implementations • 22 Aug 2022 • Rui Wang, Xingkai Wang, Huanhuan Chen, Jérémie Decouchant, Stjepan Picek, Nikolaos Laoutaris, Kaitai Liang

It is therefore currently impossible to ensure Byzantine robustness and confidentiality of updates without assuming a semi-honest majority.

Clustering Federated Learning +1

Paper
Add Code

Large-scale matrix optimization based multi microgrid topology design with a constrained differential evolution algorithm

no code implementations • 18 Jul 2022 • Wenhua Li, Shengjun Huang, Tao Zhang, Rui Wang, Ling Wang

Binary matrix optimization commonly arise in the real world, e. g., multi-microgrid network structure design problem (MGNSDP), which is to minimize the total length of the power supply line under certain constraints.

Paper
Add Code

Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios

4 code implementations • 12 Jul 2022 • Jiashi Li, Xin Xia, Wei Li, Huixia Li, Xing Wang, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

Then, Next Hybrid Strategy (NHS) is designed to stack NCB and NTB in an efficient hybrid paradigm, which boosts performance in various downstream tasks.

Ranked #281 on Image Classification on ImageNet

Image Classification

29,974

Paper
Code

Multimodal Multi-objective Optimization: Comparative Study of the State-of-the-Art

1 code implementation • 11 Jul 2022 • Wenhua Li, Tao Zhang, Rui Wang, Jing Liang

Multimodal multi-objective problems (MMOPs) commonly arise in real-world problems where distant solutions in decision space correspond to very similar objective values.

Evolutionary Algorithms

Paper
Code

A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation

no code implementations • NAACL 2022 • Kexun Zhang, Rui Wang, Xu Tan, Junliang Guo, Yi Ren, Tao Qin, Tie-Yan Liu

Furthermore, we take the best of both and design a new loss function to better handle the complicated syntactic multi-modality in real-world datasets.

Machine Translation Translation

Paper
Add Code

Deep Contrastive One-Class Time Series Anomaly Detection

1 code implementation • 4 Jul 2022 • Rui Wang, Chongwei Liu, Xudong Mou, Kai Gao, Xiaohui Guo, Pin Liu, Tianyu Wo, Xudong Liu

To overcome the shortcomings, a deep Contrastive One-Class Anomaly detection method of time series (COCA) is proposed by authors, following the normality assumptions of CL and one-class classification.

Contrastive Learning One-Class Classification +2

Paper
Code

Effect of Homomorphic Encryption on the Performance of Training Federated Learning Generative Adversarial Networks

no code implementations • 1 Jul 2022 • Ignjat Pejic, Rui Wang, Kaitai Liang

In this ML technique, only parameters and certain metadata would be communicated.

Federated Learning Generative Adversarial Network +1

Paper
Add Code

RAW-GNN: RAndom Walk Aggregation based Graph Neural Network

no code implementations • 28 Jun 2022 • Di Jin, Rui Wang, Meng Ge, Dongxiao He, Xiang Li, Wei Lin, Weixiong Zhang

Due to the homophily assumption of Graph Convolutional Networks (GCNs) that these methods use, they are not suitable for heterophily graphs where nodes with different labels or dissimilar attributes tend to be adjacent.

Representation Learning

Paper
Add Code

MULTI-FLGANs: Multi-Distributed Adversarial Networks for Non-IID distribution

no code implementations • 24 Jun 2022 • Akash Amalan, Rui Wang, Yanqi Qiao, Emmanouil Panaousis, Kaitai Liang

Federated learning is an emerging concept in the domain of distributed machine learning.

Federated Learning

Paper
Add Code

Using Autoencoders on Differentially Private Federated Learning GANs

1 code implementation • 24 Jun 2022 • Gregor Schram, Rui Wang, Kaitai Liang

In order to maintain user privacy, a combination of federated learning, differential privacy and GANs can be used to work with private data without giving away a users' privacy.

Avg Denoising +1

Paper
Code

FLVoogd: Robust And Privacy Preserving Federated Learning

no code implementations • 24 Jun 2022 • Yuhang Tian, Rui Wang, Yanqi Qiao, Emmanouil Panaousis, Kaitai Liang

In this work, we propose FLVoogd, an updated federated learning method in which servers and clients collaboratively eliminate Byzantine attacks while preserving privacy.

Federated Learning Image Classification +1

Paper
Add Code

Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation

no code implementations • 22 Jun 2022 • Ming Li, Jie Wu, Jinhang Cai, Jie Qin, Yuxi Ren, Xuefeng Xiao, Min Zheng, Rui Wang, Xin Pan

Recently, Synthetic data-based Instance Segmentation has become an exceedingly favorable optimization paradigm since it leverages simulation rendering and physics to generate high-quality image-annotation pairs.

Instance Segmentation Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.