Search Results for author: Rui Wang

Found 449 papers, 145 papers with code

Few-Shot Class-Incremental Learning for Named Entity Recognition

no code implementations ACL 2022 Rui Wang, Tong Yu, Handong Zhao, Sungchul Kim, Subrata Mitra, Ruiyi Zhang, Ricardo Henao

In this work, we study a more challenging but practical problem, i. e., few-shot class-incremental learning for NER, where an NER model is trained with only few labeled samples of the new classes, without forgetting knowledge of the old ones.

Few-Shot Class-Incremental Learning Incremental Learning +3

Synchronous Refinement for Neural Machine Translation

no code implementations Findings (ACL) 2022 Kehai Chen, Masao Utiyama, Eiichiro Sumita, Rui Wang, Min Zhang

Machine translation typically adopts an encoder-to-decoder framework, in which the decoder generates the target sentence word-by-word in an auto-regressive manner.

Machine Translation Sentence +1

Unsupervised Paraphrasing Consistency Training for Low Resource Named Entity Recognition

no code implementations EMNLP 2021 Rui Wang, Ricardo Henao

Unsupervised consistency training is a way of semi-supervised learning that encourages consistency in model predictions between the original and augmented data.

Data Augmentation Low Resource Named Entity Recognition +4

Estimating Q(s,s') with Deterministic Dynamics Gradients

no code implementations ICML 2020 Ashley Edwards, Himanshu Sahni, Rosanne Liu, Jane Hung, Ankit Jain, Rui Wang, Adrien Ecoffet, Thomas Miconi, Charles Isbell, Jason Yosinski

In this paper, we introduce a novel form of a value function, $Q(s, s')$, that expresses the utility of transitioning from a state $s$ to a neighboring state $s'$ and then acting optimally thereafter.

Transfer Learning

Syntax in End-to-End Natural Language Processing

no code implementations EMNLP (ACL) 2021 Hai Zhao, Rui Wang, Kehai Chen

This tutorial surveys the latest technical progress of syntactic parsing and the role of syntax in end-to-end natural language processing (NLP) tasks, in which semantic role labeling (SRL) and machine translation (MT) are the representative NLP tasks that have always been beneficial from informative syntactic clues since a long time ago, though the advance from end-to-end deep learning models shows new results.

Machine Translation NMT +2

Chinese Grammatical Error Diagnosis with Graph Convolution Network and Multi-task Learning

no code implementations AACL (NLP-TEA) 2020 Yikang Luo, Zuyi Bao, Chen Li, Rui Wang

For the correction subtask, we utilize the masked language model, the seq2seq model and the spelling check model to generate corrections based on the detection results.

Language Modelling Multi-Task Learning +1

Stacked AMR Parsing with Silver Data

1 code implementation Findings (EMNLP) 2021 Qingrong Xia, Zhenghua Li, Rui Wang, Min Zhang

In particular, one recent seq-to-seq work directly fine-tunes AMR graph sequences on the encoder-decoder pre-trained language model and achieves new state-of-the-art results, outperforming previous works by a large margin.

AMR Parsing Language Modelling

StableGarment: Garment-Centric Generation via Stable Diffusion

no code implementations16 Mar 2024 Rui Wang, Hailong Guo, Jiaming Liu, Huaxia Li, Haibo Zhao, Xu Tang, Yao Hu, Hao Tang, Peipei Li

In this paper, we introduce StableGarment, a unified framework to tackle garment-centric(GC) generation tasks, including GC text-to-image, controllable GC text-to-image, stylized GC text-to-image, and robust virtual try-on.

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

1 code implementation8 Mar 2024 XiWei Hu, Rui Wang, Yixiao Fang, Bin Fu, Pei Cheng, Gang Yu

Diffusion models have demonstrated remarkable performance in the domain of text-to-image generation.

Denoising Language Modelling +2

RIS-empowered Topology Control for Distributed Learning in Urban Air Mobility

no code implementations8 Mar 2024 Kai Xiong, Rui Wang, Supeng Leng, Wenyang Che, Chongwen Huang, Chau Yuen

Urban Air Mobility (UAM) expands vehicles from the ground to the near-ground space, envisioned as a revolution for transportation systems.

Federated Learning MULTI-VIEW LEARNING

Hypothesis Spaces for Deep Learning

no code implementations5 Mar 2024 Rui Wang, Yuesheng Xu, Mingsong Yan

The representer theorems unfold that solutions of these learning models can be expressed as linear combination of a finite number of kernel sessions determined by given data and the reproducing kernel.

Drug resistance revealed by in silico deep mutational scanning and mutation tracker

no code implementations5 Mar 2024 Dong Chen, Gengzhuo Liu, Hongyan Du, JunJie Wee, Rui Wang, Jiahui Chen, Jana Shen, Guo-Wei Wei

As COVID-19 enters its fifth year, it continues to pose a significant global health threat, with the constantly mutating SARS-CoV-2 virus challenging drug effectiveness.

Drug Discovery

F$^3$Loc: Fusion and Filtering for Floorplan Localization

no code implementations5 Mar 2024 Changan Chen, Rui Wang, Christoph Vogel, Marc Pollefeys

In this paper we propose an efficient data-driven solution to self-localization within a floorplan.

Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models

no code implementations5 Mar 2024 Rui Wang, Fei Mi, Yi Chen, Boyang Xue, Hongru Wang, Qi Zhu, Kam-Fai Wong, Ruifeng Xu

2) Role Prompting assigns a central prompt to the general domain and a unique role prompt to each specific domain to minimize inter-domain confusion during training.

Domain Adaptation

Logit Standardization in Knowledge Distillation

1 code implementation3 Mar 2024 Shangquan Sun, Wenqi Ren, Jingzhi Li, Rui Wang, Xiaochun Cao

Knowledge distillation involves transferring soft labels from a teacher to a student using a shared temperature-based softmax function.

Knowledge Distillation

Mitigating Reversal Curse via Semantic-aware Permutation Training

no code implementations1 Mar 2024 Qingyan Guo, Rui Wang, Junliang Guo, Xu Tan, Jiang Bian, Yujiu Yang

Accordingly, permutation on the training data is considered as a potential solution, since this can make the model predict antecedent words or tokens.

Language Modelling

Beyond Language Models: Byte Models are Digital World Simulators

no code implementations29 Feb 2024 Shangda Wu, Xu Tan, Zili Wang, Rui Wang, Xiaobing Li, Maosong Sun

Traditional deep learning often overlooks bytes, the basic units of the digital world, where all forms of information and operations are encoded and manipulated in binary format.

Improving Open-Ended Text Generation via Adaptive Decoding

1 code implementation28 Feb 2024 Wenhong Zhu, Hongkun Hao, Zhiwei He, Yiming Ai, Rui Wang

Current language models decode text token by token according to probabilistic distribution, and determining the appropriate candidates for the next token is crucial to ensure generation quality.

Story Generation

UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval

no code implementations26 Feb 2024 Hongru Wang, Boyang Xue, Baohang Zhou, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Kam-Fai Wong

Conversational retrieval refers to an information retrieval system that operates in an iterative and interactive manner, requiring the retrieval of various external resources, such as persona, knowledge, and even response, to effectively engage with the user and successfully complete the dialogue.

Information Retrieval Retrieval

Low-Frequency Black-Box Backdoor Attack via Evolutionary Algorithm

no code implementations23 Feb 2024 Yanqi Qiao, Dazhuang Liu, Rui Wang, Kaitai Liang

Extensive experiments on real-world datasets verify the effectiveness and robustness of LFBA against image processing operations and the state-of-the-art backdoor defenses, as well as its inherent stealthiness in both spatial and frequency space, making it resilient against frequency inspection.

Backdoor Attack

Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality

no code implementations22 Feb 2024 Yiming Ai, Zhiwei He, Ziyin Zhang, Wenhong Zhu, Hongkun Hao, Kai Yu, Lingjun Chen, Rui Wang

In this study, we investigate the reliability of Large Language Models (LLMs) in professing human-like personality traits through responses to personality questionnaires.

A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models

no code implementations21 Feb 2024 Boyang Xue, Hongru Wang, Weichao Wang, Rui Wang, Sheng Wang, Zeming Liu, Kam-Fai Wong

The tendency of Large Language Models to generate hallucinations and exhibit overconfidence in predictions raises concerns regarding their reliability.

Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models

no code implementations21 Feb 2024 Zhiwei He, Binglin Zhou, Hongkun Hao, Aiwei Liu, Xing Wang, Zhaopeng Tu, Zhuosheng Zhang, Rui Wang

Furthermore, we analyze two key factors that contribute to the cross-lingual consistency in text watermarking and propose a defense method that increases the AUC from 0. 67 to 0. 88 under CWRA.

TAG

Unsupervised Sign Language Translation and Generation

no code implementations12 Feb 2024 Zhengsheng Guo, Zhiwei He, Wenxiang Jiao, Xing Wang, Rui Wang, Kehai Chen, Zhaopeng Tu, Yong Xu, Min Zhang

Motivated by the success of unsupervised neural machine translation (UNMT), we introduce an unsupervised sign language translation and generation network (USLNet), which learns from abundant single-modality (text and video) data without parallel sign language data.

Machine Translation Sign Language Translation +1

Insights into Multiscale Complexity: from Macroscopic Patterns to Microscopic Simulations via Deep Learning

no code implementations7 Feb 2024 Jing Wang, Zheng Li, Pengyu Lai, Rui Wang, Di Yang, Dewu Yang, Hui Xu

Multiscale phenomena manifest across various scientific domains, presenting a ubiquitous challenge in accurately and effectively simulating multiscale dynamics in complex systems.

Partial Identification of Binary Choice Models with Misreported Outcomes

no code implementations30 Jan 2024 Orville Mondal, Rui Wang

In the first approach, the instrument is assumed to only affect the true dependent variable but not misreporting probabilities.

UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems

no code implementations24 Jan 2024 Hongru Wang, WenYu Huang, Yang Deng, Rui Wang, Zezhong Wang, YuFei Wang, Fei Mi, Jeff Z. Pan, Kam-Fai Wong

To better plan and incorporate the use of multiple sources in generating personalized response, we firstly decompose it into three sub-tasks: Knowledge Source Selection, Knowledge Retrieval, and Response Generation.

Response Generation Retrieval

Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model

1 code implementation23 Jan 2024 Zhiwei He, Xing Wang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang, Shuming Shi, Zhaopeng Tu

In this work, we investigate the potential of employing the QE model as the reward model (the QE-based reward model) to predict human preferences for feedback training.

Machine Translation Translation

T2MAC: Targeted and Trusted Multi-Agent Communication through Selective Engagement and Evidence-Driven Integration

no code implementations19 Jan 2024 Chuxiong Sun, Zehua Zang, Jiabao Li, Jiangmeng Li, Xiao Xu, Rui Wang, Changwen Zheng

This process enables agents to collectively use evidence garnered from multiple perspectives, fostering trusted and cooperative behaviors.

SMAC+

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

1 code implementation18 Jan 2024 Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang, Rui Wang, Gongshen Liu

We introduce R-Judge, a benchmark crafted to evaluate the proficiency of LLMs in judging and identifying safety risks given agent interaction records.

Benchmarking

DiffusionGPT: LLM-Driven Text-to-Image Generation System

no code implementations18 Jan 2024 Jie Qin, Jie Wu, Weifeng Chen, Yuxi Ren, Huixia Li, Hefeng Wu, Xuefeng Xiao, Rui Wang, Shilei Wen

Diffusion models have opened up new avenues for the field of image generation, resulting in the proliferation of high-quality models shared on open-source platforms.

Model Selection Text-to-Image Generation

Passive Beamforming For Practical RIS-Assisted Communication Systems With Non-Ideal Hardware

no code implementations15 Jan 2024 Yiming Liu, Rui Wang, Zhu Han

Reconfigurable intelligent surface (RIS) technology is a promising solution to improve the performance of existing wireless communications.

Toward distortion-aware change detection in realistic scenarios

no code implementations10 Jan 2024 Yitao Zhao, Heng-Chao Li, Nanqing Liu, Rui Wang

The whole framework is composed of Pretext Representation Pre-training, Bitemporal Image Alignment, and Down-stream Decoder Fine-Tuning.

Change Detection

Boosting Large Language Model for Speech Synthesis: An Empirical Study

no code implementations30 Dec 2023 Hongkun Hao, Long Zhou, Shujie Liu, Jinyu Li, Shujie Hu, Rui Wang, Furu Wei

In this paper, we conduct a comprehensive empirical exploration of boosting LLMs with the ability to generate speech, by combining pre-trained LLM LLaMA/OPT and text-to-speech synthesis model VALL-E. We compare three integration methods between LLMs and speech synthesis models, including directly fine-tuned LLMs, superposed layers of LLMs and VALL-E, and coupled LLMs and VALL-E using LLMs as a powerful text encoder.

Language Modelling Large Language Model +2

Identification of Dynamic Nonlinear Panel Models under Partial Stationarity

no code implementations30 Dec 2023 Wayne Yuan Gao, Rui Wang

This paper studies identification for a wide range of nonlinear panel data models, including binary choice, ordered repsonse, and other types of limited dependent variable models.

Exploring 3D-aware Lifespan Face Aging via Disentangled Shape-Texture Representations

no code implementations28 Dec 2023 Qianrui Teng, Rui Wang, Xing Cui, Peipei Li, Zhaofeng He

Existing face aging methods often focus on modeling either texture aging or using an entangled shape-texture representation to achieve face aging.

3D Face Reconstruction Texture Synthesis

SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation

no code implementations26 Dec 2023 Yuxuan Zhang, Yiren Song, Jiaming Liu, Rui Wang, Jinpeng Yu, Hao Tang, Huaxia Li, Xu Tang, Yao Hu, Han Pan, Zhongliang Jing

Recent advancements in subject-driven image generation have led to zero-shot generation, yet precise selection and focus on crucial subject representations remain challenging.

Image Generation

Near-Field Localization and Phase Shift Optimization for RIS-Assisted Non-Ideal OFDM Systems

no code implementations19 Dec 2023 Hanfu Zhang, Erwu Liu, Rui Wang, Zhe Xing, Yan Liu

By incorporating reconfigurable intelligent surface (RIS) into communication-assisted localization systems, the issue of signal blockage caused by obstacles can be addressed, and passive beamforming can be employed to enhance localization accuracy.

Rethinking Dimensional Rationale in Graph Contrastive Learning from Causal Perspective

1 code implementation16 Dec 2023 Qirui Ji, Jiangmeng Li, Jie Hu, Rui Wang, Changwen Zheng, Fanjiang Xu

To this end, with the purpose of exploring the intrinsic rationale of graphs, we accordingly propose to capture the dimensional rationale from graphs, which has not received sufficient attention in the literature.

Contrastive Learning Meta-Learning

Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation

no code implementations12 Dec 2023 Yuanbin Wang, Shaofei Huang, Yulu Gao, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Si Liu

In this work, we focus on zero-shot point cloud semantic segmentation and propose a simple yet effective baseline to transfer the visual-linguistic knowledge implied in CLIP to point cloud encoder at both feature and output levels.

3D Semantic Segmentation Point Cloud Segmentation +2

Vision-language Assisted Attribute Learning

no code implementations12 Dec 2023 Kongming Liang, Xinran Wang, Rui Wang, Donghui Gao, Ling Jin, Weidong Liu, Xiatian Zhu, Zhanyu Ma, Jun Guo

Attribute labeling at large scale is typically incomplete and partial, posing significant challenges to model optimization.

Attribute Language Modelling +2

DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting

no code implementations11 Dec 2023 Demin Yu, Xutao Li, Yunming Ye, Baoquan Zhang, Chuyao Luo, Kuai Dai, Rui Wang, Xunlai Chen

A unified and flexible framework that can equip any type of spatio-temporal models is proposed based on residual diffusion, which effectively tackles the shortcomings of previous methods.

EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models

1 code implementation11 Dec 2023 Yi Chen, Yuying Ge, Yixiao Ge, Mingyu Ding, Bohao Li, Rui Wang, Ruifeng Xu, Ying Shan, Xihui Liu

Multimodal Large Language Models (MLLMs), building upon the powerful Large Language Models (LLMs) with exceptional reasoning and generalization capability, have opened up new avenues for embodied task planning.

Benchmarking Human-Object Interaction Detection

Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering

1 code implementation8 Dec 2023 Yuanyuan Guo, Zehua Zang, Hang Gao, Xiao Xu, Rui Wang, Lixiang Liu, Jiangmeng Li

To this end, recent works explore learning discriminative information from social messages by leveraging graph contrastive learning (GCL) and embedding clustering in an unsupervised manner.

Clustering Contrastive Learning +1

Multi-scale Residual Transformer for VLF Lightning Transients Classification

no code implementations7 Dec 2023 Jinghao Sun, Tingting Ji, Guoyu Wang, Rui Wang

The utilization of Very Low Frequency (VLF) electromagnetic signals in navigation systems is widespread.

Classification

FaceStudio: Put Your Face Everywhere in Seconds

no code implementations5 Dec 2023 Yuxuan Yan, Chi Zhang, Rui Wang, Yichao Zhou, Gege Zhang, Pei Cheng, Gang Yu, Bin Fu

This study investigates identity-preserving image synthesis, an intriguing task in image generation that seeks to maintain a subject's identity while adding a personalized, stylistic touch.

Image Generation

VIoTGPT: Learning to Schedule Vision Tools towards Intelligent Video Internet of Things

no code implementations1 Dec 2023 Yaoyao Zhong, Mengshi Qi, Rui Wang, Yuhan Qiu, Yang Zhang, Huadong Ma

Video Internet of Things (VIoT) has shown full potential in collecting an unprecedented volume of video data.

Riemannian Self-Attention Mechanism for SPD Networks

no code implementations28 Nov 2023 Rui Wang, Xiao-Jun Wu, Hui Li, Josef Kittler

Symmetric positive definite (SPD) matrix has been demonstrated to be an effective feature descriptor in many scientific areas, as it can encode spatiotemporal statistics of the data adequately on a curved Riemannian manifold, i. e., SPD manifold.

Benchmarking Riemannian optimization

SEED-Bench-2: Benchmarking Multimodal Large Language Models

1 code implementation28 Nov 2023 Bohao Li, Yuying Ge, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan

Multimodal large language models (MLLMs), building upon the foundation of powerful large language models (LLMs), have recently demonstrated exceptional capabilities in generating not only texts but also images given interleaved multimodal inputs (acting like a combination of GPT-4V and DALL-E 3).

Benchmarking Image Generation +1

Hessian Aware Low-Rank Weight Perturbation for Continual Learning

1 code implementation26 Nov 2023 Jiaqi Li, Rui Wang, Yuanhao Lai, Changjian Shui, Sabyasachi Sahoo, Charles X. Ling, Shichun Yang, Boyu Wang, Christian Gagné, Fan Zhou

We conduct extensive experiments on various benchmarks, including a dataset with large-scale tasks, and compare our method against some recent state-of-the-art methods to demonstrate the effectiveness and scalability of our proposed method.

Continual Learning

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

1 code implementation20 Nov 2023 Zhuosheng Zhang, Yao Yao, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao

Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks.

MELA: Multilingual Evaluation of Linguistic Acceptability

no code implementations15 Nov 2023 Ziyin Zhang, Yikang Liu, Weifang Huang, Junyu Mao, Rui Wang, Hai Hu

Recent benchmarks for Large Language Models (LLMs) have mostly focused on application-driven tasks such as complex reasoning and code generation, and this has led to a scarcity in purely linguistic evaluation of LLMs.

Code Generation Cross-Lingual Transfer +3

Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code

1 code implementation14 Nov 2023 Ziyin Zhang, Chaoyu Chen, Bingchang Liu, Cong Liao, Zi Gong, Hang Yu, Jianguo Li, Rui Wang

In this work we systematically review the recent advancements in code processing with language models, covering 50+ models, 30+ evaluation tasks, 170+ datasets, and 700+ related works.

Passive Handwriting Tracking via Weak mmWave Communication Signals

no code implementations3 Nov 2023 Chao Yu, Yan Luo, Renqi Chen, Rui Wang

In this letter, a cooperative sensing framework based on millimeter wave (mmWave) communication systems is proposed to detect tiny motions with a millimeter-level resolution.

Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models

1 code implementation31 Oct 2023 Tian Liang, Zhiwei He, Jen-tse Huang, Wenxuan Wang, Wenxiang Jiao, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi, Xing Wang

Ideally, an advanced agent should possess the ability to accurately describe a given word using an aggressive description while concurrently maximizing confusion in the conservative description, enhancing its participation in the game.

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning

no code implementations23 Oct 2023 Hao Wang, Xiahua Chen, Rui Wang, Chenhui Chu

Extracting meaningful entities belonging to predefined categories from Visually-rich Form-like Documents (VFDs) is a challenging task.

Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation

1 code implementation23 Oct 2023 Wenhong Zhu, Hongkun Hao, Rui Wang

This paper investigates the self-reinforcement effect in text generation and the effectiveness of a repetition penalty to mitigate it.

Text Generation

MCC-KD: Multi-CoT Consistent Knowledge Distillation

1 code implementation23 Oct 2023 Hongzhan Chen, Siyue Wu, Xiaojun Quan, Rui Wang, Ming Yan, Ji Zhang

Large language models (LLMs) have showcased remarkable capabilities in complex reasoning through chain of thought (CoT) prompting.

Knowledge Distillation Mathematical Reasoning

DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading

1 code implementation23 Oct 2023 Hao Wang, Qingxuan Wang, Yue Li, Changqing Wang, Chenhui Chu, Rui Wang

The use of visually-rich documents (VRDs) in various fields has created a demand for Document AI models that can read and comprehend documents like humans, which requires the overcoming of technical, linguistic, and cognitive barriers.

document understanding Reading Comprehension

Large-Scale and Multi-Perspective Opinion Summarization with Diverse Review Subsets

1 code implementation20 Oct 2023 Han Jiang, Rui Wang, Zhihua Wei, Yu Li, Xinpeng Wang

Furthermore, our in-depth analysis verifies that the advanced selection of review subsets and the two-stage training scheme are vital to boosting the summarization performance.

Opinion Summarization

FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion

no code implementations15 Oct 2023 Zhihua Zhong, Jingsen Zhu, Yuxin Dai, Chuankun Zheng, Yuchi Huo, Guanlin Chen, Hujun Bao, Rui Wang

To mitigate this problem, one of the most popular solutions is to render images at a low resolution to reduce rendering overhead, and then manage to accurately upsample the low-resolution rendered image to the target resolution, a. k. a.

Super-Resolution

Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogue

no code implementations13 Oct 2023 Hongru Wang, Minda Hu, Yang Deng, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang, Wai-Chung Kwan, Irwin King, Kam-Fai Wong

Open-domain dialogue system usually requires different sources of knowledge to generate more informative and evidential responses.

Response Generation

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment

1 code implementation12 Oct 2023 Boyang Xue, Weichao Wang, Hongru Wang, Fei Mi, Rui Wang, Yasheng Wang, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong

Inspired by previous work which identified that feed-forward networks (FFNs) within Transformers are responsible for factual knowledge expressions, we investigate two methods to efficiently improve the factual expression capability {of FFNs} by knowledge enhancement and alignment respectively.

Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization

no code implementations10 Oct 2023 Le Chen, Weirong Chen, Rui Wang, Marc Pollefeys

As a promising fashion for visual localization, scene coordinate regression (SCR) has seen tremendous progress in the past decade.

regression Visual Localization

Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution

no code implementations3 Oct 2023 Rui Wang, Elyssa Hofgard, Han Gao, Robin Walters, Tess E. Smidt

Modeling symmetry breaking is essential for understanding the fundamental changes in the behaviors and properties of physical systems, from microscopic particle interactions to macroscopic phenomena like fluid dynamics and cosmic structures.

Super-Resolution

TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration

no code implementations28 Sep 2023 Hongru Wang, Huimin Wang, Lingzhi Wang, Minda Hu, Rui Wang, Boyang Xue, Hongyuan Lu, Fei Mi, Kam-Fai Wong

Large language models (LLMs) have demonstrated exceptional performance in planning the use of various functional tools, such as calculators and retrievers, particularly in question-answering tasks.

Question Answering Response Generation

Learning Point-wise Abstaining Penalty for Point Cloud Anomaly Detection

1 code implementation19 Sep 2023 Shaocong Xu, Pengfei Li, Xinyu Liu, Qianpu Sun, Yang Li, Shihui Guo, Zhen Wang, Bo Jiang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao

We demonstrate that learning different abstaining penalties, apart from point-wise penalty, for different types of (synthesized) outliers can further improve the performance.

Anomaly Detection Autonomous Driving +1

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration

1 code implementation ICCV 2023 Lijiang Li, Huixia Li, Xiawu Zheng, Jie Wu, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan, Fei Chao, Rongrong Ji

Therefore, we propose to search the optimal time steps sequence and compressed model architecture in a unified framework to achieve effective image generation for diffusion models without any further training.

Image Generation single-image-generation

A Benchmark for Text Expansion: Datasets, Metrics, and Baselines

no code implementations17 Sep 2023 Yi Chen, Haiyun Jiang, Wei Bi, Rui Wang, Longyue Wang, Shuming Shi, Ruifeng Xu

This work presents a new task of Text Expansion (TE), which aims to insert fine-grained modifiers into proper locations of the plain text to concretize or vivify human writings.

Informativeness Text Infilling

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

1 code implementation15 Sep 2023 Qingyan Guo, Rui Wang, Junliang Guo, Bei Li, Kaitao Song, Xu Tan, Guoqing Liu, Jiang Bian, Yujiu Yang

Large Language Models (LLMs) excel in various tasks, but they rely on carefully crafted prompts that often demand substantial human effort.

Evolutionary Algorithms

What are Public Concerns about ChatGPT? A Novel Self-Supervised Neural Topic Model Tells You

no code implementations4 Sep 2023 Rui Wang, Xing Liu, Yanan Wang, Haiping Huang

The recently released artificial intelligence conversational agent, ChatGPT, has gained significant attention in academia and real life.

Representation Learning

M2HGCL: Multi-Scale Meta-Path Integrated Heterogeneous Graph Contrastive Learning

no code implementations3 Sep 2023 Yuanyuan Guo, Yu Xia, Rui Wang, Rongcheng Duan, Lu Li, Jiangmeng Li

Orthogonal to homogeneous graphs, the types of nodes and edges in heterogeneous graphs are diverse so that specialized graph contrastive learning methods are required.

Contrastive Learning

Exploring the Robustness of Human Parsers Towards Common Corruptions

no code implementations2 Sep 2023 Sanyi Zhang, Xiaochun Cao, Rui Wang, Guo-Jun Qi, Jie zhou

The experimental results show that the proposed method demonstrates good universality which can improve the robustness of the human parsing models and even the semantic segmentation models when facing various image common corruptions.

Data Augmentation Human Parsing +1

DLIP: Distilling Language-Image Pre-training

no code implementations24 Aug 2023 Huafeng Kuang, Jie Wu, Xiawu Zheng, Ming Li, Xuefeng Xiao, Rui Wang, Min Zheng, Rongrong Ji

Furthermore, DLIP succeeds in retaining more than 95% of the performance with 22. 4% parameters and 24. 8% FLOPs compared to the teacher model and accelerates inference speed by 2. 7x.

Image Captioning Knowledge Distillation +5

ChatGPT in Drug Discovery: A Case Study on Anti-Cocaine Addiction Drug Development with Chatbots

no code implementations14 Aug 2023 Rui Wang, Hongsong Feng, Guo-Wei Wei

This paper not only explores the integration of advanced AI in drug discovery but also reimagines the landscape by advocating for AI-powered chatbots as trailblazers in revolutionizing therapeutic innovation.

Chatbot Drug Discovery +1

Face Encryption via Frequency-Restricted Identity-Agnostic Attacks

no code implementations11 Aug 2023 Xin Dong, Rui Wang, Siyuan Liang, Aishan Liu, Lihua Jing

As for the weak black-box scenario feasibility, we obverse that representations of the average feature in multiple face recognition models are similar, thus we propose to utilize the average feature via the crawled dataset from the Internet as the target to guide the generation, which is also agnostic to identities of unknown face recognition systems; in nature, the low-frequency perturbations are more visually perceptible by the human vision system.

Face Recognition

A Quantize-then-Estimate Protocol for CSI Acquisition in IRS-Aided Downlink Communication

no code implementations4 Aug 2023 Rui Wang, Zhaorui Wang, Liang Liu, Shuowen Zhang, Shi Jin

Different from the uplink counterpart where the BS possesses the pilot signals containing the CSI of all the users, in downlink communication, the distributed users merely receive the pilot signals containing their own CSI and cannot leverage the correlation in different users' channels revealed in [1].

Quantization

SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension

2 code implementations30 Jul 2023 Bohao Li, Rui Wang, Guangzhi Wang, Yuying Ge, Yixiao Ge, Ying Shan

Based on powerful Large Language Models (LLMs), recent generative Multimodal Large Language Models (MLLMs) have gained prominence as a pivotal research area, exhibiting remarkable capability for both comprehension and generation.

Benchmarking Multiple-choice

Phase Matching for Out-of-Distribution Generalization

no code implementations24 Jul 2023 Chengming Hu, Yeqian Du, Rui Wang, Hao Chen

In this paper, we aim to clarify the relationships between Domain Generalization (DG) and the frequency components, and explore the spatial relationships of the phase spectrum.

Domain Generalization Out-of-Distribution Generalization +1

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection

1 code implementation ICCV 2023 Ming Li, Jie Wu, Xionghui Wang, Chen Chen, Jie Qin, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

To this end, we propose AlignDet, a unified pre-training framework that can be adapted to various existing detectors to alleviate the discrepancies.

object-detection Object Detection

POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical Activities

1 code implementation19 Jul 2023 Rui Wang, Sophokles Ktistakis, Siwei Zhang, Mirko Meboldt, Quentin Lohmeyer

The surgical usage of Mixed Reality (MR) has received growing attention in areas such as surgical navigation systems, skill assessment, and robot-assisted surgeries.

3D Hand Pose Estimation hand-object pose +4

Learning to Branch in Combinatorial Optimization with Graph Pointer Networks

no code implementations4 Jul 2023 Rui Wang, Zhiming Zhou, Tao Zhang, Ling Wang, Xin Xu, Xiangke Liao, Kaiwen Li

The proposed model, which combines the graph neural network and the pointer mechanism, can effectively map from the solver state to the branching variable decisions.

Combinatorial Optimization Variable Selection

Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models

1 code implementation30 Jun 2023 Yiming Wang, Zhuosheng Zhang, Pei Zhang, Baosong Yang, Rui Wang

Neural-symbolic methods have demonstrated efficiency in enhancing the reasoning abilities of large language models (LLMs).

Domain Generalization In-Context Learning +1

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

1 code implementation NeurIPS 2023 Zibo Zhao, Wen Liu, Xin Chen, Xianfang Zeng, Rui Wang, Pei Cheng, Bin Fu, Tao Chen, Gang Yu, Shenghua Gao

We present a novel alignment-before-generation approach to tackle the challenging task of generating general 3D shapes based on 2D images or texts.

3D Shape Generation

Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification

2 code implementations NeurIPS 2023 Rui Wang, Peipei Li, Huaibo Huang, Chunshui Cao, Ran He, Zhaofeng He

Consequently, we propose a cross-modal ordinal pairwise loss to refine the CLIP feature space, where texts and images maintain both semantic alignment and ordering alignment.

Age Estimation Classification +2

Structured Learning in Time-dependent Cox Models

1 code implementation21 Jun 2023 Guanbo Wang, Yi Lian, Archer Y. Yang, Robert W. Platt, Rui Wang, Sylvie Perreault, Marc Dorais, Mireille E. Schnitzer

We propose a flexible framework for variable selection in time-dependent Cox models, accommodating complex selection rules.

Survival Analysis Variable Selection

Multi-objective Molecular Optimization for Opioid Use Disorder Treatment Using Generative Network Complex

no code implementations13 Jun 2023 Hongsong Feng, Rui Wang, Chang-Guo Zhan, Guo-Wei Wei

Opioid Use Disorder (OUD) has emerged as a significant global public health issue, with complex multifaceted conditions.

Rethinking Translation Memory Augmented Neural Machine Translation

no code implementations12 Jun 2023 Hongkun Hao, Guoping Huang, Lemao Liu, Zhirui Zhang, Shuming Shi, Rui Wang

The finding demonstrates that TM-augmented NMT is good at the ability of fitting data (i. e., lower bias) but is more sensitive to the fluctuations in the training data (i. e., higher variance), which provides an explanation to a recently reported contradictory phenomenon on the same translation task: TM-augmented NMT substantially advances vanilla NMT under the high-resource scenario whereas it fails under the low-resource scenario.

Machine Translation NMT +2

PLPCA: Persistent Laplacian Enhanced-PCA for Microarray Data Analysis

1 code implementation9 Jun 2023 Sean Cottrell, Rui Wang, GuoWei Wei

Over the years, Principal Component Analysis (PCA) has served as the baseline approach for dimensionality reduction in gene expression data analysis.

Dimensionality Reduction

Extract and Attend: Improving Entity Translation in Neural Machine Translation

no code implementations4 Jun 2023 Zixin Zeng, Rui Wang, Yichong Leng, Junliang Guo, Xu Tan, Tao Qin, Tie-Yan Liu

Inspired by this translation process, we propose an Extract-and-Attend approach to enhance entity translation in NMT, where the translation candidates of source entities are first extracted from a dictionary and then attended to by the NMT model to generate the target sentence.

Machine Translation NMT +2

Deliberate then Generate: Enhanced Prompting Framework for Text Generation

no code implementations31 May 2023 Bei Li, Rui Wang, Junliang Guo, Kaitao Song, Xu Tan, Hany Hassan, Arul Menezes, Tong Xiao, Jiang Bian, Jingbo Zhu

Large language models (LLMs) have shown remarkable success across a wide range of natural language generation tasks, where proper prompt designs make great impacts.

Text Generation

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

1 code implementation30 May 2023 Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi

To address the DoT problem, we propose a Multi-Agent Debate (MAD) framework, in which multiple agents express their arguments in the state of "tit for tat" and a judge manages the debate process to obtain a final solution.

Arithmetic Reasoning Machine Translation

Revisiting Acceptability Judgements

1 code implementation23 May 2023 Hai Hu, Ziyin Zhang, Weifang Huang, Jackie Yan-Ki Lai, Aini Li, Yina Patterson, Jiahui Huang, Peng Zhang, Chien-Jer Charles Lin, Rui Wang

We introduce CoLAC - Corpus of Linguistic Acceptability in Chinese, the first large-scale acceptability dataset for a non-Indo-European language.

Cross-Lingual Transfer Linguistic Acceptability

Enhancing Large Language Models Against Inductive Instructions with Dual-critique Prompting

1 code implementation23 May 2023 Rui Wang, Hongru Wang, Fei Mi, Yi Chen, Boyang Xue, Kam-Fai Wong, Ruifeng Xu

Numerous works are proposed to align large language models (LLMs) with human intents to better fulfill instructions, ensuring they are trustful and helpful.

counterfactual Fact Checking

Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method

1 code implementation22 May 2023 Yiming Wang, Zhuosheng Zhang, Rui Wang

Further, we propose a Summary Chain-of-Thought (SumCoT) technique to elicit LLMs to generate summaries step by step, which helps them integrate more fine-grained details of source documents into the final summaries that correlate with the human writing mindset.

Benchmarking Hallucination

Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer

1 code implementation22 May 2023 Ruize Gao, Zhirui Zhang, Yichao Du, Lemao Liu, Rui Wang

Nearest Neighbor Machine Translation ($k$NN-MT) has achieved great success in domain adaptation tasks by integrating pre-trained Neural Machine Translation (NMT) models with domain-specific token-level retrieval.

Domain Adaptation Machine Translation +3

Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs

2 code implementations19 May 2023 Hongru Wang, Rui Wang, Fei Mi, Yang Deng, Zezhong Wang, Bin Liang, Ruifeng Xu, Kam-Fai Wong

Large Language Models (LLMs), such as \texttt{ChatGPT}, greatly empower dialogue systems with strong language understanding and generation capabilities.

Question Answering Semantic Similarity +1

AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression

1 code implementation17 May 2023 Siyue Wu, Hongzhan Chen, Xiaojun Quan, Qifan Wang, Rui Wang

To enhance the knowledge transfer of model reasoning and generalization, we further explore multi-view attribution distillation on all potential decisions of the teacher.

Knowledge Distillation Language Modelling +2

Exploring Human-Like Translation Strategy with Large Language Models

2 code implementations6 May 2023 Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang

Compared to typical machine translation that focuses solely on source-to-target mapping, LLM-based translation can potentially mimic the human translation process which might take preparatory steps to ensure high-quality translation.

Hallucination Machine Translation +2

Variational Bayesian Multiuser Tracking for Reconfigurable Intelligent Surface Aided MIMO-OFDM Systems

no code implementations24 Apr 2023 Boyu Teng, Xiaojun Yuan, Rui Wang

Reconfigurable intelligent surface (RIS) has attracted enormous interest for its potential advantages in assisting both wireless communication and environmental sensing.

Delving into Shape-aware Zero-shot Semantic Segmentation

1 code implementation CVPR 2023 Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou

Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy.

Image Segmentation Segmentation +2

Testing and Identifying Substitution and Complementarity Patterns

no code implementations3 Apr 2023 Rui Wang

This paper studies semiparametric identification of substitution and complementarity patterns between two goods using a panel multinomial choice model with bundles.

IV Regressions without Exclusion Restrictions

no code implementations2 Apr 2023 Wayne Yuan Gao, Rui Wang

We study identification and estimation of endogenous linear and nonlinear regression models without excluded instrumental variables, based on the standard mean independence condition and a nonlinear relevance condition.

regression

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation

no code implementations CVPR 2023 Jie Qin, Jie Wu, Pengxiang Yan, Ming Li, Ren Yuxi, Xuefeng Xiao, Yitong Wang, Rui Wang, Shilei Wen, Xin Pan, Xingang Wang

Recently, open-vocabulary learning has emerged to accomplish segmentation for arbitrary categories of text-based descriptions, which popularizes the segmentation system to more general-purpose application scenarios.

Image Segmentation Instance Segmentation +3

Predictive Resource Allocation in mmWave Systems with Rotation Detection

no code implementations29 Mar 2023 Yifei Sun, Bojie Lv, Rui Wang, Haisheng Tan, Francis C. M. Lau

Millimeter wave (MmWave) has been regarded as a promising technology to support high-capacity communications in 5G era.

Scheduling

Point Identification of LATE with Two Imperfect Instruments

no code implementations24 Mar 2023 Rui Wang

This paper characterizes point identification results of the local average treatment effect (LATE) using two imperfect instruments.

Vocal Bursts Valence Prediction

Pluralistic Aging Diffusion Autoencoder

no code implementations ICCV 2023 Peipei Li, Rui Wang, Huaibo Huang, Ran He, Zhaofeng He

Face aging is an ill-posed problem because multiple plausible aging patterns may correspond to a given input.

Denoising

I$^2$-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations14 Mar 2023 Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

In this work, we present I$^2$-SDF, a new method for intrinsic indoor scene reconstruction and editing using differentiable Monte Carlo raytracing on neural signed distance fields (SDFs).

Indoor Scene Reconstruction Novel View Synthesis

Super-Resolution Information Enhancement For Crowd Counting

1 code implementation13 Mar 2023 Jiahao Xie, Wei Xu, Dingkang Liang, Zhanyu Ma, Kongming Liang, Weidong Liu, Rui Wang, Ling Jin

As the proposed method requires SR labels, we further propose a Super-Resolution Crowd Counting dataset (SR-Crowd).

Crowd Counting Super-Resolution

Toward Fairness in Text Generation via Mutual Information Minimization based on Importance Sampling

no code implementations25 Feb 2023 Rui Wang, Pengyu Cheng, Ricardo Henao

To improve the fairness of PLMs in text generation, we propose to minimize the mutual information between the semantics in the generated text sentences and their demographic polarity, i. e., the demographic group to which the sentence is referring.

Fairness Language Modelling +2

mmAlert: mmWave Link Blockage Prediction via Passive Sensing

no code implementations22 Feb 2023 Chao Yu, Yifei Sun, Yan Luo, Rui Wang

It is demonstrated via experiments that the mmAlert system can always detect the motions of the walking person close to the LoS path, and predict 90\% of the LoS blockage with sensing time of 1. 4 seconds.

Unique Identification of 50,000+ Virtual Reality Users from Head & Hand Motion Data

1 code implementation17 Feb 2023 Vivek Nair, Wenbo Guo, Justus Mattern, Rui Wang, James F. O'Brien, Louis Rosenberg, Dawn Song

With the recent explosive growth of interest and investment in virtual reality (VR) and the so-called "metaverse," public attention has rightly shifted toward the unique security and privacy threats that these platforms may pose.

A Study on ReLU and Softmax in Transformer

no code implementations13 Feb 2023 Kai Shen, Junliang Guo, Xu Tan, Siliang Tang, Rui Wang, Jiang Bian

This paper sheds light on the following points: 1) Softmax and ReLU use different normalization methods over elements which lead to different variances of results, and ReLU is good at dealing with a large number of key-value slots; 2) FFN and key-value memory are equivalent, and thus the Transformer can be viewed as a memory network where FFNs and self-attention networks are both key-value memories.

Document Translation

N-Gram Nearest Neighbor Machine Translation

no code implementations30 Jan 2023 Rui Lv, Junliang Guo, Rui Wang, Xu Tan, Qi Liu, Tao Qin

Nearest neighbor machine translation augments the Autoregressive Translation~(AT) with $k$-nearest-neighbor retrieval, by comparing the similarity between the token-level context representations of the target tokens in the query and the datastore.

Domain Adaptation Machine Translation +2

A Theory of Human-Like Few-Shot Learning

no code implementations3 Jan 2023 Zhiying Jiang, Rui Wang, Dongbo Bu, Ming Li

We aim to bridge the gap between our common-sense few-sample human learning and large-data machine learning.

Common Sense Reasoning Few-Shot Learning

I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations CVPR 2023 Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

Further, we propose to decompose the neural radiance field into spatially-varying material of the scene as a neural field through surface-based, differentiable Monte Carlo raytracing and emitter semantic segmentations, which enables physically based and photorealistic scene relighting and editing applications.

Indoor Scene Reconstruction Novel View Synthesis

The Victim and The Beneficiary: Exploiting a Poisoned Model to Train a Clean Model on Poisoned Data

1 code implementation ICCV 2023 Zixuan Zhu, Rui Wang, Cong Zou, Lihua Jing

This inspires us to propose a novel dual-network training framework: The Victim and The Beneficiary (V&B), which exploits a poisoned model to train a clean model without extra benign samples.

Data Augmentation

4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions

no code implementations31 Dec 2022 Patrick Wenzel, Nan Yang, Rui Wang, Niclas Zeller, Daniel Cremers

In this paper, we present a novel visual SLAM and long-term localization benchmark for autonomous driving in challenging conditions based on the large-scale 4Seasons dataset.

Autonomous Driving Benchmarking +2

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

no code implementations CVPR 2023 Jishnu Mukhoti, Tsung-Yu Lin, Omid Poursaeed, Rui Wang, Ashish Shah, Philip H. S. Torr, Ser-Nam Lim

We introduce Patch Aligned Contrastive Learning (PACL), a modified compatibility function for CLIP's contrastive loss, intending to train an alignment between the patch tokens of the vision encoder and the CLS token of the text encoder.

Contrastive Learning Image Classification +5

Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning

4 code implementations CVPR 2023 Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang

For the choice of teacher models, we observe that students taught by video teachers perform better on temporally-heavy video tasks, while image teachers transfer stronger spatial representations for spatially-heavy video tasks.

Action Classification Representation Learning +1

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition

1 code implementation2 Dec 2022 Yichong Leng, Xu Tan, Wenjie Liu, Kaitao Song, Rui Wang, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu

In this paper, we propose SoftCorrect with a soft error detection mechanism to avoid the limitations of both explicit and implicit error detection.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Coevolutionary Framework for Generalized Multimodal Multi-objective Optimization

1 code implementation2 Dec 2022 Wenhua Li, Xingyi Yao, Kaiwen Li, Rui Wang, Tao Zhang, Ling Wang

To address the above two issues, in this study, a novel coevolutionary framework termed CoMMEA for multimodal multi-objective optimization is proposed to better obtain both global and local PSs, and simultaneously, to improve the convergence performance in dealing with high-dimension MMOPs.

Evolutionary Algorithms Transfer Learning

CASSPR: Cross Attention Single Scan Place Recognition

1 code implementation ICCV 2023 Yan Xia, Mariia Gladkova, Rui Wang, Qianyun Li, Uwe Stilla, João F. Henriques, Daniel Cremers

CASSPR uses queries from one branch to try to match structures in the other branch, ensuring that both extract self-contained descriptors of the point cloud (rather than one branch dominating), but using both to inform the output global descriptor of the point cloud.

Unifying Tracking and Image-Video Object Detection

no code implementations20 Nov 2022 Peirong Liu, Rui Wang, Pengchuan Zhang, Omid Poursaeed, Yipin Zhou, Xuefei Cao, Sreya Dutta Roy, Ashish Shah, Ser-Nam Lim

We propose TrIVD (Tracking and Image-Video Detection), the first framework that unifies image OD, video OD, and MOT within one end-to-end model.

Multi-Object Tracking Object +2

Learning-based Inverse Rendering of Complex Indoor Scenes with Differentiable Monte Carlo Raytracing

no code implementations6 Nov 2022 Jingsen Zhu, Fujun Luan, Yuchi Huo, Zihao Lin, Zhihua Zhong, Dianbing Xi, Jiaxiang Zheng, Rui Tang, Hujun Bao, Rui Wang

Indoor scenes typically exhibit complex, spatially-varying appearance from global illumination, making inverse rendering a challenging ill-posed problem.

Inverse Rendering

Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation

1 code implementation19 Oct 2022 Botao Yu, Peiling Lu, Rui Wang, Wei Hu, Xu Tan, Wei Ye, Shikun Zhang, Tao Qin, Tie-Yan Liu

A recent trend is to use Transformer or its variants in music generation, which is, however, suboptimal, because the full attention cannot efficiently model the typically long music sequences (e. g., over 10, 000 tokens), and the existing models have shortcomings in generating musical repetition structures.

Music Generation

Emerging dominant SARS-CoV-2 variants

no code implementations18 Oct 2022 Jiahui Chen, Rui Wang, Yuta Hozumi, Gengzhuo Liu, Yuchi Qiu, Xiaoqi Wei, Guo-Wei Wei

Accurate and reliable forecasting of emerging dominant severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants enables policymakers and vaccine makers to get prepared for future waves of infections.

Large-Scale Bandwidth and Power Optimization for Multi-Modal Edge Intelligence Autonomous Driving

no code implementations18 Oct 2022 Xinrao Li, Tong Zhang, Shuai Wang, Guangxu Zhu, Rui Wang, Tsung-Hui Chang

However, wireless channels between the edge server and the autonomous vehicles are time-varying due to the high-mobility of vehicles.

Autonomous Driving

Koopman Neural Forecaster for Time Series with Temporal Distribution Shifts

1 code implementation7 Oct 2022 Rui Wang, Yihe Dong, Sercan Ö. Arik, Rose Yu

Temporal distributional shifts, with underlying dynamics changing over time, frequently occur in real-world time series and pose a fundamental challenge for deep neural networks (DNNs).

Time Series Time Series Forecasting

Approximate better, Attack stronger: Adversarial Example Generation via Asymptotically Gaussian Mixture Distribution

no code implementations24 Sep 2022 Zhengwei Fang, Rui Wang, Tao Huang, Liping Jing

In this paper, we propose Multiple Asymptotically Normal Distribution Attacks (MultiANDA), a novel method that explicitly characterizes adversarial perturbations from a learned distribution.

Adversarial Attack

TOSE: A Fast Capacity Estimation Algorithm Based on Spike Approximations

no code implementations2 Sep 2022 Dandan Jiang, Han Hao, Lu Yang, Rui Wang

Instead, fast eigenvalue estimations can be realized based on the spike approximations in our TOSE algorithm.

Capacity Estimation

Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling

no code implementations25 Aug 2022 Rui Wang, Zuxuan Wu, Dongdong Chen, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Luowei Zhou, Lu Yuan, Yu-Gang Jiang

To avoid significant computational cost incurred by computing self-attention between the large number of local patches in videos, we propose to use very few global tokens (e. g., 6) for a whole video in Transformers to exchange information with 3D-CNNs with a cross-attention mechanism.

Video Recognition

A novel method for data augmentation: Nine Dot Moving Least Square (ND-MLS)

no code implementations24 Aug 2022 Wen Yang, Rui Wang, Yanchao Zhang

However, the ND-MLS method has stable performance and obtains 96. 5 top-1 acc in Res-Net on 100 different handwritten character classification tasks; 2) in segmentation, under the premise of only ten original images, DeepLab obtains 93. 5%, 85%, and 73. 3% m_IOU(10) on the bottle, horse, and grass test datasets, respectively, while the cat test dataset obtains 86. 7% m_IOU(10) with the SegNet model; 3) with only 10 original images from each category in object detection, YOLO v4 obtains 100% and 97. 2% bottle and horse detection, respectively, while the cat dataset obtains 93. 6% with YOLO v3.

Classification Data Augmentation +3

Large-scale matrix optimization based multi microgrid topology design with a constrained differential evolution algorithm

no code implementations18 Jul 2022 Wenhua Li, Shengjun Huang, Tao Zhang, Rui Wang, Ling Wang

Binary matrix optimization commonly arise in the real world, e. g., multi-microgrid network structure design problem (MGNSDP), which is to minimize the total length of the power supply line under certain constraints.

Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios

4 code implementations12 Jul 2022 Jiashi Li, Xin Xia, Wei Li, Huixia Li, Xing Wang, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

Then, Next Hybrid Strategy (NHS) is designed to stack NCB and NTB in an efficient hybrid paradigm, which boosts performance in various downstream tasks.

Image Classification

Multimodal Multi-objective Optimization: Comparative Study of the State-of-the-Art

1 code implementation11 Jul 2022 Wenhua Li, Tao Zhang, Rui Wang, Jing Liang

Multimodal multi-objective problems (MMOPs) commonly arise in real-world problems where distant solutions in decision space correspond to very similar objective values.

Evolutionary Algorithms

A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation

no code implementations NAACL 2022 Kexun Zhang, Rui Wang, Xu Tan, Junliang Guo, Yi Ren, Tao Qin, Tie-Yan Liu

Furthermore, we take the best of both and design a new loss function to better handle the complicated syntactic multi-modality in real-world datasets.

Machine Translation Translation

Deep Contrastive One-Class Time Series Anomaly Detection

1 code implementation4 Jul 2022 Rui Wang, Chongwei Liu, Xudong Mou, Kai Gao, Xiaohui Guo, Pin Liu, Tianyu Wo, Xudong Liu

To overcome the shortcomings, a deep Contrastive One-Class Anomaly detection method of time series (COCA) is proposed by authors, following the normality assumptions of CL and one-class classification.

Contrastive Learning One-Class Classification +2

RAW-GNN: RAndom Walk Aggregation based Graph Neural Network

no code implementations28 Jun 2022 Di Jin, Rui Wang, Meng Ge, Dongxiao He, Xiang Li, Wei Lin, Weixiong Zhang

Due to the homophily assumption of Graph Convolutional Networks (GCNs) that these methods use, they are not suitable for heterophily graphs where nodes with different labels or dissimilar attributes tend to be adjacent.

Representation Learning

FLVoogd: Robust And Privacy Preserving Federated Learning

no code implementations24 Jun 2022 Yuhang Tian, Rui Wang, Yanqi Qiao, Emmanouil Panaousis, Kaitai Liang

In this work, we propose FLVoogd, an updated federated learning method in which servers and clients collaboratively eliminate Byzantine attacks while preserving privacy.

Federated Learning Image Classification +1

Using Autoencoders on Differentially Private Federated Learning GANs

1 code implementation24 Jun 2022 Gregor Schram, Rui Wang, Kaitai Liang

In order to maintain user privacy, a combination of federated learning, differential privacy and GANs can be used to work with private data without giving away a users' privacy.

Avg Denoising +1

Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation

no code implementations22 Jun 2022 Ming Li, Jie Wu, Jinhang Cai, Jie Qin, Yuxi Ren, Xuefeng Xiao, Min Zheng, Rui Wang, Xin Pan

Recently, Synthetic data-based Instance Segmentation has become an exceedingly favorable optimization paradigm since it leverages simulation rendering and physics to generate high-quality image-annotation pairs.

Instance Segmentation Segmentation +1

Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization

no code implementations21 Jun 2022 Wei Liu, Rui Wang, Tao Zhang, Kaiwen Li, Wenhua Li, Hisao Ishibuchi

Multi-objective orienteering problems (MO-OPs) are classical multi-objective routing problems and have received a lot of attention in the past decades.

Problem Decomposition reinforcement-learning +1

Data Augmentation vs. Equivariant Networks: A Theory of Generalization on Dynamics Forecasting

no code implementations19 Jun 2022 Rui Wang, Robin Walters, Rose Yu

In this work, we derive the generalization bounds for data augmentation and equivariant networks, characterizing their effect on learning in a unified framework.

Data Augmentation Generalization Bounds

DreamNet: A Deep Riemannian Network based on SPD Manifold Learning for Visual Classification

no code implementations16 Jun 2022 Rui Wang, Xiao-Jun Wu, Ziheng Chen, Tianyang Xu, Josef Kittler

Image set-based visual classification methods have achieved remarkable performance, via characterising the image set in terms of a non-singular covariance matrix on a symmetric positive definite (SPD) manifold.

CCP: Correlated Clustering and Projection for Dimensionality Reduction

2 code implementations8 Jun 2022 Yuta Hozumi, Rui Wang, Guo-Wei Wei

Most dimensionality reduction methods employ frequency domain representations obtained from matrix diagonalization and may not be efficient for large datasets with relatively high intrinsic dimensions.

Clustering Dimensionality Reduction

An Indoor Environment Sensing and Localization System via mmWave Phased Array

no code implementations7 Jun 2022 Yifei Sun, Jie Li, Tong Zhang, Rui Wang, Xiaohui Peng, Tony Xiao Han, Haisheng Tan

At the end, we show that the reconstructed room layout can be utilized to locate a mobile device according to its AoA spectrum, even with single access point.

Hyperspectral Unmixing Based on Nonnegative Matrix Factorization: A Comprehensive Review

no code implementations20 May 2022 Xin-Ru Feng, Heng-Chao Li, Rui Wang, Qian Du, Xiuping Jia, Antonio Plaza

Hyperspectral unmixing has been an important technique that estimates a set of endmembers and their corresponding abundances from a hyperspectral image (HSI).

Hyperspectral Unmixing

TRT-ViT: TensorRT-oriented Vision Transformer

no code implementations19 May 2022 Xin Xia, Jiashi Li, Jie Wu, Xing Wang, Xuefeng Xiao, Min Zheng, Rui Wang

We revisit the existing excellent Transformers from the perspective of practical application.

Image Classification object-detection +2

Persistent Laplacian projected Omicron BA.4 and BA.5 to become new dominating variants

no code implementations1 May 2022 Jiahui Chen, Yuchi Qiu, Rui Wang, Guo-Wei Wei

In particular, BA. 4 and BA. 5 are about 36\% more infectious than BA. 2 and are projected to become new dominating variants by natural selection.

Deep Reinforcement Learning for Orienteering Problems Based on Decomposition

no code implementations25 Apr 2022 Wei Liu, Tao Zhang, Rui Wang, Kaiwen Li, Wenhua Li, Kang Yang

A dynamic pointer network (DYPN) is introduced as the TSP solver, which takes city locations as inputs and immediately outputs a permutation of nodes.

reinforcement-learning Reinforcement Learning (RL) +1

Multi-objective Pointer Network for Combinatorial Optimization

1 code implementation25 Apr 2022 Le-yang Gao, Rui Wang, Chuang Liu, Zhao-hong Jia

Recently, a number of deep reinforcement learning (DRL) methods have been proposed to generate approximate optimal solutions to the combinatorial optimization problems.

Combinatorial Optimization reinforcement-learning +2

WikiDiverse: A Multimodal Entity Linking Dataset with Diversified Contextual Topics and Entity Types

3 code implementations ACL 2022 Xuwu Wang, Junfeng Tian, Min Gui, Zhixu Li, Rui Wang, Ming Yan, Lihan Chen, Yanghua Xiao

In this paper, we present WikiDiverse, a high-quality human-annotated MEL dataset with diversified contextual topics and entity types from Wikinews, which uses Wikipedia as the corresponding knowledge base.

Entity Linking

Accelerating Federated Edge Learning via Topology Optimization

no code implementations1 Apr 2022 Shanfeng Huang, Zezhong Zhang, Shuai Wang, Rui Wang, Kaibin Huang

In this paper, a novel topology-optimized federated edge learning (TOFEL) scheme is proposed to tackle the heterogeneity issue in federated learning and to improve the communication-and-computation efficiency.

3D Object Detection Federated Learning +3

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

1 code implementation29 Mar 2022 Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li

LightHuBERT outperforms the original HuBERT on ASR and five SUPERB tasks with the HuBERT size, achieves comparable performance to the teacher model in most tasks with a reduction of 29% parameters, and obtains a $3. 5\times$ compression ratio in three SUPERB tasks, e. g., automatic speaker verification, keyword spotting, and intent classification, with a slight accuracy loss.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Passive Motion Detection via mmWave Communication System

no code implementations28 Mar 2022 Jie Li, Chao Yu, Yan Luo, Yifei Sun, Rui Wang

Relying on the passive sensing system, a dataset of received signals, where three types of hand gestures are sensed, is collected by using Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) paths as the reference channel respectively.

Hand Gesture Recognition Hand-Gesture Recognition +1

An Interactive Image-based Modeling System

no code implementations28 Mar 2022 Zhi He, Rui Wang, Wei Hua, Yuchi Huo

This paper propose a interactive 3D modeling method and corresponding system based on single or multiple uncalibrated images.

Camera Calibration

Lazy Rearrangement Planning in Confined Spaces

2 code implementations19 Mar 2022 Rui Wang, Kai Gao, Jingjin Yu, Kostas Bekris

Object rearrangement is important for many applications but remains challenging, especially in confined spaces, such as shelves, where objects cannot be accessed from above and they block reachability to each other.

Motion Planning

Efficient conditioned face animation using frontally-viewed embedding

no code implementations16 Mar 2022 Maxime Oquab, Daniel Haziza, Ludovic Schwartz, Tao Xu, Katayoun Zand, Rui Wang, Peirong Liu, Camille Couprie

As the quality of few shot facial animation from landmarks increases, new applications become possible, such as ultra low bandwidth video chat compression with a high degree of realism.

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

1 code implementation ACL 2022 Zhiwei He, Xing Wang, Rui Wang, Shuming Shi, Zhaopeng Tu

By carefully designing experiments, we identify two representative characteristics of the data gap in source: (1) style gap (i. e., translated vs. natural text style) that leads to poor generalization capability; (2) content gap that induces the model to produce hallucination content biased towards the target language.

Hallucination Machine Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.