Search Results for author: Xin Wang

Found 465 papers, 163 papers with code

OIE@OIA: an Adaptable and Efficient Open Information Extraction Framework

no code implementations • ACL 2022 • Xin Wang, Minlong Peng, Mingming Sun, Ping Li

OIE@OIA follows the methodology of Open Information eXpression (OIX): parsing a sentence to an Open Information Annotation (OIA) Graph and then adapting the OIA graph to different OIE tasks with simple rules.

Open Information Extraction Sentence

Paper
Add Code

A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression

no code implementations • EMNLP 2020 • Mingming Sun, Wenyue Hua, Zoey Liu, Xin Wang, Kangjie Zheng, Ping Li

Based on the same platform of OIX, the OIE strategies are reusable, and people can select a set of strategies to assemble their algorithm for a specific task so that the adaptability may be significantly increased.

Open Information Extraction Sentence

Paper
Add Code

Dependency Position Encoding for Relation Extraction

no code implementations • Findings (NAACL) 2022 • Qiushi Guo, Xin Wang, Dehong Gao

Leveraging the dependency tree of the input sentence is able to improve the model performance for relation extraction.

Position Relation +2

Paper
Add Code

Efficient Sharpness-aware Minimization for Molecular Graph Transformer Models

1 code implementation • ICLR 2024 • Yili Wang, Kaixiong Zhou, Ninghao Liu, Ying Wang, Xin Wang

Sharpness-aware minimization (SAM) has received increasing attention in computer vision since it can effectively eliminate the sharp local minima from the training trajectory and mitigate generalization degradation.

Molecular Property Prediction

Paper
Code

Self-Selected Attention Span for Accelerating Large Language Model Inference

no code implementations • 14 Apr 2024 • Tian Jin, Wanzin Yazar, Zifei Xu, Sayeh Sharify, Xin Wang

We demonstrate that using this custom CUDA kernel improves the throughput of LLM inference by 28%.

Language Modelling Large Language Model

Paper
Add Code

The VoicePrivacy 2024 Challenge Evaluation Plan

1 code implementation • 3 Apr 2024 • Natalia Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas Evans, Junichi Yamagishi, Massimiliano Todisco

The task of the challenge is to develop a voice anonymization system for speech data which conceals the speaker's voice identity while protecting linguistic content and emotional states.

Paper
Code

iMD4GC: Incomplete Multimodal Data Integration to Advance Precise Treatment Response Prediction and Survival Analysis for Gastric Cancer

1 code implementation • 1 Apr 2024 • Fengtao Zhou, Yingxue Xu, Yanfen Cui, Shenyan Zhang, Yun Zhu, Weiyang He, Jiguang Wang, Xin Wang, Ronald Chan, Louis Ho Shing Lau, Chu Han, Dafu Zhang, Zhenhui Li, Hao Chen

The limited availability of modalities for each patient would cause information loss, adversely affecting predictive accuracy.

Data Integration Survival Analysis

Paper
Code

Qibo: A Large Language Model for Traditional Chinese Medicine

no code implementations • 24 Mar 2024 • Heyi Zhang, Xin Wang, Zhaopeng Meng, Yongzhe Jia, Dawei Xu

Furthermore, we develop the Qibo-benchmark, a specialized tool for evaluating the performance of LLMs, which is a specialized tool for evaluating the performance of LLMs in the TCM domain.

Language Modelling Large Language Model

Paper
Add Code

Exploring the Potential of Large Language Models in Graph Generation

no code implementations • 21 Mar 2024 • Yang Yao, Xin Wang, Zeyang Zhang, Yijian Qin, Ziwei Zhang, Xu Chu, Yuekui Yang, Wenwu Zhu, Hong Mei

In this paper, we propose LLM4GraphGen to explore the ability of LLMs for graph generation with systematical task designs and extensive experiments.

Drug Discovery Graph Generation +1

Paper
Add Code

When Do We Not Need Larger Vision Models?

1 code implementation • 19 Mar 2024 • Baifeng Shi, Ziyang Wu, Maolin Mao, Xin Wang, Trevor Darrell

Our results show that a multi-scale smaller model has comparable learning capacity to a larger model, and pre-training smaller models with S$^2$ can match or even exceed the advantage of larger models.

Depth Estimation

172

Paper
Code

MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

1 code implementation • 16 Mar 2024 • Hanlei Zhang, Xin Wang, Hua Xu, Qianrui Zhou, Kai Gao, Jianhua Su, jinyue Zhao, Wenrui Li, Yanting Chen

We believe that MIntRec2. 0 will serve as a valuable resource, providing a pioneering foundation for research in human-machine conversational interactions, and significantly facilitating related applications.

Multimodal Intent Recognition

Paper
Code

Robust Light-Weight Facial Affective Behavior Recognition with CLIP

1 code implementation • 14 Mar 2024 • Li Lin, Sarah Papabathini, Xin Wang, Shu Hu

Human affective behavior analysis aims to delve into human expressions and behaviors to deepen our understanding of human emotions.

Paper
Code

Robust COVID-19 Detection in CT Images with CLIP

1 code implementation • 13 Mar 2024 • Li Lin, Yamini Sri Krubha, Zhenhuan Yang, Cheng Ren, Thuc Duy Le, Irene Amerini, Xin Wang, Shu Hu

In the realm of medical imaging, particularly for COVID-19 detection, deep learning models face substantial challenges such as the necessity for extensive computational resources, the paucity of well-annotated datasets, and a significant amount of unlabeled data.

Paper
Code

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

1 code implementation • 12 Mar 2024 • Xin Wang, Yu Zheng, Zhongwei Wan, Mi Zhang

The advancements in Large Language Models (LLMs) have been hindered by their substantial sizes, which necessitate LLM compression methods for practical deployment.

Language Modelling Large Language Model +1

Paper
Code

Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer

no code implementations • 11 Mar 2024 • Siddhant Satyanaik, Aditya Murali, Deepak Alapatt, Xin Wang, Pietro Mascagni, Nicolas Padoy

Purpose: Advances in deep learning have resulted in effective models for surgical video analysis; however, these models often fail to generalize across medical centers due to domain shift caused by variations in surgical workflow, camera setups, and patient demographics.

Anatomy Disentanglement +3

Paper
Add Code

UAV-Enabled Asynchronous Federated Learning

no code implementations • 11 Mar 2024 • Zhiyuan Zhai, Xiaojun Yuan, Xin Wang, Huiyuan Yang

To exploit unprecedented data generation in mobile edge networks, federated learning (FL) has emerged as a promising alternative to the conventional centralized machine learning (ML).

Federated Learning

Paper
Add Code

Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts

1 code implementation • NeurIPS 2023 • Zeyang Zhang, Xin Wang, Ziwei Zhang, Zhou Qin, Weigao Wen, Hui Xue, Haoyang Li, Wenwu Zhu

In this paper, we discover that there exist cases with distribution shifts unobservable in the time domain while observable in the spectral domain, and propose to study distribution shifts on dynamic graphs in the spectral domain for the first time.

Link Prediction Node Classification

Paper
Code

Unsupervised Graph Neural Architecture Search with Disentangled Self-supervision

no code implementations • NeurIPS 2023 • Zeyang Zhang, Xin Wang, Ziwei Zhang, Guangyao Shen, Shiqi Shen, Wenwu Zhu

To address the challenge, we propose a novel Disentangled Self-supervised Graph Neural Architecture Search (DSGAS) model, which is able to discover the optimal architectures capturing various latent graph factors in a self-supervised fashion based on unlabeled graph data.

Disentanglement Neural Architecture Search

Paper
Add Code

Electrocardiogram Instruction Tuning for Report Generation

no code implementations • 7 Mar 2024 • Zhongwei Wan, Che Liu, Xin Wang, Chaofan Tao, Hui Shen, Zhenwu Peng, Jie Fu, Rossella Arcucci, Huaxiu Yao, Mi Zhang

Electrocardiogram (ECG) serves as the primary non-invasive diagnostic tool for cardiac conditions monitoring, are crucial in assisting clinicians.

Paper
Add Code

Parameterized quantum comb and simpler circuits for reversing unknown qubit-unitary operations

no code implementations • 6 Mar 2024 • Yin Mo, Lei Zhang, Yu-Ao Chen, Yingjian Liu, Tengxiang Lin, Xin Wang

Quantum comb is an essential tool for characterizing complex quantum protocols in quantum information processing.

Quantum Machine Learning

Paper
Add Code

Preserving Fairness Generalization in Deepfake Detection

1 code implementation • 27 Feb 2024 • Li Lin, Xinan He, Yan Ju, Xin Wang, Feng Ding, Shu Hu

The existing method for addressing this problem is providing a fair loss function.

DeepFake Detection Disentanglement +2

Paper
Code

Neural Radiance Fields in Medical Imaging: Challenges and Next Steps

no code implementations • 26 Feb 2024 • Xin Wang, Shu Hu, Heng Fan, Hongtu Zhu, Xin Li

Neural Radiance Fields (NeRF), as a pioneering technique in computer vision, offer great potential to revolutionize medical imaging by synthesizing three-dimensional representations from the projected two-dimensional image data.

Paper
Add Code

Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening

no code implementations • 22 Feb 2024 • Zhenrong Shen, Manman Fei, Xin Wang, Jiangdong Cai, Sheng Wang, Lichi Zhang, Qian Wang

In the first Global Image Generation stage, a Normal Image Generator is designed to generate cytopathological images full of normal cervical cells.

Cell Detection Data Augmentation +1

Paper
Add Code

HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding

no code implementations • 14 Feb 2024 • Zhao Li, Xin Wang, JianXin Li, Wenbin Guo, Jun Zhao

Existing knowledge hypergraph embedding methods mainly focused on improving model performance, but their model structures are becoming more complex and redundant.

hypergraph embedding

Paper
Add Code

Rethinking Propagation for Unsupervised Graph Domain Adaptation

1 code implementation • 8 Feb 2024 • Meihan Liu, Zeyu Fang, Zhen Zhang, Ming Gu, Sheng Zhou, Xin Wang, Jiajun Bu

Motivated by our empirical analysis, we reevaluate the role of GNNs in graph domain adaptation and uncover the pivotal role of the propagation process in GNNs for adapting to different graph domains.

Domain Adaptation

Paper
Code

Failure Analysis in Next-Generation Critical Cellular Communication Infrastructures

no code implementations • 6 Feb 2024 • Siguo Bi, Xin Yuan, Shuyan Hu, Kai Li, Wei Ni, Ekram Hossain, Xin Wang

The advent of communication technologies marks a transformative phase in critical infrastructure construction, where the meticulous analysis of failures becomes paramount in achieving the fundamental objectives of continuity, security, and availability.

Paper
Add Code

Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency Perspective

1 code implementation • 5 Feb 2024 • Zexin Wang, Changhua Pei, Minghua Ma, Xin Wang, Zhihan Li, Dan Pei, Saravan Rajmohan, Dongmei Zhang, QIngwei Lin, Haiming Zhang, Jianhui Li, Gaogang Xie

To ensure an accurate AD, FCVAE exploits an innovative approach to concurrently integrate both the global and local frequency features into the condition of Conditional Variational Autoencoder (CVAE) to significantly increase the accuracy of reconstructing the normal data.

Anomaly Detection Time Series +1

Paper
Code

Artificial Intelligence in Image-based Cardiovascular Disease Analysis: A Comprehensive Survey and Future Outlook

no code implementations • 4 Feb 2024 • Xin Wang, Hongtu Zhu

Our review encompasses these modalities, giving a broad perspective on the diverse imaging techniques integrated with AI for CVD analysis.

Paper
Add Code

Masked Conditional Diffusion Model for Enhancing Deepfake Detection

no code implementations • 1 Feb 2024 • Tiewen Chen, Shanmin Yang, Shu Hu, Zhenghan Fang, Ying Fu, Xi Wu, Xin Wang

this paper present we put a new insight into diffusion model-based data augmentation, and propose a Masked Conditional Diffusion Model (MCDM) for enhancing deepfake detection.

Data Augmentation DeepFake Detection +1

Paper
Add Code

Uncertainty-Aware Explainable Recommendation with Large Language Models

no code implementations • 31 Jan 2024 • Yicui Peng, Hao Chen, ChingSheng Lin, Guo Huang, Jinrong Hu, Hui Guo, Bin Kong, Shu Hu, Xi Wu, Xin Wang

Providing explanations within the recommendation system would boost user satisfaction and foster trust, especially by elaborating on the reasons for selecting recommended items tailored to the user.

Explainable Recommendation Multi-Task Learning

Paper
Add Code

Active Generation Network of Human Skeleton for Action Recognition

no code implementations • 30 Jan 2024 • Long Liu, Xin Wang, Fangming Li, Jiayu Chen

To solve those problems, We propose a novel active generative network (AGN), which can adaptively learn various action categories by motion style transfer to generate new actions when the data for a particular action is only a single sample or few samples.

Action Generation Action Recognition +4

Paper
Add Code

Detecting Multimedia Generated by Large AI Models: A Survey

1 code implementation • 22 Jan 2024 • Li Lin, Neeraj Gupta, Yue Zhang, Hainan Ren, Chun-Hao Liu, Feng Ding, Xin Wang, Xin Li, Luisa Verdoliva, Shu Hu

The rapid advancement of Large AI Models (LAIMs), particularly diffusion models and large language models, has marked a new era where AI-generated multimedia is increasingly integrated into various aspects of daily life.

Paper
Code

Efficient Image Super-Resolution via Symmetric Visual Attention Network

no code implementations • 17 Jan 2024 • Chengxu Wu, Qinrui Fan, Shu Hu, Xi Wu, Xin Wang, Jing Hu

An important development direction in the Single-Image Super-Resolution (SISR) algorithms is to improve the efficiency of the algorithms.

Ranked #53 on Image Super-Resolution on Set14 - 4x upscaling

Image Super-Resolution

Paper
Add Code

To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection

1 code implementation • 17 Jan 2024 • Luyi Han, Tao Tan, Tianyu Zhang, Yuan Gao, Xin Wang, Valentina Longo, Sofía Ventura-Díaz, Anna D'Angelo, Jonas Teuwen, Ritse Mann

We use a clinical dataset with 1630 MRI scans from 314 patients treated with NAC.

Keypoint Detection Tumor Segmentation +1

Paper
Code

TeleChat Technical Report

no code implementations • 8 Jan 2024 • Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang, Qiyi Xie, Yanhan Zhang, Zhongqiu Li, Lingling Shi, Weiwei Fu, Yin Zhang, Zilu Huang, Sishi Xiong, Yuxiang Zhang, Chao Wang, Shuangyong Song

Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe.

Code Generation Question Answering

Paper
Add Code

Bayesian Intrinsic Groupwise Image Registration: Unsupervised Disentanglement of Anatomy and Geometry

no code implementations • 4 Jan 2024 • Xinzhe Luo, Xin Wang, Linda Shapiro, Chun Yuan, Jianfeng Feng, Xiahai Zhuang

This article presents a general Bayesian learning framework for multi-modal groupwise registration on medical images.

Anatomy Bayesian Inference +2

Paper
Add Code

IoT in the Era of Generative AI: Vision and Challenges

no code implementations • 3 Jan 2024 • Xin Wang, Zhongwei Wan, Arvin Hekmati, Mingyu Zong, Samiul Alam, Mi Zhang, Bhaskar Krishnamachari

Equipped with sensing, networking, and computing capabilities, Internet of Things (IoT) such as smartphones, wearables, smart speakers, and household robots have been seamlessly weaved into our daily lives.

Federated Learning Prompt Engineering

Paper
Add Code

Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos

no code implementations • 28 Dec 2023 • Houlun Chen, Xin Wang, Hong Chen, Zihan Song, Jia Jia, Wenwu Zhu

To tackle these challenges, in this work we propose a Grounding-Prompter method, which is capable of conducting TSG in long videos through prompting LLM with multimodal information.

Denoising In-Context Learning +3

Paper
Add Code

PokeMQA: Programmable knowledge editing for Multi-hop Question Answering

1 code implementation • 23 Dec 2023 • Hengrui Gu, Kaixiong Zhou, Xiaotian Han, Ninghao Liu, Ruobing Wang, Xin Wang

Multi-hop question answering (MQA) is one of the challenging tasks to evaluate machine's comprehension and reasoning abilities, where large language models (LLMs) have widely achieved the human-comparable performance.

Answer Generation knowledge editing +3

Paper
Code

LLM4VG: Large Language Models Evaluation for Video Grounding

no code implementations • 21 Dec 2023 • Wei Feng, Xin Wang, Hong Chen, Zeyang Zhang, Zihan Song, Yuwei Zhou, Wenwu Zhu

Recently, researchers have attempted to investigate the capability of LLMs in handling videos and proposed several video LLM models.

Image Captioning Video Grounding +1

Paper
Add Code

In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging

no code implementations • 20 Dec 2023 • Xin Wang, Lizhi Wang, Xiangtian Ma, Maoqing Zhang, Lin Zhu, Hua Huang

Dual-Camera Compressed Hyperspectral Imaging (DCCHI) offers the capability to reconstruct 3D Hyperspectral Image (HSI) by fusing compressive and Panchromatic (PAN) image, which has shown great potential for snapshot hyperspectral imaging in practice.

Paper
Add Code

ConvD: Attention Enhanced Dynamic Convolutional Embeddings for Knowledge Graph Completion

no code implementations • 11 Dec 2023 • Wenbin Guo, Zhao Li, Xin Wang, Zirui Chen

In this paper, we propose a novel dynamic convolutional embedding model ConvD for knowledge graph completion, which directly reshapes the relation embeddings into multiple internal convolution kernels to improve the external convolution kernels of the traditional convolutional embedding model.

Entity Embeddings Relation

Paper
Add Code

Detection and Mitigation of Position Spoofing Attacks on Cooperative UAV Swarm Formations

no code implementations • 6 Dec 2023 • Siguo Bi, Kai Li, Shuyan Hu, Wei Ni, Cong Wang, Xin Wang

Detecting spoofing attacks on the positions of unmanned aerial vehicles (UAVs) within a swarm is challenging.

Position

Paper
Add Code

Efficient Large Language Models: A Survey

3 code implementations • 6 Dec 2023 • Zhongwei Wan, Xin Wang, Che Liu, Samiul Alam, Yu Zheng, Jiachen Liu, Zhongnan Qu, Shen Yan, Yi Zhu, Quanlu Zhang, Mosharaf Chowdhury, Mi Zhang

Large Language Models (LLMs) have demonstrated remarkable capabilities in important tasks such as natural language understanding, language generation, and complex reasoning and have the potential to make a substantial impact on our society.

Natural Language Understanding Text Generation

829

Paper
Code

Virtual Quantum Markov Chains

no code implementations • 4 Dec 2023 • Yu-Ao Chen, Chengkai Zhu, Keming He, Mingrui Jing, Xin Wang

In this work, we propose the concept of virtual quantum Markov chains (VQMCs), focusing on scenarios where subsystems retain classical information about global systems from measurement statistics.

Paper
Add Code

VTimeLLM: Empower LLM to Grasp Video Moments

1 code implementation • 30 Nov 2023 • Bin Huang, Xin Wang, Hong Chen, Zihan Song, Wenwu Zhu

Large language models (LLMs) have shown remarkable text understanding capabilities, which have been extended as Video LLMs to handle video data for comprehending visual details.

Ranked #1 on Video-based Generative Performance Benchmarking (Detail Orientation)) on VideoInstruct

Dense Video Captioning Video-based Generative Performance Benchmarking (Consistency) +5

108

Paper
Code

Out-of-Distribution Generalized Dynamic Graph Neural Network for Human Albumin Prediction

no code implementations • 27 Nov 2023 • Zeyang Zhang, Xingwang Li, Fei Teng, Ning Lin, Xueling Zhu, Xin Wang, Wenwu Zhu

We first model human albumin prediction as a dynamic graph regression problem to model the dynamics and patient relationship.

Graph Attention Graph Regression +1

Paper
Add Code

A Generic Stochastic Hybrid Car-following Model Based on Approximate Bayesian Computation

no code implementations • 27 Nov 2023 • Jiwan Jiang, Yang Zhou, Xin Wang, Soyoung Ahn

However, the CF behavior of human drivers is highly stochastic and nonlinear.

Paper
Add Code

OFDMA-F$^2$L: Federated Learning With Flexible Aggregation Over an OFDMA Air Interface

no code implementations • 25 Nov 2023 • Shuyan Hu, Xin Yuan, Wei Ni, Xin Wang, Ekram Hossain, H. Vincent Poor

Federated learning (FL) can suffer from a communication bottleneck when deployed in mobile networks, limiting participating clients and deterring FL convergence.

Federated Learning

Paper
Add Code

Out-of-Distribution Generalized Dynamic Graph Neural Network with Disentangled Intervention and Invariance Promotion

no code implementations • 24 Nov 2023 • Zeyang Zhang, Xin Wang, Ziwei Zhang, Haoyang Li, Wenwu Zhu

In this paper, we propose Disentangled Intervention-based Dynamic graph Attention networks with Invariance Promotion (I-DIDA) to handle spatio-temporal distribution shifts in dynamic graphs by discovering and utilizing invariant patterns, i. e., structures and features whose predictive abilities are stable across distribution shifts.

Graph Attention

Paper
Add Code

Self-organized biodiversity in biotic resource systems

no code implementations • 23 Nov 2023 • Ju Kang, Shijie Zhang, Yiyuan Niu, Xin Wang

What determines biodiversity in nature is a prominent issue in ecology, especially in biotic resource systems that are typically devoid of cross-feeding.

Paper
Add Code

Adversarial Prompt Tuning for Vision-Language Models

1 code implementation • 19 Nov 2023 • Jiaming Zhang, Xingjun Ma, Xin Wang, Lingyu Qiu, Jiaqi Wang, Yu-Gang Jiang, Jitao Sang

With the rapid advancement of multimodal learning, pre-trained Vision-Language Models (VLMs) such as CLIP have demonstrated remarkable capacities in bridging the gap between visual and language modalities.

Adversarial Robustness

Paper
Code

MeLo: Low-rank Adaptation is Better than Fine-tuning for Medical Image Diagnosis

1 code implementation • 14 Nov 2023 • Yitao Zhu, Zhenrong Shen, Zihao Zhao, Sheng Wang, Xin Wang, Xiangyu Zhao, Dinggang Shen, Qian Wang

By fixing the weight of ViT models and only adding small low-rank plug-ins, we achieve competitive results on various diagnosis tasks across different imaging modalities using only a few trainable parameters.

280

Paper
Code

Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images

no code implementations • 14 Nov 2023 • Zhiyun Song, Zengxin Qi, Xin Wang, Xiangyu Zhao, Zhenrong Shen, Sheng Wang, Manman Fei, Zhe Wang, Di Zang, Dongdong Chen, Linlin Yao, Qian Wang, Xuehai Wu, Lichi Zhang

Cross-modality synthesis (CMS), super-resolution (SR), and their combination (CMSR) have been extensively studied for magnetic resonance imaging (MRI).

Attribute Image Generation +2

Paper
Add Code

Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models

no code implementations • 10 Nov 2023 • Siao Tang, Xin Wang, Hong Chen, Chaoyu Guan, Zewen Wu, Yansong Tang, Wenwu Zhu

In this paper, we propose a novel post-training quantization method PCR (Progressive Calibration and Relaxing) for text-to-image diffusion models, which consists of a progressive calibration strategy that considers the accumulated quantization error across timesteps, and an activation relaxing strategy that improves the performance with negligible cost.

Quantization

Paper
Add Code

UMedNeRF: Uncertainty-aware Single View Volumetric Rendering for Medical Neural Radiance Fields

no code implementations • 10 Nov 2023 • Jing Hu, Qinrui Fan, Shu Hu, Siwei Lyu, Xi Wu, Xin Wang

In the field of clinical medicine, computed tomography (CT) is an effective medical imaging modality for the diagnosis of various pathologies.

Computed Tomography (CT)

Paper
Add Code

3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud

no code implementations • 8 Nov 2023 • Jianchao Ci, Xin Wang, David Rapado-Rincón, Akshay K. Burusa, Gert Kootstra

A 21 comprehensive evaluation was conducted in a commercial greenhouse to gain insight into the 22 performance of different parts of the method.

3D Pose Estimation Keypoint Detection

Paper
Add Code

Lightweight Diffusion Models with Distillation-Based Block Neural Architecture Search

no code implementations • 8 Nov 2023 • Siao Tang, Xin Wang, Hong Chen, Chaoyu Guan, Yansong Tang, Wenwu Zhu

When retraining the searched architecture, we adopt a dynamic joint loss to maintain the consistency between supernet training and subnet retraining, which also provides informative objectives for each block and shortens the paths of gradient propagation.

Neural Architecture Search

Paper
Add Code

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

no code implementations • 7 Nov 2023 • Longteng Zhang, Xiang Liu, Zeyu Li, Xinglin Pan, Peijie Dong, Ruibo Fan, Rui Guo, Xin Wang, Qiong Luo, Shaohuai Shi, Xiaowen Chu

For end users, our benchmark and findings help better understand different optimization techniques, training and inference frameworks, together with hardware platforms in choosing configurations for deploying LLMs.

Quantization

Paper
Add Code

VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning

no code implementations • 2 Nov 2023 • Hong Chen, Xin Wang, Guanning Zeng, YiPeng Zhang, Yuwei Zhou, Feilin Han, Wenwu Zhu

The video generator is further customized for the given multiple subjects by the proposed Disen-Mix Finetuning and Human-in-the-Loop Re-finetuning strategy, which can tackle the attribute binding problem of multi-subject generation.

Attribute Text-to-Video Generation +1

Paper
Add Code

A Systematic Review for Transformer-based Long-term Series Forecasting

no code implementations • 31 Oct 2023 • Liyilei Su, Xumin Zuo, Rui Li, Xin Wang, Heng Zhao, Bingding Huang

Various variants have enabled transformer architecture to effectively handle long-term time series forecasting (LTSF) tasks.

Time Series Time Series Forecasting

Paper
Add Code

Towards Generalized Multi-stage Clustering: Multi-view Self-distillation

no code implementations • 29 Oct 2023 • Jiatai Wang, Zhiwei Xu, Xin Wang, Tao Li

MVC aims at exploring common semantics and pseudo-labels from multiple views and clustering in a self-supervised manner.

Clustering Contrastive Learning +1

Paper
Add Code

Hierarchical Mutual Information Analysis: Towards Multi-view Clustering in The Wild

no code implementations • 28 Oct 2023 • Jiatai Wang, Zhiwei Xu, Xuewen Yang, Xin Wang

Multi-view clustering (MVC) can explore common semantics from unsupervised views generated by different sources, and thus has been extensively used in applications of practical computer vision.

Clustering

Paper
Add Code

Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs

no code implementations • 27 Oct 2023 • Yijian Qin, Xin Wang, Ziwei Zhang, Wenwu Zhu

Text-attributed graphs (TAGs) are prevalent on the web and research over TAGs such as citation networks, e-commerce networks and social networks has attracted considerable attention in the web community.

Representation Learning

Paper
Add Code

LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?

no code implementations • 26 Oct 2023 • Zeyang Zhang, Xin Wang, Ziwei Zhang, Haoyang Li, Yijian Qin, Wenwu Zhu

Our main observations are: 1) LLMs have preliminary spatial-temporal understanding abilities on dynamic graphs, 2) Dynamic graph tasks show increasing difficulties for LLMs as the graph size and density increase, while not sensitive to the time span and data generation mechanism, 3) the proposed DST2 prompting method can help to improve LLMs' spatial-temporal understanding abilities on dynamic graphs for most tasks.

Paper
Add Code

Self-triggered Consensus Control of Multi-agent Systems from Data

no code implementations • 19 Oct 2023 • Yifei Li, Xin Wang, Jian Sun, Gang Wang, Jie Chen

In the presence of external disturbances, a model-based STC scheme is put forth for $\mathcal{H}_{\infty}$-consensus of MASs, serving as a baseline for the data-driven STC.

Paper
Add Code

Provable Advantage of Parameterized Quantum Circuit in Function Approximation

no code implementations • 11 Oct 2023 • Zhan Yu, Qiuhao Chen, Yuling Jiao, Yinan Li, Xiliang Lu, Xin Wang, Jerry Zhijian Yang

To achieve this, we utilize techniques from quantum signal processing and linear combinations of unitaries to construct PQCs that implement multivariate polynomials.

Quantum Machine Learning

Paper
Add Code

Decentralized Federated Learning via MIMO Over-the-Air Computation: Consensus Analysis and Performance Optimization

no code implementations • 8 Oct 2023 • Zhiyuan Zhai, Xiaojun Yuan, Xin Wang

We conduct a general convergence analysis to quantitatively capture the influence of aggregation weight and communication error on the MIMO OA-DFL performance in \emph{ad hoc} networks.

Distributed Optimization Federated Learning

Paper
Add Code

X-Transfer: A Transfer Learning-Based Framework for GAN-Generated Fake Image Detection

no code implementations • 7 Oct 2023 • Lei Zhang, Hao Chen, Shu Hu, Bin Zhu, Ching Sheng Lin, Xi Wu, Jinrong Hu, Xin Wang

Generative adversarial networks (GANs) have remarkably advanced in diverse domains, especially image generation and editing.

Fake Image Detection Image Generation +1

Paper
Add Code

Controlling Neural Style Transfer with Deep Reinforcement Learning

no code implementations • 30 Sep 2023 • Chengming Feng, Jing Hu, Xin Wang, Shu Hu, Bin Zhu, Xi Wu, Hongtu Zhu, Siwei Lyu

Controlling the degree of stylization in the Neural Style Transfer (NST) is a little tricky since it usually needs hand-engineering on hyper-parameters.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Improving Cross-dataset Deepfake Detection with Deep Information Decomposition

no code implementations • 30 Sep 2023 • Shanmin Yang, Shu Hu, Bin Zhu, Ying Fu, Siwei Lyu, Xi Wu, Xin Wang

Deepfake technology poses a significant threat to security and social trust.

DeepFake Detection Face Swapping

Paper
Add Code

HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World

no code implementations • ICCV 2023 • Xin Wang, Taein Kwon, Mahdi Rad, Bowen Pan, Ishani Chakraborty, Sean Andrist, Dan Bohus, Ashley Feniello, Bugra Tekin, Felipe Vieira Frujeri, Neel Joshi, Marc Pollefeys

Building an interactive AI assistant that can perceive, reason, and collaborate with humans in the real world has been a long-standing pursuit in the AI community.

Mistake Detection Mixed Reality +1

Paper
Add Code

Collaborative Watermarking for Adversarial Speech Synthesis

no code implementations • 26 Sep 2023 • Lauri Juvela, Xin Wang

Advances in neural speech synthesis have brought us technology that is not only close to human naturalness, but is also capable of instant voice cloning with little data, and is highly accessible with pre-trained models available.

Speaker Verification Speech Synthesis +2

Paper
Add Code

Statistical Analysis of Quantum State Learning Process in Quantum Neural Networks

1 code implementation • NeurIPS 2023 • Hao-Kai Zhang, Chenghong Zhu, Mingrui Jing, Xin Wang

As a quantum analog of probability distribution learning, quantum state learning is theoretically and practically essential in quantum machine learning.

Quantum Machine Learning

Paper
Code

Image-to-Image Translation with Deep Reinforcement Learning

1 code implementation • 24 Sep 2023 • Xin Wang, Ziwei Luo, Jing Hu, Chengming Feng, Shu Hu, Bin Zhu, Xi Wu, Xin Li, Siwei Lyu

The key feature in the RL-I2IT framework is to decompose a monolithic learning process into small steps with a lightweight model to progressively transform a source image successively to a target image.

Auxiliary Learning Decision Making +3

Paper
Code

WiCV@CVPR2023: The Eleventh Women In Computer Vision Workshop at the Annual CVPR Conference

no code implementations • 22 Sep 2023 • Doris Antensteiner, Marah Halawa, Asra Aslam, Ivaxi Sheth, Sachini Herath, Ziqi Huang, Sunnie S. Y. Kim, Aparna Akula, Xin Wang

In this paper, we present the details of Women in Computer Vision Workshop - WiCV 2023, organized alongside the hybrid CVPR 2023 in Vancouver, Canada.

Paper
Add Code

On-the-Fly SfM: What you capture is What you get

1 code implementation • 21 Sep 2023 • Zongqian Zhan, Rui Xia, Yifei Yu, Yibo Xu, Xin Wang

Over the last decades, ample achievements have been made on Structure from motion (SfM).

Image Registration Image Retrieval +1

Paper
Code

For A More Comprehensive Evaluation of 6DoF Object Pose Tracking

no code implementations • 14 Sep 2023 • Yang Li, Fan Zhong, Xin Wang, Shuangbing Song, Jiachen Li, Xueying Qin, Changhe Tu

The limitations of previous scoring methods and error metrics are analyzed, based on which we introduce our improved evaluation methods.

Pose Tracking

Paper
Add Code

Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?

1 code implementation • 12 Sep 2023 • Xin Wang, Junichi Yamagishi

While many datasets use spoofed data generated by speech synthesis systems, it was recently found that data vocoded by neural vocoders were also effective as the spoofed training data.

Self-Supervised Learning Speech Synthesis

294

Paper
Code

Outlier Robust Adversarial Training

1 code implementation • 10 Sep 2023 • Shu Hu, Zhenhuan Yang, Xin Wang, Yiming Ying, Siwei Lyu

Theoretically, we show that the learning objective of ORAT satisfies the $\mathcal{H}$-consistency in binary classification, which establishes it as a proper surrogate to adversarial 0/1 loss.

Adversarial Attack Binary Classification

Paper
Code

Control-Oriented Modeling and Layer-to-Layer Spatial Control of Powder Bed Fusion Processes

no code implementations • 8 Sep 2023 • Xin Wang, Bumsoo Park, Robert G. Landers, Sandipan Mishra, Douglas A. Bristow

However, due to inherent process variability, it is still very costly and time consuming to certify the process and the part.

Paper
Add Code

DRAG: Divergence-based Adaptive Aggregation in Federated learning on Non-IID Data

no code implementations • 4 Sep 2023 • Feng Zhu, Jingjing Zhang, Shengyun Liu, Xin Wang

Local stochastic gradient descent (SGD) is a fundamental approach in achieving communication efficiency in Federated Learning (FL) by allowing individual workers to perform local updates.

Federated Learning

Paper
Add Code

BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks

1 code implementation • 31 Aug 2023 • Qiang Huang, Jiawei Jiang, Xi Susie Rao, Ce Zhang, Zhichao Han, Zitao Zhang, Xin Wang, Yongjun He, Quanqing Xu, Yang Zhao, Chuang Hu, Shuo Shang, Bo Du

To handle graphs in which features or connectivities are evolving over time, a series of temporal graph neural networks (TGNNs) have been proposed.

Link Prediction Node Classification

Paper
Code

Graph Meets LLMs: Towards Large Graph Models

1 code implementation • 28 Aug 2023 • Ziwei Zhang, Haoyang Li, Zeyang Zhang, Yijian Qin, Xin Wang, Wenwu Zhu

In order to promote applying large models for graphs forward, we present a perspective paper to discuss the challenges and opportunities associated with developing large graph models.

208

Paper
Code

A Survey on Fairness in Large Language Models

no code implementations • 20 Aug 2023 • Yingji Li, Mengnan Du, Rui Song, Xin Wang, Ying Wang

Large Language Models (LLMs) have shown powerful performance and development prospects and are widely deployed in the real world.

Fairness

Paper
Add Code

Unsupervised Multiplex Graph Learning with Complementary and Consistent Information

1 code implementation • 3 Aug 2023 • Liang Peng, Xin Wang, Xiaofeng Zhu

Unsupervised multiplex graph learning (UMGL) has been shown to achieve significant effectiveness for different downstream tasks by exploring both complementary information and consistent information among multiple graphs.

Graph Learning Representation Learning

Paper
Code

SphereNet: Learning a Noise-Robust and General Descriptor for Point Cloud Registration

no code implementations • 18 Jul 2023 • Guiyu Zhao, Zhentao Guo, Xin Wang, Hongbin Ma

However, most methods are susceptible to noise and have poor generalization ability on unseen datasets.

Point Cloud Registration

Paper
Add Code

Mixed-Precision Quantization with Cross-Layer Dependencies

no code implementations • 11 Jul 2023 • Zihao Deng, Xin Wang, Sayeh Sharify, Michael Orshansky

Quantization assigning the same bit-width to all layers leads to large accuracy degradation at low precision and is wasteful at high precision settings.

Quantization

Paper
Add Code

DisAsymNet: Disentanglement of Asymmetrical Abnormality on Bilateral Mammograms using Self-adversarial Learning

no code implementations • 6 Jul 2023 • Xin Wang, Tao Tan, Yuan Gao, Luyi Han, Tianyu Zhang, Chunyao Lu, Regina Beets-Tan, Ruisheng Su, Ritse Mann

The question of 'what the symmetrical Bi-MG would look like when the asymmetrical abnormalities have been removed ?'

Anatomy Disentanglement

Paper
Add Code

Unlocking the Potential of Deep Learning in Peak-Hour Series Forecasting

1 code implementation • 4 Jul 2023 • Zhenwei Zhang, Xin Wang, Jingyuan Xie, Heling Zhang, Yuantao Gu

Unlocking the potential of deep learning in Peak-Hour Series Forecasting (PHSF) remains a critical yet underexplored task in various domains.

Time Series Time Series Forecasting

Paper
Code

Prompt Tuning Pushes Farther, Contrastive Learning Pulls Closer: A Two-Stage Approach to Mitigate Social Biases

no code implementations • 4 Jul 2023 • Yingji Li, Mengnan Du, Xin Wang, Ying Wang

Meanwhile, experimental results on the GLUE benchmark show that CCPA retains the language modeling capability of PLMs.

Contrastive Learning counterfactual +2

Paper
Add Code

An Explainable Deep Framework: Towards Task-Specific Fusion for Multi-to-One MRI Synthesis

1 code implementation • 3 Jul 2023 • Luyi Han, Tianyu Zhang, Yunzhi Huang, Haoran Dou, Xin Wang, Yuan Gao, Chunyao Lu, Tan Tao, Ritse Mann

Multi-sequence MRI is valuable in clinical settings for reliable diagnosis and treatment prognosis, but some sequences may be unusable or missing for various reasons.

Paper
Code

Synthesis of Contrast-Enhanced Breast MRI Using Multi-b-Value DWI-based Hierarchical Fusion Network with Attention Mechanism

1 code implementation • 3 Jul 2023 • Tianyu Zhang, Luyi Han, Anna D'Angelo, Xin Wang, Yuan Gao, Chunyao Lu, Jonas Teuwen, Regina Beets-Tan, Tao Tan, Ritse Mann

DWIs with different b-values are fused to efficiently utilize the difference features of DWIs.

Breast Cancer Detection

Paper
Code

Over-The-Air Federated Learning: Status Quo, Open Challenges, and Future Directions

no code implementations • 3 Jul 2023 • Bingnan Xiao, Xichen Yu, Wei Ni, Xin Wang, H. Vincent Poor

The development of applications based on artificial intelligence and implemented over wireless networks is increasingly rapidly and is expected to grow dramatically in the future.

Federated Learning

Paper
Add Code

Textbooks Are All You Need

no code implementations • 20 Jun 2023 • Suriya Gunasekar, Yi Zhang, Jyoti Aneja, Caio César Teodoro Mendes, Allie Del Giorno, Sivakanth Gopi, Mojan Javaheripi, Piero Kauffmann, Gustavo de Rosa, Olli Saarikivi, Adil Salim, Shital Shah, Harkirat Singh Behl, Xin Wang, Sébastien Bubeck, Ronen Eldan, Adam Tauman Kalai, Yin Tat Lee, Yuanzhi Li

Despite this small scale, phi-1 attains pass@1 accuracy 50. 6% on HumanEval and 55. 5% on MBPP.

Ranked #41 on Code Generation on HumanEval

Code Generation Language Modelling +1

Paper
Add Code

Efficient Search and Detection of Relevant Plant Parts using Semantics-Aware Active Vision

no code implementations • 16 Jun 2023 • Akshay K. Burusa, Joost Scholten, David Rapado Rincon, Xin Wang, Eldert J. van Henten, Gert Kootstra

To automate harvesting and de-leafing of tomato plants using robots, it is important to search and detect the relevant plant parts, namely tomatoes, peduncles, and petioles.

Paper
Add Code

Towards single integrated spoofing-aware speaker verification embeddings

1 code implementation • 30 May 2023 • Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung

Second, competitive performance should be demonstrated compared to the fusion of automatic speaker verification (ASV) and countermeasure (CM) embeddings, which outperformed single embedding solutions by a large margin in the SASV2022 challenge.

Speaker Verification

Paper
Code

Controllable Text-to-Image Generation with GPT-4

no code implementations • 29 May 2023 • Tianjun Zhang, Yi Zhang, Vibhav Vineet, Neel Joshi, Xin Wang

Control-GPT works by querying GPT-4 to write TikZ code, and the generated sketches are used as references alongside the text instructions for diffusion models (e. g., ControlNet) to generate photo-realistic images.

Instruction Following Text-to-Image Generation

Paper
Add Code

Range-Based Equal Error Rate for Spoof Localization

1 code implementation • 28 May 2023 • Lin Zhang, Xin Wang, Erica Cooper, Nicholas Evans, Junichi Yamagishi

To properly measure misclassified ranges and better evaluate spoof localization performance, we upgrade point-based EER to range-based EER.

Paper
Code

Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval

1 code implementation • 26 May 2023 • Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Yanjun Wang

Given a query caption, the goal is to rank candidate images by relevance, from large to small.

Retrieval Text Retrieval

Paper
Code

Gorilla: Large Language Model Connected with Massive APIs

1 code implementation • 24 May 2023 • Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez

Large Language Models (LLMs) have seen an impressive wave of advances recently, with models now excelling in a variety of tasks, such as mathematical reasoning and program synthesis.

Hallucination Language Modelling +4

9,945

Paper
Code

TOAST: Transfer Learning via Attention Steering

1 code implementation • 24 May 2023 • Baifeng Shi, Siyu Gai, Trevor Darrell, Xin Wang

We introduce Top-Down Attention Steering (TOAST), a novel transfer learning algorithm that keeps the pre-trained backbone frozen, selects task-relevant features in the output, and feeds those features back to the model to steer the attention to the task-specific features.

Fine-Grained Image Classification Instruction Following +2

181

Paper
Code

Federated Learning Model Aggregation in Heterogenous Aerial and Space Networks

no code implementations • 24 May 2023 • Fan Dong, Ali Abbasi, Henry Leung, Xin Wang, Jiayu Zhou, Steve Drew

Direct sharing of the data distribution may be prohibitive due to the additional private information that is sent from the clients.

Federated Learning Privacy Preserving

Paper
Add Code

Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms

1 code implementation • Interspeech 2023 • Chang Zeng, Xin Wang, Xiaoxiao Miao, Erica Cooper, Junichi Yamagishi

The ability of countermeasure models to generalize from seen speech synthesis methods to unseen ones has been investigated in the ASVspoof challenge.

Speech Synthesis

Paper
Code

Efficient information recovery from Pauli noise via classical shadow

no code implementations • 6 May 2023 • Yifei Chen, Zhan Yu, Chenghong Zhu, Xin Wang

The rapid advancement of quantum computing has led to an extensive demand for effective techniques to extract classical information from quantum systems, particularly in fields like quantum machine learning and quantum chemistry.

Quantum Machine Learning

Paper
Add Code

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation

1 code implementation • 5 May 2023 • Hong Chen, YiPeng Zhang, Simin Wu, Xin Wang, Xuguang Duan, Yuwei Zhou, Wenwu Zhu

To tackle the problems, we propose DisenBooth, an identity-preserving disentangled tuning framework for subject-driven text-to-image generation.

Denoising Disentanglement +1

Paper
Code

Clothes Grasping and Unfolding Based on RGB-D Semantic Segmentation

no code implementations • 5 May 2023 • Xingyu Zhu, Xin Wang, Jonathan Freer, Hyung Jin Chang, Yixing Gao

These methods often utilize physics engines to synthesize depth images to reduce the cost of real labeled data collection.

Data Augmentation Semantic Segmentation

Paper
Add Code

DELTA: Dynamic Embedding Learning with Truncated Conscious Attention for CTR Prediction

no code implementations • 3 May 2023 • Chen Zhu, Liang Du, Hong Chen, Shuang Zhao, Zixun Sun, Xin Wang, Wenwu Zhu

To tackle this problem, inspired by the Global Workspace Theory in conscious processing, which posits that only a specific subset of the product features are pertinent while the rest can be noisy and even detrimental to human-click behaviors, we propose a CTR model that enables Dynamic Embedding Learning with Truncated Conscious Attention for CTR prediction, termed DELTA.

Click-Through Rate Prediction

Paper
Add Code

Enhancing Video Super-Resolution via Implicit Resampling-based Alignment

1 code implementation • arXiv 2024 • Kai Xu, Ziwei Yu, Xin Wang, Michael Bi Mi, Angela Yao

We show that bilinear interpolation inherently attenuates high-frequency information while an MLP-based coordinate network can approximate more frequencies.

Ranked #1 on Video Super-Resolution on Vid4 - 4x upscaling

Video Super-Resolution

Paper
Code

JaxPruner: A concise library for sparsity research

1 code implementation • 27 Apr 2023 • Joo Hyung Lee, Wonpyo Park, Nicole Mitchell, Jonathan Pilault, Johan Obando-Ceron, Han-Byul Kim, Namhoon Lee, Elias Frantar, Yun Long, Amir Yazdanbakhsh, Shivani Agrawal, Suvinay Subramanian, Xin Wang, Sheng-Chun Kao, Xingyao Zhang, Trevor Gale, Aart Bik, Woohyun Han, Milen Ferev, Zhonglin Han, Hong-Seok Kim, Yann Dauphin, Gintare Karolina Dziugaite, Pablo Samuel Castro, Utku Evci

This paper introduces JaxPruner, an open-source JAX-based pruning and sparse training library for machine learning research.

196

Paper
Code

LayerNAS: Neural Architecture Search in Polynomial Complexity

no code implementations • 23 Apr 2023 • Yicheng Fan, Dana Alon, Jingyue Shen, Daiyi Peng, Keshav Kumar, Yun Long, Xin Wang, Fotis Iliopoulos, Da-Cheng Juan, Erik Vee

For a model architecture with $L$ layers, we perform layerwise-search for each layer, selecting from a set of search options $\mathbb{S}$.

Ranked #1 on Neural Architecture Search on NATS-Bench Topology, CIFAR-100

Combinatorial Optimization Neural Architecture Search

Paper
Add Code

Harnessing the Power of Text-image Contrastive Models for Automatic Detection of Online Misinformation

no code implementations • 19 Apr 2023 • Hao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu

As growing usage of social media websites in the recent decades, the amount of news articles spreading online rapidly, resulting in an unprecedented scale of potentially fraudulent information.

Contrastive Learning Misinformation +1

Paper
Add Code

A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery

1 code implementation • 16 Apr 2023 • Hanlei Zhang, Hua Xu, Xin Wang, Fei Long, Kai Gao

New intent discovery is of great value to natural language processing, allowing for a better understanding of user needs and providing friendly services.

Clustering Intent Discovery +3

177

Paper
Code

Arbitrary Reduction of MRI Inter-slice Spacing Using Hierarchical Feature Conditional Diffusion

no code implementations • 16 Apr 2023 • Xin Wang, Zhenrong Shen, Zhiyun Song, Sheng Wang, Mengjun Liu, Lichi Zhang, Kai Xuan, Qian Wang

Magnetic resonance (MR) images collected in 2D scanning protocols typically have large inter-slice spacing, resulting in high in-plane resolution but reduced through-plane resolution.

Super-Resolution

Paper
Add Code

Adversarially Robust Neural Architecture Search for Graph Neural Networks

no code implementations • CVPR 2023 • Beini Xie, Heng Chang, Ziwei Zhang, Xin Wang, Daixin Wang, Zhiqiang Zhang, Rex Ying, Wenwu Zhu

To tackle these challenges, we propose a novel Robust Neural Architecture search framework for GNNs (G-RNA).

Neural Architecture Search Robust Design

Paper
Add Code

CIMI4D: A Large Multimodal Climbing Motion Dataset under Human-scene Interactions

no code implementations • CVPR 2023 • Ming Yan, Xin Wang, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang

The core of this dataset is a blending optimization process, which corrects for the pose as it drifts and is affected by the magnetic conditions.

Pose Prediction

Paper
Add Code

Domain Adaptive Semantic Segmentation by Optimal Transport

no code implementations • 29 Mar 2023 • Yaqian Guo, Xin Wang, Ce Li, Shihui Ying

Second, we utilize OT to achieve a more robust alignment of source and target domains in output space, where the OT plan defines a well attention mechanism to improve the adaptation of the model.

Autonomous Driving Domain Adaptation +2

Paper
Add Code

Top-Down Visual Attention from Analysis by Synthesis

1 code implementation • CVPR 2023 • Baifeng Shi, Trevor Darrell, Xin Wang

In this paper, we consider top-down attention from a classic Analysis-by-Synthesis (AbS) perspective of vision.

Retrieval Semantic Segmentation +1

157

Paper
Code

Damage detection of high-speed railway box girder using train-induced dynamic responses

no code implementations • 23 Mar 2023 • Xin Wang, Yi Zhuo, Shunlong Li

This paper proposes a damage detection method based on the train-induced responses of high-speed railway box girder.

Paper
Add Code

A Survey of Graph Prompting Methods: Techniques, Applications, and Challenges

no code implementations • 13 Mar 2023 • Xuansheng Wu, Kaixiong Zhou, Mingchen Sun, Xin Wang, Ninghao Liu

In particular, we introduce the basic concepts of graph prompt learning, organize the existing work of designing graph prompting functions, and describe their applications and future challenges.

Paper
Add Code

Optimal Beamforming for MIMO DFRC Systems with Transmit Covariance Constraints

no code implementations • 6 Mar 2023 • Chenhao Yang, Xin Wang, Wei Ni, Yi Jiang

Under this approach, we reveal that the optimal receive beamforming is given by the classic MMSE one and the optimal transmit beamforming design amounts to solving an orthogonal Procrustes problem, thereby allowing for closed-form solutions to subproblems in each BCD step and fast convergence of the proposed algorithm to a high-quality (near-optimal) overall beamforming design.

Paper
Add Code

Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching

no code implementations • 1 Mar 2023 • Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Zhongtian Du

To alleviate the gradient vanishing problem, we propose a Selectively Hard Negative Mining (SelHN) strategy, which chooses whether to mine hard negative samples according to the gradient vanishing condition.

Image-text matching Text Matching

Paper
Add Code

RIS-Assisted Jamming Rejection and Path Planning for UAV-Borne IoT Platform: A New Deep Reinforcement Learning Framework

no code implementations • 10 Feb 2023 • Shuyan Hu, Xin Yuan, Wei Ni, Xin Wang, Abbas Jamalipour

This paper presents a new deep reinforcement learning (DRL)-based approach to the trajectory planning and jamming rejection of an unmanned aerial vehicle (UAV) for the Internet-of-Things (IoT) applications.

Trajectory Planning

Paper
Add Code

Unsupervised Deep Learning for IoT Time Series

no code implementations • 7 Feb 2023 • Ya Liu, Yingjie Zhou, Kai Yang, Xin Wang

IoT time series analysis has found numerous applications in a wide variety of areas, ranging from health informatics to network security.

Clustering Representation Learning +3

Paper
Add Code

Curriculum Graph Machine Learning: A Survey

no code implementations • 6 Feb 2023 • Haoyang Li, Xin Wang, Wenwu Zhu

To the best of our knowledge, this paper is the first survey for curriculum graph machine learning.

Model Optimization

Paper
Add Code

IMPORTANT-Net: Integrated MRI Multi-Parameter Reinforcement Fusion Generator with Attention Network for Synthesizing Absent Data

1 code implementation • 3 Feb 2023 • Tianyu Zhang, Tao Tan, Luyi Han, Xin Wang, Yuan Gao, Jonas Teuwen, Regina Beets-Tan, Ritse Mann

Then the multi-parameter fusion with attention module enables the interaction of the encoded information from different parameters through a set of algorithmic strategies, and applies different weights to the information through the attention mechanism after information fusion to obtain refined representation information.

Lesion Classification Lesion Detection

Paper
Code

Synthesis-based Imaging-Differentiation Representation Learning for Multi-Sequence 3D/4D MRI

1 code implementation • 1 Feb 2023 • Luyi Han, Tao Tan, Tianyu Zhang, Yunzhi Huang, Xin Wang, Yuan Gao, Jonas Teuwen, Ritse Mann

Multi-sequence MRIs can be necessary for reliable diagnosis in clinical practice due to the complimentary information within sequences.

Representation Learning

Paper
Code

The Power of External Memory in Increasing Predictive Model Capacity

no code implementations • 31 Jan 2023 • Cenk Baykal, Dylan J Cutler, Nishanth Dikkala, Nikhil Ghosh, Rina Panigrahy, Xin Wang

One way of introducing sparsity into deep networks is by attaching an external table of parameters that is sparsely looked up at different layers of the network.

Language Modelling

Paper
Add Code

Attacking Important Pixels for Anchor-free Detectors

no code implementations • 26 Jan 2023 • Yunxu Xie, Shu Hu, Xin Wang, Quanyu Liao, Bin Zhu, Xi Wu, Siwei Lyu

Existing adversarial attacks on object detection focus on attacking anchor-based detectors, which may not work well for anchor-free detectors.

Adversarial Attack object-detection +2

Paper
Add Code

Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering

no code implementations • ICCV 2023 • Zi Qian, Xin Wang, Xuguang Duan, Pengda Qin, Yuhong Li, Wenwu Zhu

Based on our formulation, we further propose MulTi-Modal PRompt LearnIng with DecouPLing bEfore InTeraction (TRIPLET), a novel approach that builds on a pre-trained vision-language model and consists of decoupled prompts and prompt interaction strategies to capture the complex interactions between modalities.

Continual Learning Language Modelling +2

Paper
Add Code

You Do Not Need Additional Priors or Regularizers in Retinex-Based Low-Light Image Enhancement

no code implementations • CVPR 2023 • Huiyuan Fu, Wenkai Zheng, Xiangyu Meng, Xin Wang, Chuanming Wang, Huadong Ma

The Retinex-based methods require decomposing the image into reflectance and illumination components, which is a highly ill-posed problem and there is no available ground truth.

Contrastive Learning Low-Light Image Enhancement +1

Paper
Add Code

HDG-ODE: A Hierarchical Continuous-Time Model for Human Pose Forecasting

1 code implementation • ICCV 2023 • Yucheng Xing, Xin Wang

Considering the structural-property of the skeleton data in representing human poses and the possible irregularity caused by occlusion, we propose the use of dynamic graph convolution as the basic operator.

Human Pose Forecasting

Paper
Code

Understanding Zero-Shot Adversarial Robustness for Large-Scale Models

2 code implementations • 14 Dec 2022 • Chengzhi Mao, Scott Geng, Junfeng Yang, Xin Wang, Carl Vondrick

We apply this training loss to two adaption methods, model finetuning and visual prompt tuning.

Adversarial Robustness Contrastive Learning +1

Paper
Code

Doubly Right Object Recognition: A Why Prompt for Visual Rationales

1 code implementation • CVPR 2023 • Chengzhi Mao, Revant Teotia, Amrutha Sundar, Sachit Menon, Junfeng Yang, Xin Wang, Carl Vondrick

We propose a ``doubly right'' object recognition benchmark, where the metric requires the model to simultaneously produce both the right labels as well as the right rationales.

Object Recognition

Paper
Code

Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline

1 code implementation • 29 Nov 2022 • Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf

The use of modern vocoders in an analysis/synthesis pipeline allows us to investigate high-quality voice conversion that can be used for privacy purposes.

Voice Conversion

Paper
Code

Object Detection in Foggy Scenes by Embedding Depth and Reconstruction into Domain Adaptation

1 code implementation • 24 Nov 2022 • Xin Yang, Michael Bi Mi, Yuan Yuan, Xin Wang, Robby T. Tan

In our DA framework, we retain the depth and background information during the domain feature alignment.

Domain Adaptation Object +2

Paper
Code

Disentangled Representation Learning

no code implementations • 21 Nov 2022 • Xin Wang, Hong Chen, Si'ao Tang, Zihao Wu, Wenwu Zhu

Disentangled Representation Learning (DRL) aims to learn a model capable of identifying and disentangling the underlying factors hidden in the observable data in representation form.

Representation Learning

Paper
Add Code

FedSiam-DA: Dual-aggregated Federated Learning via Siamese Network under Non-IID Data

no code implementations • 17 Nov 2022 • Ming Yang, Yanhan Wang, Xin Wang, Zhenyong Zhang, Xiaoming Wu, Peng Cheng

Federated learning is a distributed learning that allows each client to keep the original data locally and only upload the parameters of the local model to the server.

Contrastive Learning Federated Learning

Paper
Add Code

Super-resolution Reconstruction of Single Image for Latent features

no code implementations • 16 Nov 2022 • Xin Wang, Jing-Ke Yan, Jing-Ye Cai, Jian-Hua Deng, Qin Qin, Yao Cheng

Single-image super-resolution (SISR) typically focuses on restoring various degraded low-resolution (LR) images to a single high-resolution (HR) image.

Denoising Image Reconstruction +2

Paper
Add Code

Shared Loss between Generators of GANs

no code implementations • 14 Nov 2022 • Xin Wang

Traditional GANs fall prey to the mode collapse problem, which means that they are unable to generate the different variations of data present in the input dataset.

Paper
Add Code

LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation

1 code implementation • 11 Nov 2022 • Zeyu Hu, Xuyang Bai, Runze Zhang, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai

We propose LiDAL, a novel active learning method for 3D LiDAR semantic segmentation by exploiting inter-frame uncertainty among LiDAR frames.

Active Learning LIDAR Semantic Segmentation +1

Paper
Code

Large Language Models with Controllable Working Memory

no code implementations • 9 Nov 2022 • Daliang Li, Ankit Singh Rawat, Manzil Zaheer, Xin Wang, Michal Lukasik, Andreas Veit, Felix Yu, Sanjiv Kumar

By contrast, when the context is irrelevant to the task, the model should ignore it and fall back on its internal knowledge.

counterfactual World Knowledge

Paper
Add Code

Lightweight Neural Network with Knowledge Distillation for CSI Feedback

no code implementations • 31 Oct 2022 • Yiming Cui, Jiajia Guo, Zheng Cao, Huaze Tang, Chao-Kai Wen, Shi Jin, Xin Wang, Xiaolin Hou

Firstly, an autoencoder KD-based method is introduced by training a student autoencoder to mimic the reconstructed CSI of a pretrained teacher autoencoder.

Knowledge Distillation

Paper
Add Code

Detection of Real-time DeepFakes in Video Conferencing with Active Probing and Corneal Reflection

no code implementations • 21 Oct 2022 • Hui Guo, Xin Wang, Siwei Lyu

Specifically, we authenticate video calls by displaying a distinct pattern on the screen and using the corneal reflection extracted from the images of the call participant's face.

Paper
Add Code

RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control

1 code implementation • 20 Oct 2022 • Yanfei Xiang, Xin Wang, Shu Hu, Bin Zhu, Xiaomeng Huang, Xi Wu, Siwei Lyu

Reinforcement learning is applied to solve actual complex tasks from high-dimensional, sensory inputs.

Benchmarking Data Augmentation +2

Paper
Code

Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders

1 code implementation • 19 Oct 2022 • Xin Wang, Junichi Yamagishi

To make better use of pairs of bona fide and spoofed data, this study introduces a contrastive feature loss that can be plugged into the standard training criterion.

294

Paper
Code

AdaGCL: Adaptive Subgraph Contrastive Learning to Generalize Large-scale Graph Training

1 code implementation • ACM International Conference on Information & Knowledge Management (CIKM) 2022 • Yili Wang, Kaixiong Zhou, Rui Miao, Ninghao Liu, Xin Wang

To bridge the gap between large-scale graph training and contrastive learning, we propose adaptive subgraph contrastive learning (AdaGCL).

Contrastive Learning Data Augmentation +1

Paper
Code

InFIP: An Explainable DNN Intellectual Property Protection Method based on Intrinsic Features

no code implementations • 14 Oct 2022 • Mingfu Xue, Xin Wang, Yinghao Wu, Shifeng Ni, Yushu Zhang, Weiqiang Liu

Since the intrinsic feature is composed of unique interpretation of the model's decision, the intrinsic feature can be regarded as fingerprint of the model.

Explainable artificial intelligence

Paper
Add Code

GGViT:Multistream Vision Transformer Network in Face2Face Facial Reenactment Detection

no code implementations • 12 Oct 2022 • Haotian Wu, Peipei Wang, Xin Wang, Ji Xiang, Rui Gong

The compression of videos on social media has destroyed some pixel details that could be used to detect forgeries.

Paper
Add Code

Block Format Error Bounds and Optimal Block Size Selection

no code implementations • 11 Oct 2022 • Ilya Soloveychik, Ilya Lyubomirsky, Xin Wang, Sudeep Bhoja

This measure allows us to determine the optimal parameters, such as the block size, yielding highest accuracy.

Paper
Add Code

3D Matting: A Benchmark Study on Soft Segmentation Method for Pulmonary Nodules Applied in Computed Tomography

no code implementations • 11 Oct 2022 • Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Yi Luo, Huan Luo, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, ZongYuan Ge

In this work, we introduce the image matting into the 3D scenes and use the alpha matte, i. e., a soft mask, to describe lesions in a 3D medical image.

Binarization Image Matting

Paper
Add Code

STSyn: Speeding Up Local SGD with Straggler-Tolerant Synchronization

no code implementations • 6 Oct 2022 • Feng Zhu, Jingjing Zhang, Xin Wang

Synchronous local stochastic gradient descent (local SGD) suffers from some workers being idle and random delays due to slow and straggling workers, as it waits for the workers to complete the same amount of local updates.

Paper
Add Code

Attention Augmented ConvNeXt UNet For Rectal Tumour Segmentation

no code implementations • 1 Oct 2022 • Hongwei Wu, Junlin Wang, Xin Wang, Hui Nan, Yaxin Wang, Haonan Jing, Kaixuan Shi

It is a challenge to segment the location and size of rectal cancer tumours through deep learning.

Image Segmentation Segmentation +1

Paper
Add Code

Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval

no code implementations • 28 Sep 2022 • Zheng Li, Caili Guo, Xin Wang, Zerun Feng, Jenq-Neng Hwang, Zhongtian Du

More specifically, Triplet loss with Hard Negative mining (Triplet-HN), which is widely used in existing retrieval models to improve the discriminative ability, is easy to fall into local minima in training.

Contrastive Learning Retrieval +2

Paper
Add Code

3D Matting: A Soft Segmentation Method Applied in Computed Tomography

no code implementations • 16 Sep 2022 • Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, ZongYuan Ge

It can be caused by many factors, such as the imaging properties, pathological anatomy, and the weak representation of the binary masks, which brings challenges to accurate 3D segmentation.

Anatomy Image Matting

Paper
Add Code

MIntRec: A New Dataset for Multimodal Intent Recognition

1 code implementation • 9 Sep 2022 • Hanlei Zhang, Hua Xu, Xin Wang, Qianrui Zhou, Shaojie Zhao, Jiayan Teng

This paper introduces a novel dataset for multimodal intent recognition (MIntRec) to address this issue.

Ranked #1 on Multimodal Intent Recognition on MIntRec

Multimodal Intent Recognition

Paper
Code

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

no code implementations • 1 Sep 2022 • Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi

Conventional automatic speaker verification systems can usually be decomposed into a front-end model such as time delay neural network (TDNN) for extracting speaker embeddings and a back-end model such as statistics-based probabilistic linear discriminant analysis (PLDA) or neural network-based neural PLDA (NPLDA) for similarity scoring.

Data Augmentation Speaker Verification

Paper
Add Code

NeurIPS'22 Cross-Domain MetaDL competition: Design and baseline results

no code implementations • 31 Aug 2022 • Dustin Carrión-Ojeda, Hong Chen, Adrian El Baz, Sergio Escalera, Chaoyu Guan, Isabelle Guyon, Ihsan Ullah, Xin Wang, Wenwu Zhu

We present the design and baseline results for a new challenge in the ChaLearn meta-learning series, accepted at NeurIPS'22, focusing on "cross-domain" meta-learning.

Few-Shot Image Classification Few-Shot Learning +1

Paper
Add Code

NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language

no code implementations • 29 Aug 2022 • Faysal Hossain Shezan, Yingjie Lao, Minlong Peng, Xin Wang, Mingming Sun, Ping Li

At the core, NL2GDPR is a privacy-centric information extraction model, appended with a GDPR policy finder and a policy generator.

Paper
Add Code

Data-Driven Control of Distributed Event-Triggered Network Systems

no code implementations • 22 Aug 2022 • Xin Wang, Jian Sun, Gang Wang, Frank Allgöwer, Jie Chen

The present paper deals with data-driven event-triggered control of a class of unknown discrete-time interconnected systems (a. k. a.

Paper
Add Code

Context-Aware Streaming Perception in Dynamic Environments

1 code implementation • 16 Aug 2022 • Gur-Eyal Sela, Ionel Gog, Justin Wong, Kumar Krishna Agrawal, Xiangxi Mo, Sukrit Kalra, Peter Schafhalter, Eric Leong, Xin Wang, Bharathan Balaji, Joseph Gonzalez, Ion Stoica

These works evaluate accuracy offline, one image at a time.

Autonomous Driving

Paper
Code

Efficient Climate Simulation via Machine Learning Method

no code implementations • 15 Aug 2022 • Xin Wang, Wei Xue, Yilun Han, Guangwen Yang

We develop a user-friendly platform NeuroGCM for efficiently developing hybrid modeling in climate simulation.

Paper
Add Code

GPPT: Graph Pre-training and Prompt Tuning to Generalize Graph Neural Networks

1 code implementation • SIGKDD 2022 • Mingchen Sun, Kaixiong Zhou, Xin He, Ying Wang, Xin Wang

Based on the pre-trained model, we propose the graph prompting function to modify the standalone node into a token pair, and reformulate the downstream node classification looking the same as edge prediction.

Few-Shot Learning Node Classification +3

Paper
Code

Revisiting Adversarial Attacks on Graph Neural Networks for Graph Classification

no code implementations • 13 Aug 2022 • Xin Wang, Heng Chang, Beini Xie, Tian Bian, Shiji Zhou, Daixin Wang, Zhiqiang Zhang, Wenwu Zhu

Graph neural networks (GNNs) have achieved tremendous success in the task of graph classification and its diverse downstream real-world applications.

Graph Classification

Paper
Add Code

A Theoretical View on Sparsely Activated Networks

no code implementations • 8 Aug 2022 • Cenk Baykal, Nishanth Dikkala, Rina Panigrahy, Cyrus Rashtchian, Xin Wang

After representing LSH-based sparse networks with our model, we prove that sparse networks can match the approximation power of dense networks on Lipschitz functions.

Paper
Add Code

Event-triggered Consensus Control of Heterogeneous Multi-agent Systems: Model- and Data-based Analysis

no code implementations • 1 Aug 2022 • Xin Wang, Jian Sun, Gang Wang, Jie Chen

This article deals with model- and data-based consensus control of heterogenous leader-following multi-agent systems (MASs) under an event-triggering transmission scheme.

Paper
Add Code

Trajectory Planning of Cellular-Connected UAV for Communication-assisted Radar Sensing

no code implementations • 27 Jul 2022 • Shuyan Hu, Xin Yuan, Wei Ni, Xin Wang

Being a key technology for beyond fifth-generation wireless systems, joint communication and radar sensing (JCAS) utilizes the reflections of communication signals to detect foreign objects and deliver situational awareness.

Trajectory Planning

Paper
Add Code

Proving Common Mechanisms Shared by Twelve Methods of Boosting Adversarial Transferability

no code implementations • 24 Jul 2022 • Quanshi Zhang, Xin Wang, Jie Ren, Xu Cheng, Shuyun Lin, Yisen Wang, Xiangming Zhu

This paper summarizes the common mechanism shared by twelve previous transferability-boosting methods in a unified view, i. e., these methods all reduce game-theoretic interactions between regional adversarial perturbations.

Paper
Add Code

PanGu-Coder: Program Synthesis with Function-Level Language Modeling

1 code implementation • 22 Jul 2022 • Fenia Christopoulou, Gerasimos Lampouras, Milan Gritta, Guchun Zhang, Yinpeng Guo, Zhongqi Li, Qi Zhang, Meng Xiao, Bo Shen, Lin Li, Hao Yu, Li Yan, Pingyi Zhou, Xin Wang, Yuchi Ma, Ignacio Iacobacci, Yasheng Wang, Guangtai Liang, Jiansheng Wei, Xin Jiang, Qianxiang Wang, Qun Liu

We present PanGu-Coder, a pretrained decoder-only language model adopting the PanGu-Alpha architecture for text-to-code generation, i. e. the synthesis of programming language solutions given a natural language problem description.

Code Generation Language Modelling +2

Paper
Code

Neural-Sim: Learning to Generate Training Data with NeRF

1 code implementation • 22 Jul 2022 • Yunhao Ge, Harkirat Behl, Jiashu Xu, Suriya Gunasekar, Neel Joshi, Yale Song, Xin Wang, Laurent Itti, Vibhav Vineet

However, existing approaches either require human experts to manually tune each scene property or use automatic methods that provide little to no control; this requires rendering large amounts of random data variations, which is slow and is often suboptimal for the target domain.

Object Detection

154

Paper
Code

Scene Recognition with Objectness, Attribute and Category Learning

no code implementations • 20 Jul 2022 • Ji Zhang, Jean-Paul Ainam, Li-hui Zhao, Wenai Song, Xin Wang

Based on the complementarity of attribute and category labels, we propose a Multi-task Attribute-Scene Recognition (MASR) network which learns a category embedding and at the same time predicts scene attributes.

Attribute Scene Classification +1

Paper
Add Code

Rank-based Decomposable Losses in Machine Learning: A Survey

no code implementations • 18 Jul 2022 • Shu Hu, Xin Wang, Siwei Lyu

Following these categories, we review the literature on rank-based aggregate losses and rank-based individual losses.

BIG-bench Machine Learning

Paper
Add Code

Scaling Novel Object Detection with Weakly Supervised Detection Transformers

1 code implementation • 11 Jul 2022 • Tyler LaBonte, Yale Song, Xin Wang, Vibhav Vineet, Neel Joshi

A critical object detection task is finetuning an existing model to detect novel objects, but the standard workflow requires bounding box annotations which are time-consuming and expensive to collect.

Multiple Instance Learning Novel Object Detection +4

Paper
Code

Unsupervised Domain Adaptive Fundus Image Segmentation with Category-level Regularization

1 code implementation • 8 Jul 2022 • Wei Feng, Lin Wang, Lie Ju, Xin Zhao, Xin Wang, Xiaoyu Shi, ZongYuan Ge

Existing unsupervised domain adaptation methods based on adversarial learning have achieved good performance in several medical imaging tasks.

Image Segmentation Semantic Segmentation +1

Paper
Code

Enhanced brain structure-function tethering in transmodal cortex revealed by high-frequency eigenmodes

no code implementations • 7 Jul 2022 • Yaqian Yang, Zhiming Zheng, Longzhao Liu, Hongwei Zheng, Yi Zhen, Yi Zheng, Xin Wang, Shaoting Tang

Specifically, low-frequency eigenmodes, which are considered sufficient to capture the essence of the functional network, contribute little to functional connectivity reconstruction in transmodal regions, resulting in structure-function decoupling along the unimodal-transmodal gradient.

Paper
Add Code

NAS-Bench-Graph: Benchmarking Graph Neural Architecture Search

1 code implementation • 18 Jun 2022 • Yijian Qin, Ziwei Zhang, Xin Wang, Zeyang Zhang, Wenwu Zhu

To the best of our knowledge, our work is the first benchmark for graph neural architecture search.

Benchmarking Neural Architecture Search

Paper
Code

Concentration of Data Encoding in Parameterized Quantum Circuits

no code implementations • 16 Jun 2022 • Guangxi Li, Ruilin Ye, Xuanqiang Zhao, Xin Wang

This result in particular implies that the average encoded state will concentrate on the maximally mixed state at an exponential speed on depth.

Combinatorial Optimization

Paper
Add Code

Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

no code implementations • 15 Jun 2022 • Adrian El Baz, Ihsan Ullah, Edesio Alcobaça, André C. P. L. F. Carvalho, Hong Chen, Fabio Ferreira, Henry Gouk, Chaoyu Guan, Isabelle Guyon, Timothy Hospedales, Shell Hu, Mike Huisman, Frank Hutter, Zhengying Liu, Felix Mohr, Ekrem Öztürk, Jan N. van Rijn, Haozhe Sun, Xin Wang, Wenwu Zhu

Although deep neural networks are capable of achieving performance superior to humans on various tasks, they are notorious for requiring large amounts of data and computing resources, restricting their success to domains where such resources are available.

Few-Shot Learning Image Classification +1

Paper
Add Code

A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions

1 code implementation • 15 Jun 2022 • Sheng Zhou, Hongjia Xu, Zhuonan Zheng, Jiawei Chen, Zhao Li, Jiajun Bu, Jia Wu, Xin Wang, Wenwu Zhu, Martin Ester

Motivated by the tremendous success of deep learning in clustering, one of the most fundamental machine learning tasks, and the large number of recent advances in this direction, in this paper we conduct a comprehensive survey on deep clustering by proposing a new taxonomy of different state-of-the-art approaches.

Clustering Deep Clustering +1

2,663

Paper
Code

Deep Learning-based Massive MIMO CSI Acquisition for 5G Evolution and 6G

no code implementations • 10 Jun 2022 • Xin Wang, Xiaolin Hou, Lan Chen, Yoshihisa Kishiyama, Takahiro Asai

Considering its large impact on air-interface design, it will be a candidate technology for 6th generation (6G) networks, in which an air interface designed by artificial intelligence can be used.

Paper
Add Code

BInGo: Bayesian Intrinsic Groupwise Registration via Explicit Hierarchical Disentanglement

no code implementations • 6 Jun 2022 • Xin Wang, Xinzhe Luo, Xiahai Zhuang

Multimodal groupwise registration aligns internal structures in a group of medical images.

Bayesian Inference Computational Efficiency +1

Paper
Add Code

Mitigating barren plateaus of variational quantum eigensolvers

no code implementations • 26 May 2022 • Xia Liu, Geng Liu, Jiaxin Huang, Hao-Kai Zhang, Xin Wang

Variational quantum algorithms (VQAs) are expected to establish valuable applications on near-term quantum computers.

Paper
Add Code

Spatial Attention-based Implicit Neural Representation for Arbitrary Reduction of MRI Slice Spacing

no code implementations • 23 May 2022 • Xin Wang, Sheng Wang, Honglin Xiong, Kai Xuan, Zixu Zhuang, Mengjun Liu, Zhenrong Shen, Xiangyu Zhao, Lichi Zhang, Qian Wang

Magnetic resonance (MR) images collected in 2D clinical protocols typically have large inter-slice spacing, resulting in high in-plane resolution and reduced through-plane resolution.

Computational Efficiency Super-Resolution

Paper
Add Code

Power and limitations of single-qubit native quantum neural networks

no code implementations • 16 May 2022 • Zhan Yu, Hongshun Yao, Mujin Li, Xin Wang

Quantum neural networks (QNNs) have emerged as a leading strategy to establish applications in machine learning, chemistry, and optimization.

Paper
Add Code

The VoicePrivacy 2020 Challenge Evaluation Plan

1 code implementation • 14 May 2022 • Natalia Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco

The VoicePrivacy Challenge aims to promote the development of privacy preservation tools for speech technology by gathering a new community to define the tasks of interest and the evaluation methodology, and benchmarking solutions through a series of challenges.

Benchmarking

Paper
Code

Open-Eye: An Open Platform to Study Human Performance on Identifying AI-Synthesized Faces

no code implementations • 13 May 2022 • Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu

In this work, we develop an online platform called Open-eye to study the human performance of AI-synthesized face detection.

Face Detection

Paper
Add Code

Quantum Self-Attention Neural Networks for Text Classification

1 code implementation • 11 May 2022 • Guangxi Li, Xuanqiang Zhao, Xin Wang

An emerging direction of quantum computing is to establish meaningful quantum applications in various fields of artificial intelligence, including natural language processing (NLP).

text-classification Text Classification

Paper
Code

An Edge-Cloud Integrated Framework for Flexible and Dynamic Stream Analytics

no code implementations • 10 May 2022 • Xin Wang, Azim Khan, Jianwu Wang, Aryya Gangopadhyay, Carl E. Busart, Jade Freeman

In this paper, we study how to best leverage edge and cloud resources to achieve better accuracy and latency for stream analytics using a type of RNN model called long short-term memory (LSTM).

Cloud Computing Edge-computing +3

Paper
Add Code

Fundamental limitations on optimization in variational quantum algorithms

no code implementations • 10 May 2022 • Hao-Kai Zhang, Chengkai Zhu, Geng Liu, Xin Wang

Exploring quantum applications of near-term quantum devices is a rapidly growing field of quantum information science with both theoretical and practical interests.

Paper
Add Code

CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training

no code implementations • Findings (NAACL) 2022 • Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu

Specifically, we first extract multiple code views using compiler tools, and learn the complementary information among them under a contrastive learning framework.

Contrastive Learning Defect Detection +2

Paper
Add Code

Receiver Design for MIMO Unsourced Random Access with SKP Coding

no code implementations • 30 Apr 2022 • Zeyu Han, Xiaojun Yuan, Chongbin Xu, Xin Wang

In this letter, we extend the sparse Kronecker-product (SKP) coding scheme, originally designed for the additive white Gaussian noise (AWGN) channel, to multiple input multiple output (MIMO) unsourced random access (URA).

Paper
Add Code

Visual Attention Emerges from Recurrent Sparse Reconstruction

1 code implementation • 23 Apr 2022 • Baifeng Shi, Yale Song, Neel Joshi, Trevor Darrell, Xin Wang

We present VARS, Visual Attention from Recurrent Sparse reconstruction, a new attention formulation built on two prominent features of the human visual attention mechanism: recurrency and sparsity.

Paper
Code

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

no code implementations • 13 Apr 2022 • Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman

In this paper, we propose a dynamic cascaded encoder Automatic Speech Recognition (ASR) model, which unifies models for different deployment scenarios.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance

no code implementations • 11 Apr 2022 • Lin Zhang, Xin Wang, Erica Cooper, Nicholas Evans, Junichi Yamagishi

Since the short spoofed speech segments to be embedded by attackers are of variable length, six different temporal resolutions are considered, ranging from as short as 20 ms to as large as 640 ms. Third, we propose a new CM that enables the simultaneous use of the segment-level labels at different temporal resolutions as well as utterance-level labels to execute utterance- and segment-level detection at the same time.

Speaker Verification Speech Synthesis +2

Paper
Add Code

Flexible Sampling for Long-tailed Skin Lesion Classification

no code implementations • 7 Apr 2022 • Lie Ju, Yicheng Wu, Lin Wang, Zhen Yu, Xin Zhao, Xin Wang, Paul Bonnington, ZongYuan Ge

To address this, in this paper, we propose a curriculum learning-based framework called Flexible Sampling for the long-tailed skin lesion classification task.

Classification Lesion Classification +1

Paper
Add Code

Learning to Solve Travelling Salesman Problem with Hardness-adaptive Curriculum

1 code implementation • 7 Apr 2022 • Zeyang Zhang, Ziwei Zhang, Xin Wang, Wenwu Zhu

To solve these challenges, we first propose a principled hardness measurement to quantify the hardness of TSP instances.

Combinatorial Optimization

Paper
Code

Investigating Active-learning-based Training Data Selection for Speech Spoofing Countermeasure

1 code implementation • 28 Mar 2022 • Xin Wang, Junich Yamagishi

This study took the initiative and investigated CM training using active learning (AL), a framework that iteratively selects useful data from a large pool set and fine-tunes the CM.

Active Learning Data Augmentation +1

294

Paper
Code

Energy-Efficient UAV-Mounted RIS Assisted Mobile Edge Computing

no code implementations • 24 Mar 2022 • Zhiyuan Zhai, Xinhong Dai, Bin Duo, Xin Wang, Xiaojun Yuan

Unmanned aerial vehicle (UAV) and reconfigurable intelligent surface (RIS) have been recently applied in the field of mobile edge computing (MEC) to improve the data exchange environment by proactively changing the wireless channels through maneuverable location deployment and intelligent signals reflection, respectively.

Edge-computing

Paper
Add Code

The VoicePrivacy 2022 Challenge Evaluation Plan

1 code implementation • 23 Mar 2022 • Natalia Tomashenko, Xin Wang, Xiaoxiao Miao, Hubert Nourtel, Pierre Champion, Massimiliano Todisco, Emmanuel Vincent, Nicholas Evans, Junichi Yamagishi, Jean-François Bonastre

Participants apply their developed anonymization systems, run evaluation scripts and submit objective evaluation results and anonymized speech data to the organizers.

Speaker Verification

Paper
Code

A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach

no code implementations • 10 Mar 2022 • Xiaohan Lan, Yitian Yuan, Xin Wang, Long Chen, Zhi Wang, Lin Ma, Wenwu Zhu

New benchmarking results indicate that our proposed evaluation protocols can better monitor the research progress.

Benchmarking Sentence +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.