Search Results for author: Yuxin Wang

Found 54 papers, 29 papers with code

A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges

1 code implementation15 Mar 2024 Xinrun Xu, Yuxin Wang, Chaoyi Xu, Ziluo Ding, Jiechuan Jiang, Zhiming Ding, Börje F. Karlsson

The swift evolution of Large-scale Models (LMs), either language-focused or multi-modal, has garnered extensive attention in both academy and industry.

Mixer is more than just a model

no code implementations28 Feb 2024 Qingfeng Ji, Yuxin Wang, Letong Sun

In the field of computer vision, MLP-Mixer is noted for its ability to extract data information from both channel and token perspectives, effectively acting as a fusion of channel and token information.

Environmental Sound Classification Speech Emotion Recognition

Angle Robustness Unmanned Aerial Vehicle Navigation in GNSS-Denied Scenarios

no code implementations4 Feb 2024 Yuxin Wang, Zunlei Feng, Haofei Zhang, Yang Gao, Jie Lei, Li Sun, Mingli Song

Due to the inability to receive signals from the Global Navigation Satellite System (GNSS) in extreme conditions, achieving accurate and robust navigation for Unmanned Aerial Vehicles (UAVs) is a challenging task.

Sample-based Dynamic Hierarchical Transformer with Layer and Head Flexibility via Contextual Bandit

no code implementations5 Dec 2023 Fanfei Meng, LeLe Zhang, Yu Chen, Yuxin Wang

Transformer requires a fixed number of layers and heads which makes them inflexible to the complexity of individual samples and expensive in training and inference.

Thompson Sampling

Optimizing the Passenger Flow for Airport Security Check

no code implementations30 Nov 2023 Yuxin Wang, Fanfei Meng, Xiaotian Wang, Chaoyu Xie

Due to the necessary security for the airport and flight, passengers are required to have strict security check before getting aboard.

FedEmb: A Vertical and Hybrid Federated Learning Algorithm using Network And Feature Embedding Aggregation

no code implementations30 Nov 2023 Fanfei Meng, LeLe Zhang, Yu Chen, Yuxin Wang

Federated learning (FL) is an emerging paradigm for decentralized training of machine learning models on distributed clients, without revealing the data to the central server.

Federated Learning Privacy Preserving

Joint Detection Algorithm for Multiple Cognitive Users in Spectrum Sensing

no code implementations30 Nov 2023 Fanfei Meng, Yuxin Wang, LeLe Zhang, Yingxin Zhao

This paper first introduces three common logical circuit decision criteria in hard decisions and analyzes their decision rigor.

Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition

1 code implementation8 Oct 2023 Zixiao Wang, Hongtao Xie, Yuxin Wang, Jianjun Xu, Boqiang Zhang, Yongdong Zhang

In this paper, we explore the potential of the Contrastive Language-Image Pretraining (CLIP) model in scene text recognition (STR), and establish a novel Symmetrical Linguistic Feature Distillation framework (named CLIP-OCR) to leverage both visual and linguistic knowledge in CLIP.

Optical Character Recognition (OCR) Scene Text Recognition

Energy-efficient Integrated Sensing and Communication System and DNLFM Waveform

no code implementations18 Sep 2023 Yihua Ma, Zhifeng Yuan, Shuqiang Xia, Chen Bai, Zhongbin Wang, Yuxin Wang

In this paper, the ISAC system employs dedicated sensing signals, which can be reused as the communication reference signal.

FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs

no code implementations3 Sep 2023 Zhenheng Tang, Yuxin Wang, Xin He, Longteng Zhang, Xinglin Pan, Qiang Wang, Rongfei Zeng, Kaiyong Zhao, Shaohuai Shi, Bingsheng He, Xiaowen Chu

The rapid growth of memory and computation requirements of large language models (LLMs) has outpaced the development of hardware, hindering people who lack large-scale high-end GPUs from training or deploying LLMs.

Scheduling

Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis

no code implementations ICCV 2023 Yuxin Wang, Wayne Wu, Dan Xu

State-of-the-art methods in this direction typically consider building separate networks for these two tasks (i. e., view synthesis and editing).

Novel View Synthesis

From Hypergraph Energy Functions to Hypergraph Neural Networks

1 code implementation16 Jun 2023 Yuxin Wang, Quan Gan, Xipeng Qiu, Xuanjing Huang, David Wipf

Hypergraphs are a powerful abstraction for representing higher-order interactions between entities of interest.

Bilevel Optimization Node Classification

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

1 code implementation NeurIPS 2023 Dongwei Pan, Long Zhuo, Jingtan Piao, Huiwen Luo, Wei Cheng, Yuxin Wang, Siming Fan, Shengqi Liu, Lei Yang, Bo Dai, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Kwan-Yee Lin

It is a large-scale digital library for head avatars with three key attributes: 1) High Fidelity: all subjects are captured by 60 synchronized, high-resolution 2K cameras in 360 degrees.

2k Image Matting +2

Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition

1 code implementation9 May 2023 Boqiang Zhang, Hongtao Xie, Yuxin Wang, Jianjun Xu, Yongdong Zhang

Vision model have gained increasing attention due to their simplicity and efficiency in Scene Text Recognition (STR) task.

Scene Text Recognition

FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical Training

1 code implementation3 Mar 2023 Zhenheng Tang, Xiaowen Chu, Ryan Yide Ran, Sunwoo Lee, Shaohuai Shi, Yonggang Zhang, Yuxin Wang, Alex Qiaozhong Liang, Salman Avestimehr, Chaoyang He

It improves the training efficiency, remarkably relaxes the requirements on the hardware, and supports efficient large-scale FL experiments with stateful clients by: (1) sequential training clients on devices; (2) decomposing original aggregation into local and global aggregation on devices and server respectively; (3) scheduling tasks to mitigate straggler problems and enhance computing utility; (4) distributed client state manager to support various FL algorithms.

Federated Learning Scheduling

GAR: Generalized Autoregression for Multi-Fidelity Fusion

no code implementations13 Jan 2023 Yuxin Wang, Zheng Xing, Wei W. Xing

In many scientific research and engineering applications where repeated simulations of complex systems are conducted, a surrogate is commonly adopted to quickly estimate the whole system.

Rethinking Label Smoothing on Multi-hop Question Answering

2 code implementations19 Dec 2022 Zhangyue Yin, Yuxin Wang, Xiannian Hu, Yiguang Wu, Hang Yan, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Xipeng Qiu

Multi-Hop Question Answering (MHQA) is a significant area in question answering, requiring multiple reasoning components, including document retrieval, supporting sentence prediction, and answer span extraction.

Image Classification Machine Reading Comprehension +6

Open-world Story Generation with Structured Knowledge Enhancement: A Comprehensive Survey

no code implementations9 Dec 2022 Yuxin Wang, Jieru Lin, Zhiwei Yu, Wei Hu, Börje F. Karlsson

Storytelling and narrative are fundamental to human experience, intertwined with our social and cultural engagement.

Story Generation

Exploring Stroke-Level Modifications for Scene Text Editing

1 code implementation5 Dec 2022 Yadong Qu, Qingfeng Tan, Hongtao Xie, Jianjun Xu, Yuxin Wang, Yongdong Zhang

Moreover, two new datasets (Tamper-Syn2k and Tamper-Scene) are proposed to fill the blank of public evaluation datasets.

Attribute Scene Text Editing

Lifelong Embedding Learning and Transfer for Growing Knowledge Graphs

1 code implementation29 Nov 2022 Yuanning Cui, Yuxin Wang, Zequn Sun, Wenqiang Liu, Yiqiao Jiang, Kexin Han, Wei Hu

We consider knowledge transfer and retention of the learning on growing snapshots of a KG without having to learn embeddings from scratch.

Knowledge Graphs Transfer Learning

NAS-LID: Efficient Neural Architecture Search with Local Intrinsic Dimension

1 code implementation23 Nov 2022 Xin He, Jiangchao Yao, Yuxin Wang, Zhenheng Tang, Ka Chu Cheung, Simon See, Bo Han, Xiaowen Chu

One-shot neural architecture search (NAS) substantially improves the search efficiency by training one supernet to estimate the performance of every possible child architecture (i. e., subnet).

Neural Architecture Search

ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting

1 code implementation19 Nov 2022 Shancheng Fang, Zhendong Mao, Hongtao Xie, Yuxin Wang, Chenggang Yan, Yongdong Zhang

In this paper, we argue that the limited capacity of language models comes from 1) implicit language modeling; 2) unidirectional feature representation; and 3) language model with noise input.

Blocking Language Modelling +2

Discovering New Intents Using Latent Variables

no code implementations21 Oct 2022 Yunhua Zhou, Peiju Liu, Yuxin Wang, Xipeng Qiu

In this paper, starting from the intuition that discovering intents could be beneficial to the identification of the known intents, we propose a probabilistic framework for discovering intents where intent assignments are treated as latent variables.

The Open-World Lottery Ticket Hypothesis for OOD Intent Classification

1 code implementation13 Oct 2022 Yunhua Zhou, Pengyu Wang, Peiju Liu, Yuxin Wang, Xipeng Qiu

Most existing methods of Out-of-Domain (OOD) intent classification rely on extensive auxiliary OOD corpora or specific training paradigms.

intent-classification Intent Classification

Wider and Higher: Intensive Integration and Global Foreground Perception for Image Matting

no code implementations13 Oct 2022 Yu Qiao, Ziqi Wei, Yuhao Liu, Yuxin Wang, Dongsheng Zhou, Qiang Zhang, Xin Yang

This paper reviews recent deep-learning-based matting research and conceives our wider and higher motivation for image matting.

Image Matting

Hierarchical and Progressive Image Matting

no code implementations13 Oct 2022 Yu Qiao, Yuhao Liu, Ziqi Wei, Yuxin Wang, Qiang Cai, Guofeng Zhang, Xin Yang

In this paper, we propose an end-to-end Hierarchical and Progressive Attention Matting Network (HAttMatting++), which can better predict the opacity of the foreground from single RGB images without additional input.

Image Matting SSIM

Progressive Glass Segmentation

no code implementations6 Sep 2022 Letian Yu, Haiyang Mei, Wen Dong, Ziqi Wei, Li Zhu, Yuxin Wang, Xin Yang

First, we attempt to bridge the characteristic gap between different levels of features by developing a Discriminability Enhancement (DE) module which enables level-specific features to be a more discriminative representation, alleviating the features incompatibility for fusion.

Segmentation

Facing Changes: Continual Entity Alignment for Growing Knowledge Graphs

1 code implementation23 Jul 2022 Yuxin Wang, Yuanning Cui, Wenqiang Liu, Zequn Sun, Yiqiao Jiang, Kexin Han, Wei Hu

To avoid retraining an entire model on the whole KGs whenever new entities and triples come, we present a continual alignment method for this task.

Entity Alignment Knowledge Graphs

What Dense Graph Do You Need for Self-Attention?

1 code implementation27 May 2022 Yuxin Wang, Chu-Tak Lee, Qipeng Guo, Zhangyue Yin, Yunhua Zhou, Xuanjing Huang, Xipeng Qiu

Transformers have made progress in miscellaneous tasks, but suffer from quadratic computational and memory complexities.

Miscellaneous

CPRAL: Collaborative Panoptic-Regional Active Learning for Semantic Segmentation

no code implementations11 Dec 2021 Yu Qiao, Jincheng Zhu, Chengjiang Long, Zeyao Zhang, Yuxin Wang, Zhenjun Du, Xin Yang

Acquiring the most representative examples via active learning (AL) can benefit many data-dependent computer vision tasks by minimizing efforts of image-level or pixel-wise annotations.

Active Learning Semantic Segmentation

From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network

4 code implementations ICCV 2021 Yuxin Wang, Hongtao Xie, Shancheng Fang, Jing Wang, Shenggao Zhu, Yongdong Zhang

Such operation guides the vision model to use not only the visual texture of characters, but also the linguistic information in visual context for recognition when the visual cues are confused (e. g. occlusion, noise, etc.).

Language Modelling Scene Text Recognition

Multi-domain Collaborative Feature Representation for Robust Visual Object Tracking

no code implementations10 Aug 2021 Jiqing Zhang, Kai Zhao, Bo Dong, Yingkai Fu, Yuxin Wang, Xin Yang, BaoCai Yin

Jointly exploiting multiple different yet complementary domain information has been proven to be an effective way to perform robust object tracking.

Visual Object Tracking

PERT: A Progressively Region-based Network for Scene Text Removal

1 code implementation24 Jun 2021 Yuxin Wang, Hongtao Xie, Shancheng Fang, Yadong Qu, Yongdong Zhang

However, there exists two problems: 1) the implicit erasure guidance causes the excessive erasure to non-text areas; 2) the one-stage erasure lacks the exhaustive removal of text region.

First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction Track

4 code implementations15 Jun 2021 Chengxuan Ying, Mingqi Yang, Shuxin Zheng, Guolin Ke, Shengjie Luo, Tianle Cai, Chenglin Wu, Yuxin Wang, Yanming Shen, Di He

In this technical report, we present our solution of KDD Cup 2021 OGB Large-Scale Challenge - PCQM4M-LSC Track.

A Survey of Transformers

1 code implementation8 Jun 2021 Tianyang Lin, Yuxin Wang, Xiangyang Liu, Xipeng Qiu

X-formers) have been proposed, however, a systematic and comprehensive literature review on these Transformer variants is still missing.

A Two-Stage Attentive Network for Single Image Super-Resolution

1 code implementation21 Apr 2021 Jiqing Zhang, Chengjiang Long, Yuxin Wang, Haiyin Piao, Haiyang Mei, Xin Yang, BaoCai Yin

Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and contribute remarkable progress.

Image Reconstruction Image Super-Resolution +1

Multi-scale Information Assembly for Image Matting

no code implementations7 Jan 2021 Yu Qiao, Yuhao Liu, Qiang Zhu, Xin Yang, Yuxin Wang, Qiang Zhang, Xiaopeng Wei

Image matting is a long-standing problem in computer graphics and vision, mostly identified as the accurate estimation of the foreground in input images.

Image Matting

Optimizing Fitness-For-Use of Differentially Private Linear Queries

1 code implementation30 Nov 2020 Yingtai Xiao, Zeyu Ding, Yuxin Wang, Danfeng Zhang, Daniel Kifer

In practice, differentially private data releases are designed to support a variety of applications.

Databases

CheckDP: An Automated and Integrated Approach for Proving Differential Privacy or Finding Precise Counterexamples

no code implementations17 Aug 2020 Yuxin Wang, Zeyu Ding, Daniel Kifer, Danfeng Zhang

We propose CheckDP, the first automated and integrated approach for proving or disproving claims that a mechanism is differentially private.

Programming Languages D.3.1

ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection

1 code implementation CVPR 2020 Yuxin Wang, Hongtao Xie, Zheng-Jun Zha, Mengting Xing, Zilong Fu, Yongdong Zhang

Then a novel Local Orthogonal Texture-aware Module (LOTM) models the local texture information of proposal features in two orthogonal directions and represents text region with a set of contour points.

Region Proposal Scene Text Detection +1

A Survey of Deep Learning Techniques for Neural Machine Translation

1 code implementation18 Feb 2020 Shuoheng Yang, Yuxin Wang, Xiaowen Chu

In recent years, natural language processing (NLP) has got great development with deep learning techniques.

Machine Translation NMT +1

Benchmarking the Performance and Energy Efficiency of AI Accelerators for AI Training

no code implementations15 Sep 2019 Yuxin Wang, Qiang Wang, Shaohuai Shi, Xin He, Zhenheng Tang, Kaiyong Zhao, Xiaowen Chu

Different from the existing end-to-end benchmarks which only present the training time, We try to investigate the impact of hardware, vendor's software library, and deep learning framework on the performance and energy consumption of AI training.

Benchmarking

Free Gap Information from the Differentially Private Sparse Vector and Noisy Max Mechanisms

no code implementations29 Apr 2019 Zeyu Ding, Yuxin Wang, Danfeng Zhang, Daniel Kifer

We show that it can also release for free the noisy gap between the approximate maximizer and runner-up.

Proving Differential Privacy with Shadow Execution

1 code implementation28 Mar 2019 Yuxin Wang, Zeyu Ding, Guanhong Wang, Daniel Kifer, Danfeng Zhang

Sometimes, combining those two requires substantial changes to program logics: one recent paper is able to verify Report Noisy Max automatically, but it involves a complex verification system using customized program logics and verifiers.

Programming Languages D.2.4

A Distributed Synchronous SGD Algorithm with Global Top-$k$ Sparsification for Low Bandwidth Networks

1 code implementation14 Jan 2019 Shaohuai Shi, Qiang Wang, Kaiyong Zhao, Zhenheng Tang, Yuxin Wang, Xiang Huang, Xiaowen Chu

Current methods that use AllGather to accumulate the sparse gradients have a communication complexity of $O(kP)$, where $P$ is the number of workers, which is inefficient on low bandwidth networks with a large number of workers.

Toward Detecting Violations of Differential Privacy

2 code implementations25 May 2018 Ding Ding, Yuxin Wang, Guanhong Wang, Danfeng Zhang, Daniel Kifer

The widespread acceptance of differential privacy has led to the publication of many sophisticated algorithms for protecting privacy.

Cryptography and Security

Cannot find the paper you are looking for? You can Submit a new open access paper.