no code implementations • 13 Nov 2024 • Yifei Jin, Ali Maatouk, Sarunas Girdzijauskas, Shugong Xu, Leandros Tassiulas, Rex Ying
Wireless ray-tracing (RT) is emerging as a key tool for three-dimensional (3D) wireless channel modeling, driven by advances in graphical rendering.
no code implementations • 28 Oct 2024 • Yanliang Jin, Yifan Wu, Yuan Gao, Shunqing Zhang, Shugong Xu, Cheng-Xiang Wang
The emergence of 6th generation (6G) mobile networks brings new challenges in supporting high-mobility communications, particularly in addressing the issue of channel aging.
no code implementations • 28 Sep 2024 • Xiaoxiang Han, Xinyu Li, Jiang Shang, Yiman Liu, Keyan Chen, Shugong Xu, Qiaohong Liu, Qi Zhang
Therefore, we propose leveraging predictions near decision boundaries effectively.
no code implementations • 24 Sep 2024 • Zhiyong Chen, Xinnuo Li, Zhiqi Ai, Shugong Xu
We introduce StyleFusion-TTS, a prompt and/or audio referenced, style and speaker-controllable, zero-shot text-to-speech (TTS) synthesis system designed to enhance the editability and naturalness of current research literature.
no code implementations • 24 Sep 2024 • Zhiyong Chen, Zhiqi Ai, Xinnuo Li, Shugong Xu
This paper introduces a novel framework for open-set speaker identification in household environments, playing a crucial role in facilitating seamless human-computer interactions.
no code implementations • 4 Sep 2024 • Anqi Liu, Shiyi Mu, Shugong Xu
Autonomous driving algorithms usually employ sRGB images as model input due to their compatibility with the human visual system.
no code implementations • 4 Sep 2024 • Jinhao Chai, Shiyi Mu, Shugong Xu
To our knowledge, TLD is the first dataset to separately annotate brake lights and turn signals in real driving scenarios.
no code implementations • 21 Jun 2024 • Xiaojing Chen, Zhenyuan Li, Wei Ni, Xin Wang, Shunqing Zhang, Yanzan Sun, Shugong Xu, Qingqi Pei
Federated learning (FL) is a viable technique to train a shared machine learning model without sharing data.
1 code implementation • 11 Jun 2024 • Zhiqi Ai, Zhiyong Chen, Shugong Xu
In this paper, we propose MM-KWS, a novel approach to user-defined keyword spotting leveraging multi-modal enrollments of text and speech templates.
no code implementations • 21 Aug 2022 • Jun Yu, Shunqing Zhang, Jiayun Sun, Shugong Xu, Shan Cao
Multi-stream carrier aggregation is a key technology to expand bandwidth and improve the throughput of the fifth-generation wireless communication systems.
no code implementations • 6 Aug 2022 • Heng Zhang, Guangjin Pan, Shugong Xu, Shunqing Zhang, Zhiyuan Jiang
In the proposal, LSTM networks are employed to predict traffic demand and the location of each user in a slicing window level.
no code implementations • 3 Aug 2022 • Guangjin Pan, Heng Zhang, Shugong Xu, Shunqing Zhang, Xiaojing Chen
The high computational complexity and high energy consumption of artificial intelligence (AI) algorithms hinder their application in augmented reality (AR) systems.
no code implementations • 16 Nov 2021 • Yue Tao, Zhiwei Jia, Runze Ma, Shugong Xu
We propose a 1-D split to address the challenges of complexity and replace the CNN with the transformer encoder to reduce the need for a context modeling module.
no code implementations • 29 Oct 2021 • Dinghao Fan, Hengjie Lu, Shugong Xu, Shan Cao
Our framework is trained to learn a representation for multi-task learning: gesture segmentation and gesture recognition.
1 code implementation • 21 Oct 2021 • Jingchao Chen, Shiyi Mu, Shugong Xu, Youdong Ding
Although lots of progress were made in Text Recognition/OCR in recent years, the task of font recognition is remaining challenging.
Ranked #1 on Font Recognition on Explor_all (Top 1 Accuracy metric)
1 code implementation • 21 Oct 2021 • Hang Cheng, Shugong Xu, Xiufeng Jiang, Rongrong Wang
In this paper, we propose a matting method that use Flexible Guidance Input as user hint, which means our method can use trimap, scribblemap or clickmap as guidance information or even work without any guidance input.
no code implementations • 24 Aug 2021 • Kaixuan Huang, Chenlu Xiang, Shunqing Zhang, Shugong Xu, Xianfeng Ma, Qinglong Xian, Hua Yang
With the rising demand for indoor localization, high precision technique-based fingerprints became increasingly important nowadays.
no code implementations • 13 Aug 2021 • Zhiwei Jia, Shugong Xu, Shiyi Mu, Yue Tao, Shan Cao, Zhiyong Chen
In this paper, we propose an Iterative Fusion based Recognizer (IFR) for low quality scene text recognition, taking advantage of refined text images input and robust feature representation.
no code implementations • 12 Aug 2021 • Youxuan Ma, Zongze Ren, Shugong Xu
In recent years, synthetic speech generated by advanced text-to-speech (TTS) and voice conversion (VC) systems has caused great harms to automatic speaker verification (ASV) systems, urging us to design a synthetic speech detection system to protect ASV systems.
no code implementations • 24 Jun 2021 • Hengjie Lu, Shugong Xu, Shan Cao
Therefore, we propose a method to tackle the problem of single-line depth completion, in which we aim to generate a dense depth map from the single-line LiDAR info and the aligned RGB image.
no code implementations • 21 Apr 2021 • Yanzan Sun, Qinggang Xie, Guangjin Pan, Shunqing Zhang, Shugong Xu
With the rapid development of indoor location-based services (LBSs), the demand for accurate localization keeps growing as well.
no code implementations • 1 Apr 2021 • Xiufeng Jiang, Shugong Xu, Shunqing Zhang, Shan Cao
In this paper, we propose a novel text regionrepresentation method, with a robust pipeline, which can precisely detect dense adjacent text instances witharbitrary shapes.
no code implementations • 29 Mar 2021 • Jiajun Zhu, Xiufeng Jiang, Zhiwei Jia, Shugong Xu, Shan Cao
Moreover, a paired low-quality scene text video dataset named Text-RBL is proposed, consisting of raw videos, blurry videos, and low-resolution videos, labeled by the proposed convenient semi-automatic labeling strategy.
no code implementations • 29 Mar 2021 • Xudong Chen, Shugong Xu, Qiaobin Ji, Shan Cao
Besides, we propose an Attention based Face Anti-spoofing network with Feature Augment (AFA) to solve the FAS towards low-quality face images.
no code implementations • 26 Jan 2021 • Chenlu Xiang, Shunqing Zhang, Shugong Xu, George C. Alexandropoulos
Precise indoor localization is one of the key requirements for fifth Generation (5G) and beyond, concerning various wireless communication systems, whose applications span different vertical sectors.
no code implementations • NeurIPS 2020 • Peiyao Wang, Weixin Luo, Yanyu Xu, Haojie Li, Shugong Xu, Jianyu Yang, Shenghua Gao
Spatial Description Resolution, as a language-guided localization task, is proposed for target location in a panoramic street view, given corresponding language descriptions.
no code implementations • 8 Sep 2020 • Asim Ihsan, Wen Chen, Shunqing Zhang, Shugong Xu
The proposed system multicast the information through low complexity optimal power allocation algorithms used under channel outage probability constraint of vehicles with imperfect CSI, QoS constraints of vehicles, and transmit power limits constraint of RSUs.
no code implementations • 30 Aug 2020 • Yuxi Li, Weiyao Lin, Tao Wang, John See, Rui Qian, Ning Xu, Li-Min Wang, Shugong Xu
The task of spatial-temporal action detection has attracted increasing attention among researchers.
Ranked #3 on Action Detection on UCF Sports (Video-mAP 0.2 metric)
no code implementations • 20 Aug 2020 • Yu Wang, Guangbing Zhou, Chenlu Xiang, Shunqing Zhang, Shugong Xu
The existing localization systems for indoor applications basically rely on wireless signal.
no code implementations • ECCV 2020 • Yuxi Li, Weiyao Lin, John See, Ning Xu, Shugong Xu, Ke Yan, Cong Yang
Most current pipelines for spatio-temporal action localization connect frame-wise or clip-wise detection results to generate action proposals, where only local information is exploited and the efficiency is hindered by dense per-frame localization.
no code implementations • 18 Aug 2020 • Guangjin Pan, Tao Wang, Shunqing Zhang, Shugong Xu
Conventional schemes often require extra reference signals or more complicated algorithms to improve the time-of-arrival (TOA) estimation accuracy.
no code implementations • 19 Jun 2020 • Xiaojing Chen, Zhouyu Lu, Wei Ni, Xin Wang, Feng Wang, Shunqing Zhang, Shugong Xu
Driven by explosive computation demands of Internet of Things (IoT), mobile edge computing (MEC) provides a promising technique to enhance the computation capability for mobile users.
no code implementations • 24 Feb 2020 • Fei Peng, Zhiyuan Jiang, Shunqing Zhang, Shugong Xu
Real-time status update in future vehicular networks is vital to enable control-level cooperative autonomous driving.
Information Theory Networking and Internet Architecture Information Theory
no code implementations • 13 Feb 2020 • Yan Liu, Zhiyuan Jiang, Shunqing Zhang, Shugong Xu
Ultra-Reliable and Low-Latency Communications (URLLC) services in vehicular networks on millimeter-wave bands present a significant challenge, considering the necessity of constantly adjusting the beam directions.
no code implementations • 16 Aug 2019 • Tianhao Qiao, Shunqing Zhang, Zhichao Zhang, Shan Cao, Shugong Xu
Environmental Sound Classification (ESC) is an important and challenging problem, and feature representation is a critical and even decisive factor in ESC.
no code implementations • 12 Aug 2019 • Zhiyong Chen, Zongze Ren, Shugong Xu
Learning a good speaker embedding is important for many automatic speaker recognition tasks, including verification, identification and diarization.
no code implementations • 6 Aug 2019 • Zongze Ren, Guofu Yang, Shugong Xu
In this paper, we present a two-stage language identification (LID) system based on a shallow ResNet14 followed by a simple 2-layer recurrent neural network (RNN) architecture, which was used for Xunfei (iFlyTek) Chinese Dialect Recognition Challenge and won the first place among 110 teams.
no code implementations • 6 Aug 2019 • Zongze Ren, Zhiyong Chen, Shugong Xu
The improvements are both based on triplet cause the training stage and the evaluation stage of the baseline x-vector system focus on different aims.
Speaker Recognition Text-Independent Speaker Verification +1
no code implementations • 4 Jul 2019 • Zhichao Zhang, Shugong Xu, Tianhao Qiao, Shunqing Zhang, Shan Cao
In order to deal with this, we employ a frame-level attention model to focus on the semantically relevant frames and salient frames.
no code implementations • 9 Apr 2019 • Xiaoyu Chen, Shugong Xu, Xudong Chen, Shan Cao, Shunqing Zhang, Yanzan Sun
TCP congestion control algorithm identification (TCP identification) can be used to significantly improve network efficiency.
no code implementations • 25 Aug 2018 • Changhao Wu, Shugong Xu, Guocong Song, Shunqing Zhang
As a large amount of labeled data is typically difficult to collect and even more difficult to annotate, data augmentation and data generation are widely used in the process of training deep neural networks.
no code implementations • 2 Jun 2018 • Yuanzhouhan Cao, Tianqi Zhao, Ke Xian, Chunhua Shen, Zhiguo Cao, Shugong Xu
In this paper, we propose to improve the performance of metric depth estimation with relative depths collected from stereo movie videos using existing stereo matching algorithm.