1 code implementation • 12 Apr 2023 • Dong Wang, Jia Guo, Qiqi Shao, Haochi He, Zhian Chen, Chuanbao Xiao, Ajian Liu, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Jun Wan, Jiankang Deng
Leveraging the WFAS dataset and Protocol 1 (Known-Type), we host the Wild Face Anti-Spoofing Challenge at the CVPR2023 workshop.
1 code implementation • International Conference on Computer Vision Workshops 2019 • Dawei Du, Pengfei Zhu, Longyin Wen, Xiao Bian, Haibin Lin, QinGhua Hu, Tao Peng, Jiayu Zheng, Xinyao Wang, Yue Zhang, Liefeng Bo, Hailin Shi, Rui Zhu, Aashish Kumar, Aijin Li, Almaz Zinollayev, Anuar Askergaliyev, Arne Schumann, Binjie Mao, Byeongwon Lee, Chang Liu, Changrui Chen, Chunhong Pan, Chunlei Huo, Da Yu, Dechun Cong, Dening Zeng, Dheeraj Reddy Pailla, Di Li, Dong Wang, Donghyeon Cho, Dongyu Zhang, Furui Bai, George Jose, Guangyu Gao, Guizhong Liu, Haitao Xiong, Hao Qi, Haoran Wang, Heqian Qiu, Hongliang Li, Huchuan Lu, Ildoo Kim, Jaekyum Kim, Jane Shen, Jihoon Lee, Jing Ge, Jingjing Xu, Jingkai Zhou, Jonas Meier, Jun Won Choi, Junhao Hu, Junyi Zhang, Junying Huang, Kaiqi Huang, Keyang Wang, Lars Sommer, Lei Jin, Lei Zhang
Results of 33 object detection algorithms are presented.
2 code implementations • 1 Aug 2023 • Mingzhan Yang, Guangxin Han, Bin Yan, Wenhua Zhang, Jinqing Qi, Huchuan Lu, Dong Wang
Also, our method shows strong generalization for diverse trackers and scenarios in a plug-and-play and training-free manner.
Ranked #9 on Multi-Object Tracking on DanceTrack
12 code implementations • ECCV 2018 • Ze Yang, Tiange Luo, Dong Wang, Zhiqiang Hu, Jun Gao, Li-Wei Wang
In consideration of intrinsic consistency between informativeness of the regions and their probability being ground-truth class, we design a novel training paradigm, which enables Navigator to detect most informative regions under the guidance from Teacher.
Ranked #42 on Fine-Grained Image Classification on FGVC Aircraft
1 code implementation • CVPR 2023 • Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu
All instance perception tasks aim at finding certain objects specified by some queries such as category names, language expressions, and target annotations, but this complete field has been split into multiple independent subtasks.
Ranked #1 on Referring Expression Segmentation on RefCoCo val (using extra training data)
Described Object Detection Generalized Referring Expression Comprehension +15
1 code implementation • 14 Jul 2022 • Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu
We present a unified method, termed Unicorn, that can simultaneously solve four tracking problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters.
Multi-Object Tracking Multi-Object Tracking and Segmentation +3
2 code implementations • 26 Jan 2024 • Zifan Wu, Bo Tang, Qian Lin, Chao Yu, Shangqin Mao, Qianlong Xie, Xingxing Wang, Dong Wang
Results on benchmark tasks show that our method not only achieves an asymptotic performance comparable to state-of-the-art on-policy methods while using much fewer samples, but also significantly reduces constraint violation during training.
1 code implementation • 26 Jul 2023 • Jiawen Zhu, Zhenyu Chen, Zeqi Hao, Shijie Chang, Lu Zhang, Dong Wang, Huchuan Lu, Bin Luo, Jun-Yan He, Jin-Peng Lan, Hanyuan Chen, Chenyang Li
To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results.
Ranked #5 on Semi-Supervised Video Object Segmentation on YouTube-VOS 2019 (using extra training data)
3 code implementations • 5 Jul 2019 • Junyu. Gao, Wei. Lin, Bin Zhao, Dong Wang, Chenyu Gao, Jun Wen
This technical report attempts to provide efficient and solid kits addressed on the field of crowd counting, which is denoted as Crowd Counting Code Framework (C$^3$F).
4 code implementations • NeurIPS 2018 • Wei Cao, Dong Wang, Jian Li, Hao Zhou, Lei LI, Yitan Li
It is ubiquitous that time series contains many missing values.
General Classification Multivariate Time Series Forecasting +5
1 code implementation • ICCV 2021 • Bin Yan, Houwen Peng, Jianlong Fu, Dong Wang, Huchuan Lu
In this paper, we present a new tracking architecture with an encoder-decoder transformer as the key component.
Ranked #18 on Visual Object Tracking on TrackingNet
1 code implementation • CVPR 2021 • Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu
The correlation operation is a simple fusion manner to consider the similarity between the template and the search region.
Ranked #5 on Visual Tracking on TNL2K
1 code implementation • CVPR 2021 • Bin Yan, Houwen Peng, Kan Wu, Dong Wang, Jianlong Fu, Huchuan Lu
Object tracking has achieved significant progress over the past few years.
1 code implementation • Geoscientific Model Development 2019 • Xiaomeng Huang, Xing Huang, Dong Wang, Qi Wu, Yi Li, Shixun Zhang, YuWen Chen, Mingqing Wang, Yuan Gao, Qiang Tang, Yue Chen, Zheng Fang, Zhenya Song, Guangwen Yang
In this work, we design a simple computing library to bridge the gap and decouple the work of ocean modeling from parallel computing.
2 code implementations • CVPR 2020 • Kenan Dai, Yunhua Zhang, Dong Wang, Jianhua Li, Huchuan Lu, Xiaoyun Yang
Most top-ranked long-term trackers adopt the offline-trained Siamese architectures, thus, they cannot benefit from great progress of short-term trackers with online update.
Ranked #10 on Visual Object Tracking on LaSOT-ext
1 code implementation • CVPR 2023 • Jiawen Zhu, Simiao Lai, Xin Chen, Dong Wang, Huchuan Lu
To inherit the powerful representations of the foundation model, a natural modus operandi for multi-modal tracking is full fine-tuning on the RGB-based parameters.
Ranked #10 on Rgb-T Tracking on LasHeR
3 code implementations • 12 Sep 2018 • Yunhua Zhang, Dong Wang, Lijun Wang, Jinqing Qi, Huchuan Lu
Compared with short-term tracking, the long-term tracking task requires determining the tracked object is present or absent, and then estimating the accurate bounding box if present or conducting image-wide re-detection if absent.
1 code implementation • CVPR 2022 • Xiaokang Peng, Yake Wei, Andong Deng, Dong Wang, Di Hu
Multimodal learning helps to comprehensively understand the world, by integrating different senses.
3 code implementations • 28 May 2022 • Renrui Zhang, Ziyu Guo, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li, Peng Gao
By fine-tuning on downstream tasks, Point-M2AE achieves 86. 43% accuracy on ScanObjectNN, +3. 36% to the second-best, and largely benefits the few-shot classification, part segmentation and 3D object detection with the hierarchical pre-training scheme.
Ranked #4 on 3D Point Cloud Linear Classification on ModelNet40 (using extra training data)
1 code implementation • 4 Jul 2020 • Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang
In recent years, the multiple-stage strategy has become a popular trend for visual tracking.
1 code implementation • CVPR 2021 • Bin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang
Many recent trackers adopt the multiple-stage tracking strategy to improve the quality of bounding box estimation.
Ranked #15 on Semi-Supervised Video Object Segmentation on VOT2020
Semi-Supervised Video Object Segmentation Visual Object Tracking
1 code implementation • 29 May 2023 • Haojun Yu, Youcheng Li, Quanlin Wu, Ziwei Zhao, Dengbo Chen, Dong Wang, LiWei Wang
To address this issue, we propose to extract contexts from previous frames, including NTC, with the guidance of inverse optical flow.
1 code implementation • 21 Oct 2021 • Yepeng Liu, Zaiwang Gu, Shenghua Gao, Dong Wang, Yusheng Zeng, Jun Cheng
Very often, the pose is estimated after the face detection.
1 code implementation • 10 Mar 2021 • Botao He, Haojia Li, Siyuan Wu, Dong Wang, Zhiwei Zhang, Qianli Dong, Chao Xu, Fei Gao
The bottleneck of solving this problem is the accurate perception of rapid dynamic objects.
Motion Compensation Robust Object Detection +1 Robotics
1 code implementation • ICCV 2019 • Bin Yan, Haojie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang
In this work, we present a novel robust and real-time long-term tracking framework based on the proposed skimming and perusal modules.
1 code implementation • ICCV 2023 • Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao
The popularity of Contrastive Language-Image Pre-training (CLIP) has propelled its application to diverse downstream vision tasks.
1 code implementation • 25 Oct 2023 • Zhenrui Yue, Sara Rabhi, Gabriel de Souza Pereira Moreira, Dong Wang, Even Oldridge
Recently, large language models (LLMs) have exhibited significant progress in language understanding and generation.
2 code implementations • ICCV 2019 • Peixia Li, Bo-Yu Chen, Wanli Ouyang, Dong Wang, Xiaoyun Yang, Huchuan Lu
In this work, we propose a novel gradient-guided network to exploit the discriminative information in gradients and update the template in the siamese network through feed-forward and backward operations.
Ranked #3 on Visual Object Tracking on OTB-2015 (Precision metric)
2 code implementations • 8 Dec 2020 • Pengyu Zhang, Dong Wang, Huchuan Lu
Visual object tracking, as a fundamental task in computer vision, has drawn much attention in recent years.
1 code implementation • 5 Sep 2022 • Qian Chen, Xingjian Dong, Guowei Tu, Dong Wang, Baoxuan Zhao, Zhike Peng
However, the CNN is a typical black-box model, and the mechanism of CNN's decision-making are not clear, which limits its application in high-reliability-required fault diagnosis scenarios.
1 code implementation • 18 Mar 2024 • Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Dong Wang, Huchuan Lu, You He
Continual learning can empower vision-language models to continuously acquire new knowledge, without the need for access to the entire historical dataset.
1 code implementation • CVPR 2020 • Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang
An effective and efficient perturbation generator is trained with a carefully designed adversarial loss, which can simultaneously cool hot regions where the target exists on the heatmaps and force the predicted bounding box to shrink, making the tracked target invisible to trackers.
1 code implementation • 22 May 2022 • Jie Zhao, Jingshu Zhang, Dongdong Li, Dong Wang
It contains a detection dataset with a total of 10, 000 images and a tracking dataset with 20 videos that include short-term and long-term sequences.
7 code implementations • 29 Mar 2023 • Zoey Guo, Yiwen Tang, Ray Zhang, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li
In this paper, we propose ViewRefer, a multi-view framework for 3D visual grounding exploring how to grasp the view knowledge from both text and 3D modalities.
1 code implementation • ICCV 2017 • Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, Bao-Cai Yin
In this paper, we propose a novel deep fully convolutional network model for accurate salient object detection.
Ranked #5 on Saliency Detection on DUT-OMRON
1 code implementation • ICCV 2017 • Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, Xiang Ruan
In addition, to achieve accurate boundary inference and semantic enhancement, edge-aware feature maps in low-level layers and the predicted results of low resolution features are recursively embedded into the learning framework.
Ranked #20 on RGB Salient Object Detection on DUTS-TE (max F-measure metric)
1 code implementation • ACL 2021 • Dong Wang, Ning Ding, Piji Li, Hai-Tao Zheng
Recent works aimed to improve the robustness of pre-trained models mainly focus on adversarial training from perturbed examples with similar semantics, neglecting the utilization of different or even opposite semantics.
5 code implementations • 4 Oct 2023 • Yiwen Tang, Ray Zhang, Zoey Guo, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li
To this end, we introduce Point-PEFT, a novel framework for adapting point cloud pre-trained models with minimal learnable parameters.
5 code implementations • 11 Apr 2024 • Yiwen Tang, Jiaming Liu, Dong Wang, Zhigang Wang, Shanghang Zhang, Bin Zhao, Xuelong Li
The adapter incorporates prior spatial knowledge from the source modality to guide the local feature aggregation of 3D tokens, compelling the semantic adaption of any-modality transformers.
1 code implementation • 25 Mar 2022 • Xin Chen, Ben Kang, Dong Wang, Dongdong Li, Huchuan Lu
Most state-of-the-art trackers are satisfied with the real-time speed on powerful GPUs.
1 code implementation • NeurIPS 2023 • Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li
Specifically, we propose Multi-Task Diffusion Model (\textsc{MTDiff}), a diffusion-based method that incorporates Transformer backbones and prompt learning for generative planning and data synthesis in multi-task offline settings.
1 code implementation • 7 Dec 2015 • Dong Wang, Xuewei Zhang
Speech data is crucially important for speech recognition research.
1 code implementation • ICCV 2021 • Yingquan Wang, Pingping Zhang, Shang Gao, Xia Geng, Hu Lu, Dong Wang
Video-based person re-identification aims to associate the video clips of the same person across multiple non-overlapping cameras.
Ranked #1 on Person Re-Identification on DukeMTMC-VideoReID
1 code implementation • CVPR 2022 • Pengyu Zhang, Jie Zhao, Dong Wang, Huchuan Lu, Xiang Ruan
With the popularity of multi-modal sensors, visible-thermal (RGB-T) object tracking is to achieve robust performance and wider application scenarios with the guidance of objects' temperature information.
Ranked #2 on Rgb-T Tracking on GTOT
1 code implementation • 13 Sep 2022 • Dong Wang, Zhao Zhang, Ziwei Zhao, Yuhang Liu, Yihong Chen, LiWei Wang
Inspired by this, we propose PointScatter, an alternative to the segmentation models for the tubular structure extraction task.
1 code implementation • CVPR 2023 • Haozhe Si, Bin Zhao, Dong Wang, Yunpeng Gao, Mulin Chen, Zhigang Wang, Xuelong Li
We show that our framework circumvents the needs for the depth and AIF image ground-truth, and receives superior predictions, thus closing the gap between the theoretical success of DFD works and their applications in the real world.
1 code implementation • ACL 2020 • Wentao Ma, Yiming Cui, Ting Liu, Dong Wang, Shijin Wang, Guoping Hu
Human conversations contain many types of information, e. g., knowledge, common sense, and language habits.
1 code implementation • 19 Mar 2021 • Zhe Xie, Chengxuan Liu, Yichi Zhang, Hongtao Lu, Dong Wang, Yue Ding
To solve the above problem, in this work, we propose a novel method called Adversarial and Contrastive Variational Autoencoder (ACVAE) for sequential recommendation.
1 code implementation • 29 May 2019 • Haoyu Song, Wei-Nan Zhang, Yiming Cui, Dong Wang, Ting Liu
Giving conversational context with persona information to a chatbot, how to exploit the information to generate diverse and sustainable conversations is still a non-trivial task.
1 code implementation • 3 Oct 2023 • Zhenrui Yue, Yueqi Wang, Zhankui He, Huimin Zeng, Julian McAuley, Dong Wang
State-of-the-art sequential recommendation relies heavily on self-attention-based recommender models.
1 code implementation • 30 Jul 2020 • Dong Wang, Bo Jiang, W. K. Chan
Furthermore, WANA proposes a set of test oracles to detect the vulnerabilities in EOSIO and Ethereum smart contracts based on WebAssembly bytecode analysis.
Software Engineering D.2.5
1 code implementation • CVPR 2018 • Chong Sun, Dong Wang, Huchuan Lu, Ming-Hsuan Yang
To address this issue, we propose a novel CF-based optimization problem to jointly model the discrimination and reliability information.
1 code implementation • CVPR 2023 • Xin Chen, Ben Kang, Jiawen Zhu, Dong Wang, Houwen Peng, Huchuan Lu
In this paper, we introduce a new sequence-to-sequence learning framework for RGB-based and multi-modal object tracking.
Ranked #1 on Rgb-T Tracking on LasHeR
1 code implementation • 25 Mar 2022 • Xin Chen, Bin Yan, Jiawen Zhu, Huchuan Lu, Xiang Ruan, Dong Wang
First, we present a transformer tracking (named TransT) method based on the Siamese-like feature extraction backbone, the designed attention-based fusion mechanism, and the classification and regression head.
1 code implementation • 22 May 2023 • Zhenrui Yue, Huimin Zeng, Yang Zhang, Lanyu Shang, Dong Wang
As such, MetaAdapt can learn how to adapt the misinformation detection model and exploit the source data for improved performance in the target domain.
1 code implementation • 2 May 2022 • Tingyu Fan, Linyao Gao, Yiling Xu, Zhu Li, Dong Wang
This paper proposes a novel 3D sparse convolution-based Deep Dynamic Point Cloud Compression (D-DPCC) network to compensate and compress the DPC geometry with 3D motion estimation and motion compensation in the feature space.
1 code implementation • 14 Dec 2020 • Dong Wang, Di Hu, Xingjian Li, Dejing Dou
The main reason is that large number of nodes (i. e., video frames) makes GCNs hard to capture and model temporal relations in videos.
Ranked #23 on Action Segmentation on Breakfast
1 code implementation • 18 Oct 2021 • Lantian Li, Ruiqian Nai, Dong Wang
The additive margin softmax (AM-Softmax) loss has delivered remarkable performance in speaker verification.
1 code implementation • CVPR 2023 • Haojie Zhao, Dong Wang, Huchuan Lu
However, for the template, we make the decoder reconstruct the target appearance within the search region.
1 code implementation • CVPR 2018 • Chong Sun, Dong Wang, Huchuan Lu, Ming-Hsuan Yang
Second, we propose a fully convolutional neural network with spatially regularized kernels, through which the filter kernel corresponding to each output channel is forced to focus on a specific region of the target.
Ranked #12 on Visual Object Tracking on VOT2017/18
2 code implementations • 6 Nov 2023 • Wenke Xia, Dong Wang, Xincheng Pang, Zhigang Wang, Bin Zhao, Di Hu, Xuelong Li
Generalizable articulated object manipulation is essential for home-assistant robots.
1 code implementation • 22 Jul 2023 • Zhixing Zhang, Ziwei Zhao, Dong Wang, Shishuang Zhao, Yuhang Liu, Jia Liu, LiWei Wang
Automatic labeling of coronary arteries is an essential task in the practical diagnosis process of cardiovascular diseases.
1 code implementation • 27 May 2023 • Zhenrui Yue, Huimin Zeng, Mengfei Lan, Heng Ji, Dong Wang
With emerging online topics as a source for numerous new events, detecting unseen / rare event types presents an elusive challenge for existing event detection methods, where only limited data access is provided for training.
1 code implementation • 2 Jul 2021 • Qing Guo, Junya Chen, Dong Wang, Yuewei Yang, Xinwei Deng, Lawrence Carin, Fan Li, Jing Huang, Chenyang Tao
Successful applications of InfoNCE and its variants have popularized the use of contrastive variational mutual information (MI) estimators in machine learning.
1 code implementation • CVPR 2022 • Bing Liu, Dong Wang, Xu Yang, Yong Zhou, Rui Yao, Zhiwen Shao, Jiaqi Zhao
In the encoding stage, the IOD is able to disentangle the region-based visual features by deconfounding the visual confounder.
2 code implementations • 31 Oct 2019 • Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang
These datasets tend to deliver over optimistic performance and do not meet the request of research on speaker recognition in unconstrained conditions.
1 code implementation • 17 Mar 2023 • Dongsheng Wang, Xu Jia, Yang Zhang, Xinyu Zhang, Yaoyuan Wang, Ziyang Zhang, Dong Wang, Huchuan Lu
To fully exploit information with event streams to detect objects, a dual-memory aggregation network (DMANet) is proposed to leverage both long and short memory along event streams to aggregate effective information for object detection.
1 code implementation • 1 Jun 2023 • Qian Lin, Bo Tang, Zifan Wu, Chao Yu, Shangqin Mao, Qianlong Xie, Xingxing Wang, Dong Wang
Aiming at promoting the safe real-world deployment of Reinforcement Learning (RL), research on safe RL has made significant progress in recent years.
1 code implementation • CVPR 2019 • Yuxuan Sun, Chong Sun, Dong Wang, You He, Huchuan Lu
The ROI (region-of-interest) based pooling method performs pooling operations on the cropped ROI regions for various samples and has shown great success in the object detection methods.
1 code implementation • ICCV 2021 • Kenan Dai, Jie Zhao, Lijun Wang, Dong Wang, Jianhua Li, Huchuan Lu, Xuesheng Qian, Xiaoyun Yang
Deep learning based visual trackers entail offline pre-training on large volumes of video datasets with accurate bounding box annotations that are labor-expensive to achieve.
1 code implementation • 9 Sep 2021 • Guogang Liao, Ze Wang, Xiaoxu Wu, Xiaowen Shi, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang
Our model results in higher revenue and better user experience than state-of-the-art baselines in offline experiments.
1 code implementation • 6 Feb 2023 • Xiaowen Shi, Fan Yang, Ze Wang, Xiaoxu Wu, Muzhi Guan, Guogang Liao, Yongkang Wang, Xingxing Wang, Dong Wang
Then we design a novel omnidirectional attention mechanism in OCPM to capture the context information in the permutation.
1 code implementation • 30 Oct 2020 • Yunqi Cai, Lantian Li, Dong Wang, Andrew Abel
In this paper, we argue that this problem is largely attributed to the maximum-likelihood (ML) training criterion of the DNF model, which aims to maximize the likelihood of the observations but not necessarily improve the Gaussianality of the latent codes.
1 code implementation • 15 Jun 2021 • Haibin Wu, Yang Zhang, Zhiyong Wu, Dong Wang, Hung-Yi Lee
Automatic speaker verification (ASV) is a well developed technology for biometric identification, and has been ubiquitous implemented in security-critic applications, such as banking and access control.
2 code implementations • 20 Aug 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang
However, early misinformation often demonstrates both conditional and label shifts against existing misinformation data (e. g., class imbalance in COVID-19 datasets), rendering such methods less effective for detecting early misinformation.
1 code implementation • COLING 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang
In this work, we investigate the potential benefits of question classification for QA domain adaptation.
1 code implementation • 19 Oct 2022 • Zhenrui Yue, Huimin Zeng, Bernhard Kratzwald, Stefan Feuerriegel, Dong Wang
Unlike existing approaches, we generate pseudo labels and propose to train the model via a novel attention-based contrastive adaptation method.
1 code implementation • CVPR 2023 • Simin Li, Shuing Zhang, Gujun Chen, Dong Wang, Pu Feng, Jiakai Wang, Aishan Liu, Xin Yi, Xianglong Liu
First, to benchmark attack naturalness, we contribute the first Physical Attack Naturalness (PAN) dataset with human rating and gaze.
1 code implementation • 2 Jun 2018 • Zhiyuan Tang, Dong Wang, Qing Chen
The third oriental language recognition (OLR) challenge AP18-OLR is introduced in this paper, including the data profile, the tasks and the evaluation principles.
1 code implementation • 28 Jun 2017 • Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen
We present the data profile and the evaluation plan of the second oriental language recognition (OLR) challenge AP17-OLR.
1 code implementation • 23 May 2023 • Xinyu Zhang, Hefei Huang, Xu Jia, Dong Wang, Huchuan Lu
In this work, we aim to re-expose the captured photo in post-processing to provide a more flexible way of addressing those issues within a unified framework.
Ranked #4 on Deblurring on GoPro (using extra training data)
1 code implementation • 21 Jul 2019 • Dong Wang, Yicheng Liu, Wenwo Tang, Fanhua Shang, Hongying Liu, Qigong Sun, Licheng Jiao
In this paper, we propose a new first-order gradient-based algorithm to train deep neural networks.
1 code implementation • 25 May 2020 • Jiawen Kang, Ruiqi Liu, Lantian Li, Yunqi Cai, Dong Wang, Thomas Fang Zheng
Domain generalization remains a critical problem for speaker recognition, even with the state-of-the-art architectures based on deep neural nets.
Audio and Speech Processing
1 code implementation • 8 Mar 2022 • Mingqi Yuan, Man-on Pun, Dong Wang
One of the most critical challenges in deep reinforcement learning is to maintain the long-term exploration capability of the agent.
1 code implementation • 30 Oct 2020 • Yunqi Cai, Dong Wang
Limited by its linear form and the underlying Gaussian assumption, however, LDA is not applicable in situations where the data distribution is complex.
1 code implementation • 7 Mar 2024 • Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang
To this end, we propose GPT-FedRec, a federated recommendation framework leveraging ChatGPT and a novel hybrid Retrieval Augmented Generation (RAG) mechanism.
1 code implementation • NeurIPS 2019 • Chenyang Tao, Liqun Chen, Shuyang Dai, Junya Chen, Ke Bai, Dong Wang, Jianfeng Feng, Wenlian Lu, Georgiy Bobashev, Lawrence Carin
Inference, estimation, sampling and likelihood evaluation are four primary goals of probabilistic modeling.
1 code implementation • 7 Apr 2020 • Yunqi Cai, Lantian Li, Dong Wang, Andrew Abel
Deep speaker embedding has demonstrated state-of-the-art performance in speaker recognition tasks.
1 code implementation • 20 Dec 2020 • Faisal M. Almutairi, Yunlong Wang, Dong Wang, Emily Zhao, Nicholas D. Sidiropoulos
In many applications, the categories of items exhibit a hierarchical tree structure.
1 code implementation • ICCV 2023 • Delin Qu, Yizhen Lao, Zhigang Wang, Dong Wang, Bin Zhao, Xuelong Li
This paper addresses the problem of rolling shutter correction in complex nonlinear and dynamic scenes with extreme occlusion.
1 code implementation • 22 May 2023 • Olga Saukh, Dong Wang, Xiaoxi He, Lothar Thiele
The obtained subspace is low-dimensional and has a surprisingly simple structure even for complex, non-invertible transformations of the input, leading to an exceptionally high efficiency of subspace-configurable networks (SCNs) when limited storage and computing resources are at stake.
1 code implementation • 4 Aug 2016 • Dong Wang, Haohan Li, Xiaoyu Wei, Xiao-Ping Wang
We proposed an efficient iterative thresholding method for multi-phase image segmentation.
1 code implementation • ICLR 2022 • Qitong Gao, Dong Wang, Joshua D. Amason, Siyang Yuan, Chenyang Tao, Ricardo Henao, Majda Hadziahmetovic, Lawrence Carin, Miroslav Pajic
Though recent works have developed methods that can generate estimates (or imputations) of the missing entries in a dataset to facilitate downstream analysis, most depend on assumptions that may not align with real-world applications and could suffer from poor performance in subsequent tasks such as classification.
1 code implementation • 19 Jul 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang
Additionally, we design an adversarial training method tailored for sequential recommender systems.
1 code implementation • 17 Jan 2024 • Dong Wang, Giovanni Beltrame
Unfortunately, the system should be controlled at the highest, worst-case frequency to ensure stability, which can demand significant computational and energy resources and hinder the deployability of the controller on onboard hardware.
no code implementations • 15 Mar 2018 • Dong Wang, Lei Zhou, Xueni Zhang, Xiao Bai, Jun Zhou
In this way, most of the representative information in the network can be retained in each cluster.
no code implementations • 27 Feb 2018 • Lantian Li, Dong Wang, Yixiang Chen, Ying Shi, Zhiyuan Tang, Thomas Fang Zheng
Various informative factors mixed in speech signals, leading to great difficulty when decoding any of the factors.
no code implementations • 31 Oct 2017 • Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng
In recent studies, it has shown that speaker patterns can be learned from very short speech segments (e. g., 0. 3 seconds) by a carefully designed convolutional & time-delay deep neural network (CT-DNN) model.
no code implementations • 22 Feb 2018 • Pingping Zhang, Wei Liu, Dong Wang, Yinjie Lei, Hongyu Wang, Chunhua Shen, Huchuan Lu
Extensive experiments demonstrate that the proposed algorithm achieves competitive performance in both saliency detection and visual tracking, especially outperforming other related trackers on the non-rigid object tracking datasets.
no code implementations • 20 Feb 2018 • Pingping Zhang, Luyao Wang, Dong Wang, Huchuan Lu, Chunhua Shen
This paper proposes an Agile Aggregating Multi-Level feaTure framework (Agile Amulet) for salient object detection.
no code implementations • 15 Nov 2017 • Miao Zhang, Xiaofei Kang, Yanqing Wang, Lantian Li, Zhiyuan Tang, Haisheng Dai, Dong Wang
Trivial events are ubiquitous in human to human conversations, e. g., cough, laugh and sniff.
no code implementations • 12 Nov 2017 • Shiyue Zhang, Pengtao Xie, Dong Wang, Eric P. Xing
In hospital, physicians rely on massive clinical data to make diagnosis decisions, among which laboratory tests are one of the most important resources.
2 code implementations • 4 Oct 2017 • Aodong Li, Shiyue Zhang, Dong Wang, Thomas Fang Zheng
Neural machine translation (NMT) has recently achieved impressive results.
no code implementations • 9 May 2017 • Zhiyuan Tang, Dong Wang, Yixiang Chen, Lantian Li, Andrew Abel
Deep neural models, particularly the LSTM-RNN model, have shown great potential for language identification (LID).
no code implementations • EMNLP 2017 • Yang Feng, Shiyue Zhang, Andi Zhang, Dong Wang, Andrew Abel
Neural machine translation (NMT) has achieved notable success in recent times, however it is also widely recognized that this approach has limitations with handling infrequent words and word pairs.
no code implementations • 27 Jun 2017 • Shiyue Zhang, Gulnigar Mahmut, Dong Wang, Askar Hamdulla
Neural machine translation (NMT) has achieved notable performance recently.
no code implementations • 5 Jun 2017 • Dong Wang, Lantian Li, Ying Shi, Yixiang Chen, Zhiyuan Tang
In this paper, we demonstrated that the speaker factor is also a short-time spectral pattern and can be largely identified with just a few frames using a simple deep neural network (DNN).
no code implementations • 22 Jun 2017 • Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng
This principle has recently been applied to several prototype research on speaker verification (SV), where the feature learning and classifier are learned together with an objective function that is consistent with the evaluation metric.
no code implementations • 22 Jun 2017 • Lantian Li, Dong Wang, Askar Rozi, Thomas Fang Zheng
The experiments demonstrated that the feature-based system outperformed the i-vector system with a large margin, particularly with language mismatch between enrollment and test.
no code implementations • 22 Jun 2017 • Miao Zhang, Yixiang Chen, Lantian Li, Dong Wang
This paper proposes a speaker recognition (SRE) task with trivial speech events, such as cough and laugh.
no code implementations • 27 Sep 2016 • Lantian Li, Yixiang Chen, Dong Wang, Chenghui Zhao
PLDA is a popular normalization approach for the i-vector model, and it has delivered state-of-the-art performance in speaker verification.
no code implementations • 27 Sep 2016 • Lantian Li, Zhiyuan Tang, Dong Wang, Andrew Abel, Yang Feng, Shiyue Zhang
This paper presents a unified model to perform language and speaker recognition simultaneously and altogether.
no code implementations • 9 May 2017 • Zhiyuan Tang, Dong Wang, Yixiang Chen, Ying Shi, Lantian Li
Pure acoustic neural models, particularly the LSTM-RNN model, have shown great potential in language identification (LID).
no code implementations • ACL 2017 • Jiyuan Zhang, Yang Feng, Dong Wang, Yang Wang, Andrew Abel, Shiyue Zhang, Andi Zhang
It has been shown that Chinese poems can be successfully generated by sequence-to-sequence neural models, particularly with the attention mechanism.
no code implementations • 10 May 2017 • Lantian Li, Yixiang Chen, Ying Shi, Zhiyuan Tang, Dong Wang
Recently deep neural networks (DNNs) have been used to learn speaker features.
no code implementations • 28 Sep 2016 • Zhiyuan Tang, Ying Shi, Dong Wang, Yang Feng, Shiyue Zhang
Recurrent neural networks (RNNs) have shown clear superiority in sequence modeling, particularly the ones with gated units, such as long short-term memory (LSTM) and gated recurrent unit (GRU).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 27 Sep 2016 • Dong Wang, Lantian Li, Difei Tang, Qing Chen
We present the AP16-OL7 database which was released as the training and test data for the oriental language recognition (OLR) challenge on APSIPA 2016.
no code implementations • 31 Mar 2016 • Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, Panshi Jin
This paper presents a combination approach to the SUSR tasks with two phonetic-aware systems: one is the DNN-based i-vector system and the other is our recently proposed subregion-based GMM-UBM system.
no code implementations • 27 Sep 2016 • Chenghui Zhao, Lantian Li, Dong Wang, April Pu
PLDA is a popular normalization approach for the i-vector model, and it has delivered state-of-the-art performance in speaker verification.
no code implementations • 27 Sep 2016 • Dong Wang, Zhiyuan Tang, Difei Tang, Qing Chen
We present the OC16-CE80 Chinese-English mixlingual speech database which was released as a main resource for training, development and test for the Chinese-English mixlingual speech recognition (MixASR-CHEN) challenge on O-COCOSDA 2016.
no code implementations • 31 Mar 2016 • Zhiyuan Tang, Lantian Li, Dong Wang
Although highly correlated, speech and speaker recognition have been regarded as two independent tasks and studied by two communities.
no code implementations • 27 Sep 2016 • Zhiyuan Tang, Lantian Li, Dong Wang
Research on multilingual speech recognition remains attractive yet challenging.
no code implementations • 21 Apr 2016 • Qixin Wang, Tianyi Luo, Dong Wang, Chao Xing
Learning and generating Chinese poems is a charming yet challenging task.
no code implementations • 19 Jun 2016 • Qixin Wang, Tianyi Luo, Dong Wang
Recent progress in neural learning demonstrated that machines can do well in regularized tasks, e. g., the game of Go.
no code implementations • 18 May 2015 • Zhiyuan Tang, Dong Wang, Zhiyong Zhang
Recent research found that a well-trained model can be used as a teacher to train other child models, by using the predictions generated by the teacher model as supervision.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 8 Apr 2016 • Dong Wang, Xiaoyang Tan
Learning a good distance metric in feature space potentially improves the performance of the KNN classifier and is useful in many real-world applications.
no code implementations • 20 Oct 2015 • Lantian Li, Dong Wang, Chao Xing, Kaimin Yu, Thomas Fang Zheng
The popular i-vector model represents speakers as low-dimensional continuous vectors (i-vectors), and hence it is a way of continuous speaker embedding.
no code implementations • 20 Oct 2015 • Lantian Li, Dong Wang, Chao Xing, Thomas Fang Zheng
Probabilistic linear discriminant analysis (PLDA) is a popular normalization approach for the i-vector model, and has delivered state-of-the-art performance in speaker recognition.
1 code implementation • 1 Mar 2016 • Zexi Hu, Yuefang Gao, Dong Wang, Xuhong Tian
Given a base tracker, an ensemble of trackers is generated, in which each tracker's update behavior will be paced and then traces the target object forward and backward to generate a pair of trajectories in an interval.
1 code implementation • 5 Aug 2015 • Dongxu Zhang, Dong Wang
Deep learning has gained much success in sentence-level relation classification.
no code implementations • 19 Nov 2015 • Dong Wang, Thomas Fang Zheng
Transfer learning is a vital technique that generalizes models trained for one setting or task to other settings or tasks.
no code implementations • 10 Nov 2015 • Huibin Li, Jian Sun, Dong Wang, Zongben Xu, Liming Chen
In this paper, we present a novel approach to automatic 3D Facial Expression Recognition (FER) based on deep representation of facial 3D geometric and 2D photometric attributes.
3D Facial Expression Recognition Facial Expression Recognition
no code implementations • EMNLP 2015 • Tianyi Luo, Dong Wang, Rong Liu, Yiqiao Pan
ListNet is a well-known listwise learning to rank model and has gained much attention in recent years.
no code implementations • 5 Aug 2015 • Dongxu Zhang, Tianyi Luo, Dong Wang, Rong Liu
Latent Dirichlet Allocation (LDA) is a three-level hierarchical Bayesian model for topic inference.
no code implementations • 30 Jul 2015 • Mian Wang, Dong Wang
This assumption does not hold for a wide range of data types in practical applications, for instance spherical data for which the local proximity is better modelled by the von Mises-Fisher (vMF) distribution instead of the Gaussian.
no code implementations • 28 Jun 2015 • Lantian Li, Yiye Lin, Zhiyong Zhang, Dong Wang
A deep learning approach has been proposed recently to derive speaker identifies (d-vector) by a deep neural network (DNN).
no code implementations • 16 Jun 2015 • Xi Ma, Xiaoxi Wang, Dong Wang, Zhiyong Zhang
We also employ this approach to deal with out-of-language words in the task of multi-lingual speech recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 7 Jun 2015 • Zhiyuan Tang, Dong Wang, Yiqiao Pan, Zhiyong Zhang
Compared to the conventional layer-wise methods, this new method does not care about the model structure, so can be used to pre-train very complex models.
no code implementations • 2 Jun 2015 • Xiangyu Zeng, Shi Yin, Dong Wang
A significant performance reduction is often observed in speech recognition when the rate of speech (ROS) is too low or too high.
no code implementations • 23 Dec 2014 • Dong Wang, Xiaoyang Tan
To address this issue, we propose a SVDD based feature learning algorithm that describes the density and distribution of each cluster from K-means with an SVDD ball for more robust feature representation.
Ranked #23 on Image Classification on MNIST
no code implementations • 24 May 2015 • Lantian Li, Dong Wang, Zhiyong Zhang, Thomas Fang Zheng
Recent research shows that deep neural networks (DNNs) can be used to extract deep speaker vectors (d-vectors) that preserve speaker characteristics and can be used in speaker verification.
no code implementations • 17 Jul 2018 • Jiyuan Zhang, Dong Wang
Research has shown that sequence-to-sequence neural models, particularly those with the attention mechanism, can successfully generate classical Chinese poems.
no code implementations • 8 Nov 2018 • Lantian Li, Zhiyuan Tang, Ying Shi, Dong Wang
This paper proposes a Gaussian-constrained training approach that (1) discards the parametric classifier, and (2) enforces the distribution of the derived speaker vectors to be Gaussian.
no code implementations • 8 Nov 2018 • Lantian Li, Zhiyuan Tang, Ying Shi, Dong Wang
This score reflects the similarity of the two frames in phonetic content, and is used to weigh the contribution of this frame pair in the utterance-based scoring.
no code implementations • ACL 2017 • Wei Song, Dong Wang, Ruiji Fu, Lizhen Liu, Ting Liu, Guoping Hu
Evaluation results show that discourse modes can be identified automatically with an average F1-score of 0. 7.
no code implementations • CVPR 2018 • Wenda Zhao, Fan Zhao, Dong Wang, Huchuan Lu
To address these issues, we propose a multi-stream bottom-top-bottom fully convolutional network (BTBNet), which is the first attempt to develop an end-to-end deep network for DBD.
Ranked #2 on Defocus Estimation on CUHK - Blur Detection Dataset (MAE metric)
no code implementations • ECCV 2018 • Yunhua Zhang, Lijun Wang, Jinqing Qi, Dong Wang, Mengyang Feng, Huchuan Lu
In this paper, we circumvent this issue by proposing a local structure learning method, which simultaneously considers the local patterns of the target and their structural relationships for more accurate target tracking.
no code implementations • ECCV 2018 • Boyu Chen, Dong Wang, Peixia Li, Shuang Wang, Huchuan Lu
In this work, we propose a novel tracking algorithm with real-time performance based on the âActor-Criticâ framework.
no code implementations • ICLR 2019 • Ke Xu, Xiao-Yun Wang, Qun Jia, Jianjing An, Dong Wang
Therefore, accumulating the saliency of the filter over the entire data set can provide more accurate guidance for pruning.
no code implementations • 9 Jan 2019 • Huimin Lu, Dong Wang, Yujie Li, Jianru Li, Xin Li, Hyoungseop Kim, Seiichi Serikawa, Iztok Humar
The Cognitive Ocean Network (CONet) will become the mainstream of future ocean science and engineering developments.
no code implementations • CVPR 2013 • Dong Wang, Huchuan Lu, Ming-Hsuan Yang
In this paper, we propose a generative tracking method based on a novel robust linear regression algorithm.
no code implementations • CVPR 2014 • Dong Wang, Huchuan Lu
In this paper, we present a novel online visual tracking method based on linear representation.
no code implementations • CVPR 2017 • Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Bao-Cai Yin, Xiang Ruan
In the second stage, FIN is fine-tuned with its predicted saliency maps as ground truth.
no code implementations • ICCV 2017 • Zimo Liu, Dong Wang, Huchuan Lu
The intensive annotation cost and the rich but unlabeled data contained in videos motivate us to propose an unsupervised video-based person re-identification (re-ID) method.
Ranked #7 on Person Re-Identification on PRID2011
no code implementations • LREC 2012 • Dong Wang, Fei Xia
Our experiments show that the predicted scores are close to the real scores when tested on the CTB data.
no code implementations • 7 Apr 2019 • Yang Zhang, Lantian Li, Dong Wang
Deep speaker embedding has achieved state-of-the-art performance in speaker recognition.
no code implementations • CVPR 2019 • Di Hu, Dong Wang, Xuelong. Li, Feiping Nie, Qi. Wang
different encoding schemes indicate that using machine model to accelerate optimization evaluation and reduce experimental cost is feasible to some extent, which could dramatically promote the upgrading of encoding scheme then help the blind to improve their visual perception ability.
no code implementations • 24 Apr 2019 • Dong Wang, Xiao-Ping Wang
In this paper, we propose a novel iterative convolution-thresholding method (ICTM) that is applicable to a range of variational models for image segmentation.
no code implementations • 30 Apr 2019 • Dong Wang, Yuan Yuan, Qi. Wang
Action Prediction is aimed to determine what action is occurring in a video as early as possible, which is crucial to many online applications, such as predicting a traffic accident before it happens and detecting malicious actions in the monitoring system.
no code implementations • 30 Apr 2019 • Yuan Yuan, Dong Wang, Qi. Wang
Human actions captured in video sequences contain two crucial factors for action recognition, i. e., visual appearance and motion dynamics.
no code implementations • 30 Apr 2019 • Yuan Yuan, Dong Wang, Qi. Wang
3) Results of motion orientation and magnitude are adaptively weighted and fused by a Bayesian model, which makes the proposed method more robust and handle more kinds of abnormal events.
no code implementations • 30 Apr 2019 • Dong Wang, Yuan Yuan, Qi. Wang
The classification object ensures that each modal network predicts the true action category while the competing objective encourages each modal network to outperform the other one.
no code implementations • 28 May 2019 • Tianle Cai, Ruiqi Gao, Jikai Hou, Siyu Chen, Dong Wang, Di He, Zhihua Zhang, Li-Wei Wang
First-order methods such as stochastic gradient descent (SGD) are currently the standard algorithm for training deep neural networks.
no code implementations • 24 Jan 2019 • Yuan Zhang, Dong Wang, Yan Zhang
Recently, neural models for information retrieval are becoming increasingly popular.
no code implementations • 18 Jun 2019 • Dong Wang, Lei Zhou, Xiao Bai, Jun Zhou
Our method accelerates the network in one-step pruning-recovery manner with a novel optimization objective function, which achieves higher accuracy with much less cost compared with existing pruning methods.
no code implementations • 24 Jun 2019 • Dong Wang, Yitong Li, Wei Cao, Liqun Chen, Qi Wei, Lawrence Carin
We propose a Leaked Motion Video Predictor (LMVP) to predict future frames by capturing the spatial and temporal dependencies from given inputs.
no code implementations • 9 Jun 2019 • Benlin Hu, Cheng Lei, Dong Wang, Shu Zhang, Zhenyu Chen
Deep learning models have a large number of freeparameters that need to be calculated by effective trainingof the models on a great deal of training data to improvetheir generalization performance.
no code implementations • 17 Jul 2019 • Lanyu Shang, Daniel Zhang, Michael Wang, Shuyue Lai, Dong Wang
Current clickbait detection solutions that mainly focus on analyzing the text of the title, the image of the thumbnail, or the content of the video are shown to be suboptimal in detecting the online clickbait videos.
no code implementations • 16 Jul 2019 • Zhiyuan Tang, Dong Wang, Li-Ming Song
The participants can refer to these online-published recipes to deploy LID systems for convenience.
no code implementations • 27 Aug 2019 • Xueyi Wang, Lantian Li, Dong Wang
By enforcing the neural model to discriminate the speakers in the training set, deep speaker embedding (called `x-vectors`) can be derived from the hidden layers.
no code implementations • 11 Sep 2019 • Yang Zhang, Daniel Zhang, Nathan Vance, Dong Wang
Social sensing has emerged as a new sensing paradigm where humans (or devices on their behalf) collectively report measurements about the physical world.
no code implementations • 29 Oct 2019 • Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang
Speech signals are complex composites of various information, including phonetic content, speaker traits, channel effect, etc.
no code implementations • 10 Nov 2019 • Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Shijing Si, Dinghan Shen, Dong Wang, Lawrence Carin
Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks.
no code implementations • 20 Dec 2019 • Xi Liu, Rui Zhang, Yongsheng Zhou, Qianyi Jiang, Qi Song, Nan Li, Kai Zhou, Lei Wang, Dong Wang, Minghui Liao, Mingkun Yang, Xiang Bai, Baoguang Shi, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar
21 teams submit results for Task 1, 23 teams submit results for Task 2, 24 teams submit results for Task 3, and 13 teams submit results for Task 4.
no code implementations • 26 Jan 2020 • Di Hu, Zheng Wang, Haoyi Xiong, Dong Wang, Feiping Nie, Dejing Dou
Associating sound and its producer in complex audiovisual scene is a challenging task, especially when we are lack of annotated training data.
no code implementations • 12 Feb 2020 • Dong Wang, Feng Zhou, Zheng Yan, Guang Yao, Zongxuan Liu, Wennan Ma, Cewu Lu
Our model builds upon an variational encoder which transforms the input video into a latent feature space and a Luenberger-type observer which captures the dynamic evolution of the latent features.
no code implementations • 27 Feb 2020 • Ziqi Liu, Dong Wang, Qianyu Yu, Zhiqiang Zhang, Yue Shen, Jian Ma, Wenliang Zhong, Jinjie Gu, Jun Zhou, Shuang Yang, Yuan Qi
In this paper, we present a graph representation learning method atop of transaction networks for merchant incentive optimization in mobile payment marketing.
no code implementations • CVPR 2020 • Dong Wang, Yuan Zhang, Kexin Zhang, Li-Wei Wang
Applying artificial intelligence techniques in medical imaging is one of the most promising areas in medicine.
no code implementations • 9 Apr 2020 • Md Tahmid Rashid, Dong Wang
In this vision paper, we discuss the roles of CovidSens and identify potential challenges in developing reliable social sensing based risk alert systems.
no code implementations • 22 Apr 2020 • Dong Wang, Xiaoqian Qin, Fengyi Song, Li Cheng
Generative adversarial networks (GANs), famous for the capability of learning complex underlying data distribution, are however known to be tricky in the training process, which would probably result in mode collapse or performance deterioration.
no code implementations • 25 May 2020 • Dong Wang
In this paper, we develop an efficient iterative method on a variational model for the surface reconstruction from point clouds.
no code implementations • 27 May 2020 • Dong Wang, Kexin Zhang, Jia Ding, Li-Wei Wang
In the clinical practice, Tanner and Whitehouse (TW2) method is a widely-used method for radiologists to perform BAA.
no code implementations • 4 Jun 2020 • Md Tahmid Rashid, Daniel, Zhang, Dong Wang
iii) How to efficiently guide the cars to the event locations with little prior knowledge of the road damage caused by the disaster, while also handling the dynamics of the physical world and social media?
no code implementations • 4 Jun 2020 • Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Li-Ming Song, Cheng Yang
Based on Kaldi and Pytorch, recipes for i-vector and x-vector systems are also conducted as baselines for the three tasks.
no code implementations • 1 Jul 2020 • Jun Ma, Dong Wang, Xiao-Ping Wang, Xiaoping Yang
Active contour models have been widely used in image segmentation, and the level set method (LSM) is the most popular approach for solving the models, via implicitly representing the contour by a level set function.
no code implementations • 4 Jul 2020 • Pengyu Zhang, Jie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang
In this study, we propose a novel RGB-T tracking framework by jointly modeling both appearance and motion cues.
Ranked #4 on Rgb-T Tracking on GTOT
no code implementations • 1 Jan 2021 • Liqun Chen, Yizhe Zhang, Dianqi Li, Chenyang Tao, Dong Wang, Lawrence Carin
There has been growing interest in representation learning for text data, based on theoretical arguments and empirical evidence.
no code implementations • 10 Oct 2020 • Dong Wang
In this article, we first establish the theory of optimal scores for speaker recognition.
no code implementations • 16 Oct 2020 • Braxton Osting, Dong Wang, Yiming Xu, Dominique Zosso
Archetypal analysis is an unsupervised learning method that uses a convex polytope to summarize multivariate data.
no code implementations • 27 Oct 2020 • Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang
Domain mismatch often occurs in real applications and causes serious performance reduction on speaker verification systems.
no code implementations • 27 Oct 2020 • Haoran Sun, Lantian Li, Yunqi Cai, Yang Zhang, Thomas Fang Zheng, Dong Wang
Various information factors are blended in speech signals, which forms the primary difficulty for most speech information processing tasks.
no code implementations • 4 Nov 2020 • Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han
Recently, speech enhancement (SE) based on deep speech prior has attracted much attention, such as the variational auto-encoder with non-negative matrix factorization (VAE-NMF) architecture.
no code implementations • 6 Dec 2020 • Dong Wang, Yuewei Yang, Chenyang Tao, Zhe Gan, Liqun Chen, Fanjie Kong, Ricardo Henao, Lawrence Carin
Deep neural networks excel at comprehending complex visual signals, delivering on par or even superior performance to that of human experts.
no code implementations • CVPR 2021 • Liqun Chen, Dong Wang, Zhe Gan, Jingjing Liu, Ricardo Henao, Lawrence Carin
The primary goal of knowledge distillation (KD) is to encapsulate the information of a model learned from a teacher network into a student network, with the latter being more compact than the former.
Ranked #10 on Knowledge Distillation on CIFAR-100
no code implementations • COLING 2020 • Dongming Sheng, Dong Wang, Ying Shen, Haitao Zheng, Haozhuang Liu
Local dependencies, which captures short-term emotional effects between neighbouring utterances, are further injected via an Aggregation Graph to distinguish the subtle differences between utterances containing emotional phrases.
Ranked #29 on Emotion Recognition in Conversation on IEMOCAP
no code implementations • 20 May 2021 • Nihal Potdar, Anderson R. Avila, Chao Xing, Dong Wang, Yiran Cao, Xiao Chen
In this paper, we propose a streaming end-to-end framework that can process multiple intentions in an online and incremental way.