no code implementations • ACL 2022 • Shuang Liu, Dong Wang, Xiaoguang Li, Minghui Huang, Meizhen Ding
Open-domain question answering is a challenging task with a wide variety of practical applications.
no code implementations • 20 Sep 2023 • Xuyang Chen, Dong Wang, Konrad Schindler, Mingwei Sun, Yongliang Wang, Nicolo Savioli, Liqiu Meng
Recently, Transformer-based text detection techniques have sought to predict polygons by encoding the coordinates of individual boundary vertices using distinct query features.
no code implementations • 15 Sep 2023 • Jie Zhao, Johan Edstedt, Michael Felsberg, Dong Wang, Huchuan Lu
Due to long-distance correlation and powerful pretrained models, transformer-based methods have initiated a breakthrough in visual object tracking performance.
no code implementations • 17 Aug 2023 • Dong Wang, Kavé Salamatian, Yunqing Xia, Weiwei Deng, Qi Zhiang
Although deep pre-trained language models have shown promising benefit in a large set of industrial scenarios, including Click-Through-Rate (CTR) prediction, how to integrate pre-trained language models that handle only textual signals into a prediction pipeline with non-textual features is challenging.
1 code implementation • 14 Aug 2023 • Ben Kang, Xin Chen, Dong Wang, Houwen Peng, Huchuan Lu
The Bridge Module incorporates the high-level information of deep features into the shallow large-resolution features.
1 code implementation • 1 Aug 2023 • Mingzhan Yang, Guangxin Han, Bin Yan, Wenhua Zhang, Jinqing Qi, Huchuan Lu, Dong Wang
Furthermore, our method shows strong generalization for diverse trackers and scenarios in a plug-and-play and training-free manner.
1 code implementation • 26 Jul 2023 • Jiawen Zhu, Zhenyu Chen, Zeqi Hao, Shijie Chang, Lu Zhang, Dong Wang, Huchuan Lu, Bin Luo, Jun-Yan He, Jin-Peng Lan, Hanyuan Chen, Chenyang Li
To further improve the quality of tracking masks, a pretrained MR model is employed to refine the tracking results.
Ranked #4 on
Semi-Supervised Video Object Segmentation
on YouTube-VOS 2019
(using extra training data)
Semantic Segmentation
Semi-Supervised Video Object Segmentation
+2
1 code implementation • 22 Jul 2023 • Zhixing Zhang, Ziwei Zhao, Dong Wang, Shishuang Zhao, Yuhang Liu, Jia Liu, LiWei Wang
Automatic labeling of coronary arteries is an essential task in the practical diagnosis process of cardiovascular diseases.
no code implementations • 4 Jul 2023 • Wei zhang, Ping Zhang, Jian Dong, Yongkang Wang, Pengye Zhang, Bo Zhang, Xingxing Wang, Dong Wang
The effectiveness of ad creatives is greatly influenced by their visual appearance.
no code implementations • 26 Jun 2023 • Wei zhang, Pengye Zhang, Bo Zhang, Xingxing Wang, Dong Wang
The disadvantage of the former is that the data from other domains is not utilized by a single domain model, while the latter leverage all the data from different domains, but the fine-tuned model of transfer learning may trap the model in a local optimum of the source domain, making it difficult to fit the target domain.
no code implementations • 21 Jun 2023 • Chanyue Wu, Dong Wang, Hanyu Mao, Ying Li
Despite the proven significance of hyperspectral images (HSIs) in performing various computer vision tasks, its potential is adversely affected by the low-resolution (LR) property in the spatial domain, resulting from multiple physical factors.
no code implementations • 12 Jun 2023 • AnLan Sun, Zhao Zhang, Meng Lei, Yuting Dai, Dong Wang, LiWei Wang
The coherence loss uses the feature centers generated by the static images to guide the frame attention in the video model.
no code implementations • 5 Jun 2023 • Huinan Sun, Guangliang Yu, Pengye Zhang, Bo Zhang, Xingxing Wang, Dong Wang
It consists of a multi-interest graph structure for capturing long-term user behavior, a multi-scenario heterogeneous sequence model for modeling short-term information, then an adaptive fusion mechanism to fused information from long-term and short-term behaviors.
1 code implementation • 1 Jun 2023 • Qian Lin, Bo Tang, Zifan Wu, Chao Yu, Shangqin Mao, Qianlong Xie, Xingxing Wang, Dong Wang
Aiming at promoting the safe real-world deployment of Reinforcement Learning (RL), research on safe RL has made significant progress in recent years.
1 code implementation • 29 May 2023 • Haojun Yu, Youcheng Li, Quanlin Wu, Ziwei Zhao, Dengbo Chen, Dong Wang, LiWei Wang
To address this issue, we propose to extract contexts from previous frames, including NTC, with the guidance of inverse optical flow.
1 code implementation • 29 May 2023 • Haoran He, Chenjia Bai, Kang Xu, Zhuoran Yang, Weinan Zhang, Dong Wang, Bin Zhao, Xuelong Li
Specifically, we propose Multi-Task Diffusion Model (\textsc{MTDiff}), a diffusion-based method that incorporates Transformer backbones and prompt learning for generative planning and data synthesis in multi-task offline settings.
no code implementations • 28 May 2023 • Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li
Specifically, we present the Value-Guided Data Filtering (VGDF) algorithm, which selectively shares transitions from the source domain based on the proximity of paired value targets across the two domains.
no code implementations • 28 May 2023 • Ying Shi, Dong Wang, Lantian Li, Jiqing Han, Shi Yin
We propose a novel Mix Training (MT) strategy that encourages the model to discover low-energy keywords from noisy and mixed speech.
1 code implementation • 27 May 2023 • Zhenrui Yue, Huimin Zeng, Mengfei Lan, Heng Ji, Dong Wang
With emerging online topics as a source for numerous new events, detecting unseen / rare event types presents an elusive challenge for existing event detection methods, where only limited data access is provided for training.
no code implementations • 25 May 2023 • Lantian Li, Xiaolou Li, Haoyu Jiang, Chen Chen, Ruihai Hou, Dong Wang
A comprehensive study was conducted to compare CN-Celeb-AV with two popular public AVPR benchmark datasets, and the results demonstrated that CN-Celeb-AV is more in line with real-world scenarios and can be regarded as a new benchmark dataset for AVPR research.
no code implementations • 25 May 2023 • Jiaying Wang, Xianglong Wang, Namin Wang, Lantian Li, Dong Wang
Modern speaker recognition systems represent utterances by embedding vectors.
1 code implementation • 23 May 2023 • Xinyu Zhang, Hefei Huang, Xu Jia, Dong Wang, Huchuan Lu
In this work, we aim to re-expose the captured photo in post-processing to provide a more flexible way of addressing those issues within a unified framework.
Ranked #4 on
Deblurring
on GoPro
(using extra training data)
1 code implementation • 22 May 2023 • Zhenrui Yue, Huimin Zeng, Yang Zhang, Lanyu Shang, Dong Wang
As such, MetaAdapt can learn how to adapt the misinformation detection model and exploit the source data for improved performance in the target domain.
1 code implementation • CVPR 2023 • Simin Li, Shuing Zhang, Gujun Chen, Dong Wang, Pu Feng, Jiakai Wang, Aishan Liu, Xin Yi, Xianglong Liu
First, to benchmark attack naturalness, we contribute the first Physical Attack Naturalness (PAN) dataset with human rating and gaze.
1 code implementation • 22 May 2023 • Olga Saukh, Dong Wang, Xiaoxi He, Lothar Thiele
Deep models lack robustness to simple input transformations such as rotation, scaling, and translation, unless they feature a particular invariant architecture or undergo specific training, e. g., learning the desired robustness from data augmentations.
1 code implementation • CVPR 2023 • Xin Chen, Houwen Peng, Dong Wang, Huchuan Lu, Han Hu
In this paper, we present a new sequence-to-sequence learning framework for visual tracking, dubbed SeqTrack.
Ranked #2 on
Visual Object Tracking
on TNL2K
no code implementations • 24 Apr 2023 • Pengcheng Ai, Le Xiao, Zhi Deng, Yi Wang, Xiangming Sun, Guangming Huang, Dong Wang, Yulei Li, Xinchi Ran
We mathematically demonstrate the existence of the optimal function desired by the method, and give a systematic algorithm for training and calibration of the model.
no code implementations • 17 Apr 2023 • Xiaowen Shi, Ze Wang, Yuanying Cai, Xiaoxu Wu, Fan Yang, Guogang Liao, Yongkang Wang, Xingxing Wang, Dong Wang
There are two types of data employed to train reinforcement learning (RL) model for position allocation, named strategy data and random data.
1 code implementation • 12 Apr 2023 • Dong Wang, Jia Guo, Qiqi Shao, Haochi He, Zhian Chen, Chuanbao Xiao, Ajian Liu, Sergio Escalera, Hugo Jair Escalante, Zhen Lei, Jun Wan, Jiankang Deng
Leveraging the WFAS dataset and Protocol 1 (Known-Type), we host the Wild Face Anti-Spoofing Challenge at the CVPR2023 workshop.
no code implementations • CVPR 2023 • Weichuang Li, Longhao Zhang, Dong Wang, Bin Zhao, Zhigang Wang, Mulin Chen, Bang Zhang, Zhongjian Wang, Liefeng Bo, Xuelong Li
Talking head generation aims to generate faces that maintain the identity information of the source image and imitate the motion of the driving image.
1 code implementation • 3 Apr 2023 • Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao
The popularity of Contrastive Language-Image Pre-training (CLIP) has propelled its application to diverse downstream vision tasks.
1 code implementation • 31 Mar 2023 • Delin Qu, Yizhen Lao, Zhigang Wang, Dong Wang, Bin Zhao, Xuelong Li
This paper addresses the problem of rolling shutter correction in complex nonlinear and dynamic scenes with extreme occlusion.
1 code implementation • 29 Mar 2023 • Zoey Guo, Yiwen Tang, Ray Zhang, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li
In this paper, we propose ViewRefer, a multi-view framework for 3D visual grounding exploring how to grasp the view knowledge from both text and 3D modalities.
no code implementations • CVPR 2023 • Yihao Wang, Zhigang Wang, Bin Zhao, Dong Wang, Mulin Chen, Xuelong Li
In contrast, we propose a purely passive method to track a person walking in an invisible room by only observing a relay wall, which is more in line with real application scenarios, e. g., security.
1 code implementation • CVPR 2023 • Jiawen Zhu, Simiao Lai, Xin Chen, Dong Wang, Huchuan Lu
To inherit the powerful representations of the foundation model, a natural modus operandi for multi-modal tracking is full fine-tuning on the RGB-based parameters.
1 code implementation • CVPR 2023 • Haozhe Si, Bin Zhao, Dong Wang, Yunpeng Gao, Mulin Chen, Zhigang Wang, Xuelong Li
We show that our framework circumvents the needs for the depth and AIF image ground-truth, and receives superior predictions, thus closing the gap between the theoretical success of DFD works and their applications in the real world.
1 code implementation • 17 Mar 2023 • Dongsheng Wang, Xu Jia, Yang Zhang, Xinyu Zhang, Yaoyuan Wang, Ziyang Zhang, Dong Wang, Huchuan Lu
To fully exploit information with event streams to detect objects, a dual-memory aggregation network (DMANet) is proposed to leverage both long and short memory along event streams to aggregate effective information for object detection.
1 code implementation • CVPR 2023 • Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu
All instance perception tasks aim at finding certain objects specified by some queries such as category names, language expressions, and target annotations, but this complete field has been split into multiple independent subtasks.
Generalized Referring Expression Comprehension
Multi-Object Tracking and Segmentation
+14
1 code implementation • 6 Feb 2023 • Xiaowen Shi, Fan Yang, Ze Wang, Xiaoxu Wu, Muzhi Guan, Guogang Liao, Yongkang Wang, Xingxing Wang, Dong Wang
Then we design a novel omnidirectional attention mechanism in OCPM to capture the context information in the permutation.
no code implementations • 1 Feb 2023 • Jian Dong, Yisong Yu, Yapeng Zhang, Yimin Lv, Shuli Wang, Beihong Jin, Yongkang Wang, Xingxing Wang, Dong Wang
User behaviors on an e-commerce app not only contain different kinds of feedback on items but also sometimes imply the cognitive clue of the user's decision-making.
no code implementations • 29 Jan 2023 • Xiang Li, Shuwei Chen, Jian Dong, Jin Zhang, Yongkang Wang, Xingxing Wang, Dong Wang
Click-through rate (CTR) prediction is crucial in recommendation and online advertising systems.
1 code implementation • CVPR 2023 • Haojie Zhao, Dong Wang, Huchuan Lu
However, for the template, we make the decoder reconstruct the target appearance within the search region.
no code implementations • 28 Nov 2022 • Hao Zhou, Shaoming Li, Guibin Jiang, Jiaqi Zheng, Dong Wang
Our key intuition is that we introduce the decision factor to establish a bridge between ML and OR such that the solution can be directly obtained in OR by only performing the sorting or comparison operations on the decision factor.
no code implementations • 25 Oct 2022 • Katy Craig, Braxton Osting, Dong Wang, Yiming Xu
We prove a consistency result for the regularized problem, ensuring that if the data are iid samples from a probability measure, then as the number of samples is increased, a subsequence of the archetype points converges to the archetype points for the limiting data distribution, almost surely.
1 code implementation • 19 Oct 2022 • Zhenrui Yue, Huimin Zeng, Bernhard Kratzwald, Stefan Feuerriegel, Dong Wang
Unlike existing approaches, we generate pseudo labels and propose to train the model via a novel attention-based contrastive adaptation method.
no code implementations • 6 Oct 2022 • Huimin Zeng, Zhenrui Yue, Ziyi Kou, Lanyu Shang, Yang Zhang, Dong Wang
Moreover, we leverage the power of domain adversarial examples to establish an intermediate domain mixup, where the latent representations of the input text from both domains could be mixed during the training process.
no code implementations • 3 Oct 2022 • Huimin Zeng, Zhenrui Yue, Yang Zhang, Ziyi Kou, Lanyu Shang, Dong Wang
In many applications with real-world consequences, it is crucial to develop reliable uncertainty estimation for the predictions made by the AI decision systems.
no code implementations • 26 Sep 2022 • Tingyu Fan, Linyao Gao, Yiling Xu, Dong Wang, Zhu Li
Besides, we propose a residual coding framework for the compression of the latent variable, which explores the spatial correlation of each layer by progressive downsampling, and model the corresponding residual with a fully-factorized entropy model.
1 code implementation • 13 Sep 2022 • Dong Wang, Zhao Zhang, Ziwei Zhao, Yuhang Liu, Yihong Chen, LiWei Wang
Inspired by this, we propose PointScatter, an alternative to the segmentation models for the tubular structure extraction task.
no code implementations • 13 Sep 2022 • Ziwei Zhao, Dong Wang, Yihong Chen, Ziteng Wang, LiWei Wang
In mammogram mass detection, modeling pairwise lesion correspondence explicitly is particularly important.
1 code implementation • COLING 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang
In this work, we investigate the potential benefits of question classification for QA domain adaptation.
1 code implementation • 5 Sep 2022 • Qian Chen, Xingjian Dong, Guowei Tu, Dong Wang, Baoxuan Zhao, Zhike Peng
However, the CNN is a typical black-box model, and the mechanism of CNN's decision-making are not clear, which limits its application in high-reliability-required fault diagnosis scenarios.
2 code implementations • 20 Aug 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang
However, early misinformation often demonstrates both conditional and label shifts against existing misinformation data (e. g., class imbalance in COVID-19 datasets), rendering such methods less effective for detecting early misinformation.
no code implementations • 11 Aug 2022 • Yuxiang Shi, Yue Ding, Bo Chen, YuYang Huang, Ruiming Tang, Dong Wang
In this paper, we propose a Task aligned Meta-learning based Augmented Graph (TMAG) to address cold-start recommendation.
1 code implementation • 19 Jul 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang
Additionally, we design an adversarial training method tailored for sequential recommender systems.
1 code implementation • 14 Jul 2022 • Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu
We present a unified method, termed Unicorn, that can simultaneously solve four tracking problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters.
Multi-Object Tracking
Multi-Object Tracking and Segmentation
+2
no code implementations • 10 Jul 2022 • Jiawen Zhu, Xin Chen, Pengyu Zhang, Xinying Wang, Dong Wang, Wenda Zhao, Huchuan Lu
Trackers tend to lose the target object due to the limited search region or be interfered with by distractors due to the excessive search region.
3 code implementations • 28 May 2022 • Renrui Zhang, Ziyu Guo, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li, Peng Gao
By fine-tuning on downstream tasks, Point-M2AE achieves 86. 43% accuracy on ScanObjectNN, +3. 36% to the second-best, and largely benefits the few-shot classification, part segmentation and 3D object detection with the hierarchical pre-training scheme.
Ranked #3 on
3D Point Cloud Linear Classification
on ModelNet40
(using extra training data)
1 code implementation • 22 May 2022 • Jie Zhao, Jingshu Zhang, Dongdong Li, Dong Wang
It contains a detection dataset with a total of 10, 000 images and a tracking dataset with 20 videos that include short-term and long-term sequences.
no code implementations • 20 May 2022 • Guogang Liao, Xuejian Li, Ze Wang, Fan Yang, Muzhi Guan, Bingqi Zhu, Yongkang Wang, Xingxing Wang, Dong Wang
Although VCG-based multi-slot auctions (e. g., VCG, WVCG) make it theoretically possible to model global externalities (e. g., the order and positions of ads and so on), they lack an efficient balance of both revenue and social welfare.
1 code implementation • 2 May 2022 • Tingyu Fan, Linyao Gao, Yiling Xu, Zhu Li, Dong Wang
This paper proposes a novel 3D sparse convolution-based Deep Dynamic Point Cloud Compression (D-DPCC) network to compensate and compress the DPC geometry with 3D motion estimation and motion compensation in the feature space.
1 code implementation • 19 Apr 2022 • Atoosa Parsa, Dong Wang, Corey S. O'Hern, Mark D. Shattuck, Rebecca Kramer-Bottiglio, Josh Bongard
Granular metamaterials are a promising choice for the realization of mechanical computing devices.
no code implementations • CVPR 2022 • Pengyu Zhang, Jie Zhao, Dong Wang, Huchuan Lu, Xiang Ruan
With the popularity of multi-modal sensors, visible-thermal (RGB-T) object tracking is to achieve robust performance and wider application scenarios with the guidance of objects' temperature information.
no code implementations • 2 Apr 2022 • Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Bingqi Zhu, Yongkang Wang, Xingxing Wang, Dong Wang
Ads allocation, which involves allocating ads and organic items to limited slots in feed with the purpose of maximizing platform revenue, has become a research hotspot.
no code implementations • 2 Apr 2022 • Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang
With the recent prevalence of reinforcement learning (RL), there have been tremendous interests in utilizing RL for ads allocation in recommendation platforms (e. g., e-commerce and news feed sites).
no code implementations • 1 Apr 2022 • Guogang Liao, Xiaowen Shi, Ze Wang, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang
A mixed list of ads and organic items is usually displayed in feed and how to allocate the limited slots to maximize the overall revenue is a key problem.
no code implementations • 29 Mar 2022 • Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang
Modern smart sensor-based energy management systems leverage non-intrusive load monitoring (NILM) to predict and optimize appliance load distribution in real-time.
1 code implementation • CVPR 2022 • Xiaokang Peng, Yake Wei, Andong Deng, Dong Wang, Di Hu
Multimodal learning helps to comprehensively understand the world, by integrating different senses.
1 code implementation • 25 Mar 2022 • Xin Chen, Ben Kang, Dong Wang, Dongdong Li, Huchuan Lu
Most state-of-the-art trackers are satisfied with the real-time speed on powerful GPUs.
1 code implementation • 25 Mar 2022 • Xin Chen, Bin Yan, Jiawen Zhu, Huchuan Lu, Xiang Ruan, Dong Wang
First, we present a transformer tracking (named TransT) method based on the Siamese-like feature extraction backbone, the designed attention-based fusion mechanism, and the classification and regression head.
no code implementations • 12 Mar 2022 • Kang Xu, Xiaoqiu Lu, Yuan-Fang Li, Tongtong Wu, Guilin Qi, Ning Ye, Dong Wang, Zheng Zhou
NTM-DMIE is a neural network method for topic learning which maximizes the mutual information between the input documents and their latent topic representation.
1 code implementation • 8 Mar 2022 • Mingqi Yuan, Man-on Pun, Dong Wang
One of the most critical challenges in deep reinforcement learning is to maintain the long-term exploration capability of the agent.
1 code implementation • CVPR 2022 • Bing Liu, Dong Wang, Xu Yang, Yong Zhou, Rui Yao, Zhiwen Shao, Jiaqi Zhao
In the encoding stage, the IOD is able to disentangle the region-based visual features by deconfounding the visual confounder.
1 code implementation • 21 Oct 2021 • Yepeng Liu, Zaiwang Gu, Shenghua Gao, Dong Wang, Yusheng Zeng, Jun Cheng
Very often, the pose is estimated after the face detection.
1 code implementation • 18 Oct 2021 • Lantian Li, Ruiqian Nai, Dong Wang
The additive margin softmax (AM-Softmax) loss has delivered remarkable performance in speaker verification.
no code implementations • 18 Oct 2021 • Haoran Sun, Chen Chen, Lantian Li, Dong Wang
SpeechFlow is a powerful factorization model based on information bottleneck (IB), and its effectiveness has been reported by several studies.
no code implementations • 29 Sep 2021 • Chao Xing, Dong Wang, LiRong Dai, Qun Liu, Anderson Avila
Overparameterized transformer-based architectures have shown remarkable performance in recent years, achieving state-of-the-art results in speech processing tasks such as speech recognition, speech synthesis, keyword spotting, and speech enhancement et al.
no code implementations • 29 Sep 2021 • Jingpu Shi, Dong Wang, Gino Tesei, Beau Norgeot
Validation of these models, however, has been a challenge because the ground truth is unknown: only one treatment-outcome pair for each person can be observed.
no code implementations • 28 Sep 2021 • Yunzhe Li, Yue Ding, Bo Chen, Xin Xin, Yule Wang, Yuxiang Shi, Ruiming Tang, Dong Wang
In this paper, we propose a novel time-aware sequential recommendation framework called Social Temporal Excitation Networks (STEN), which introduces temporal point processes to model the fine-grained impact of friends' behaviors on the user s dynamic interests in an event-level direct paradigm.
no code implementations • 27 Sep 2021 • Yule Wang, Xin Xin, Yue Ding, Yunzhe Li, Dong Wang
In detail, we define our item cluster-wise optimization target as the recommender model should balance all item clusters that differ in popularity, thus we set the model learning on each item cluster as a unique optimization objective.
no code implementations • 26 Sep 2021 • Yule Wang, Qiang Luo, Yue Ding, Dong Wang, Hongbo Deng
In this paper, we propose a novel model named DemiNet (short for DEpendency-Aware Multi-Interest Network}) to address the above two issues.
1 code implementation • 9 Sep 2021 • Guogang Liao, Ze Wang, Xiaoxu Wu, Xiaowen Shi, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang
Our model results in higher revenue and better user experience than state-of-the-art baselines in offline experiments.
no code implementations • 13 Aug 2021 • Minghui Huang, Wei Peng, Dong Wang
Ranking models have achieved promising results, but it remains challenging to design personalized ranking systems to leverage user profiles and semantic representations between queries and documents.
no code implementations • 13 Aug 2021 • Minghui Huang, Dong Wang, Shuang Liu, Meizhen Ding
To leverage the strength of text generation for information retrieval, in this article, we propose a novel approach which effectively integrates text generation models into PRF-based query expansion.
no code implementations • 12 Aug 2021 • Ruijian Han, Braxton Osting, Dong Wang, Yiming Xu
Archetypal analysis is an unsupervised learning method for exploratory data analysis.
1 code implementation • ICCV 2021 • Kenan Dai, Jie Zhao, Lijun Wang, Dong Wang, Jianhua Li, Huchuan Lu, Xuesheng Qian, Xiaoyun Yang
Deep learning based visual trackers entail offline pre-training on large volumes of video datasets with accurate bounding box annotations that are labor-expensive to achieve.
no code implementations • 23 Jul 2021 • Binling Wang, Wenxuan Hu, Jing Li, Yiming Zhi, Zheng Li, Qingyang Hong, Lin Li, Dong Wang, Liming Song, Cheng Yang
In addition to the Language Identification (LID) tasks, multilingual Automatic Speech Recognition (ASR) tasks are introduced to OLR 2021 Challenge for the first time.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 19 Jul 2021 • Mingqi Yuan, Mon-on Pun, Dong Wang, Yi Chen, Haojun Li
Furthermore, we leverage a variational auto-encoder (VAE) model to capture the life-long novelty of states, which is combined with the global JFI score to form multimodal intrinsic rewards.
no code implementations • 5 Jul 2021 • Jing Li, Binling Wang, Yiming Zhi, Zheng Li, Lin Li, Qingyang Hong, Dong Wang
The fifth Oriental Language Recognition (OLR) Challenge focuses on language recognition in a variety of complex environments to promote its development.
1 code implementation • ICLR 2022 • Qitong Gao, Dong Wang, Joshua D. Amason, Siyang Yuan, Chenyang Tao, Ricardo Henao, Majda Hadziahmetovic, Lawrence Carin, Miroslav Pajic
Though recent works have developed methods that can generate estimates (or imputations) of the missing entries in a dataset to facilitate downstream analysis, most depend on assumptions that may not align with real-world applications and could suffer from poor performance in subsequent tasks such as classification.
1 code implementation • 2 Jul 2021 • Qing Guo, Junya Chen, Dong Wang, Yuewei Yang, Xinwei Deng, Lawrence Carin, Fan Li, Jing Huang, Chenyang Tao
Successful applications of InfoNCE and its variants have popularized the use of contrastive variational mutual information (MI) estimators in machine learning.
1 code implementation • ACL 2021 • Dong Wang, Ning Ding, Piji Li, Hai-Tao Zheng
Recent works aimed to improve the robustness of pre-trained models mainly focus on adversarial training from perturbed examples with similar semantics, neglecting the utilization of different or even opposite semantics.
no code implementations • 21 Jun 2021 • Lanyu Shang, Yang Zhang, Yuheng Zha, Yingxi Chen, Christina Youn, Dong Wang
To address the above challenges, we develop a deep learning based Analogy-aware Offensive Meme Detection (AOMD) framework to learn the implicit analogy from the multi-modal contents of the meme and effectively detect offensive analogy memes.
1 code implementation • 15 Jun 2021 • Haibin Wu, Yang Zhang, Zhiyong Wu, Dong Wang, Hung-Yi Lee
Automatic speaker verification (ASV) is a well developed technology for biometric identification, and has been ubiquitous implemented in security-critic applications, such as banking and access control.
no code implementations • 10 Jun 2021 • Chongwei Liu, Haojie Li, Shuchang Wang, Ming Zhu, Dong Wang, Xin Fan, Zhihui Wang
Towards these challenges we introduce a dataset, Detecting Underwater Objects (DUO), and a corresponding benchmark, based on the collection and re-annotation of all relevant datasets.
no code implementations • 20 May 2021 • Nihal Potdar, Anderson R. Avila, Chao Xing, Dong Wang, Yiran Cao, Xiao Chen
In this paper, we propose a streaming end-to-end framework that can process multiple intentions in an online and incremental way.
1 code implementation • CVPR 2021 • Bin Yan, Houwen Peng, Kan Wu, Dong Wang, Jianlong Fu, Huchuan Lu
Object tracking has achieved significant progress over the past few years.
1 code implementation • ICCV 2021 • Bin Yan, Houwen Peng, Jianlong Fu, Dong Wang, Huchuan Lu
In this paper, we present a new tracking architecture with an encoder-decoder transformer as the key component.
Ranked #14 on
Visual Object Tracking
on TrackingNet
1 code implementation • CVPR 2021 • Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu
The correlation operation is a simple fusion manner to consider the similarity between the template and the search region.
Ranked #5 on
Visual Tracking
on TNL2K
1 code implementation • 19 Mar 2021 • Zhe Xie, Chengxuan Liu, Yichi Zhang, Hongtao Lu, Dong Wang, Yue Ding
To solve the above problem, in this work, we propose a novel method called Adversarial and Contrastive Variational Autoencoder (ACVAE) for sequential recommendation.
1 code implementation • 10 Mar 2021 • Botao He, Haojia Li, Siyuan Wu, Dong Wang, Zhiwei Zhang, Qianli Dong, Chao Xu, Fei Gao
The bottleneck of solving this problem is the accurate perception of rapid dynamic objects.
Motion Compensation
Robust Object Detection
+1
Robotics
1 code implementation • ICCV 2021 • Yingquan Wang, Pingping Zhang, Shang Gao, Xia Geng, Hu Lu, Dong Wang
Video-based person re-identification aims to associate the video clips of the same person across multiple non-overlapping cameras.
Ranked #1 on
Person Re-Identification
on DukeMTMC-VideoReID
no code implementations • 1 Jan 2021 • Liqun Chen, Yizhe Zhang, Dianqi Li, Chenyang Tao, Dong Wang, Lawrence Carin
There has been growing interest in representation learning for text data, based on theoretical arguments and empirical evidence.
1 code implementation • 20 Dec 2020 • Faisal M. Almutairi, Yunlong Wang, Dong Wang, Emily Zhao, Nicholas D. Sidiropoulos
In many applications, the categories of items exhibit a hierarchical tree structure.
no code implementations • CVPR 2021 • Liqun Chen, Dong Wang, Zhe Gan, Jingjing Liu, Ricardo Henao, Lawrence Carin
The primary goal of knowledge distillation (KD) is to encapsulate the information of a model learned from a teacher network into a student network, with the latter being more compact than the former.
Ranked #7 on
Knowledge Distillation
on CIFAR-100
1 code implementation • 14 Dec 2020 • Dong Wang, Di Hu, Xingjian Li, Dejing Dou
The main reason is that large number of nodes (i. e., video frames) makes GCNs hard to capture and model temporal relations in videos.
Ranked #22 on
Action Segmentation
on Breakfast
1 code implementation • CVPR 2021 • Bin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang
Many recent trackers adopt the multiple-stage tracking strategy to improve the quality of bounding box estimation.
Ranked #13 on
Semi-Supervised Video Object Segmentation
on VOT2020
Semi-Supervised Video Object Segmentation
Visual Object Tracking
2 code implementations • 8 Dec 2020 • Pengyu Zhang, Dong Wang, Huchuan Lu
Visual object tracking, as a fundamental task in computer vision, has drawn much attention in recent years.
no code implementations • 6 Dec 2020 • Dong Wang, Yuewei Yang, Chenyang Tao, Zhe Gan, Liqun Chen, Fanjie Kong, Ricardo Henao, Lawrence Carin
Deep neural networks excel at comprehending complex visual signals, delivering on par or even superior performance to that of human experts.
no code implementations • 1 Dec 2020 • Ming Lu, Tong Chen, zhenyu Dai, Dong Wang, Dandan Ding, Zhan Ma
This paper proposes a decoder-side Cross Resolution Synthesis (CRS) module to pursue better compression efficiency beyond the latest Versatile Video Coding (VVC), where we encode intra frames at original high resolution (HR), compress inter frames at a lower resolution (LR), and then super-resolve decoded LR inter frames with the help from preceding HR intra and neighboring LR inter frames.
no code implementations • COLING 2020 • Dongming Sheng, Dong Wang, Ying Shen, Haitao Zheng, Haozhuang Liu
Local dependencies, which captures short-term emotional effects between neighbouring utterances, are further injected via an Aggregation Graph to distinguish the subtle differences between utterances containing emotional phrases.
Ranked #20 on
Emotion Recognition in Conversation
on IEMOCAP
no code implementations • COLING 2020 • Dong Wang, Ziran Li, Haitao Zheng, Ying Shen
Dialogue Act Recognition (DAR) is a challenging problem in Natural Language Understanding, which aims to attach Dialogue Act (DA) labels to each utterance in a conversation.
no code implementations • 4 Nov 2020 • Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han
Recently, speech enhancement (SE) based on deep speech prior has attracted much attention, such as the variational auto-encoder with non-negative matrix factorization (VAE-NMF) architecture.
1 code implementation • 30 Oct 2020 • Yunqi Cai, Lantian Li, Dong Wang, Andrew Abel
In this paper, we argue that this problem is largely attributed to the maximum-likelihood (ML) training criterion of the DNF model, which aims to maximize the likelihood of the observations but not necessarily improve the Gaussianality of the latent codes.
1 code implementation • 30 Oct 2020 • Yunqi Cai, Dong Wang
Limited by its linear form and the underlying Gaussian assumption, however, LDA is not applicable in situations where the data distribution is complex.
no code implementations • 27 Oct 2020 • Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang
Domain mismatch often occurs in real applications and causes serious performance reduction on speaker verification systems.
no code implementations • 27 Oct 2020 • Haoran Sun, Lantian Li, Yunqi Cai, Yang Zhang, Thomas Fang Zheng, Dong Wang
Various information factors are blended in speech signals, which forms the primary difficulty for most speech information processing tasks.
no code implementations • 16 Oct 2020 • Braxton Osting, Dong Wang, Yiming Xu, Dominique Zosso
Archetypal analysis is an unsupervised learning method that uses a convex polytope to summarize multivariate data.
no code implementations • 10 Oct 2020 • Dong Wang
In this article, we first establish the theory of optimal scores for speaker recognition.
1 code implementation • 30 Jul 2020 • Dong Wang, Bo Jiang, W. K. Chan
Furthermore, WANA proposes a set of test oracles to detect the vulnerabilities in EOSIO and Ethereum smart contracts based on WebAssembly bytecode analysis.
Software Engineering D.2.5
1 code implementation • 4 Jul 2020 • Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang
In recent years, the multiple-stage strategy has become a popular trend for visual tracking.
no code implementations • 4 Jul 2020 • Pengyu Zhang, Jie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang
In this study, we propose a novel RGB-T tracking framework by jointly modeling both appearance and motion cues.
no code implementations • 1 Jul 2020 • Jun Ma, Dong Wang, Xiao-Ping Wang, Xiaoping Yang
Active contour models have been widely used in image segmentation, and the level set method (LSM) is the most popular approach for solving the models, via implicitly representing the contour by a level set function.
no code implementations • 4 Jun 2020 • Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Li-Ming Song, Cheng Yang
Based on Kaldi and Pytorch, recipes for i-vector and x-vector systems are also conducted as baselines for the three tasks.
no code implementations • 4 Jun 2020 • Md Tahmid Rashid, Daniel, Zhang, Dong Wang
iii) How to efficiently guide the cars to the event locations with little prior knowledge of the road damage caused by the disaster, while also handling the dynamics of the physical world and social media?
no code implementations • 27 May 2020 • Dong Wang, Kexin Zhang, Jia Ding, Li-Wei Wang
In the clinical practice, Tanner and Whitehouse (TW2) method is a widely-used method for radiologists to perform BAA.
no code implementations • 25 May 2020 • Dong Wang
In this paper, we develop an efficient iterative method on a variational model for the surface reconstruction from point clouds.
1 code implementation • 25 May 2020 • Jiawen Kang, Ruiqi Liu, Lantian Li, Yunqi Cai, Dong Wang, Thomas Fang Zheng
Domain generalization remains a critical problem for speaker recognition, even with the state-of-the-art architectures based on deep neural nets.
Audio and Speech Processing
1 code implementation • ACL 2020 • Wentao Ma, Yiming Cui, Ting Liu, Dong Wang, Shijin Wang, Guoping Hu
Human conversations contain many types of information, e. g., knowledge, common sense, and language habits.
no code implementations • 22 Apr 2020 • Dong Wang, Xiaoqian Qin, Fengyi Song, Li Cheng
Generative adversarial networks (GANs), famous for the capability of learning complex underlying data distribution, are however known to be tricky in the training process, which would probably result in mode collapse or performance deterioration.
no code implementations • 9 Apr 2020 • Md Tahmid Rashid, Dong Wang
In this vision paper, we discuss the roles of CovidSens and identify potential challenges in developing reliable social sensing based risk alert systems.
1 code implementation • 7 Apr 2020 • Yunqi Cai, Lantian Li, Dong Wang, Andrew Abel
Deep speaker embedding has demonstrated state-of-the-art performance in speaker recognition tasks.
2 code implementations • CVPR 2020 • Kenan Dai, Yunhua Zhang, Dong Wang, Jianhua Li, Huchuan Lu, Xiaoyun Yang
Most top-ranked long-term trackers adopt the offline-trained Siamese architectures, thus, they cannot benefit from great progress of short-term trackers with online update.
Ranked #7 on
Visual Object Tracking
on LaSOT-ext
1 code implementation • CVPR 2020 • Bin Yan, Dong Wang, Huchuan Lu, Xiaoyun Yang
An effective and efficient perturbation generator is trained with a carefully designed adversarial loss, which can simultaneously cool hot regions where the target exists on the heatmaps and force the predicted bounding box to shrink, making the tracked target invisible to trackers.
no code implementations • CVPR 2020 • Dong Wang, Yuan Zhang, Kexin Zhang, Li-Wei Wang
Applying artificial intelligence techniques in medical imaging is one of the most promising areas in medicine.
no code implementations • IEEE Access 2020 • Dezhong Xu, HENG FU, Lifang Wu, Meng Jian, Dong Wang, AND XU LIU
Then, we propose two types of inference models, opt-GRU and relation-GRU, which are used to encode the object relationship and motion representation effectively, and form the discriminative frame-level feature representation.
Ranked #4 on
Group Activity Recognition
on Volleyball
no code implementations • 27 Feb 2020 • Ziqi Liu, Dong Wang, Qianyu Yu, Zhiqiang Zhang, Yue Shen, Jian Ma, Wenliang Zhong, Jinjie Gu, Jun Zhou, Shuang Yang, Yuan Qi
In this paper, we present a graph representation learning method atop of transaction networks for merchant incentive optimization in mobile payment marketing.
no code implementations • 12 Feb 2020 • Dong Wang, Feng Zhou, Zheng Yan, Guang Yao, Zongxuan Liu, Wennan Ma, Cewu Lu
Our model builds upon an variational encoder which transforms the input video into a latent feature space and a Luenberger-type observer which captures the dynamic evolution of the latent features.
no code implementations • 26 Jan 2020 • Di Hu, Zheng Wang, Haoyi Xiong, Dong Wang, Feiping Nie, Dejing Dou
Associating sound and its producer in complex audiovisual scene is a challenging task, especially when we are lack of annotated training data.
no code implementations • 20 Dec 2019 • Xi Liu, Rui Zhang, Yongsheng Zhou, Qianyi Jiang, Qi Song, Nan Li, Kai Zhou, Lei Wang, Dong Wang, Minghui Liao, Mingkun Yang, Xiang Bai, Baoguang Shi, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar
21 teams submit results for Task 1, 23 teams submit results for Task 2, 24 teams submit results for Task 3, and 13 teams submit results for Task 4.
1 code implementation • NeurIPS 2019 • Chenyang Tao, Liqun Chen, Shuyang Dai, Junya Chen, Ke Bai, Dong Wang, Jianfeng Feng, Wenlian Lu, Georgiy Bobashev, Lawrence Carin
Inference, estimation, sampling and likelihood evaluation are four primary goals of probabilistic modeling.
1 code implementation • Geoscientific Model Development 2019 • Xiaomeng Huang, Xing Huang, Dong Wang, Qi Wu, Yi Li, Shixun Zhang, YuWen Chen, Mingqing Wang, Yuan Gao, Qiang Tang, Yue Chen, Zheng Fang, Zhenya Song, Guangwen Yang
In this work, we design a simple computing library to bridge the gap and decouple the work of ocean modeling from parallel computing.
no code implementations • 10 Nov 2019 • Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Shijing Si, Dinghan Shen, Dong Wang, Lawrence Carin
Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks.
1 code implementation • CVPR 2019 • Yuxuan Sun, Chong Sun, Dong Wang, You He, Huchuan Lu
The ROI (region-of-interest) based pooling method performs pooling operations on the cropped ROI regions for various samples and has shown great success in the object detection methods.
2 code implementations • 31 Oct 2019 • Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang
These datasets tend to deliver over optimistic performance and do not meet the request of research on speaker recognition in unconstrained conditions.
no code implementations • 29 Oct 2019 • Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang
Speech signals are complex composites of various information, including phonetic content, speaker traits, channel effect, etc.
1 code implementation • International Conference on Computer Vision Workshops 2019 • Dawei Du, Pengfei Zhu, Longyin Wen, Xiao Bian, Haibin Lin, QinGhua Hu, Tao Peng, Jiayu Zheng, Xinyao Wang, Yue Zhang, Liefeng Bo, Hailin Shi, Rui Zhu, Aashish Kumar, Aijin Li, Almaz Zinollayev, Anuar Askergaliyev, Arne Schumann, Binjie Mao, Byeongwon Lee, Chang Liu, Changrui Chen, Chunhong Pan, Chunlei Huo, Da Yu, Dechun Cong, Dening Zeng, Dheeraj Reddy Pailla, Di Li, Dong Wang, Donghyeon Cho, Dongyu Zhang, Furui Bai, George Jose, Guangyu Gao, Guizhong Liu, Haitao Xiong, Hao Qi, Haoran Wang, Heqian Qiu, Hongliang Li, Huchuan Lu, Ildoo Kim, Jaekyum Kim, Jane Shen, Jihoon Lee, Jing Ge, Jingjing Xu, Jingkai Zhou, Jonas Meier, Jun Won Choi, Junhao Hu, Junyi Zhang, Junying Huang, Kaiqi Huang, Keyang Wang, Lars Sommer, Lei Jin, Lei Zhang
Results of 33 object detection algorithms are presented.
2 code implementations • ICCV 2019 • Peixia Li, Bo-Yu Chen, Wanli Ouyang, Dong Wang, Xiaoyun Yang, Huchuan Lu
In this work, we propose a novel gradient-guided network to exploit the discriminative information in gradients and update the template in the siamese network through feed-forward and backward operations.
Ranked #3 on
Visual Object Tracking
on OTB-2015
(Precision metric)
no code implementations • 11 Sep 2019 • Yang Zhang, Daniel Zhang, Nathan Vance, Dong Wang
Social sensing has emerged as a new sensing paradigm where humans (or devices on their behalf) collectively report measurements about the physical world.
1 code implementation • ICCV 2019 • Bin Yan, Haojie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang
In this work, we present a novel robust and real-time long-term tracking framework based on the proposed skimming and perusal modules.
no code implementations • 27 Aug 2019 • Xueyi Wang, Lantian Li, Dong Wang
By enforcing the neural model to discriminate the speakers in the training set, deep speaker embedding (called `x-vectors`) can be derived from the hidden layers.
1 code implementation • 21 Jul 2019 • Dong Wang, Yicheng Liu, Wenwo Tang, Fanhua Shang, Hongying Liu, Qigong Sun, Licheng Jiao
In this paper, we propose a new first-order gradient-based algorithm to train deep neural networks.
no code implementations • 17 Jul 2019 • Lanyu Shang, Daniel Zhang, Michael Wang, Shuyue Lai, Dong Wang
Current clickbait detection solutions that mainly focus on analyzing the text of the title, the image of the thumbnail, or the content of the video are shown to be suboptimal in detecting the online clickbait videos.
no code implementations • 16 Jul 2019 • Zhiyuan Tang, Dong Wang, Li-Ming Song
The participants can refer to these online-published recipes to deploy LID systems for convenience.
3 code implementations • 5 Jul 2019 • Junyu. Gao, Wei. Lin, Bin Zhao, Dong Wang, Chenyu Gao, Jun Wen
This technical report attempts to provide efficient and solid kits addressed on the field of crowd counting, which is denoted as Crowd Counting Code Framework (C$^3$F).
no code implementations • 24 Jun 2019 • Dong Wang, Yitong Li, Wei Cao, Liqun Chen, Qi Wei, Lawrence Carin
We propose a Leaked Motion Video Predictor (LMVP) to predict future frames by capturing the spatial and temporal dependencies from given inputs.
no code implementations • 18 Jun 2019 • Dong Wang, Lei Zhou, Xiao Bai, Jun Zhou
Our method accelerates the network in one-step pruning-recovery manner with a novel optimization objective function, which achieves higher accuracy with much less cost compared with existing pruning methods.
no code implementations • 9 Jun 2019 • Benlin Hu, Cheng Lei, Dong Wang, Shu Zhang, Zhenyu Chen
Deep learning models have a large number of freeparameters that need to be calculated by effective trainingof the models on a great deal of training data to improvetheir generalization performance.
1 code implementation • 29 May 2019 • Haoyu Song, Wei-Nan Zhang, Yiming Cui, Dong Wang, Ting Liu
Giving conversational context with persona information to a chatbot, how to exploit the information to generate diverse and sustainable conversations is still a non-trivial task.
no code implementations • 28 May 2019 • Tianle Cai, Ruiqi Gao, Jikai Hou, Siyu Chen, Dong Wang, Di He, Zhihua Zhang, Li-Wei Wang
First-order methods such as stochastic gradient descent (SGD) are currently the standard algorithm for training deep neural networks.
no code implementations • ICLR 2019 • Ke Xu, Xiao-Yun Wang, Qun Jia, Jianjing An, Dong Wang
Therefore, accumulating the saliency of the filter over the entire data set can provide more accurate guidance for pruning.
no code implementations • 30 Apr 2019 • Dong Wang, Yuan Yuan, Qi. Wang
Action Prediction is aimed to determine what action is occurring in a video as early as possible, whi