no code implementations • Findings (ACL) 2022 • Xianghong Fang, Jian Li, Lifeng Shang, Xin Jiang, Qun Liu, Dit-yan Yeung
While variational autoencoders (VAEs) have been widely applied in text generation tasks, they are troubled by two challenges: insufficient representation capacity and poor controllability.
1 code implementation • EMNLP 2020 • Nedjma Ousidhoum, Yangqiu Song, Dit-yan Yeung
Work on bias in hate speech typically aims to improve classification performance while relatively overlooking the quality of the data.
no code implementations • ICCV 2023 • Xuechao Chen, Shuangjie Xu, Xiaoyi Zou, Tongyi Cao, Dit-yan Yeung, Lu Fang
To take full advantage of the historical frames high-efficiently, we shunt the historical points into two groups with reference to the current points.
no code implementations • 4 Jul 2023 • Junjie Wu, Dit-yan Yeung
Specifically, SCAT modifies random augmentations of the data in a fully labelfree manner to generate adversarial examples.
no code implementations • 7 Jun 2023 • Kai Chen, Enze Xie, Zhe Chen, Lanqing Hong, Zhenguo Li, Dit-yan Yeung
However, the usage of diffusion models to generate high-quality object detection data remains an underexplored area, where not only the image-level perceptual quality but also geometric conditions such as bounding boxes and camera views are essential.
no code implementations • CVPR 2023 • Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-yan Yeung
Specifically, our MixedAE outperforms MAE by +0. 3% accuracy, +1. 7 mIoU and +0. 9 AP on ImageNet-1K, ADE20K and COCO respectively with a standard ViT-Base.
no code implementations • 22 Mar 2023 • Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu
Contrastive Language-Image Pre-training, benefiting from large-scale unlabeled text-image pairs, has demonstrated great performance in open-world vision understanding tasks.
no code implementations • 2 Feb 2023 • Chung Yan Fong, Dit-yan Yeung
As such, it has a two-fold advantage: 1) more actual observations can be used for training, and 2) the model can be validated on data which has distribution closer to the expected situation.
no code implementations • CVPR 2023 • Zifan Shi, Yujun Shen, Yinghao Xu, Sida Peng, Yiyi Liao, Sheng Guo, Qifeng Chen, Dit-yan Yeung
Existing methods for 3D-aware image synthesis largely depend on the 3D pose distribution pre-estimated on the training set.
no code implementations • CVPR 2023 • Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu
Contrastive Language-Image Pre-training, benefiting from large-scale unlabeled text-image pairs, has demonstrated great performance in open-world vision understanding tasks.
1 code implementation • 28 Nov 2022 • Yusen Sun, Liangyou Li, Qun Liu, Dit-yan Yeung
Although lyrics generation has achieved significant progress in recent years, it has limited practical applications because the generated lyrics cannot be performed without composing compatible melodies.
no code implementations • 30 Sep 2022 • Zifan Shi, Yinghao Xu, Yujun Shen, Deli Zhao, Qifeng Chen, Dit-yan Yeung
We argue that, considering the two-player game in the formulation of GANs, only making the generator 3D-aware is not enough.
no code implementations • 29 Sep 2022 • Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-yan Yeung
The trained model is then directly generalized to new unseen graphs for online CD without additional optimization, where a better trade-off between quality and efficiency can be achieved.
1 code implementation • 12 Jul 2022 • Zhihan Gao, Xingjian Shi, Hao Wang, Yi Zhu, Yuyang Wang, Mu Li, Dit-yan Yeung
With the explosive growth of the spatiotemporal Earth observation data in the past decade, data-driven models that apply Deep Learning (DL) are demonstrating impressive potential for various Earth system forecasting tasks.
Ranked #1 on
Earth Surface Forecasting
on EarthNet2021 OOD Track
1 code implementation • 2 May 2022 • Jeongseok Hyun, Myunggu Kang, Dongyoon Wee, Dit-yan Yeung
The strong edge features allow SGT to track targets with tracking candidates selected by top-K scored detections with large K. As a result, even low-scored detections can be tracked, and the missed detections are also recovered.
Ranked #2 on
Multi-Object Tracking
on HiEve
no code implementations • 15 Mar 2022 • Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei zhang, Chunjing Xu, Dit-yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu
One main reason that impedes the development of truly reliably self-driving systems is the lack of public datasets for evaluating the performance of object detectors on corner cases.
no code implementations • 17 Feb 2022 • Zifan Shi, Yujun Shen, Jiapeng Zhu, Dit-yan Yeung, Qifeng Chen
In this way, the discriminator can take the spatial arrangement into account and advise the generator to learn an appropriate depth condition.
no code implementations • 29 Sep 2021 • Chiu Wai Yan, Dit-yan Yeung
We start by looking into the effect of common image augmentation techniques and exploring novel augmentation with the aid of adversarial perturbations.
1 code implementation • ICLR 2022 • Tsz-Him Cheung, Dit-yan Yeung
However, the augmentation policies found are not adaptive to the dataset used, hindering the effectiveness of these AutoDA methods.
no code implementations • 29 Sep 2021 • Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-yan Yeung
IGP is also a generic framework that can capture the permutation invariant partitioning ground-truth of historical snapshots in the offline training and tackle the online GP on graphs with non-fixed number of nodes and clusters.
1 code implementation • ICCV 2021 • Kai Chen, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-yan Yeung
By pre-training on SODA10M, a large-scale autonomous driving dataset, MultiSiam exceeds the ImageNet pre-trained MoCo-v2, demonstrating the potential of domain-specific pre-training.
1 code implementation • 7 Aug 2021 • Zifan Shi, Na Fan, Dit-yan Yeung, Qifeng Chen
Thus, we propose a learning-based model for waterdrop removal with stereo images.
1 code implementation • ACL 2021 • Nedjma Ousidhoum, Xinran Zhao, Tianqing Fang, Yangqiu Song, Dit-yan Yeung
Large pre-trained language models (PTLMs) have been shown to carry biases towards different social groups which leads to the reproduction of stereotypical and toxic content by major NLP systems.
1 code implementation • ICLR 2021 • Tsz-Him Cheung, Dit-yan Yeung
Data augmentation is an efficient way to expand a training dataset by creating additional artificial data.
1 code implementation • IJCNLP 2019 • Nedjma Ousidhoum, Zizheng Lin, Hongming Zhang, Yangqiu Song, Dit-yan Yeung
Current research on hate speech analysis is typically oriented towards monolingual and single classification tasks.
no code implementations • 3 Aug 2019 • Jinseok Lee, Dit-yan Yeung
This involves abstract concepts of students' states of knowledge and the interactions between those states and skills.
no code implementations • 9 Jun 2019 • Ting Sun, Yuxiang Sun, Ming Liu, Dit-yan Yeung
Moving objects can greatly jeopardize the performance of a visual simultaneous localization and mapping (vSLAM) system which relies on the static-world assumption.
Simultaneous Localization and Mapping
Weakly supervised Semantic Segmentation
+1
no code implementations • ICLR 2019 • Yuan Yuan, Yueming Lyu, Xi Shen, Ivor W. Tsang, Dit-yan Yeung
The MAAN employs a novel marginalized average aggregation (MAA) module and learns a set of latent discriminative probabilities in an end-to-end fashion.
Ranked #9 on
Weakly Supervised Action Localization
on ActivityNet-1.3
(mAP@0.5 metric)
Weakly Supervised Action Localization
Weakly-supervised Learning
+2
no code implementations • 3 Apr 2019 • Ting Sun, Lei Tai, Zhihan Gao, Ming Liu, Dit-yan Yeung
This paper proposes a novel weakly-supervised semantic segmentation method using image-level label only.
Weakly supervised Semantic Segmentation
Weakly-Supervised Semantic Segmentation
no code implementations • 12 Dec 2018 • Mucong Ding, Kai Yang, Dit-yan Yeung, Ting-Chuen Pong
A major challenge that has to be addressed when building such models is to design handcrafted features that are effective for the prediction task at hand.
no code implementations • 5 Nov 2018 • Ting Sun, Dezhen Song, Dit-yan Yeung, Ming Liu
In the back end, we optimize the map imposing the constraint that the line segments of the same cluster should be the same.
no code implementations • 23 Oct 2018 • Ting Sun, Ming Liu, Haoyang Ye, Dit-yan Yeung
This paper proposes a novel point-cloud-based place recognition system that adopts a deep learning approach for feature extraction.
no code implementations • 21 Aug 2018 • Xingjian Shi, Dit-yan Yeung
Forecasting the multi-step future of these spatiotemporal systems based on the past observations, or, Spatiotemporal Sequence Forecasting (STSF), is a significant and challenging problem.
1 code implementation • 6 Jun 2018 • Chun-kit Yeung, Zizheng Lin, Kai Yang, Dit-yan Yeung
The 2017 ASSISTments Data Mining competition aims to use data from a longitudinal study for predicting a brand-new outcome of students which had never been studied before by the educational data mining research community.
2 code implementations • 6 Jun 2018 • Chun-kit Yeung, Dit-yan Yeung
In recent years, a recurrent neural network model called deep knowledge tracing (DKT) has been proposed to handle the knowledge tracing task and literature has shown that DKT generally outperforms traditional methods.
1 code implementation • 20 Mar 2018 • Jiani Zhang, Xingjian Shi, Junyuan Xie, Hao Ma, Irwin King, Dit-yan Yeung
We propose a new network architecture, Gated Attention Networks (GaAN), for learning on graphs.
Ranked #1 on
Node Property Prediction
on ogbn-proteins
no code implementations • 1 Jan 2018 • Hong Chang, Dit-yan Yeung
In this paper, based on M-estimation from robust statistics, we develop a robust path-based spectral clustering method by defining a robust path-based similarity measure for spectral clustering under both unsupervised and semi-supervised settings.
no code implementations • 24 Sep 2017 • Siyi Li, Tianbo Liu, Chi Zhang, Dit-yan Yeung, Shaojie Shen
While deep reinforcement learning (RL) methods have achieved unprecedented successes in a range of challenging problems, their applicability has been mainly limited to simulation or game domains due to the high sample complexity of the trial-and-error learning process.
no code implementations • ICCV 2017 • Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-yan Yeung, Abhinav Gupta
A common issue, however, is that objects of interest that are not involved in human actions are often absent in global action descriptions known as "missing label".
Ranked #3 on
Weakly Supervised Object Detection
on Charades
no code implementations • ICCV 2017 • Feng Xiong, Xingjian Shi, Dit-yan Yeung
To exploit the otherwise very useful temporal information in video sequences, we propose a variant of a recent deep learning model called convolutional LSTM (ConvLSTM) for crowd counting.
no code implementations • 22 Jun 2017 • Ting Sun, Lin Sun, Dit-yan Yeung
Fine-grained categorization can benefit from part-based features which reveal subtle visual differences between object categories.
4 code implementations • NeurIPS 2017 • Xingjian Shi, Zhihan Gao, Leonard Lausen, Hao Wang, Dit-yan Yeung, Wai-kin Wong, Wang-chun Woo
To address these problems, we propose both a new model and a benchmark for precipitation nowcasting.
Ranked #1 on
Video Prediction
on KTH
(Cond metric)
no code implementations • 21 Mar 2017 • Hao Wang, Xiaodan Liang, Hao Zhang, Dit-yan Yeung, Eric P. Xing
We cast this problem as manipulating an input image according to a parametric model whose key parameters can be conditionally generated from any guiding signal (even unseen ones).
1 code implementation • 24 Nov 2016 • Jiani Zhang, Xingjian Shi, Irwin King, Dit-yan Yeung
Knowledge Tracing (KT) is a task of tracing evolving knowledge state of students with respect to one or more concepts as they engage in a sequence of learning activities.
1 code implementation • NeurIPS 2016 • Hao Wang, Xingjian Shi, Dit-yan Yeung
Another shortcoming of NN is the lack of flexibility to customize different distributions for the weights and neurons according to the data, as is often done in probabilistic graphical models.
no code implementations • NeurIPS 2016 • Hao Wang, Xingjian Shi, Dit-yan Yeung
To address this problem, we develop a collaborative recurrent autoencoder (CRAE) which is a denoising recurrent autoencoder (DRAE) that models the generation of content sequences in the collaborative filtering (CF) setting.
no code implementations • 17 Sep 2016 • Zhourong Chen, Nevin L. Zhang, Dit-yan Yeung, Peixian Chen
We are interested in exploring the possibility and benefits of structure learning for deep models.
no code implementations • 24 Aug 2016 • Hao Wang, Dit-yan Yeung
While perception tasks such as visual object recognition and text understanding play an important role in human intelligence, the subsequent tasks that involve inference, reasoning and planning require an even higher level of intelligence.
1 code implementation • 6 Apr 2016 • Hao Wang, Dit-yan Yeung
The past decade has seen major advances in many perception tasks such as visual object recognition and speech recognition using deep learning models.
no code implementations • ICCV 2015 • Lin Sun, Kui Jia, Dit-yan Yeung, Bertram E. Shi
Human actions in video sequences are three-dimensional (3D) spatio-temporal signals characterizing both the visual appearance and motion dynamics of the involved humans and objects.
16 code implementations • NeurIPS 2015 • Xingjian Shi, Zhourong Chen, Hao Wang, Dit-yan Yeung, Wai-kin Wong, Wang-chun Woo
The goal of precipitation nowcasting is to predict the future rainfall intensity in a local region over a relatively short period of time.
Ranked #1 on
Video Prediction
on KTH
(Cond metric)
no code implementations • CVPR 2015 • Chuang Gan, Naiyan Wang, Yi Yang, Dit-yan Yeung, Alex G. Hauptmann
Taking key frames of videos as input, we first detect the event of interest at the video level by aggregating the CNN features of the key frames.
no code implementations • CVPR 2015 • Peixian Chen, Naiyan Wang, Nevin L. Zhang, Dit-yan Yeung
Low-rank matrix factorization has long been recognized as a fundamental problem in many computer vision applications.
no code implementations • ICCV 2015 • Naiyan Wang, Jianping Shi, Dit-yan Yeung, Jiaya Jia
Surprisingly, our findings are discrepant with some common beliefs in the visual tracking research community.
no code implementations • 19 Jan 2015 • Naiyan Wang, Siyi Li, Abhinav Gupta, Dit-yan Yeung
To fit the characteristics of object tracking, we first pre-train the CNN to recognize what is an object, and then propose to generate a probability map instead of producing a simple class label.
1 code implementation • 10 Sep 2014 • Hao Wang, Naiyan Wang, Dit-yan Yeung
(CF-based) input and propose in this paper a hierarchical Bayesian model called collaborative deep learning (CDL), which jointly performs deep representation learning for the content information and collaborative filtering for the ratings (feedback) matrix.
no code implementations • NeurIPS 2013 • Naiyan Wang, Dit-yan Yeung
In this paper, we study the challenging problem of tracking the trajectory of a moving object in a video with possibly very complex background.
no code implementations • NeurIPS 2012 • Yi Zhen, Dit-yan Yeung
Hashing-based methods provide a very promising approach to large-scale similarity search.
no code implementations • 15 Mar 2012 • Yu Zhang, Dit-yan Yeung
In this paper, we propose a regularization formulation for learning the relationships between tasks in multi-task learning.
no code implementations • NeurIPS 2010 • Yu Zhang, Dit-yan Yeung, Qian Xu
In this paper, we unify the $l_{1, 2}$ and $l_{1,\infty}$ norms by considering a family of $l_{1, q}$ norms for $1 < q\le\infty$ and study the problem of determining the most appropriate sparsity enforcing norm to use in the context of multi-task feature selection.
no code implementations • NeurIPS 2010 • Yu Zhang, Dit-yan Yeung
In this paper, we first analyze the scatter measures used in the conventional linear discriminant analysis~(LDA) model and note that the formulation is based on the average-case view.
no code implementations • NeurIPS 2009 • Wu-Jun Li, Dit-yan Yeung, Zhihua Zhang
assumption is unreasonable for relational data.
no code implementations • NeurIPS 2008 • Zhihua Zhang, Michael. I. Jordan, Dit-yan Yeung
The duality between regularization and prior leads to interpreting regularization methods in terms of maximum a posteriori estimation and has motivated Bayesian interpretations of kernel methods.