1 code implementation • EMNLP 2020 • Nedjma Ousidhoum, Yangqiu Song, Dit-yan Yeung
Work on bias in hate speech typically aims to improve classification performance while relatively overlooking the quality of the data.
no code implementations • Findings (ACL) 2022 • Xianghong Fang, Jian Li, Lifeng Shang, Xin Jiang, Qun Liu, Dit-yan Yeung
While variational autoencoders (VAEs) have been widely applied in text generation tasks, they are troubled by two challenges: insufficient representation capacity and poor controllability.
2 code implementations • 19 Nov 2024 • Zhiwen Shao, Yichen Yuan, Lizhuang Ma, Dit-yan Yeung, Xiaojia Zhu
Automatic Cobb angle measurement from X-ray images is crucial for scoliosis screening and diagnosis.
1 code implementation • 30 Oct 2024 • Junjie Wu, Tsz Ting Chung, Kai Chen, Dit-yan Yeung
The core idea of our framework is to conduct hallucination evaluation on (object, relation, object) triplets extracted from LVLMs' responses, and thus, could be easily generalized to different vision-language tasks.
1 code implementation • 30 Oct 2024 • Chiu-Wai Yan, Shi Quan Foo, Van Hoan Trinh, Dit-yan Yeung, Ka-Hing Wong, Wai-kin Wong
Our method is generic, parameter-free and efficient.
no code implementations • 15 Oct 2024 • Tsz Ting Chung, Leyang Cui, Lemao Liu, Xinting Huang, Shuming Shi, Dit-yan Yeung
Large Language Models (LLMs) have demonstrated impressive capabilities in a wide range of natural language processing tasks when leveraging in-context learning.
no code implementations • 7 Oct 2024 • Jiaming Zhang, Junhong Ye, Xingjun Ma, Yige Li, Yunfan Yang, Jitao Sang, Dit-yan Yeung
Due to their multimodal capabilities, Vision-Language Models (VLMs) have found numerous impactful applications in real-world scenarios.
no code implementations • 26 Sep 2024 • Kai Chen, Yunhao Gou, Runhui Huang, Zhili Liu, Daxin Tan, Jing Xu, Chunwei Wang, Yi Zhu, Yihan Zeng, Kuo Yang, Dingdong Wang, Kun Xiang, Haoyuan Li, Haoli Bai, Jianhua Han, Xiaohui Li, Weike Jin, Nian Xie, Yu Zhang, James T. Kwok, Hengshuang Zhao, Xiaodan Liang, Dit-yan Yeung, Xiao Chen, Zhenguo Li, Wei zhang, Qun Liu, Jun Yao, Lanqing Hong, Lu Hou, Hang Xu
GPT-4o, an omni-modal model that enables vocal conversations with diverse emotions and tones, marks a milestone for omni-modal foundation models.
1 code implementation • 22 Jul 2024 • Zhili Chen, Shuangjie Xu, Maosheng Ye, Zian Qian, Xiaoyi Zou, Dit-yan Yeung, Qifeng Chen
The Bird's-Eye-View (BEV) representation is a critical factor that directly impacts the 3D object detection performance, but the traditional BEV grid representation induces quadratic computational cost as the spatial resolution grows.
no code implementations • 17 Jul 2024 • Chenhan Jiang, Yihan Zeng, Tianyang Hu, Songcun Xu, Wei zhang, Hang Xu, Dit-yan Yeung
However, this paradigm distills view-agnostic 2D image distributions into the rendering distribution of 3D representation for each view independently, overlooking the coherence across views and yielding 3D inconsistency in generations.
1 code implementation • 7 Jul 2024 • Junjie Wu, Lemao Liu, Wei Bi, Dit-yan Yeung
To this end, this paper presents a new setting for NMT targeted adversarial attacks that could lead to reliable attacking results.
no code implementations • 16 Apr 2024 • Kai Chen, Yanze Li, Wenhua Zhang, Yanxin Liu, Pengxiang Li, Ruiyuan Gao, Lanqing Hong, Meng Tian, Xinhai Zhao, Zhenguo Li, Dit-yan Yeung, Huchuan Lu, Xu Jia
Moreover, with our CODA-LM, we build CODA-VLM, a new driving LVLM surpassing all open-sourced counterparts on CODA-LM.
no code implementations • CVPR 2024 • Yibo Wang, Ruiyuan Gao, Kai Chen, Kaiqiang Zhou, Yingjie Cai, Lanqing Hong, Zhenguo Li, Lihui Jiang, Dit-yan Yeung, Qiang Xu, Kai Zhang
Furthermore, image syntheses from DetDiffusion can effectively augment training data, significantly enhancing downstream detection performance.
no code implementations • 19 Mar 2024 • Tsz-Him Cheung, Dit-yan Yeung
Sample-mixing is a popular data augmentation approach that creates additional data by combining existing samples.
no code implementations • 14 Mar 2024 • Yunhao Gou, Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-yan Yeung, James T. Kwok, Yu Zhang
To construct robust MLLMs, we propose ECSO (Eyes Closed, Safety On), a novel training-free protecting approach that exploits the inherent safety awareness of MLLMs, and generates safer responses via adaptively transforming unsafe images into texts to activate the intrinsic safety mechanism of pre-aligned LLMs in MLLMs.
no code implementations • 19 Dec 2023 • Yunhao Gou, Zhili Liu, Kai Chen, Lanqing Hong, Hang Xu, Aoxue Li, Dit-yan Yeung, James T. Kwok, Yu Zhang
Instruction tuning of Large Vision-language Models (LVLMs) has revolutionized the development of versatile models with zero-shot generalization across a wide range of downstream vision-language tasks.
1 code implementation • 1 Dec 2023 • Pengxiang Li, Kai Chen, Zhili Liu, Ruiyuan Gao, Lanqing Hong, Guo Zhou, Hua Yao, Dit-yan Yeung, Huchuan Lu, Xu Jia
Despite remarkable achievements in video synthesis, achieving granular control over complex dynamics, such as nuanced movement among multiple interacting objects, still presents a significant hurdle for dynamic world modeling, compounded by the necessity to manage appearance and disappearance, drastic scale changes, and ensure consistency for instances across frames.
1 code implementation • CVPR 2024 • Rameen Abdal, Wang Yifan, Zifan Shi, Yinghao Xu, Ryan Po, Zhengfei Kuang, Qifeng Chen, Dit-yan Yeung, Gordon Wetzstein
Instead of rasterizing the shells directly, we sample 3D Gaussians on the shells whose attributes are encoded in the texture features.
1 code implementation • 20 Oct 2023 • Junjie Wu, Lemao Liu, Dit-yan Yeung
Behavioral testing offers a crucial means of diagnosing linguistic errors and assessing capabilities of NLP models.
no code implementations • 16 Oct 2023 • Kai Chen, Chunwei Wang, Kuo Yang, Jianhua Han, Lanqing Hong, Fei Mi, Hang Xu, Zhengying Liu, Wenyong Huang, Zhenguo Li, Dit-yan Yeung, Lifeng Shang, Xin Jiang, Qun Liu
The rapid development of large language models (LLMs) has not only provided numerous opportunities but also presented significant challenges.
no code implementations • 9 Oct 2023 • Zhili Liu, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-yan Yeung, James Kwok
Subsequently, the model is optimized to identify and disentangle this information, which is then adopted as negative prompts during generation.
no code implementations • 4 Oct 2023 • Ruiyuan Gao, Kai Chen, Enze Xie, Lanqing Hong, Zhenguo Li, Dit-yan Yeung, Qiang Xu
Recent advancements in diffusion models have significantly enhanced the data synthesis with 2D control.
no code implementations • ICCV 2023 • Xuechao Chen, Shuangjie Xu, Xiaoyi Zou, Tongyi Cao, Dit-yan Yeung, Lu Fang
To take full advantage of the historical frames high-efficiently, we shunt the historical points into two groups with reference to the current points.
no code implementations • 4 Jul 2023 • Junjie Wu, Dit-yan Yeung
Specifically, SCAT modifies random augmentations of the data in a fully labelfree manner to generate adversarial examples.
no code implementations • 7 Jun 2023 • Kai Chen, Enze Xie, Zhe Chen, Yibo Wang, Lanqing Hong, Zhenguo Li, Dit-yan Yeung
Diffusion models have attracted significant attention due to the remarkable ability to create content and generate data for tasks like image classification.
1 code implementation • CVPR 2023 • Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-yan Yeung
Specifically, our MixedAE outperforms MAE by +0. 3% accuracy, +1. 7 mIoU and +0. 9 AP on ImageNet-1K, ADE20K and COCO respectively with a standard ViT-Base.
no code implementations • 22 Mar 2023 • Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu
Contrastive Language-Image Pre-training, benefiting from large-scale unlabeled text-image pairs, has demonstrated great performance in open-world vision understanding tasks.
Ranked #3 on Zero-shot 3D Point Cloud Classification on ScanNetV2
no code implementations • 2 Feb 2023 • Chung Yan Fong, Dit-yan Yeung
As such, it has a two-fold advantage: 1) more actual observations can be used for training, and 2) the model can be validated on data which has distribution closer to the expected situation.
no code implementations • CVPR 2023 • Zifan Shi, Yujun Shen, Yinghao Xu, Sida Peng, Yiyi Liao, Sheng Guo, Qifeng Chen, Dit-yan Yeung
Existing methods for 3D-aware image synthesis largely depend on the 3D pose distribution pre-estimated on the training set.
no code implementations • CVPR 2023 • Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu
Contrastive Language-Image Pre-training, benefiting from large-scale unlabeled text-image pairs, has demonstrated great performance in open-world vision understanding tasks.
1 code implementation • 28 Nov 2022 • Yusen Sun, Liangyou Li, Qun Liu, Dit-yan Yeung
Although lyrics generation has achieved significant progress in recent years, it has limited practical applications because the generated lyrics cannot be performed without composing compatible melodies.
no code implementations • 30 Sep 2022 • Zifan Shi, Yinghao Xu, Yujun Shen, Deli Zhao, Qifeng Chen, Dit-yan Yeung
We argue that, considering the two-player game in the formulation of GANs, only making the generator 3D-aware is not enough.
no code implementations • 29 Sep 2022 • Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-yan Yeung
The trained model is then directly generalized to new unseen graphs for online CD without additional optimization, where a better trade-off between quality and efficiency can be achieved.
2 code implementations • 12 Jul 2022 • Zhihan Gao, Xingjian Shi, Hao Wang, Yi Zhu, Yuyang Wang, Mu Li, Dit-yan Yeung
With the explosive growth of the spatiotemporal Earth observation data in the past decade, data-driven models that apply Deep Learning (DL) are demonstrating impressive potential for various Earth system forecasting tasks.
Ranked #1 on Earth Surface Forecasting on EarthNet2021 OOD Track
1 code implementation • 2 May 2022 • Jeongseok Hyun, Myunggu Kang, Dongyoon Wee, Dit-yan Yeung
The strong edge features allow SGT to track targets with tracking candidates selected by top-K scored detections with large K. As a result, even low-scored detections can be tracked, and the missed detections are also recovered.
Ranked #2 on Multi-Object Tracking on HiEve
no code implementations • 15 Mar 2022 • Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei zhang, Chunjing Xu, Dit-yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu
One main reason that impedes the development of truly reliably self-driving systems is the lack of public datasets for evaluating the performance of object detectors on corner cases.
no code implementations • 17 Feb 2022 • Zifan Shi, Yujun Shen, Jiapeng Zhu, Dit-yan Yeung, Qifeng Chen
In this way, the discriminator can take the spatial arrangement into account and advise the generator to learn an appropriate depth condition.
1 code implementation • ICLR 2022 • Tsz-Him Cheung, Dit-yan Yeung
However, the augmentation policies found are not adaptive to the dataset used, hindering the effectiveness of these AutoDA methods.
no code implementations • 29 Sep 2021 • Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-yan Yeung
IGP is also a generic framework that can capture the permutation invariant partitioning ground-truth of historical snapshots in the offline training and tackle the online GP on graphs with non-fixed number of nodes and clusters.
no code implementations • 29 Sep 2021 • Chiu Wai Yan, Dit-yan Yeung
We start by looking into the effect of common image augmentation techniques and exploring novel augmentation with the aid of adversarial perturbations.
1 code implementation • ICCV 2021 • Kai Chen, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-yan Yeung
By pre-training on SODA10M, a large-scale autonomous driving dataset, MultiSiam exceeds the ImageNet pre-trained MoCo-v2, demonstrating the potential of domain-specific pre-training.
1 code implementation • 7 Aug 2021 • Zifan Shi, Na Fan, Dit-yan Yeung, Qifeng Chen
Thus, we propose a learning-based model for waterdrop removal with stereo images.
1 code implementation • ACL 2021 • Nedjma Ousidhoum, Xinran Zhao, Tianqing Fang, Yangqiu Song, Dit-yan Yeung
Large pre-trained language models (PTLMs) have been shown to carry biases towards different social groups which leads to the reproduction of stereotypical and toxic content by major NLP systems.
1 code implementation • ICLR 2021 • Tsz-Him Cheung, Dit-yan Yeung
Data augmentation is an efficient way to expand a training dataset by creating additional artificial data.
1 code implementation • IJCNLP 2019 • Nedjma Ousidhoum, Zizheng Lin, Hongming Zhang, Yangqiu Song, Dit-yan Yeung
Current research on hate speech analysis is typically oriented towards monolingual and single classification tasks.
no code implementations • 3 Aug 2019 • Jinseok Lee, Dit-yan Yeung
This involves abstract concepts of students' states of knowledge and the interactions between those states and skills.
no code implementations • 9 Jun 2019 • Ting Sun, Yuxiang Sun, Ming Liu, Dit-yan Yeung
Moving objects can greatly jeopardize the performance of a visual simultaneous localization and mapping (vSLAM) system which relies on the static-world assumption.
no code implementations • ICLR 2019 • Yuan Yuan, Yueming Lyu, Xi Shen, Ivor W. Tsang, Dit-yan Yeung
The MAAN employs a novel marginalized average aggregation (MAA) module and learns a set of latent discriminative probabilities in an end-to-end fashion.
Ranked #14 on Weakly Supervised Action Localization on ActivityNet-1.3 (mAP@0.5 metric)
Weakly Supervised Action Localization Weakly-supervised Learning +2
no code implementations • 3 Apr 2019 • Ting Sun, Lei Tai, Zhihan Gao, Ming Liu, Dit-yan Yeung
This paper proposes a novel weakly-supervised semantic segmentation method using image-level label only.
1 code implementation • International Conference on Learning Analytics & Knowledge 2019 • Jinseok Lee, Dit-yan Yeung
This involves abstract concepts of students' states of knowledge and the interactions between those states and skills.
no code implementations • 12 Dec 2018 • Mucong Ding, Kai Yang, Dit-yan Yeung, Ting-Chuen Pong
A major challenge that has to be addressed when building such models is to design handcrafted features that are effective for the prediction task at hand.
no code implementations • 5 Nov 2018 • Ting Sun, Dezhen Song, Dit-yan Yeung, Ming Liu
In the back end, we optimize the map imposing the constraint that the line segments of the same cluster should be the same.
no code implementations • 23 Oct 2018 • Ting Sun, Ming Liu, Haoyang Ye, Dit-yan Yeung
This paper proposes a novel point-cloud-based place recognition system that adopts a deep learning approach for feature extraction.
no code implementations • 21 Aug 2018 • Xingjian Shi, Dit-yan Yeung
Forecasting the multi-step future of these spatiotemporal systems based on the past observations, or, Spatiotemporal Sequence Forecasting (STSF), is a significant and challenging problem.
2 code implementations • 6 Jun 2018 • Chun-kit Yeung, Dit-yan Yeung
In recent years, a recurrent neural network model called deep knowledge tracing (DKT) has been proposed to handle the knowledge tracing task and literature has shown that DKT generally outperforms traditional methods.
1 code implementation • 6 Jun 2018 • Chun-kit Yeung, Zizheng Lin, Kai Yang, Dit-yan Yeung
The 2017 ASSISTments Data Mining competition aims to use data from a longitudinal study for predicting a brand-new outcome of students which had never been studied before by the educational data mining research community.
1 code implementation • 20 Mar 2018 • Jiani Zhang, Xingjian Shi, Junyuan Xie, Hao Ma, Irwin King, Dit-yan Yeung
We propose a new network architecture, Gated Attention Networks (GaAN), for learning on graphs.
Ranked #1 on Node Property Prediction on ogbn-proteins
no code implementations • 1 Jan 2018 • Hong Chang, Dit-yan Yeung
In this paper, based on M-estimation from robust statistics, we develop a robust path-based spectral clustering method by defining a robust path-based similarity measure for spectral clustering under both unsupervised and semi-supervised settings.
no code implementations • 24 Sep 2017 • Siyi Li, Tianbo Liu, Chi Zhang, Dit-yan Yeung, Shaojie Shen
While deep reinforcement learning (RL) methods have achieved unprecedented successes in a range of challenging problems, their applicability has been mainly limited to simulation or game domains due to the high sample complexity of the trial-and-error learning process.
no code implementations • ICCV 2017 • Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-yan Yeung, Abhinav Gupta
A common issue, however, is that objects of interest that are not involved in human actions are often absent in global action descriptions known as "missing label".
Ranked #3 on Weakly Supervised Object Detection on Charades
no code implementations • ICCV 2017 • Feng Xiong, Xingjian Shi, Dit-yan Yeung
To exploit the otherwise very useful temporal information in video sequences, we propose a variant of a recent deep learning model called convolutional LSTM (ConvLSTM) for crowd counting.
no code implementations • 22 Jun 2017 • Ting Sun, Lin Sun, Dit-yan Yeung
Fine-grained categorization can benefit from part-based features which reveal subtle visual differences between object categories.
4 code implementations • NeurIPS 2017 • Xingjian Shi, Zhihan Gao, Leonard Lausen, Hao Wang, Dit-yan Yeung, Wai-kin Wong, Wang-chun Woo
To address these problems, we propose both a new model and a benchmark for precipitation nowcasting.
Ranked #1 on Video Prediction on KTH (Cond metric)
no code implementations • 21 Mar 2017 • Hao Wang, Xiaodan Liang, Hao Zhang, Dit-yan Yeung, Eric P. Xing
We cast this problem as manipulating an input image according to a parametric model whose key parameters can be conditionally generated from any guiding signal (even unseen ones).
1 code implementation • 24 Nov 2016 • Jiani Zhang, Xingjian Shi, Irwin King, Dit-yan Yeung
Knowledge Tracing (KT) is a task of tracing evolving knowledge state of students with respect to one or more concepts as they engage in a sequence of learning activities.
1 code implementation • NeurIPS 2016 • Hao Wang, Xingjian Shi, Dit-yan Yeung
Another shortcoming of NN is the lack of flexibility to customize different distributions for the weights and neurons according to the data, as is often done in probabilistic graphical models.
no code implementations • NeurIPS 2016 • Hao Wang, Xingjian Shi, Dit-yan Yeung
To address this problem, we develop a collaborative recurrent autoencoder (CRAE) which is a denoising recurrent autoencoder (DRAE) that models the generation of content sequences in the collaborative filtering (CF) setting.
no code implementations • 17 Sep 2016 • Zhourong Chen, Nevin L. Zhang, Dit-yan Yeung, Peixian Chen
We are interested in exploring the possibility and benefits of structure learning for deep models.
no code implementations • 24 Aug 2016 • Hao Wang, Dit-yan Yeung
While perception tasks such as visual object recognition and text understanding play an important role in human intelligence, the subsequent tasks that involve inference, reasoning and planning require an even higher level of intelligence.
1 code implementation • 6 Apr 2016 • Hao Wang, Dit-yan Yeung
The past decade has seen major advances in many perception tasks such as visual object recognition and speech recognition using deep learning models.
no code implementations • ICCV 2015 • Lin Sun, Kui Jia, Dit-yan Yeung, Bertram E. Shi
Human actions in video sequences are three-dimensional (3D) spatio-temporal signals characterizing both the visual appearance and motion dynamics of the involved humans and objects.
18 code implementations • NeurIPS 2015 • Xingjian Shi, Zhourong Chen, Hao Wang, Dit-yan Yeung, Wai-kin Wong, Wang-chun Woo
The goal of precipitation nowcasting is to predict the future rainfall intensity in a local region over a relatively short period of time.
Ranked #1 on Video Prediction on KTH (Cond metric)
no code implementations • CVPR 2015 • Chuang Gan, Naiyan Wang, Yi Yang, Dit-yan Yeung, Alex G. Hauptmann
Taking key frames of videos as input, we first detect the event of interest at the video level by aggregating the CNN features of the key frames.
no code implementations • CVPR 2015 • Peixian Chen, Naiyan Wang, Nevin L. Zhang, Dit-yan Yeung
Low-rank matrix factorization has long been recognized as a fundamental problem in many computer vision applications.
no code implementations • ICCV 2015 • Naiyan Wang, Jianping Shi, Dit-yan Yeung, Jiaya Jia
Surprisingly, our findings are discrepant with some common beliefs in the visual tracking research community.
no code implementations • 19 Jan 2015 • Naiyan Wang, Siyi Li, Abhinav Gupta, Dit-yan Yeung
To fit the characteristics of object tracking, we first pre-train the CNN to recognize what is an object, and then propose to generate a probability map instead of producing a simple class label.
1 code implementation • 10 Sep 2014 • Hao Wang, Naiyan Wang, Dit-yan Yeung
(CF-based) input and propose in this paper a hierarchical Bayesian model called collaborative deep learning (CDL), which jointly performs deep representation learning for the content information and collaborative filtering for the ratings (feedback) matrix.
no code implementations • NeurIPS 2013 • Naiyan Wang, Dit-yan Yeung
In this paper, we study the challenging problem of tracking the trajectory of a moving object in a video with possibly very complex background.
no code implementations • NeurIPS 2012 • Yi Zhen, Dit-yan Yeung
Hashing-based methods provide a very promising approach to large-scale similarity search.
no code implementations • 15 Mar 2012 • Yu Zhang, Dit-yan Yeung
In this paper, we propose a regularization formulation for learning the relationships between tasks in multi-task learning.
no code implementations • NeurIPS 2010 • Yu Zhang, Dit-yan Yeung, Qian Xu
In this paper, we unify the $l_{1, 2}$ and $l_{1,\infty}$ norms by considering a family of $l_{1, q}$ norms for $1 < q\le\infty$ and study the problem of determining the most appropriate sparsity enforcing norm to use in the context of multi-task feature selection.
no code implementations • NeurIPS 2010 • Yu Zhang, Dit-yan Yeung
In this paper, we first analyze the scatter measures used in the conventional linear discriminant analysis~(LDA) model and note that the formulation is based on the average-case view.
no code implementations • NeurIPS 2009 • Wu-Jun Li, Dit-yan Yeung, Zhihua Zhang
assumption is unreasonable for relational data.
no code implementations • NeurIPS 2008 • Zhihua Zhang, Michael. I. Jordan, Dit-yan Yeung
The duality between regularization and prior leads to interpreting regularization methods in terms of maximum a posteriori estimation and has motivated Bayesian interpretations of kernel methods.