Search Results for author: Joey Tianyi Zhou

Found 95 papers, 45 papers with code

N-ary Error Correcting Coding Scheme

no code implementations18 Mar 2016 Joey Tianyi Zhou, Ivor W. Tsang, Shen-Shyang Ho, Klaus-Robert Muller

The coding matrix design plays a fundamental role in the prediction performance of the error correcting output codes (ECOC)-based multi-class task.

Classification General Classification

Simple and Efficient Learning using Privileged Information

no code implementations6 Apr 2016 Xinxing Xu, Joey Tianyi Zhou, IvorW. Tsang, Zheng Qin, Rick Siow Mong Goh, Yong liu

The Support Vector Machine using Privileged Information (SVM+) has been proposed to train a classifier to utilize the additional privileged information that is only available in the training phase but not available in the test phase.

Image Categorization

Transfer Hashing with Privileged Information

no code implementations13 May 2016 Joey Tianyi Zhou, Xinxing Xu, Sinno Jialin Pan, Ivor W. Tsang, Zheng Qin, Rick Siow Mong Goh

Specifically, we extend the standard learning to hash method, Iterative Quantization (ITQ), in a transfer learning manner, namely ITQ+.

Quantization Transfer Learning

Improving Multi-label Learning with Missing Labels by Structured Semantic Correlations

no code implementations4 Aug 2016 Hao Yang, Joey Tianyi Zhou, Jianfei Cai

Experimental results demonstrate the effectiveness of the proposed semantic descriptor and the usefulness of incorporating the structured semantic correlations.

Missing Labels Object Recognition

MIML-FCN+: Multi-instance Multi-label Learning via Fully Convolutional Networks with Privileged Information

no code implementations CVPR 2017 Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew Soon Ong

As the proposed PI loss is convex and SGD compatible and the framework itself is a fully convolutional network, MIML-FCN+ can be easily integrated with state of-the-art deep learning networks.

Image Captioning Multi-Label Learning +1

Towards Good Practices on Building Effective CNN Baseline Model for Person Re-identification

1 code implementation29 Jul 2018 Fu Xiong, Yang Xiao, Zhiguo Cao, Kaicheng Gong, Zhiwen Fang, Joey Tianyi Zhou

Person re-identification is indeed a challenging visual recognition task due to the critical issues of human pose variation, human body occlusion, camera view variation, etc.

Open-Ended Question Answering Person Re-Identification

XAI Beyond Classification: Interpretable Neural Clustering

no code implementations22 Aug 2018 Xi Peng, Yunnan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou

The second is implementing discrete $k$-means with a differentiable neural network that embraces the advantages of parallel computing, online clustering, and clustering-favorable representation learning.

Classification Clustering +3

Towards Real-time Eyeblink Detection in The Wild:Dataset,Theory and Practices

no code implementations21 Feb 2019 Guilei Hu, Yang Xiao, Zhiguo Cao, Lubin Meng, Zhiwen Fang, Joey Tianyi Zhou, Junsong Yuan

Effective and real-time eyeblink detection is of wide-range applications, such as deception detection, drive fatigue detection, face anti-spoofing, etc.

Attribute Deception Detection +1

Query-efficient Meta Attack to Deep Neural Networks

1 code implementation ICLR 2020 Jiawei Du, Hu Zhang, Joey Tianyi Zhou, Yi Yang, Jiashi Feng

Black-box attack methods aim to infer suitable attack patterns to targeted DNN models by only using output feedback of the models and the corresponding input queries.

Adversarial Attack Meta-Learning

Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment

6 code implementations27 Jul 2019 Di Jin, Zhijing Jin, Joey Tianyi Zhou, Peter Szolovits

Machine learning algorithms are often vulnerable to adversarial examples that have imperceptible alterations from the original counterparts but can fool the state-of-the-art models.

Adversarial Text General Classification +2

Robust Regression via Deep Negative Correlation Learning

no code implementations24 Aug 2019 Le Zhang, Zenglin Shi, Ming-Ming Cheng, Yun Liu, Jia-Wang Bian, Joey Tianyi Zhou, Guoyan Zheng, Zeng Zeng

Nonlinear regression has been extensively employed in many computer vision problems (e. g., crowd counting, age estimation, affective computing).

Age Estimation Crowd Counting +2

A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image

2 code implementations ICCV 2019 Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan

For 3D hand and body pose estimation task in depth image, a novel anchor-based approach termed Anchor-to-Joint regression network (A2J) with the end-to-end learning ability is proposed.

3D Pose Estimation Depth Estimation +1

Multi-graph Fusion for Multi-view Spectral Clustering

1 code implementation16 Sep 2019 Zhao Kang, Guoxin Shi, Shudong Huang, Wenyu Chen, Xiaorong Pu, Joey Tianyi Zhou, Zenglin Xu

Most existing methods don't pay attention to the quality of the graphs and perform graph learning and spectral clustering separately.

Clustering Graph Learning

Unsupervised Domain Adaptation on Reading Comprehension

1 code implementation13 Nov 2019 Yu Cao, Meng Fang, Baosheng Yu, Joey Tianyi Zhou

On the other hand, it further reduces domain distribution discrepancy through conditional adversarial learning across domains.

Reading Comprehension Unsupervised Domain Adaptation

CPM-Nets: Cross Partial Multi-View Networks

1 code implementation NeurIPS 2019 Changqing Zhang, Zongbo Han, Yajie Cui, Huazhu Fu, Joey Tianyi Zhou, QinGhua Hu

Despite multi-view learning progressed fast in past decades, it is still challenging due to the difficulty in modeling complex correlation among different views, especially under the context of view missing.

MULTI-VIEW LEARNING

Ordered or Orderless: A Revisit for Video based Person Re-Identification

no code implementations24 Dec 2019 Le Zhang, Zenglin Shi, Joey Tianyi Zhou, Ming-Ming Cheng, Yun Liu, Jia-Wang Bian, Zeng Zeng, Chunhua Shen

Specifically, with a diagnostic analysis, we show that the recurrent structure may not be effective to learn temporal dependencies than what we expected and implicitly yields an orderless representation.

Video-Based Person Re-Identification

A Simple Baseline to Semi-Supervised Domain Adaptation for Machine Translation

1 code implementation22 Jan 2020 Di Jin, Zhijing Jin, Joey Tianyi Zhou, Peter Szolovits

State-of-the-art neural machine translation (NMT) systems are data-hungry and perform poorly on new domains with no supervised data.

Language Modelling Machine Translation +4

Hooks in the Headline: Learning to Generate Headlines with Controlled Styles

1 code implementation ACL 2020 Di Jin, Zhijing Jin, Joey Tianyi Zhou, Lisa Orii, Peter Szolovits

Current summarization systems only produce plain, factual headlines, but do not meet the practical needs of creating memorable titles to increase exposure.

Headline Generation

Heterogeneous Representation Learning: A Review

no code implementations28 Apr 2020 Joey Tianyi Zhou, Xi Peng, Yew-Soon Ong

The real-world data usually exhibits heterogeneous properties such as modalities, views, or resources, which brings some unique challenges wherein the key is Heterogeneous Representation Learning (HRL) termed in this paper.

Multi-Task Learning MULTI-VIEW LEARNING +1

Span-based Localizing Network for Natural Language Video Localization

1 code implementation ACL 2020 Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou

Given an untrimmed video and a text query, natural language video localization (NLVL) is to locate a matching span from the video that semantically corresponds to the query.

The Power of Triply Complementary Priors for Image Compressive Sensing

no code implementations16 May 2020 Zhiyuan Zha, Xin Yuan, Joey Tianyi Zhou, Jiantao Zhou, Bihan Wen, Ce Zhu

In this paper, we propose a joint low-rank and deep (LRD) image model, which contains a pair of triply complementary priors, namely \textit{external} and \textit{internal}, \textit{deep} and \textit{shallow}, and \textit{local} and \textit{non-local} priors.

Compressive Sensing Image Restoration

Omni-supervised Facial Expression Recognition via Distilled Data

no code implementations18 May 2020 Ping Liu, Yunchao Wei, Zibo Meng, Weihong Deng, Joey Tianyi Zhou, Yi Yang

However, the performance of the current state-of-the-art facial expression recognition (FER) approaches is directly related to the labeled data for training.

Facial Expression Recognition Facial Expression Recognition (FER)

EDCompress: Energy-Aware Model Compression for Dataflows

no code implementations8 Jun 2020 Zhehui Wang, Tao Luo, Joey Tianyi Zhou, Rick Siow Mong Goh

EDCompress could also find the optimal dataflow type for specific neural networks in terms of energy consumption, which can guide the deployment of CNN models on hardware systems.

Model Compression

You Only Look Yourself: Unsupervised and Untrained Single Image Dehazing Neural Network

1 code implementation30 Jun 2020 Boyun Li, Yuanbiao Gou, Shuhang Gu, Jerry Zitao Liu, Joey Tianyi Zhou, Xi Peng

In this paper, we study two challenging and less-touched problems in single image dehazing, namely, how to make deep learning achieve image dehazing without training on the ground-truth clean image (unsupervised) and a image collection (untrained).

Disentanglement Image Dehazing +1

Multi-source Meta Transfer for Low Resource Multiple-Choice Question Answering

no code implementations ACL 2020 Ming Yan, Hao Zhang, Di Jin, Joey Tianyi Zhou

Multiple-choice question answering (MCQA) is one of the most challenging tasks in machine reading comprehension since it requires more advanced reading comprehension skills such as logical reasoning, summarization, and arithmetic operations.

Logical Reasoning Machine Reading Comprehension +4

ECML: An Ensemble Cascade Metric Learning Mechanism towards Face Verification

1 code implementation11 Jul 2020 Fu Xiong, Yang Xiao, Zhiguo Cao, Yancheng Wang, Joey Tianyi Zhou, Jianxi Wu

Embedding RMML into the proposed ECML mechanism, our metric learning paradigm (EC-RMML) can run in the one-pass learning manner.

Face Verification Fine-Grained Visual Recognition +1

Attentive Graph Neural Networks for Few-Shot Learning

no code implementations14 Jul 2020 Hao Cheng, Joey Tianyi Zhou, Wee Peng Tay, Bihan Wen

Graph Neural Networks (GNN) has demonstrated the superior performance in many challenging applications, including the few-shot learning tasks.

Few-Shot Learning

Point Adversarial Self Mining: A Simple Method for Facial Expression Recognition

no code implementations26 Aug 2020 Ping Liu, Yuewei Lin, Zibo Meng, Lu Lu, Weihong Deng, Joey Tianyi Zhou, Yi Yang

In this paper, we propose a simple yet effective approach, named Point Adversarial Self Mining (PASM), to improve the recognition accuracy in facial expression recognition.

Adversarial Attack Data Augmentation +4

Contrastive Clustering

1 code implementation21 Sep 2020 Yunfan Li, Peng Hu, Zitao Liu, Dezhong Peng, Joey Tianyi Zhou, Xi Peng

In this paper, we propose a one-stage online clustering method called Contrastive Clustering (CC) which explicitly performs the instance- and cluster-level contrastive learning.

Ranked #4 on Image Clustering on STL-10 (using extra training data)

Clustering Contrastive Learning +1

Deep N-ary Error Correcting Output Codes

1 code implementation22 Sep 2020 Hao Zhang, Joey Tianyi Zhou, Tianying Wang, Ivor W. Tsang, Rick Siow Mong Goh

To facilitate the training of N-ary ECOC with deep learning base learners, we further propose three different variants of parameter sharing architectures for deep N-ary ECOC.

Ensemble Learning General Classification +3

Deep Neural Networks with Short Circuits for Improved Gradient Learning

no code implementations23 Sep 2020 Ming Yan, Xueli Xiao, Joey Tianyi Zhou, Yi Pan

Deep neural networks have achieved great success both in computer vision and natural language processing tasks.

Deep Partial Multi-View Learning

no code implementations12 Nov 2020 Changqing Zhang, Yajie Cui, Zongbo Han, Joey Tianyi Zhou, Huazhu Fu, QinGhua Hu

Although multi-view learning has made signifificant progress over the past few decades, it is still challenging due to the diffificulty in modeling complex correlations among different views, especially under the context of view missing.

Imputation MULTI-VIEW LEARNING +1

Partially View-aligned Clustering

no code implementations NeurIPS 2020 Zhenyu Huang, Peng Hu, Joey Tianyi Zhou, Jiancheng Lv, Xi Peng

To solve this practical and challenging problem, we propose a novel multi-view clustering method termed partially view-aligned clustering (PVC).

Clustering

Adaptive Precision Training for Resource Constrained Devices

no code implementations23 Dec 2020 Tian Huang, Tao Luo, Joey Tianyi Zhou

We use model of the same precision for both forward and backward pass in order to reduce memory usage for training.

Multi-View Disentangled Representation

no code implementations1 Jan 2021 Zongbo Han, Changqing Zhang, Huazhu Fu, QinGhua Hu, Joey Tianyi Zhou

Learning effective representations for data with multiple views is crucial in machine learning and pattern recognition.

Disentanglement

Deep Learning for Latent Events Forecasting in Twitter Aided Caching Networks

no code implementations4 Jan 2021 Zhong Yang, Yuanwei Liu, Yue Chen, Joey Tianyi Zhou

A novel Twitter context aided content caching (TAC) framework is proposed for enhancing the caching efficiency by taking advantage of the legibility and massive volume of Twitter data.

Popularity Forecasting

Trusted Multi-View Classification

5 code implementations ICLR 2021 Zongbo Han, Changqing Zhang, Huazhu Fu, Joey Tianyi Zhou

To this end, we propose a novel multi-view classification method, termed trusted multi-view classification, which provides a new paradigm for multi-view learning by dynamically integrating different views at an evidence level.

Classification General Classification +1

RCT: Resource Constrained Training for Edge AI

no code implementations26 Mar 2021 Tian Huang, Tao Luo, Ming Yan, Joey Tianyi Zhou, Rick Goh

For example, quantisation-aware training (QAT) method involves two copies of model parameters, which is usually beyond the capacity of on-chip memory in edge devices.

PointBA: Towards Backdoor Attacks in 3D Point Cloud

no code implementations ICCV 2021 Xinke Li, Zhirui Chen, Yue Zhao, Zekun Tong, Yabang Zhao, Andrew Lim, Joey Tianyi Zhou

We present the backdoor attacks in 3D point cloud with a unified framework that exploits the unique properties of 3D data and networks.

Backdoor Attack Disentanglement

Video Corpus Moment Retrieval with Contrastive Learning

1 code implementation13 May 2021 Hao Zhang, Aixin Sun, Wei Jing, Guoshun Nan, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh

We adopt the first approach and introduce two contrastive learning objectives to refine video encoder and text encoder to learn video and text representations separately but with better alignment for VCMR.

Contrastive Learning Moment Retrieval +2

Efficient Spiking Neural Networks with Radix Encoding

no code implementations14 May 2021 Zhehui Wang, Xiaozhe Gu, Rick Goh, Joey Tianyi Zhou, Tao Luo

Traditionally, a spike train needs around one thousand time steps to approach similar accuracy as its ANN counterpart.

Parallel Attention Network with Sequence Matching for Video Grounding

no code implementations Findings (ACL) 2021 Hao Zhang, Aixin Sun, Wei Jing, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh

In this work, we propose a Parallel Attention Network with Sequence matching (SeqPAN) to address the challenges in this task: multi-modal representation learning, and target moment boundary prediction.

Representation Learning Video Grounding

Adversarial Semantic Hallucination for Domain Generalized Semantic Segmentation

1 code implementation8 Jun 2021 Gabriel Tjio, Ping Liu, Joey Tianyi Zhou, Rick Siow Mong Goh

In this work, we propose an adversarial semantic hallucination approach (ASH), which combines a class-conditioned hallucination module and a semantic segmentation module.

Domain Generalization Hallucination +2

Automated Deepfake Detection

no code implementations20 Jun 2021 Ping Liu, Yuewei Lin, Yang He, Yunchao Wei, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh, Jingen Liu

In this paper, we propose to utilize Automated Machine Learning to adaptively search a neural architecture for deepfake detection.

BIG-bench Machine Learning DeepFake Detection +1

Word2Pix: Word to Pixel Cross Attention Transformer in Visual Grounding

no code implementations31 Jul 2021 Heng Zhao, Joey Tianyi Zhou, Yew-Soon Ong

Current one-stage methods for visual grounding encode the language query as one holistic sentence embedding before fusion with visual feature.

Sentence Sentence Embedding +2

Simultaneously Transmitting and Reflecting (STAR) Intelligent Omni-Surfaces, Their Modeling and Implementation

no code implementations13 Aug 2021 Jiaqi Xu, Yuanwei Liu, Xidong Mu, Joey Tianyi Zhou, Lingyang Song, H. Vincent Poor, Lajos Hanzo

With the rapid development of advanced electromagnetic manipulation technologies, researchers and engineers are starting to study smart surfaces that can achieve enhanced coverages, high reconfigurability, and are easy to deploy.

Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

1 code implementation ICLR 2022 Jiawei Du, Hanshu Yan, Jiashi Feng, Joey Tianyi Zhou, Liangli Zhen, Rick Siow Mong Goh, Vincent Y. F. Tan

Recently, the relation between the sharpness of the loss landscape and the generalization error has been established by Foret et al. (2020), in which the Sharpness Aware Minimizer (SAM) was proposed to mitigate the degradation of the generalization.

Towards Debiasing Temporal Sentence Grounding in Video

no code implementations8 Nov 2021 Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou

In this paper, we propose two debiasing strategies, data debiasing and model debiasing, to "force" a TSGV model to capture cross-modal interactions.

Sentence Temporal Sentence Grounding

Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma Distributions

1 code implementation NeurIPS 2021 Huan Ma, Zongbo Han, Changqing Zhang, Huazhu Fu, Joey Tianyi Zhou, QinGhua Hu

Multimodal regression is a fundamental task, which integrates the information from different sources to improve the performance of follow-up applications.

Multimodal Sentiment Analysis regression

Temporal Sentence Grounding in Videos: A Survey and Future Directions

no code implementations20 Jan 2022 Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou

Temporal sentence grounding in videos (TSGV), \aka natural language video localization (NLVL) or video moment retrieval (VMR), aims to retrieve a temporal moment that semantically corresponds to a language query from an untrimmed video.

Moment Retrieval Retrieval +2

Multi-Scale Adaptive Network for Single Image Denoising

1 code implementation8 Mar 2022 Yuanbiao Gou, Peng Hu, Jiancheng Lv, Joey Tianyi Zhou, Xi Peng

AFuB devotes to adaptively sampling and transferring the features from one scale to another scale, which fuses the multi-scale features with varying characteristics from coarse to fine.

Image Denoising

Trusted Multi-View Classification with Dynamic Evidential Fusion

2 code implementations25 Apr 2022 Zongbo Han, Changqing Zhang, Huazhu Fu, Joey Tianyi Zhou

With this in mind, we propose a novel multi-view classification algorithm, termed trusted multi-view classification (TMC), providing a new paradigm for multi-view learning by dynamically integrating different views at an evidence level.

Classification MULTI-VIEW LEARNING

Sharpness-Aware Training for Free

1 code implementation27 May 2022 Jiawei Du, Daquan Zhou, Jiashi Feng, Vincent Y. F. Tan, Joey Tianyi Zhou

Intuitively, SAF achieves this by avoiding sudden drops in the loss in the sharp local minima throughout the trajectory of the updates of the weights.

Simultaneously Transmitting and Reflecting (STAR)-RISs: Are they Applicable to Dual-Sided Incidence?

no code implementations12 Sep 2022 Jiaqi Xu, Xidong Mu, Joey Tianyi Zhou, Yuanwei Liu

A hardware model and a signal model are proposed for dual-sided simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs), where the signal simultaneously incident on both sides of the surface.

Meta Knowledge Condensation for Federated Learning

1 code implementation29 Sep 2022 Ping Liu, Xin Yu, Joey Tianyi Zhou

In this work, we first introduce a meta knowledge representation method that extracts meta knowledge from distributed clients.

Federated Learning

TFormer: 3D Tooth Segmentation in Mesh Scans with Geometry Guided Transformer

no code implementations29 Oct 2022 Huimin Xiong, Kunle Li, Kaiyuan Tan, Yang Feng, Joey Tianyi Zhou, Jin Hao, Zuozhu Liu

Optical Intra-oral Scanners (IOS) are widely used in digital dentistry, providing 3-Dimensional (3D) and high-resolution geometrical information of dental crowns and the gingiva.

Multi-Task Learning Segmentation

Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation

3 code implementations CVPR 2023 Jiawei Du, Yidi Jiang, Vincent Y. F. Tan, Joey Tianyi Zhou, Haizhou Li

To mitigate the adverse impact of this accumulated trajectory error, we propose a novel approach that encourages the optimization algorithm to seek a flat trajectory.

Neural Architecture Search

Deep Negative Correlation Classification

no code implementations14 Dec 2022 Le Zhang, Qibin Hou, Yun Liu, Jia-Wang Bian, Xun Xu, Joey Tianyi Zhou, Ce Zhu

Ensemble learning serves as a straightforward way to improve the performance of almost any machine learning algorithm.

Classification Ensemble Learning

Frequency Guidance Matters in Few-Shot Learning

no code implementations ICCV 2023 Hao Cheng, Siyuan Yang, Joey Tianyi Zhou, Lanqing Guo, Bihan Wen

Few-shot classification aims to learn a discriminative feature representation to recognize unseen classes with few labeled support samples.

Few-Shot Learning Metric Learning

Active Simultaneously Transmitting and Reflecting (STAR)-RISs: Modelling and Analysis

no code implementations9 Feb 2023 Jiaqi Xu, Jiakuo Zuo, Joey Tianyi Zhou, Yuanwei Liu

The amplitude gains of the STAR element are derived for both coupled and independent phase-shift scenarios.

A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image

1 code implementation CVPR 2023 Changlong Jiang, Yang Xiao, Cunlin Wu, Mingyang Zhang, Jinghong Zheng, Zhiguo Cao, Joey Tianyi Zhou

3D interacting hand pose estimation from a single RGB image is a challenging task, due to serious self-occlusion and inter-occlusion towards hands, confusing similar appearance patterns between 2 hands, ill-posed joint position mapping from 2D to 3D, etc.. To address these, we propose to extend A2J-the state-of-the-art depth-based 3D single hand pose estimation method-to RGB domain under interacting hand condition.

3D Interacting Hand Pose Estimation Hand Pose Estimation +1

Dual Stage Stylization Modulation for Domain Generalized Semantic Segmentation

no code implementations18 Apr 2023 Gabriel Tjio, Ping Liu, Chee-Keong Kwoh, Joey Tianyi Zhou

To tackle this challenge, we introduce a dual-stage Feature Transform (dFT) layer within the Adversarial Semantic Hallucination+ (ASH+) framework.

Domain Generalization Hallucination +1

Calibrating Multimodal Learning

no code implementations2 Jun 2023 Huan Ma. Qingyang Zhang, Changqing Zhang, Bingzhe Wu, Huazhu Fu, Joey Tianyi Zhou, QinGhua Hu

Specifically, we find that the confidence estimated by current models could even increase when some modalities are corrupted.

dugMatting: Decomposed-Uncertainty-Guided Matting

1 code implementation2 Jun 2023 Jiawei Wu, Changqing Zhang, Zuoyong Li, Huazhu Fu, Xi Peng, Joey Tianyi Zhou

Cutting out an object and estimating its opacity mask, known as image matting, is a key task in image and video editing.

Image Matting Video Editing

Provable Dynamic Fusion for Low-Quality Multimodal Data

1 code implementation3 Jun 2023 Qingyang Zhang, Haitao Wu, Changqing Zhang, QinGhua Hu, Huazhu Fu, Joey Tianyi Zhou, Xi Peng

The inherent challenge of multimodal fusion is to precisely capture the cross-modal correlation and flexibly conduct cross-modal interaction.

Proceedings of the 40th International Conference on Machine Learning

1 code implementation journal 2023 Huan Ma, Qingyang Zhang, Changqing Zhang, Bingzhe Wu, Huazhu Fu, Joey Tianyi Zhou, QinGhua Hu

Specifically, we find that the confidence estimated by current models could even increase when some modalities are corrupted.

Risk-optimized Outlier Removal for Robust 3D Point Cloud Classification

1 code implementation20 Jul 2023 Xinke Li, Junchi Lu, Henghui Ding, Changsheng Sun, Joey Tianyi Zhou, Chee Yeow Meng

With the growth of 3D sensing technology, deep learning system for 3D point clouds has become increasingly important, especially in applications like autonomous vehicles where safety is a primary concern.

3D Point Cloud Classification Autonomous Vehicles +4

Noisy-Correspondence Learning for Text-to-Image Person Re-identification

1 code implementation19 Aug 2023 Yang Qin, Yingke Chen, Dezhong Peng, Xi Peng, Joey Tianyi Zhou, Peng Hu

Text-to-image person re-identification (TIReID) is a compelling topic in the cross-modal community, which aims to retrieve the target person based on a textual query.

Ranked #2 on Text based Person Retrieval on ICFG-PEDES (using extra training data)

Person Re-Identification Text based Person Retrieval +1

Ladder-of-Thought: Using Knowledge as Steps to Elevate Stance Detection

no code implementations31 Aug 2023 Kairui Hu, Ming Yan, Joey Tianyi Zhou, Ivor W. Tsang, Wen Haw Chong, Yong Keong Yap

In response to these identified gaps, we introduce the Ladder-of-Thought (LoT) for the stance detection task.

Stance Detection

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering

no code implementations ICCV 2023 Chi Zhang, Wei Yin, Gang Yu, Zhibin Wang, Tao Chen, Bin Fu, Joey Tianyi Zhou, Chunhua Shen

In this paper, we propose a learning framework that trains models to predict geometry-preserving depth without requiring extra data or annotations.

Monocular Depth Estimation

Towards Distribution-Agnostic Generalized Category Discovery

1 code implementation NeurIPS 2023 Jianhong Bai, Zuozhu Liu, Hualiang Wang, Ruizhe Chen, Lianrui Mu, Xiaomeng Li, Joey Tianyi Zhou, Yang Feng, Jian Wu, Haoji Hu

In this paper, we formally define a more realistic task as distribution-agnostic generalized category discovery (DA-GCD): generating fine-grained predictions for both close- and open-set classes in a long-tailed open-world setting.

Contrastive Learning Transfer Learning

You Only Condense Once: Two Rules for Pruning Condensed Datasets

1 code implementation NeurIPS 2023 Yang He, Lingao Xiao, Joey Tianyi Zhou

However, these scenarios have two significant challenges: 1) the varying computational resources available on the devices require a dataset size different from the pre-defined condensed dataset, and 2) the limited computational resources often preclude the possibility of conducting additional condensation processes.

Dataset Condensation

Cross-modal Active Complementary Learning with Self-refining Correspondence

1 code implementation NeurIPS 2023 Yang Qin, Yuan Sun, Dezhong Peng, Joey Tianyi Zhou, Xi Peng, Peng Hu

Recently, image-text matching has attracted more and more attention from academia and industry, which is fundamental to understanding the latent correspondence across visual and textual modalities.

Image-text matching Text Matching

TSegFormer: 3D Tooth Segmentation in Intraoral Scans with Geometry Guided Transformer

1 code implementation22 Nov 2023 Huimin Xiong, Kunle Li, Kaiyuan Tan, Yang Feng, Joey Tianyi Zhou, Jin Hao, Haochao Ying, Jian Wu, Zuozhu Liu

Optical Intraoral Scanners (IOS) are widely used in digital dentistry to provide detailed 3D information of dental crowns and the gingiva.

Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning

1 code implementation22 Nov 2023 Xin Zhang, Jiawei Du, Yunsong Li, Weiying Xie, Joey Tianyi Zhou

Dataset pruning aims to construct a coreset capable of achieving performance comparable to the original, full dataset.

Classification

Direct Distillation between Different Domains

no code implementations12 Jan 2024 Jialiang Tang, Shuo Chen, Gang Niu, Hongyuan Zhu, Joey Tianyi Zhou, Chen Gong, Masashi Sugiyama

Then, we build a fusion-activation mechanism to transfer the valuable domain-invariant knowledge to the student network, while simultaneously encouraging the adapter within the teacher network to learn the domain-specific knowledge of the target data.

Domain Adaptation Knowledge Distillation

FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data

1 code implementation17 Jan 2024 Zikai Xiao, Zihan Chen, Liyinglan Liu, Yang Feng, Jian Wu, Wanlu Liu, Joey Tianyi Zhou, Howard Hao Yang, Zuozhu Liu

Federated Long-Tailed Learning (Fed-LT), a paradigm wherein data collected from decentralized local clients manifests a globally prevalent long-tailed distribution, has garnered considerable attention in recent times.

Personalized Federated Learning Representation Learning

Two Trades is not Baffled: Condensing Graph via Crafting Rational Gradient Matching

1 code implementation7 Feb 2024 Tianle Zhang, Yuchen Zhang, Kun Wang, Kai Wang, Beining Yang, Kaipeng Zhang, Wenqi Shao, Ping Liu, Joey Tianyi Zhou, Yang You

Training on large-scale graphs has achieved remarkable results in graph representation learning, but its cost and storage have raised growing concerns.

Graph Representation Learning

Selective Learning: Towards Robust Calibration with Dynamic Regularization

no code implementations13 Feb 2024 Zongbo Han, Yifeng Yang, Changqing Zhang, Linjun Zhang, Joey Tianyi Zhou, QinGhua Hu, Huaxiu Yao

The objective can be understood as seeking a model that fits the ground-truth labels by increasing the confidence while also maximizing the entropy of predicted probabilities by decreasing the confidence.

Multisize Dataset Condensation

1 code implementation10 Mar 2024 Yang He, Lingao Xiao, Joey Tianyi Zhou, Ivor Tsang

These two challenges connect to the "subset degradation problem" in traditional dataset condensation: a subset from a larger condensed dataset is often unrepresentative compared to directly condensing the whole dataset to that smaller size.

Dataset Condensation

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

no code implementations15 Mar 2024 Tingbing Yan, Wenzheng Zeng, Yang Xiao, Xingyu Tong, Bo Tan, Zhiwen Fang, Zhiguo Cao, Joey Tianyi Zhou

Most existing one-shot skeleton-based action recognition focuses on raw low-level information (e. g., joint location), and may suffer from local information loss and low generalization ability.

Skeleton Based Action Recognition

Collaborative Knowledge Infusion for Low-resource Stance Detection

no code implementations28 Mar 2024 Ming Yan, Joey Tianyi Zhou, Ivor W. Tsang

Specifically, our stance detection approach leverages target background knowledge collaboratively from different knowledge sources with the help of knowledge alignment.

Stance Detection

Shortcuts Arising from Contrast: Effective and Covert Clean-Label Attacks in Prompt-Based Learning

no code implementations30 Mar 2024 Xiaopeng Xie, Ming Yan, Xiwen Zhou, Chenlong Zhao, Suli Wang, Yong Zhang, Joey Tianyi Zhou

In addressing this issue, we are inspired by the notion that a backdoor acts as a shortcut and posit that this shortcut stems from the contrast between the trigger and the data utilized for poisoning.

Data Augmentation Few-Shot Text Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.