Search Results for author: Dongyue Chen

Found 14 papers, 5 papers with code

CRoF: CLIP-based Robust Few-shot Learning on Noisy Labels

no code implementations17 Dec 2024 Shizhuo Deng, Bowen Han, Jiaqi Chen, Hao Wang, Dongyue Chen, Tong Jia

Noisy labels threaten the robustness of few-shot learning (FSL) due to the inexact features in a new domain.

Domain Generalization Few-Shot Learning +1

Learning Pattern-Specific Experts for Time Series Forecasting Under Patch-level Distribution Shift

1 code implementation13 Oct 2024 Yanru Sun, Zongxia Xie, Emadeldeen Eldele, Dongyue Chen, QinGhua Hu, Min Wu

To address these challenges, we propose \textbf{TFPS}, a novel architecture that leverages pattern-specific experts for more accurate and adaptable time series forecasting.

Time Series Time Series Forecasting

DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation

no code implementations9 Sep 2024 Wei Wu, Xi Guo, Weixuan Tang, Tingxuan Huang, Chiyu Wang, Dongyue Chen, Chenjing Ding

However, existing approaches often struggle with multi-view video generation due to the challenges of integrating 3D information while maintaining spatial-temporal consistency and effectively learning from a unified model.

Autonomous Driving Video Generation

Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP

no code implementations16 Jun 2024 Shuyang Lin, Tong Jia, Hao Wang, Bowen Ma, Mingyuan Li, Dongyue Chen

To address aforementioned challenges, in this paper, we introduce distillation-based open-vocabulary object detection (OVOD) task into X-ray security inspection domain by extending CLIP to learn visual representations in our specific X-ray domain, aiming to detect novel prohibited item categories beyond base categories on which the detector is trained.

object-detection Open-vocabulary object detection +1

Fine-Grained Domain Generalization with Feature Structuralization

no code implementations13 Jun 2024 Wenlong Yu, Dongyue Chen, Qilong Wang, QinGhua Hu

Likewise, we propose a Feature Structuralized Domain Generalization (FSDG) model, wherein features experience structuralization into common, specific, and confounding segments, harmoniously aligned with their relevant semantic concepts, to elevate performance in FGDG.

Domain Generalization Specificity

MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection

1 code implementation5 Jun 2024 Mingyuan Li, Tong Jia, Hui Lu, Bowen Ma, Hao Wang, Dongyue Chen

Prohibited Item detection in X-ray images is one of the most effective security inspection methods. However, differing from natural light images, the unique overlapping phenomena in X-ray images lead to the coupling of foreground and background features, thereby lowering the accuracy of general object detectors. Therefore, we propose a Multi-Class Min-Margin Contrastive Learning (MMCL) method that, by clarifying the category semantic information of content queries under the deformable DETR architecture, aids the model in extracting specific category foreground information from coupled features. Specifically, after grouping content queries by the number of categories, we employ the Multi-Class Inter-Class Exclusion (MIE) loss to push apart content queries from different groups.

Contrastive Learning

Hierarchical Classification Auxiliary Network for Time Series Forecasting

1 code implementation29 May 2024 Yanru Sun, Zongxia Xie, Dongyue Chen, Emadeldeen Eldele, QinGhua Hu

In this work, we introduce a novel approach by tokenizing time series values to train forecasting models via cross-entropy loss, while considering the continuous nature of time series data.

Classification Time Series +1

AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection

1 code implementation7 Mar 2024 Mingyuan Li, Tong Jia, Hao Wang, Bowen Ma, Shuyang Lin, Da Cai, Dongyue Chen

Considering the significant overlapping phenomenon in X-ray prohibited item images, we propose an Anti-Overlapping DETR (AO-DETR) based on one of the state-of-the-art general object detectors, DINO.

Decoder

Mask-adaptive Gated Convolution and Bi-directional Progressive Fusion Network for Depth Completion

no code implementations15 Jan 2024 Tingxuan Huang, Jiacheng Miao, Shizhuo Deng, Tong, Dongyue Chen

Depth completion is a critical task for handling depth images with missing pixels, which can negatively impact further applications.

Decoder Depth Completion

Prior Knowledge Guided Network for Video Anomaly Detection

no code implementations4 Sep 2023 Zhewen Deng, Dongyue Chen, Shizhuo Deng

Video Anomaly Detection (VAD) involves detecting anomalous events in videos, presenting a significant and intricate task within intelligent video surveillance.

Anomaly Detection Knowledge Distillation +1

AGG-Net: Attention Guided Gated-convolutional Network for Depth Image Completion

1 code implementation ICCV 2023 Dongyue Chen, Tingxuan Huang, Zhimin Song, Shizhuo Deng, Tong Jia

In the encoding stage, an Attention Guided Gated-Convolution (AG-GConv) module is proposed to realize the fusion of depth and color features at different scales, which can effectively reduce the negative impacts of invalid depth data on the reconstruction.

Learning Deep Representations by Mutual Information for Person Re-identification

no code implementations16 Aug 2019 Peng Chen, Tong Jia, Pengfei Wu, Jianjun Wu, Dongyue Chen

Most existing person re-identification (ReID) methods have good feature representations to distinguish pedestrians with deep convolutional neural network (CNN) and metric learning methods.

Metric Learning Person Re-Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.