Search Results for author: Chuang Zhang

Found 29 papers, 17 papers with code

Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking

1 code implementation4 Jun 2024 Zefeng Zhang, Jiawei Sheng, Chuang Zhang, Yunzhi Liang, Wenyuan Zhang, Siqi Wang, Tingwen Liu

Thereby, we exploit the correlation between multimodal features to enhance multimodal fusion, and the correlation between mentions and entities to enhance fine-grained matching.

Entity Linking Knowledge Distillation

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

2 code implementations30 May 2024 Wenchao Sun, Xuewu Lin, Yining Shi, Chuang Zhang, Haoran Wu, Sifa Zheng

To this end, we explore the sparse representation and review the task design for end-to-end autonomous driving, proposing a new paradigm named SparseDrive.

Attribute Autonomous Driving +1

Multi-Scale Representations by Varying Window Attention for Semantic Segmentation

2 code implementations25 Apr 2024 Haotian Yan, Ming Wu, Chuang Zhang

VWA leverages the local window attention (LWA) and disentangles LWA into the query window and context window, allowing the context's scale to vary for the query to learn representations at multiple scales.

Decoder Semantic Segmentation

SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection

1 code implementation29 Jan 2024 Lei Yang, Xinyu Zhang, Jun Li, Li Wang, Chuang Zhang, Li Ju, Zhiwei Li, Yang shen

Our method surpasses all previous methods by a significant margin in new scenes, including +42. 57% for vehicle, +5. 87% for pedestrian, and +14. 89% for cyclist compared to BEVHeight on the DAIR-V2X-I heterologous benchmark.

3D Object Detection Autonomous Vehicles +1

UAV Swarm-enabled Collaborative Secure Relay Communications with Time-domain Colluding Eavesdropper

no code implementations3 Oct 2023 Chuang Zhang, Geng Sun, Qingqing Wu, Jiahui Li, Shuang Liang, Dusit Niyato, Victor C. M. Leung

Unmanned aerial vehicles (UAVs) as aerial relays are practically appealing for assisting Internet of Things (IoT) network.

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code implementations30 Sep 2023 Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Autonomous Driving Monocular 3D Object Detection +1

Cross-modal Contrastive Learning for Multimodal Fake News Detection

1 code implementation25 Feb 2023 Longzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Xiaohan Xu, Siqi Wang

An attention mechanism with an attention guidance module is implemented to help effectively and interpretably aggregate the aligned unimodal representations and the cross-modality correlations.

Contrastive Learning Fake News Detection +1

SwiftAvatar: Efficient Auto-Creation of Parameterized Stylized Character on Arbitrary Avatar Engines

no code implementations19 Jan 2023 Shizun Wang, Weihong Zeng, Xu Wang, Hao Yang, Li Chen, Yi Yuan, Yunzhao Zeng, Min Zheng, Chuang Zhang, Ming Wu

To this end, we propose SwiftAvatar, a novel avatar auto-creation framework that is evidently superior to previous works.

Privileged Prior Information Distillation for Image Matting

no code implementations25 Nov 2022 Cheng Lyu, Jiake Xie, Bo Xu, Cheng Lu, Han Huang, Xin Huang, Ming Wu, Chuang Zhang, Yong Tang

Performance of trimap-free image matting methods is limited when trying to decouple the deterministic and undetermined regions, especially in the scenes where foregrounds are semantically ambiguous, chromaless, or high transmittance.

Image Matting

Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

1 code implementation10 Jul 2022 Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Chuang Zhang, Jun Li

Besides, by leveraging full training set and the additional 48K raw images of KITTI, it can further improve the MonoFlex by +4. 65% improvement on AP@0. 7 for car detection, reaching 18. 54% AP@0. 7, which ranks the 1st place among all monocular based methods on KITTI test leaderboard.

Autonomous Driving Model Optimization +2

Towards Harnessing Feature Embedding for Robust Learning with Noisy Labels

no code implementations27 Jun 2022 Chuang Zhang, Li Shen, Jian Yang, Chen Gong

To exploit this effect, the model prediction-based methods have been widely adopted, which aim to exploit the outputs of DNNs in the early stage of learning to correct noisy labels.

Learning with noisy labels Memorization

CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries

no code implementations9 Jun 2022 Xiaojun Liu, Shunan Zang, Chuang Zhang, Xiaojun Chen, Yangyang Ding

In order to solve this problem, we paraphrase the reference summaries in CLTS, the Chinese Long Text Summarization dataset, correct errors of factual inconsistencies, and propose the first Chinese Long Text Summarization dataset with a high level of abstractiveness, CLTS+, which contains more than 180K article-summary pairs and is available online.

Text Summarization

Multi-class Label Noise Learning via Loss Decomposition and Centroid Estimation

no code implementations21 Mar 2022 Yongliang Ding, Tao Zhou, Chuang Zhang, Yijing Luo, Juan Tang, Chen Gong

Further, by defining a new form of data centroid, we transform the recovery problem of a label-dependent part to a centroid estimation problem.

Binary Classification

Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window Attention

2 code implementations5 Jan 2022 Haotian Yan, Chuang Zhang, Ming Wu

In this paper, we succeed in introducing multi-scale representations into semantic segmentation ViT via window attention mechanism and further improves the performance and efficiency.

Decoder Image Classification +2

Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification

1 code implementation12 Dec 2021 Lin Wan, Qianyan Jing, Zongyuan Sun, Chuang Zhang, Zhihang Li, Yehansen Chen

Much of that is due to the notorious modality bias training issue brought by the single-modality ImageNet pre-training, which might yield RGB-biased representations that severely hinder the cross-modality image retrieval.

Contrastive Learning Cross-Modality Person Re-identification +3

SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution

1 code implementation30 Nov 2021 Shizun Wang, Ming Lu, Kaixin Chen, Jiaming Liu, Xiaoqi Li, Chuang Zhang, Ming Wu

However, existing methods mostly train the DNNs on uniformly sampled LR-HR patch pairs, which makes them fail to fully exploit informative patches within the image.

Data Augmentation Image Super-Resolution

ConTNet: Why not use convolution and transformer at the same time?

2 code implementations27 Apr 2021 Haotian Yan, Zhe Li, Weijian Li, Changhu Wang, Ming Wu, Chuang Zhang

It is also worth pointing that, given identical strong data augmentations, the performance improvement of ConTNet is more remarkable than that of ResNet.

Image Classification object-detection +1

Contextual Graph Reasoning Networks

no code implementations1 Jan 2021 Zhaoqing Wang, Jiaming Liu, Yangyuxuan Kang, Mingming Gong, Chuang Zhang, Ming Lu, Ming Wu

Graph Reasoning has shown great potential recently in modeling long-range dependencies, which are crucial for various computer vision tasks.

2D Human Pose Estimation Instance Segmentation +4

GINet: Graph Interaction Network for Scene Parsing

1 code implementation ECCV 2020 Tianyi Wu, Yu Lu, Yu Zhu, Chuang Zhang, Ming Wu, Zhanyu Ma, Guodong Guo

GI unit is further improved by the SC-loss to enhance the semantic representations over the exemplar-based semantic graph.

Scene Parsing

FGSD: A Dataset for Fine-Grained Ship Detection in High Resolution Satellite Images

no code implementations15 Mar 2020 Kaiyan Chen, Ming Wu, Jiaming Liu, Chuang Zhang

To further promote the research of ship detection, we introduced a new fine-grained ship detection datasets, which is named as FGSD.

C-DLinkNet: considering multi-level semantic features for human parsing

1 code implementation31 Jan 2020 Yu Lu, Muyan Feng, Ming Wu, Chuang Zhang

Human parsing is an essential branch of semantic segmentation, which is a fine-grained semantic segmentation task to identify the constituent parts of human.

Decoder Human Parsing +2

Improving Abstractive Text Summarization with History Aggregation

no code implementations24 Dec 2019 Pengcheng Liao, Chuang Zhang, Xiaojun Chen, Xiaofei Zhou

Recent neural sequence to sequence models have provided feasible solutions for abstractive summarization.

Abstractive Text Summarization Decoder

Cannot find the paper you are looking for? You can Submit a new open access paper.