Search Results for author: Chuang Zhang

Found 26 papers, 14 papers with code

Improving Abstractive Text Summarization with History Aggregation

no code implementations24 Dec 2019 Pengcheng Liao, Chuang Zhang, Xiaojun Chen, Xiaofei Zhou

Recent neural sequence to sequence models have provided feasible solutions for abstractive summarization.

Abstractive Text Summarization

C-DLinkNet: considering multi-level semantic features for human parsing

1 code implementation31 Jan 2020 Yu Lu, Muyan Feng, Ming Wu, Chuang Zhang

Human parsing is an essential branch of semantic segmentation, which is a fine-grained semantic segmentation task to identify the constituent parts of human.

Human Parsing Segmentation +1

FGSD: A Dataset for Fine-Grained Ship Detection in High Resolution Satellite Images

no code implementations15 Mar 2020 Kaiyan Chen, Ming Wu, Jiaming Liu, Chuang Zhang

To further promote the research of ship detection, we introduced a new fine-grained ship detection datasets, which is named as FGSD.

GINet: Graph Interaction Network for Scene Parsing

1 code implementation ECCV 2020 Tianyi Wu, Yu Lu, Yu Zhu, Chuang Zhang, Ming Wu, Zhanyu Ma, Guodong Guo

GI unit is further improved by the SC-loss to enhance the semantic representations over the exemplar-based semantic graph.

Scene Parsing

Contextual Graph Reasoning Networks

no code implementations1 Jan 2021 Zhaoqing Wang, Jiaming Liu, Yangyuxuan Kang, Mingming Gong, Chuang Zhang, Ming Lu, Ming Wu

Graph Reasoning has shown great potential recently in modeling long-range dependencies, which are crucial for various computer vision tasks.

2D Human Pose Estimation Instance Segmentation +4

ConTNet: Why not use convolution and transformer at the same time?

2 code implementations27 Apr 2021 Haotian Yan, Zhe Li, Weijian Li, Changhu Wang, Ming Wu, Chuang Zhang

It is also worth pointing that, given identical strong data augmentations, the performance improvement of ConTNet is more remarkable than that of ResNet.

Image Classification object-detection +1

SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution

1 code implementation30 Nov 2021 Shizun Wang, Ming Lu, Kaixin Chen, Jiaming Liu, Xiaoqi Li, Chuang Zhang, Ming Wu

However, existing methods mostly train the DNNs on uniformly sampled LR-HR patch pairs, which makes them fail to fully exploit informative patches within the image.

Data Augmentation Image Super-Resolution

Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification

1 code implementation12 Dec 2021 Lin Wan, Qianyan Jing, Zongyuan Sun, Chuang Zhang, Zhihang Li, Yehansen Chen

Much of that is due to the notorious modality bias training issue brought by the single-modality ImageNet pre-training, which might yield RGB-biased representations that severely hinder the cross-modality image retrieval.

Contrastive Learning Cross-Modality Person Re-identification +3

Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window Attention

2 code implementations5 Jan 2022 Haotian Yan, Chuang Zhang, Ming Wu

In this paper, we succeed in introducing multi-scale representations into semantic segmentation ViT via window attention mechanism and further improves the performance and efficiency.

Image Classification Segmentation +1

Multi-class Label Noise Learning via Loss Decomposition and Centroid Estimation

no code implementations21 Mar 2022 Yongliang Ding, Tao Zhou, Chuang Zhang, Yijing Luo, Juan Tang, Chen Gong

Further, by defining a new form of data centroid, we transform the recovery problem of a label-dependent part to a centroid estimation problem.

Binary Classification

CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries

no code implementations9 Jun 2022 Xiaojun Liu, Shunan Zang, Chuang Zhang, Xiaojun Chen, Yangyang Ding

In order to solve this problem, we paraphrase the reference summaries in CLTS, the Chinese Long Text Summarization dataset, correct errors of factual inconsistencies, and propose the first Chinese Long Text Summarization dataset with a high level of abstractiveness, CLTS+, which contains more than 180K article-summary pairs and is available online.

Text Summarization

Towards Harnessing Feature Embedding for Robust Learning with Noisy Labels

no code implementations27 Jun 2022 Chuang Zhang, Li Shen, Jian Yang, Chen Gong

To exploit this effect, the model prediction-based methods have been widely adopted, which aim to exploit the outputs of DNNs in the early stage of learning to correct noisy labels.

Learning with noisy labels Memorization

Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

1 code implementation10 Jul 2022 Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Chuang Zhang, Jun Li

Besides, by leveraging full training set and the additional 48K raw images of KITTI, it can further improve the MonoFlex by +4. 65% improvement on AP@0. 7 for car detection, reaching 18. 54% AP@0. 7, which ranks the 1st place among all monocular based methods on KITTI test leaderboard.

Autonomous Driving Model Optimization +2

Privileged Prior Information Distillation for Image Matting

no code implementations25 Nov 2022 Cheng Lyu, Jiake Xie, Bo Xu, Cheng Lu, Han Huang, Xin Huang, Ming Wu, Chuang Zhang, Yong Tang

Performance of trimap-free image matting methods is limited when trying to decouple the deterministic and undetermined regions, especially in the scenes where foregrounds are semantically ambiguous, chromaless, or high transmittance.

Image Matting

SwiftAvatar: Efficient Auto-Creation of Parameterized Stylized Character on Arbitrary Avatar Engines

no code implementations19 Jan 2023 Shizun Wang, Weihong Zeng, Xu Wang, Hao Yang, Li Chen, Yi Yuan, Yunzhao Zeng, Min Zheng, Chuang Zhang, Ming Wu

To this end, we propose SwiftAvatar, a novel avatar auto-creation framework that is evidently superior to previous works.

Cross-modal Contrastive Learning for Multimodal Fake News Detection

1 code implementation25 Feb 2023 Longzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Xiaohan Xu, Siqi Wang

An attention mechanism with an attention guidance module is implemented to help effectively and interpretably aggregate the aligned unimodal representations and the cross-modality correlations.

Contrastive Learning Fake News Detection +1

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code implementations30 Sep 2023 Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Autonomous Driving Monocular 3D Object Detection +1

UAV Swarm-enabled Collaborative Secure Relay Communications with Time-domain Colluding Eavesdropper

no code implementations3 Oct 2023 Chuang Zhang, Geng Sun, Qingqing Wu, Jiahui Li, Shuang Liang, Dusit Niyato, Victor C. M. Leung

Unmanned aerial vehicles (UAVs) as aerial relays are practically appealing for assisting Internet of Things (IoT) network.

VN-Net: Vision-Numerical Fusion Graph Convolutional Network for Sparse Spatio-Temporal Meteorological Forecasting

no code implementations26 Jan 2024 Yutong Xiong, Xun Zhu, Ming Wu, Weiqing Li, Fanbin Mo, Chuang Zhang, Bin Zhang

Sparse meteorological forecasting is indispensable for fine-grained weather forecasting and deserves extensive attention.

SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection

1 code implementation29 Jan 2024 Lei Yang, Xinyu Zhang, Jun Li, Li Wang, Chuang Zhang, Li Ju, Zhiwei Li, Yang shen

Our method surpasses all previous methods by a significant margin in new scenes, including +42. 57% for vehicle, +5. 87% for pedestrian, and +14. 89% for cyclist compared to BEVHeight on the DAIR-V2X-I heterologous benchmark.

3D Object Detection Autonomous Vehicles +1

Multi-Scale Representations by Varying Window Attention for Semantic Segmentation

1 code implementation25 Apr 2024 Haotian Yan, Ming Wu, Chuang Zhang

VWA leverages the local window attention (LWA) and disentangles LWA into the query window and context window, allowing the context's scale to vary for the query to learn representations at multiple scales.

Cannot find the paper you are looking for? You can Submit a new open access paper.