Search Results for author: Daoan Zhang

Found 16 papers, 6 papers with code

FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

no code implementations23 Apr 2024 Hang Hua, Jing Shi, Kushal Kafle, Simon Jenni, Daoan Zhang, John Collomosse, Scott Cohen, Jiebo Luo

To address this, we propose FineMatch, a new aspect-based fine-grained text and image matching benchmark, focusing on text and image mismatch detection and correction.

Hallucination In-Context Learning +2

Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering

no code implementations1 Feb 2024 Pinxin Liu, Luchuan Song, Daoan Zhang, Hang Hua, Yunlong Tang, Huaijin Tu, Jiebo Luo, Chenliang Xu

To address the above problems, we propose the Efficient Monotonic Video Style Avatar (Emo-Avatar) through deferred neural rendering that enhances StyleGAN's capacity for producing dynamic, drivable portrait videos.

Contrastive Learning Neural Rendering

CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs

no code implementations5 Jan 2024 Daoan Zhang, Junming Yang, Hanjia Lyu, Zijian Jin, Yuan YAO, Mingkai Chen, Jiebo Luo

When exploring the development of Artificial General Intelligence (AGI), a critical task for these models involves interpreting and processing information from multiple image inputs.

Image Comprehension Text Matching +1

Video Understanding with Large Language Models: A Survey

1 code implementation29 Dec 2023 Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, JianGuo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu

With the burgeoning growth of online video platforms and the escalating volume of video content, the demand for proficient video understanding tools has intensified markedly.

Video Understanding

Semi-supervised Semantic Segmentation via Boosting Uncertainty on Unlabeled Data

no code implementations30 Nov 2023 Daoan Zhang, Yunhao Luo, JianGuo Zhang

We first figure out that the distribution gap between labeled and unlabeled datasets cannot be ignored, even though the two datasets are sampled from the same distribution.

Segmentation Semi-Supervised Semantic Segmentation

GPT-4V(ision) as A Social Media Analysis Engine

1 code implementation13 Nov 2023 Hanjia Lyu, Jinfa Huang, Daoan Zhang, Yongsheng Yu, Xinyi Mou, Jinsheng Pan, Zhengyuan Yang, Zhongyu Wei, Jiebo Luo

Our investigation begins with a preliminary quantitative analysis for each task using existing benchmark datasets, followed by a careful review of the results and a selection of qualitative samples that illustrate GPT-4V's potential in understanding multimodal social media content.

Hallucination Hate Speech Detection +1

Cross Contrasting Feature Perturbation for Domain Generalization

2 code implementations ICCV 2023 Chenming Li, Daoan Zhang, Wenjian Huang, JianGuo Zhang

Domain generalization (DG) aims to learn a robust model from source domains that generalize well on unseen target domains.

Domain Generalization

DNAGPT: A Generalized Pre-trained Tool for Versatile DNA Sequence Analysis Tasks

no code implementations11 Jul 2023 Daoan Zhang, Weitong Zhang, Yu Zhao, JianGuo Zhang, Bing He, Chenchen Qin, Jianhua Yao

Pre-trained large language models demonstrate potential in extracting information from DNA sequences, yet adapting to a variety of tasks and data modalities remains a challenge.

Binary Classification DNA analysis +1

Black-box Source-free Domain Adaptation via Two-stage Knowledge Distillation

no code implementations13 May 2023 Shuai Wang, Daoan Zhang, Zipei Yan, Shitong Shao, Rui Li

In Stage \uppercase\expandafter{\romannumeral1}, we train the target model from scratch with soft pseudo-labels generated by the source model in a knowledge distillation manner.

Knowledge Distillation Source-Free Domain Adaptation +1

Towards Generalizable Medical Image Segmentation with Pixel-wise Uncertainty Estimation

no code implementations13 May 2023 Shuai Wang, Zipei Yan, Daoan Zhang, Zhongsen Li, Sirui Wu, Wenxuan Chen, Rui Li

In contrast, the IID hypothesis is not universally guaranteed in numerous real-world applications, especially in medical image analysis.

Image Segmentation Medical Image Segmentation +1

Feature Alignment and Uniformity for Test Time Adaptation

1 code implementation CVPR 2023 Shuai Wang, Daoan Zhang, Zipei Yan, JianGuo Zhang, Rui Li

Test time adaptation (TTA) aims to adapt deep neural networks when receiving out of distribution test domain samples.

Domain Generalization Image Segmentation +3

Prototype Knowledge Distillation for Medical Segmentation with Missing Modality

1 code implementation17 Mar 2023 Shuai Wang, Zipei Yan, Daoan Zhang, Haining Wei, Zhongsen Li, Rui Li

Specifically, our ProtoKD can not only distillate the pixel-wise knowledge of multi-modality data to single-modality data but also transfer intra-class and inter-class feature variations, such that the student model could learn more robust feature representation from the teacher model and inference with only one single modality data.

Image Segmentation Knowledge Distillation +3

Bootstrap The Original Latent: Learning a Private Model from a Black-box Model

no code implementations7 Mar 2023 Shuai Wang, Daoan Zhang, JianGuo Zhang, Weiwei Zhang, Rui Li

In this paper, considering the balance of data/model privacy of model owners and user needs, we propose a new setting called Back-Propagated Black-Box Adaptation (BPBA) for users to better train their private models via the guidance of the back-propagated results of a Black-box foundation/source model.

Aggregation of Disentanglement: Reconsidering Domain Variations in Domain Generalization

no code implementations5 Feb 2023 Daoan Zhang, Mingkai Chen, Chenming Li, Lingyun Huang, JianGuo Zhang

Different from learning domain invariant features from source domains, we decouple the input images into Domain Expert Features and noise.

Contrastive Learning Disentanglement +1

Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation

no code implementations26 Nov 2022 Daoan Zhang, Chenming Li, Haoquan Li, Wenjian Huang, Lingyun Huang, JianGuo Zhang

Experimental results on multiple semantic segmentation benchmarks show that our unsupervised segmentation framework specializes in catching semantic representations, which outperforms all the unpretrained and even several pretrained methods.

Representation Learning Segmentation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.