no code implementations • 22 Aug 2024 • Wenhui Li, Xinqi Su, Dan Song, Lanjun Wang, Kun Zhang, An-An Liu
Prior image-text matching methods have shown remarkable performance on many benchmark datasets, but most of them overlook the bias in the dataset, which exists in intra-modal and inter-modal, and tend to learn the spurious correlations that extremely degrade the generalization ability of the model.
1 code implementation • 12 Aug 2024 • Xuanpu Zhang, Dan Song, Pengxin Zhan, Tianyu Chang, Jianhao Zeng, QingGuo Chen, Weihua Luo, AnAn Liu
Recent methods model virtual try-on as image mask-inpaint task, which requires masking the person image and results in significant loss of spatial information.
no code implementations • 27 Jun 2024 • Okan Bulut, Maggie Beiting-Parrish, Jodi M. Casabianca, Sharon C. Slater, Hong Jiao, Dan Song, Christopher M. Ormerod, Deborah Gbemisola Fabiyi, Rodica Ivan, Cole Walsh, Oscar Rios, Joshua Wilson, Seyma N. Yildirim-Erbasli, Tarid Wongvorachan, Joyce Xinle Liu, Bin Tan, Polina Morilova
In this paper, a diverse group of AIME members examines the ethical implications of AI-powered tools in educational measurement, explores significant challenges such as automation bias and environmental impact, and proposes solutions to ensure AI's responsible and effective use in education.
no code implementations • 13 Mar 2024 • Dan Song, Xuanpu Zhang, Jianhao Zeng, Pengxin Zhan, QingGuo Chen, Weihua Luo, An-An Liu
Image-based virtual try-on aims to transfer target in-shop clothing to a dressed model image, the objectives of which are totally taking off original clothing while preserving the contents outside of the try-on area, naturally wearing target clothing and correctly inpainting the gap between target clothing and original clothing.
1 code implementation • CVPR 2024 • Jianhao Zeng, Dan Song, Weizhi Nie, Hongshuo Tian, Tongtong Wang, AnAn Liu
Generative Adversarial Networks (GANs) dominate the research field in image-based virtual try-on, but have not resolved problems such as unnatural deformation of garments and the blurry generation quality.
no code implementations • 30 Nov 2023 • Dan Song, Xinwei Fu, Ning Liu, Weizhi Nie, Wenhui Li, Lanjun Wang, You Yang, AnAn Liu
Consequently, this paper aims to improve the confidence with view selection and hierarchical prompts.
1 code implementation • 8 Nov 2023 • Dan Song, Xuanpu Zhang, Juan Zhou, Weizhi Nie, Ruofeng Tong, Mohan Kankanhalli, An-An Liu
Image-based virtual try-on aims to synthesize a naturally dressed person image with a clothing image, which revolutionizes online shopping and inspires related topics within image generation, showing both research significance and commercial potential.
no code implementations • 3 Jun 2023 • Weizhi Nie, Yuhe Yu, Chen Zhang, Dan Song, Lina Zhao, Yunpeng Bai
Our method can also find key clinical indicators of important outcomes that can be used to improve treatment options.
1 code implementation • 2 Jun 2023 • Weizhi Nie, Chen Zhang, Dan Song, Lina Zhao, Yunpeng Bai, Keliang Xie, AnAn Liu
The chest X-ray is often utilized for diagnosing common thoracic diseases.
no code implementations • 20 May 2023 • Weizhi Nie, Chen Zhang, Dan Song, Yunpeng Bai, Keliang Xie, AnAn Liu
The chest X-ray (CXR) is commonly employed to diagnose thoracic illnesses, but the challenge of achieving accurate automatic diagnosis through this method persists due to the complex relationship between pathology.
no code implementations • 20 May 2023 • Weizhi Nie, Chen Zhang, Dan Song, Lina Zhao, Yunpeng Bai, Keliang Xie, AnAn Liu
The chest X-ray (CXR) is one of the most common and easy-to-get medical tests used to diagnose common diseases of the chest.
no code implementations • 11 Apr 2023 • Yue Zhang, Chengtao Peng, Qiuli Wang, Dan Song, Kaiyan Li, S. Kevin Zhou
Besides, we propose a Dynamic Feature Unification Module to integrate information from a varying number of available modalities, which enables the network to be robust to random missing modalities.
1 code implementation • CVPR 2023 • Diqiong Jiang, Dan Song, Ruofeng Tong, Min Tang
StyleIPSB gives us a novel tool for high-fidelity face swapping, and we propose a three-stage framework for face swapping with StyleIPSB.
1 code implementation • 2 Aug 2022 • Weiwei Cui, Yaqi Wang, Yilong Li, Dan Song, Xingyong Zuo, Jiaojiao Wang, Yifan Zhang, Huiyu Zhou, Bung san Chong, Liaoyuan Zeng, Qianni Zhang
This work provides a new benchmark for the tooth volume segmentation task, and the experiment can serve as the baseline for future AI-based dental imaging research and clinical application development.
1 code implementation • 17 Jun 2022 • Weiwei Cui, Yaqi Wang, Qianni Zhang, Huiyu Zhou, Dan Song, Xingyong Zuo, Gangyong Jia, Liaoyuan Zeng
Several state-of-the-art segmentation methods are evaluated on this dataset.
1 code implementation • 14 Aug 2018 • Chengyang Li, Dan Song, Ruofeng Tong, Min Tang
To narrow this gap, we propose a network fusion architecture, which consists of a multispectral proposal network to generate pedestrian proposals, and a subsequent multispectral classification network to distinguish pedestrian instances from hard negatives.
no code implementations • 14 Mar 2018 • Chengyang Li, Dan Song, Ruofeng Tong, Min Tang
Multispectral images of color-thermal pairs have shown more effective than a single color channel for pedestrian detection, especially under challenging illumination conditions.
no code implementations • 6 Jun 2014 • Xiaoyu Chen, Dan Song, Dongming Wang
We propose an approach to generate geometric theorems from electronic images of diagrams automatically.