no code implementations • 9 Dec 2023 • Chaoquan Jiang, Jinqiang Wang, Rui Hu, Jitao Sang
To address this issue, We propose a language-assisted diagnostic method that uses texts instead of images to diagnose bugs in vision models based on multi-modal models (eg CLIP).
no code implementations • 17 Nov 2021 • Jitao Sang, Jinqiang Wang, Rui Hu, Chaoquan Jiang
Deep network models perform excellently on In-Distribution (ID) data, but can significantly fail on Out-Of-Distribution (OOD) data.