no code implementations • 5 Jan 2024 • SeokHyun Seo, Jinwoo Hong, JungWoo Chae, Kyungyul Kim, Sangheum Hwang
Through experimental analysis using attention maps in ViT, we observe that the rich representations deteriorate when trained on a small dataset.