Search Results for author: Keisuke Doman

Found 1 papers, 0 papers with code

IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining

no code implementations • 6 Mar 2023 • Chihaya Matsuhira, Marc A. Kastner, Takahiro Komamizu, Takatsugu Hirayama, Keisuke Doman, Yasutomo Kawanishi, Ichiro Ide

Furthermore, in some multimodal retrieval tasks, we confirm that the proposed pronunciation encoder enhances the performance of the text encoder and that the pronunciation encoder handles nonsense words in a more phonetic manner than the text encoder.

Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.