no code implementations • 13 Mar 2024 • Kento Kawaharazuka, Naoaki Kanazawa, Yoshiki Obinata, Kei Okada, Masayuki Inaba
By using models that can compute the similarity between images and texts continuously over time, we can capture the state changes of food while cooking.