1 code implementation • 29 Apr 2024 • Guillaume Astruc, Nicolas Dufour, Ioannis Siglidis, Constantin Aronssohn, Nacim Bouia, Stephanie Fu, Romain Loiseau, Van Nguyen Nguyen, Charles Raude, Elliot Vincent, Lintao XU, HongYu Zhou, Loic Landrieu
Determining the location of an image anywhere on Earth is a complex visual task, which makes it particularly relevant for evaluating computer vision algorithms.
Ranked #1 on Photo geolocation estimation on OpenStreetView-5M
1 code implementation • 15 Mar 2024 • Stephanie Fu, Mark Hamilton, Laura Brandt, Axel Feldman, Zhoutong Zhang, William T. Freeman
Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime.
Ranked #1 on Feature Upsampling on <h2>oi</h2>
no code implementations • 3 Jan 2024 • Pratyusha Sharma, Tamar Rott Shaham, Manel Baradad, Stephanie Fu, Adrian Rodriguez-Munoz, Shivam Duggal, Phillip Isola, Antonio Torralba
Although LLM-generated images do not look like natural images, results on image generation and the ability of models to correct these generated images indicate that precise modeling of strings can teach language models about numerous aspects of the visual world.
1 code implementation • NeurIPS 2023 • Stephanie Fu, Netanel Tamir, Shobhita Sundaram, Lucy Chai, Richard Zhang, Tali Dekel, Phillip Isola
Furthermore, our metric outperforms both prior learned metrics and recent large vision models on these tasks.
no code implementations • ICLR 2022 • Mark Hamilton, Scott Lundberg, Lei Zhang, Stephanie Fu, William T. Freeman
Visual search, recommendation, and contrastive similarity learning power technologies that impact billions of users worldwide.
1 code implementation • 14 Jul 2020 • Mark Hamilton, Stephanie Fu, Mindren Lu, Johnny Bui, Darius Bopp, Zhenbang Chen, Felix Tran, Margaret Wang, Marina Rogers, Lei Zhang, Chris Hoder, William T. Freeman
We introduce MosAIc, an interactive web app that allows users to find pairs of semantically related artworks that span different cultures, media, and millennia.
Cultural Vocal Bursts Intensity Prediction Image Retrieval +2