no code implementations • 25 Feb 2025 • Eric Xue, Zeyi Huang, Yuyang Ji, Haohan Wang
These findings establish Iterative Refinement as an effective new strategy for LLM-driven ML automation and position IMPROVE as an accessible solution for building high-quality computer vision models without requiring ML expertise.
no code implementations • 8 Jan 2025 • Zeyi Huang, Yuyang Ji, Xiaofang Wang, Nikhil Mehta, Tong Xiao, DongHyun Lee, Sigmund Vanvalkenburgh, Shengxin Zha, Bolin Lai, Licheng Yu, Ning Zhang, Yong Jae Lee, Miao Liu
Long-form video understanding with Large Vision Language Models is challenged by the need to analyze temporally dispersed yet spatially concentrated key moments within limited context windows.
1 code implementation • 5 Jul 2024 • Yuhan Zhu, Yuyang Ji, Zhiyu Zhao, Gangshan Wu, LiMin Wang
Pre-trained vision-language models (VLMs) have shown impressive results in various visual classification tasks.