no code implementations • 30 Sep 2024 • Haotian Zhang, Mingfei Gao, Zhe Gan, Philipp Dufter, Nina Wenzel, Forrest Huang, Dhruti Shah, Xianzhi Du, BoWen Zhang, Yanghao Li, Sam Dodge, Keen You, Zhen Yang, Aleksei Timofeev, Mingze Xu, Hong-You Chen, Jean-Philippe Fauconnier, Zhengfeng Lai, Haoxuan You, ZiRui Wang, Afshin Dehghan, Peter Grasch, Yinfei Yang
We present MM1. 5, a new family of multimodal large language models (MLLMs) designed to enhance capabilities in text-rich image understanding, visual referring and grounding, and multi-image reasoning.
Ranked #55 on
Visual Question Answering
on MM-Vet
no code implementations • 18 May 2024 • Chin-Yi Cheng, Ruiqi Gao, Forrest Huang, Yang Li
Layout design generation has recently gained significant attention due to its potential applications in various fields, including UI, graphic, and floor plan design.
no code implementations • 16 May 2024 • Amber Xie, Chin-Yi Cheng, Forrest Huang, Yang Li
Our method, Revision-Aware Reward Models ($\method$), allows a generative text-to-layout model to produce more modern, designer-aligned layouts, showing the potential for utilizing human revisions and stronger forms of feedback in improving generative models.
1 code implementation • 10 Oct 2023 • Forrest Huang, Gang Li, Tao Li, Yang Li
Macros are building block tasks of our everyday smartphone activity (e. g., "login", or "booking a flight").
no code implementations • 27 Jan 2023 • Chin-Yi Cheng, Forrest Huang, Gang Li, Yang Li
Layout design is an important task in various design fields, including user interface, document, and graphic design.
no code implementations • 19 Nov 2021 • Forrest Huang, Eldon Schoop, David Ha, Jeffrey Nichols, John Canny
Sketching is a natural and effective visual communication medium commonly used in creative processes.
no code implementations • 14 Oct 2021 • Forrest Huang, Gang Li, Xin Zhou, John F. Canny, Yang Li
The design process of user interfaces (UIs) often begins with articulating high-level design goals.
no code implementations • 12 May 2020 • Forrest Huang, Eldon Schoop, David Ha, John Canny
Iteratively refining and critiquing sketches are crucial steps to developing effective designs.
1 code implementation • 8 Apr 2019 • Forrest Huang, John F. Canny
Sketching and natural languages are effective communication media for interactive applications.
1 code implementation • 31 Jul 2018 • David M. Chan, Roshan Rao, Forrest Huang, John F. Canny
Modern datasets and models are notoriously difficult to explore and analyze due to their inherent high dimensionality and massive numbers of samples.