no code implementations • 4 Mar 2025 • Bingqing Song, Boran Han, Shuai Zhang, Hao Wang, Haoyang Fang, Bonan Min, Yuyang Wang, Mingyi Hong
However, their capabilities are typically limited to steering the model into one of the two directions (i. e., bidirectional steering), and there has been no theoretical understanding to guarantee their performance.
no code implementations • 6 Dec 2024 • Luca Masserano, Abdul Fatir Ansari, Boran Han, Xiyuan Zhang, Christos Faloutsos, Michael W. Mahoney, Andrew Gordon Wilson, Youngsuk Park, Syama Rangapuram, Danielle C. Maddix, Yuyang Wang
To address this question, we develop WaveToken, a wavelet-based tokenizer that allows models to learn complex representations directly in the space of time-localized frequencies.
no code implementations • 2 Dec 2024 • Chaoran Cheng, Boran Han, Danielle C. Maddix, Abdul Fatir Ansari, Andrew Stuart, Michael W. Mahoney, Yuyang Wang
Generative models that satisfy hard constraints are crucial in many scientific and engineering applications where physical laws or system requirements must be strictly respected.
no code implementations • 12 Nov 2024 • Bingqing Song, Boran Han, Shuai Zhang, Jie Ding, Mingyi Hong
While the Transformer architecture has achieved remarkable success across various domains, a thorough theoretical foundation explaining its optimization dynamics is yet to be fully developed.
1 code implementation • 19 Jul 2024 • Matthias Karlbauer, Danielle C. Maddix, Abdul Fatir Ansari, Boran Han, Gaurav Gupta, Yuyang Wang, Andrew Stuart, Michael W. Mahoney
Remarkable progress in the development of Deep Learning Weather Prediction (DLWP) models positions them to become competitive with traditional numerical weather prediction (NWP) models.
no code implementations • 11 Jun 2024 • Shikai Qiu, Boran Han, Danielle C. Maddix, Shuai Zhang, Yuyang Wang, Andrew Gordon Wilson
Furthermore, AFT reliably translates improvement in pre-trained models into improvement in downstream performance, even if the downstream model is over $50\times$ smaller, and can effectively transfer complementary information learned by multiple pre-trained models.
no code implementations • 5 Jun 2024 • Dyah Adila, Shuai Zhang, Boran Han, Yuyang Wang
The question-answering (QA) capabilities of foundation models are highly sensitive to prompt variations, rendering their performance susceptible to superficial, non-meaning-altering changes.
1 code implementation • 26 Apr 2024 • Pei Chen, Boran Han, Shuai Zhang
Specifically, we prompt LLMs to play different roles in a problem-solving team, and encourage different role-play agents to collaboratively solve the target task.
1 code implementation • CVPR 2024 • Boran Han, Shuai Zhang, Xingjian Shi, Markus Reichstein
A key discovery of our research is that representations derived from natural images are not always compatible with the distinct characteristics of geospatial remote sensors, underscoring the limitations of existing representations in this field.
no code implementations • 8 Mar 2024 • Wenqi Jiang, Shuai Zhang, Boran Han, Jie Wang, Bernie Wang, Tim Kraska
Retrieval-augmented generation (RAG) can enhance the generation quality of large language models (LLMs) by incorporating external token databases.
1 code implementation • 6 Jan 2024 • Yixin Chen, Shuai Zhang, Boran Han, Tong He, Bo Li
In this work, we introduce Context-Aware MultiModal Learner (CaMML), for tuning large multimodal models (LMMs).
Ranked #141 on
Visual Question Answering
on MM-Vet
no code implementations • 8 Oct 2023 • Yixin Chen, Shuai Zhang, Boran Han, Jiaya Jia
In-context learning (ICL) involves reasoning from given contextual examples.
1 code implementation • NeurIPS 2023 • Zhihan Gao, Xingjian Shi, Boran Han, Hao Wang, Xiaoyong Jin, Danielle Maddix, Yi Zhu, Mu Li, Yuyang Wang
We conduct empirical studies on two datasets: N-body MNIST, a synthetic dataset with chaotic behavior, and SEVIR, a real-world precipitation nowcasting dataset.
Ranked #1 on
Precipitation Forecasting
on SEVIR
1 code implementation • 30 May 2023 • Boran Han
The class-wise distribution of angular representation becomes a sum of these kernels.
2 code implementations • ICCV 2023 • Matias Mendieta, Boran Han, Xingjian Shi, Yi Zhu, Chen Chen
Geospatial technologies are becoming increasingly essential in our world for a wide range of applications, including agriculture, urban planning, and disaster response.
no code implementations • 8 Jan 2022 • Jiahe Wang, Boran Han
Microscopy imaging is vital in biology research and diagnosis.