Search Results for author: Boran Han

Found 16 papers, 7 papers with code

Effectively Steer LLM To Follow Preference via Building Confident Directions

no code implementations4 Mar 2025 Bingqing Song, Boran Han, Shuai Zhang, Hao Wang, Haoyang Fang, Bonan Min, Yuyang Wang, Mingyi Hong

However, their capabilities are typically limited to steering the model into one of the two directions (i. e., bidirectional steering), and there has been no theoretical understanding to guarantee their performance.

Hard Constraint Guided Flow Matching for Gradient-Free Generation of PDE Solutions

no code implementations2 Dec 2024 Chaoran Cheng, Boran Han, Danielle C. Maddix, Abdul Fatir Ansari, Andrew Stuart, Michael W. Mahoney, Yuyang Wang

Generative models that satisfy hard constraints are crucial in many scientific and engineering applications where physical laws or system requirements must be strictly respected.

Unraveling the Gradient Descent Dynamics of Transformers

no code implementations12 Nov 2024 Bingqing Song, Boran Han, Shuai Zhang, Jie Ding, Mingyi Hong

While the Transformer architecture has achieved remarkable success across various domains, a thorough theoretical foundation explaining its optimization dynamics is yet to be fully developed.

Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics

1 code implementation19 Jul 2024 Matthias Karlbauer, Danielle C. Maddix, Abdul Fatir Ansari, Boran Han, Gaurav Gupta, Yuyang Wang, Andrew Stuart, Michael W. Mahoney

Remarkable progress in the development of Deep Learning Weather Prediction (DLWP) models positions them to become competitive with traditional numerical weather prediction (NWP) models.

Graph Neural Network Weather Forecasting

Transferring Knowledge from Large Foundation Models to Small Downstream Models

no code implementations11 Jun 2024 Shikai Qiu, Boran Han, Danielle C. Maddix, Shuai Zhang, Yuyang Wang, Andrew Gordon Wilson

Furthermore, AFT reliably translates improvement in pre-trained models into improvement in downstream performance, even if the downstream model is over $50\times$ smaller, and can effectively transfer complementary information learned by multiple pre-trained models.

Transfer Learning

Discovering Bias in Latent Space: An Unsupervised Debiasing Approach

no code implementations5 Jun 2024 Dyah Adila, Shuai Zhang, Boran Han, Yuyang Wang

The question-answering (QA) capabilities of foundation models are highly sensitive to prompt variations, rendering their performance susceptible to superficial, non-meaning-altering changes.

Question Answering

CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving

1 code implementation26 Apr 2024 Pei Chen, Boran Han, Shuai Zhang

Specifically, we prompt LLMs to play different roles in a problem-solving team, and encourage different role-play agents to collaboratively solve the target task.

Bridging Remote Sensors with Multisensor Geospatial Foundation Models

1 code implementation CVPR 2024 Boran Han, Shuai Zhang, Xingjian Shi, Markus Reichstein

A key discovery of our research is that representations derived from natural images are not always compatible with the distinct characteristics of geospatial remote sensors, underscoring the limitations of existing representations in this field.

Cloud Removal Diversity +1

PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design

no code implementations8 Mar 2024 Wenqi Jiang, Shuai Zhang, Boran Han, Jie Wang, Bernie Wang, Tim Kraska

Retrieval-augmented generation (RAG) can enhance the generation quality of large language models (LLMs) by incorporating external token databases.

RAG Retrieval

CaMML: Context-Aware Multimodal Learner for Large Models

1 code implementation6 Jan 2024 Yixin Chen, Shuai Zhang, Boran Han, Tong He, Bo Li

In this work, we introduce Context-Aware MultiModal Learner (CaMML), for tuning large multimodal models (LMMs).

Visual Question Answering

PreDiff: Precipitation Nowcasting with Latent Diffusion Models

1 code implementation NeurIPS 2023 Zhihan Gao, Xingjian Shi, Boran Han, Hao Wang, Xiaoyong Jin, Danielle Maddix, Yi Zhu, Mu Li, Yuyang Wang

We conduct empirical studies on two datasets: N-body MNIST, a synthetic dataset with chaotic behavior, and SEVIR, a real-world precipitation nowcasting dataset.

Denoising Earth Observation +1

Wrapped Cauchy Distributed Angular Softmax for Long-Tailed Visual Recognition

1 code implementation30 May 2023 Boran Han

The class-wise distribution of angular representation becomes a sum of these kernels.

Towards Geospatial Foundation Models via Continual Pretraining

2 code implementations ICCV 2023 Matias Mendieta, Boran Han, Xingjian Shi, Yi Zhu, Chen Chen

Geospatial technologies are becoming increasingly essential in our world for a wide range of applications, including agriculture, urban planning, and disaster response.

Change Detection Continual Pretraining +6

Cannot find the paper you are looking for? You can Submit a new open access paper.