Search Results for author: Yangxinyu Xie

Found 11 papers, 9 papers with code

GeoGrid-Bench: Can Foundation Models Understand Multimodal Gridded Geo-Spatial Data?

no code implementations15 May 2025 Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Jiashu He, Joshua Bergerson, John K Hutchison, Jordan Branham, Camillo J Taylor, Tanwi Mallick

We present GeoGrid-Bench, a benchmark designed to evaluate the ability of foundation models to understand geo-spatial data in the grid structure.

A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation

1 code implementation24 Apr 2025 Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, Joshua David Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. Ross Alexander, Robert B. Ross, Yan Feng, Leslie-Anne Levy, Weijie Su, Camillo J. Taylor

In this work we propose a retrieval-augmented generation (RAG)-based multi-agent LLM system to support analysis and decision-making in the context of natural hazards and extreme weather events.

Decision Making RAG

Debiasing Watermarks for Large Language Models via Maximal Coupling

1 code implementation17 Nov 2024 Yangxinyu Xie, Xiang Li, Tanwi Mallick, Weijie J. Su, Ruixun Zhang

Watermarking language models is essential for distinguishing between human and machine-generated text and thus maintaining the integrity and trustworthiness of digital communication.

A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners

1 code implementation16 Jun 2024 Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J. Su, Camillo J. Taylor, Dan Roth

This study introduces a hypothesis-testing framework to assess whether large language models (LLMs) possess genuine reasoning abilities or primarily depend on token bias.

Logical Reasoning

Towards Rationality in Language and Multimodal Agents: A Survey

1 code implementation1 Jun 2024 Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Yuan Yuan, Zhuoqun Hao, Xinyi Bai, Weijie J. Su, Camillo J. Taylor, Tanwi Mallick

This work discusses how to build more rational language and multimodal agents and what criteria define rationality in intelligent systems.

Decision Making Survey

A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation

1 code implementation12 Feb 2024 Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, Joshua David Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. Ross Alexander, Robert B. Ross, Yan Feng, Leslie-Anne Levy, Weijie Su, Camillo J. Taylor

Large language models (LLMs) are a transformational capability at the frontier of artificial intelligence and machine learning that can support decision-makers in addressing pressing societal challenges such as extreme natural hazard events.

Decision Making Language Modeling +3

A Comparative Study of Loss Functions: Traffic Predictions in Regular and Congestion Scenarios

1 code implementation29 Aug 2023 Yangxinyu Xie, Tanwi Mallick

While accurate forecasting of regular traffic conditions is crucial, a reliable AI system must also accurately forecast congestion scenarios to maintain safe and efficient transportation.

imbalanced classification Management

Improving random walk rankings with feature selection and imputation

1 code implementation29 Nov 2021 Ngoc Mai Tran, Yangxinyu Xie

The Science4cast Competition consists of predicting new links in a semantic network, with each node representing a concept and each edge representing a link proposed by a paper relating two concepts.

feature selection Imputation

Cannot find the paper you are looking for? You can Submit a new open access paper.