Search Results for author: Sizhe Wang

Found 20 papers, 4 papers with code

CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation

no code implementations30 Apr 2025 Sizhe Wang, Zhengren Wang, Dongsheng Ma, Yongan Yu, Rui Ling, Zhiyu Li, Feiyu Xiong, Wentao Zhang

We observe models' poor performance on CodeFlowBench, with a substantial performance drop in the iterative codeflow scenario.

Code Generation

Bootstrapped Model Predictive Control

2 code implementations24 Mar 2025 Yuhang Wang, Hanwei Guo, Sizhe Wang, Long Qian, Xuguang Lan

In this work, we introduce Bootstrapped Model Predictive Control (BMPC), a novel algorithm that performs policy learning in a bootstrapped manner.

continuous-control Continuous Control +3

Enhancing GeoAI and location encoding with spatial point pattern statistics: A Case Study of Terrain Feature Classification

no code implementations21 Nov 2024 Sizhe Wang, Wenwen Li

This study introduces a novel approach to terrain feature classification by incorporating spatial point pattern statistics into deep learning models.

Decision Making

BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment

no code implementations16 Nov 2024 Sizhe Wang, Yongqi Tong, Hengyuan Zhang, Dawei Li, Xin Zhang, Tianlong Chen

Building on this, we further propose Balanced Preference Optimization (BPO), designed to dynamically augment the knowledge depth of each sample.

Informativeness

ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework

1 code implementation25 Oct 2024 Hengyuan Zhang, Chenming Shang, Sizhe Wang, Dongdong Zhang, Feng Yao, Renliang Sun, Yiyao Yu, Yujiu Yang, Furu Wei

Although fine-tuning Large Language Models (LLMs) with multilingual data can rapidly enhance the multilingual capabilities of LLMs, they still exhibit a performance gap between the dominant language (e. g., English) and non-dominant ones due to the imbalance of training data across languages.

Contrastive Learning

Prediction of Brent crude oil price based on LSTM model under the background of low-carbon transition

no code implementations19 Sep 2024 Yuwen Zhao, Baojun Hu, Sizhe Wang

In the field of global energy and environment, crude oil is an important strategic resource, and its price fluctuation has a far-reaching impact on the global economy, financial market and the process of low-carbon development.

Geospatial foundation models for image analysis: evaluating and enhancing NASA-IBM Prithvi's domain adaptability

no code implementations31 Aug 2024 Chia-Yu Hsu, Wenwen Li, Sizhe Wang

A series of experiments were designed to assess Prithvi's performance as compared to other pre-trained task-specific AI models in geospatial image analysis.

Domain Adaptation

Optimizing Language Model's Reasoning Abilities with Weak Supervision

no code implementations7 May 2024 Yongqi Tong, Sizhe Wang, Dawei Li, Yifan Wang, Simeng Han, Zi Lin, Chengsong Huang, Jiaxin Huang, Jingbo Shang

Therefore, we present \textsc{PuzzleBen}, a weakly supervised benchmark that comprises 25, 147 complex questions, answers, and human-generated rationales across various domains, such as brainteasers, puzzles, riddles, parajumbles, and critical reasoning tasks.

GeoAI Reproducibility and Replicability: a computational and spatial perspective

no code implementations15 Apr 2024 Wenwen Li, Chia-Yu Hsu, Sizhe Wang, Peter Kedron

We then discuss the factors that may cause the lack of R&R in GeoAI research, with an emphasis on (1) the selection and use of training data; (2) the uncertainty that resides in the GeoAI model design, training, deployment, and inference processes; and more importantly (3) the inherent spatial heterogeneity of geospatial data and processes.

Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning

no code implementations29 Mar 2024 Yongqi Tong, Dawei Li, Sizhe Wang, Yujia Wang, Fei Teng, Jingbo Shang

We conduct a series of experiments to prove LLMs can obtain benefits from mistakes in both directions.

Segment Anything Model Can Not Segment Anything: Assessing AI Foundation Model's Generalizability in Permafrost Mapping

no code implementations16 Jan 2024 Wenwen Li, Chia-Yu Hsu, Sizhe Wang, Yezhou Yang, Hyunho Lee, Anna Liljedahl, Chandi Witharana, Yili Yang, Brendan M. Rogers, Samantha T. Arundel, Matthew B. Jones, Kenton McHenry, Patricia Solis

To evaluate the performance of large AI vision models, especially Meta's Segment Anything Model (SAM), we implemented different instance segmentation pipelines that minimize the changes to SAM to leverage its power as a foundation model.

Instance Segmentation Semantic Segmentation

Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking

no code implementations18 Oct 2023 Yongqi Tong, Yifan Wang, Dawei Li, Sizhe Wang, Zi Lin, Simeng Han, Jingbo Shang

Chain-of-Thought(CoT) prompting and its variants explore equipping large language models (LLMs) with high-level reasoning abilities by emulating human-like linear cognition and logic.

Natural Language Inference

Assessment of a new GeoAI foundation model for flood inundation mapping

no code implementations25 Sep 2023 Wenwen Li, Hyunho Lee, Sizhe Wang, Chia-Yu Hsu, Samantha T. Arundel

Vision foundation models are a new frontier in Geospatial Artificial Intelligence (GeoAI), an interdisciplinary research area that applies and extends AI for geospatial problem solving and geographic knowledge discovery, because of their potential to enable powerful image analysis by learning and extracting important image features from vast amounts of geospatial data.

Flood Inundation Mapping Representation Learning

FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models

1 code implementation19 Aug 2023 Xin Guo, Haotian Xia, Zhaowei Liu, Hanyang Cao, Zhi Yang, Zhiqiang Liu, Sizhe Wang, Jinyi Niu, Chuqi Wang, Yanhui Wang, Xiaolong Liang, Xiaoming Huang, Bing Zhu, Zhongyu Wei, Yun Chen, Weining Shen, Liwen Zhang

The dataset contains 8, 351 questions categorized into four different key areas: Financial Academic Knowledge, Financial Industry Knowledge, Financial Security Knowledge, and Financial Agent.

Multiple-choice

Real-time GeoAI for High-resolution Mapping and Segmentation of Arctic Permafrost Features

no code implementations8 Jun 2023 Wenwen Li, Chia-Yu Hsu, Sizhe Wang, Chandi Witharana, Anna Liljedahl

This paper introduces a real-time GeoAI workflow for large-scale image analysis and the segmentation of Arctic permafrost features at a fine-granularity.

Instance Segmentation Position +3

Semantic Similarity Measure of Natural Language Text through Machine Learning and a Keyword-Aware Cross-Encoder-Ranking Summarizer -- A Case Study Using UCGIS GIS&T Body of Knowledge

no code implementations17 May 2023 Yuanyuan Tian, Wenwen Li, Sizhe Wang, Zhining Gu

Initiated by the University Consortium of Geographic Information Science (UCGIS), GIS&T Body of Knowledge (BoK) is a community-driven endeavor to define, develop, and document geospatial topics related to geographic information science and technologies (GIS&T).

Semantic Similarity Semantic Textual Similarity +1

Cannot find the paper you are looking for? You can Submit a new open access paper.