Search Results for author: Chuan Xiao

Found 17 papers, 7 papers with code

Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation

no code implementations22 Feb 2024 Jiawei Wang, Renhe Jiang, Chuang Yang, Zengqing Wu, Makoto Onizuka, Ryosuke Shibasaki, Chuan Xiao

The key technical contribution is a novel LLM agent framework that accounts for individual activity patterns and motivations, including a self-consistency approach to align LLMs with real-world activity data and a retrieval-augmented strategy for interpretable activity generation.

Retrieval

Shall We Talk: Exploring Spontaneous Collaborations of Competing LLM Agents

1 code implementation19 Feb 2024 Zengqing Wu, Shuyuan Zheng, Qianying Liu, Xu Han, Brian Inhyuk Kwon, Makoto Onizuka, Shaojie Tang, Run Peng, Chuan Xiao

Recent advancements have shown that agents powered by large language models (LLMs) possess capabilities to simulate human behaviors and societal dynamics.

Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search

no code implementations17 Feb 2024 Kejing Lu, Chuan Xiao, Yoshiharu Ishikawa

Approximate nearest neighbor search (ANNS) in high-dimensional spaces is a pivotal challenge in the field of machine learning.

Jellyfish: A Large Language Model for Data Preprocessing

no code implementations4 Dec 2023 Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada

This paper explores the utilization of LLMs for data preprocessing (DP), a crucial step in the data mining pipeline that transforms raw data into a clean format conducive to easy processing.

Imputation Language Modelling +1

BClean: A Bayesian Data Cleaning System

1 code implementation11 Nov 2023 Jianbin Qin, Sifan Huang, Yaoshu Wang, Jing Zhu, Yifan Zhang, Yukai Miao, Rui Mao, Makoto Onizuka, Chuan Xiao

By evaluating on both real-world and synthetic datasets, we demonstrate that BClean is capable of achieving an F-measure of up to 0. 9 in data cleaning, outperforming existing Bayesian methods by 2% and other data cleaning methods by 15%.

Bayesian Inference graph partitioning

Smart Agent-Based Modeling: On the Use of Large Language Models in Computer Simulations

3 code implementations10 Nov 2023 Zengqing Wu, Run Peng, Xu Han, Shuyuan Zheng, Yixin Zhang, Chuan Xiao

ABM's strength lies in its bottom-up methodology, illuminating emergent phenomena by modeling the behaviors of individual components of a system.

Common Sense Reasoning

Large Language Models as Data Preprocessors

no code implementations30 Aug 2023 Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada

Large Language Models (LLMs), typified by OpenAI's GPT series and Meta's LLaMA variants, have marked a significant advancement in artificial intelligence.

feature selection Imputation +1

"Guinea Pig Trials" Utilizing GPT: A Novel Smart Agent-Based Modeling Approach for Studying Firm Competition and Collusion

2 code implementations21 Aug 2023 Xu Han, Zengqing Wu, Chuan Xiao

Our results demonstrate that, in the absence of communication, smart agents consistently reach tacit collusion, leading to prices converging at levels higher than the Bertrand equilibrium price but lower than monopoly or cartel prices.

Decision Making

Scardina: Scalable Join Cardinality Estimation by Multiple Density Estimators

1 code implementation31 Mar 2023 Ryuichi Ito, Yuya Sasaki, Chuan Xiao, Makoto Onizuka

In recent years, machine learning-based cardinality estimation methods are replacing traditional methods.

DeepJoin: Joinable Table Discovery with Pre-trained Language Models

no code implementations15 Dec 2022 Yuyang Dong, Chuan Xiao, Takuma Nozawa, Masafumi Enomoto, Masafumi Oyamada

They are either exact solutions whose running time is linear in the sizes of query column and target table repository or approximate solutions lacking precision.

Data Augmentation Language Modelling +1

An Empirical Study of Personalized Federated Learning

1 code implementation27 Jun 2022 Koji Matsuda, Yuya Sasaki, Chuan Xiao, Makoto Onizuka

Federated learning is a distributed machine learning approach in which a single server and multiple clients collaboratively build machine learning models without sharing datasets on clients.

BIG-bench Machine Learning Personalized Federated Learning

Similarity Search on Computational Notebooks

no code implementations30 Jan 2022 Misato Horiuchi, Yuya Sasaki, Chuan Xiao, Makoto Onizuka

In this paper, we propose a similarity search on computational notebooks and develop a new framework for the similarity search.

FedMe: Federated Learning via Model Exchange

no code implementations15 Oct 2021 Koji Matsuda, Yuya Sasaki, Chuan Xiao, Makoto Onizuka

First, to optimize the model architectures for local data, clients tune their own personalized models by comparing to exchanged models and picking the one that yields the best performance.

BIG-bench Machine Learning Federated Learning

Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach

no code implementations26 Oct 2020 Yuyang Dong, Kunihiro Takeoka, Chuan Xiao, Masafumi Oyamada

Finding joinable tables in data lakes is key procedure in many applications such as data integration, data augmentation, data analysis, and data market.

Data Augmentation Data Integration

Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach

no code implementations15 Feb 2020 Yaoshu Wang, Chuan Xiao, Jianbin Qin, Xin Cao, Yifang Sun, Wei Wang, Makoto Onizuka

The feature extraction model transforms original data and threshold to a Hamming space, in which a deep learning-based regression model is utilized to exploit the incremental property of cardinality w. r. t.

Management regression

Pigeonring: A Principle for Faster Thresholded Similarity Search

no code implementations4 Apr 2018 Jianbin Qin, Chuan Xiao

Many solutions to these problems utilize the pigeonhole principle to find candidates that satisfy a filtering condition.

Management

Cannot find the paper you are looking for? You can Submit a new open access paper.