Search Results for author: Chuan Xiao

Found 17 papers, 7 papers with code

Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation

no code implementations • 22 Feb 2024 • Jiawei Wang, Renhe Jiang, Chuang Yang, Zengqing Wu, Makoto Onizuka, Ryosuke Shibasaki, Chuan Xiao

The key technical contribution is a novel LLM agent framework that accounts for individual activity patterns and motivations, including a self-consistency approach to align LLMs with real-world activity data and a retrieval-augmented strategy for interpretable activity generation.

Retrieval

Paper
Add Code

Shall We Talk: Exploring Spontaneous Collaborations of Competing LLM Agents

1 code implementation • 19 Feb 2024 • Zengqing Wu, Shuyuan Zheng, Qianying Liu, Xu Han, Brian Inhyuk Kwon, Makoto Onizuka, Shaojie Tang, Run Peng, Chuan Xiao

Recent advancements have shown that agents powered by large language models (LLMs) possess capabilities to simulate human behaviors and societal dynamics.

Paper
Code

Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search

no code implementations • 17 Feb 2024 • Kejing Lu, Chuan Xiao, Yoshiharu Ishikawa

Approximate nearest neighbor search (ANNS) in high-dimensional spaces is a pivotal challenge in the field of machine learning.

Paper
Add Code

Jellyfish: A Large Language Model for Data Preprocessing

no code implementations • 4 Dec 2023 • Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada

This paper explores the utilization of LLMs for data preprocessing (DP), a crucial step in the data mining pipeline that transforms raw data into a clean format conducive to easy processing.

Imputation Language Modelling +1

Paper
Add Code

BClean: A Bayesian Data Cleaning System

1 code implementation • 11 Nov 2023 • Jianbin Qin, Sifan Huang, Yaoshu Wang, Jing Zhu, Yifan Zhang, Yukai Miao, Rui Mao, Makoto Onizuka, Chuan Xiao

By evaluating on both real-world and synthetic datasets, we demonstrate that BClean is capable of achieving an F-measure of up to 0. 9 in data cleaning, outperforming existing Bayesian methods by 2% and other data cleaning methods by 15%.

Bayesian Inference graph partitioning

Paper
Code

Smart Agent-Based Modeling: On the Use of Large Language Models in Computer Simulations

3 code implementations • 10 Nov 2023 • Zengqing Wu, Run Peng, Xu Han, Shuyuan Zheng, Yixin Zhang, Chuan Xiao

ABM's strength lies in its bottom-up methodology, illuminating emergent phenomena by modeling the behaviors of individual components of a system.

Common Sense Reasoning

Paper
Code

Large Language Models as Data Preprocessors

no code implementations • 30 Aug 2023 • Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada

Large Language Models (LLMs), typified by OpenAI's GPT series and Meta's LLaMA variants, have marked a significant advancement in artificial intelligence.

feature selection Imputation +1

Paper
Add Code

"Guinea Pig Trials" Utilizing GPT: A Novel Smart Agent-Based Modeling Approach for Studying Firm Competition and Collusion

2 code implementations • 21 Aug 2023 • Xu Han, Zengqing Wu, Chuan Xiao

Our results demonstrate that, in the absence of communication, smart agents consistently reach tacit collusion, leading to prices converging at levels higher than the Bertrand equilibrium price but lower than monopoly or cartel prices.

Decision Making

Paper
Code

Scardina: Scalable Join Cardinality Estimation by Multiple Density Estimators

1 code implementation • 31 Mar 2023 • Ryuichi Ito, Yuya Sasaki, Chuan Xiao, Makoto Onizuka

In recent years, machine learning-based cardinality estimation methods are replacing traditional methods.

Paper
Code

DeepJoin: Joinable Table Discovery with Pre-trained Language Models

no code implementations • 15 Dec 2022 • Yuyang Dong, Chuan Xiao, Takuma Nozawa, Masafumi Enomoto, Masafumi Oyamada

They are either exact solutions whose running time is linear in the sizes of query column and target table repository or approximate solutions lacking precision.

Data Augmentation Language Modelling +1

Paper
Add Code

An Empirical Study of Personalized Federated Learning

1 code implementation • 27 Jun 2022 • Koji Matsuda, Yuya Sasaki, Chuan Xiao, Makoto Onizuka

Federated learning is a distributed machine learning approach in which a single server and multiple clients collaboratively build machine learning models without sharing datasets on clients.

BIG-bench Machine Learning Personalized Federated Learning

Paper
Code

Similarity Search on Computational Notebooks

no code implementations • 30 Jan 2022 • Misato Horiuchi, Yuya Sasaki, Chuan Xiao, Makoto Onizuka

In this paper, we propose a similarity search on computational notebooks and develop a new framework for the similarity search.

Paper
Add Code

FedMe: Federated Learning via Model Exchange

no code implementations • 15 Oct 2021 • Koji Matsuda, Yuya Sasaki, Chuan Xiao, Makoto Onizuka

First, to optimize the model architectures for local data, clients tune their own personalized models by comparing to exchanged models and picking the one that yields the best performance.

BIG-bench Machine Learning Federated Learning

Paper
Add Code

Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach

no code implementations • 26 Oct 2020 • Yuyang Dong, Kunihiro Takeoka, Chuan Xiao, Masafumi Oyamada

Finding joinable tables in data lakes is key procedure in many applications such as data integration, data augmentation, data analysis, and data market.

Data Augmentation Data Integration

Paper
Add Code

Consistent and Flexible Selectivity Estimation for High-Dimensional Data

1 code implementation • 20 May 2020 • Yaoshu Wang, Chuan Xiao, Jianbin Qin, Rui Mao, Onizuka Makoto, Wei Wang, Rui Zhang, Yoshiharu Ishikawa

Selectivity estimation aims at estimating the number of database objects that satisfy a selection criterion.

Data Integration Density Estimation +2

Paper
Code

Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach

no code implementations • 15 Feb 2020 • Yaoshu Wang, Chuan Xiao, Jianbin Qin, Xin Cao, Yifang Sun, Wei Wang, Makoto Onizuka

The feature extraction model transforms original data and threshold to a Hamming space, in which a deep learning-based regression model is utilized to exploit the incremental property of cardinality w. r. t.

Management regression

Paper
Add Code

Pigeonring: A Principle for Faster Thresholded Similarity Search

no code implementations • 4 Apr 2018 • Jianbin Qin, Chuan Xiao

Many solutions to these problems utilize the pigeonhole principle to find candidates that satisfy a filtering condition.

Management

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.