no code implementations • 22 Feb 2024 • Jiawei Wang, Renhe Jiang, Chuang Yang, Zengqing Wu, Makoto Onizuka, Ryosuke Shibasaki, Chuan Xiao
The key technical contribution is a novel LLM agent framework that accounts for individual activity patterns and motivations, including a self-consistency approach to align LLMs with real-world activity data and a retrieval-augmented strategy for interpretable activity generation.
1 code implementation • 19 Feb 2024 • Zengqing Wu, Shuyuan Zheng, Qianying Liu, Xu Han, Brian Inhyuk Kwon, Makoto Onizuka, Shaojie Tang, Run Peng, Chuan Xiao
Recent advancements have shown that agents powered by large language models (LLMs) possess capabilities to simulate human behaviors and societal dynamics.
no code implementations • 17 Feb 2024 • Kejing Lu, Chuan Xiao, Yoshiharu Ishikawa
Approximate nearest neighbor search (ANNS) in high-dimensional spaces is a pivotal challenge in the field of machine learning.
no code implementations • 4 Dec 2023 • Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada
This paper explores the utilization of LLMs for data preprocessing (DP), a crucial step in the data mining pipeline that transforms raw data into a clean format conducive to easy processing.
1 code implementation • 11 Nov 2023 • Jianbin Qin, Sifan Huang, Yaoshu Wang, Jing Zhu, Yifan Zhang, Yukai Miao, Rui Mao, Makoto Onizuka, Chuan Xiao
By evaluating on both real-world and synthetic datasets, we demonstrate that BClean is capable of achieving an F-measure of up to 0. 9 in data cleaning, outperforming existing Bayesian methods by 2% and other data cleaning methods by 15%.
3 code implementations • 10 Nov 2023 • Zengqing Wu, Run Peng, Xu Han, Shuyuan Zheng, Yixin Zhang, Chuan Xiao
ABM's strength lies in its bottom-up methodology, illuminating emergent phenomena by modeling the behaviors of individual components of a system.
no code implementations • 30 Aug 2023 • Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada
Large Language Models (LLMs), typified by OpenAI's GPT series and Meta's LLaMA variants, have marked a significant advancement in artificial intelligence.
2 code implementations • 21 Aug 2023 • Xu Han, Zengqing Wu, Chuan Xiao
Our results demonstrate that, in the absence of communication, smart agents consistently reach tacit collusion, leading to prices converging at levels higher than the Bertrand equilibrium price but lower than monopoly or cartel prices.
1 code implementation • 31 Mar 2023 • Ryuichi Ito, Yuya Sasaki, Chuan Xiao, Makoto Onizuka
In recent years, machine learning-based cardinality estimation methods are replacing traditional methods.
no code implementations • 15 Dec 2022 • Yuyang Dong, Chuan Xiao, Takuma Nozawa, Masafumi Enomoto, Masafumi Oyamada
They are either exact solutions whose running time is linear in the sizes of query column and target table repository or approximate solutions lacking precision.
1 code implementation • 27 Jun 2022 • Koji Matsuda, Yuya Sasaki, Chuan Xiao, Makoto Onizuka
Federated learning is a distributed machine learning approach in which a single server and multiple clients collaboratively build machine learning models without sharing datasets on clients.
no code implementations • 30 Jan 2022 • Misato Horiuchi, Yuya Sasaki, Chuan Xiao, Makoto Onizuka
In this paper, we propose a similarity search on computational notebooks and develop a new framework for the similarity search.
no code implementations • 15 Oct 2021 • Koji Matsuda, Yuya Sasaki, Chuan Xiao, Makoto Onizuka
First, to optimize the model architectures for local data, clients tune their own personalized models by comparing to exchanged models and picking the one that yields the best performance.
no code implementations • 26 Oct 2020 • Yuyang Dong, Kunihiro Takeoka, Chuan Xiao, Masafumi Oyamada
Finding joinable tables in data lakes is key procedure in many applications such as data integration, data augmentation, data analysis, and data market.
1 code implementation • 20 May 2020 • Yaoshu Wang, Chuan Xiao, Jianbin Qin, Rui Mao, Onizuka Makoto, Wei Wang, Rui Zhang, Yoshiharu Ishikawa
Selectivity estimation aims at estimating the number of database objects that satisfy a selection criterion.
no code implementations • 15 Feb 2020 • Yaoshu Wang, Chuan Xiao, Jianbin Qin, Xin Cao, Yifang Sun, Wei Wang, Makoto Onizuka
The feature extraction model transforms original data and threshold to a Hamming space, in which a deep learning-based regression model is utilized to exploit the incremental property of cardinality w. r. t.
no code implementations • 4 Apr 2018 • Jianbin Qin, Chuan Xiao
Many solutions to these problems utilize the pigeonhole principle to find candidates that satisfy a filtering condition.