1 code implementation • 11 Nov 2023 • Jianbin Qin, Sifan Huang, Yaoshu Wang, Jing Zhu, Yifan Zhang, Yukai Miao, Rui Mao, Makoto Onizuka, Chuan Xiao
By evaluating on both real-world and synthetic datasets, we demonstrate that BClean is capable of achieving an F-measure of up to 0. 9 in data cleaning, outperforming existing Bayesian methods by 2% and other data cleaning methods by 15%.
1 code implementation • 20 May 2020 • Yaoshu Wang, Chuan Xiao, Jianbin Qin, Rui Mao, Onizuka Makoto, Wei Wang, Rui Zhang, Yoshiharu Ishikawa
Selectivity estimation aims at estimating the number of database objects that satisfy a selection criterion.
no code implementations • 15 Feb 2020 • Yaoshu Wang, Chuan Xiao, Jianbin Qin, Xin Cao, Yifang Sun, Wei Wang, Makoto Onizuka
The feature extraction model transforms original data and threshold to a Hamming space, in which a deep learning-based regression model is utilized to exploit the incremental property of cardinality w. r. t.