no code implementations • 27 Oct 2023 • Minbiao Han, Jonathan Light, Steven Xia, Sainyam Galhotra, Raul Castro Fernandez, Haifeng Xu
We envision that the synergy of our data and model discovery algorithm and pricing mechanism will be an important step towards building a new data-centric online market that serves ML users effectively.
1 code implementation • 5 Jun 2023 • Boxin Zhao, Boxiang Lyu, Raul Castro Fernandez, Mladen Kolar
Data markets help with identifying valuable training data: model consumers pay to train a model, the market uses that budget to identify data and train the model (the budget allocation problem), and finally the market compensates data providers according to their data contribution (revenue allocation problem).
no code implementations • 18 Apr 2023 • Sainyam Galhotra, Yue Gong, Raul Castro Fernandez
Data is a central component of machine learning and causal inference tasks.
2 code implementations • 9 Jan 2023 • Qiming Wang, Raul Castro Fernandez
All in all, the technique is a stepping stone towards building learned discovery systems.
1 code implementation • 21 Mar 2020 • Nadiia Chepurko, Ryan Marcus, Emanuel Zgraggen, Raul Castro Fernandez, Tim Kraska, David Karger
Our system has two distinct components: (1) a framework to search and join data with the input data, based on various attributes of the input, and (2) an efficient feature selection algorithm that prunes out noisy or irrelevant features from the resulting join.
no code implementations • 10 Jun 2018 • Guillaume Leclerc, Manasi Vartak, Raul Castro Fernandez, Tim Kraska, Samuel Madden
As neural networks become widely deployed in different applications and on different hardware, it has become increasingly important to optimize inference time and model size along with model accuracy.