1 code implementation • 26 Jun 2024 • Adam Stein, Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong
To automatically discover compositional concept representations, we identify two salient properties of such representations, and propose Compositional Concept Extraction (CCE) for finding concepts which obey these properties.
1 code implementation • 2 Jun 2024 • Yinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J Getzen, Qi Long, Mayur Naik, Ravi B Parikh, Eric Wong
Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE).
no code implementations • 13 Aug 2023 • Aaditya Naik, Adam Stein, Yinjun Wu, Mayur Naik, Eric Wong
Finding errors in machine learning applications requires a thorough exploration of their behavior over data.
no code implementations • 25 May 2023 • Adam Stein, Yinjun Wu, Eric Wong, Mayur Naik
It is well-known that real-world changes constituting distribution shift adversely affect model performance.
1 code implementation • 2 Mar 2023 • Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong
Test-time adaptation reduces these violations by up to 68. 7% with relative performance improvement up to 32%.
1 code implementation • 9 Feb 2023 • Yinjun Wu, Adam Stein, Jacob Gardner, Mayur Naik
In this paper, we study how to learn to identify such a meta sample set from a large, imperfect training set, that is subsequently cleaned and used to optimize performance in the meta re-weighting setting.
1 code implementation • 19 Jul 2021 • Yinjun Wu, James Weimer, Susan B. Davidson
First, to reduce the cost of human annotators, we use Infl, which prioritizes the most influential training samples for cleaning and provides cleaned labels to save the cost of one human annotator.
1 code implementation • 3 Mar 2021 • Yinjun Wu, Jingchao Ni, Wei Cheng, Bo Zong, Dongjin Song, Zhengzhang Chen, Yanchi Liu, Xuchao Zhang, Haifeng Chen, Susan Davidson
Forecasting on sparse multivariate time series (MTS) aims to model the predictors of future values of time series given their incomplete past, which is important for many emerging applications.
1 code implementation • ICML 2020 • Yinjun Wu, Edgar Dobriban, Susan B. Davidson
Machine learning models are not static and may need to be retrained on slightly changed datasets, for instance, with the addition or deletion of a set of data points.
no code implementations • 26 Feb 2020 • Yinjun Wu, Val Tannen, Susan B. Davidson
The ubiquitous use of machine learning algorithms brings new challenges to traditional database problems such as incremental view update.