1 code implementation • 19 Aug 2024 • Ken Gu, Ruoxi Shang, Ruien Jiang, Keying Kuang, Richard-John Lin, Donghe Lyu, Yue Mao, Youran Pan, Teng Wu, Jiaqian Yu, Yikun Zhang, Tianmai M. Zhang, Lanyi Zhu, Mike A. Merrill, Jeffrey Heer, Tim Althoff
To address these challenges, we present BLADE, a benchmark to automatically evaluate agents' multifaceted approaches to open-ended research questions.
1 code implementation • 18 Apr 2024 • Michelle S. Lam, Janice Teoh, James Landay, Jeffrey Heer, Michael S. Bernstein
Data analysts have long sought to turn unstructured text data into meaningful concepts.
1 code implementation • 18 Dec 2023 • Madeleine Grunde-McLaughlin, Michelle S. Lam, Ranjay Krishna, Daniel S. Weld, Jeffrey Heer
The design space covers a designer's objectives and the tactics used to build workflows.
1 code implementation • IEEE VIS 2023 • Jeffrey Heer, Dominik Moritz
Mosaic is an architecture for greater scalability, extensibility, and interoperability of interactive data views.
no code implementations • 25 Oct 2023 • Eunice Jun, Edward Misback, Jeffrey Heer, René Just
We leverage these findings to develop rTisane, a DSL for expressing conceptual models augmented with an interactive disambiguation process.
1 code implementation • 14 Feb 2023 • Tongshuang Wu, Hua Shen, Daniel S. Weld, Jeffrey Heer, Marco Tulio Ribeiro
ScatterShot iteratively slices unlabeled data into task-specific patterns, samples informative inputs from underexplored or not-yet-saturated slices in an active learning manner, and helps users label more efficiently with the help of an LLM and the current example set.
1 code implementation • 7 Jan 2022 • Eunice Jun, Audrey Seo, Jeffrey Heer, René Just
Proper statistical modeling incorporates domain theory about how concepts relate and details of how data were measured.
1 code implementation • ACL 2021 • Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer, Daniel S. Weld
While counterfactual examples are useful for analysis and training of NLP models, current generation methods either rely on manual labor to create very few counterfactuals, or only instantiate limited types of perturbations such as paraphrases or word substitutions.
no code implementations • 28 Aug 2020 • Ge Zhang, Mike A. Merrill, Yang Liu, Jeffrey Heer, Tim Althoff
Large scale analysis of source code, and in particular scientific source code, holds the promise of better understanding the data science process, identifying analytical best practices, and providing insights to the builders of scientific toolkits.
1 code implementation • 10 Jul 2020 • Yang Liu, Alex Kale, Tim Althoff, Jeffrey Heer
Multiverse analysis is an approach to data analysis in which all "reasonable" analytic decisions are evaluated in parallel and interpreted collectively, in order to foster robustness and transparency.
Human-Computer Interaction
1 code implementation • ACL 2019 • Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer, Daniel Weld
Though error analysis is crucial to understanding and improving NLP models, the common practice of manual, subjective categorization of a small sample of errors can yield biased and incomplete conclusions.