Search Results for author: Tal August

Found 19 papers, 7 papers with code

Generating Scientific Definitions with Controllable Complexity

1 code implementation ACL 2022 Tal August, Katharina Reinecke, Noah Smith

Unfamiliar terminology and complex language can present barriers to understanding science.

Reranking

Research Borderlands: Analysing Writing Across Research Cultures

no code implementations1 Jun 2025 Shaily Bhatt, Tal August, Maria Antoniak

In this work, we take a human-centered approach to discover and measure language-based cultural norms, and cultural competence of LLMs.

Uncertainty in Action: Confidence Elicitation in Embodied Agents

no code implementations13 Mar 2025 Tianjiao Yu, Vedant Shah, Muntasir Wahed, Kiet A. Nguyen, Adheesh Juvekar, Tal August, Ismini Lourentzou

Expressing confidence is challenging for embodied agents navigating dynamic multimodal environments, where uncertainty arises from both perception and decision-making processes.

Decision Making Minecraft

Automatic Detection of Research Values from Scientific Abstracts Across Computer Science Subfields

1 code implementation23 Feb 2025 Hang Jiang, Tal August, Luca Soldaini, Kyle Lo, Maria Antoniak

Based on the scheme, we build value classifiers to scale up the analysis and present a systematic study over 226, 600 paper abstracts from 32 CS-related subfields and 86 popular publishing venues over ten years.

Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis

1 code implementation20 Feb 2025 Priyanka Kargupta, Ishika Agarwal, Tal August, Jiawei Han

With the exponential growth of research facilitated by modern technology and improved accessibility, scientific discoveries have become increasingly fragmented within and across fields.

Articles

Cocoa: Co-Planning and Co-Execution with AI Agents

no code implementations14 Dec 2024 K. J. Kevin Feng, Kevin Pu, Matt Latzke, Tal August, Pao Siangliulue, Jonathan Bragg, Daniel S. Weld, Amy X. Zhang, Joseph Chee Chang

Human collaboration benefits from continuous coordination -- planning, delegating tasks, sharing progress, and adjusting objectives -- to align on shared goals.

AI Agent

Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula

1 code implementation8 Aug 2024 Li Lucy, Tal August, Rose E. Wang, Luca Soldaini, Courtney Allison, Kyle Lo

To ensure that math curriculum is grade-appropriate and aligns with critical skills or concepts in accordance with educational standards, pedagogical experts can spend months carefully reviewing published math problems.

GSM8K Language Modeling +3

Personalized Jargon Identification for Enhanced Interdisciplinary Communication

no code implementations16 Nov 2023 Yue Guo, Joseph Chee Chang, Maria Antoniak, Erin Bransom, Trevor Cohen, Lucy Lu Wang, Tal August

We collect a dataset of over 10K term familiarity annotations from 11 computer science researchers for terms drawn from 100 paper abstracts.

APPLS: Evaluating Evaluation Metrics for Plain Language Summarization

1 code implementation23 May 2023 Yue Guo, Tal August, Gondy Leroy, Trevor Cohen, Lucy Lu Wang

We identify four PLS criteria from previous work -- informativeness, simplification, coherence, and faithfulness -- and define a set of perturbations corresponding to these criteria that sensitive metrics should be able to detect.

Informativeness Language Modelling +2

Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks

no code implementations5 Apr 2023 Zejiang Shen, Tal August, Pao Siangliulue, Kyle Lo, Jonathan Bragg, Jeff Hammerbacher, Doug Downey, Joseph Chee Chang, David Sontag

In this position paper, we argue that developing AI supports for expository writing has unique and exciting research challenges and can lead to high real-world impacts.

Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing

1 code implementation28 Feb 2022 Tal August, Lucy Lu Wang, Jonathan Bragg, Marti A. Hearst, Andrew Head, Kyle Lo

When seeking information not covered in patient-friendly documents, like medical pamphlets, healthcare consumers may turn to the research literature.

All That's `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text

no code implementations ACL 2021 Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan, Noah A. Smith

Human evaluations are typically considered the gold standard in natural language generation, but as models{'} fluency improves, how well can evaluators detect and judge machine-generated text?

All Articles +2

All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text

no code implementations30 Jun 2021 Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan, Noah A. Smith

Human evaluations are typically considered the gold standard in natural language generation, but as models' fluency improves, how well can evaluators detect and judge machine-generated text?

All Articles +2

Exploring the Effect of Author and Reader Identity in Online Story Writing: the STORIESINTHEWILD Corpus.

no code implementations WS 2020 Tal August, Maarten Sap, Elizabeth Clark, Katharina Reinecke, Noah A. Smith

We analyze the effect of author and reader characteristics and story writing setup on the quality of stories in a short storytelling task.

The Effect of Moderation on Online Mental Health Conversations

no code implementations19 May 2020 David Wadden, Tal August, Qisheng Li, Tim Althoff

We found that participation in group mental health discussions led to improvements in psychological perspective, and that these improvements were larger in moderated conversations.

Cannot find the paper you are looking for? You can Submit a new open access paper.