Search Results for author: Dorottya Demszky

Found 19 papers, 11 papers with code

Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings

1 code implementation NAACL 2019 Dorottya Demszky, Nikhil Garg, Rob Voigt, James Zou, Matthew Gentzkow, Jesse Shapiro, Dan Jurafsky

We provide an NLP framework to uncover four linguistic dimensions of political polarization in social media: topic choice, framing, affect and illocutionary force.

Clustering

GoEmotions: A Dataset of Fine-Grained Emotions

8 code implementations ACL 2020 Dorottya Demszky, Dana Movshovitz-Attias, Jeongwoo Ko, Alan Cowen, Gaurav Nemade, Sujith Ravi

Understanding emotion expressed in language has a wide range of applications, from building empathetic chatbots to detecting harmful online behavior.

Emotion Classification Transfer Learning

P\'art\'elet: A Hungarian Corpus of Propaganda Texts from the Hungarian Socialist Era

no code implementations LREC 2020 Zolt{\'a}n Kmetty, Veronika Vincze, Dorottya Demszky, Orsolya Ring, Bal{\'a}zs Nagy, Martina Katalin Szab{\'o}

P{\'a}rt{\'e}let was the official journal of the governing party during the Hungarian socialism from 1956 to 1989, hence it represents the direct political agitation and propaganda of the dictatorial system in question.

The Role of Verb Semantics in Hungarian Verb-Object Order

no code implementations16 Jun 2020 Dorottya Demszky, László Kálmán, Dan Jurafsky, Beth Levin

We test the effect of lexical semantics on the ordering of verbs and their objects by grouping verbs into 11 semantic classes.

Object

Analyzing the Framing of 2020 Presidential Candidates in the News

no code implementations WS 2020 Audrey Acken, Dorottya Demszky

In this study, we apply NLP methods to learn about the framing of the 2020 Democratic Presidential candidates in news media.

Word Embeddings

Learning to Recognize Dialect Features

no code implementations NAACL 2021 Dorottya Demszky, Devyani Sharma, Jonathan H. Clark, Vinodkumar Prabhakaran, Jacob Eisenstein

Evaluation on a test set of 22 dialect features of Indian English demonstrates that these models learn to recognize many features with high accuracy, and that a few minimal pairs can be as effective for training as thousands of labeled examples.

Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions

1 code implementation ACL 2021 Dorottya Demszky, Jing Liu, Zid Mancenido, Julie Cohen, Heather Hill, Dan Jurafsky, Tatsunori Hashimoto

In conversation, uptake happens when a speaker builds on the contribution of their interlocutor by, for example, acknowledging, repeating or reformulating what they have said.

Math Question Answering

The NCTE Transcripts: A Dataset of Elementary Math Classroom Transcripts

1 code implementation21 Nov 2022 Dorottya Demszky, Heather Hill

Classroom discourse is a core medium of instruction - analyzing it can provide a window into teaching and learning as well as driving the development of new tools for improving instruction.

Elementary Mathematics Math

MD3: The Multi-Dialect Dataset of Dialogues

no code implementations19 May 2023 Jacob Eisenstein, Vinodkumar Prabhakaran, Clara Rivera, Dorottya Demszky, Devyani Sharma

We introduce a new dataset of conversational speech representing English from India, Nigeria, and the United States.

Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction

1 code implementation5 Jun 2023 Rose E. Wang, Dorottya Demszky

In doing so, we propose three teacher coaching tasks for generative AI: (A) scoring transcript segments based on classroom observation instruments, (B) identifying highlights and missed opportunities for good instructional strategies, and (C) providing actionable suggestions for eliciting more student reasoning.

Math

SIGHT: A Large Annotated Dataset on Student Insights Gathered from Higher Education Transcripts

1 code implementation15 Jun 2023 Rose E. Wang, Pawan Wirawarn, Noah Goodman, Dorottya Demszky

To overcome this challenge, we propose a set of best practices for using large language models (LLMs) to cheaply classify the comments at scale.

Math

Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults

no code implementations12 Sep 2023 Ahmed Adel Attia, Jing Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson

Recent advancements in Automatic Speech Recognition (ASR) systems, exemplified by Whisper, have demonstrated the potential of these systems to approach human-level performance given sufficient data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

"Mistakes Help Us Grow": Facilitating and Evaluating Growth Mindset Supportive Language in Classrooms

no code implementations16 Oct 2023 Kunal Handa, Margaret Clapper, Jessica Boyle, Rose E Wang, Diyi Yang, David S Yeager, Dorottya Demszky

Teachers' growth mindset supportive language (GMSL)--rhetoric emphasizing that one's skills can be improved over time--has been shown to significantly reduce disparities in academic achievement and enhance students' learning outcomes.

Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes

2 code implementations16 Oct 2023 Rose E. Wang, Qingyang Zhang, Carly Robinson, Susanna Loeb, Dorottya Demszky

We evaluate state-of-the-art LLMs on our dataset and find that the expert's decision-making model is critical for LLMs to close the gap: responses from GPT4 with expert decisions (e. g., "simplify the problem") are +76% more preferred than without.

Decision Making Math

Measuring Five Accountable Talk Moves to Improve Instruction at Scale

no code implementations2 Nov 2023 Ashlee Kupor, Candice Morgan, Dorottya Demszky

To build scalable measures of instruction, we fine-tune RoBERTa and GPT models to identify five instructional talk moves inspired by accountable talk theory: adding on, connecting, eliciting, probing and revoicing students' ideas.

Edu-ConvoKit: An Open-Source Library for Education Conversation Data

1 code implementation7 Feb 2024 Rose E. Wang, Dorottya Demszky

We introduce Edu-ConvoKit, an open-source library designed to handle pre-processing, annotation and analysis of conversation data in education.

Backtracing: Retrieving the Cause of the Query

1 code implementation6 Mar 2024 Rose E. Wang, Pawan Wirawarn, Omar Khattab, Noah Goodman, Dorottya Demszky

While information retrieval (IR) systems may provide answers for such user queries, they do not directly assist content creators -- such as lecturers who want to improve their content -- identify segments that _caused_ a user to ask those questions.

Information Retrieval Language Modelling +2

Cannot find the paper you are looking for? You can Submit a new open access paper.