no code implementations • 17 Feb 2025 • Sowmya Vajjala, Shwetali Shimangaud
Large Language Models revolutionized NLP and showed dramatic performance improvements across several tasks.
no code implementations • 18 Sep 2024 • Bashar Alhafni, Sowmya Vajjala, Stefano Bannò, Kaushal Kumar Maurya, Ekaterina Kochmar
This COLING 2025 tutorial is designed for researchers and practitioners interested in the educational applications of NLP and the role LLMs have to play in this area.
no code implementations • 27 Jun 2024 • Gabriel Bernier-Colborne, Sowmya Vajjala
In terms of entity mentions, we corrected the span and/or type of ~8% of mentions in the dataset, while adding/deleting/splitting/merging a few more.
no code implementations • 20 Jun 2024 • Taraka Rama, Sowmya Vajjala
The Universal Dependencies (UD) project aims to create a cross-linguistically consistent dependency annotation for multiple languages, to facilitate multilingual NLP.
1 code implementation • 5 Apr 2024 • Gaurav Kamath, Sebastian Schuster, Sowmya Vajjala, Siva Reddy
Sentences containing multiple semantic operators with overlapping scope often create ambiguities in interpretation, known as scope ambiguities.
no code implementations • 30 May 2023 • Akshay Srinivasan, Sowmya Vajjala
Our results showed the NER models we explored across three languages (English, German and Hindi) are not very robust to such changes, as indicated by the fluctuations in the overall F1 score as well as in a more fine-grained evaluation.
no code implementations • 26 Dec 2022 • Diego Maupomé, Fanny Rancourt, Thomas Soulas, Alexandre Lachance, Marie-Jean Meurs, Desislava Aleksandrova, Olivier Brochu Dufour, Igor Pontes, Rémi Cardon, Michel Simard, Sowmya Vajjala
This report summarizes the work carried out by the authors during the Twelfth Montreal Industrial Problem Solving Workshop, held at Universit\'e de Montr\'eal in August 2022.
no code implementations • LREC 2022 • Sowmya Vajjala, Ramya Balasubramaniam
Named Entity Recognition (NER) is a well researched NLP task and is widely used in real world NLP scenarios.
1 code implementation • Findings (ACL) 2022 • Justin Lee, Sowmya Vajjala
Automatic Readability Assessment (ARA), the task of assigning a reading level to a text, is traditionally treated as a classification problem in NLP research.
no code implementations • LREC 2022 • Sowmya Vajjala
This article is a brief survey of contemporary research on developing computational models for readability assessment.
no code implementations • NAACL (TeachingNLP) 2021 • Sowmya Vajjala
NLP's sphere of influence went much beyond computer science research and the development of software applications in the past decade.
1 code implementation • 25 Feb 2021 • Taraka Rama, Sowmya Vajjala
Our results indicate that while fine-tuned embeddings are useful for multilingual proficiency modeling, none of the features achieve consistently best performance for all dimensions of language proficiency.
1 code implementation • WS 2019 • Sowmya Vajjala, Ivana Lucic
To address this gap, we conducted a user study in which over a 100 participants read texts of different reading levels and answered questions created to test three forms of comprehension.
no code implementations • WS 2018 • Sowmya Vajjala, Ivana Lu{\v{c}}i{\'c}
This paper describes the collection and compilation of the OneStopEnglish corpus of texts written at three reading levels, and demonstrates its usefulness for through two applications - automatic readability assessment and automatic text simplification.
1 code implementation • WS 2018 • Sowmya Vajjala, Taraka Rama
The Common European Framework of Reference (CEFR) guidelines describe language proficiency of learners on a scale of 6 levels.
no code implementations • 16 Apr 2018 • Sowmya Vajjala, Ziwei Zhou
This paper describes our experiments with automatically identifying native accents from speech samples of non-native English speakers using low level audio features, and n-gram features from manual transcriptions.
no code implementations • 24 Mar 2018 • Sowmya Vajjala
This entry introduces the topic of machine learning and provides an overview of its relevance for applied linguistics and language learning.
1 code implementation • WS 2017 • Sowmya Vajjala, Sagnik Banerjee
We report on our experiments with N-gram and embedding based feature representations for Native Language Identification (NLI) as a part of the NLI Shared Task 2017 (team name: NLI-ISU).
Ranked #2 on
Native Language Identification
on italki NLI
1 code implementation • 2 Dec 2016 • Sowmya Vajjala
While the results show that the feature set used results in good predictive models with both datasets, the question "what are the most predictive features?"
no code implementations • WS 2016 • Sowmya Vajjala, Detmar Meurers, Alex Eitel, er, Katharina Scheiter
Computational approaches to readability assessment are generally built and evaluated using gold standard corpora labeled by publishers or teachers rather than being grounded in observations about human performance.
no code implementations • 29 Mar 2016 • Ildikó Pilán, Sowmya Vajjala, Elena Volodina
Corpora and web texts can become a rich language learning resource if we have a means of assessing whether they are linguistically appropriate for learners at a given proficiency level.
1 code implementation • 18 Mar 2016 • Sowmya Vajjala, Detmar Meurers
We propose a new method for evaluating the readability of simplified sentences through pair-wise ranking.