no code implementations • EACL (WANLP) 2021 • Fatemah Husain, Ozlem Uzuner
Sarcasm detection is one of the top challenging tasks in text classification, particularly for informal Arabic with high syntactic and semantic ambiguity.
no code implementations • NAACL (BioNLP) 2021 • Jooyeon Lee, Huong Dang, Ozlem Uzuner, Sam Henry
This paper details a Consumer Health Question (CHQ) summarization model submitted to MEDIQA 2021 for shared task 1: Question Summarization.
no code implementations • 24 Oct 2024 • Yujuan Velvin Fu, Giridhar Kaushik Ramachandran, Namu Park, Kevin Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen
Our experiments show that our BioMistral-NLU outperforms the original BioMistral, as well as the proprietary LLMs - ChatGPT and GPT-4.
no code implementations • 24 Oct 2024 • Yujuan Fu, Ozlem Uzuner, Meliha Yetisgen, Fei Xia
Large language models (LLMs) have demonstrated great performance across various benchmarks, showing potential as general-purpose task solvers.
1 code implementation • 31 Mar 2024 • Yujuan Fu, Giridhar Kaushik Ramachandran, Nicholas J Dobbins, Namu Park, Michael Leu, Abby R. Rosenberg, Kevin Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen
In this work, we present a novel annotated corpus, the Pediatric Social History Annotation Corpus (PedSHAC), and evaluate the automatic extraction of detailed SDoH representations using fine-tuned and in-context learning methods with Large Language Models (LLMs).
1 code implementation • 27 Mar 2024 • Namu Park, Kevin Lybarger, Giridhar Kaushik Ramachandran, Spencer Lewis, Aashka Damani, Ozlem Uzuner, Martin Gunn, Meliha Yetisgen
Here, we introduce the Corpus of Annotated Medical Imaging Reports (CAMIR), which includes 609 annotated radiology reports from three imaging modality types: Computed Tomography, Magnetic Resonance Imaging, and Positron Emission Tomography-Computed Tomography.
no code implementations • 22 Mar 2024 • Sadiya Sayara Chowdhury Puspo, Md Nishat Raihan, Dhiman Goswami, Al Nahian Bin Emran, Amrita Ganguly, Ozlem Uzuner
This paper presents the MasonTigers entry to the SemEval-2024 Task 8 - Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection.
no code implementations • 17 Oct 2023 • Fardin Ahsan Sakib, Ahnaf Atef Choudhury, Ozlem Uzuner
With this in mind, the eRisk 2023 Task 1 was designed to do exactly that: assess the relevance of different sentences to the symptoms of depression as outlined in the BDI questionnaire.
no code implementations • 26 Apr 2023 • Giridhar Kaushik Ramachandran, Haritha Gangavarapu, Kevin Lybarger, Ozlem Uzuner
The Reddit Health Online Talk (RedHot) corpus contains posts from medical condition-related subreddits with annotations characterizing the patient experience and medical conditions.
no code implementations • 13 Apr 2023 • Nicholas J Dobbins, Bin Han, Weipeng Zhou, Kristine Lan, H. Nina Kim, Robert Harrington, Ozlem Uzuner, Meliha Yetisgen
Conclusions: Our work contributes a state-of-the-art data model-agnostic query generation system capable of conditional reasoning using a knowledge base.
no code implementations • 14 Mar 2023 • Yanjun Gao, Dmitriy Dligach, Timothy Miller, Matthew M Churpek, Ozlem Uzuner, Majid Afshar
The goal of the task was to identify and prioritize diagnoses as the first steps in diagnostic decision support to find the most relevant information in long documents like daily progress notes.
1 code implementation • 27 Jul 2022 • Nicholas J Dobbins, Tony Mullen, Ozlem Uzuner, Meliha Yetisgen
In order to identify potential participants at scale, these criteria must first be translated into queries on clinical databases, which can be labor-intensive and error-prone.
no code implementations • 9 Mar 2022 • Maryam Heidari, James H Jr Jones, Ozlem Uzuner
The new proposed model for bot detection creates user profiles based on personal information such as age, personality, gender, education from users' online posts and introduces a machine learning model to detect social bots with high prediction accuracy based on personal information.
no code implementations • 7 Feb 2022 • Fatemah Husain, Ozlem Uzuner
The problem of online offensive language limits the health and security of online users.
no code implementations • 7 Dec 2021 • Yanjun Gao, Dmitriy Dligach, Leslie Christensen, Samuel Tesch, Ryan Laffin, Dongfang Xu, Timothy Miller, Ozlem Uzuner, Matthew M Churpek, Majid Afshar
Conclusions: The existing clinical NLP tasks cover a wide range of topics and the field will continue to grow and attract more attention from both general domain NLP and clinical informatics community.
no code implementations • 20 Aug 2021 • Kevin Lybarger, Aashka Damani, Martin Gunn, Ozlem Uzuner, Meliha Yetisgen
Medical imaging reports distill the findings and observations of radiologists, creating an unstructured textual representation of unstructured medical images.
no code implementations • 17 Feb 2021 • Paul Barry, Sam Henry, Meliha Yetisgen, Bridget McInnes, Ozlem Uzuner
We hypothesize that explicit integration of contextual information into an Multi-task Learning framework would emphasize the significance of context for boosting performance in jointly learning Named Entity Recognition (NER) and Relation Extraction (RE).
no code implementations • 17 Feb 2021 • Kahyun Lee, Nicholas J. Dobbins, Bridget McInnes, Meliha Yetisgen, Ozlem Uzuner
We measured: transferability from external sources; transferability across note types; the contribution of external source data when in-domain training data are available; and transferability across institutions.
no code implementations • 9 Feb 2021 • Fatemah Husain, Ozlem Uzuner
In our study, we apply the principles of transfer learning cross multiple Arabic offensive language datasets to compare the effects on system performance.
no code implementations • 20 Jan 2021 • Fatemah Husain, Ozlem Uzuner
This paper adding more insights towards resources and datasets used in Arabic offensive language research.
no code implementations • SEMEVAL 2020 • Fatemah Husain, Jooyeon Lee, Sam Henry, Ozlem Uzuner
This paper describes SalamNET, an Arabic offensive language detection system that has been submitted to SemEval 2020 shared task 12: Multilingual Offensive Language Identification in Social Media.
no code implementations • 28 Jul 2020 • Fatemah Husain, Jooyeon Lee, Samuel Henry, Ozlem Uzuner
This paper describes SalamNET, an Arabic offensive language detection system that has been submitted to SemEval 2020 shared task 12: Multilingual Offensive Language Identification in Social Media.
no code implementations • WS 2019 • Paul Barry, Ozlem Uzuner
Social Media Mining for Health Applications (SMM4H) Adverse Effect Mentions Shared Task challenges participants to accurately identify spans of text within a tweet that correspond to Adverse Effects (AEs) resulting from medication usage (Weissenbacher et al., 2019).
1 code implementation • 14 May 2019 • Wilson Lau, Thomas H Payne, Ozlem Uzuner, Meliha Yetisgen
Communication of follow-up recommendations when abnormalities are identified on imaging studies is prone to error.
no code implementations • WS 2016 • Ji Young Lee, Franck Dernoncourt, Ozlem Uzuner, Peter Szolovits
In this work, we explore a method to incorporate human-engineered features as well as features derived from EHRs to a neural-network-based de-identification system.
1 code implementation • 10 Jun 2016 • Franck Dernoncourt, Ji Young Lee, Ozlem Uzuner, Peter Szolovits
It yields an F1-score of 97. 85 on the i2b2 2014 dataset, with a recall 97. 38 and a precision of 97. 32, and an F1-score of 99. 23 on the MIMIC de-identification dataset, with a recall 99. 25 and a precision of 99. 06.
no code implementations • 16 Oct 2015 • Weiyi Sun, Anna Rumshisky, Ozlem Uzuner
We analyze the RI-TIMEXes in temporally annotated corpora and propose two hypotheses regarding the normalization of RI-TIMEXes in the clinical narrative domain: the anchor point hypothesis and the anchor relation hypothesis.