no code implementations • CL (ACL) 2021 • Lifeng Jin, Lane Schwartz, Finale Doshi-Velez, Timothy Miller, William Schuler
Abstract This article describes a simple PCFG induction model with a fixed category domain that predicts a large majority of attested constituent boundaries, and predicts labels consistent with nearly half of attested constituent labels on a standard evaluation data set of child-directed speech.
no code implementations • NAACL (BioNLP) 2021 • Chen Lin, Timothy Miller, Dmitriy Dligach, Steven Bethard, Guergana Savova
We propose a methodology to produce a model focused on the clinical domain: continued pretraining of a model with a broad representation of biomedical terminology (PubMedBERT) on a clinical corpus along with a novel entity-centric masking strategy to infuse domain knowledge in the learning process.
no code implementations • NAACL (ClinicalNLP) 2022 • Dmitriy Dligach, Steven Bethard, Timothy Miller, Guergana Savova
Sequence-to-sequence models are appealing because they allow both encoder and decoder to be shared across many tasks by formulating those tasks as text-to-text problems.
no code implementations • NAACL (ClinicalNLP) 2022 • Lijing Wang, Timothy Miller, Steven Bethard, Guergana Savova
In this paper, we investigate ensemble methods for fine-tuning transformer-based pretrained models for clinical natural language processing tasks, specifically temporal relation extraction from the clinical narrative.
no code implementations • EMNLP (Louhi) 2020 • Kristin Wright-Bettner, Chen Lin, Timothy Miller, Steven Bethard, Dmitriy Dligach, Martha Palmer, James H. Martin, Guergana Savova
We present refinements over existing temporal relation annotations in the Electronic Medical Record clinical narrative.
no code implementations • EMNLP (ClinicalNLP) 2020 • Xiyu Ding, Mei-Hua Hall, Timothy Miller
Reducing rates of early hospital readmission has been recognized and identified as a key to improve quality of care and reduce costs.
no code implementations • EMNLP (ClinicalNLP) 2020 • Danielle Bitterman, Timothy Miller, David Harris, Chen Lin, Sean Finan, Jeremy Warner, Raymond Mak, Guergana Savova
We present work on extraction of radiotherapy treatment information from the clinical narrative in the electronic medical records.
no code implementations • EACL (AdaptNLP) 2021 • Timothy Miller, Egoitz Laparra, Steven Bethard
Advances in transfer learning and domain adaptation have raised hopes that once-challenging NLP tasks are ready to be put to use for sophisticated information extraction needs.
no code implementations • COLING 2022 • Yanjun Gao, Dmitriy Dligach, Timothy Miller, Dongfang Xu, Matthew M. M. Churpek, Majid Afshar
In this work, we propose a new NLP task that aims to generate a list of problems in a patient’s daily care plan using input from the provider’s progress notes during hospitalization.
no code implementations • 16 Oct 2024 • Yingya Li, Timothy Miller, Steven Bethard, Guergana Savova
We propose a metric of task relatedness based on task difficulty measured by pointwise V-usable information (PVI).
1 code implementation • 28 Mar 2024 • Shan Chen, Jack Gallifant, Marco Guevara, Yanjun Gao, Majid Afshar, Timothy Miller, Dmitriy Dligach, Danielle S. Bitterman
Generative models have been showing potential for producing data in mass.
1 code implementation • 26 Oct 2023 • Shan Chen, Marco Guevara, Shalini Moningi, Frank Hoebers, Hesham Elhalawani, Benjamin H. Kann, Fallon E. Chipidza, Jonathan Leeman, Hugo J. W. L. Aerts, Timothy Miller, Guergana K. Savova, Raymond H. Mak, Maryam Lustberg, Majid Afshar, Danielle S. Bitterman
Results show promise for AI to improve clinician efficiency and patient care through assisting documentation, if used judiciously.
no code implementations • 28 Aug 2023 • Yanjun Gao, Ruizhe Li, John Caskey, Dmitriy Dligach, Timothy Miller, Matthew M. Churpek, Majid Afshar
In this paper, we outline an innovative approach for augmenting the proficiency of LLMs in the realm of automated diagnosis generation, achieved through the incorporation of a medical knowledge graph (KG) and a novel graph model: Dr. Knows, inspired by the clinical diagnostic reasoning process.
no code implementations • 8 Jun 2023 • Yanjun Gao, Dmitriy Dligach, Timothy Miller, Matthew M. Churpek, Majid Afshar
The BioNLP Workshop 2023 initiated the launch of a shared task on Problem List Summarization (ProbSum) in January 2023.
no code implementations • 7 Jun 2023 • Brihat Sharma, Yanjun Gao, Timothy Miller, Matthew M. Churpek, Majid Afshar, Dmitriy Dligach
Generative artificial intelligence (AI) is a promising direction for augmenting clinical diagnostic decision support and reducing diagnostic errors, a leading contributor to medical errors.
no code implementations • 14 Mar 2023 • Yanjun Gao, Dmitriy Dligach, Timothy Miller, Matthew M Churpek, Ozlem Uzuner, Majid Afshar
The goal of the task was to identify and prioritize diagnoses as the first steps in diagnostic decision support to find the most relevant information in long documents like daily progress notes.
no code implementations • 29 Sep 2022 • Yanjun Gao, Dmitriy Dligach, Timothy Miller, John Caskey, Brihat Sharma, Matthew M Churpek, Majid Afshar
The potential for clinical natural language processing (cNLP) to model diagnostic reasoning in humans with forward reasoning from data to diagnosis and potentially reduce the cognitive burden and medical error has not been investigated.
no code implementations • 17 Aug 2022 • Yanjun Gao, Dmitriy Dligach, Timothy Miller, Dongfang Xu, Matthew M. Churpek, Majid Afshar
In this work, we propose a new NLP task that aims to generate a list of problems in a patient's daily care plan using input from the provider's progress notes during hospitalization.
no code implementations • LREC 2022 • Yanjun Gao, Dmitriy Dligach, Timothy Miller, Samuel Tesch, Ryan Laffin, Matthew M. Churpek, Majid Afshar
This work introduces a hierarchical annotation schema with three stages to address clinical text understanding, clinical reasoning, and summarization.
no code implementations • 7 Dec 2021 • Yanjun Gao, Dmitriy Dligach, Leslie Christensen, Samuel Tesch, Ryan Laffin, Dongfang Xu, Timothy Miller, Ozlem Uzuner, Matthew M Churpek, Majid Afshar
Conclusions: The existing clinical NLP tasks cover a wide range of topics and the field will continue to grow and attract more attention from both general domain NLP and clinical informatics community.
1 code implementation • 26 Nov 2021 • Dongfang Xu, Shan Chen, Timothy Miller
In this paper, we present our work participating in the BioCreative VII Track 3 - automatic extraction of medication names in tweets, where we implemented a multi-task learning model that is jointly trained on text classification and sequence labelling.
1 code implementation • SEMEVAL 2021 • Egoitz Laparra, Xin Su, Yiyun Zhao, {\"O}zlem Uzuner, Timothy Miller, Steven Bethard
Participants are then tested on data representing a new (target) domain.
no code implementations • 14 May 2021 • Xin Su, Timothy Miller, Xiyu Ding, Majid Afshar, Dmitriy Dligach
Automatic phenotyping is a task of identifying cohorts of patients that match a predefined set of criteria.
no code implementations • 6 Oct 2020 • Yuqi Si, Jingcheng Du, Zhao Li, Xiaoqian Jiang, Timothy Miller, Fei Wang, W. Jim Zheng, Kirk Roberts
We show the importance and feasibility of learning comprehensive representations of patient EHR data through a systematic review.
no code implementations • WS 2020 • Chen Lin, Timothy Miller, Dmitriy Dligach, Farig Sadeque, Steven Bethard, Guergana Savova
Recently BERT has achieved a state-of-the-art performance in temporal relation extraction from clinical Electronic Medical Records text.
no code implementations • WS 2020 • Xiyu Ding, Michael Barnett, Ateev Mehrotra, Timothy Miller
Electronic consult (eConsult) systems allow specialists more flexibility to respond to referrals more efficiently, thereby increasing access in under-resourced healthcare settings like safety net systems.
no code implementations • WS 2019 • Kristin Wright-Bettner, Martha Palmer, Guergana Savova, Piet de Groen, Timothy Miller
This paper discusses a cross-document coreference annotation schema that was developed to further automatic extraction of timelines in the clinical domain.
no code implementations • WS 2019 • Dianbo Liu, Dmitriy Dligach, Timothy Miller
A large percentage of medical information is in unstructured text format in electronic medical record systems.
no code implementations • ACL 2019 • Lifeng Jin, Finale Doshi-Velez, Timothy Miller, Lane Schwartz, William Schuler
This paper describes a neural PCFG inducer which employs context embeddings (Peters et al., 2018) in a normalizing flow model (Dinh et al., 2015) to extend PCFG induction to use semantic and morphological information.
no code implementations • NAACL 2019 • Timothy Miller
Unsupervised domain adaptation (UDA) is the task of training a statistical model on labeled data from a source domain to achieve better performance on data from a target domain, with access to only unlabeled data in the target domain.
no code implementations • WS 2019 • Timothy Miller, Alon Geva, Dmitriy Dligach
In this paper we describe an evaluation of the potential of classical information extraction methods to extract drug-related attributes, including adverse drug events, and compare to more recently developed neural methods.
no code implementations • WS 2019 • Chen Lin, Timothy Miller, Dmitriy Dligach, Steven Bethard, Guergana Savova
Classic methods for clinical temporal relation extraction focus on relational candidates within a sentence.
no code implementations • 28 Nov 2018 • Dianbo Liu, Timothy Miller, Raheel Sayeed, Kenneth D. Mandl
Electronic health record (EHR) data is collected by individual institutions and often stored across locations in silos.
no code implementations • WS 2018 • Chen Lin, Timothy Miller, Dmitriy Dligach, Hadi Amiri, Steven Bethard, Guergana Savova
Neural network models are oftentimes restricted by limited labeled instances and resort to advanced architectures and features for cutting edge performance.
1 code implementation • EMNLP 2018 • Lifeng Jin, Finale Doshi-Velez, Timothy Miller, William Schuler, Lane Schwartz
There have been several recent attempts to improve the accuracy of grammar induction systems by bounding the recursive complexity of the induction model (Ponvert et al., 2011; Noji and Johnson, 2016; Shain et al., 2016; Jin et al., 2018).
no code implementations • NAACL 2018 • Hadi Amiri, Timothy Miller, Guergana Savova
Automatic identification of spurious instances (those with potentially wrong labels in datasets) can improve the quality of existing language resources, especially when annotations are obtained through crowdsourcing or automatically generated based on coded rankings.
1 code implementation • SEMEVAL 2018 • Dmitriy Dligach, Timothy Miller
Mining electronic health records for patients who satisfy a set of predefined criteria is known in medical informatics as phenotyping.
1 code implementation • TACL 2018 • Lifeng Jin, Finale Doshi-Velez, Timothy Miller, William Schuler, Lane Schwartz
There has been recent interest in applying cognitively or empirically motivated bounds on recursion depth to limit the search space of grammar induction models (Ponvert et al., 2011; Noji and Johnson, 2016; Shain et al., 2016).
no code implementations • EMNLP 2017 • Hadi Amiri, Timothy Miller, Guergana Savova
We present a novel approach for training artificial neural networks.
no code implementations • WS 2017 • Timothy Miller, Steven Bethard, Hadi Amiri, Guergana Savova
Detecting negated concepts in clinical texts is an important part of NLP information extraction systems.
no code implementations • WS 2017 • Chen Lin, Timothy Miller, Dmitriy Dligach, Steven Bethard, Guergana Savova
Token sequences are often used as the input for Convolutional Neural Networks (CNNs) in natural language processing.
no code implementations • EACL 2017 • Dmitriy Dligach, Timothy Miller, Chen Lin, Steven Bethard, Guergana Savova
We experiment with neural architectures for temporal relation extraction and establish a new state-of-the-art for several scenarios.
no code implementations • COLING 2016 • Cory Shain, William Bryce, Lifeng Jin, Victoria Krakovna, Finale Doshi-Velez, Timothy Miller, William Schuler, Lane Schwartz
This paper presents a new memory-bounded left-corner parsing model for unsupervised raw-text syntax induction, using unsupervised hierarchical hidden Markov models (UHHMM).
no code implementations • TACL 2014 • William F. Styler IV, Steven Bethard, Sean Finan, Martha Palmer, Sameer Pradhan, Piet C de Groen, Brad Erickson, Timothy Miller, Chen Lin, Guergana Savova, James Pustejovsky
The corpus is available to the community and has been proposed for use in a SemEval 2015 task.