1 code implementation • WMT (EMNLP) 2020 • Rachel Bawden, Giorgio Maria Di Nunzio, Cristian Grozea, Inigo Jauregi Unanue, Antonio Jimeno Yepes, Nancy Mah, David Martinez, Aurélie Névéol, Mariana Neves, Maite Oronoz, Olatz Perez-de-Viñaspre, Massimo Piccardi, Roland Roller, Amy Siu, Philippe Thomas, Federica Vezzani, Maika Vicente Navarro, Dina Wiemann, Lana Yeganova
Machine translation of scientific abstracts and terminologies has the potential to support health professionals and biomedical researchers in some of their activities.
no code implementations • WMT (EMNLP) 2021 • Lana Yeganova, Dina Wiemann, Mariana Neves, Federica Vezzani, Amy Siu, Inigo Jauregi Unanue, Maite Oronoz, Nancy Mah, Aurélie Névéol, David Martinez, Rachel Bawden, Giorgio Maria Di Nunzio, Roland Roller, Philippe Thomas, Cristian Grozea, Olatz Perez-de-Viñaspre, Maika Vicente Navarro, Antonio Jimeno Yepes
In the sixth edition of the WMT Biomedical Task, we addressed a total of eight language pairs, namely English/German, English/French, English/Spanish, English/Portuguese, English/Chinese, English/Russian, English/Italian, and English/Basque.
no code implementations • NAACL (BioNLP) 2021 • Lana Yeganova, Won Gyu Kim, Donald Comeau, W John Wilbur, Zhiyong Lu
In this work we establish the connection between the BM25 score of a query term appearing in a section of a full text document and the probability of that document being clicked or identified as relevant.
1 code implementation • 2 Jul 2023 • Qiao Jin, Won Kim, Qingyu Chen, Donald C. Comeau, Lana Yeganova, John Wilbur, Zhiyong Lu
Experimental results show that BioCPT sets new state-of-the-art performance on five biomedical IR tasks, outperforming various baselines including much larger models such as GPT-3-sized cpt-text-XL.
no code implementations • 15 Jun 2023 • Shubo Tian, Qiao Jin, Lana Yeganova, Po-Ting Lai, Qingqing Zhu, Xiuying Chen, Yifan Yang, Qingyu Chen, Won Kim, Donald C. Comeau, Rezarta Islamaj, Aadit Kapoor, Xin Gao, Zhiyong Lu
In this work, we examine the diverse applications of large language models (LLMs), such as ChatGPT, in biomedicine and health.
no code implementations • 7 Aug 2020 • Lana Yeganova, Rezarta Islamaj, Qingyu Chen, Robert Leaman, Alexis Allot, Chin-Hsuan Wei, Donald C. Comeau, Won Kim, Yifan Peng, W. John Wilbur, Zhiyong Lu
In this study we analyze the LitCovid collection, 13, 369 COVID-19 related articles found in PubMed as of May 15th, 2020 with the purpose of examining the landscape of literature and presenting it in a format that facilitates information navigation and understanding.
no code implementations • 4 Dec 2019 • Rezarta Islamaj, Lana Yeganova, Won Kim, Natalie Xie, W. John Wilbur
In this work, we present PDC (probabilistic distributional clustering), a novel algorithm that, given a document collection, computes disjoint term sets representing topics in the collection.
no code implementations • WS 2018 • Won Gyu Kim, Lana Yeganova, Donald Comeau, W. John Wilbur, Zhiyong Lu
Creating simulated search environments has been of a significant interest in infor-mation retrieval, in both general and bio-medical search domains.
no code implementations • WS 2018 • Lana Yeganova, Donald C. Comeau, Won Kim, W. John Wilbur, Zhiyong Lu
A search that is targeted at finding a specific document in databases is called a Single Citation search.
no code implementations • Journal of Biomedical Informatics 2015 • Sun Kim, Haibin Liu, Lana Yeganova, W. John Wilbur
In this work, we propose an efficient and scalable system using a linear kernel to identify DDI information.