no code implementations • 12 Sep 2018 • Liheng Chen, Yanru Qu, Zhenghui Wang, Lin Qiu, Wei-Nan Zhang, Ken Chen, Shaodian Zhang, Yong Yu
TGE-PS uses Pairs Sampling (PS) to improve the sampling strategy of RW, being able to reduce ~99% training samples while preserving competitive performance.
1 code implementation • COLING 2018 • Junjie Xing, Kenny Zhu, Shaodian Zhang
Chinese word segmentation (CWS) trained from open source corpus faces dramatic performance drop when dealing with domain text, especially for a domain with lots of special terms and diverse writing styles, such as the biomedical domain.
no code implementations • NAACL 2018 • Zhenghui Wang, Yanru Qu, Li-Heng Chen, Jian Shen, Wei-Nan Zhang, Shaodian Zhang, Yimei Gao, Gen Gu, Ken Chen, Yong Yu
We study the problem of named entity recognition (NER) from electronic medical records, which is one of the most fundamental and critical problems for medical text mining.
Medical Named Entity Recognition named-entity-recognition +3
no code implementations • 28 Mar 2016 • Shaodian Zhang, Edouard Grave, Elizabeth Sklar, Noemie Elhadad
Identifying topics of discussions in online health communities (OHC) is critical to various applications, but can be difficult because topics of OHC content are usually heterogeneous and domain-dependent.