Search Results for author: Marina Danilevsky

Found 12 papers, 2 papers with code

Active Learning for BERT: An Empirical Study

1 code implementation EMNLP 2020 Liat Ein-Dor, Alon Halfon, Ariel Gera, Eyal Shnarch, Lena Dankin, Leshem Choshen, Marina Danilevsky, Ranit Aharonov, Yoav Katz, Noam Slonim

Here, we present a large-scale empirical study on active learning techniques for BERT-based classification, addressing a diverse set of AL strategies and datasets.

Active Learning General Classification +1

Learning to Robustly Aggregate Labeling Functions for Semi-supervised Data Programming

no code implementations Findings (ACL) 2022 Ayush Maheshwari, KrishnaTeja Killamsetty, Ganesh Ramakrishnan, Rishabh Iyer, Marina Danilevsky, Lucian Popa

These LFs, in turn, have been used to generate a large amount of additional noisy labeled data, in a paradigm that is now commonly referred to as data programming.

Text Classification

Facilitating Knowledge Sharing from Domain Experts to Data Scientists for Building NLP Models

no code implementations29 Jan 2021 Soya Park, April Wang, Ban Kawas, Q. Vera Liao, David Piorkowski, Marina Danilevsky

Data scientists face a steep learning curve in understanding a new domain for which they want to build machine learning (ML) models.

A Survey of the State of Explainable AI for Natural Language Processing

no code implementations Asian Chapter of the Association for Computational Linguistics 2020 Marina Danilevsky, Kun Qian, Ranit Aharonov, Yannis Katsis, Ban Kawas, Prithviraj Sen

Recent years have seen important advances in the quality of state-of-the-art models, but this has come at the expense of models becoming less interpretable.

DIMSIM: An Accurate Chinese Phonetic Similarity Algorithm Based on Learned High Dimensional Encoding

1 code implementation CONLL 2018 Min Li, Marina Danilevsky, Sara Noeman, Yunyao Li

Phonetic similarity algorithms identify words and phrases with similar pronunciation which are used in many natural language processing tasks.

Spelling Correction

SystemT: Declarative Text Understanding for Enterprise

no code implementations NAACL 2018 Laura Chiticariu, Marina Danilevsky, Yunyao Li, Frederick Reiss, Huaiyu Zhu

The rise of enterprise applications over unstructured and semi-structured documents poses new challenges to text understanding systems across multiple dimensions.

Document Classification Entity Extraction using GAN +3

Multilingual Information Extraction with PolyglotIE

no code implementations COLING 2016 Alan Akbik, Laura Chiticariu, Marina Danilevsky, Yonas Kbrom, Yunyao Li, Huaiyu Zhu

We present PolyglotIE, a web-based tool for developing extractors that perform Information Extraction (IE) over multilingual data.

Semantic Parsing

Pagination: It's what you say, not how long it takes to say it

no code implementations11 Apr 2014 Joshua Hailpern, Niranjan Damera Venkata, Marina Danilevsky

Results from this experiment show that one of our approaches strongly outperforms the baselines and alternatives.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.