no code implementations • PANDL (COLING) 2022 • Enrique Noriega-Atala, Robert Vacareanu, Gus Hahn-Powell, Marco A. Valenzuela-Escárcega
We propose a neural-based approach for rule synthesis designed to help bridge the gap between the interpretability, precision and maintainability exhibited by rule-based information extraction systems with the scalability and convenience of statistical information extraction systems.
no code implementations • NAACL (ACL) 2022 • Robert Vacareanu, George C.G. Barbosa, Enrique Noriega-Atala, Gus Hahn-Powell, Rebecca Sharp, Marco A. Valenzuela-Escárcega, Mihai Surdeanu
We propose a system that assists a user in constructing transparent information extraction models, consisting of patterns (or rules) written in a declarative language, through program synthesis. Users of our system can specify their requirements through the use of examples, which are collected with a search interface. The rule-synthesis system proposes rule candidates and the results of applying them on a textual corpus; the user has the option to accept the candidate, request another option, or adjust the examples provided to the system. Through an interactive evaluation, we show that our approach generates high-precision rules even in a 1-shot setting.
no code implementations • EACL (WANLP) 2021 • Elsayed Issa, Mohammed AlShakhori1, Reda Al-Bahrani, Gus Hahn-Powell
This work investigates the value of augmenting recurrent neural networks with feature engineering for the Second Nuanced Arabic Dialect Identification (NADI) Subtask 1. 2: Country-level DA identification.
1 code implementation • PANDL (COLING) 2022 • Arie Pratama Sutiono, Gus Hahn-Powell
In low resource settings, data augmentation strategies are commonly leveraged to improve performance.
no code implementations • 11 Jul 2021 • Seethalakshmi Gopalakrishnan, Victor Chen, Gus Hahn-Powell, Bharadwaj Tirunagar
The number of research articles in business and management has dramatically increased along with terminology, constructs, and measures.
no code implementations • ACL 2020 • Zheng Tang, Gus Hahn-Powell, Mihai Surdeanu
Our approach uses an encoder-decoder architecture, which jointly trains a classifier for event extraction, and a rule decoder that generates syntactico-semantic rules that explain the decisions of the event classifier.
no code implementations • LREC 2020 • Marco A. Valenzuela-Esc{\'a}rcega, Gus Hahn-Powell, Dane Bell
We present Odinson, a rule-based information extraction framework, which couples a simple yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.
no code implementations • NAACL 2019 • George C. G. Barbosa, Zechy Wong, Gus Hahn-Powell, Dane Bell, Rebecca Sharp, Marco A. Valenzuela-Esc{\'a}rcega, Mihai Surdeanu
Many of the most pressing current research problems (e. g., public health, food security, or climate change) require multi-disciplinary collaborations.
no code implementations • WS 2018 • Fan Luo, Marco A. Valenzuela-Esc{\'a}rcega, Gus Hahn-Powell, Mihai Surdeanu
We introduce a machine learning approach for the identification of {``}white spaces{''} in scientific knowledge.
1 code implementation • LREC 2018 • Angus G. Forbes, Kristine Lee, Gus Hahn-Powell, Marco A. Valenzuela-Escárcega, Mihai Surdeanu
Additionally, we include an approach to representing text annotations in which annotation subgraphs, or semantic summaries, are used to show relationships outside of the sequential context of the text itself.
no code implementations • WS 2016 • Marco A. Valenzuela-Escarcega, Gus Hahn-Powell, Dane Bell, Mihai Surdeanu
We propose an approach for biomedical information extraction that marries the advantages of machine learning models, e. g., learning directly from data, with the benefits of rule-based approaches, e. g., interpretability.
2 code implementations • WS 2016 • Gus Hahn-Powell, Dane Bell, Marco A. Valenzuela-Escárcega, Mihai Surdeanu
Causal precedence between biochemical interactions is crucial in the biomedical domain, because it transforms collections of individual interactions, e. g., bindings and phosphorylations, into the causal mechanisms needed to inform meaningful search and inference.
no code implementations • LREC 2016 • Marco A. Valenzuela-Esc{\'a}rcega, Gus Hahn-Powell, Mihai Surdeanu
Odin is an information extraction framework that applies cascades of finite state automata over both surface text and syntactic dependency graphs.
no code implementations • LREC 2016 • Dane Bell, Gus Hahn-Powell, Marco A. Valenzuela-Escárcega, Mihai Surdeanu
We describe challenges and advantages unique to coreference resolution in the biomedical domain, and a sieve-based architecture that leverages domain knowledge for both entity and event coreference resolution.
1 code implementation • 24 Sep 2015 • Marco A. Valenzuela-Escárcega, Gus Hahn-Powell, Mihai Surdeanu
Here we include a thorough definition of the Odin rule language, together with a description of the Odin API in the Scala language, which allows one to apply these rules to arbitrary texts.