no code implementations • ACL 2013 • Sebastian Sulger, Miriam Butt, Tracy Holloway King, Paul Meurer, Tibor Laczk{\'o}, Gy{\"o}rgy R{\'a}kosi, Cheikh Bamba Dione, Helge Dyvik, Victoria Ros{\'e}n, Koenraad De Smedt, Agnieszka Patejuk, {\"O}zlem {\c{C}}etino{\u{g}}lu, I Wayan Arka, Meladel Mistica
no code implementations • LREC 2014 • Saba Urooj, Sarmad Hussain, Asad Mustafa, Rahila Parveen, Farah Adeeba, Tafseer Ahmed Khan, Miriam Butt, Annette Hautli
The paper presents a design schema and details of a new Urdu POS tagset.
no code implementations • LREC 2012 • Tafseer Ahmed, Miriam Butt, Annette Hautli, Sebastian Sulger
When dealing with languages of South Asia from an NLP perspective, a problem that repeatedly crops up is the treatment of complex predicates.
no code implementations • ACL 2019 • Mennatallah El-Assady, Wolfgang Jentner, Fabian Sperrle, Rita Sevastjanova, Annette Hautli-Janisz, Miriam Butt, Daniel Keim
We present a modular framework for the rapid-prototyping of linguistic, web-based, visual analytics applications.
no code implementations • WS 2019 • Aikaterini-Lida Kalouli, Rebecca Kehlbeck, Rita Sevastjanova, Katharina Kaiser, Georg A. Kaiser, Miriam Butt
The study of language change through parallel corpora can be advantageous for the analysis of complex interactions between time, text domain and language.
no code implementations • WS 2019 • Christin Sch{\"a}tzle, Frederik L. Dennig, Michael Blumenschein, Daniel A. Keim, Miriam Butt
This paper presents a significant extension of HistoBankVis, a multilayer visualization system which allows a fast and interactive exploration of complex linguistic data.
no code implementations • WS 2019 • Kengatharaiyer Sarveswaran, Gihan Dias, Miriam Butt
This paper describes a new and larger coverage Finite-State Morphological Analyser (FSM) and Generator for the Dravidian language Tamil.
no code implementations • LREC 2020 • Toqeer Ehsan, Miriam Butt
This paper adds to the available resources for the under-resourced language Urdu by converting different types of existing treebanks for Urdu into a common format that is based on Universal Dependencies.
no code implementations • COLING (LAW) 2020 • Christin Beck, Hannah Booth, Mennatallah El-Assady, Miriam Butt
The development of linguistic corpora is fraught with various problems of annotation and representation.
1 code implementation • IWCS (ACL) 2021 • Aikaterini-Lida Kalouli, Rebecca Kehlbeck, Rita Sevastjanova, Oliver Deussen, Daniel Keim, Miriam Butt
Research in NLP has mainly focused on factoid questions, with the goal of finding quick and reliable ways of matching a query to an answer.
no code implementations • LaTeCHCLfL (COLING) 2022 • Wassiliki Siskou, Clara Giralt Mirón, Sarah Molina-Raith, Miriam Butt
While previous research has primarily looked at how calls-to-action (CTAs) were used in Twitter messages from non-profit organizations and protest mobilization, we are interested in identifying the linguistic cues used in CTAs found on Facebook and Twitter for an automatic identification of CTAs.