no code implementations • RANLP 2019 • Sanja {\v{S}}tajner, Maja Popovi{\'c}
We use the state-of-the-art automatic text simplification (ATS) system for lexically and syntactically simplifying source sentences, which are then translated with two state-of-the-art English-to-Serbian MT systems, the phrase-based MT (PBMT) and the neural MT (NMT).
no code implementations • ACL 2019 • Ioana Hulpu{\textcommabelow{s}}, Sanja {\v{S}}tajner, Heiner Stuckenschmidt
We propose an unsupervised approach for assessing conceptual complexity of texts, based on spreading activation.
no code implementations • SEMEVAL 2019 • Angelo Basile, Marc Franco-Salvador, Neha Pawar, Sanja {\v{S}}tajner, Mara Chinea Rios, Yassine Benajiba
In this paper, we present our participation to the EmoContext shared task on detecting emotions in English textual conversations between a human and a chatbot.
Ranked #2 on Emotion Recognition in Conversation on EC
no code implementations • COLING 2018 • Sanja {\v{S}}tajner, Ioana Hulpu{\c{s}}
Complexity of texts is usually assessed only at the lexical and syntactic levels.
no code implementations • WS 2018 • Christoph Kilian Theil, Sanja {\v{S}}tajner, Heiner Stuckenschmidt
In this paper, we use NLP techniques to detect linguistic uncertainty in financial disclosures.
no code implementations • IJCNLP 2017 • Seid Muhie Yimam, Sanja {\v{S}}tajner, Martin Riedl, Chris Biemann
Complex word identification (CWI) is an important task in text accessibility.
no code implementations • WS 2017 • Sanja {\v{S}}tajner, Victoria Yaneva, Ruslan Mitkov, Simone Paolo Ponzetto
Eye tracking studies from the past few decades have shaped the way we think of word complexity and cognitive load: words that are long, rare and ambiguous are more difficult to read.
no code implementations • RANLP 2017 • Seid Muhie Yimam, Sanja {\v{S}}tajner, Martin Riedl, Chris Biemann
Complex Word Identification (CWI) is an important task in lexical simplification and text accessibility.
1 code implementation • ACL 2017 • Sergiu Nisioi, Sanja {\v{S}}tajner, Simone Paolo Ponzetto, Liviu P. Dinu
Unlike the previously proposed automated TS systems, our neural text simplification (NTS) systems are able to simultaneously perform lexical simplification and content reduction.
Ranked #14 on Text Simplification on TurkCorpus
no code implementations • ACL 2017 • Sanja {\v{S}}tajner, Marc Franco-Salvador, Simone Paolo Ponzetto, Paolo Rosso, Heiner Stuckenschmidt
We provide several methods for sentence-alignment of texts with different complexity levels.
no code implementations • LREC 2016 • Sanja {\v{S}}tajner, Andreia Querido, Nuno Rendeiro, Jo{\~a}o Ant{\'o}nio Rodrigues, Ant{\'o}nio Branco
In this paper, we address the problem of Machine Translation (MT) for a specialised domain in a language pair for which only a very small domain-specific parallel corpus is available.
no code implementations • LREC 2016 • Jo{\~a}o Ant{\'o}nio Rodrigues, Nuno Rendeiro, Andreia Querido, Sanja {\v{S}}tajner, Ant{\'o}nio Branco
The usual concern when opting for a rule-based or a hybrid machine translation (MT) system is how much effort is required to adapt the system to a different language pair or a new domain.
no code implementations • LREC 2012 • Sanja {\v{S}}tajner, Ruslan Mitkov
In British English, we compared the complexity of texts published in 1931, 1961 and 1991, while in American English we compared the complexity of texts published in 1961 and 1992.