no code implementations • MTSummit 2021 • John E. Ortega, Richard Alexander Castro Mamani, Jaime Rafael Montoya Samame
Low-resource languages sometimes take on similar morphological and syntactic characteristics due to their geographic nearness and shared history.
no code implementations • TDLE (LREC) 2022 • Iria de-Dios-Flores, Carmen Magariños, Adina Ioana Vladu, John E. Ortega, José Ramom Pichel, Marcos García, Pablo Gamallo, Elisa Fernández Rei, Alberto Bugarín-Diz, Manuel González González, Senén Barro, Xosé Luis Regueira
The development of language technologies (LTs) such as machine translation, text analytics, and dialogue systems is essential in the current digital society, culture and economy.
Cultural Vocal Bursts Intensity Prediction
Machine Translation
+2
no code implementations • COLING 2022 • Nelsi Melgarejo, Rodolfo Zevallos, Hector Gomez, John E. Ortega
In the effort to minimize the risk of extinction of a language, linguistic resources are fundamental.
no code implementations • 20 Dec 2024 • Annika Marie Schoene, John E. Ortega, Rodolfo Joel Zevallos, Laura Haaber Ihle
Previous work has used English dictionaries related to suicide to translate into different target languages due to lack of other available resources.
no code implementations • 20 Dec 2024 • Rodolfo Zevallos, Annika Schoene, John E. Ortega
Suicidal ideation is a serious health problem affecting millions of people worldwide.
1 code implementation • 18 Dec 2024 • Kenneth Church, Raman Chandrasekar, John E. Ortega, Ibrahim Said Ahmad
How effective is peer-reviewing in identifying important papers?
no code implementations • 18 Dec 2024 • Adam Meyers, Advait Pravin Savant, John E. Ortega
This article is about Semantic Role Labeling for English partitive nouns (5%/REL of the price/ARG1; The price/ARG1 rose 5 percent/REL) in the NomBank annotated corpus.
no code implementations • 18 Dec 2024 • Lisa Wang, Adam Meyers, John E. Ortega, Rodolfo Zevallos
Translating between languages with drastically different grammatical conventions poses challenges, not just for human interpreters but also for machine translation systems.
no code implementations • 8 Oct 2024 • Jordan Miner, John E. Ortega
Our results show that through the use of advanced NLP techniques (both supervised and unsupervised) toxicity and other attributes about language before and after a conflict is predictable with a low error of nearly 1. 2 percent for both conflicts.
no code implementations • 26 Sep 2024 • Richard Yue, John E. Ortega, Kenneth Ward Church
The typical workflow for a professional translator to translate a document from its source language (SL) to a target language (TL) is not always focused on what many language models in natural language processing (NLP) do - predict the next word in a series of words.
no code implementations • 26 Sep 2024 • Richard Yue, John E. Ortega
Many CAT tools offer a fuzzy-match algorithm to locate segments (s) in the TM that are close in distance to s'.
no code implementations • 2 Jul 2024 • John E. Ortega, Ibrahim Said Ahmad, William Chen
Nollywood, based on the idea of Bollywood from India, is a series of outstanding movies that originate from Nigeria.
no code implementations • 5 Oct 2023 • Chih-Chen Chen, William Chen, Rodolfo Zevallos, John E. Ortega
The application of self-supervision to speech representation learning has garnered significant interest in recent years, due to its scalability to large amounts of unlabeled data.
1 code implementation • 17 Apr 2023 • Shubham Vatsal, Adam Meyers, John E. Ortega
We compare our results for two classification tasks: (1) a broad classification task with 15 categories and (2) a fine-grained classification task with 279 categories.
1 code implementation • 15 Feb 2023 • Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, Luis Chiruzzo, John E. Ortega, Gustavo A. Giménez-Lugo, Rolando Coto-Solano, Katharina Kann
However, the languages most in need of automatic alignment are low-resource and, thus, not typically included in the pretraining data.
no code implementations • 5 Dec 2022 • Ayush Singh, John E. Ortega
However, PLMs have been found to degrade in performance under distribution shift, a phenomenon that occurs when data at test-time does not come from the same distribution as the source training set.
no code implementations • 22 Jun 2022 • John E. Ortega
This work provides a survey of several networking cipher algorithms and proposes a method for integrating natural language processing (NLP) as a protective agent for them.