no code implementations • 9 Oct 2024 • Toby Simonds, Kemal Kurniawan, Jey Han Lau
We propose a novel approach to enhancing the performance and efficiency of large language models (LLMs) by combining domain prompt routing with domain-specialized models.
1 code implementation • 4 Oct 2024 • Raphael Merx, Ekaterina Vylomova, Kemal Kurniawan
Additionally, we explore the use of pre-trained language models for automated rating of examples, finding that sentence perplexity serves as a good proxy for typicality and intelligibility in higher-resourced languages.
1 code implementation • 5 Aug 2024 • Kemal Kurniawan, Meladel Mistica, Timothy Baldwin, Jey Han Lau
This paper explores the task of automatic prediction of text spans in a legal problem description that support a legal area label.
2 code implementations • 31 May 2022 • Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Rahmad Mahendra, Fajri Koto, Ade Romadhony, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Pascale Fung, Timothy Baldwin, Jey Han Lau, Rico Sennrich, Sebastian Ruder
In this work, we focus on developing resources for languages in Indonesia.
no code implementations • ACL 2022 • Alham Fikri Aji, Genta Indra Winata, Fajri Koto, Samuel Cahyawijaya, Ade Romadhony, Rahmad Mahendra, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Timothy Baldwin, Jey Han Lau, Sebastian Ruder
NLP research is impeded by a lack of resources and awareness of the challenges presented by underrepresented languages and dialects.
1 code implementation • NAACL 2022 • Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn
Providing technologies to communities or domains where training data is scarce or protected e. g., for privacy reasons, is becoming increasingly important.
no code implementations • SEMEVAL 2021 • Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn
This paper describes PTST, a source-free unsupervised domain adaptation technique for sequence tagging, and its application to the SemEval-2021 Task 10 on time expression recognition.
1 code implementation • EACL 2021 • Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn
Cross-lingual transfer is a leading technique for parsing low-resource languages in the absence of explicit supervision.
2 code implementations • 17 Jun 2019 • Kemal Kurniawan
We introduced KaWAT (Kata Word Analogy Task), a new word analogy task dataset for Indonesian.
1 code implementation • 12 Oct 2018 • Kemal Kurniawan, Samuel Louvan
Automatic text summarization is generally considered as a challenging task in the NLP community.
1 code implementation • 10 Sep 2018 • Kemal Kurniawan, Alham Fikri Aji
Previous work in Indonesian part-of-speech (POS) tagging are hard to compare as they are not evaluated on a common dataset.
no code implementations • WS 2018 • Fariz Ikhwantri, Samuel Louvan, Kemal Kurniawan, Bagas Abisena, Valdi Rachman, Alfan Farizki Wicaksono, Rahmad Mahendra
In this paper, we propose a Multi-Task Active Learning framework for Semantic Role Labeling with Entity Recognition (ER) as the auxiliary task to alleviate the need for extensive data and use additional information from ER to help SRL.
1 code implementation • WS 2018 • Kemal Kurniawan, Samuel Louvan
We report an empirical evaluation of neural sequence labeling models with character embedding to tackle NER task in Indonesian conversational texts.