Search Results for author: Kemal Kurniawan

Found 13 papers, 9 papers with code

MoDEM: Mixture of Domain Expert Models

no code implementations9 Oct 2024 Toby Simonds, Kemal Kurniawan, Jey Han Lau

We propose a novel approach to enhancing the performance and efficiency of large language models (LLMs) by combining domain prompt routing with domain-specialized models.

Generating bilingual example sentences with large language models as lexicography assistants

1 code implementation4 Oct 2024 Raphael Merx, Ekaterina Vylomova, Kemal Kurniawan

Additionally, we explore the use of pre-trained language models for automated rating of examples, finding that sentence perplexity serves as a good proxy for typicality and intelligibility in higher-resourced languages.

In-Context Learning Informativeness +1

To Aggregate or Not to Aggregate. That is the Question: A Case Study on Annotation Subjectivity in Span Prediction

1 code implementation5 Aug 2024 Kemal Kurniawan, Meladel Mistica, Timothy Baldwin, Jey Han Lau

This paper explores the task of automatic prediction of text spans in a legal problem description that support a legal area label.

Unsupervised Cross-Lingual Transfer of Structured Predictors without Source Data

1 code implementation NAACL 2022 Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn

Providing technologies to communities or domains where training data is scarce or protected e. g., for privacy reasons, is becoming increasingly important.

Cross-Lingual Transfer Dependency Parsing +1

PTST-UoM at SemEval-2021 Task 10: Parsimonious Transfer for Sequence Tagging

no code implementations SEMEVAL 2021 Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn

This paper describes PTST, a source-free unsupervised domain adaptation technique for sequence tagging, and its application to the SemEval-2021 Task 10 on time expression recognition.

Unsupervised Domain Adaptation

PPT: Parsimonious Parser Transfer for Unsupervised Cross-Lingual Adaptation

1 code implementation EACL 2021 Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn

Cross-lingual transfer is a leading technique for parsing low-resource languages in the absence of explicit supervision.

ARC Cross-Lingual Transfer

KaWAT: A Word Analogy Task Dataset for Indonesian

2 code implementations17 Jun 2019 Kemal Kurniawan

We introduced KaWAT (Kata Word Analogy Task), a new word analogy task dataset for Indonesian.

Word Embeddings

IndoSum: A New Benchmark Dataset for Indonesian Text Summarization

1 code implementation12 Oct 2018 Kemal Kurniawan, Samuel Louvan

Automatic text summarization is generally considered as a challenging task in the NLP community.

Extractive Summarization Text Summarization

Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging

1 code implementation10 Sep 2018 Kemal Kurniawan, Alham Fikri Aji

Previous work in Indonesian part-of-speech (POS) tagging are hard to compare as they are not evaluated on a common dataset.

Part-Of-Speech Tagging POS +1

Multi-Task Active Learning for Neural Semantic Role Labeling on Low Resource Conversational Corpus

no code implementations WS 2018 Fariz Ikhwantri, Samuel Louvan, Kemal Kurniawan, Bagas Abisena, Valdi Rachman, Alfan Farizki Wicaksono, Rahmad Mahendra

In this paper, we propose a Multi-Task Active Learning framework for Semantic Role Labeling with Entity Recognition (ER) as the auxiliary task to alleviate the need for extensive data and use additional information from ER to help SRL.

Active Learning Multi-Task Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.