Search Results for author: Anand Kumar Sah

Found 3 papers, 1 papers with code

Can Perplexity Predict Fine-Tuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for Nepali

no code implementations28 Apr 2024 Nishant Luitel, Nirajan Bekoju, Anand Kumar Sah, Subarna Shakya

To reduce this gap we used 6 different tokenization schemes to pretrain relatively small language models in Nepali and used the representations learned to finetune on several downstream tasks.

Language Modelling

Contextual Spelling Correction with Language Model for Low-resource Setting

no code implementations28 Apr 2024 Nishant Luitel, Nirajan Bekoju, Anand Kumar Sah, Subarna Shakya

The task of Spell Correction(SC) in low-resource languages presents a significant challenge due to the availability of only a limited corpus of data and no annotated spelling correction datasets.

Language Modelling Spelling Correction

Cannot find the paper you are looking for? You can Submit a new open access paper.