Search Results for author: Nisansa de Silva

Found 38 papers, 7 papers with code

Legal Case Winning Party Prediction With Domain Specific Auxiliary Models

no code implementations ROCLING 2022 Sahan Jayasinghe, Lakith Rambukkanage, Ashan Silva, Nisansa de Silva, Amal Shehan Perera

The model is built with and experimented using legal domain specific sub-models to provide more visibility to the final model, along with other variations.

Sentence Sentence Embedding +1

M2DS: Multilingual Dataset for Multi-document Summarisation

no code implementations17 Jul 2024 Kushan Hewapathirana, Nisansa de Silva, C. D. Athuraliya

This paper introduces M2DS, emphasising its unique multilingual aspect, and includes baseline scores from state-of-the-art MDS models evaluated on our dataset.

SHADE: Semantic Hypernym Annotator for Domain-specific Entities -- DnD Domain Use Case

no code implementations29 Jun 2024 Akila Peiris, Nisansa de Silva

Such an app can ensure that the notations are consistent, and the labels can be pre-defined or restricted reducing the room for errors.

Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP Research

no code implementations10 Jun 2024 Surangika Ranathunga, Nisansa de Silva, Dilith Jayakody, Aloka Fernando

We analysed a sample of NLP research papers archived in ACL Anthology as an attempt to quantify the degree of openness and the benefit of such an open culture in the NLP community.

Fine Tuning Named Entity Extraction Models for the Fantasy Domain

no code implementations16 Feb 2024 Aravinth Sivaganeshan, Nisansa de Silva

This work uses available lore of monsters in the D&D domain to fine-tune Trankit, which is a prolific NER framework that uses a pre-trained model for NER.

Miscellaneous named-entity-recognition +3

Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined Parallel Corpora

1 code implementation12 Feb 2024 Surangika Ranathunga, Nisansa de Silva, Menan Velayuthan, Aloka Fernando, Charitha Rathnayake

We conducted a detailed analysis on the quality of web-mined corpora for two low-resource languages (making three language pairs, English-Sinhala, English-Tamil and Sinhala-Tamil).

Machine Translation NMT +1

Sinhala-English Word Embedding Alignment: Introducing Datasets and Benchmark for a Low Resource Language

no code implementations17 Nov 2023 Kasun Wickramasinghe, Nisansa de Silva

In this paper, we try to align Sinhala and English word embedding spaces based on available alignment techniques and introduce a benchmark for Sinhala language embedding alignment.

Multi-document Summarization: A Comparative Evaluation

no code implementations10 Sep 2023 Kushan Hewapathirana, Nisansa de Silva, C. D. Athuraliya

This work serves as a reference for future MDS research and contributes to the development of accurate and robust models which can be utilized on demanding datasets with academically and/or scientifically complex data as well as generalized, relatively simple datasets.

Document Summarization Multi-Document Summarization

Sinhala-English Parallel Word Dictionary Dataset

1 code implementation4 Aug 2023 Kasun Wickramasinghe, Nisansa de Silva

However, in the cases where one of the considered language pairs is a low-resource language, the existing top-down parallel data such as corpora are lacking in both tally and quality due to the dearth of human annotation.

Machine Translation Sentence

Synthesis and Evaluation of a Domain-specific Large Data Set for Dungeons & Dragons

1 code implementation18 Dec 2022 Akila Peiris, Nisansa de Silva

This paper introduces the Forgotten Realms Wiki (FRW) data set and domain specific natural language generation using FRW along with related analyses.

Text Generation

Some Languages are More Equal than Others: Probing Deeper into the Linguistic Disparity in the NLP World

1 code implementation16 Oct 2022 Surangika Ranathunga, Nisansa de Silva

Using an existing language categorisation based on speaker population and vitality, we analyse the distribution of language data resources, amount of NLP/CL research, inclusion in multilingual web-based platforms and the inclusion in pre-trained multilingual models.

Selecting Seed Words for Wordle using Character Statistics

no code implementations7 Feb 2022 Nisansa de Silva

Wordle, a word guessing game rose to global popularity in the January of 2022.

Seeking Sinhala Sentiment: Predicting Facebook Reactions of Sinhala Posts

no code implementations1 Dec 2021 Vihanga Jayawickrama, Gihan Weeraprameshwara, Nisansa de Silva, Yudhanjaya Wijeratne

This paper uses millions of such reactions, derived from a decade worth of Facebook post data centred around a Sri Lankan context, to model an eye of the beholder approach to sentiment detection for online Sinhala textual content.

Binary Classification Sentiment Analysis

Semantic Oppositeness Assisted Deep Contextual Modeling for Automatic Rumor Detection in Social Networks

no code implementations EACL 2021 Nisansa de Silva, Dejing Dou

Social networks face a major challenge in the form of rumors and fake news, due to their intrinsic nature of connecting users to millions of others, and of giving any individual the power to post anything.

Semantic Similarity Semantic Textual Similarity

Effective Approach to Develop a Sentiment Annotator For Legal Domain in a Low Resource Setting

no code implementations PACLIC 2020 Gathika Ratnayaka, Nisansa de Silva, Amal Shehan Perera, Ramesh Pathirana

Analyzing the sentiments of legal opinions available in Legal Opinion Texts can facilitate several use cases such as legal judgement prediction, contradictory statements identification and party-based sentiment analysis.

Sentiment Analysis

Sinhala Language Corpora and Stopwords from a Decade of Sri Lankan Facebook

1 code implementation15 Jul 2020 Yudhanjaya Wijeratne, Nisansa de Silva

This paper presents two colloquial Sinhala language corpora from the language efforts of the Data, Analysis and Policy team of LIRNEasia, as well as a list of algorithmically derived stopwords.

Shift-of-Perspective Identification Within Legal Cases

no code implementations6 Jun 2019 Gathika Ratnayaka, Thejan Rupasinghe, Nisansa de Silva, Viraj Salaka Gamage, Menuka Warushavithana, Amal Shehan Perera

Therefore, the process of automatic information extraction from documents containing legal opinions related to court cases can be considered to be of significant importance.

Open Information Extraction Sentiment Analysis

Survey on Publicly Available Sinhala Natural Language Processing Tools and Research

1 code implementation5 Jun 2019 Nisansa de Silva

Sinhala is the native language of the Sinhalese people who make up the largest ethnic group of Sri Lanka.

Logic Rules Powered Knowledge Graph Embedding

no code implementations9 Mar 2019 Pengwei Wang, Dejing Dou, Fangzhao Wu, Nisansa de Silva, Lianwen Jin

And then, to put both triples and mined logic rules within the same semantic space, all triples in the knowledge graph are represented as first-order logic.

Knowledge Graph Embedding Link Prediction +1

Identifying Relationships Among Sentences in Court Case Transcripts Using Discourse Relations

no code implementations10 Sep 2018 Gathika Ratnayaka, Thejan Rupasinghe, Nisansa de Silva, Menuka Warushavithana, Viraj Gamage, Amal Shehan Perera

To the best of our knowledge, this is the first study where discourse relationships between sentences have been used to determine relationships among sentences in legal court case transcripts.

Legal Document Retrieval using Document Vector Embeddings and Deep Learning

no code implementations27 May 2018 Keet Sugathadasa, Buddhi Ayesha, Nisansa de Silva, Amal Shehan Perera, Vindula Jayawardana, Dimuthu Lakmal, Madhavi Perera

The ensemble model built in this study, shows a significantly higher accuracy level, which indeed proves the need for incorporation of domain specific semantic similarity measures into the information retrieval process.

Deep Learning Information Retrieval +5

Semi-Supervised Instance Population of an Ontology using Word Vector Embeddings

no code implementations9 Sep 2017 Vindula Jayawardana, Dimuthu Lakmal, Nisansa de Silva, Amal Shehan Perera, Keet Sugathadasa, Buddhi Ayesha, Madhavi Perera

With the use of word embeddings in the field of natural language processing, it became a popular topic due to its ability to cope up with semantic sensitivity.

Management Word Embeddings

Cannot find the paper you are looking for? You can Submit a new open access paper.