Search Results for author: Ehsan Doostmohammadi

Found 11 papers, 4 papers with code

On the Effects of Video Grounding on Language Models

no code implementations MMMPIE (COLING) 2022 Ehsan Doostmohammadi, Marco Kuhlmann

The results show that the smaller model benefits from video grounding in predicting highly imageable words, while the results for the larger model seem harder to interpret. of lack of grounding, e. g., addressing issues like models’ insufficient commonsense knowledge.

Image Captioning Question Answering +2

How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?

no code implementations16 Feb 2024 Ehsan Doostmohammadi, Oskar Holmström, Marco Kuhlmann

Work on instruction-tuned Large Language Models (LLMs) has used automatic methods based on text overlap and LLM judgments as cost-effective alternatives to human evaluation.

Cross-Lingual Transfer

Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models

1 code implementation25 May 2023 Ehsan Doostmohammadi, Tobias Norlund, Marco Kuhlmann, Richard Johansson

Inspired by this, we replace the semantic retrieval in Retro with a surface-level method based on BM25, obtaining a significant reduction in perplexity.

Re-Ranking Retrieval +1

On the Generalization Ability of Retrieval-Enhanced Transformers

no code implementations23 Feb 2023 Tobias Norlund, Ehsan Doostmohammadi, Richard Johansson, Marco Kuhlmann

Recent work on the Retrieval-Enhanced Transformer (RETRO) model has shown that off-loading memory from trainable weights to a retrieval database can significantly improve language modeling and match the performance of non-retrieval models that are an order of magnitude larger in size.

Language Modelling Retrieval

SINA-BERT: A pre-trained Language Model for Analysis of Medical Texts in Persian

no code implementations15 Apr 2021 Nasrin Taghizadeh, Ehsan Doostmohammadi, Elham Seifossadat, Hamid R. Rabiee, Maedeh S. Tahaei

We have released Sina-BERT, a language model pre-trained on BERT (Devlin et al., 2018) to address the lack of a high-quality Persian language model in the medical domain.

Language Modelling Retrieval +1

Joint Persian Word Segmentation Correction and Zero-Width Non-Joiner Recognition Using BERT

no code implementations COLING 2020 Ehsan Doostmohammadi, Minoo Nassajian, Adel Rahimi

Words are properly segmented in the Persian writing system; in practice, however, these writing rules are often neglected, resulting in single words being written disjointedly and multiple words written without any white spaces between them.

PerKey: A Persian News Corpus for Keyphrase Extraction and Generation

no code implementations25 Sep 2020 Ehsan Doostmohammadi, Mohammad Hadi Bokaei, Hossein Sameti

Since previous studies on Persian keyword or keyphrase extraction have not published their data, the field suffers from the lack of a human extracted keyphrase dataset.

Information Retrieval Keyphrase Extraction +2

Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts

no code implementations WS 2019 Ehsan Doostmohammadi, Minoo Nassajian

Identification of the languages written using cuneiform symbols is a difficult task due to the lack of resources and the problem of tokenization.

BIG-bench Machine Learning Dialect Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.