Search Results for author: Ehsan Shareghi

Found 33 papers, 20 papers with code

On the Effect of Isotropy on VAE Representations of Text

1 code implementation ACL 2022 Lan Zhang, Wray Buntine, Ehsan Shareghi

Injecting desired geometric properties into text representations has attracted a lot of attention.

Integrating Transformers and Knowledge Graphs for Twitter Stance Detection

no code implementations WNUT (ACL) 2021 Thomas Clark, Costanza Conforti, Fangyu Liu, Zaiqiao Meng, Ehsan Shareghi, Nigel Collier

Stance detection (SD) entails classifying the sentiment of a text towards a given target, and is a relevant sub-task for opinion mining and social media analysis.

Knowledge Graphs Knowledge Probing +2

Fire Burns, Sword Cuts: Commonsense Inductive Bias for Exploration in Text-based Games

1 code implementation ACL 2022 Dongwon Ryu, Ehsan Shareghi, Meng Fang, Yunqiu Xu, Shirui Pan, Reza Haf

Text-based games (TGs) are exciting testbeds for developing deep reinforcement learning techniques due to their partially observed environments and large action spaces.

Efficient Exploration Inductive Bias +2

Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation

1 code implementation24 May 2023 Yuan Yang, Siheng Xiong, Ali Payani, Ehsan Shareghi, Faramarz Fekri

Translating natural language sentences to first-order logic (NL-FOL translation) is a longstanding challenge in the NLP and formal logic literature.

Formal Logic Translation

Koala: An Index for Quantifying Overlaps with Pre-training Corpora

no code implementations26 Mar 2023 Thuy-Trang Vu, Xuanli He, Gholamreza Haffari, Ehsan Shareghi

In very recent years more attention has been placed on probing the role of pre-training data in Large Language Models (LLMs) downstream behaviour.

Memorization

Plug-and-Play Recipe Generation with Content Planning

no code implementations9 Dec 2022 Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

Specifically, it optimizes the joint distribution of the natural language sequence and the global content plan in a plug-and-play manner.

Recipe Generation Text Generation

Self-supervised Graph Masking Pre-training for Graph-to-Text Generation

1 code implementation19 Oct 2022 Jiuzhou Han, Ehsan Shareghi

Large-scale pre-trained language models (PLMs) have advanced Graph-to-Text (G2T) generation by processing the linearised version of a graph.

Text Generation

RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise

no code implementations16 Oct 2022 Jinming Zhao, Hao Yang, Gholamreza Haffari, Ehsan Shareghi

Pre-trained speech Transformers in speech translation (ST) have facilitated state-of-the-art (SotA) results; yet, using such encoders is computationally expensive.

Translation

Generating Synthetic Speech from SpokenVocab for Speech Translation

1 code implementation15 Oct 2022 Jinming Zhao, Gholamreza Haffar, Ehsan Shareghi

Training end-to-end speech translation (ST) systems requires sufficiently large-scale data, which is unavailable for most language pairs and domains.

Data Augmentation Machine Translation +1

On Reality and the Limits of Language Data: Aligning LLMs with Human Norms

no code implementations25 Aug 2022 Nigel H. Collier, Fangyu Liu, Ehsan Shareghi

Recent advancements in Large Language Models (LLMs) harness linguistic associations in vast natural language data for practical applications.

Common Sense Reasoning

M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation

1 code implementation3 Jul 2022 Jinming Zhao, Hao Yang, Ehsan Shareghi, Gholamreza Haffari

End-to-end speech-to-text translation models are often initialized with pre-trained speech encoder and pre-trained text decoder.

Speech-to-Text Translation Translation

Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language Models

1 code implementation ACL 2022 Zaiqiao Meng, Fangyu Liu, Ehsan Shareghi, Yixuan Su, Charlotte Collins, Nigel Collier

To catalyse the research in this direction, we release a well-curated biomedical knowledge probing benchmark, MedLAMA, which is constructed based on the Unified Medical Language System (UMLS) Metathesaurus.

Knowledge Probing Transfer Learning

The Neglected Sibling: Isotropic Gaussian Posterior for VAE

1 code implementation14 Oct 2021 Lan Zhang, Wray Buntine, Ehsan Shareghi

Deep generative models have been widely used in several areas of NLP, and various techniques have been proposed to augment them or address their training challenges.

Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

1 code implementation ACL (RepL4NLP) 2021 Lan Zhang, Victor Prokhorov, Ehsan Shareghi

To highlight the challenges of achieving representation disentanglement for text domain in an unsupervised setting, in this paper we select a representative set of successfully applied models from the image domain.

Disentanglement Inductive Bias

Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification

1 code implementation EACL 2021 Yi Zhu, Ehsan Shareghi, Yingzhen Li, Roi Reichart, Anna Korhonen

Semi-supervised learning through deep generative models and multi-lingual pretraining techniques have orchestrated tremendous success across different areas of NLP.

Classification Document Classification +1

A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters

no code implementations ACL 2021 Mengjie Zhao, Yi Zhu, Ehsan Shareghi, Ivan Vulić, Roi Reichart, Anna Korhonen, Hinrich Schütze

Few-shot crosslingual transfer has been shown to outperform its zero-shot counterpart with pretrained encoders like multilingual BERT.

Few-Shot Learning

Self-Alignment Pretraining for Biomedical Entity Representations

1 code implementation NAACL 2021 Fangyu Liu, Ehsan Shareghi, Zaiqiao Meng, Marco Basaldella, Nigel Collier

Despite the widespread success of self-supervised learning via masked language models (MLM), accurately capturing fine-grained semantic relationships in the biomedical domain remains a challenge.

Benchmarking Entity Linking +2

COMETA: A Corpus for Medical Entity Linking in the Social Media

1 code implementation EMNLP 2020 Marco Basaldella, Fangyu Liu, Ehsan Shareghi, Nigel Collier

Whilst there has been growing progress in Entity Linking (EL) for general language, existing datasets fail to address the complex nature of health terminology in layman's language.

Entity Linking

Learning Sparse Sentence Encoding without Supervision: An Exploration of Sparsity in Variational Autoencoders

1 code implementation ACL (RepL4NLP) 2021 Victor Prokhorov, Yingzhen Li, Ehsan Shareghi, Nigel Collier

It has been long known that sparsity is an effective inductive bias for learning efficient representation of data in vectors with fixed dimensionality, and it has been explored in many areas of representation learning.

Inductive Bias Representation Learning +2

On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

1 code implementation WS 2019 Victor Prokhorov, Ehsan Shareghi, Yingzhen Li, Mohammad Taher Pilehvar, Nigel Collier

While the explicit constraint naturally avoids posterior collapse, we use it to further understand the significance of the KL term in controlling the information transmitted through the VAE channel.

Text Generation

Bayesian Learning for Neural Dependency Parsing

no code implementations NAACL 2019 Ehsan Shareghi, Yingzhen Li, Yi Zhu, Roi Reichart, Anna Korhonen

While neural dependency parsers provide state-of-the-art accuracy for several languages, they still rely on large amounts of costly labeled training data.

Dependency Parsing POS +1

Structured Prediction of Sequences and Trees using Infinite Contexts

no code implementations9 Mar 2015 Ehsan Shareghi, Gholamreza Haffari, Trevor Cohn, Ann Nicholson

Linguistic structures exhibit a rich array of global phenomena, however commonly used Markov models are unable to adequately describe these phenomena due to their strong locality assumptions.

Part-Of-Speech Tagging Structured Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.