Search Results for author: Ikuya Yamada

Found 21 papers, 14 papers with code

LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation

1 code implementation • 18 Feb 2024 • Ikuya Yamada, Ryokan Ri

In this study, we introduce LEIA, a language adaptation tuning method that utilizes Wikipedia entity names aligned across languages.

Cross-Lingual Transfer Data Augmentation +3

Paper
Code

Arukikata Travelogue Dataset with Geographic Entity Mention, Coreference, and Link Annotation

1 code implementation • 23 May 2023 • Shohei Higashiyama, Hiroki Ouchi, Hiroki Teranishi, Hiroyuki Otomo, Yusuke Ide, Aitaro Yamamoto, Hiroyuki Shindo, Yuki Matsuda, Shoko Wakamiya, Naoya Inoue, Ikuya Yamada, Taro Watanabe

Geoparsing is a fundamental technique for analyzing geo-entity information in text.

Paper
Code

MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages

no code implementations • NAACL (MIA) 2022 • Akari Asai, Shayne Longpre, Jungo Kasai, Chia-Hsuan Lee, Rui Zhang, Junjie Hu, Ikuya Yamada, Jonathan H. Clark, Eunsol Choi

We present the results of the Workshop on Multilingual Information Access (MIA) 2022 Shared Task, evaluating cross-lingual open-retrieval question answering (QA) systems in 16 typologically diverse languages.

Question Answering Retrieval

Paper
Add Code

EASE: Entity-Aware Contrastive Learning of Sentence Embedding

1 code implementation • NAACL 2022 • Sosuke Nishikawa, Ryokan Ri, Ikuya Yamada, Yoshimasa Tsuruoka, Isao Echizen

We present EASE, a novel method for learning sentence embeddings via contrastive learning between sentences and their related entities.

Clustering Contrastive Learning +6

Paper
Code

A Multilingual Bag-of-Entities Model for Zero-Shot Cross-Lingual Text Classification

no code implementations • 15 Oct 2021 • Sosuke Nishikawa, Ikuya Yamada, Yoshimasa Tsuruoka, Isao Echizen

We present a multilingual bag-of-entities model that effectively boosts the performance of zero-shot cross-lingual text classification by extending a multilingual pre-trained language model (e. g., M-BERT).

Entity Typing Language Modelling +3

Paper
Add Code

mLUKE: The Power of Entity Representations in Multilingual Pretrained Language Models

2 code implementations • ACL 2022 • Ryokan Ri, Ikuya Yamada, Yoshimasa Tsuruoka

We train a multilingual language model with 24 languages with entity representations and show the model consistently outperforms word-based pretrained models in various cross-lingual transfer tasks.

Ranked #1 on Cross-Lingual Question Answering on XQuAD (Average F1 metric, using extra training data)

Cross-Lingual Question Answering Cross-Lingual Transfer +1

683

Paper
Code

Efficient Passage Retrieval with Hashing for Open-domain Question Answering

1 code implementation • ACL 2021 • Ikuya Yamada, Akari Asai, Hannaneh Hajishirzi

Most state-of-the-art open-domain question answering systems use a neural retrieval model to encode passages into continuous vectors and extract them from a knowledge source.

Ranked #2 on Open-Domain Question Answering on TQA

Natural Questions Open-Domain Question Answering +3

161

Paper
Code

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

no code implementations • 1 Jan 2021 • Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini, Nicola De Cao, Edouard Grave, Ikuya Yamada, Sonse Shimaoka, Masatoshi Suzuki, Shumpei Miyawaki, Shun Sato, Ryo Takahashi, Jun Suzuki, Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz, Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Schlichtkrull, Sonal Gupta, Yashar Mehdad, Wen-tau Yih

We review the EfficientQA competition from NeurIPS 2020.

Open-Domain Question Answering Retrieval

Paper
Add Code

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

8 code implementations • EMNLP 2020 • Ikuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda, Yuji Matsumoto

In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer.

Ranked #1 on Entity Typing on Open Entity

Common Sense Reasoning Entity Typing +6

124,889

Paper
Code

Neural Attentive Bag-of-Entities Model for Text Classification

3 code implementations • CONLL 2019 • Ikuya Yamada, Hiroyuki Shindo

This study proposes a Neural Attentive Bag-of-Entities model, which is a neural network model that performs text classification using entities in a knowledge base.

Ranked #9 on Text Classification on 20NEWS

General Classification Question Answering +1

919

Paper
Code

Global Entity Disambiguation with BERT

1 code implementation • NAACL 2022 • Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto

We propose a global entity disambiguation (ED) model based on BERT.

Ranked #1 on Entity Disambiguation on MSNBC

Entity Disambiguation Language Modelling

683

Paper
Code

Wikipedia2Vec: An Efficient Toolkit for Learning and Visualizing the Embeddings of Words and Entities from Wikipedia

no code implementations • EMNLP 2020 • Ikuya Yamada, Akari Asai, Jin Sakuma, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji, Yuji Matsumoto

The embeddings of entities in a large knowledge base (e. g., Wikipedia) are highly beneficial for solving various natural language tasks that involve real world knowledge.

World Knowledge

Paper
Add Code

Trick Me If You Can: Human-in-the-loop Generation of Adversarial Examples for Question Answering

1 code implementation • TACL 2019 • Eric Wallace, Pedro Rodriguez, Shi Feng, Ikuya Yamada, Jordan Boyd-Graber

We propose human-in-the-loop adversarial generation, where human authors are guided to break models.

Information Retrieval Question Answering +1

Paper
Code

Representation Learning of Entities and Documents from Knowledge Base Descriptions

2 code implementations • COLING 2018 • Ikuya Yamada, Hiroyuki Shindo, Yoshiyasu Takefuji

In this paper, we describe TextEnt, a neural network model that learns distributed representations of entities and documents directly from a knowledge base (KB).

Ranked #1 on Entity Typing on Freebase FIGER

Entity Typing General Classification +3

919

Paper
Code

Studio Ousia's Quiz Bowl Question Answering System

no code implementations • 23 Mar 2018 • Ikuya Yamada, Ryuji Tamaki, Hiroyuki Shindo, Yoshiyasu Takefuji

In this chapter, we describe our question answering system, which was the winning system at the Human-Computer Question Answering (HCQA) Competition at the Thirty-first Annual Conference on Neural Information Processing Systems (NIPS).

BIG-bench Machine Learning Information Retrieval +2

Paper
Add Code

Segment-Level Neural Conditional Random Fields for Named Entity Recognition

no code implementations • IJCNLP 2017 • Motoki Sato, Hiroyuki Shindo, Ikuya Yamada, Yuji Matsumoto

We present Segment-level Neural CRF, which combines neural networks with a linear chain CRF for segment-level sequence modeling tasks such as named entity recognition (NER) and syntactic chunking.

Chunking Morphological Tagging +3

Paper
Add Code

Named Entity Disambiguation for Noisy Text

1 code implementation • CONLL 2017 • Yotam Eshel, Noam Cohen, Kira Radinsky, Shaul Markovitch, Ikuya Yamada, Omer Levy

We address the task of Named Entity Disambiguation (NED) for noisy text.

Entity Disambiguation Entity Embeddings

Paper
Code

Learning Distributed Representations of Texts and Entities from Knowledge Base

1 code implementation • TACL 2017 • Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji

Given a text in the KB, we train our proposed model to predict entities that are relevant to the text.

Ranked #2 on Entity Disambiguation on TAC2010

Entity Disambiguation Entity Linking +2

Paper
Code

Ensemble of Neural Classifiers for Scoring Knowledge Base Triples

1 code implementation • 15 Mar 2017 • Ikuya Yamada, Motoki Sato, Hiroyuki Shindo

This paper describes our approach for the triple scoring task at the WSDM Cup 2017.

BIG-bench Machine Learning Entity Retrieval +1

Paper
Code

Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

1 code implementation • CONLL 2016 • Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji

The KB graph model learns the relatedness of entities using the link structure of the KB, whereas the anchor context model aims to align vectors such that similar words and entities occur close to one another in the vector space by leveraging KB anchors and their context words.

Ranked #4 on Entity Disambiguation on TAC2010

Entity Disambiguation Entity Linking

919

Paper
Code

Enhancing Named Entity Recognition in Twitter Messages Using Entity Linking

no code implementations • WS 2015 • Ikuya Yamada, Hideaki Takeda, Yoshiyasu Takefuji

Entity Linking named-entity-recognition +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.