no code implementations • 22 Oct 2024 • Aitaro Yamamoto, Hiroyuki Otomo, Hiroki Ouchi, Shohei Higashiyama, Hiroki Teranishi, Hiroyuki Shindo, Taro Watanabe
Previous studies on sequence-based extraction of human movement trajectories have an issue of inadequate trajectory representation.
no code implementations • 26 Mar 2024 • Jesse Atuhurra, Hiroyuki Shindo, Hidetaka Kamigaito, Taro Watanabe
Tokenization is one such technique because it allows for the words to be split based on characters or subwords, creating word embeddings that best represent the structure of the language.
no code implementations • 13 Mar 2024 • Jesse Atuhurra, Seiveright Cargill Dujohn, Hidetaka Kamigaito, Hiroyuki Shindo, Taro Watanabe
Natural language processing (NLP) practitioners are leveraging large language models (LLM) to create structured datasets from semi-structured and unstructured data sources such as patents, papers, and theses, without having domain-specific knowledge.
1 code implementation • Journal of Natural Language Processing 2023 • Van-Hien Tran, Hiroki Ouchi, Hiroyuki Shindo, Yuji Matsumoto, Taro Watanabe
This study argues that enhancing the semantic correlation between instances and relations is key to effectively solving the zero-shot relation extraction task.
Ranked #1 on
Zero-shot Relation Classification
on FewRel
1 code implementation • 23 May 2023 • Shohei Higashiyama, Hiroki Ouchi, Hiroki Teranishi, Hiroyuki Otomo, Yusuke Ide, Aitaro Yamamoto, Hiroyuki Shindo, Yuki Matsuda, Shoko Wakamiya, Naoya Inoue, Ikuya Yamada, Taro Watanabe
Geoparsing is a fundamental technique for analyzing geo-entity information in text.
no code implementations • 19 May 2023 • Hiroki Ouchi, Hiroyuki Shindo, Shoko Wakamiya, Yuki Matsuda, Naoya Inoue, Shohei Higashiyama, Satoshi Nakamura, Taro Watanabe
We have constructed Arukikata Travelogue Dataset and released it free of charge for academic research.
no code implementations • Journal of Natural Language Processing 2021 • Van-Hien Tran, Van-Thuy Phi, Akihiko Kato, Hiroyuki Shindo, Taro Watanabe, Yuji Matsumoto
A recent study (Yu et al. 2020) proposed a novel decomposition strategy that splits the task into two interrelated subtasks: detection of the head-entity (HE) and identification of the corresponding tail-entity and relation (TER) for each extracted head-entity.
1 code implementation • ACL 2021 • Yiran Wang, Hiroyuki Shindo, Yuji Matsumoto, Taro Watanabe
This paper presents a novel method for nested named entity recognition.
Ranked #13 on
Nested Named Entity Recognition
on ACE 2005
no code implementations • COLING 2020 • Yuya Sawada, Takashi Wada, Takayoshi Shibahara, Hiroki Teranishi, Shuhei Kondo, Hiroyuki Shindo, Taro Watanabe, Yuji Matsumoto
We propose a simple method for nominal coordination boundary identification.
9 code implementations • EMNLP 2020 • Ikuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda, Yuji Matsumoto
In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer.
Ranked #1 on
Entity Typing
on Open Entity
no code implementations • 21 Jan 2020 • Itsumi Saito, Kyosuke Nishida, Kosuke Nishida, Atsushi Otsuka, Hisako Asano, Junji Tomita, Hiroyuki Shindo, Yuji Matsumoto
Unlike the previous models, our length-controllable abstractive summarization model incorporates a word-level extractive module in the encoder-decoder model instead of length embeddings.
3 code implementations • CONLL 2019 • Ikuya Yamada, Hiroyuki Shindo
This study proposes a Neural Attentive Bag-of-Entities model, which is a neural network model that performs text classification using entities in a knowledge base.
Ranked #9 on
Text Classification
on 20NEWS
1 code implementation • NAACL 2022 • Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto
We propose a global entity disambiguation (ED) model based on BERT.
Ranked #1 on
Entity Disambiguation
on MSNBC
no code implementations • 31 Aug 2019 • Hiroyuki Shindo, Yuji Matsumoto
Molecule property prediction is a fundamental problem for computer-aided drug discovery and materials science.
no code implementations • 15 Aug 2019 • Hamada A. Nayel, H. L. Shashirekha, Hiroyuki Shindo, Yuji Matsumoto
The proposed SR model, FROBES, improves the representation of multi-word entities.
no code implementations • ACL 2019 • Tatsuya Hiraoka, Hiroyuki Shindo, Yuji Matsumoto
To make the model robust against infrequent tokens, we sampled segmentation for each sentence stochastically during training, which resulted in improved performance of text classification.
no code implementations • NAACL 2019 • Van-Hien Tran, Van-Thuy Phi, Hiroyuki Shindo, Yuji Matsumoto
Recently, relation classification has gained much success by exploiting deep neural networks.
no code implementations • NAACL 2019 • Hiroki Teranishi, Hiroyuki Shindo, Yuji Matsumoto
We propose a simple and accurate model for coordination boundary identification.
no code implementations • EMNLP 2020 • Ikuya Yamada, Akari Asai, Jin Sakuma, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji, Yuji Matsumoto
The embeddings of entities in a large knowledge base (e. g., Wikipedia) are highly beneficial for solving various natural language tasks that involve real world knowledge.
1 code implementation • WS 2019 • Ronen Tamari, Hiroyuki Shindo, Dafna Shahaf, Yuji Matsumoto
Understanding procedural text requires tracking entities, actions and effects as the narrative unfolds.
2 code implementations • EMNLP 2018 • Hiroki Ouchi, Hiroyuki Shindo, Yuji Matsumoto
We present a simple and accurate span-based model for semantic role labeling (SRL).
Ranked #7 on
Semantic Role Labeling
on CoNLL 2005
no code implementations • COLING 2018 • Yuji Matsumoto, Akihiko Kato, Hiroyuki Shindo, Toshio Morita
Those two tools cooperate so that the words and multi-word expressions stored in Cradle are directly referred to by ChaKi in conducting corpus annotation, and the words and expressions annotated in ChaKi can be output as a list of lexical entities that are to be stored in Cradle.
no code implementations • ACL 2018 • Jun Liu, Hiroyuki Shindo, Yuji Matsumoto
We present a computer-assisted learning system, Jastudy, which is particularly designed for Chinese-speaking learners of Japanese as a second language (JSL) to learn Japanese functional expressions with suggestion of appropriate example sentences.
2 code implementations • COLING 2018 • Ikuya Yamada, Hiroyuki Shindo, Yoshiyasu Takefuji
In this paper, we describe TextEnt, a neural network model that learns distributed representations of entities and documents directly from a knowledge base (KB).
Ranked #1 on
Entity Typing
on Freebase FIGER
2 code implementations • 8 May 2018 • Motoki Sato, Jun Suzuki, Hiroyuki Shindo, Yuji Matsumoto
This paper restores interpretability to such methods by restricting the directions of perturbations toward the existing words in the input embedding space.
no code implementations • 23 Mar 2018 • Ikuya Yamada, Ryuji Tamaki, Hiroyuki Shindo, Yoshiyasu Takefuji
In this chapter, we describe our question answering system, which was the winning system at the Human-Computer Question Answering (HCQA) Competition at the Thirty-first Annual Conference on Neural Information Processing Systems (NIPS).
no code implementations • IJCNLP 2017 • Motoki Sato, Hiroyuki Shindo, Ikuya Yamada, Yuji Matsumoto
We present Segment-level Neural CRF, which combines neural networks with a linear chain CRF for segment-level sequence modeling tasks such as named entity recognition (NER) and syntactic chunking.
no code implementations • IJCNLP 2017 • Hiroki Teranishi, Hiroyuki Shindo, Yuji Matsumoto
We propose a neural network model for coordination boundary detection.
no code implementations • CONLL 2017 • Go Inoue, Hiroyuki Shindo, Yuji Matsumoto
One reason for this is that in the tagging scheme for such languages, a complete POS tag is formed by combining tags from multiple tag sets defined for each morphosyntactic category.
no code implementations • ACL 2017 • Hiroki Ouchi, Hiroyuki Shindo, Yuji Matsumoto
The performance of Japanese predicate argument structure (PAS) analysis has improved in recent years thanks to the joint modeling of interactions between multiple predicates.
no code implementations • ACL 2017 • Akihiko Kato, Hiroyuki Shindo, Yuji Matsumoto
Because syntactic structures and spans of multiword expressions (MWEs) are independently annotated in many English syntactic corpora, they are generally inconsistent with respect to one another, which is harmful to the implementation of an aggregate system.
1 code implementation • TACL 2017 • Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji
Given a text in the KB, we train our proposed model to predict entities that are relevant to the text.
Ranked #2 on
Entity Disambiguation
on TAC2010
1 code implementation • 15 Mar 2017 • Ikuya Yamada, Motoki Sato, Hiroyuki Shindo
This paper describes our approach for the triple scoring task at the WSDM Cup 2017.
no code implementations • WS 2016 • Taishi Ikeda, Hiroyuki Shindo, Yuji Matsumoto
Text normalization is the task of transforming lexical variants to their canonical forms.
no code implementations • WS 2016 • Ayaka Morimoto, Akifumi Yoshimoto, Akihiko Kato, Hiroyuki Shindo, Yuji Matsumoto
This paper presents our ongoing work on compilation of English multi-word expression (MWE) lexicon.
no code implementations • LREC 2016 • Akihiko Kato, Hiroyuki Shindo, Yuji Matsumoto
Nevertheless, this method often leads to the following problem: A node derived from an MWE could have multiple heads and the whole dependency structure including MWE might be cyclic.
1 code implementation • CONLL 2016 • Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji
The KB graph model learns the relatedness of entities using the link structure of the KB, whereas the anchor context model aims to align vectors such that similar words and entities occur close to one another in the vector space by leveraging KB anchors and their context words.
Ranked #4 on
Entity Disambiguation
on TAC2010