Search Results for author: Masayuki Asahara

Found 33 papers, 4 papers with code

CHJ-WLSP: Annotation of ‘Word List by Semantic Principles’ Labels for the Corpus of Historical Japanese

no code implementations LT4HALA (LREC) 2022 Masayuki Asahara, Nao Ikegami, Tai Suzuki, Taro Ichimura, Asuko Kondo, Sachi Kato, Makoto Yamazaki

This article presents a word-sense annotation for the Corpus of Historical Japanese: a mashed-up Japanese lexicon based on the ‘Word List by Semantic Principles’ (WLSP).

AcTED: Automatic Acquisition of Typical Event Duration for Semi-supervised Temporal Commonsense QA

no code implementations27 Mar 2024 Felix Virgo, Fei Cheng, Lis Kanashiro Pereira, Masayuki Asahara, Ichiro Kobayashi, Sadao Kurohashi

We propose a voting-driven semi-supervised approach to automatically acquire the typical duration of an event and use it as pseudo-labeled data.

Lower Perplexity is Not Always Human-Like

1 code implementation ACL 2021 Tatsuki Kuribayashi, Yohei Oseki, Takumi Ito, Ryo Yoshida, Masayuki Asahara, Kentaro Inui

Overall, our results suggest that a cross-lingual evaluation will be necessary to construct human-like computational models.

Language Modelling

A Gamification of Japanese Dependency Parsing

no code implementations9 Jan 2021 Masayuki Asahara

Gamification approaches have been used as a way for creating language resources for NLP.

Dependency Parsing

Automatic Creation of Correspondence Table of Meaning Tags from Two Dictionaries in One Language Using Bilingual Word Embedding

no code implementations LREC 2020 Teruo Hirabayashi, Kanako Komiya, Masayuki Asahara, Hiroyuki Shinnou

However, because our method utilized the embedding vectors of the word senses, the relations of the sense tags corresponding to concept tags could be examined by mapping the sense embeddings to the vector space of the concept tags.

TAG Word Embeddings

Design of BCCWJ-EEG: Balanced Corpus with Human Electroencephalography

no code implementations LREC 2020 Yohei Oseki, Masayuki Asahara

Importantly, this inter-fertilization between NLP, on one hand, and the cognitive (neuro)science of language, on the other, has been driven by the language resources annotated with human language processing data.

EEG

Word Familiarity Rate Estimation Using a Bayesian Linear Mixed Model

no code implementations WS 2019 Masayuki Asahara

This paper presents research on word familiarity rate estimation using the {`}Word List by Semantic Principles{'}.

UD-Japanese BCCWJ: Universal Dependencies Annotation for the Balanced Corpus of Contemporary Written Japanese

no code implementations WS 2018 Mai Omura, Masayuki Asahara

In this paper, we describe a corpus UD Japanese-BCCWJ that was created by converting the Balanced Corpus of Contemporary Written Japanese (BCCWJ), a Japanese language corpus, to adhere to the UD annotation schema.

Coordinate Structures in Universal Dependencies for Head-final Languages

no code implementations WS 2018 Hiroshi Kanayama, Na-Rae Han, Masayuki Asahara, Jena D. Hwang, Yusuke Miyao, Jinho D. Choi, Yuji Matsumoto

This paper discusses the representation of coordinate structures in the Universal Dependencies framework for two head-final languages, Japanese and Korean.

Predicting Japanese Word Order in Double Object Constructions

no code implementations WS 2018 Masayuki Asahara, Satoshi Nambu, Shin-Ichiro Sano

This paper presents a statistical model to predict Japanese word order in the double object constructions.

Object

Between Reading Time and Syntactic/Semantic Categories

no code implementations IJCNLP 2017 Masayuki Asahara, Sachi Kato

This article presents a contrastive analysis between reading time and syntactic/semantic categories in Japanese.

`BonTen' -- Corpus Concordance System for `NINJAL Web Japanese Corpus'

no code implementations COLING 2016 Masayuki Asahara, Kazuya Kawahara, Yuya Takei, Hideto Masuoka, Yasuko Ohba, Yuki Torii, Toru Morii, Yuki Tanaka, Kikuo Maekawa, Sachi Kato, Hikari Konishi

The National Institute for Japanese Language and Linguistics, Japan (NINJAL) has undertaken a corpus compilation project to construct a web corpus for linguistic research comprising ten billion words.

Morphological Analysis

Reading-Time Annotations for ``Balanced Corpus of Contemporary Written Japanese''

no code implementations COLING 2016 Masayuki Asahara, Hajime Ono, Edson T. Miyamoto

The Dundee Eyetracking Corpus contains eyetracking data collected while native speakers of English and French read newspaper editorial articles.

Universal Dependencies for Japanese

no code implementations LREC 2016 Takaaki Tanaka, Yusuke Miyao, Masayuki Asahara, Sumire Uematsu, Hiroshi Kanayama, Shinsuke Mori, Yuji Matsumoto

We present an attempt to port the international syntactic annotation scheme, Universal Dependencies, to the Japanese language in this paper.

Cannot find the paper you are looking for? You can Submit a new open access paper.