Search Results for author: Sosuke Kobayashi

Found 22 papers, 10 papers with code

SHAPE : Shifted Absolute Position Embedding for Transformers

no code implementations • EMNLP 2021 • Shun Kiyono, Sosuke Kobayashi, Jun Suzuki, Kentaro Inui

Position representation is crucial for building position-aware representations in Transformers.

Position

Paper
Add Code

Spike No More: Stabilizing the Pre-training of Large Language Models

no code implementations • 28 Dec 2023 • Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki

Loss spikes often occur during pre-training of large language models.

Language Modelling Large Language Model

Paper
Add Code

B2T Connection: Serving Stability and Performance in Deep Transformers

1 code implementation • 1 Jun 2022 • Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki

Recent Transformers tend to be Pre-LN because, in Post-LN with deep Transformers (e. g., those with ten or more layers), the training is often unstable, resulting in useless models.

Text Generation

Paper
Code

Decomposing NeRF for Editing via Feature Field Distillation

1 code implementation • 31 May 2022 • Sosuke Kobayashi, Eiichi Matsumoto, Vincent Sitzmann

Emerging neural radiance fields (NeRF) are a promising scene representation for computer graphics, enabling high-quality 3D reconstruction and novel view synthesis from image observations.

3D Reconstruction Novel View Synthesis

165

Paper
Code

Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model

no code implementations • BigScience (ACL) 2022 • Sosuke Kobayashi, Shun Kiyono, Jun Suzuki, Kentaro Inui

Ensembling is a popular method used to improve performance as a last resort.

Paper
Add Code

Instance-Based Neural Dependency Parsing

no code implementations • 28 Sep 2021 • Hiroki Ouchi, Jun Suzuki, Sosuke Kobayashi, Sho Yokoi, Tatsuki Kuribayashi, Masashi Yoshikawa, Kentaro Inui

Interpretable rationales for model predictions are crucial in practical applications.

Dependency Parsing

Paper
Add Code

SHAPE: Shifted Absolute Position Embedding for Transformers

1 code implementation • 13 Sep 2021 • Shun Kiyono, Sosuke Kobayashi, Jun Suzuki, Kentaro Inui

Position representation is crucial for building position-aware representations in Transformers.

Position

Paper
Code

Efficient Estimation of Influence of a Training Instance

no code implementations • EMNLP (sustainlp) 2020 • Sosuke Kobayashi, Sho Yokoi, Jun Suzuki, Kentaro Inui

Understanding the influence of a training instance on a neural network model leads to improving interpretability.

Paper
Add Code

Instance-Based Learning of Span Representations: A Case Study through Named Entity Recognition

1 code implementation • ACL 2020 • Hiroki Ouchi, Jun Suzuki, Sosuke Kobayashi, Sho Yokoi, Tatsuki Kuribayashi, Ryuto Konno, Kentaro Inui

Interpretable rationales for model predictions play a critical role in practical applications.

named-entity-recognition Named Entity Recognition +2

Paper
Code

All Word Embeddings from One Embedding

1 code implementation • NeurIPS 2020 • Sho Takase, Sosuke Kobayashi

The proposed method, ALONE (all word embeddings from one), constructs the embedding of a word by modifying the shared embedding with a filter vector, which is word-specific but non-trainable.

Ranked #3 on Text Summarization on DUC 2004 Task 1

Machine Translation Sentence Summarization +2

Paper
Code

Data Interpolating Prediction: Alternative Interpretation of Mixup

no code implementations • ICLR Workshop LLD 2019 • Takuya Shimada, Shoichiro Yamaguchi, Kohei Hayashi, Sosuke Kobayashi

Data augmentation by mixing samples, such as Mixup, has widely been used typically for classification tasks.

Data Augmentation General Classification

Paper
Add Code

Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN

2 code implementations • 22 Nov 2018 • Masaki Saito, Shunta Saito, Masanori Koyama, Sosuke Kobayashi

Training of Generative Adversarial Network (GAN) on a video dataset is a challenge because of the sheer size of the dataset and the complexity of each observation.

Ranked #1 on Video Generation on UCF-101 16 frames, 128x128, Unconditional

Generative Adversarial Network Video Generation

Paper
Code

DQN-TAMER: Human-in-the-Loop Reinforcement Learning with Intractable Feedback

1 code implementation • 28 Oct 2018 • Riku Arakawa, Sosuke Kobayashi, Yuya Unno, Yuta Tsuboi, Shin-ichi Maeda

A remedy for this is to train an agent with real-time feedback from a human observer who immediately gives rewards for some actions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Pointwise HSIC: A Linear-Time Kernelized Co-occurrence Norm for Sparse Linguistic Expressions

no code implementations • EMNLP 2018 • Sho Yokoi, Sosuke Kobayashi, Kenji Fukumizu, Jun Suzuki, Kentaro Inui

As well as deriving PMI from mutual information, we derive this new measure from the Hilbert--Schmidt independence criterion (HSIC); thus, we call the new measure the pointwise HSIC (PHSIC).

Machine Translation Sentence +2

Paper
Add Code

Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations

2 code implementations • NAACL 2018 • Sosuke Kobayashi

We stochastically replace words with other words that are predicted by a bi-directional language model at the word positions.

General Classification Language Modelling +3

4,290

Paper
Code

Unsupervised Learning of Style-sensitive Word Vectors

no code implementations • ACL 2018 • Reina Akama, Kento Watanabe, Sho Yokoi, Sosuke Kobayashi, Kentaro Inui

This paper presents the first study aimed at capturing stylistic similarity between words in an unsupervised manner.

Word Embeddings

Paper
Add Code

Generating Stylistically Consistent Dialog Responses with Transfer Learning

no code implementations • IJCNLP 2017 • Reina Akama, Kazuaki Inada, Naoya Inoue, Sosuke Kobayashi, Kentaro Inui

We propose a novel, data-driven, and stylistically consistent dialog response generation system.

Response Generation Transfer Learning

Paper
Add Code

Interactively Picking Real-World Objects with Unconstrained Spoken Language Instructions

1 code implementation • 17 Oct 2017 • Jun Hatori, Yuta Kikuchi, Sosuke Kobayashi, Kuniyuki Takahashi, Yuta Tsuboi, Yuya Unno, Wilson Ko, Jethro Tan

In this paper, we propose the first comprehensive system that can handle unconstrained spoken language and is able to effectively resolve ambiguity in spoken instructions.

object-detection Object Detection

Paper
Code

A Neural Language Model for Dynamically Representing the Meanings of Unknown Words and Entities in a Discourse

1 code implementation • IJCNLP 2017 • Sosuke Kobayashi, Naoaki Okazaki, Kentaro Inui

This study addresses the problem of identifying the meaning of unknown words or entities in a discourse with respect to the word embedding approaches used in neural language models.

Language Modelling Word Embeddings

Paper
Code

An RNN-based Binary Classifier for the Story Cloze Test

no code implementations • WS 2017 • Melissa Roemmele, Sosuke Kobayashi, Naoya Inoue, Andrew Gordon

In this paper we present a system that performs this task using a supervised binary classifier on top of a recurrent neural network to predict the probability that a given story ending is correct.

Cloze Test Sentence +1

Paper
Add Code

Tohoku at SemEval-2016 Task 6: Feature-based Model versus Convolutional Neural Network for Stance Detection

no code implementations • SEMEVAL 2016 • Yuki Igarashi, Hiroya Komatsu, Sosuke Kobayashi, Naoaki Okazaki, Kentaro Inui

Stance Detection Text Classification +1

Paper
Add Code

Dynamic Entity Representation with Max-pooling Improves Machine Reading

no code implementations • NAACL 2016 • Sosuke Kobayashi, Ran Tian, Naoaki Okazaki, Kentaro Inui

Ranked #10 on Question Answering on CNN / Daily Mail

Reading Comprehension

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.