Search Results for author: Yoshihiko Suhara

Found 15 papers, 9 papers with code

Deep Entity Matching with Pre-Trained Language Models

1 code implementation • 1 Apr 2020 • Yuliang Li, Jinfeng Li, Yoshihiko Suhara, AnHai Doan, Wang-Chiew Tan

Our experiments show that a straightforward application of language models such as BERT, DistilBERT, or RoBERTa pre-trained on large text corpora already significantly improves the matching quality and outperforms previous state-of-the-art (SOTA), by up to 29% of F1 score on benchmark datasets.

Ranked #2 on Entity Resolution on WDC Watches-xlarge

Data Augmentation Entity Resolution

239

Paper
Code

Sato: Contextual Semantic Type Detection in Tables

1 code implementation • 14 Nov 2019 • Dan Zhang, Yoshihiko Suhara, Jinfeng Li, Madelon Hulsebos, Çağatay Demiralp, Wang-Chiew Tan

Detecting the semantic types of data columns in relational tables is important for various data preparation and information retrieval tasks such as data cleaning, schema matching, data discovery, and semantic search.

Ranked #2 on Column Type Annotation on VizNet-Sato-MultiColumn

Column Type Annotation Information Retrieval +3

108

Paper
Code

OpinionDigest: A Simple Framework for Opinion Summarization

1 code implementation • ACL 2020 • Yoshihiko Suhara, Xiaolan Wang, Stefanos Angelidis, Wang-Chiew Tan

The framework uses an Aspect-based Sentiment Analysis model to extract opinion phrases from reviews, and trains a Transformer model to reconstruct the original reviews from these extractions.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Paper
Code

Extractive Opinion Summarization in Quantized Transformer Spaces

2 code implementations • 8 Dec 2020 • Stefanos Angelidis, Reinald Kim Amplayo, Yoshihiko Suhara, Xiaolan Wang, Mirella Lapata

We present the Quantized Transformer (QT), an unsupervised system for extractive opinion summarization.

Clustering Extract Aspect +1

Paper
Code

Convex Aggregation for Opinion Summarization

1 code implementation • Findings (EMNLP) 2021 • Hayate Iso, Xiaolan Wang, Yoshihiko Suhara, Stefanos Angelidis, Wang-Chiew Tan

We found that text autoencoders tend to generate overly generic summaries from simply averaged latent vectors due to an unexpected $L_2$-norm shrinkage in the aggregated latent vectors, which we refer to as summary vector degeneration.

Ranked #1 on Unsupervised Opinion Summarization on Amazon

Opinion Summarization Unsupervised Opinion Summarization

Paper
Code

Annotating Columns with Pre-trained Language Models

1 code implementation • 5 Apr 2021 • Yoshihiko Suhara, Jinfeng Li, Yuliang Li, Dan Zhang, Çağatay Demiralp, Chen Chen, Wang-Chiew Tan

Inferring meta information about tables, such as column headers or relationships between columns, is an active research topic in data management as we find many tables are missing some of this information.

Ranked #1 on Column Type Annotation on VizNet-Sato-MultiColumn

Columns Property Annotation Column Type Annotation +3

Paper
Code

Comparative Opinion Summarization via Collaborative Decoding

1 code implementation • Findings (ACL) 2022 • Hayate Iso, Xiaolan Wang, Stefanos Angelidis, Yoshihiko Suhara

Opinion summarization focuses on generating summaries that reflect popular subjective information expressed in multiple online reviews.

Opinion Summarization

Paper
Code

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

1 code implementation • 15 Sep 2019 • Wataru Hirota, Yoshihiko Suhara, Behzad Golshan, Wang-Chiew Tan

We present Emu, a system that semantically enhances multilingual sentence embeddings.

intent-classification Intent Classification +5

Paper
Code

HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments

2 code implementations • LREC 2018 • Akari Asai, Sara Evensen, Behzad Golshan, Alon Halevy, Vivian Li, Andrei Lopatenko, Daniela Stepanov, Yoshihiko Suhara, Wang-Chiew Tan, Yinzhan Xu

The science of happiness is an area of positive psychology concerned with understanding what behaviors make people happy in a sustainable fashion.

Art Analysis

Paper
Code

A Lightweight Front-end Tool for Interactive Entity Population

no code implementations • 1 Aug 2017 • Hidekazu Oiwa, Yoshihiko Suhara, Jiyu Komiya, Andrei Lopatenko

Entity population, a task of collecting entities that belong to a particular category, has attracted attention from vertical domains.

Paper
Add Code

Open Information Extraction from Question-Answer Pairs

no code implementations • NAACL 2019 • Nikita Bhutani, Yoshihiko Suhara, Wang-Chiew Tan, Alon Halevy, H. V. Jagadish

We describe NeurON, a system for extracting tuples from question-answer pairs.

Open Information Extraction Sentence

Paper
Add Code

Happiness Entailment: Automating Suggestions for Well-Being

no code implementations • 23 Jul 2019 • Sara Evensen, Yoshihiko Suhara, Alon Halevy, Vivian Li, Wang-Chiew Tan, Saran Mumick

We prototype one necessary component of such a system, the Happiness Entailment Recognition (HER) module, which takes as input a short text describing an event, a candidate suggestion, and outputs a determination about whether the suggestion is more likely to be good for this user based on the event described.

Paper
Add Code

Understanding Human Judgments of Causality

no code implementations • 19 Dec 2019 • Masahiro Kazama, Yoshihiko Suhara, Andrey Bogomolov, Alex `Sandy' Pentland

We also analyzed the differences between the expert and non-expert machine algorithms based on their neural representations to evaluate the performances, providing insight into the human experts' and non-experts' cognitive abilities.

Attribute BIG-bench Machine Learning

Paper
Add Code

Enhancing Review Comprehension with Domain-Specific Commonsense

no code implementations • 6 Apr 2020 • Aaron Traylor, Chen Chen, Behzad Golshan, Xiaolan Wang, Yuliang Li, Yoshihiko Suhara, Jinfeng Li, Cagatay Demiralp, Wang-Chiew Tan

In this paper, we introduce xSense, an effective system for review comprehension using domain-specific commonsense knowledge bases (xSense KBs).

Aspect Extraction Knowledge Distillation +3

Paper
Add Code

Constructing Explainable Opinion Graphs from Review

no code implementations • 29 May 2020 • Nofar Carmeli, Xiaolan Wang, Yoshihiko Suhara, Stefanos Angelidis, Yuliang Li, Jinfeng Li, Wang-Chiew Tan

The Web is a major resource of both factual and subjective information.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.