1 code implementation • 1 Apr 2020 • Yuliang Li, Jinfeng Li, Yoshihiko Suhara, AnHai Doan, Wang-Chiew Tan
Our experiments show that a straightforward application of language models such as BERT, DistilBERT, or RoBERTa pre-trained on large text corpora already significantly improves the matching quality and outperforms previous state-of-the-art (SOTA), by up to 29% of F1 score on benchmark datasets.
Ranked #2 on Entity Resolution on WDC Watches-xlarge
1 code implementation • 14 Nov 2019 • Dan Zhang, Yoshihiko Suhara, Jinfeng Li, Madelon Hulsebos, Çağatay Demiralp, Wang-Chiew Tan
Detecting the semantic types of data columns in relational tables is important for various data preparation and information retrieval tasks such as data cleaning, schema matching, data discovery, and semantic search.
Ranked #2 on Column Type Annotation on VizNet-Sato-MultiColumn
1 code implementation • ACL 2020 • Yoshihiko Suhara, Xiaolan Wang, Stefanos Angelidis, Wang-Chiew Tan
The framework uses an Aspect-based Sentiment Analysis model to extract opinion phrases from reviews, and trains a Transformer model to reconstruct the original reviews from these extractions.
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1
2 code implementations • 8 Dec 2020 • Stefanos Angelidis, Reinald Kim Amplayo, Yoshihiko Suhara, Xiaolan Wang, Mirella Lapata
We present the Quantized Transformer (QT), an unsupervised system for extractive opinion summarization.
1 code implementation • Findings (EMNLP) 2021 • Hayate Iso, Xiaolan Wang, Yoshihiko Suhara, Stefanos Angelidis, Wang-Chiew Tan
We found that text autoencoders tend to generate overly generic summaries from simply averaged latent vectors due to an unexpected $L_2$-norm shrinkage in the aggregated latent vectors, which we refer to as summary vector degeneration.
Ranked #1 on Unsupervised Opinion Summarization on Amazon
1 code implementation • 5 Apr 2021 • Yoshihiko Suhara, Jinfeng Li, Yuliang Li, Dan Zhang, Çağatay Demiralp, Chen Chen, Wang-Chiew Tan
Inferring meta information about tables, such as column headers or relationships between columns, is an active research topic in data management as we find many tables are missing some of this information.
Ranked #1 on Column Type Annotation on VizNet-Sato-MultiColumn
1 code implementation • Findings (ACL) 2022 • Hayate Iso, Xiaolan Wang, Stefanos Angelidis, Yoshihiko Suhara
Opinion summarization focuses on generating summaries that reflect popular subjective information expressed in multiple online reviews.
1 code implementation • 15 Sep 2019 • Wataru Hirota, Yoshihiko Suhara, Behzad Golshan, Wang-Chiew Tan
We present Emu, a system that semantically enhances multilingual sentence embeddings.
2 code implementations • LREC 2018 • Akari Asai, Sara Evensen, Behzad Golshan, Alon Halevy, Vivian Li, Andrei Lopatenko, Daniela Stepanov, Yoshihiko Suhara, Wang-Chiew Tan, Yinzhan Xu
The science of happiness is an area of positive psychology concerned with understanding what behaviors make people happy in a sustainable fashion.
no code implementations • 1 Aug 2017 • Hidekazu Oiwa, Yoshihiko Suhara, Jiyu Komiya, Andrei Lopatenko
Entity population, a task of collecting entities that belong to a particular category, has attracted attention from vertical domains.
no code implementations • NAACL 2019 • Nikita Bhutani, Yoshihiko Suhara, Wang-Chiew Tan, Alon Halevy, H. V. Jagadish
We describe NeurON, a system for extracting tuples from question-answer pairs.
no code implementations • 23 Jul 2019 • Sara Evensen, Yoshihiko Suhara, Alon Halevy, Vivian Li, Wang-Chiew Tan, Saran Mumick
We prototype one necessary component of such a system, the Happiness Entailment Recognition (HER) module, which takes as input a short text describing an event, a candidate suggestion, and outputs a determination about whether the suggestion is more likely to be good for this user based on the event described.
no code implementations • 19 Dec 2019 • Masahiro Kazama, Yoshihiko Suhara, Andrey Bogomolov, Alex `Sandy' Pentland
We also analyzed the differences between the expert and non-expert machine algorithms based on their neural representations to evaluate the performances, providing insight into the human experts' and non-experts' cognitive abilities.
no code implementations • 6 Apr 2020 • Aaron Traylor, Chen Chen, Behzad Golshan, Xiaolan Wang, Yuliang Li, Yoshihiko Suhara, Jinfeng Li, Cagatay Demiralp, Wang-Chiew Tan
In this paper, we introduce xSense, an effective system for review comprehension using domain-specific commonsense knowledge bases (xSense KBs).
no code implementations • 29 May 2020 • Nofar Carmeli, Xiaolan Wang, Yoshihiko Suhara, Stefanos Angelidis, Yuliang Li, Jinfeng Li, Wang-Chiew Tan
The Web is a major resource of both factual and subjective information.