no code implementations • 21 Oct 2020 • H. Bai, L. Tan, K. Xiong, M. Li, J. Lin
In this paper, we demonstrate under a Bayesian framework that distance between primitive statistics such as the mean of word embeddings are fundamentally flawed for capturing sentence-level semantic similarity.