1 code implementation • LREC 2022 • Hrishikesh Kulkarni, Sean MacAvaney, Nazli Goharian, Ophir Frieder
To complement this evaluation, we propose a dynamic thresholding technique that adjusts the classifier’s sensitivity as a function of the number of posts a user has.
1 code implementation • 25 Aug 2024 • Hrishikesh Kulkarni, Nazli Goharian, Ophir Frieder, Sean MacAvaney
We address hallucination by adapting an existing genetic generation approach with a new 'balanced fitness function' consisting of a cross-encoder model for relevance and an n-gram overlap metric to promote grounding.
1 code implementation • 25 Aug 2024 • Hrishikesh Kulkarni, Nazli Goharian, Ophir Frieder, Sean MacAvaney
For efficiency, approximation methods like HNSW are frequently used to approximate exhaustive dense retrieval.
1 code implementation • 31 Jul 2023 • Hrishikesh Kulkarni, Sean MacAvaney, Nazli Goharian, Ophir Frieder
We introduce 'LADR' (Lexically-Accelerated Dense Retrieval), a simple-yet-effective approach that improves the efficiency of existing dense retrieval models without compromising on retrieval effectiveness.
no code implementations • 25 Nov 2022 • Ophir Frieder, Ida Mele, Cristina Ioana Muntean, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto
Our achieved high cache hit rates significantly improve the responsiveness of conversational systems while likewise reducing the number of queries managed on the search back-end.
no code implementations • 1 Sep 2022 • Hao-Ren Yao, Nairen Cao, Katina Russell, Der-Chen Chang, Ophir Frieder, Jeremy Fineman
We propose Graph Kernel Infomax, a self-supervised graph kernel learning approach on the graphical representation of EHR, to overcome the previous problems.
1 code implementation • 29 Aug 2021 • Eugene Yang, David D. Lewis, Ophir Frieder
Content moderation (removing or limiting the distribution of posts based on their contents) is one tool social networks use to fight problems such as harassment and disinformation.
no code implementations • 29 Aug 2021 • David D. Lewis, Eugene Yang, Ophir Frieder
Technology-assisted review (TAR) workflows based on iterative active learning are widely used in document review applications.
1 code implementation • 18 Jun 2021 • Eugene Yang, David D. Lewis, Ophir Frieder
Technology-assisted review (TAR) refers to human-in-the-loop machine learning workflows for document review in legal discovery and other high recall review tasks.
no code implementations • 18 Jun 2021 • Eugene Yang, David D. Lewis, Ophir Frieder
Technology-assisted review (TAR) refers to human-in-the-loop active learning workflows for finding relevant documents in large collections.
no code implementations • 3 May 2021 • Eugene Yang, Sean MacAvaney, David D. Lewis, Ophir Frieder
We indeed find that the pre-trained BERT model reduces review cost by 10% to 15% in TAR workflows simulated on the RCV1-v2 newswire collection.
no code implementations • 4 Feb 2021 • Der-Chen Chang, Ophir Frieder, Chi-Feng Hung, Hao-Ren Yao
Distance metrics and their nonlinear variant play a crucial role in machine learning based real-world problem solving.
no code implementations • 4 Aug 2020 • Hao-Ren Yao, Der-Chen Chang, Ophir Frieder, Wendy Huang, I-Chia Liang, Chi-Feng Hung
We present an end-to-end, interpretable, deep-learning architecture to learn a graph kernel that predicts the outcome of chronic disease drug prescription.
no code implementations • SEMEVAL 2020 • Sajad Sotudeh, Tong Xiang, Hao-Ren Yao, Sean MacAvaney, Eugene Yang, Nazli Goharian, Ophir Frieder
Offensive language detection is an important and challenging task in natural language processing.
no code implementations • 18 May 2020 • Sean MacAvaney, Franck Dernoncourt, Walter Chang, Nazli Goharian, Ophir Frieder
We present an elegant and effective approach for addressing limitations in existing multi-label classification models by incorporating interaction matching, a concept shown to be useful for ad-hoc search result ranking.
1 code implementation • 29 Apr 2020 • Sean MacAvaney, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto, Nazli Goharian, Ophir Frieder
Deep pretrained transformer networks are effective at various ranking tasks, such as question answering and ad-hoc document ranking.
1 code implementation • 29 Apr 2020 • Sean MacAvaney, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto, Nazli Goharian, Ophir Frieder
We show that the proposed heuristics can be used to build a training curriculum that down-weights difficult samples early in the training process.
1 code implementation • 29 Apr 2020 • Sean MacAvaney, Franco Maria Nardini, Raffaele Perego, Nicola Tonellotto, Nazli Goharian, Ophir Frieder
We also observe that the performance is additive with the current leading first-stage retrieval methods, further narrowing the gap between inexpensive and cost-prohibitive passage ranking approaches.
no code implementations • 9 Jan 2020 • Ida Mele, Nicola Tonellotto, Ophir Frieder, Raffaele Perego
The results of queries characterized by a topic are kept in the fraction of the cache dedicated to it.
1 code implementation • 1 Jul 2017 • Sean MacAvaney, Andrew Yates, Kai Hui, Ophir Frieder
One challenge with neural ranking is the need for a large amount of manually-labeled relevance judgments for training.
no code implementations • LREC 2016 • Andrew Yates, Alek Kolcz, Nazli Goharian, Ophir Frieder
In this work we use a larger feed to investigate the effects of sampling on Twitter trend detection.
no code implementations • LREC 2014 • Andrew Yates, Jon Parker, Nazli Goharian, Ophir Frieder
With the rapid growth of social media, there is increasing potential to augment traditional public health surveillance methods with data from social media.