Search Results for author: Robby Wagner

Found 1 papers, 0 papers with code

The Impact of Main Content Extraction on Near-Duplicate Detection

no code implementations21 Nov 2021 Maik Fröbe, Matthias Hagen, Janek Bevendorff, Michael Völske, Benno Stein, Christopher Schröder, Robby Wagner, Lukas Gienapp, Martin Potthast

Commercial web search engines employ near-duplicate detection to ensure that users see each relevant result only once, albeit the underlying web crawls typically include (near-)duplicates of many web pages.

Information Retrieval Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.