no code implementations • 14 Jun 2023 • Tarannum Zaki, Michael L. Nelson, Michele C. Weigle
Screenshots are prevalent on social media as a common approach for information sharing.
1 code implementation • 30 Apr 2023 • Lesley Frew, Michael L. Nelson, Michele C. Weigle
We present a change text search engine that allows users to find changes in webpages.
no code implementations • 17 Nov 2022 • Caleb Bradford, Michael L. Nelson
We developed software that automatically makes search queries utilizing the body of alleged tweets to a variety of services (Google, Snopes built-in search, and Reuters built-in search) in an effort to find fact-check articles and other evidence of supposedly made tweets.
no code implementations • 6 Aug 2021 • Sawood Alam, Michele C. Weigle, Michael L. Nelson
Prior work on web archive profiling were focused on Archival Holdings to describe what is present in an archive.
no code implementations • 8 Mar 2021 • Shawn M. Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson
With these observations, we are motivated to quantify the levels of inclusion of required metadata in web resources, its evolution over time for archived resources, and create and evaluate an algorithm to automatically select a striking image for social cards.
Digital Libraries Human-Computer Interaction
no code implementations • 7 Dec 2020 • Yasith Jayawardana, Alexander C. Nwala, Gavindya Jayawardena, Jian Wu, Sampath Jayarathna, Michael L. Nelson, C. Lee Giles
The vastness of the web imposes a prohibitive cost on building large-scale search engines with limited resources.
no code implementations • 1 Aug 2020 • Shawn M. Jones, Martin Klein, Michele C. Weigle, Michael L. Nelson
Search engines and social media platforms often represent web pages as cards consisting of text snippets, titles, and images.
no code implementations • 1 Aug 2020 • Shawn M. Jones, Alexander C. Nwala, Martin Klein, Michele C. Weigle, Michael L. Nelson
StoryGraph clusters news articles together to identify a common news story.
1 code implementation • 3 Jun 2020 • Abigail Mabe, Dhruv Patel, Maheedhar Gunnam, Surbhi Shankar, Mat Kelly, Sawood Alam, Michael L. Nelson, Michele C. Weigle
Embed codes for the image grid and image slider can be produced to include these on separate webpages.
Digital Libraries
no code implementations • 22 Mar 2020 • Alexander C. Nwala, Michele C. Weigle, Michael L. Nelson
From these articles, StoryGraph extracts named entities (PEOPLE, LOCATIONS, ORGANIZATIONS, etc.)
no code implementations • 7 Aug 2019 • Lulwah M. Alkwai, Michael L. Nelson, Michele C. Weigle
This is because web archives are typically accessed by URI lookup, and the response is binary: the archive either has the page or it does not, and the user will not know of other archived web pages that exist and are potentially similar to the requested web page.
no code implementations • 17 Jun 2019 • Sawood Alam, Plinio Vargas, Michele C. Weigle, Michael L. Nelson
Certain HTTP Cookies on certain sites can be a source of content bias in archival crawls.
Digital Libraries
1 code implementation • 29 May 2019 • Alexander C. Nwala, Michele C. Weigle, Michael L. Nelson
In this work, we studied three social media platforms in order to provide insight on the characteristics of seeds generated from different sources.
1 code implementation • 29 May 2019 • Mohamed Aturban, Sawood Alam, Michael L. Nelson, Michele C. Weigle
In the Atomic approach, the fixity information of each archived web page is stored in a JSON file (or a manifest), and published in a well-known web location (an Archival Fixity server) before it is disseminated to several on-demand web archives.
Digital Libraries
1 code implementation • 9 May 2019 • Mohamed Aturban, Michael L. Nelson, Michele C. Weigle, Martin Klein, Herbert Van de Sompel
First, we used the Los Alamos National Laboratory (LANL) Memento Aggregator to collect mementos of an initial set of URIs obtained from four sources: (a) the Moz Top 500, (b) the dataset used in our previous study, (c) the HTTP Archive, and (d) the Web Archives for Historical Research group.
Digital Libraries
1 code implementation • 18 Jun 2018 • Shawn M. Jones, Michele C. Weigle, Michael L. Nelson
We document the implementation of each of these similarity measures.
no code implementations • 8 Dec 2017 • Mohamed Aturban, Michael L. Nelson, Michele C. Weigle
We show that state-of-the-art services for creating trusted timestamps in blockchain-based networks do not adequately allow for timestamping of web pages.
Digital Libraries
2 code implementations • 20 Jun 2015 • Shawn M. Jones, Michael L. Nelson
We find that when accessing fan wiki pages in the Internet Archive there is as much as a 66% chance of encountering a spoiler.
Digital Libraries H.3.7
2 code implementations • 16 Jun 2014 • Shawn M. Jones, Michael L. Nelson, Harihar Shankar, Herbert Van de Sompel
In addition to implementing Memento, Version 2. 0 allows administrators to choose the optional 200-style datetime negotiation Pattern 1. 2 instead of Pattern 2. 1.
Digital Libraries H.3.7
2 code implementations • 5 Feb 2014 • Scott G. Ainsworth, Michael L. Nelson, Herbert Van de Sompel
Most archived HTML pages embed other web resources, such as images and stylesheets.
Digital Libraries H.3.7
no code implementations • 23 Jul 2008 • Martin Klein, Michael L. Nelson
Intuitively this value is different from document frequency (DF), the number of documents (e. g., web pages) a certain term occurs in.
Information Retrieval Digital Libraries H.3.0
no code implementations • 24 Mar 2005 • Michael L. Nelson
Reverse engineering has been a standard practice in the hardware community for some time.
Software Engineering