no code implementations • 14 Jun 2023 • Tarannum Zaki, Michael L. Nelson, Michele C. Weigle
Screenshots are prevalent on social media as a common approach for information sharing.
1 code implementation • 30 Apr 2023 • Lesley Frew, Michael L. Nelson, Michele C. Weigle
We present a change text search engine that allows users to find changes in webpages.
no code implementations • 6 Aug 2021 • Sawood Alam, Michele C. Weigle, Michael L. Nelson
Prior work on web archive profiling were focused on Archival Holdings to describe what is present in an archive.
no code implementations • 8 Mar 2021 • Shawn M. Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson
With these observations, we are motivated to quantify the levels of inclusion of required metadata in web resources, its evolution over time for archived resources, and create and evaluate an algorithm to automatically select a striking image for social cards.
Digital Libraries Human-Computer Interaction
no code implementations • 1 Aug 2020 • Shawn M. Jones, Martin Klein, Michele C. Weigle, Michael L. Nelson
Search engines and social media platforms often represent web pages as cards consisting of text snippets, titles, and images.
no code implementations • 1 Aug 2020 • Shawn M. Jones, Alexander C. Nwala, Martin Klein, Michele C. Weigle, Michael L. Nelson
StoryGraph clusters news articles together to identify a common news story.
1 code implementation • 3 Jun 2020 • Abigail Mabe, Dhruv Patel, Maheedhar Gunnam, Surbhi Shankar, Mat Kelly, Sawood Alam, Michael L. Nelson, Michele C. Weigle
Embed codes for the image grid and image slider can be produced to include these on separate webpages.
Digital Libraries
no code implementations • 22 Mar 2020 • Alexander C. Nwala, Michele C. Weigle, Michael L. Nelson
From these articles, StoryGraph extracts named entities (PEOPLE, LOCATIONS, ORGANIZATIONS, etc.)
no code implementations • 7 Aug 2019 • Lulwah M. Alkwai, Michael L. Nelson, Michele C. Weigle
This is because web archives are typically accessed by URI lookup, and the response is binary: the archive either has the page or it does not, and the user will not know of other archived web pages that exist and are potentially similar to the requested web page.
no code implementations • 17 Jun 2019 • Sawood Alam, Plinio Vargas, Michele C. Weigle, Michael L. Nelson
Certain HTTP Cookies on certain sites can be a source of content bias in archival crawls.
Digital Libraries
1 code implementation • 29 May 2019 • Alexander C. Nwala, Michele C. Weigle, Michael L. Nelson
In this work, we studied three social media platforms in order to provide insight on the characteristics of seeds generated from different sources.
1 code implementation • 29 May 2019 • Mohamed Aturban, Sawood Alam, Michael L. Nelson, Michele C. Weigle
In the Atomic approach, the fixity information of each archived web page is stored in a JSON file (or a manifest), and published in a well-known web location (an Archival Fixity server) before it is disseminated to several on-demand web archives.
Digital Libraries
1 code implementation • 9 May 2019 • Mohamed Aturban, Michael L. Nelson, Michele C. Weigle, Martin Klein, Herbert Van de Sompel
First, we used the Los Alamos National Laboratory (LANL) Memento Aggregator to collect mementos of an initial set of URIs obtained from four sources: (a) the Moz Top 500, (b) the dataset used in our previous study, (c) the HTTP Archive, and (d) the Web Archives for Historical Research group.
Digital Libraries
1 code implementation • 18 Jun 2018 • Shawn M. Jones, Michele C. Weigle, Michael L. Nelson
We document the implementation of each of these similarity measures.
no code implementations • 8 Dec 2017 • Mohamed Aturban, Michael L. Nelson, Michele C. Weigle
We show that state-of-the-art services for creating trusted timestamps in blockchain-based networks do not adequately allow for timestamping of web pages.
Digital Libraries