Search Results for author: Michael L. Nelson

Found 22 papers, 9 papers with code

Extracting Information from Twitter Screenshots

no code implementations14 Jun 2023 Tarannum Zaki, Michael L. Nelson, Michele C. Weigle

Screenshots are prevalent on social media as a common approach for information sharing.

Misinformation

Making Changes in Webpages Discoverable: A Change-Text Search Interface for Web Archives

1 code implementation30 Apr 2023 Lesley Frew, Michael L. Nelson, Michele C. Weigle

We present a change text search engine that allows users to find changes in webpages.

Did They Really Tweet That? Querying Fact-Checking Sites and Politwoops to Determine Tweet Misattribution

no code implementations17 Nov 2022 Caleb Bradford, Michael L. Nelson

We developed software that automatically makes search queries utilizing the body of alleged tweets to a variety of services (Google, Snopes built-in search, and Reuters built-in search) in an effort to find fact-check articles and other evidence of supposedly made tweets.

Fact Checking

Profiling Web Archival Voids for Memento Routing

no code implementations6 Aug 2021 Sawood Alam, Michele C. Weigle, Michael L. Nelson

Prior work on web archive profiling were focused on Archival Holdings to describe what is present in an archive.

Automatically Selecting Striking Images for Social Cards

no code implementations8 Mar 2021 Shawn M. Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson

With these observations, we are motivated to quantify the levels of inclusion of required metadata in web resources, its evolution over time for archived resources, and create and evaluate an algorithm to automatically select a striking image for social cards.

Digital Libraries Human-Computer Interaction

Modeling Updates of Scholarly Webpages Using Archived Data

no code implementations7 Dec 2020 Yasith Jayawardana, Alexander C. Nwala, Gavindya Jayawardena, Jian Wu, Sampath Jayarathna, Michael L. Nelson, C. Lee Giles

The vastness of the web imposes a prohibitive cost on building large-scale search engines with limited resources.

MementoEmbed and Raintale for Web Archive Storytelling

no code implementations1 Aug 2020 Shawn M. Jones, Martin Klein, Michele C. Weigle, Michael L. Nelson

Search engines and social media platforms often represent web pages as cards consisting of text snippets, titles, and images.

Visualizing Webpage Changes Over Time

1 code implementation3 Jun 2020 Abigail Mabe, Dhruv Patel, Maheedhar Gunnam, Surbhi Shankar, Mat Kelly, Sawood Alam, Michael L. Nelson, Michele C. Weigle

Embed codes for the image grid and image slider can be produced to include these on separate webpages.

Digital Libraries

365 Dots in 2019: Quantifying Attention of News Sources

no code implementations22 Mar 2020 Alexander C. Nwala, Michele C. Weigle, Michael L. Nelson

From these articles, StoryGraph extracts named entities (PEOPLE, LOCATIONS, ORGANIZATIONS, etc.)

Making Recommendations from Web Archives for "Lost" Web Pages

no code implementations7 Aug 2019 Lulwah M. Alkwai, Michael L. Nelson, Michele C. Weigle

This is because web archives are typically accessed by URI lookup, and the response is binary: the archive either has the page or it does not, and the user will not know of other archived web pages that exist and are potentially similar to the requested web page.

Impact of HTTP Cookie Violations in Web Archives

no code implementations17 Jun 2019 Sawood Alam, Plinio Vargas, Michele C. Weigle, Michael L. Nelson

Certain HTTP Cookies on certain sites can be a source of content bias in archival crawls.

Digital Libraries

Using Micro-collections in Social Media to Generate Seeds for Web Archive Collections

1 code implementation29 May 2019 Alexander C. Nwala, Michele C. Weigle, Michael L. Nelson

In this work, we studied three social media platforms in order to provide insight on the characteristics of seeds generated from different sources.

Archive Assisted Archival Fixity Verification Framework

1 code implementation29 May 2019 Mohamed Aturban, Sawood Alam, Michael L. Nelson, Michele C. Weigle

In the Atomic approach, the fixity information of each archived web page is stored in a JSON file (or a manifest), and published in a well-known web location (an Archival Fixity server) before it is disseminated to several on-demand web archives.

Digital Libraries

Collecting 16K archived web pages from 17 public web archives

1 code implementation9 May 2019 Mohamed Aturban, Michael L. Nelson, Michele C. Weigle, Martin Klein, Herbert Van de Sompel

First, we used the Los Alamos National Laboratory (LANL) Memento Aggregator to collect mementos of an initial set of URIs obtained from four sources: (a) the Moz Top 500, (b) the dataset used in our previous study, (c) the HTTP Archive, and (d) the Web Archives for Historical Research group.

Digital Libraries

The Off-Topic Memento Toolkit

1 code implementation18 Jun 2018 Shawn M. Jones, Michele C. Weigle, Michael L. Nelson

We document the implementation of each of these similarity measures.

Difficulties of Timestamping Archived Web Pages

no code implementations8 Dec 2017 Mohamed Aturban, Michael L. Nelson, Michele C. Weigle

We show that state-of-the-art services for creating trusted timestamps in blockchain-based networks do not adequately allow for timestamping of web pages.

Digital Libraries

Avoiding Spoilers in Fan Wikis of Episodic Fiction

2 code implementations20 Jun 2015 Shawn M. Jones, Michael L. Nelson

We find that when accessing fan wiki pages in the Internet Archive there is as much as a 66% chance of encountering a spoiler.

Digital Libraries H.3.7

Bringing Web Time Travel to MediaWiki: An Assessment of the Memento MediaWiki Extension

2 code implementations16 Jun 2014 Shawn M. Jones, Michael L. Nelson, Harihar Shankar, Herbert Van de Sompel

In addition to implementing Memento, Version 2. 0 allows administrators to choose the optional 200-style datetime negotiation Pattern 1. 2 instead of Pattern 2. 1.

Digital Libraries H.3.7

A Framework for Evaluation of Composite Memento Temporal Coherence

2 code implementations5 Feb 2014 Scott G. Ainsworth, Michael L. Nelson, Herbert Van de Sompel

Most archived HTML pages embed other web resources, such as images and stylesheets.

Digital Libraries H.3.7

Approximating Document Frequency with Term Count Values

no code implementations23 Jul 2008 Martin Klein, Michael L. Nelson

Intuitively this value is different from document frequency (DF), the number of documents (e. g., web pages) a certain term occurs in.

Information Retrieval Digital Libraries H.3.0

A Survey of Reverse Engineering and Program Comprehension

no code implementations24 Mar 2005 Michael L. Nelson

Reverse engineering has been a standard practice in the hardware community for some time.

Software Engineering

Cannot find the paper you are looking for? You can Submit a new open access paper.