Search Results for author: Christopher Thomas

Found 17 papers, 3 papers with code

Predicting the Politics of an Image Using Webly Supervised Data

1 code implementation NeurIPS 2019 Christopher Thomas, Adriana Kovashka

We collect a dataset of over one million unique images and associated news articles from left- and right-leaning news sources, and develop a method to predict the image's political leaning.

Weakly-Supervised Temporal Article Grounding

1 code implementation22 Oct 2022 Long Chen, Yulei Niu, Brian Chen, Xudong Lin, Guangxing Han, Christopher Thomas, Hammad Ayyubi, Heng Ji, Shih-Fu Chang

Specifically, given an article and a relevant video, WSAG aims to localize all ``groundable'' sentences to the video, and these sentences are possibly at different semantic scales.

Natural Language Queries Sentence +1

Fine-Grained Visual Entailment

1 code implementation29 Mar 2022 Christopher Thomas, YiPeng Zhang, Shih-Fu Chang

In this paper, we propose an extension of this task, where the goal is to predict the logical relationship of fine-grained knowledge elements within a piece of text to an image.

Multimodal Reasoning Visual Entailment

Automatic Understanding of Image and Video Advertisements

no code implementations CVPR 2017 Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka

There is more to images than their objective physical content: for example, advertisements are created to persuade a viewer to take a certain action.

OpenSalicon: An Open Source Implementation of the Salicon Saliency Model

no code implementations1 Jun 2016 Christopher Thomas

In this technical report, we present our publicly downloadable implementation of the SALICON saliency model.

Seeing Behind the Camera: Identifying the Authorship of a Photograph

no code implementations CVPR 2016 Christopher Thomas, Adriana Kovashka

To explore the feasibility of current computer vision techniques to address this problem, we created a new dataset of over 180, 000 images taken by 41 well-known photographers.

Persuasive Faces: Generating Faces in Advertisements

no code implementations25 Jul 2018 Christopher Thomas, Adriana Kovashka

We show how our model can be used to produce visually distinct faces which appear to be from a fixed ad topic category.

Face Generation Generative Adversarial Network

Artistic Object Recognition by Unsupervised Style Adaptation

no code implementations28 Dec 2018 Christopher Thomas, Adriana Kovashka

To do so, we introduce a complementary training modality constructed to be similar in artistic style to the target domain, and enforce that the network learns features that are invariant between the two training modalities.

Domain Adaptation Object +3

A Scalable Real-Time Architecture for Neural Oscillation Detection and Phase-Specific Stimulation

no code implementations15 Sep 2020 Christopher Thomas, Thilo Womelsdorf

Oscillations in the local field potential (LFP) of the brain are key signatures of neural information processing.

Learning to Transfer Visual Effects from Videos to Images

no code implementations3 Dec 2020 Christopher Thomas, Yale Song, Adriana Kovashka

We study the problem of animating images by transferring spatio-temporal visual effects (such as melting) from a collection of videos.

Optical Flow Estimation

Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs

no code implementations29 May 2023 Mingyang Zhou, Yi R. Fung, Long Chen, Christopher Thomas, Heng Ji, Shih-Fu Chang

Building cross-model intelligence that can understand charts and communicate the salient information hidden behind them is an appealing challenge in the vision and language(V+L) community.

Chart Question Answering Question Answering +1

Cannot find the paper you are looking for? You can Submit a new open access paper.