1 code implementation • EMNLP 2021 • Xuewen Yang, Svebor Karaman, Joel Tetreault, Alex Jaimes
The task of news article image captioning aims to generate descriptive and informative captions for news article images.
1 code implementation • CVPR 2020 • Alireza Zareian, Svebor Karaman, Shih-Fu Chang
Scene Graph Generation (SGG) aims to extract entities, predicates and their semantic structure from images, enabling deep understanding of visual content, with many applications such as visual reasoning and image retrieval.
1 code implementation • ECCV 2020 • Alireza Zareian, Svebor Karaman, Shih-Fu Chang
Scene graphs are powerful representations that parse images into their abstract semantic elements, i. e., objects and their interactions, which facilitates visual comprehension and explainable reasoning.
no code implementations • 10 Dec 2019 • Shiyuan Huang, Xudong Lin, Svebor Karaman, Shih-Fu Chang
Recent works instead use modern compressed video modalities as an alternative to the RGB spatial stream and improve the inference speed by orders of magnitudes.
1 code implementation • 15 Jul 2019 • Xu Zhang, Svebor Karaman, Shih-Fu Chang
By using the simulated images to train a spectrum based classifier, even without seeing the fake images produced by the targeted GAN model during training, our approach achieves state-of-the-art performances on detecting fake images generated by popular GAN models such as CycleGAN.
no code implementations • 4 Mar 2019 • Svebor Karaman, Xudong Lin, Xuefeng Hu, Shih-Fu Chang
We propose an unsupervised hashing method which aims to produce binary codes that preserve the ranking induced by a real-valued representation.
1 code implementation • CVPR 2019 • Hassan Akbari, Svebor Karaman, Surabhi Bhargava, Brian Chen, Carl Vondrick, Shih-Fu Chang
Following dedicated non-linear mappings for visual features at each level, word, and sentence embeddings, we obtain multiple instantiations of our common semantic space in which comparisons between any target text and the visual content is performed with cosine similarity.
Ranked #1 on Phrase Grounding on ReferIt
1 code implementation • ICLR 2019 • Xu Zhang, Felix Xinnan Yu, Svebor Karaman, Wei zhang, Shih-Fu Chang
Metric learning aims at learning a distance which is consistent with the semantic meaning of the samples.
no code implementations • 23 Jul 2018 • Philipp Blandfort, Desmond Patton, William R. Frey, Svebor Karaman, Surabhi Bhargava, Fei-Tzin Lee, Siddharth Varia, Chris Kedzie, Michael B. Gaskell, Rossano Schifanella, Kathleen McKeown, Shih-Fu Chang
In this paper we partnered computer scientists with social work researchers, who have domain expertise in gang violence, to analyze how public tweets with images posted by youth who mention gang associations on Twitter can be leveraged to automatically detect psychosocial factors and conditions that could potentially assist social workers and violence outreach workers in prevention and early intervention programs.
1 code implementation • CVPR 2017 • Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang
Specifically, we extend the covariant constraint proposed by Lenc and Vedaldi by defining the concepts of "standard patch" and "canonical feature" and leverage these to train a novel robust covariant detector.
1 code implementation • Computer Vision and Pattern Recognition 2017 • Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang
Specifically, we extend the covariant constraint proposed by Lenc and Vedaldi [8] by defining the concepts of “standard patch” and “canonical feature” and leverage these to train a novel robust covariant detector.
no code implementations • 8 Jul 2016 • Giuseppe Lisanti, Svebor Karaman, Iacopo Masi
In this paper we introduce a method to overcome one of the main challenges of person re-identification in multi-camera networks, namely cross-view appearance changes.
no code implementations • 16 Jun 2016 • Jie Feng, Svebor Karaman, I-Hong Jhuo, Shih-Fu Chang
Learning-based hashing is often used in large scale image retrieval as they provide a compact representation of each sample and the Hamming distance can be used to efficiently compare two samples.
no code implementations • 28 Jul 2015 • Giuseppe Lisanti, Svebor Karaman, Daniele Pezzatini, Alberto del Bimbo
In this paper we present a machine vision system to efficiently monitor, analyze and present visual data acquired with a railway overhead gantry equipped with multiple cameras.
no code implementations • 14 Jun 2011 • Svebor Karaman, Jenny Benois-Pineau, Rémi Mégret
In this paper, we propose a new, scalable approach for the task of object based image search or object recognition.