Search Results for author: Srikrishna Karanam

Found 33 papers, 5 papers with code

A Systematic Evaluation and Benchmark for Person Re-Identification: Features, Metrics, and Datasets

3 code implementations31 May 2016 Srikrishna Karanam, Mengran Gou, Ziyan Wu, Angels Rates-Borras, Octavia Camps, Richard J. Radke

To ensure a fair comparison, all of the approaches were implemented using a unified code library that includes 11 feature extraction algorithms and 22 metric learning and ranking techniques.

Metric Learning Person Re-Identification

Learning Local RGB-to-CAD Correspondences for Object Pose Estimation

1 code implementation ICCV 2019 Georgios Georgakis, Srikrishna Karanam, Ziyan Wu, Jana Kosecka

In this paper, we solve this key problem of existing methods requiring expensive 3D pose annotations by proposing a new method that matches RGB images to CAD models for object pose estimation.

Object Pose Estimation

Towards Visually Explaining Variational Autoencoders

2 code implementations CVPR 2020 Wenqian Liu, Runze Li, Meng Zheng, Srikrishna Karanam, Ziyan Wu, Bir Bhanu, Richard J. Radke, Octavia Camps

We present methods to generate visual attention from the learned latent space, and also demonstrate such attention explanations serve more than just explaining VAE predictions.

Disentanglement

Learning Hierarchical Attention for Weakly-supervised Chest X-Ray Abnormality Localization and Diagnosis

1 code implementation23 Dec 2021 Xi Ouyang, Srikrishna Karanam, Ziyan Wu, Terrence Chen, Jiayu Huo, Xiang Sean Zhou, Qian Wang, Jie-Zhi Cheng

However, doing this accurately will require a large amount of disease localization annotations by clinical experts, a task that is prohibitively expensive to accomplish for most applications.

Decision Making

Sharpen Focus: Learning with Attention Separability and Consistency

1 code implementation ICCV 2019 Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram Singh, Bo Liu, Dimitris N. Metaxas

Recent developments in gradient-based attention modeling have seen attention maps emerge as a powerful tool for interpreting convolutional neural networks.

General Classification Image Classification

End-to-end learning of keypoint detector and descriptor for pose invariant 3D matching

no code implementations CVPR 2018 Georgios Georgakis, Srikrishna Karanam, Ziyan Wu, Jan Ernst, Jana Kosecka

Finding correspondences between images or 3D scans is at the heart of many computer vision and image retrieval applications and is often enabled by matching local keypoint descriptors.

Image Retrieval Keypoint Detection +2

Learning Compositional Visual Concepts with Mutual Consistency

no code implementations CVPR 2018 Yunye Gong, Srikrishna Karanam, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst, Peter C. Doerschuk

Compositionality of semantic concepts in image synthesis and analysis is appealing as it can help in decomposing known and generatively recomposing unknown data.

Data Augmentation Face Verification +1

Rank Persistence: Assessing the Temporal Performance of Real-World Person Re-Identification

no code implementations2 Jun 2017 Srikrishna Karanam, Eric Lam, Richard J. Radke

Designing useful person re-identification systems for real-world applications requires attention to operational aspects not typically considered in academic research.

Person Re-Identification

Measuring the Temporal Behavior of Real-World Person Re-Identification

no code implementations16 Aug 2018 Meng Zheng, Srikrishna Karanam, Richard J. Radke

Designing real-world person re-identification (re-id) systems requires attention to operational aspects not typically considered in academic research.

Person Re-Identification

Incremental Scene Synthesis

no code implementations NeurIPS 2019 Benjamin Planche, Xuejian Rong, Ziyan Wu, Srikrishna Karanam, Harald Kosch, YingLi Tian, Jan Ernst, Andreas Hutter

We present a method to incrementally generate complete 2D or 3D scenes with the following properties: (a) it is globally consistent at each step according to a learned scene prior, (b) real observations of a scene can be incorporated while observing global consistency, (c) unobserved regions can be hallucinated locally in consistence with previous observations, hallucinations and global priors, and (d) hallucinations are statistical in nature, i. e., different scenes can be generated from the same observations.

Autonomous Navigation Hallucination

Visual Similarity Attention

no code implementations18 Nov 2019 Meng Zheng, Srikrishna Karanam, Terrence Chen, Richard J. Radke, Ziyan Wu

While there has been substantial progress in learning suitable distance metrics, these techniques in general lack transparency and decision reasoning, i. e., explaining why the input set of images is similar or dissimilar.

Image Retrieval Person Re-Identification +2

Hierarchical Kinematic Human Mesh Recovery

no code implementations ECCV 2020 Georgios Georgakis, Ren Li, Srikrishna Karanam, Terrence Chen, Jana Kosecka, Ziyan Wu

In this work, we address this gap by proposing a new technique for regression of human parametric model that is explicitly informed by the known hierarchical structure, including joint interdependencies of the model.

Human Mesh Recovery regression

Towards Visually Explaining Similarity Models

no code implementations13 Aug 2020 Meng Zheng, Srikrishna Karanam, Terrence Chen, Richard J. Radke, Ziyan Wu

We show that the resulting similarity models perform, and can be visually explained, better than the corresponding baseline models trained without these constraints.

Image Retrieval Metric Learning +3

Everybody Is Unique: Towards Unbiased Human Mesh Recovery

no code implementations13 Jul 2021 Ren Li, Meng Zheng, Srikrishna Karanam, Terrence Chen, Ziyan Wu

Next, we present a simple baseline to address this problem that is scalable and can be easily used in conjunction with existing algorithms to improve their performance.

 Ranked #1 on 3D Human Shape Estimation on SSP-3D (PVE-T metric)

3D Human Pose Estimation 3D Human Shape Estimation +1

Spatio-Temporal Representation Factorization for Video-based Person Re-Identification

no code implementations ICCV 2021 Abhishek Aich, Meng Zheng, Srikrishna Karanam, Terrence Chen, Amit K. Roy-Chowdhury, Ziyan Wu

To alleviate these problems, we propose Spatio-Temporal Representation Factorization (STRF), a flexible new computational unit that can be used in conjunction with most existing 3D convolutional neural network architectures for re-ID.

Video-Based Person Re-Identification

Learning Local Recurrent Models for Human Mesh Recovery

no code implementations27 Jul 2021 Runze Li, Srikrishna Karanam, Ren Li, Terrence Chen, Bir Bhanu, Ziyan Wu

We conduct a variety of experiments on standard video mesh recovery benchmark datasets such as Human3. 6M, MPI-INF-3DHP, and 3DPW, demonstrating the efficacy of our design of modeling local dynamics as well as establishing state-of-the-art results based on standard evaluation metrics.

3D Human Pose Estimation 3D Human Shape Estimation +1

Ensemble Attention Distillation for Privacy-Preserving Federated Learning

no code implementations ICCV 2021 Xuan Gong, Abhishek Sharma, Srikrishna Karanam, Ziyan Wu, Terrence Chen, David Doermann, Arun Innanje

Such decentralized training naturally leads to issues of imbalanced or differing data distributions among the local models and challenges in fusing them into a central model.

Federated Learning Privacy Preserving

SMPL-A: Modeling Person-Specific Deformable Anatomy

no code implementations CVPR 2022 Hengtao Guo, Benjamin Planche, Meng Zheng, Srikrishna Karanam, Terrence Chen, Ziyan Wu

In order to obtain accurate target location information, clinicians have to either conduct frequent intraoperative scans, resulting in higher exposition of patients to radiations, or adopt proxy procedures (e. g., creating and using custom molds to keep patients in the exact same pose during both preoperative organ scanning and subsequent treatment.

Anatomy Human Mesh Recovery

PseudoClick: Interactive Image Segmentation with Click Imitation

no code implementations12 Jul 2022 Qin Liu, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, Marc Niethammer, Ziyan Wu

The goal of click-based interactive image segmentation is to obtain precise object segmentation masks with limited user interaction, i. e., by a minimal number of user clicks.

Image Segmentation Segmentation +1

Self-supervised Human Mesh Recovery with Cross-Representation Alignment

no code implementations10 Sep 2022 Xuan Gong, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, David Doermann, Ziyan Wu

However, on synthetic dense correspondence maps (i. e., IUV) few have been explored since the domain gap between synthetic training data and real testing data is hard to address for 2D dense representation.

Human Mesh Recovery

Preserving Privacy in Federated Learning with Ensemble Cross-Domain Knowledge Distillation

no code implementations10 Sep 2022 Xuan Gong, Abhishek Sharma, Srikrishna Karanam, Ziyan Wu, Terrence Chen, David Doermann, Arun Innanje

Federated Learning (FL) is a machine learning paradigm where local nodes collaboratively train a central model while the training data remains decentralized.

Federated Learning Image Classification +4

Audio Retrieval for Multimodal Design Documents: A New Dataset and Algorithms

no code implementations28 Feb 2023 Prachi Singh, Srikrishna Karanam, Sumit Shekhar

We consider and propose a new problem of retrieving audio files relevant to multimodal design document inputs comprising both textual elements and visual imagery, e. g., birthday/greeting cards.

Retrieval

Learning with Difference Attention for Visually Grounded Self-supervised Representations

no code implementations26 Jun 2023 Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan

Recent works in self-supervised learning have shown impressive results on single-object images, but they struggle to perform well on complex multi-object images as evidenced by their poor visual grounding.

Self-Supervised Learning Visual Grounding

A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis

no code implementations ICCV 2023 Aishwarya Agarwal, Srikrishna Karanam, K J Joseph, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan

First, our attention segregation loss reduces the cross-attention overlap between attention maps of different concepts in the text prompt, thereby reducing the confusion/conflict among various concepts and the eventual capture of all concepts in the generated output.

Denoising Image Generation

CoPL: Contextual Prompt Learning for Vision-Language Understanding

no code implementations3 Jul 2023 Koustava Goswami, Srikrishna Karanam, Prateksha Udhayanan, K J Joseph, Balaji Vasan Srinivasan

Our key innovations over earlier works include using local image features as part of the prompt learning process, and more crucially, learning to weight these prompts based on local features that are appropriate for the task at hand.

Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval

no code implementations31 Aug 2023 Prateksha Udhayanan, Srikrishna Karanam, Balaji Vasan Srinivasan

To this end, our key novelty is a new gradient-attention-based learning objective that explicitly forces the model to focus on the local regions of interest being modified in each retrieval step.

Image Retrieval Retrieval

An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis

no code implementations20 Nov 2023 Aishwarya Agarwal, Srikrishna Karanam, Tripti Shukla, Balaji Vasan Srinivasan

Another line of techniques expand the inversion space to learn multiple embeddings but they do this only along the layer dimension (e. g., one per layer of the DDPM model) or the timestep dimension (one for a set of timesteps in the denoising process), leading to suboptimal attribute disentanglement.

Attribute Denoising +2

Approximate Caching for Efficiently Serving Diffusion Models

no code implementations7 Dec 2023 Shubham Agarwal, Subrata Mitra, Sarthak Chakraborty, Srikrishna Karanam, Koyel Mukherjee, Shiv Saini

Text-to-image generation using diffusion models has seen explosive popularity owing to their ability in producing high quality images adhering to text prompts.

Denoising Management +1

Cannot find the paper you are looking for? You can Submit a new open access paper.