Search Results for author: Srikrishna Karanam

Found 33 papers, 5 papers with code

A Systematic Evaluation and Benchmark for Person Re-Identification: Features, Metrics, and Datasets

3 code implementations • 31 May 2016 • Srikrishna Karanam, Mengran Gou, Ziyan Wu, Angels Rates-Borras, Octavia Camps, Richard J. Radke

To ensure a fair comparison, all of the approaches were implemented using a unified code library that includes 11 feature extraction algorithms and 22 metric learning and ranking techniques.

Metric Learning Person Re-Identification

195

Paper
Code

Learning Local RGB-to-CAD Correspondences for Object Pose Estimation

1 code implementation • ICCV 2019 • Georgios Georgakis, Srikrishna Karanam, Ziyan Wu, Jana Kosecka

In this paper, we solve this key problem of existing methods requiring expensive 3D pose annotations by proposing a new method that matches RGB images to CAD models for object pose estimation.

Object Pose Estimation

169

Paper
Code

Towards Visually Explaining Variational Autoencoders

2 code implementations • CVPR 2020 • Wenqian Liu, Runze Li, Meng Zheng, Srikrishna Karanam, Ziyan Wu, Bir Bhanu, Richard J. Radke, Octavia Camps

We present methods to generate visual attention from the learned latent space, and also demonstrate such attention explanations serve more than just explaining VAE predictions.

Disentanglement

Paper
Code

Learning Hierarchical Attention for Weakly-supervised Chest X-Ray Abnormality Localization and Diagnosis

1 code implementation • 23 Dec 2021 • Xi Ouyang, Srikrishna Karanam, Ziyan Wu, Terrence Chen, Jiayu Huo, Xiang Sean Zhou, Qian Wang, Jie-Zhi Cheng

However, doing this accurately will require a large amount of disease localization annotations by clinical experts, a task that is prohibitively expensive to accomplish for most applications.

Decision Making

Paper
Code

Sharpen Focus: Learning with Attention Separability and Consistency

1 code implementation • ICCV 2019 • Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram Singh, Bo Liu, Dimitris N. Metaxas

Recent developments in gradient-based attention modeling have seen attention maps emerge as a powerful tool for interpreting convolutional neural networks.

General Classification Image Classification

Paper
Code

End-to-end learning of keypoint detector and descriptor for pose invariant 3D matching

no code implementations • CVPR 2018 • Georgios Georgakis, Srikrishna Karanam, Ziyan Wu, Jan Ernst, Jana Kosecka

Finding correspondences between images or 3D scans is at the heart of many computer vision and image retrieval applications and is often enabled by matching local keypoint descriptors.

Image Retrieval Keypoint Detection +2

Paper
Add Code

Learning Compositional Visual Concepts with Mutual Consistency

no code implementations • CVPR 2018 • Yunye Gong, Srikrishna Karanam, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst, Peter C. Doerschuk

Compositionality of semantic concepts in image synthesis and analysis is appealing as it can help in decomposing known and generatively recomposing unknown data.

Data Augmentation Face Verification +1

Paper
Add Code

Rank Persistence: Assessing the Temporal Performance of Real-World Person Re-Identification

no code implementations • 2 Jun 2017 • Srikrishna Karanam, Eric Lam, Richard J. Radke

Designing useful person re-identification systems for real-world applications requires attention to operational aspects not typically considered in academic research.

Person Re-Identification

Paper
Add Code

Measuring the Temporal Behavior of Real-World Person Re-Identification

no code implementations • 16 Aug 2018 • Meng Zheng, Srikrishna Karanam, Richard J. Radke

Designing real-world person re-identification (re-id) systems requires attention to operational aspects not typically considered in academic research.

Person Re-Identification

Paper
Add Code

Re-Identification with Consistent Attentive Siamese Networks

no code implementations • CVPR 2019 • Meng Zheng, Srikrishna Karanam, Ziyan Wu, Richard J. Radke

We propose a new deep architecture for person re-identification (re-id).

Person Re-Identification Representation Learning

Paper
Add Code

Incremental Scene Synthesis

no code implementations • NeurIPS 2019 • Benjamin Planche, Xuejian Rong, Ziyan Wu, Srikrishna Karanam, Harald Kosch, YingLi Tian, Jan Ernst, Andreas Hutter

We present a method to incrementally generate complete 2D or 3D scenes with the following properties: (a) it is globally consistent at each step according to a learned scene prior, (b) real observations of a scene can be incorporated while observing global consistency, (c) unobserved regions can be hallucinated locally in consistence with previous observations, hallucinations and global priors, and (d) hallucinations are statistical in nature, i. e., different scenes can be generated from the same observations.

Autonomous Navigation Hallucination

Paper
Add Code

Person Re-Identification With Discriminatively Trained Viewpoint Invariant Dictionaries

no code implementations • ICCV 2015 • Srikrishna Karanam, Yang Li, Richard J. Radke

This paper introduces a new approach to address the person re-identification problem in cameras with non-overlapping fields of view.

Dictionary Learning Metric Learning +1

Paper
Add Code

Towards Robust RGB-D Human Mesh Recovery

no code implementations • 18 Nov 2019 • Ren Li, Changjiang Cai, Georgios Georgakis, Srikrishna Karanam, Terrence Chen, Ziyan Wu

We consider the problem of human pose estimation.

Human Mesh Recovery Pose Estimation

Paper
Add Code

Visual Similarity Attention

no code implementations • 18 Nov 2019 • Meng Zheng, Srikrishna Karanam, Terrence Chen, Richard J. Radke, Ziyan Wu

While there has been substantial progress in learning suitable distance metrics, these techniques in general lack transparency and decision reasoning, i. e., explaining why the input set of images is similar or dissimilar.

Image Retrieval Person Re-Identification +2

Paper
Add Code

Hierarchical Kinematic Human Mesh Recovery

no code implementations • ECCV 2020 • Georgios Georgakis, Ren Li, Srikrishna Karanam, Terrence Chen, Jana Kosecka, Ziyan Wu

In this work, we address this gap by proposing a new technique for regression of human parametric model that is explicitly informed by the known hierarchical structure, including joint interdependencies of the model.

Human Mesh Recovery regression

Paper
Add Code

Towards Visually Explaining Similarity Models

no code implementations • 13 Aug 2020 • Meng Zheng, Srikrishna Karanam, Terrence Chen, Richard J. Radke, Ziyan Wu

We show that the resulting similarity models perform, and can be visually explained, better than the corresponding baseline models trained without these constraints.

Image Retrieval Metric Learning +3

Paper
Add Code

A Peek Into the Reasoning of Neural Networks: Interpreting with Structural Visual Concepts

no code implementations • CVPR 2021 • Yunhao Ge, Yao Xiao, Zhi Xu, Meng Zheng, Srikrishna Karanam, Terrence Chen, Laurent Itti, Ziyan Wu

Despite substantial progress in applying neural networks (NN) to a wide variety of areas, they still largely suffer from a lack of transparency and interpretability.

Explainable artificial intelligence Knowledge Distillation

Paper
Add Code

Everybody Is Unique: Towards Unbiased Human Mesh Recovery

no code implementations • 13 Jul 2021 • Ren Li, Meng Zheng, Srikrishna Karanam, Terrence Chen, Ziyan Wu

Next, we present a simple baseline to address this problem that is scalable and can be easily used in conjunction with existing algorithms to improve their performance.

Ranked #1 on 3D Human Shape Estimation on SSP-3D (PVE-T metric)

3D Human Pose Estimation 3D Human Shape Estimation +1

Paper
Add Code

Spatio-Temporal Representation Factorization for Video-based Person Re-Identification

no code implementations • ICCV 2021 • Abhishek Aich, Meng Zheng, Srikrishna Karanam, Terrence Chen, Amit K. Roy-Chowdhury, Ziyan Wu

To alleviate these problems, we propose Spatio-Temporal Representation Factorization (STRF), a flexible new computational unit that can be used in conjunction with most existing 3D convolutional neural network architectures for re-ID.

Ranked #2 on Person Re-Identification on DukeMTMC-VideoReID

Video-Based Person Re-Identification

Paper
Add Code

Learning Local Recurrent Models for Human Mesh Recovery

no code implementations • 27 Jul 2021 • Runze Li, Srikrishna Karanam, Ren Li, Terrence Chen, Bir Bhanu, Ziyan Wu

We conduct a variety of experiments on standard video mesh recovery benchmark datasets such as Human3. 6M, MPI-INF-3DHP, and 3DPW, demonstrating the efficacy of our design of modeling local dynamics as well as establishing state-of-the-art results based on standard evaluation metrics.

Ranked #44 on 3D Human Pose Estimation on 3DPW

3D Human Pose Estimation 3D Human Shape Estimation +1

Paper
Add Code

Ensemble Attention Distillation for Privacy-Preserving Federated Learning

no code implementations • ICCV 2021 • Xuan Gong, Abhishek Sharma, Srikrishna Karanam, Ziyan Wu, Terrence Chen, David Doermann, Arun Innanje

Such decentralized training naturally leads to issues of imbalanced or differing data distributions among the local models and challenges in fusing them into a central model.

Federated Learning Privacy Preserving

Paper
Add Code

SMPL-A: Modeling Person-Specific Deformable Anatomy

no code implementations • CVPR 2022 • Hengtao Guo, Benjamin Planche, Meng Zheng, Srikrishna Karanam, Terrence Chen, Ziyan Wu

In order to obtain accurate target location information, clinicians have to either conduct frequent intraoperative scans, resulting in higher exposition of patients to radiations, or adopt proxy procedures (e. g., creating and using custom molds to keep patients in the exact same pose during both preoperative organ scanning and subsequent treatment.

Anatomy Human Mesh Recovery

Paper
Add Code

PseudoClick: Interactive Image Segmentation with Click Imitation

no code implementations • 12 Jul 2022 • Qin Liu, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, Marc Niethammer, Ziyan Wu

The goal of click-based interactive image segmentation is to obtain precise object segmentation masks with limited user interaction, i. e., by a minimal number of user clicks.

Image Segmentation Segmentation +1

Paper
Add Code

Self-supervised Human Mesh Recovery with Cross-Representation Alignment

no code implementations • 10 Sep 2022 • Xuan Gong, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, David Doermann, Ziyan Wu

However, on synthetic dense correspondence maps (i. e., IUV) few have been explored since the domain gap between synthetic training data and real testing data is hard to address for 2D dense representation.

Human Mesh Recovery

Paper
Add Code

Preserving Privacy in Federated Learning with Ensemble Cross-Domain Knowledge Distillation

no code implementations • 10 Sep 2022 • Xuan Gong, Abhishek Sharma, Srikrishna Karanam, Ziyan Wu, Terrence Chen, David Doermann, Arun Innanje

Federated Learning (FL) is a machine learning paradigm where local nodes collaboratively train a central model while the training data remains decentralized.

Federated Learning Image Classification +4

Paper
Add Code

Audio Retrieval for Multimodal Design Documents: A New Dataset and Algorithms

no code implementations • 28 Feb 2023 • Prachi Singh, Srikrishna Karanam, Sumit Shekhar

We consider and propose a new problem of retrieving audio files relevant to multimodal design document inputs comprising both textual elements and visual imagery, e. g., birthday/greeting cards.

Retrieval

Paper
Add Code

Learning with Difference Attention for Visually Grounded Self-supervised Representations

no code implementations • 26 Jun 2023 • Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan

Recent works in self-supervised learning have shown impressive results on single-object images, but they struggle to perform well on complex multi-object images as evidenced by their poor visual grounding.

Self-Supervised Learning Visual Grounding

Paper
Add Code

A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis

no code implementations • ICCV 2023 • Aishwarya Agarwal, Srikrishna Karanam, K J Joseph, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan

First, our attention segregation loss reduces the cross-attention overlap between attention maps of different concepts in the text prompt, thereby reducing the confusion/conflict among various concepts and the eventual capture of all concepts in the generated output.

Denoising Image Generation

Paper
Add Code

CoPL: Contextual Prompt Learning for Vision-Language Understanding

no code implementations • 3 Jul 2023 • Koustava Goswami, Srikrishna Karanam, Prateksha Udhayanan, K J Joseph, Balaji Vasan Srinivasan

Our key innovations over earlier works include using local image features as part of the prompt learning process, and more crucially, learning to weight these prompts based on local features that are appropriate for the task at hand.

Paper
Add Code

Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval

no code implementations • 31 Aug 2023 • Prateksha Udhayanan, Srikrishna Karanam, Balaji Vasan Srinivasan

To this end, our key novelty is a new gradient-attention-based learning objective that explicitly forces the model to focus on the local regions of interest being modified in each retrieval step.

Image Retrieval Retrieval

Paper
Add Code

Iterative Multi-granular Image Editing using Diffusion Models

no code implementations • 1 Sep 2023 • K J Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan

We hope our work would attract attention to this newly identified, pragmatic problem setting.

Image Generation

Paper
Add Code

An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis

no code implementations • 20 Nov 2023 • Aishwarya Agarwal, Srikrishna Karanam, Tripti Shukla, Balaji Vasan Srinivasan

Another line of techniques expand the inversion space to learn multiple embeddings but they do this only along the layer dimension (e. g., one per layer of the DDPM model) or the timestep dimension (one for a set of timesteps in the denoising process), leading to suboptimal attribute disentanglement.

Attribute Denoising +2

Paper
Add Code

Approximate Caching for Efficiently Serving Diffusion Models

no code implementations • 7 Dec 2023 • Shubham Agarwal, Subrata Mitra, Sarthak Chakraborty, Srikrishna Karanam, Koyel Mukherjee, Shiv Saini

Text-to-image generation using diffusion models has seen explosive popularity owing to their ability in producing high quality images adhering to text prompts.

Denoising Management +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.