no code implementations • 27 Jul 2024 • Qun Li, Baoquan Sun, Fu Xiao, Yonggang Qi, Bir Bhanu
We propose Sym-Net, a novel framework for Few-Shot Segmentation (FSS) that addresses the critical issue of intra-class variation by jointly learning both query and support prototypes in a symmetrical manner.
no code implementations • 6 Sep 2023 • Hengyue Liu, Bir Bhanu
As each representation's cardinality has different trade-offs between performance and computation overhead, extracting highly representative features efficiently and dynamically is both challenging and crucial for SGG.
no code implementations • 12 Apr 2023 • Runze Li, Dahun Kim, Bir Bhanu, Weicheng Kuo
We present RECLIP (Resource-efficient CLIP), a simple method that minimizes computational resource footprint for CLIP (Contrastive Language Image Pretraining).
no code implementations • 18 Jul 2022 • Runze Li, Pan Ji, Yi Xu, Bir Bhanu
As compared to outdoor environments, estimating depth of monocular videos for indoor environments, using self-supervised methods, results in two additional challenges: (i) the depth range of indoor video sequences varies a lot across different frames, making it difficult for the depth network to induce consistent depth cues for training; (ii) the indoor sequences recorded with handheld devices often contain much more rotational motions, which cause difficulties for the pose network to predict accurate relative camera poses.
1 code implementation • 22 Apr 2022 • Qun Li, Ziyi Zhang, Fu Xiao, Feng Zhang, Bir Bhanu
A high-resolution network exhibits remarkable capability in extracting multi-scale features for human pose estimation, but fails to capture long-range interactions between joints and has high computational complexity.
Ranked #30 on
Pose Estimation
on COCO test-dev
no code implementations • 17 Aug 2021 • Runze Li, Tomaso Fontanini, Luca Donati, Andrea Prati, Bir Bhanu
Gradient-based attention modeling has been used widely as a way to visualize and understand convolutional neural networks.
no code implementations • 5 Aug 2021 • Akash Gupta, Padmaja Jonnalagedda, Bir Bhanu, Amit K. Roy-Chowdhury
Specifically, meta-learning is employed to obtain adaptive parameters, using a large-scale external dataset, that can adapt quickly to the novel condition (degradation model) of the given test video during the internal learning task, thereby exploiting external and internal information of a video for super-resolution.
no code implementations • 27 Jul 2021 • Runze Li, Srikrishna Karanam, Ren Li, Terrence Chen, Bir Bhanu, Ziyan Wu
We conduct a variety of experiments on standard video mesh recovery benchmark datasets such as Human3. 6M, MPI-INF-3DHP, and 3DPW, demonstrating the efficacy of our design of modeling local dynamics as well as establishing state-of-the-art results based on standard evaluation metrics.
Ranked #54 on
3D Human Pose Estimation
on 3DPW
no code implementations • ICCV 2021 • Pan Ji, Runze Li, Bir Bhanu, Yi Xu
The effectiveness of each module is shown through a carefully conducted ablation study and the demonstration of the state-of-the-art performance on three indoor datasets, \ie, EuRoC, NYUv2, and 7-scenes.
1 code implementation • CVPR 2021 • Hengyue Liu, Ning Yan, Masood S. Mortazavi, Bir Bhanu
This paper presents a fully convolutional scene graph generation (FCSGG) model that detects objects and relations simultaneously.
no code implementations • 1 Nov 2020 • Hengyue Liu, Samyak Parajuli, Jesse Hostetler, Sek Chai, Bir Bhanu
Conditional computation for Deep Neural Networks (DNNs) reduce overall computational load and improve model accuracy by running a subset of the network.
no code implementations • 14 May 2020 • Padmaja Jonnalagedda, Brent Weinberg, Jason Allen, Taejin L. Min, Shiv Bhanu, Bir Bhanu
While deep learning approaches have shown remarkable performance in many imaging tasks, most of these methods rely on availability of large quantities of data.
no code implementations • MIDL 2019 • Padmaja Jonnalagedda, Brent Weinberg, Jason Allen, Bir Bhanu
Various mutations have been shown to correlate with prognosis of High-Grade Glioma (Glioblastoma).
2 code implementations • CVPR 2020 • Wenqian Liu, Runze Li, Meng Zheng, Srikrishna Karanam, Ziyan Wu, Bir Bhanu, Richard J. Radke, Octavia Camps
We present methods to generate visual attention from the learned latent space, and also demonstrate such attention explanations serve more than just explaining VAE predictions.
no code implementations • CVPR 2014 • Xiaojing Chen, Zhen Qin, Le An, Bir Bhanu
We introduce an online approach to learn possible elementary groups (groups that contain only two targets) for inferring high level context that can be used to improve multi-target tracking in a data-association based framework.