no code implementations • 26 Aug 2024 • Meng Zheng, Benjamin Planche, Zhongpai Gao, Terrence Chen, Richard J. Radke, Ziyan Wu
Conventional 3D medical image segmentation methods typically require learning heavy 3D networks (e. g., 3D-UNet), as well as large amounts of in-domain data with accurate pixel/voxel-level labels to avoid overfitting.
no code implementations • 16 Jun 2024 • Zhuoxu Duan, Zhengye Yang, Samuel Westby, Christoph Riedl, Brooke Foucault Welles, Richard J. Radke
Large language models like GPT have proven widely successful on natural language understanding tasks based on written text documents.
no code implementations • 10 Jul 2022 • Ashraful Islam, Ben Lundell, Harpreet Sawhney, Sudipta Sinha, Peter Morales, Richard J. Radke
We evaluate our SSL approach on two downstream tasks -- object detection and semantic segmentation, using COCO, PASCAL VOC, and CityScapes datasets.
1 code implementation • NeurIPS 2021 • Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Rogerio Feris, Richard J. Radke
As the base dataset and unlabeled dataset are from different domains, projecting the target images in the class-domain of the base dataset with a fixed pretrained model might be sub-optimal.
no code implementations • 13 Aug 2020 • Meng Zheng, Srikrishna Karanam, Terrence Chen, Richard J. Radke, Ziyan Wu
We show that the resulting similarity models perform, and can be visually explained, better than the corresponding baseline models trained without these constraints.
1 code implementation • 21 Jan 2020 • Ashraful Islam, Richard J. Radke
We propose a classification module to generate action labels for each segment in the video, and a deep metric learning module to learn the similarity between different action instances.
Ranked #1 on Temporal Action Localization on ActivityNet-1.2
no code implementations • 26 Nov 2019 • Gyanendra Sharma, Richard J. Radke
In this paper, we present a multi-user interaction interface for a large immersive space that supports simultaneous screen interactions by combining (1) user input via personal smartphones and Bluetooth microphones, (2) spatial tracking via an overhead array of Kinect sensors, and (3) WebSocket interfaces to a webpage running on the large screen.
2 code implementations • CVPR 2020 • Wenqian Liu, Runze Li, Meng Zheng, Srikrishna Karanam, Ziyan Wu, Bir Bhanu, Richard J. Radke, Octavia Camps
We present methods to generate visual attention from the learned latent space, and also demonstrate such attention explanations serve more than just explaining VAE predictions.
no code implementations • 18 Nov 2019 • Meng Zheng, Srikrishna Karanam, Terrence Chen, Richard J. Radke, Ziyan Wu
While there has been substantial progress in learning suitable distance metrics, these techniques in general lack transparency and decision reasoning, i. e., explaining why the input set of images is similar or dissimilar.
no code implementations • ACL 2019 • Manling Li, Lingyu Zhang, Heng Ji, Richard J. Radke
Transcripts of natural, multi-person meetings differ significantly from documents like news articles, which can make Natural Language Generation models for generating summaries unfocused.
no code implementations • CVPR 2019 • Meng Zheng, Srikrishna Karanam, Ziyan Wu, Richard J. Radke
We propose a new deep architecture for person re-identification (re-id).
no code implementations • 16 Aug 2018 • Meng Zheng, Srikrishna Karanam, Richard J. Radke
Designing real-world person re-identification (re-id) systems requires attention to operational aspects not typically considered in academic research.
no code implementations • 2 Jun 2017 • Srikrishna Karanam, Eric Lam, Richard J. Radke
Designing useful person re-identification systems for real-world applications requires attention to operational aspects not typically considered in academic research.
2 code implementations • 31 May 2016 • Srikrishna Karanam, Mengran Gou, Ziyan Wu, Angels Rates-Borras, Octavia Camps, Richard J. Radke
To ensure a fair comparison, all of the approaches were implemented using a unified code library that includes 11 feature extraction algorithms and 22 metric learning and ranking techniques.
no code implementations • ICCV 2015 • Srikrishna Karanam, Yang Li, Richard J. Radke
This paper introduces a new approach to address the person re-identification problem in cameras with non-overlapping fields of view.