Search Results for author: Sinisa Todorovic

Found 38 papers, 9 papers with code

Markov Game Video Augmentation for Action Segmentation

no code implementations ICCV 2023 Nicolas Aziere, Sinisa Todorovic

Our key novelty is that we augment the original training videos in the deep feature space, not in the visual spatiotemporal domain as done by previous work.

Action Segmentation Data Augmentation

Bidirectional Alignment for Domain Adaptive Detection with Transformers

1 code implementation ICCV 2023 Liqiang He, Wei Wang, Albert Chen, Min Sun, Cheng-Hao Kuo, Sinisa Todorovic

We propose a Bidirectional Alignment for domain adaptive Detection with Transformers (BiADT) to improve cross domain object detection performance.

Object object-detection +1

iFS-RCNN: An Incremental Few-shot Instance Segmenter

1 code implementation CVPR 2022 Khoi Nguyen, Sinisa Todorovic

This paper addresses incremental few-shot instance segmentation, where a few examples of new object classes arrive when access to training examples of old classes is not available anymore, and the goal is to perform well on both old and new classes.

Instance Segmentation Object +1

DESTR: Object Detection With Split Transformer

no code implementations CVPR 2022 Liqiang He, Sinisa Todorovic

Second, we use a mini-detector to initialize the content queries in the decoder with classification and regression embeddings of the respective heads in the mini-detector.

Classification Inductive Bias +4

A Weakly Supervised Amodal Segmenter with Boundary Uncertainty Estimation

1 code implementation ICCV 2021 Khoi Nguyen, Sinisa Todorovic

The resulting predictions on training images are taken as the pseudo-ground truth for the standard training of Mask-RCNN, which we use for amodal instance segmentation of test images.

Amodal Instance Segmentation Segmentation +1

Action Shuffle Alternating Learning for Unsupervised Action Segmentation

no code implementations CVPR 2021 Jun Li, Sinisa Todorovic

Our SSL trains an RNN to recognize positive and negative action sequences, and the RNN's hidden layer is taken as our new action-level feature embedding.

Action Segmentation Segmentation +1

Anchor-Constrained Viterbi for Set-Supervised Action Segmentation

no code implementations CVPR 2021 Jun Li, Sinisa Todorovic

This paper is about action segmentation under weak supervision in training, where the ground truth provides only a set of actions present, but neither their temporal ordering nor when they occur in a training video.

Action Segmentation Segmentation +1

FAPIS: A Few-shot Anchor-free Part-based Instance Segmenter

1 code implementation CVPR 2021 Khoi Nguyen, Sinisa Todorovic

This paper is about few-shot instance segmentation, where training and test image sets do not share the same object classes.

Few-Shot Learning Instance Segmentation +3

A Self-supervised GAN for Unsupervised Few-shot Object Recognition

no code implementations16 Aug 2020 Khoi Nguyen, Sinisa Todorovic

This paper addresses unsupervised few-shot object recognition, where all training images are unlabeled, and test images are divided into queries and a few labeled support images per object class of interest.

Object Object Recognition +1

Set-Constrained Viterbi for Set-Supervised Action Segmentation

no code implementations CVPR 2020 Jun Li, Sinisa Todorovic

This paper is about weakly supervised action segmentation, where the ground truth specifies only a set of actions present in a training video, but not their true temporal ordering.

Action Segmentation Multiple Instance Learning +1

Feature Weighting and Boosting for Few-Shot Segmentation

1 code implementation ICCV 2019 Khoi Nguyen, Sinisa Todorovic

Finally, the target object is segmented in the query image by using a cosine similarity between the class feature vector and the query's feature map.

Few-Shot Semantic Segmentation Segmentation

Weakly Supervised Energy-Based Learning for Action Segmentation

1 code implementation ICCV 2019 Jun Li, Peng Lei, Sinisa Todorovic

This paper is about labeling video frames with action classes under weak supervision in training, where we have access to a temporal ordering of actions, but their start and end frames in training videos are unknown.

Segmentation valid +3

Fine-Grained Recognition as HSnet Search for Informative Image Parts

no code implementations CVPR 2017 Michael Lam, Behrooz Mahasseni, Sinisa Todorovic

This motivates us to formulate our problem as a sequential search for informative parts over a deep feature map produced by a deep Convolutional Neural Network (CNN).

Fine-Grained Image Classification Informativeness

Unsupervised Video Summarization With Adversarial LSTM Networks

1 code implementation CVPR 2017 Behrooz Mahasseni, Michael Lam, Sinisa Todorovic

The summarizer is the autoencoder long short-term memory network (LSTM) aimed at, first, selecting video frames, and then decoding the obtained summarization for reconstructing the input video.

Unsupervised Video Summarization

Budget-Aware Deep Semantic Video Segmentation

no code implementations CVPR 2017 Behrooz Mahasseni, Sinisa Todorovic, Alan Fern

In this work, we study a poorly understood trade-off between accuracy and runtime costs for deep semantic video segmentation.

Action Detection Activity Detection +4

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition

no code implementations CVPR 2017 Tianmin Shu, Sinisa Todorovic, Song-Chun Zhu

This work is about recognizing human activities occurring in videos at distinct semantic levels, including individual actions, interactions, and group activities.

Group Activity Recognition

Boundary Flow: A Siamese Network that Predicts Boundary Motion without Training on Motion

no code implementations CVPR 2018 Peng Lei, Fuxin Li, Sinisa Todorovic

Using deep learning, this paper addresses the problem of joint object boundary detection and boundary motion estimation in videos, which we named boundary flow estimation.

Boundary Detection Motion Estimation +2

Approximate Policy Iteration for Budgeted Semantic Video Segmentation

no code implementations26 Jul 2016 Behrooz Mahasseni, Sinisa Todorovic, Alan Fern

Our second contribution is the algorithm for learning a policy for the sparse selection of supervoxels and their descriptors for budgeted CRF inference.

Video Segmentation Video Semantic Segmentation

Modeling and Inferring Human Intents and Latent Functional Objects for Trajectory Prediction

no code implementations24 Jun 2016 Dan Xie, Tianmin Shu, Sinisa Todorovic, Song-Chun Zhu

This paper is about detecting functional objects and inferring human intentions in surveillance videos of public spaces.

Clustering Trajectory Prediction

Regularizing Long Short Term Memory With 3D Human-Skeleton Sequences for Action Recognition

no code implementations CVPR 2016 Behrooz Mahasseni, Sinisa Todorovic

This paper argues that large-scale action recognition in video can be greatly improved by providing an additional modality in training data -- namely, 3D human-skeleton sequences -- aimed at complementing poorly represented or missing features of human actions in the training videos.

Action Recognition Temporal Action Localization

Monocular Depth Estimation Using Neural Regression Forest

no code implementations CVPR 2016 Anirban Roy, Sinisa Todorovic

This paper presents a novel deep architecture, called neural regression forest (NRF), for depth estimation from a single image.

Monocular Depth Estimation regression

Tree-Cut for Probabilistic Image Segmentation

no code implementations11 Jun 2015 Shell X. Hu, Christopher K. I. Williams, Sinisa Todorovic

This paper presents a new probabilistic generative model for image segmentation, i. e. the task of partitioning an image into homogeneous regions.

Image Segmentation Segmentation +2

HC-Search for Structured Prediction in Computer Vision

no code implementations CVPR 2015 Michael Lam, Janardhan Rao Doppa, Sinisa Todorovic, Thomas G. Dietterich

The mainstream approach to structured prediction problems in computer vision is to learn an energy function such that the solution minimizes that function.

Monocular Depth Estimation object-detection +3

Latent Trees for Estimating Intensity of Facial Action Units

no code implementations CVPR 2015 Sebastian Kaltwang, Sinisa Todorovic, Maja Pantic

Our model is a latent tree (LT) that represents input features of facial landmark points and FAU intensities as leaf nodes, and encodes their higher-order dependencies with latent nodes at tree levels closer to the root.

Person Count Localization in Videos From Noisy Foreground and Detections

no code implementations CVPR 2015 Sheng Chen, Alan Fern, Sinisa Todorovic

This problem is a middle-ground between frame-level person counting, which does not localize counts, and person detection aimed at perfectly localizing people with count-one detections.

Foreground Segmentation Human Detection +1

Joint Inference of Groups, Events and Human Roles in Aerial Videos

no code implementations CVPR 2015 Tianmin Shu, Dan Xie, Brandon Rothrock, Sinisa Todorovic, Song-Chun Zhu

This paper addresses a new problem of parsing low-resolution aerial videos of large spatial areas, in terms of 1) grouping, 2) recognizing events and 3) assigning roles to people engaged in events.

Multi-Object Tracking via Constrained Sequential Labeling

no code implementations CVPR 2014 Sheng Chen, Alan Fern, Sinisa Todorovic

This paper presents a new approach to tracking people in crowded scenes, where people are subject to long-term (partial) occlusions and may assume varying postures and articulations.

Multi-Object Tracking Object

(RF)^2 -- Random Forest Random Field

no code implementations NeurIPS 2010 Nadia Payet, Sinisa Todorovic

We use these class histograms for a non-parametric estimation of the distribution ratios.

Object Recognition

Segmentation as Maximum-Weight Independent Set

no code implementations NeurIPS 2010 William Brendel, Sinisa Todorovic

The algorithm seeks a solution directly in the discrete domain, instead of relaxing MWIS to a continuous problem, as common in previous work.

Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.