Search Results for author: Bugra Tekin

Found 18 papers, 6 papers with code

X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization

1 code implementation28 Mar 2024 Anna Kukleva, Fadime Sener, Edoardo Remelli, Bugra Tekin, Eric Sauser, Bernt Schiele, Shugao Ma

Lately, there has been growing interest in adapting vision-language models (VLMs) to image and third-person video classification due to their success in zero-shot recognition.

Video Classification Zero-Shot Learning

DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions

no code implementations26 Mar 2024 Sammy Christen, Shreyas Hampali, Fadime Sener, Edoardo Remelli, Tomas Hodan, Eric Sauser, Shugao Ma, Bugra Tekin

In the grasping stage, the model only generates hand motions, whereas in the interaction phase both hand and object poses are synthesized.

Object

FoundPose: Unseen Object Pose Estimation with Foundation Features

no code implementations30 Nov 2023 Evin Pınar Örnek, Yann Labbé, Bugra Tekin, Lingni Ma, Cem Keskin, Christian Forster, Tomas Hodan

Pose hypotheses are then generated from 2D-3D correspondences established by matching DINOv2 patch features between the query image and a retrieved template, and finally optimized by featuremetric refinement.

6D Pose Estimation Object +1

Context-Aware Sequence Alignment using 4D Skeletal Augmentation

1 code implementation CVPR 2022 Taein Kwon, Bugra Tekin, Siyu Tang, Marc Pollefeys

Temporal alignment of fine-grained human actions in videos is important for numerous applications in computer vision, robotics, and mixed reality.

Hand Pose Estimation Mixed Reality +1

Learning to Align Sequential Actions in the Wild

no code implementations CVPR 2022 Weizhe Liu, Bugra Tekin, Huseyin Coskun, Vibhav Vineet, Pascal Fua, Marc Pollefeys

To this end, we propose an approach to enforce temporal priors on the optimal transport matrix, which leverages temporal consistency, while allowing for variations in the order of actions.

Representation Learning

Reconstructing and grounding narrated instructional videos in 3D

no code implementations9 Sep 2021 Dimitri Zhukov, Ignacio Rocco, Ivan Laptev, Josef Sivic, Johannes L. Schönberger, Bugra Tekin, Marc Pollefeys

Contrary to the standard scenario of instance-level 3D reconstruction, where identical objects or scenes are present in all views, objects in different instructional videos may have large appearance variations given varying conditions and versions of the same product.

3D Reconstruction

HoloLens 2 Research Mode as a Tool for Computer Vision Research

1 code implementation25 Aug 2020 Dorin Ungureanu, Federica Bogo, Silvano Galliani, Pooja Sama, Xin Duan, Casey Meekhof, Jan Stühmer, Thomas J. Cashman, Bugra Tekin, Johannes L. Schönberger, Pawel Olszta, Marc Pollefeys

Mixed reality headsets, such as the Microsoft HoloLens 2, are powerful sensing devices with integrated compute capabilities, which makes it an ideal platform for computer vision research.

Mixed Reality

H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions

1 code implementation CVPR 2019 Bugra Tekin, Federica Bogo, Marc Pollefeys

Given a single RGB image, our model jointly estimates the 3D hand and object poses, models their interactions, and recognizes the object and action classes with a single feed-forward pass through a neural network.

Object

Real-Time Seamless Single Shot 6D Object Pose Prediction

5 code implementations CVPR 2018 Bugra Tekin, Sudipta N. Sinha, Pascal Fua

For single object and multiple object pose estimation on the LINEMOD and OCCLUSION datasets, our approach substantially outperforms other recent CNN-based approaches when they are all used without post-processing.

6D Pose Estimation using RGB Drone Pose Estimation +2

Direct Prediction of 3D Body Poses from Motion Compensated Sequences

no code implementations CVPR 2016 Bugra Tekin, Artem Rozantsev, Vincent Lepetit, Pascal Fua

We propose an efficient approach to exploiting motion information from consecutive frames of a video sequence to recover the 3D pose of people.

3D Human Pose Estimation

Predicting People's 3D Poses from Short Sequences

no code implementations30 Apr 2015 Bugra Tekin, Xiaolu Sun, Xinchao Wang, Vincent Lepetit, Pascal Fua

We propose an efficient approach to exploiting motion information from consecutive frames of a video sequence to recover the 3D pose of people.

Cannot find the paper you are looking for? You can Submit a new open access paper.