Search Results for author: Huseyin Coskun

Found 15 papers, 4 papers with code

Slicing Vision Transformer for Flexible Inference

1 code implementation6 Dec 2024 Yitian Zhang, Huseyin Coskun, Xu Ma, Huan Wang, Ke Ma, Xi, Chen, Derek Hao Hu, Yun Fu

Thus, we propose a general framework, named Scala, to enable a single network to represent multiple smaller ViTs with flexible inference capability, which aligns with the inherent design of ViT to vary from widths.

Scalable Ranked Preference Optimization for Text-to-Image Generation

no code implementations23 Oct 2024 Shyamgopal Karthik, Huseyin Coskun, Zeynep Akata, Sergey Tulyakov, Jian Ren, Anil Kag

In this work, we investigate a scalable approach for collecting large-scale and fully synthetic datasets for DPO training.

Text-to-Image Generation

GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning

1 code implementation20 Jul 2022 Huseyin Coskun, Alireza Zareian, Joshua L. Moore, Federico Tombari, Chen Wang

Specifically, we outperform the state of the art by 7% on UCF and 4% on HMDB for video retrieval, and 5% on UCF and 6% on HMDB for video classification

Action Recognition Clustering +6

Transformers in Action: Weakly Supervised Action Segmentation

no code implementations14 Jan 2022 John Ridley, Huseyin Coskun, David Joseph Tan, Nassir Navab, Federico Tombari

The video action segmentation task is regularly explored under weaker forms of supervision, such as transcript supervision, where a list of actions is easier to obtain than dense frame-wise labels.

Action Segmentation

Learning to Align Sequential Actions in the Wild

no code implementations CVPR 2022 Weizhe Liu, Bugra Tekin, Huseyin Coskun, Vibhav Vineet, Pascal Fua, Marc Pollefeys

To this end, we propose an approach to enforce temporal priors on the optimal transport matrix, which leverages temporal consistency, while allowing for variations in the order of actions.

Representation Learning

Learning by Aligning Videos in Time

no code implementations CVPR 2021 Sanjay Haresh, Sateesh Kumar, Huseyin Coskun, Shahram Najam Syed, Andrey Konin, Muhammad Zeeshan Zia, Quoc-Huy Tran

To overcome this problem, we propose a temporal regularization term (i. e., Contrastive-IDM) which encourages different frames to be mapped to different points in the embedding space.

Representation Learning Retrieval +1

Direct and indirect transactions and requirements

no code implementations23 Nov 2019 Husna Betul Coskun, Huseyin Coskun

The indirect transactions between sectors of an economic system has been a long-standing open problem.

Novel Concepts

Human Motion Analysis with Deep Metric Learning

2 code implementations ECCV 2018 Huseyin Coskun, David Joseph Tan, Sailesh Conjeti, Nassir Navab, Federico Tombari

Nevertheless, we believe that traditional approaches such as L2 distance or Dynamic Time Warping based on hand-crafted local pose metrics fail to appropriately capture the semantic relationship across motions and, as such, are not suitable for being employed as metrics within these tasks.

Dynamic Time Warping Metric Learning +2

Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization

no code implementations ICCV 2017 Huseyin Coskun, Felix Achilles, Robert DiPietro, Nassir Navab, Federico Tombari

One-shot pose estimation for tasks such as body joint localization, camera pose estimation, and object tracking are generally noisy, and temporal filters have been extensively used for regularization.

Camera Pose Estimation Object Tracking +1

Long Short-Term Memory Kalman Filters:Recurrent Neural Estimators for Pose Regularization

no code implementations6 Aug 2017 Huseyin Coskun, Felix Achilles, Robert DiPietro, Nassir Navab, Federico Tombari

One-shot pose estimation for tasks such as body joint localization, camera pose estimation, and object tracking are generally noisy, and temporal filters have been extensively used for regularization.

Camera Pose Estimation Object Tracking +1

Cannot find the paper you are looking for? You can Submit a new open access paper.