Search Results for author: Urun Dogan

Found 16 papers, 5 papers with code

Pearl: A Production-ready Reinforcement Learning Agent

1 code implementation • 6 Dec 2023 • Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Urun Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu

Reinforcement Learning (RL) offers a versatile framework for achieving long-term goals.

reinforcement-learning Reinforcement Learning (RL)

2,368

Paper
Code

IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control

no code implementations • 1 Jun 2023 • Rohan Chitnis, Yingchen Xu, Bobak Hashemi, Lucas Lehnert, Urun Dogan, Zheqing Zhu, Olivier Delalleau

Model-based reinforcement learning (RL) has shown great promise due to its sample efficiency, but still struggles with long-horizon sparse-reward tasks, especially in offline settings where the agent learns from a fixed dataset.

D4RL Model-based Reinforcement Learning +4

Paper
Add Code

Offline RL With Resource Constrained Online Deployment

no code implementations • 7 Oct 2021 • Jayanth Reddy Regatti, Aniket Anand Deshmukh, Frank Cheng, Young Hun Jung, Abhishek Gupta, Urun Dogan

We address this performance gap with a policy transfer algorithm which first trains a teacher agent using the offline dataset where features are fully available, and then transfers this knowledge to a student agent that only uses the resource-constrained features.

D4RL Offline RL

Paper
Add Code

Offline Reinforcement Learning with Resource Constrained Online Deployment

no code implementations • 29 Sep 2021 • Jayanth Reddy Regatti, Aniket Anand Deshmukh, Young Hun Jung, Frank Cheng, Abhishek Gupta, Urun Dogan

D4RL Offline RL +2

Paper
Add Code

Representation Learning for Clustering via Building Consensus

1 code implementation • 4 May 2021 • Aniket Anand Deshmukh, Jayanth Reddy Regatti, Eren Manavoglu, Urun Dogan

Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must be close in the representation space (exemplar consistency), and/or similar images must have similar cluster assignments (population consistency).

Ranked #3 on Image Clustering on ImageNet-10

Clustering Data Augmentation +3

Paper
Code

Semantic Hashing with Locality Sensitive Embeddings

no code implementations • 1 Jan 2021 • Levi Boyles, Aniket Anand Deshmukh, Urun Dogan, Rajesh Koduru, Charles Denis, Eren Manavoglu

Semantic hashing methods have been explored for learning transformations into binary vector spaces.

Retrieval

Paper
Add Code

Consensus Clustering With Unsupervised Representation Learning

no code implementations • 3 Oct 2020 • Jayanth Reddy Regatti, Aniket Anand Deshmukh, Eren Manavoglu, Urun Dogan

Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must either be closer in the representation space, or have a similar cluster assignment.

Clustering Data Augmentation +4

Paper
Add Code

Zero Shot Domain Generalization

1 code implementation • 17 Aug 2020 • Udit Maniyar, Joseph K J, Aniket Anand Deshmukh, Urun Dogan, Vineeth N. Balasubramanian

Domain generalization (DG) methods try to learn a model that when trained on data from multiple domains, would generalize to a new unseen domain.

Domain Generalization

Paper
Code

Self-Supervised Contextual Bandits in Computer Vision

no code implementations • 18 Mar 2020 • Aniket Anand Deshmukh, Abhimanu Kumar, Levi Boyles, Denis Charles, Eren Manavoglu, Urun Dogan

In the usual self-supervision, we learn implicit labels from the training data for a secondary task.

Clustering Colorization +3

Paper
Add Code

Data Transformation Insights in Self-supervision with Clustering Tasks

no code implementations • 18 Feb 2020 • Abhimanu Kumar, Aniket Anand Deshmukh, Urun Dogan, Denis Charles, Eren Manavoglu

We show faster convergence rate with valid transformations for convex as well as certain family of non-convex objectives along with the proof of convergence to the original set of optima.

Clustering valid

Paper
Add Code

Label-similarity Curriculum Learning

no code implementations • ECCV 2020 • Urun Dogan, Aniket Anand Deshmukh, Marcin Machura, Christian Igel

We propose a novel curriculum learning approach for image classification that adapts the loss function by changing the label representation.

Classification General Classification +1

Paper
Add Code

A Generalization Error Bound for Multi-class Domain Generalization

no code implementations • 24 May 2019 • Aniket Anand Deshmukh, Yunwen Lei, Srinagesh Sharma, Urun Dogan, James W. Cutler, Clayton Scott

Domain generalization is the problem of assigning labels to an unlabeled data set, given several similar data sets for which labels have been provided.

Classification Domain Generalization +2

Paper
Add Code

Domain Generalization by Marginal Transfer Learning

2 code implementations • 21 Nov 2017 • Gilles Blanchard, Aniket Anand Deshmukh, Urun Dogan, Gyemin Lee, Clayton Scott

In the problem of domain generalization (DG), there are labeled training data sets from several related prediction problems, and the goal is to make accurate predictions on future unlabeled data sets that are not known to the learner.

Domain Generalization General Classification +1

1,332

Paper
Code

Data-dependent Generalization Bounds for Multi-class Classification

no code implementations • 29 Jun 2017 • Yunwen Lei, Urun Dogan, Ding-Xuan Zhou, Marius Kloft

In this paper, we study data-dependent generalization error bounds exhibiting a mild dependency on the number of classes, making them suitable for multi-class learning with a large number of label classes.

Classification General Classification +2

Paper
Add Code

Multi-Task Learning for Contextual Bandits

no code implementations • NeurIPS 2017 • Aniket Anand Deshmukh, Urun Dogan, Clayton Scott

Contextual bandits are a form of multi-armed bandit in which the agent has access to predictive side information (known as the context) for each arm at each time step, and have been used to model personalized news recommendation, ad placement, and other applications.

Multi-Armed Bandits Multi-Task Learning +1

Paper
Add Code

Distributed Optimization of Multi-Class SVMs

1 code implementation • 25 Nov 2016 • Maximilian Alber, Julian Zimmert, Urun Dogan, Marius Kloft

Training of one-vs.-rest SVMs can be parallelized over the number of classes in a straight forward way.

Distributed Optimization General Classification +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.