no code implementations • 13 Jun 2024 • Arda Sarp Yenicesu, Furkan B. Mutlu, Suleyman S. Kozat, Ozgur S. Oguz
The utilization of the experience replay mechanism enables agents to effectively leverage their experiences on several occasions.
1 code implementation • 23 Jan 2024 • Mehmet E. Lorasdagi, Mehmet Y. Turali, Ali T. Koc, Suleyman S. Kozat
However, no study has introduced a training-free framework for a generic ML model to select features while considering the importance of the feature subsets as a whole, instead of focusing on the individual features.
1 code implementation • 20 Jan 2024 • Mehmet Y. Turali, Mehmet E. Lorasdagi, Ali T. Koc, Suleyman S. Kozat
In particular, we do the joint optimization and binary masking to continuously adapt the set of features and model parameters during the training process.
no code implementations • 26 Oct 2023 • Aysin Tumay, Mustafa E. Aydın, Ali T. Koc, Suleyman S. Kozat
This hierarchical structure allows for flexible depth and feature selection.
no code implementations • 19 Sep 2023 • Mustafa E. Aydın, Arda Fazla, Suleyman S. Kozat
We achieve this by introducing novel state space representations for the base models, which are then combined to provide a full state space representation of the hybrid or the ensemble.
2 code implementations • 10 Oct 2022 • Baturay Saglam, Doga Gurgunoglu, Suleyman S. Kozat
We introduce a novel deep reinforcement learning (DRL) approach to jointly optimize transmit beamforming and reconfigurable intelligent surface (RIS) phase shifts in a multiuser multiple input single output (MU-MISO) system to maximize the sum downlink rate under the phase-dependent reflection amplitude model.
2 code implementations • 1 Oct 2022 • Baturay Saglam, Suleyman S. Kozat
In continuous control, exploration is often performed through undirected strategies in which parameters of the networks or selected actions are perturbed by random noise.
1 code implementation • 1 Sep 2022 • Baturay Saglam, Furkan B. Mutlu, Dogan C. Cicek, Suleyman S. Kozat
A widely-studied deep reinforcement learning (RL) technique known as Prioritized Experience Replay (PER) allows agents to learn from transitions sampled with non-uniform probability proportional to their temporal-difference (TD) error.
no code implementations • 7 Aug 2022 • Hakan Gokcesu, Suleyman S. Kozat
We study the prediction with expert advice setting, where the aim is to produce a decision by combining the decisions generated by a set of experts, e. g., independently running algorithms.
1 code implementation • 1 Aug 2022 • Baturay Saglam, Dogan C. Cicek, Furkan B. Mutlu, Suleyman S. Kozat
Compared to on-policy counterparts, off-policy model-free deep reinforcement learning can improve data efficiency by repeatedly using the previously gathered data.
1 code implementation • 27 Jul 2022 • Baturay Saglam, Dogan C. Cicek, Furkan B. Mutlu, Suleyman S. Kozat
Learning in high dimensional continuous tasks is challenging, mainly when the experience replay memory is very limited.
no code implementations • 25 Mar 2022 • Mustafa E. Aydın, Suleyman S. Kozat
We investigate nonlinear prediction in an online setting and introduce a hybrid model that effectively mitigates, via an end-to-end architecture, the need for hand-designed features and manual model selection issues of conventional nonlinear prediction/regression methods.
no code implementations • 12 Nov 2021 • Dogan C. Cicek, Enes Duran, Baturay Saglam, Kagan Kaya, Furkan B. Mutlu, Suleyman S. Kozat
We show through continuous control environments of OpenAI gym that our algorithm matches or outperforms the state-of-the-art off-policy policy gradient learning algorithms.
no code implementations • 2 Nov 2021 • Dogan C. Cicek, Enes Duran, Baturay Saglam, Furkan B. Mutlu, Suleyman S. Kozat
In addition, experience replay stores the transitions are generated by the previous policies of the agent that may significantly deviate from the most recent policy of the agent.
1 code implementation • 22 Sep 2021 • Baturay Saglam, Enes Duran, Dogan C. Cicek, Furkan B. Mutlu, Suleyman S. Kozat
We show that in deep actor-critic methods that aim to overcome the overestimation bias, if the reinforcement signals received by the agent have a high variance, a significant underestimation bias arises.
1 code implementation • 5 Sep 2020 • Selim F. Yilmaz, Suleyman S. Kozat
PySAD is an open-source python framework for anomaly detection on streaming data.
1 code implementation • 26 Aug 2020 • Selim F. Yilmaz, E. Batuhan Kaynak, Aykut Koç, Hamdi Dibeklioğlu, Suleyman S. Kozat
We investigate cross-lingual sentiment analysis, which has attracted significant attention due to its applications in various areas including market research, politics and social sciences.
no code implementations • 25 Jun 2020 • Oguzhan Karaahmetoglu, Suleyman S. Kozat
We study the spatio-temporal prediction problem and introduce a novel point-process-based prediction algorithm.
no code implementations • 22 May 2020 • S. Onur Sahin, Suleyman S. Kozat
In our architecture, we employ a variable number of LSTM networks, which use only the existing inputs in the sequence, in a tree-like architecture without any statistical assumptions or imputations on the missing data, unlike all the previous approaches.
no code implementations • 16 May 2020 • N. Mert Vural, Fatih Ilhan, Selim F. Yilmaz, Salih Ergüt, Suleyman S. Kozat
Recurrent Neural Networks (RNNs) are widely used for online regression due to their ability to generalize nonlinear temporal dependencies.
1 code implementation • 12 May 2020 • Selim F. Yilmaz, Suleyman S. Kozat
We investigate unsupervised anomaly detection for high-dimensional data and introduce a deep metric learning (DML) based framework.
no code implementations • 7 Mar 2020 • N. Mert Vural, Selim F. Yilmaz, Fatih Ilhan, Suleyman S. Kozat
We investigate online nonlinear regression with continually running recurrent neural network networks (RNNs), i. e., RNN-based online learning.
no code implementations • 25 Nov 2019 • Nuri Mert Vural, Fatih Ilhan, Suleyman S. Kozat
We investigate the convergence and stability properties of the decoupled extended Kalman filter learning algorithm (DEKF) within the long-short term memory network (LSTM) based online learning framework.
no code implementations • 25 Nov 2019 • N. Mert Vural, Hakan Gokcesu, Kaan Gokcesu, Suleyman S. Kozat
To construct our algorithm, we introduce a new expert advice algorithm for the multiple-play setting.
1 code implementation • 22 Oct 2019 • N. Mert Vural, Salih Ergüt, Suleyman S. Kozat
We study adaptive (or online) nonlinear regression with Long-Short-Term-Memory (LSTM) based networks, i. e., LSTM-based adaptive learning.
no code implementations • 30 Jun 2019 • Hakan Gokcesu, Suleyman S. Kozat
Our approach can compete against all comparator sequences simultaneously (universally) in a minimax optimal manner, i. e. each regret guarantee depends on the respective comparator path variation.
no code implementations • 19 Apr 2019 • Hakan Gokcesu, Suleyman S. Kozat
We specifically study scenarios where our sub-gradient observations can be noisy or even completely missing in a stochastic manner.
no code implementations • 18 Jan 2017 • Burak C. Civek, Ibrahim Delibalta, Suleyman S. Kozat
We introduce highly efficient online nonlinear regression algorithms that are suitable for real life applications.
no code implementations • 5 Dec 2016 • Mohammadreza Mohaghegh Neyshabouri, Kaan Gokcesu, Huseyin Ozkan, Suleyman S. Kozat
Therefore, we design our algorithms based on the optimal adaptive combination and asymptotically achieve the performance of the best mapping as well as the best arm selection policy.
no code implementations • 3 Oct 2016 • Muhammed O. Sayin, Suleyman S. Kozat, Tamer Başar
Finally, in the numerical examples, we demonstrate the superior performance of the introduced algorithms in the finite-horizon MSE sense due to optimal estimation.
no code implementations • 30 Sep 2014 • Huseyin Ozkan, Ozgun S. Pelvan, Suleyman S. Kozat
We introduce a comprehensive and statistical framework in a model free setting for a complete treatment of localized data corruptions due to severe noise sources, e. g., an occluder in the case of a visual recording.
no code implementations • 23 Jan 2014 • N. Denizcan Vanli, Muhammed O. Sayin, Suleyman S. Kozat
We study nonlinear regression of real valued data in an individual sequence manner, where we provide results that are guaranteed to hold without any statistical assumptions.
no code implementations • 26 Nov 2013 • Muhammed O. Sayin, N. Denizcan Vanli, Suleyman S. Kozat
We introduce important members of this family of algorithms such as the least mean logarithmic square (LMLS) and least logarithmic absolute difference (LLAD) algorithms that improve the convergence performance of the conventional algorithms.
no code implementations • 25 Nov 2013 • N. Denizcan Vanli, Suleyman S. Kozat
We first introduce the lower bounds on this relative performance in the mixture of experts framework, where we show that for any sequential algorithm, there always exists a sequence for which the performance of the sequential algorithm is lower bounded by zero.
no code implementations • 25 Nov 2013 • N. Denizcan Vanli, Suleyman S. Kozat
In this paper, we investigate adaptive nonlinear regression and introduce tree based piecewise linear regression algorithms that are highly efficient and provide significantly improved performance with guaranteed upper bounds in an individual sequence manner.