Search Results for author: Marius-Constantin Dinu

Found 6 papers, 6 papers with code

SymbolicAI: A framework for logic-based approaches combining generative models and solvers

2 code implementations • 1 Feb 2024 • Marius-Constantin Dinu, Claudiu Leoveanu-Condrei, Markus Holzleitner, Werner Zellinger, Sepp Hochreiter

We conclude by introducing a quality measure and its empirical score for evaluating these computational graphs, and propose a benchmark that compares various state-of-the-art LLMs across a set of complex workflows.

Few-Shot Learning Probabilistic Programming

875

Paper
Code

Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation

1 code implementation • 2 May 2023 • Marius-Constantin Dinu, Markus Holzleitner, Maximilian Beck, Hoan Duc Nguyen, Andrea Huber, Hamid Eghbal-zadeh, Bernhard A. Moser, Sergei Pereverzyev, Sepp Hochreiter, Werner Zellinger

Our method outperforms deep embedded validation (DEV) and importance weighted validation (IWV) on all datasets, setting a new state-of-the-art performance for solving parameter choice issues in unsupervised domain adaptation with theoretical error guarantees.

Unsupervised Domain Adaptation

Paper
Code

Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning

1 code implementation • 12 Jul 2022 • Christian Steinparz, Thomas Schmied, Fabian Paischer, Marius-Constantin Dinu, Vihang Patil, Angela Bitto-Nemling, Hamid Eghbal-zadeh, Sepp Hochreiter

Therefore, exploration strategies and learning methods are required that are capable of tracking the steady domain shifts, and adapting to them.

Policy Gradient Methods Q-Learning +2

Paper
Code

The balancing principle for parameter choice in distance-regularized domain adaptation

1 code implementation • NeurIPS 2021 • Werner Zellinger, Natalia Shepeleva, Marius-Constantin Dinu, Hamid Eghbal-zadeh, Hoan Nguyen, Bernhard Nessler, Sergei Pereverzyev, Bernhard A. Moser

Our approach starts with the observation that the widely-used method of minimizing the source error, penalized by a distance measure between source and target feature representations, shares characteristics with regularized ill-posed inverse problems.

Unsupervised Domain Adaptation

Paper
Code

A Dataset Perspective on Offline Reinforcement Learning

2 code implementations • 8 Nov 2021 • Kajetan Schweighofer, Andreas Radler, Marius-Constantin Dinu, Markus Hofmarcher, Vihang Patil, Angela Bitto-Nemling, Hamid Eghbal-zadeh, Sepp Hochreiter

The dataset characteristics are determined by the behavioral policy that samples this dataset.

Offline RL reinforcement-learning +1

Paper
Code

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

1 code implementation • 29 Sep 2020 • Vihang P. Patil, Markus Hofmarcher, Marius-Constantin Dinu, Matthias Dorfer, Patrick M. Blies, Johannes Brandstetter, Jose A. Arjona-Medina, Sepp Hochreiter

For such complex tasks, the recently proposed RUDDER uses reward redistribution to leverage steps in the Q-function that are associated with accomplishing sub-tasks.

General Reinforcement Learning Multiple Sequence Alignment +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.