Search Results for author: Juliusz Ziomek

Found 6 papers, 1 papers with code

Beyond Lengthscales: No-regret Bayesian Optimisation With Unknown Hyperparameters Of Any Type

no code implementations • 2 Feb 2024 • Juliusz Ziomek, Masaki Adachi, Michael A. Osborne

Previously proposed algorithms with the no-regret property were only able to handle the special case of unknown lengthscales, reproducing kernel Hilbert space norm and applied only to the frequentist case.

Bayesian Optimisation

Paper
Add Code

Why Can Large Language Models Generate Correct Chain-of-Thoughts?

no code implementations • 20 Oct 2023 • Rasul Tutunov, Antoine Grosnit, Juliusz Ziomek, Jun Wang, Haitham Bou-Ammar

This paper delves into the capabilities of large language models (LLMs), specifically focusing on advancing the theoretical comprehension of chain-of-thought prompting.

Text Generation

Paper
Add Code

Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?

2 code implementations • 30 Jan 2023 • Juliusz Ziomek, Haitham Bou-Ammar

Learning decompositions of expensive-to-evaluate black-box functions promises to scale Bayesian optimisation (BO) to high-dimensional problems.

Bayesian Optimisation

2,938

Paper
Code

Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction

no code implementations • 2 Sep 2022 • Taher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew Taylor, Kun Shao, Jun Wang, David Mguni

Centralised training with decentralised execution (CT-DE) serves as the foundation of many leading multi-agent reinforcement learning (MARL) algorithms.

Multi-agent Reinforcement Learning reinforcement-learning +3

Paper
Add Code

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

no code implementations • 31 May 2022 • David Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang

In this paper, we introduce a reinforcement learning (RL) framework named \textbf{L}earnable \textbf{I}mpulse \textbf{C}ontrol \textbf{R}einforcement \textbf{A}lgorithm (LICRA), for learning to optimally select both when to act and which actions to take when actions incur costs.

Reinforcement Learning (RL)

Paper
Add Code

Modelling nonlinear dependencies in the latent space of inverse scattering

no code implementations • 19 Mar 2022 • Juliusz Ziomek, Katayoun Farrahi

The conducted meta-analysis also shows a clear practical advantage of such constructed generative models in terms of the efficiency of their training process compared to existing generative models for images.

Generative Adversarial Network

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.