Search Results for author: Michał Koziarski

Found 18 papers, 11 papers with code

Learning Decision Trees as Amortized Structure Inference

1 code implementation10 Mar 2025 Mohammed Mahfoud, Ghait Boukachab, Michał Koziarski, Alex Hernandez-Garcia, Stefan Bauer, Yoshua Bengio, Nikolay Malkin

Building predictive models for tabular data presents fundamental challenges, notably in scaling consistently, i. e., more resources translating to better performance, and generalizing systematically beyond the training data distribution.

Anomaly Detection Deep Reinforcement Learning

Action abstractions for amortized sampling

no code implementations19 Oct 2024 Oussama Boussif, Léna Néhale Ezzine, Joseph D Viviano, Michał Koziarski, Moksh Jain, Nikolay Malkin, Emmanuel Bengio, Rim Assouel, Yoshua Bengio

As trajectories sampled by policies used by reinforcement learning (RL) and generative flow networks (GFlowNets) grow longer, credit assignment and exploration become more challenging, and the long planning horizon hinders mode discovery and generalization.

Chunking Reinforcement Learning (RL)

Cell Morphology-Guided Small Molecule Generation with GFlowNets

1 code implementation9 Aug 2024 Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Michał Koziarski

We consider an alternative approach in which we leverage an unsupervised multimodal joint embedding to define a latent similarity as a reward for GFlowNets.

Drug Discovery

RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNets

2 code implementations26 Jun 2024 Piotr Gaiński, Michał Koziarski, Krzysztof Maziarz, Marwin Segler, Jacek Tabor, Marek Śmieja

Single-step retrosynthesis aims to predict a set of reactions that lead to the creation of a target molecule, which is a crucial task in molecular discovery.

Retrosynthesis Single-step retrosynthesis

RGFN: Synthesizable Molecular Generation Using GFlowNets

1 code implementation1 Jun 2024 Michał Koziarski, Andrei Rekesh, Dmytro Shevchuk, Almer van der Sloot, Piotr Gaiński, Yoshua Bengio, Cheng-Hao Liu, Mike Tyers, Robert A. Batey

Generative models hold great promise for small molecule discovery, significantly increasing the size of search space compared to traditional in silico screening libraries.

Towards equilibrium molecular conformation generation with GFlowNets

no code implementations20 Oct 2023 Alexandra Volokhova, Michał Koziarski, Alex Hernández-García, Cheng-Hao Liu, Santiago Miret, Pablo Lemos, Luca Thiede, Zichao Yan, Alán Aspuru-Guzik, Yoshua Bengio

Sampling diverse, thermodynamically feasible molecular conformations plays a crucial role in predicting properties of a molecule.

ChiENN: Embracing Molecular Chirality with Graph Neural Networks

1 code implementation5 Jul 2023 Piotr Gaiński, Michał Koziarski, Jacek Tabor, Marek Śmieja

Graph Neural Networks (GNNs) play a fundamental role in many deep learning problems, in particular in cheminformatics.

Drug Discovery Molecular Property Prediction +1

Imbalanced data preprocessing techniques utilizing local data characteristics

no code implementations28 Nov 2021 Michał Koziarski

The focus of this thesis is development of novel data resampling strategies natively utilizing the information about the distribution of both minority and majority class.

RB-CCR: Radial-Based Combined Cleaning and Resampling algorithm for imbalanced data classification

1 code implementation9 May 2021 Michał Koziarski, Colin Bellinger, Michał Woźniak

Our $5\times2$ cross-validated results on 57 benchmark binary datasets with 9 classifiers show that RB-CCR achieves a better precision-recall trade-off than CCR and generally out-performs the state-of-the-art resampling methods in terms of AUC and G-mean.

General Classification

Potential Anchoring for imbalanced data classification

1 code implementation17 Apr 2021 Michał Koziarski

The results of the experiments conducted on 60 imbalanced datasets show outperformance of Potential Anchoring over state-of-the-art resampling algorithms, including previously proposed methods that utilize radial basis functions to model class potential.

Classification General Classification

CSMOUTE: Combined Synthetic Oversampling and Undersampling Technique for Imbalanced Data Classification

no code implementations7 Apr 2020 Michał Koziarski

Furthermore, we combine both in the Combined Synthetic Oversampling and Undersampling Technique (CSMOUTE), which integrates SMOTE oversampling with SMUTE undersampling.

General Classification

Combined Cleaning and Resampling Algorithm for Multi-Class Imbalanced Data with Label Noise

no code implementations7 Apr 2020 Michał Koziarski, Michał Woźniak, Bartosz Krawczyk

The proposed method utilizes an energy-based approach to modeling the regions suitable for oversampling, less affected by small disjuncts and outliers than SMOTE.

Binary Classification General Classification

Radial-Based Undersampling for Imbalanced Data Classification

1 code implementation2 Jun 2019 Michał Koziarski

Aforementioned difficulty factors can also limit the applicability of some of the methods of dealing with data imbalance, in particular the neighborhood-based oversampling algorithms based on SMOTE.

Classification General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.