Search Results for author: Mohamed Osama Ahmed

Found 16 papers, 4 papers with code

Attention as an RNN

no code implementations • 22 May 2024 • Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Mohamed Osama Ahmed, Yoshua Bengio, Greg Mori

Tackling this, we (3) introduce a new efficient method of computing attention's \textit{many-to-many} RNN output based on the parallel prefix scan algorithm.

Paper
Add Code

Tree Cross Attention

1 code implementation • 29 Sep 2023 • Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

In this work, we propose Tree Cross Attention (TCA) - a module based on Cross Attention that only retrieves information from a logarithmic $\mathcal{O}(\log(N))$ number of tokens for performing inference.

Paper
Code

Constant Memory Attention Block

no code implementations • 21 Jun 2023 • Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

Modern foundation model architectures rely on attention mechanisms to effectively capture context.

Point Processes

Paper
Add Code

Memory Efficient Neural Processes via Constant Memory Attention Block

no code implementations • 23 May 2023 • Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty.

Meta-Learning

Paper
Add Code

Meta Temporal Point Processes

1 code implementation • 27 Jan 2023 • Wonho Bae, Mohamed Osama Ahmed, Frederick Tung, Gabriel L. Oliveira

In this work, we propose to train TPPs in a meta learning framework, where each sequence is treated as a different task, via a novel framing of TPPs as neural processes (NPs).

Meta-Learning Point Processes

Paper
Code

Gumbel-Softmax Selective Networks

no code implementations • 19 Nov 2022 • Mahmoud Salem, Mohamed Osama Ahmed, Frederick Tung, Gabriel Oliveira

This commonly encountered operational context calls for principled techniques for training ML models with the option to abstain from predicting when uncertain.

Paper
Add Code

Latent Bottlenecked Attentive Neural Processes

1 code implementation • 15 Nov 2022 • Leo Feng, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

We demonstrate that LBANPs can trade-off the computational cost and performance according to the number of latent vectors.

Meta-Learning Multi-Armed Bandits

Paper
Code

Towards Better Selective Classification

1 code implementation • 17 Jun 2022 • Leo Feng, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Amir Abdi

We tackle the problem of Selective Classification where the objective is to achieve the best performance on a predetermined ratio (coverage) of the dataset.

Classification

Paper
Code

Monotonicity Regularization: Improved Penalties and Novel Applications to Disentangled Representation Learning and Robust Classification

no code implementations • 17 May 2022 • Joao Monteiro, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Greg Mori

We study settings where gradient penalties are used alongside risk minimization with the goal of obtaining predictors satisfying different notions of monotonicity.

Image Classification Representation Learning +1

Paper
Add Code

Monotonicity as a requirement and as a regularizer: efficient methods and applications

no code implementations • 29 Sep 2021 • Joao Monteiro, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Greg Mori

We study the setting where risk minimization is performed over general classes of models and consider two cases where monotonicity is treated as either a requirement to be satisfied everywhere or a useful property.

Image Classification

Paper
Add Code

Point Process Flows

no code implementations • 18 Oct 2019 • Nazanin Mehrasa, Ruizhi Deng, Mohamed Osama Ahmed, Bo Chang, JiaWei He, Thibaut Durand, Marcus Brubaker, Greg Mori

Event sequences can be modeled by temporal point processes (TPPs) to capture their asynchronous and probabilistic nature.

Point Processes

Paper
Add Code

Human Intracranial EEG Quantitative Analysis and Automatic Feature Learning for Epileptic Seizure Prediction

no code implementations • 7 Apr 2019 • Ramy Hussein, Mohamed Osama Ahmed, Rabab Ward, Z. Jane Wang, Levin Kuhlmann, Yi Guo

2) The traditional PCA is not a reliable method for iEEG data reduction in seizure prediction.

Ranked #1 on Seizure prediction on Melbourne University Seizure Prediction

EEG Seizure prediction +1

Paper
Add Code

Combining Bayesian Optimization and Lipschitz Optimization

no code implementations • 10 Oct 2018 • Mohamed Osama Ahmed, Sharan Vaswani, Mark Schmidt

Indeed, in a particular setting, we prove that using the Lipschitz information yields the same or a better bound on the regret compared to using Bayesian optimization on its own.

Bayesian Optimization Thompson Sampling

Paper
Add Code

StopWasting My Gradients: Practical SVRG

no code implementations • NeurIPS 2015 • Reza Harikandeh, Mohamed Osama Ahmed, Alim Virani, Mark Schmidt, Jakub Konečný, Scott Sallinen

We present and analyze several strategies for improving the performance ofstochastic variance-reduced gradient (SVRG) methods.

Paper
Add Code

Stop Wasting My Gradients: Practical SVRG

no code implementations • 5 Nov 2015 • Reza Babanezhad, Mohamed Osama Ahmed, Alim Virani, Mark Schmidt, Jakub Konečný, Scott Sallinen

We present and analyze several strategies for improving the performance of stochastic variance-reduced gradient (SVRG) methods.

Paper
Add Code

Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

no code implementations • 16 Apr 2015 • Mark Schmidt, Reza Babanezhad, Mohamed Osama Ahmed, Aaron Defazio, Ann Clifton, Anoop Sarkar

We apply stochastic average gradient (SAG) algorithms for training conditional random fields (CRFs).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.