Search Results for author: Ahmed Imtiaz Humayun

Found 20 papers, 13 papers with code

Deep Networks Always Grok and Here is Why

no code implementations23 Feb 2024 Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk

Our local complexity measures the density of the so-called 'linear regions' (aka, spline partition regions) that tile the DNN input space, and serves as a utile progress measure for training.

Training Dynamics of Deep Network Linear Regions

no code implementations19 Oct 2023 Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk

First, we present a novel statistic that encompasses the local complexity (LC) of the DN based on the concentration of linear regions inside arbitrary dimensional neighborhoods around data points.

Memorization

Self-Consuming Generative Models Go MAD

no code implementations4 Jul 2023 Sina AlEMohammad, Josue Casco-Rodriguez, Lorenzo Luzi, Ahmed Imtiaz Humayun, Hossein Babaei, Daniel LeJeune, Ali Siahkoohi, Richard G. Baraniuk

Seismic advances in generative AI algorithms for imagery, text, and other data types has led to the temptation to use synthetic data to train next-generation models.

Abugida Normalizer and Parser for Unicode texts

1 code implementation11 May 2023 Nazmuddoha Ansary, Quazi Adibur Rahman Adib, Tahsin Reasat, Sazia Mehnaz, Asif Shahriyar Sushmit, Ahmed Imtiaz Humayun, Mohammad Mamun Or Rashid, Farig Sadeque

This paper proposes two libraries to address common and uncommon issues with Unicode-based writing schemes for Indic languages.

Language Modelling

BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset

1 code implementation9 Mar 2023 Md. Istiak Hossain Shihab, Md. Rakibul Hasan, Mahfuzur Rahman Emon, Syed Mobassir Hossen, MD. Nazmuddoha Ansary, Intesur Ahmed, Fazle Rabbi Rakib, Shahriar Elahi Dhruvo, Souhardya Saha Dip, Akib Hasan Pavel, Marsia Haque Meghla, Md. Rezwanul Haque, Sayma Sultana Chowdhury, Farig Sadeque, Tahsin Reasat, Ahmed Imtiaz Humayun, Asif Shahriyar Sushmit

While strides have been made in deep learning based Bengali Optical Character Recognition (OCR) in the past decade, the absence of large Document Layout Analysis (DLA) datasets has hindered the application of OCR in document transcription, e. g., transcribing historical documents and newspapers.

Benchmarking Document Layout Analysis +2

SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries

1 code implementation CVPR 2023 Ahmed Imtiaz Humayun, Randall Balestriero, Guha Balakrishnan, Richard Baraniuk

In this paper, we go one step further by developing the first provably exact method for computing the geometry of a DN's mapping - including its decision boundary - over a specified region of the data space.

No More Than 6ft Apart: Robust K-Means via Radius Upper Bounds

1 code implementation4 Mar 2022 Ahmed Imtiaz Humayun, Randall Balestriero, Anastasios Kyrillidis, Richard Baraniuk

We propose to remedy such a scenario by introducing a maximal radius constraint $r$ on the clusters formed by the centroids, i. e., samples from the same cluster should not be more than $2r$ apart in terms of $\ell_2$ distance.

Clustering

Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values

1 code implementation CVPR 2022 Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk

We present Polarity Sampling, a theoretically justified plug-and-play method for controlling the generation quality and diversity of pre-trained deep generative networks DGNs).

Image Generation Unconditional Image Generation

MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without Retraining

1 code implementation ICLR 2022 Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk

Deep Generative Networks (DGNs) are extensively employed in Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and their variants to approximate the data manifold and distribution.

Data Augmentation Domain Adaptation +2

A Large Multi-Target Dataset of Common Bengali Handwritten Graphemes

2 code implementations1 Oct 2020 Samiul Alam, Tahsin Reasat, Asif Shahriyar Sushmit, Sadi Mohammad Siddiquee, Fuad Rahman, Mahady Hasan, Ahmed Imtiaz Humayun

We propose a labeling scheme based on graphemes (linguistic segments of word formation) that makes segmentation in-side alpha-syllabary words linear and present the first dataset of Bengali handwritten graphemes that are commonly used in an everyday context.

Multi-Label Classification Optical Character Recognition +1

Towards Domain Invariant Heart Sound Abnormality Detection using Learnable Filterbanks

1 code implementation29 Sep 2019 Ahmed Imtiaz Humayun, Shabnam Ghaffarzadegan, Md. Istiaq Ansari, Zhe Feng, Taufiq Hasan

Cardiac auscultation is the most practiced non-invasive and cost-effective procedure for the early diagnosis of heart diseases.

Signal Processing

X-Ray Image Compression Using Convolutional Recurrent Neural Networks

no code implementations28 Apr 2019 Asif Shahriyar Sushmit, Shakib Uz Zaman, Ahmed Imtiaz Humayun, Taufiq Hasan, Mohammed Imamul Hassan Bhuiyan

To the best of our knowledge, this is the first reported evaluation on using a deep convolutional RNN for medical image compression.

Image Compression Retrieval +2

End-to-end Sleep Staging with Raw Single Channel EEG using Deep Residual ConvNets

1 code implementation23 Apr 2019 Ahmed Imtiaz Humayun, Asif Shahriyar Sushmit, Taufiq Hasan, Mohammed Imamul Hassan Bhuiyan

The experimental results demonstrate the superiority of the proposed network compared to the best existing method, providing a relative improvement in epoch-wise average accuracy of 6. 8% and 6. 3% on the household data and multi-source data, respectively.

EEG Sleep Staging

An Ensemble of Transfer, Semi-supervised and Supervised Learning Methods for Pathological Heart Sound Classification

1 code implementation18 Jun 2018 Ahmed Imtiaz Humayun, Md. Tauhiduzzaman Khan, Shabnam Ghaffarzadegan, Zhe Feng, Taufiq Hasan

In this work, we propose an ensemble of classifiers to distinguish between various degrees of abnormalities of the heart using Phonocardiogram (PCG) signals acquired using digital stethoscopes in a clinical setting, for the INTERSPEECH 2018 Computational Paralinguistics (ComParE) Heart Beats SubChallenge.

General Classification Representation Learning +1

Learning Front-end Filter-bank Parameters using Convolutional Neural Networks for Abnormal Heart Sound Detection

1 code implementation15 Jun 2018 Ahmed Imtiaz Humayun, Shabnam Ghaffarzadegan, Zhe Feng, Taufiq Hasan

In this work, we propound a novel CNN architecture that integrates the front-end bandpass filters within the network using time-convolution (tConv) layers, which enables the FIR filter-bank parameters to become learnable.

Anomaly Detection

NumtaDB - Assembled Bengali Handwritten Digits

2 code implementations6 Jun 2018 Samiul Alam, Tahsin Reasat, Rashed Mohammad Doha, Ahmed Imtiaz Humayun

To benchmark Bengali digit recognition algorithms, a large publicly available dataset is required which is free from biases originating from geographical location, gender, and age.

Cannot find the paper you are looking for? You can Submit a new open access paper.