Search Results for author: Md Rifat Arefin

Found 12 papers, 8 papers with code

Layer by Layer: Uncovering Hidden Representations in Language Models

no code implementations4 Feb 2025 Oscar Skean, Md Rifat Arefin, Dan Zhao, Niket Patel, Jalal Naghiyev, Yann Lecun, Ravid Shwartz-Ziv

From extracting features to generating text, the outputs of large language models (LLMs) typically rely on their final layers, following the conventional wisdom that earlier layers capture only low-level cues.

State Space Models

Does Representation Matter? Exploring Intermediate Layers in Large Language Models

no code implementations12 Dec 2024 Oscar Skean, Md Rifat Arefin, Yann Lecun, Ravid Shwartz-Ziv

Understanding what defines a good representation in large language models (LLMs) is fundamental to both theoretical understanding and practical applications.

State Space Models

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

1 code implementation4 Nov 2024 Md Rifat Arefin, Gopeshh Subbaraj, Nicolas Gontier, Yann Lecun, Irina Rish, Ravid Shwartz-Ziv, Christopher Pal

To address this, we propose Sequential Variance-Covariance Regularization (Seq-VCR), which enhances the entropy of intermediate representations and prevents collapse.

Arithmetic Reasoning Decoder

VFA: Vision Frequency Analysis of Foundation Models and Human

1 code implementation9 Sep 2024 Mohammad-Javad Darvishi-Bayazi, Md Rifat Arefin, Jocelyn Faubert, Irina Rish

Machine learning models often struggle with distribution shifts in real-world scenarios, whereas humans exhibit robust adaptation.

Models Alignment Out-of-Distribution Generalization

Unsupervised Concept Discovery Mitigates Spurious Correlations

1 code implementation20 Feb 2024 Md Rifat Arefin, Yan Zhang, Aristide Baratin, Francesco Locatello, Irina Rish, Dianbo Liu, Kenji Kawaguchi

Models prone to spurious correlations in training data often produce brittle predictions and introduce unintended biases.

Representation Learning

Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning

2 code implementations19 Sep 2023 Mohammad-Javad Darvishi-Bayazi, Mohammad Sajjad Ghaemi, Timothee Lesort, Md Rifat Arefin, Jocelyn Faubert, Irina Rish

We see improvement in the performance of the target model on the target (NMT) datasets by using the knowledge from the source dataset (TUAB) when a low amount of labelled data was available.

EEG NMT +1

Challenging Common Assumptions about Catastrophic Forgetting

no code implementations10 Jul 2022 Timothée Lesort, Oleksiy Ostapenko, Diganta Misra, Md Rifat Arefin, Pau Rodríguez, Laurent Charlin, Irina Rish

In this paper, we study the progressive knowledge accumulation (KA) in DNNs trained with gradient-based algorithms in long sequences of tasks with data re-occurrence.

Continual Learning Memorization

Continual Learning with Foundation Models: An Empirical Study of Latent Replay

1 code implementation30 Apr 2022 Oleksiy Ostapenko, Timothee Lesort, Pau Rodríguez, Md Rifat Arefin, Arthur Douillard, Irina Rish, Laurent Charlin

Motivated by this, we study the efficacy of pre-trained vision models as a foundation for downstream continual learning (CL) scenarios.

Benchmarking Continual Learning

A Statistical Real-Time Prediction Model for Recommender System

no code implementations1 Dec 2020 Md Rifat Arefin, Minhas Kamal, Kishan Kumar Ganguly, Tarek Salah Uddin Mahmud

Recommender system has become an inseparable part of online shopping and its usability is increasing with the advancement of these e-commerce sites.

Prediction Recommendation Systems

Cannot find the paper you are looking for? You can Submit a new open access paper.