Search Results for author: Nicolae-Catalin Ristea

Found 20 papers, 14 papers with code

Cascaded Cross-Modal Transformer for Audio-Textual Classification

1 code implementation15 Jan 2024 Nicolae-Catalin Ristea, Andrei Anghel, Radu Tudor Ionescu

Subsequently, we combine language-specific Bidirectional Encoder Representations from Transformers (BERT) with Wav2Vec2. 0 audio features via a novel cascaded cross-modal transformer (CCMT).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

ICASSP 2023 Acoustic Echo Cancellation Challenge

1 code implementation22 Sep 2023 Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Evgenii Indenbom, Nicolae-Catalin Ristea, Jegor Gužvin, Hannes Gamper, Sebastian Braun, Robert Aichner

This is the fourth AEC challenge and it is enhanced by adding a second track for personalized acoustic echo cancellation, reducing the algorithmic + buffering latency to 20ms, as well as including a full-band version of AECMOS.

Acoustic echo cancellation Speech Enhancement

CL-MAE: Curriculum-Learned Masked Autoencoders

1 code implementation31 Aug 2023 Neelu Madan, Nicolae-Catalin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu

In this paper, we propose a curriculum learning approach that updates the masking strategy to continually increase the complexity of the self-supervised reconstruction task.

Representation Learning

Cascaded Cross-Modal Transformer for Request and Complaint Detection

no code implementations27 Jul 2023 Nicolae-Catalin Ristea, Radu Tudor Ionescu

We propose a novel cascaded cross-modal transformer (CCMT) that combines speech and text transcripts to detect customer requests and complaints in phone conversations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Sea Ice Segmentation From SAR Data by Convolutional Transformer Networks

no code implementations13 Jun 2023 Nicolae-Catalin Ristea, Andrei Anghel, Mihai Datcu

Sea ice is a crucial component of the Earth's climate system and is highly sensitive to changes in temperature and atmospheric conditions.

Lightning Fast Video Anomaly Detection via Adversarial Knowledge Distillation

1 code implementation28 Nov 2022 Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Dana Dascalescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah

We propose a very fast frame-level model for anomaly detection in video, which learns to detect anomalies by distilling knowledge from multiple highly accurate object-level teacher models.

Anomaly Detection Knowledge Distillation +1

Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

1 code implementation25 Sep 2022 Neelu Madan, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.

Event Detection Fault Detection +1

Learning Rate Curriculum

1 code implementation18 May 2022 Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Nicu Sebe

In this work, we propose a novel curriculum learning approach termed Learning Rate Curriculum (LeRaC), which leverages the use of a different learning rate for each layer of a neural network to create a data-agnostic curriculum during the initial training epochs.

Audio Classification QNLI +2

Guided deep learning by subaperture decomposition: ocean patterns from SAR imagery

no code implementations9 Apr 2022 Nicolae-Catalin Ristea, Andrei Anghel, Mihai Datcu, Bertrand Chapron

Overall, we encourage the development of data centring approaches, showing that, data preprocessing could bring significant performance improvements over existing deep learning models.

Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution

1 code implementation8 Apr 2022 Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan

Our attention module uses the convolution operation to perform joint spatial-channel attention on multiple concatenated input tensors, where the kernel (receptive field) size controls the reduction rate of the spatial attention, and the number of convolutional filters controls the reduction rate of the channel attention, respectively.

Computed Tomography (CT) Image Super-Resolution

SepTr: Separable Transformer for Audio Spectrogram Processing

1 code implementation17 Mar 2022 Nicolae-Catalin Ristea, Radu Tudor Ionescu, Fahad Shahbaz Khan

Following the successful application of vision transformers in multiple computer vision tasks, these models have drawn the attention of the signal processing community.

Audio Classification Speech Emotion Recognition +1

Self-paced ensemble learning for speech and audio classification

no code implementations22 Mar 2021 Nicolae-Catalin Ristea, Radu Tudor Ionescu

Instead of just combining the models, we propose a self-paced ensemble learning scheme in which models learn from each other over several iterations.

Audio Classification Ensemble Learning +2

Emotion Recognition System from Speech and Visual Information based on Convolutional Neural Networks

no code implementations29 Feb 2020 Nicolae-Catalin Ristea, Liviu Cristian Dutu, Anamaria Radoi

In order to increase the accuracy of the recognition system, we analyze also the speech data and fuse the information coming from both sources, i. e., visual and audio.

Emotion Recognition

Non-linear Neurons with Human-like Apical Dendrite Activations

1 code implementation2 Feb 2020 Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Nicolae-Catalin Ristea, Nicu Sebe

In order to classify linearly non-separable data, neurons are typically organized into multi-layer neural networks that are equipped with at least one hidden layer.

Speech Emotion Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.