Search Results for author: Sageev Oore

Found 20 papers, 7 papers with code

Detecting Out-of-Distribution Examples with Gram Matrices

no code implementations ICML 2020 Chandramouli Shama Sastry, Sageev Oore

We find that characterizing activity patterns by Gram matrices and identifying anomalies in Gram matrix values can yield high OOD detection rates.

Out of Distribution (OOD) Detection

VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations

no code implementations25 Apr 2024 Sri Harsha Dumpala, Aman Jaiswal, Chandramouli Sastry, Evangelos Milios, Sageev Oore, Hassan Sajjad

This paper introduces the VISLA (Variance and Invariance to Semantic and Lexical Alterations) benchmark, designed to evaluate the semantic and lexical understanding of language models.

Test-Time Training for Depression Detection

no code implementations7 Apr 2024 Sri Harsha Dumpala, Chandramouli Shama Sastry, Rudolf Uher, Sageev Oore

Previous works on depression detection use datasets collected in similar environments to train and test the models.

Depression Detection

Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion

1 code implementation22 Feb 2024 Yujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli S Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue

We propose Stochastic Control Guidance (SCG), a novel guidance method that only requires forward evaluation of rule functions that can work with pre-trained diffusion models in a plug-and-play way, thus achieving training-free guidance for non-differentiable rules for the first time.

Music Generation

SynthScribe: Deep Multimodal Tools for Synthesizer Sound Retrieval and Exploration

no code implementations7 Dec 2023 Stephen Brade, Bryan Wang, Mauricio Sousa, Gregory Lee Newsome, Sageev Oore, Tovi Grossman

This is achieved with three main features: a multimodal search engine for a large library of synthesizer sounds; a user centered genetic algorithm by which completely new sounds can be created and selected given the users preferences; a sound editing support feature which highlights and gives examples for key control parameters with respect to a text or audio based query.

Multimodal Deep Learning Retrieval

Test-Time Training for Speech

no code implementations19 Sep 2023 Sri Harsha Dumpala, Chandramouli Sastry, Sageev Oore

In this paper, we study the application of Test-Time Training (TTT) as a solution to handling distribution shifts in speech applications.

Speaker Identification

Training Diffusion Classifiers with Denoising Assistance

no code implementations15 Jun 2023 Chandramouli Sastry, Sri Harsha Dumpala, Sageev Oore

Score-matching and diffusion models have emerged as state-of-the-art generative models for both conditional and unconditional generation.

Denoising

Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models

no code implementations18 Apr 2023 Stephen Brade, Bryan Wang, Mauricio Sousa, Sageev Oore, Tovi Grossman

Text-to-image generative models have demonstrated remarkable capabilities in generating high-quality images based on textual prompts.

Text-to-Image Generation

Echofilter: A Deep Learning Segmentation Model Improves the Automation, Standardization, and Timeliness for Post-Processing Echosounder Data in Tidal Energy Streams

1 code implementation19 Feb 2022 Scott C. Lowe, Louise P. McGarry, Jessica Douglas, Jason Newport, Sageev Oore, Christopher Whidden, Daniel J. Hasselman

Application of a single conventional algorithm to identify the depth-of-penetration of entrained air is insufficient for a boundary that is discontinuous, depth-dynamic, porous, and varies with tidal flow speed.

LogAvgExp Provides a Principled and Performant Global Pooling Operator

no code implementations2 Nov 2021 Scott C. Lowe, Thomas Trappenberg, Sageev Oore

We seek to improve the pooling operation in neural networks, by applying a more theoretically justified operator.

Logical Activation Functions: Logit-space equivalents of Probabilistic Boolean Operators

no code implementations22 Oct 2021 Scott C. Lowe, Robert Earle, Jason d'Eon, Thomas Trappenberg, Sageev Oore

Consequently, we construct efficient approximations named $\text{AND}_\text{AIL}$ (the AND operator Approximate for Independent Logits), $\text{OR}_\text{AIL}$, and $\text{XNOR}_\text{AIL}$, which utilize only comparison and addition operations, have well-behaved gradients, and can be deployed as activation functions in neural networks.

Compositional Zero-Shot Learning Image Classification +1

Musical Speech: A Transformer-based Composition Tool

no code implementations2 Aug 2021 Jason d'Eon, Sri Harsha Dumpala, Chandramouli Shama Sastry, Dani Oore, Sageev Oore

In this paper, we propose a new compositional tool that will generate a musical outline of speech recorded/provided by the user for use as a musical building block in their compositions.

Significance of Speaker Embeddings and Temporal Context for Depression Detection

no code implementations24 Jul 2021 Sri Harsha Dumpala, Sebastian Rodriguez, Sheri Rempel, Rudolf Uher, Sageev Oore

In this work, we analyze the significance of speaker embeddings for the task of depression detection from speech.

Depression Detection

Zero-Shot Out-of-Distribution Detection with Feature Correlations

1 code implementation25 Sep 2019 Chandramouli S Sastry, Sageev Oore

We find that characterizing activity patterns by feature correlations and identifying anomalies in pairwise feature correlation values can yield high OOD detection rates.

Feature Correlation Out-of-Distribution Detection +1

Exploring Conditioning for Generative Music Systems with Human-Interpretable Controls

no code implementations9 Jul 2019 Nicholas Meade, Nicholas Barreyre, Scott C. Lowe, Sageev Oore

Performance RNN is a machine-learning system designed primarily for the generation of solo piano performances using an event-based (rather than audio) representation.

Language Modelling

TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer

1 code implementation ICLR 2019 Sicong Huang, Qiyang Li, Cem Anil, Xuchan Bao, Sageev Oore, Roger B. Grosse

In this work, we address the problem of musical timbre transfer, where the goal is to manipulate the timbre of a sound sample from one instrument to match another instrument while preserving other musical content, such as pitch, rhythm, and loudness.

Style Transfer

This Time with Feeling: Learning Expressive Musical Performance

5 code implementations10 Aug 2018 Sageev Oore, Ian Simon, Sander Dieleman, Douglas Eck, Karen Simonyan

Music generation has generally been focused on either creating scores or interpreting them.

Music Generation

Onsets and Frames: Dual-Objective Piano Transcription

1 code implementation30 Oct 2017 Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse Engel, Sageev Oore, Douglas Eck

We advance the state of the art in polyphonic piano music transcription by using a deep convolutional and recurrent neural network which is trained to jointly predict onsets and frames.

Music Transcription

Cannot find the paper you are looking for? You can Submit a new open access paper.