Search Results for author: Gašper Beguš

Found 16 papers, 6 papers with code

CiwaGAN: Articulatory information exchange

1 code implementation • 14 Sep 2023 • Gašper Beguš, Thomas Lu, Alan Zhou, Peter Wu, Gopala K. Anumanchipalli

This paper introduces CiwaGAN, a model of human spoken language acquisition that combines unsupervised articulatory modeling with an unsupervised model of information exchange through the auditory modality.

Language Acquisition

Paper
Code

Large language models and (non-)linguistic recursion

no code implementations • 12 Jun 2023 • Maksymilian Dąbkowski, Gašper Beguš

Recursion is one of the hallmarks of human language.

Language Modelling Large Language Model

Paper
Add Code

Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks

no code implementations • 2 May 2023 • Gašper Beguš, Thomas Lu, Zili Wang

Computational models of syntax are predominantly text-based.

Generative Adversarial Network

Paper
Add Code

Large Linguistic Models: Analyzing theoretical linguistic abilities of LLMs

no code implementations • 1 May 2023 • Gašper Beguš, Maksymilian Dąbkowski, Ryan Rhodes

We show here that for the first time, the models can also generate coherent and valid formal analyses of linguistic data and illustrate the vast potential of large language models for analyses of their metalinguistic abilities.

valid

Paper
Add Code

AI-assisted coding: Experiments with GPT-4

1 code implementation • 25 Apr 2023 • Russell A Poldrack, Thomas Lu, Gašper Beguš

We report several experiments using GPT-4 to generate computer code.

Code Generation

Paper
Code

Approaching an unknown communication system by latent space exploration and causal inference

1 code implementation • 20 Mar 2023 • Gašper Beguš, Andrej Leban, Shane Gero

This paper suggests that an interpretation of the outputs of deep neural networks with causal inference methodology can be a viable strategy for approaching data about which little is known and presents another case of how deep learning can limit the hypothesis space.

Causal Inference Disentanglement

Paper
Code

Articulation GAN: Unsupervised modeling of articulatory learning

1 code implementation • 27 Oct 2022 • Gašper Beguš, Alan Zhou, Peter Wu, Gopala K Anumanchipalli

Articulatory analysis suggests that the network learns to control articulators in a similar manner to humans during speech production.

Generative Adversarial Network Speech Synthesis

Paper
Code

Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no direct access to speech data

no code implementations • 22 Mar 2022 • Gašper Beguš, Alan Zhou

Here, we test how encoding and decoding of lexical semantic information can emerge automatically from raw speech in unsupervised generative deep convolutional networks that combine the production and perception principles of speech.

speech-recognition Speech Recognition +1

Paper
Add Code

Interpreting intermediate convolutional layers in unsupervised acoustic word classification

no code implementations • 5 Oct 2021 • Gašper Beguš, Alan Zhou

We propose a technique to visualize individual convolutional layers in the classifier that yields highly informative time-series data for each convolutional layer and apply it to unobserved test data.

Classification regression +2

Paper
Add Code

Interpreting intermediate convolutional layers of generative CNNs trained on waveforms

no code implementations • 19 Apr 2021 • Gašper Beguš, Alan Zhou

This technique allows for acoustic analysis of intermediate layers that parallels the acoustic analysis of human speech data: we can extract F0, intensity, duration, formants, and other acoustic properties from intermediate layers in order to test where and how CNNs encode various types of information.

Time Series Analysis

Paper
Add Code

Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales

no code implementations • 17 Apr 2021 • Jacob Andreas, Gašper Beguš, Michael M. Bronstein, Roee Diamant, Denley Delaney, Shane Gero, Shafi Goldwasser, David F. Gruber, Sarah de Haas, Peter Malkin, Roger Payne, Giovanni Petri, Daniela Rus, Pratyusha Sharma, Dan Tchernov, Pernille Tønnesen, Antonio Torralba, Daniel Vogt, Robert J. Wood

We posit that machine learning will be the cornerstone of future collection, processing, and analysis of multimodal streams of data in animal communication studies, including bioacoustic, behavioral, biological, and environmental data.

BIG-bench Machine Learning Sentence +1

Paper
Add Code

Deep Sound Change: Deep and Iterative Learning, Convolutional Neural Networks, and Language Change

no code implementations • 10 Nov 2020 • Gašper Beguš

This paper proposes a framework for modeling sound change that combines deep learning and iterative learning.

Paper
Add Code

Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks

no code implementations • 27 Sep 2020 • Gašper Beguš

The proposed technique for retrieving learning representations has general implications for our understanding of how GANs discretize continuous speech data and suggests that rule-like generalizations in the training data are represented as an interaction between variables in the network's latent space.

Language Acquisition

Paper
Add Code

Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication

1 code implementation • 13 Sep 2020 • Gašper Beguš

This paper models unsupervised learning of an identity-based pattern (or copying) in speech called reduplication from raw continuous data with deep convolutional neural networks.

Paper
Code

Generative Adversarial Phonology: Modeling unsupervised phonetic and phonological learning with neural networks

no code implementations • 6 Jun 2020 • Gašper Beguš

A Generative Adversarial Network was trained on an allophonic distribution in English.

Generative Adversarial Network Language Acquisition

Paper
Add Code

CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks

1 code implementation • 4 Jun 2020 • Gašper Beguš

The networks trained on lexical items from TIMIT learn to encode unique information corresponding to lexical items in the form of categorical variables in their latent space.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.