Search Results for author: Devin Schumacher

Found 2 papers, 2 papers with code

Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert

2 code implementations Social Science Research Network (SSRN) 2023 Devin Schumacher, Francis LaBounty Jr.

Keywords: Bark, ai voice cloning, Suno, text-to-speech, artificial intelligence, audio generation, Meta's encodec, audio codebooks, semantic tokens, HuBert, transformer-based model, multilingual speech, wav2vec, linear projection head, embedding space, generative capabilities, pretrained model checkpoints

Expressive Speech Synthesis Text-To-Speech Synthesis +1

V3CTRON | Data Retrieval & Access System For Flexible Semantic Search & Retrieval Of Proprietary Document Collections Using Natural Language Queries.

3 code implementations Social Science Research Network (SSRN) 2023 Devin Schumacher

V3CTRON is an open source vector database that allows users to upload text based documents & document collections, which are automatically embedded for super-accurate semantic search & retrieval using natural language queries.

Conversational Search Information Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.