Search Results for author: Jason Li

Found 20 papers, 11 papers with code

Exploring Embeddings for Measuring Text Relatedness: Unveiling Sentiments and Relationships in Online Comments

no code implementations15 Sep 2023 Anthony Olakangil, Cindy Wang, Justin Nguyen, Qunbo Zhou, Kaavya Jethwa, Jason Li, Aryan Narendra, Nishk Patel, Arjun Rajaram

This paper investigates sentiment and semantic relationships among comments across various social media platforms, as well as discusses the importance of shared opinions across these different media platforms, using word embeddings to analyze components in sentences and documents.

Word Embeddings

zkDL: Efficient Zero-Knowledge Proofs of Deep Learning Training

1 code implementation30 Jul 2023 Haochen Sun, Tonghe Bai, Jason Li, Hongyang Zhang

In response to this challenge, we present zero-knowledge deep learning (zkDL), an efficient zero-knowledge proof for deep learning training.

NutritionVerse-Thin: An Optimized Strategy for Enabling Improved Rendering of 3D Thin Food Models

no code implementations12 Apr 2023 Chi-en Amy Tai, Jason Li, Sriram Kumar, Saeejith Nair, Yuhao Chen, Pengcheng Xi, Alexander Wong

With the growth in capabilities of generative models, there has been growing interest in using photo-realistic renders of common 3D food items to improve downstream tasks such as food printing, nutrition prediction, or management of food wastage.

Management Nutrition

Modeling Human Eye Movements with Neural Networks in a Maze-Solving Task

1 code implementation20 Dec 2022 Jason Li, Nicholas Watters, Yingting, Wang, Hansem Sohn, Mehrdad Jazayeri

This not only provides a generative model of eye movements in this task but also suggests a computational theory for how humans solve the task, namely that humans use mental simulation.

Adapting TTS models For New Speakers using Transfer Learning

no code implementations12 Oct 2021 Paarth Neekhara, Jason Li, Boris Ginsburg

We address this challenge by proposing transfer-learning guidelines for adapting high quality single-speaker TTS models for a new speaker, using only a few minutes of speech data.

Transfer Learning Voice Cloning

Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning

no code implementations6 Aug 2021 Bencheng Wei, Jason Li, Ajay Gupta, Hafiza Umair, Atsu Vovor, Natalie Durzynski

Differentiating if a text message belongs to hate speech and offensive language is a key challenge in automatic detection of toxic text content.

Data Augmentation Hate Speech Detection +4

A Lightweight Algorithm to Uncover Deep Relationships in Data Tables

no code implementations7 Sep 2020 Jin Cao, Yibo Zhao, Linjun Zhang, Jason Li

The key to our approach is a computationally lightweight forward addition algorithm that we developed to recursively extract the functional dependencies between table columns that are scalable to tables with many columns.

Cycle Text-To-Image GAN with BERT

4 code implementations26 Mar 2020 Trevor Tsue, Samir Sen, Jason Li

We explore novel approaches to the task of image generation from their respective captions, building on state-of-the-art GAN architectures.

Image Generation Word Embeddings

UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

2 code implementations15 Feb 2020 Huaishao Luo, Lei Ji, Botian Shi, Haoyang Huang, Nan Duan, Tianrui Li, Jason Li, Taroon Bharti, Ming Zhou

However, most of the existing multimodal models are pre-trained for understanding tasks, leading to a pretrain-finetune discrepancy for generation tasks.

Ranked #2 on Action Segmentation on COIN (using extra training data)

Action Segmentation Language Modelling +2

Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens

4 code implementations26 Oct 2019 Rafael Valle, Jason Li, Ryan Prenger, Bryan Catanzaro

Mellotron is a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data.

Style Transfer

Jasper: An End-to-End Convolutional Neural Acoustic Model

10 code implementations5 Apr 2019 Jason Li, Vitaly Lavrukhin, Boris Ginsburg, Ryan Leary, Oleksii Kuchaiev, Jonathan M. Cohen, Huyen Nguyen, Ravi Teja Gadde

In this paper, we report state-of-the-art results on LibriSpeech among end-to-end speech recognition models without any external training data.

Language Modelling Speech Recognition

Training Neural Speech Recognition Systems with Synthetic Speech Augmentation

no code implementations2 Nov 2018 Jason Li, Ravi Gadde, Boris Ginsburg, Vitaly Lavrukhin

Building an accurate automatic speech recognition (ASR) system requires a large dataset that contains many hours of labeled speech samples produced by a diverse set of speakers.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.