Search Results for author: Fei Jia

Found 8 papers, 4 papers with code

RULER: What's the Real Context Size of Your Long-Context Language Models?

1 code implementation9 Apr 2024 Cheng-Ping Hsieh, Simeng Sun, Samuel Kriman, Shantanu Acharya, Dima Rekesh, Fei Jia, Yang Zhang, Boris Ginsburg

Despite achieving nearly perfect accuracy in the vanilla NIAH test, all models exhibit large performance drops as the context length increases.

Long-Context Understanding

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

1 code implementation15 Feb 2024 Shubham Toshniwal, Ivan Moshkov, Sean Narenthiran, Daria Gitman, Fei Jia, Igor Gitman

Building on the recent progress in open-source LLMs, our proposed prompting novelty, and some brute-force scaling, we construct OpenMathInstruct-1, a math instruction tuning dataset with 1. 8M problem-solution pairs.

 Ranked #1 on Math Word Problem Solving on MAWPS (using extra training data)

Arithmetic Reasoning GSM8K +2

Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models

no code implementations9 Nov 2022 Travis M. Bartley, Fei Jia, Krishna C. Puvvada, Samuel Kriman, Boris Ginsburg

In this paper, we extend previous self-supervised approaches for language identification by experimenting with Conformer based architecture in a multilingual pre-training paradigm.

Language Identification Spoken language identification

A Compact End-to-End Model with Local and Global Context for Spoken Language Identification

no code implementations27 Oct 2022 Fei Jia, Nithin Rao Koluguri, Jagadeesh Balam, Boris Ginsburg

We introduce TitaNet-LID, a compact end-to-end neural network for Spoken Language Identification (LID) that is based on the ContextNet architecture.

Language Identification Spoken language identification

Lessons from the AdKDD'21 Privacy-Preserving ML Challenge

no code implementations31 Jan 2022 Eustache Diemert, Romain Fabre, Alexandre Gilotte, Fei Jia, Basile Leparmentier, Jérémie Mary, Zhonghua Qu, Ugo Tanielian, Hui Yang

Designing data sharing mechanisms providing performance and strong privacy guarantees is a hot topic for the Online Advertising industry.

Privacy Preserving

Cannot find the paper you are looking for? You can Submit a new open access paper.