Search Results for author: Dujian Ding

Found 3 papers, 1 papers with code

Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing

no code implementations22 Apr 2024 Dujian Ding, Ankur Mallick, Chi Wang, Robert Sim, Subhabrata Mukherjee, Victor Ruhle, Laks V. S. Lakshmanan, Ahmed Hassan Awadallah

Large language models (LLMs) excel in most NLP tasks but also require expensive cloud servers for deployment due to their size, while smaller models that can be deployed on lower cost (e. g., edge) devices, tend to lag behind in terms of response quality.

LLM Performance Predictors are good initializers for Architecture Search

no code implementations25 Oct 2023 Ganesh Jawahar, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Dujian Ding

We show that HS-NAS performs very similar to SOTA NAS across benchmarks, reduces search hours by 50% roughly, and in some cases, improves latency, GFLOPs, and model size.

Machine Translation Neural Architecture Search

On Efficient Approximate Queries over Machine Learning Models

1 code implementation6 Jun 2022 Dujian Ding, Sihem Amer-Yahia, Laks VS Lakshmanan

Alternatively, under the Core Set Closure assumption, we develop two algorithms: CSC that efficiently returns high quality answers with high probability and minimal oracle usage, and CSE, which extends it to more general settings.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.