Search Results

Analyzing the Effectiveness of Large Language Models on Text-to-SQL Synthesis

1 code implementation22 Jan 2024

This study investigates various approaches to using Large Language Models (LLMs) for Text-to-SQL program synthesis, focusing on the outcomes and insights derived.

16k SQL Synthesis +1

Natural SQL: Making SQL Easier to Infer from Natural Language Specifications

3 code implementations Findings (EMNLP) 2021

Addressing the mismatch between natural language descriptions and the corresponding SQL queries is a key challenge for text-to-SQL translation.

Text-To-SQL Translation

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

1 code implementation18 Sep 2023

We find that scaling LMM consistently enhances model performance and improves language capabilities, and performance of LoRA/QLoRA tuning of LMM are comparable to the performance of full-model fine-tuning.

Visual Question Answering

FinGPT: Democratizing Internet-scale Data for Financial Large Language Models

1 code implementation19 Jul 2023

In light of this, we aim to democratize Internet-scale financial data for LLMs, which is an open challenge due to diverse data sources, low signal-to-noise ratio, and high time-validity.

Algorithmic Trading Sentiment Analysis

Extending Llama-3's Context Ten-Fold Overnight

1 code implementation30 Apr 2024

We extend the context length of Llama-3-8B-Instruct from 8K to 80K via QLoRA fine-tuning.

8k Retrieval

Chinese-Vicuna: A Chinese Instruction-following Llama-based Model

1 code implementation17 Apr 2025

Chinese-Vicuna is an open-source, resource-efficient language model designed to bridge the gap in Chinese instruction-following capabilities by fine-tuning Meta's LLaMA architecture using Low-Rank Adaptation (LoRA).

Code Generation Instruction Following +3

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

2 code implementations11 Jul 2024

To address these limitations, we introduce Q-Galore, a novel approach that substantially reduces memory usage by combining quantization and low-rank projection, surpassing the benefits of GaLore.

Quantization

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models

1 code implementation3 Apr 2024

PiSSA shares the same architecture as LoRA, but initializes the adaptor matrices $A$ and $B$ with the principal components of the original matrix $W$, and put the remaining components into a residual matrix $W^{res} \in \mathbb{R}^{m \times n}$ which is frozen during fine-tuning.

GSM8K Quantization

CoLLaVO: Crayon Large Language and Vision mOdel

1 code implementation17 Feb 2024

Our findings reveal that the image understanding capabilities of current VLMs are strongly correlated with their zero-shot performance on vision language (VL) tasks.

Large Language Model model +4

Spectrum: Targeted Training on Signal to Noise Ratio

1 code implementation7 Jun 2024

Efficiently post-training large language models remains a challenging task due to the vast computational resources required.