Search Results for author: Bhaskarjit Sarmah

Found 7 papers, 0 papers with code

A Comparative Study of DSPy Teleprompter Algorithms for Aligning Large Language Models Evaluation Metrics to Human Evaluation

no code implementations19 Dec 2024 Bhaskarjit Sarmah, Kriti Dutta, Anna Grigoryan, Sachin Tiwari, Stefano Pasquali, Dhagash Mehta

We argue that the Declarative Self-improving Python (DSPy) optimizers are a way to align the large language model (LLM) prompts and their evaluations to the human annotations.

Hallucination Language Modeling +2

How to Choose a Threshold for an Evaluation Metric for Large Language Models

no code implementations10 Dec 2024 Bhaskarjit Sarmah, Mingshu Li, Jingrao Lyu, Sebastian Frank, Nathalia Castellanos, Stefano Pasquali, Dhagash Mehta

We then propose concrete and statistically rigorous procedures to determine a threshold for the given LLM evaluation metric using available ground-truth data.

HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction

no code implementations9 Aug 2024 Bhaskarjit Sarmah, Benika Hall, Rohan Rao, Sunil Patel, Stefano Pasquali, Dhagash Mehta

Extraction and interpretation of intricate information from unstructured text data arising in financial applications, such as earnings call transcripts, present substantial challenges to large language models (LLMs) even using the current best practices to use Retrieval Augmented Generation (RAG) (referred to as VectorRAG techniques which utilize vector databases for information retrieval) due to challenges such as domain specific terminology and complex formats of the documents.

Answer Generation Information Retrieval +3

Quantile Regression using Random Forest Proximities

no code implementations5 Aug 2024 Mingshu Li, Bhaskarjit Sarmah, Dhruv Desai, Joshua Rosaler, Snigdha Bhagat, Philip Sommer, Dhagash Mehta

Recently, quantile regression forests (QRF) have emerged as a promising solution: Unlike most basic quantile regression methods that need separate models for each quantile, quantile regression forests estimate the entire conditional distribution of the target variable with a single model, while retaining all the salient features of a typical random forest.

Prediction Intervals quantile regression

Enhanced Local Explainability and Trust Scores with Random Forest Proximities

no code implementations19 Oct 2023 Joshua Rosaler, Dhruv Desai, Bhaskarjit Sarmah, Dimitrios Vamvourellis, Deran Onay, Dhagash Mehta, Stefano Pasquali

We initiate a novel approach to explain the predictions and out of sample performance of random forest (RF) regression and classification models by exploiting the fact that any RF can be mathematically formulated as an adaptive weighted K nearest-neighbors model.

Prediction regression

Towards reducing hallucination in extracting information from financial reports using Large Language Models

no code implementations16 Oct 2023 Bhaskarjit Sarmah, Tianjie Zhu, Dhagash Mehta, Stefano Pasquali

For a financial analyst, the question and answer (Q\&A) segment of the company financial report is a crucial piece of information for various analysis and investment decisions.

Hallucination Optical Character Recognition +2

Learning Embedded Representation of the Stock Correlation Matrix using Graph Machine Learning

no code implementations14 Jul 2022 Bhaskarjit Sarmah, Nayana Nair, Dhagash Mehta, Stefano Pasquali

In particular, the algorithm compresses the network into a lower dimensional continuous space, called an embedding, where pairs of nodes that are identified as similar by the algorithm are placed closer to each other.

BIG-bench Machine Learning Management

Cannot find the paper you are looking for? You can Submit a new open access paper.