Search Results for author: Orna Raz

Found 16 papers, 1 papers with code

Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity

no code implementations22 Aug 2024 Ora Nova Fandina, Leshem Choshen, Eitan Farchi, George Kour, Yotam Perlitz, Orna Raz

We applied these tests in a model safety scenario to assess the reliability of harmfulness detection metrics, uncovering a number of inconsistencies.

Language Modelling Large Language Model +1

Generating Unseen Code Tests In Infinitum

no code implementations29 Jul 2024 Marcel Zalmanovici, Orna Raz, Eitan Farchi, Iftach Freund

Large Language Models (LLMs) are used for many tasks, including those related to coding.

Code Generation regression +1

Using Combinatorial Optimization to Design a High quality LLM Solution

no code implementations15 May 2024 Samuel Ackerman, Eitan Farchi, Rami Katan, Orna Raz

Next, a set of interactions between the factors are defined and combinatorial optimization is used to create a small subset $P$ that ensures all desired interactions occur in $P$.

AutoML Combinatorial Optimization

Predicting Question-Answering Performance of Large Language Models through Semantic Consistency

no code implementations2 Nov 2023 Ella Rabinovich, Samuel Ackerman, Orna Raz, Eitan Farchi, Ateret Anaby-Tavor

Semantic consistency of a language model is broadly defined as the model's ability to produce semantically-equivalent outputs, given semantically-equivalent inputs.

Language Modelling Question Answering

Automatic Generation of Attention Rules For Containment of Machine Learning Model Errors

no code implementations14 May 2023 Samuel Ackerman, Axel Bendavid, Eitan Farchi, Orna Raz

The approach we propose is to separate the observations that are the most likely to be predicted incorrectly into 'attention sets'.

Decision Making

Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

2 code implementations29 Nov 2022 George Kour, Samuel Ackerman, Orna Raz, Eitan Farchi, Boaz Carmeli, Ateret Anaby-Tavor

The ability to compare the semantic similarity between text corpora is important in a variety of natural language processing applications.

Semantic Similarity Semantic Textual Similarity

Theory and Practice of Quality Assurance for Machine Learning Systems An Experiment Driven Approach

no code implementations2 Jan 2022 Samuel Ackerman, Guy Barash, Eitan Farchi, Orna Raz, Onn Shehory

The crafting of machine learning (ML) based systems requires statistical control throughout its life cycle.

Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation

no code implementations22 Dec 2021 George Kour, Marcel Zalmanovici, Orna Raz, Samuel Ackerman, Ateret Anaby-Tavor

Testing Machine Learning (ML) models and AI-Infused Applications (AIIAs), or systems that contain ML models, is highly challenging.

Chatbot

Automatically detecting data drift in machine learning classifiers

no code implementations10 Nov 2021 Samuel Ackerman, Orna Raz, Marcel Zalmanovici, Aviad Zlotnick

The assumption underlying statistical ML resulting in theoretical or empirical performance guarantees is that the distribution of the training data is representative of the production data distribution.

BIG-bench Machine Learning

Detecting model drift using polynomial relations

no code implementations24 Oct 2021 Eliran Roffe, Samuel Ackerman, Orna Raz, Eitan Farchi

We thus use a set of learned strong polynomial relations to identify drift.

Relation

Density-based interpretable hypercube region partitioning for mixed numeric and categorical data

no code implementations11 Oct 2021 Samuel Ackerman, Eitan Farchi, Orna Raz, Marcel Zalmanovici, Maya Zohar

A user may want to know where in the feature space observations are concentrated, and where it is sparse or empty.

Causal Inference

FreaAI: Automated extraction of data slices to test machine learning models

no code implementations12 Aug 2021 Samuel Ackerman, Orna Raz, Marcel Zalmanovici

In this paper we show the feasibility of automatically extracting feature models that result in explainable data slices over which the ML solution under-performs.

BIG-bench Machine Learning

Detection of data drift and outliers affecting machine learning model performance over time

no code implementations16 Dec 2020 Samuel Ackerman, Eitan Farchi, Orna Raz, Marcel Zalmanovici, Parijat Dube

Drift is distribution change between the training and deployment data, which is concerning if model performance changes.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.