Search Results

Assessing Language Model Deployment with Risk Cards

3 code implementations31 Mar 2023

However, there is no risk-centric framework for documenting the complexity of a landscape in which some risks are shared across models and contexts, while others are specific, and where certain conditions may be required for risks to manifest as harms.

Language Modeling Language Modelling +2

On the Opportunities and Risks of Foundation Models

2 code implementations16 Aug 2021

AI is undergoing a paradigm shift with the rise of models (e. g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks.

Transfer Learning

Deduplicating Training Data Mitigates Privacy Risks in Language Models

3 code implementations14 Feb 2022

Past work has shown that large language models are susceptible to privacy attacks, where adversaries generate sequences from a trained model and detect which sequences are memorized from the training set.

Probabilistic Traversability Model for Risk-Aware Motion Planning in Off-Road Environments

1 code implementation1 Oct 2022

A key challenge in off-road navigation is that even visually similar terrains or ones from the same semantic class may have substantially different traction properties.

Robotics Systems and Control Systems and Control

A hierarchical spatio-temporal model to analyze relative risk variations of COVID-19: a focus on Spain, Italy and Germany

1 code implementation28 Sep 2020

The novel coronavirus disease (COVID-19) has spread rapidly across the world in a short period of time and with a heterogeneous pattern.

Applications

ML-Doctor: Holistic Risk Assessment of Inference Attacks Against Machine Learning Models

1 code implementation4 Feb 2021

As a result, we lack a comprehensive picture of the risks caused by the attacks, e. g., the different scenarios they can be applied to, the common factors that influence their performance, the relationship among them, or the effectiveness of possible defenses.

Attribute BIG-bench Machine Learning +3

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

3 code implementations15 Aug 2024

To evaluate agent capabilities, we construct a cybersecurity agent and evaluate 8 models: GPT-4o, OpenAI o1-preview, Claude 3 Opus, Claude 3. 5 Sonnet, Mixtral 8x22b Instruct, Gemini 1. 5 Pro, Llama 3 70B Chat, and Llama 3. 1 405B Instruct.

Ranked #3 on on Cybench

Risk-mediated dynamic regulation of effective contacts de-synchronizes outbreaks in metapopulation epidemic models

1 code implementation20 Feb 2025

In the event of an epidemic, an important research question is, to what degree spatial information (i. e., regional or national) is relevant for mitigation and (local) policymakers.

Privacy Risks of Securing Machine Learning Models against Adversarial Examples

1 code implementation24 May 2019

To perform the membership inference attacks, we leverage the existing inference methods that exploit model predictions.

Adversarial Defense BIG-bench Machine Learning +1

KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Application

1 code implementation28 May 2023

Large language models (LLMs) learn not only natural text generation abilities but also social biases against different demographic groups from real-world data.

Language Modeling Language Modelling +2