Search Results for author: Pierre Baldi

Found 61 papers, 17 papers with code

Neural Erosion: Emulating Controlled Neurodegeneration and Aging in AI Systems

no code implementations15 Mar 2024 Antonios Alexos, Yu-Dai Tsai, Ian Domingo, Maryam Pishgar, Pierre Baldi

Creating controlled methods to simulate neurodegeneration in artificial intelligence (AI) is crucial for applications that emulate brain function decline and cognitive disorders.

Unraveling the Molecular Magic: AI Insights on the Formation of Extraordinarily Stretchable Hydrogels

no code implementations8 Mar 2024 Shahriar Hojjati Emmami, Ali Pilehvar Meibody, Lobat Tayebi, Mohammadamin Tavakoli, Pierre Baldi

The deliberate manipulation of ammonium persulfate, methylenebisacrylamide, dimethyleacrylamide, and polyethylene oxide concentrations resulted in the development of a hydrogel with an exceptional stretchability, capable of extending up to 260 times its original length.

AttentionStitch: How Attention Solves the Speech Editing Problem

no code implementations5 Mar 2024 Antonios Alexos, Pierre Baldi

In addition to speech generation, speech editing is also a crucial task, which requires the seamless and unnoticeable integration of edited speech into synthesized speech.

Evaluating the Performance of Large Language Models for Spanish Language in Undergraduate Admissions Exams

no code implementations28 Dec 2023 Sabino Miranda, Obdulia Pichardo-Lagunas, Bella Martínez-Seis, Pierre Baldi

This study evaluates the performance of large language models, specifically GPT-3. 5 and BARD (supported by Gemini Pro model), in undergraduate admissions exams proposed by the National Polytechnic Institute in Mexico.

Machine Learning-Enhanced Prediction of Surface Smoothness for Inertial Confinement Fusion Target Polishing Using Limited Data

no code implementations16 Dec 2023 Antonios Alexos, Junze Liu, Akash Tiwari, Kshitij Bhardwaj, Sean Hayes, Pierre Baldi, Satish Bukkapatnam, Suhas Bhandarkar

In Inertial Confinement Fusion (ICF) process, roughly a 2mm spherical shell made of high density carbon is used as target for laser beams, which compress and heat it to energy levels needed for high fusion yield.

Reconstruction of Unstable Heavy Particles Using Deep Symmetry-Preserving Attention Networks

no code implementations5 Sep 2023 Michael James Fenton, Alexander Shmakov, Hideki Okawa, Yuji Li, Ko-Yang Hsiao, Shih-Chieh Hsu, Daniel Whiteson, Pierre Baldi

We explore the performance of the extended capability of Spa-Net in the context of semi-leptonic decays of top quark pairs as well as top quark pairs produced in association with a Higgs boson.

Single Particle Analysis

Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors

no code implementations21 Jul 2023 Kolby Nottingham, Yasaman Razeghi, KyungMin Kim, JB Lanier, Pierre Baldi, Roy Fox, Sameer Singh

Large language models (LLMs) are being applied as actors for sequential decision making tasks in domains such as robotics and games, utilizing their general world knowledge and planning abilities.

Decision Making Language Modelling +2

Generalizing to new geometries with Geometry-Aware Autoregressive Models (GAAMs) for fast calorimeter simulation

no code implementations19 May 2023 Junze Liu, Aishik Ghosh, Dylan Smith, Pierre Baldi, Daniel Whiteson

Generation of simulated detector response to collision products is crucial to data analysis in particle physics, but computationally very expensive.

Language Models can Solve Computer Tasks

1 code implementation NeurIPS 2023 Geunwoo Kim, Pierre Baldi, Stephen Mcaleer

We compare multiple LLMs and find that RCI with the InstructGPT-3+RLHF LLM is state-of-the-art on MiniWoB++, using only a handful of demonstrations per task rather than tens of thousands, and without a task-specific reward function.

Language Modelling Large Language Model +1

Interpretable Joint Event-Particle Reconstruction for Neutrino Physics at NOvA with Sparse CNNs and Transformers

no code implementations10 Mar 2023 Alexander Shmakov, Alejandro Yankelevich, Jianming Bian, Pierre Baldi

TransformerCVN classifies events with 90\% accuracy and improves the reconstruction of individual particles by 6\% over baseline methods which lack the integrated architecture of TransformerCVN.

Geometry-aware Autoregressive Models for Calorimeter Shower Simulations

no code implementations16 Dec 2022 Junze Liu, Aishik Ghosh, Dylan Smith, Pierre Baldi, Daniel Whiteson

Calorimeter shower simulations are often the bottleneck in simulation time for particle physics detectors.

Position

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments

no code implementations19 Jul 2022 JB Lanier, Stephen Mcaleer, Pierre Baldi, Roy Fox

In this paper, we propose Feasible Adversarial Robust RL (FARR), a novel problem formulation and objective for automatically determining the set of environment parameter values over which to be robust.

reinforcement-learning Reinforcement Learning (RL)

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

no code implementations13 Jul 2022 Stephen Mcaleer, JB Lanier, Kevin Wang, Pierre Baldi, Roy Fox, Tuomas Sandholm

Instead of adding only deterministic best responses to the opponent's least exploitable population mixture, SP-PSRO also learns an approximately optimal stochastic policy and adds it to the population as well.

Reinforcement Learning (RL)

Deep Learning Models of the Discrete Component of the Galactic Interstellar Gamma-Ray Emission

no code implementations6 Jun 2022 Alexander Shmakov, Mohammadamin Tavakoli, Pierre Baldi, Christopher M. Karwin, Alex Broughton, Simona Murgia

A significant point-like component from the small scale (or discrete) structure in the H2 interstellar gas might be present in the Fermi-LAT data, but modeling this emission relies on observations of rare gas tracers only available in limited regions of the sky.

The Quarks of Attention

no code implementations15 Feb 2022 Pierre Baldi, Roman Vershynin

The gating mechanisms correspond to multiplicative extensions of the standard model and are used across all current attention-based deep learning architectures.

Anytime PSRO for Two-Player Zero-Sum Games

no code implementations19 Jan 2022 Stephen Mcaleer, Kevin Wang, John Lanier, Marc Lanctot, Pierre Baldi, Tuomas Sandholm, Roy Fox

PSRO is based on the tabular double oracle (DO) method, an algorithm that is guaranteed to converge to a Nash equilibrium, but may increase exploitability from one iteration to the next.

Multi-agent Reinforcement Learning reinforcement-learning +2

Rxn Hypergraph: a Hypergraph Attention Model for Chemical Reaction Representation

no code implementations2 Jan 2022 Mohammadamin Tavakoli, Alexander Shmakov, Francesco Ceccarelli, Pierre Baldi

To achieve such skills, it is important to develop good representations of chemical reactions, or good deep learning architectures that can learn such representations automatically from the data.

Property Prediction

Tourbillon: a Physically Plausible Neural Architecture

no code implementations13 Jul 2021 Mohammadamin Tavakoli, Peter Sadowski, Pierre Baldi

The circular autoencoders are trained in self-supervised mode by recirculation algorithms and the top layer in supervised mode by stochastic gradient descent, with the option of propagating error information through the entire stack using non-symmetric connections.

Improving Social Welfare While Preserving Autonomy via a Pareto Mediator

no code implementations7 Jun 2021 Stephen Mcaleer, John Lanier, Michael Dennis, Pierre Baldi, Roy Fox

Machine learning algorithms often make decisions on behalf of agents with varied and sometimes conflicting interests.

Open-Ended Question Answering

SPANet: Generalized Permutationless Set Assignment for Particle Physics using Symmetry Preserving Attention

1 code implementation7 Jun 2021 Alexander Shmakov, Michael James Fenton, Ta-Wei Ho, Shih-Chieh Hsu, Daniel Whiteson, Pierre Baldi

The creation of unstable heavy particles at the Large Hadron Collider is the most direct way to address some of the deepest open questions in physics.

XDO: A Double Oracle Algorithm for Extensive-Form Games

1 code implementation NeurIPS 2021 Stephen Mcaleer, John Lanier, Kevin Wang, Pierre Baldi, Roy Fox

NXDO is the first deep RL method that can find an approximate Nash equilibrium in high-dimensional continuous-action sequential games.

Reinforcement Learning (RL)

A theory of capacity and sparse neural encoding

no code implementations19 Feb 2021 Pierre Baldi, Roman Vershynin

Motivated by biological considerations, we study sparse neural maps from an input layer to a target layer with sparse activity, and specifically the problem of storing $K$ input-target associations $(x, y)$, or memories, when the target vectors $y$ are sparse.

A* Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks

no code implementations8 Feb 2021 Forest Agostinelli, Alexander Shmakov, Stephen Mcaleer, Roy Fox, Pierre Baldi

We use Q* search to solve the Rubik's cube when formulated with a large action space that includes 1872 meta-actions and find that this 157-fold increase in the size of the action space incurs less than a 4-fold increase in computation time and less than a 3-fold increase in number of nodes generated when performing Q* search.

Rubik's Cube

Detecting Pulmonary Coccidioidomycosis (Valley fever) with Deep Convolutional Neural Networks

no code implementations30 Jan 2021 Jordan Ott, David Bruyette, Cody Arbuckle, Dylan Balsz, Silke Hecht, Lisa Shubitz, Pierre Baldi

We also use the classification model to identify regions of interest and localize the disease in the radiographic images, as illustrated through visual heatmaps.

Deep-Learning-Based Kinematic Reconstruction for DUNE

no code implementations11 Dec 2020 Junze Liu, Jordan Ott, Julian Collado, Benjamin Jargowsky, Wenjie Wu, Jianming Bian, Pierre Baldi

To precisely reconstruct these kinematic characteristics of detected interactions at DUNE, we have developed and will present two CNN-based methods, 2-D and 3-D, for the reconstruction of final state particle direction and energy, as well as neutrino energy.

Learning to Identify Electrons

1 code implementation3 Nov 2020 Julian Collado, Jessica N. Howard, Taylor Faucett, Tony Tong, Pierre Baldi, Daniel Whiteson

We investigate whether state-of-the-art classification features commonly used to distinguish electrons from jet backgrounds in collider experiments are overlooking valuable information.

Data Analysis, Statistics and Probability High Energy Physics - Experiment High Energy Physics - Phenomenology

Permutationless Many-Jet Event Reconstruction with Symmetry Preserving Attention Networks

1 code implementation19 Oct 2020 Michael James Fenton, Alexander Shmakov, Ta-Wei Ho, Shih-Chieh Hsu, Daniel Whiteson, Pierre Baldi

Top quarks, produced in large numbers at the Large Hadron Collider, have a complex detector signature and require special reconstruction techniques.

Single Particle Analysis

Quantity vs. Quality: On Hyperparameter Optimization for Deep Reinforcement Learning

no code implementations29 Jul 2020 Lars Hertel, Pierre Baldi, Daniel L. Gillen

Reinforcement learning algorithms can show strong variation in performance between training runs with different random seeds.

Bayesian Optimization Hyperparameter Optimization +2

SPLASH: Learnable Activation Functions for Improving Accuracy and Adversarial Robustness

no code implementations16 Jun 2020 Mohammadamin Tavakoli, Forest Agostinelli, Pierre Baldi

Furthermore, we show that SPLASH units significantly increase the robustness of deep neural networks to adversarial attacks.

Adversarial Robustness

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

2 code implementations NeurIPS 2020 Stephen McAleer, John Lanier, Roy Fox, Pierre Baldi

We also introduce an open-source environment for Barrage Stratego, a variant of Stratego with an approximate game tree complexity of $10^{50}$.

reinforcement-learning Reinforcement Learning (RL)

Continuous Representation of Molecules Using Graph Variational Autoencoder

no code implementations17 Apr 2020 Mohammadamin Tavakoli, Pierre Baldi

In order to continuously represent molecules, we propose a generative model in the form of a VAE which is operating on the 2D-graph structure of molecules.

Property Prediction

A Fortran-Keras Deep Learning Bridge for Scientific Computing

2 code implementations14 Apr 2020 Jordan Ott, Mike Pritchard, Natalie Best, Erik Linstead, Milan Curcic, Pierre Baldi

Implementing artificial neural networks is commonly achieved via high-level programming languages like Python and easy-to-use deep learning libraries like Keras.

ColosseumRL: A Framework for Multiagent Reinforcement Learning in $N$-Player Games

no code implementations10 Dec 2019 Alexander Shmakov, John Lanier, Stephen Mcaleer, Rohan Achar, Cristina Lopes, Pierre Baldi

Much of recent success in multiagent reinforcement learning has been in two-player zero-sum games.

Multiagent Systems

Symmetric-APL Activations: Training Insights and Robustness to Adversarial Attacks

no code implementations25 Sep 2019 Mohammadamin Tavakoli, Forest Agostinelli, Pierre Baldi

Finally, we show that the use of Symmetric-APL activations can significantly increase the robustness of deep neural networks to adversarial attacks.

Learning in the Machine: To Share or Not to Share?

1 code implementation23 Sep 2019 Jordan Ott, Erik Linstead, Nicholas LaHaye, Pierre Baldi

Weight-sharing is one of the pillars behind Convolutional Neural Networks and their successes.

Enforcing Analytic Constraints in Neural-Networks Emulating Physical Systems

4 code implementations3 Sep 2019 Tom Beucler, Michael Pritchard, Stephan Rasp, Jordan Ott, Pierre Baldi, Pierre Gentine

Neural networks can emulate nonlinear physical systems with high accuracy, yet they may produce physically-inconsistent results when violating fundamental constraints.

Computational Physics Atmospheric and Oceanic Physics

Neural Network Regression with Beta, Dirichlet, and Dirichlet-Multinomial Outputs

no code implementations ICLR 2019 Peter Sadowski, Pierre Baldi

We show that each target can be modeled as a sample from a Dirichlet distribution, where the parameters of the Dirichlet are provided by the output of a neural network, and that the combined model can be trained using the gradient of the data likelihood.

Decision Making regression

Solving the Rubik's Cube with Approximate Policy Iteration

no code implementations ICLR 2019 Stephen McAleer, Forest Agostinelli, Alexander Shmakov, Pierre Baldi

Autodidactic Iteration is able to learn how to solve the Rubik’s Cube and the 15-puzzle without relying on human data.

Rubik's Cube

The capacity of feedforward neural networks

no code implementations2 Jan 2019 Pierre Baldi, Roman Vershynin

Here we define the capacity of an architecture by the binary logarithm of the number of functions it can compute, as the synaptic weights are varied.

On Neuronal Capacity

no code implementations NeurIPS 2018 Pierre Baldi, Roman Vershynin

We define the capacity of a learning machine to be the logarithm of the number (or volume) of the functions it can implement.

Gaussian Process Accelerated Feldman-Cousins Approach for Physical Parameter Inference

no code implementations16 Nov 2018 Lingge Li, Nitish Nayak, Jianming Bian, Pierre Baldi

The unified approach of Feldman and Cousins allows for exact statistical inference of small signals that commonly arise in high energy physics.

Gaussian Processes

Solving the Rubik's Cube Without Human Knowledge

9 code implementations18 May 2018 Stephen McAleer, Forest Agostinelli, Alexander Shmakov, Pierre Baldi

A generally intelligent agent must be able to teach itself how to solve problems in complex domains with minimal human supervision.

Combinatorial Optimization reinforcement-learning +2

Learning in the Machine: the Symmetries of the Deep Learning Channel

no code implementations22 Dec 2017 Pierre Baldi, Peter Sadowski, Zhiqin Lu

Specifically, random backpropagation and its variations can be performed with the same non-linear neurons used in the main input-output forward channel, and the connections in the learning channel can be adapted using the same algorithm used in the forward channel, removing the need for any specialized hardware in the learning channel.

Efficient Antihydrogen Detection in Antimatter Physics by Deep Learning

1 code implementation6 Jun 2017 Peter Sadowski, Balint Radics, Ananya, Yasunori Yamazaki, Pierre Baldi

Antihydrogen is at the forefront of antimatter research at the CERN Antiproton Decelerator.

Decorrelated Jet Substructure Tagging using Adversarial Neural Networks

no code implementations10 Mar 2017 Chase Shimmin, Peter Sadowski, Pierre Baldi, Edison Weik, Daniel Whiteson, Edward Goul, Andreas Søgaard

We describe a strategy for constructing a neural network jet substructure tagger which powerfully discriminates boosted decay signals while remaining largely uncorrelated with the jet mass.

Jet Tagging

Learning in the Machine: Random Backpropagation and the Deep Learning Channel

no code implementations8 Dec 2016 Pierre Baldi, Peter Sadowski, Zhiqin Lu

It is remarkable both because of its effectiveness, in spite of using random matrices to communicate error information, and because it completely removes the taxing requirement of maintaining symmetric weights in a physical neural system.

Parameterized Machine Learning for High-Energy Physics

2 code implementations28 Jan 2016 Pierre Baldi, Kyle Cranmer, Taylor Faucett, Peter Sadowski, Daniel Whiteson

We investigate a new structure for machine learning classifiers applied to problems in high-energy physics by expanding the inputs to include not only measured features but also physics parameters.

BIG-bench Machine Learning Vocal Bursts Intensity Prediction

A Theory of Local Learning, the Learning Channel, and the Optimality of Backpropagation

no code implementations22 Jun 2015 Pierre Baldi, Peter Sadowski

The nature of the communicated information about the targets and the structure of the learning channel partition the space of learning algorithms.

Searching for Higgs Boson Decay Modes with Deep Learning

no code implementations NeurIPS 2014 Peter J. Sadowski, Daniel Whiteson, Pierre Baldi

Particle colliders enable us to probe the fundamental nature of matter by observing exotic particles produced by high-energy collisions.

BIG-bench Machine Learning

Enhanced Higgs to $τ^+τ^-$ Searches with Deep Learning

no code implementations13 Oct 2014 Pierre Baldi, Peter Sadowski, Daniel Whiteson

The Higgs boson is thought to provide the interaction that imparts mass to the fundamental fermions, but while measurements at the Large Hadron Collider (LHC) are consistent with this hypothesis, current analysis techniques lack the statistical power to cross the traditional 5$\sigma$ significance barrier without more data.

Bayesian Optimization

Searching for Exotic Particles in High-Energy Physics with Deep Learning

2 code implementations19 Feb 2014 Pierre Baldi, Peter Sadowski, Daniel Whiteson

Standard approaches have relied on `shallow' machine learning models that have a limited capacity to learn complex non-linear functions of the inputs, and rely on a pain-staking search through manually constructed non-linear features.

High Energy Physics - Phenomenology High Energy Physics - Experiment

Understanding Dropout

no code implementations NeurIPS 2013 Pierre Baldi, Peter J. Sadowski

Dropout is a relatively new algorithm for training neural networks which relies on stochastically dropping out'' neurons during training in order to avoid the co-adaptation of feature detectors.

Complex-Valued Autoencoders

no code implementations20 Aug 2011 Pierre Baldi, Zhiqin Lu

The general framework described here is useful to classify autoencoders and identify general common properties that ought to be investigated for each class, illuminating some of the connections between information theory, unsupervised learning, clustering, Hebbian learning, and autoencoders.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.