1 code implementation • 22 Oct 2024 • Isamu Isozaki, Manil Shrestha, Rick Console, Edward Kim
We first evaluate the performance of LLMs, including GPT-4o and Llama 3. 1-405B, using the state-of-the-art PentestGPT tool.
no code implementations • 27 Sep 2024 • Manil Shrestha, Yashodha Ravichandran, Edward Kim
In our research, we present a secure and private methodology for generative artificial intelligence that does not expose sensitive data or models to third-party AI providers.
no code implementations • 22 Sep 2024 • Hongchen Wang, Kangming Li, Scott Ramsay, Yao Fehlis, Edward Kim, Jason Hattrick-Simpers
Large Language Models (LLMs) have the potential to revolutionize scientific research, yet their robustness and reliability in domain-specific applications remain insufficiently explored.
no code implementations • 1 Jul 2024 • Ximing Wen, Rosina O. Weber, Anik Sen, Darryl Hannan, Steven C. Nesbit, Vincent Chan, Alberto Goffi, Michael Morris, John C. Hunninghake, Nicholas E. Villalobos, Edward Kim, Christopher J. MacLellan
Point-of-Care Ultrasound (POCUS) is the practice of clinicians conducting and interpreting ultrasound scans right at the patient's bedside.
1 code implementation • 12 Apr 2024 • Aref Azizpour, Tai D. Nguyen, Manil Shrestha, Kaidi Xu, Edward Kim, Matthew C. Stamm
To address these issues, we introduce the Ensemble of Expert Embedders (E3), a novel continual learning framework for updating synthetic image detectors.
no code implementations • 4 Mar 2024 • Darryl Hannan, Steven C. Nesbit, Ximing Wen, Glen Smith, Qiao Zhang, Alberto Goffi, Vincent Chan, Michael J. Morris, John C. Hunninghake, Nicholas E. Villalobos, Edward Kim, Rosina O. Weber, Christopher J. MacLellan
Detecting elevated intracranial pressure (ICP) is crucial in diagnosing and managing various neurological conditions.
no code implementations • 9 Feb 2024 • Darryl Hannan, Ragib Arnab, Gavin Parpart, Garrett T. Kenyon, Edward Kim, Yijing Watkins
In this paper, we investigate the viability of event streams for overhead object detection.
no code implementations • 5 Feb 2024 • Edward Kim
Given the impressive capabilities of recent Large Language Models (LLMs), we investigate and benchmark the most popular proprietary and different sized open source models on the task of explicit instruction following in conflicting situations, e. g. overrides.
no code implementations • 29 Nov 2023 • Meriem Boubdir, Edward Kim, Beyza Ermis, Sara Hooker, Marzieh Fadaee
In Natural Language Processing (NLP), the Elo rating system, originally designed for ranking players in dynamic games such as chess, is increasingly being used to evaluate Large Language Models (LLMs) through "A vs B" paired comparisons.
no code implementations • 22 Oct 2023 • Meriem Boubdir, Edward Kim, Beyza Ermis, Marzieh Fadaee, Sara Hooker
Human evaluation is increasingly critical for assessing large language models, capturing linguistic nuances, and reflecting user preferences more accurately than traditional automated metrics.
1 code implementation • 14 Sep 2023 • Chenan Wang, Jinhao Duan, Chaowei Xiao, Edward Kim, Matthew Stamm, Kaidi Xu
Then there are two variants of this framework: 1) the Semantic Transformation (ST) approach fine-tunes the latent space of the generated image and/or the diffusion model itself; 2) the Latent Masking (LM) approach masks the latent space with another target image and local backpropagation-based interpretation methods.
1 code implementation • 1 Sep 2023 • Michael Shenoda, Edward Kim
Generating high-quality labeled image datasets is crucial for training accurate and robust machine learning models in the field of computer vision.
no code implementations • 28 Jul 2023 • Xiangyun Lei, Edward Kim, Viktoriia Baibakova, Shijing Sun
In summary, our study appreciates the benchmark set by these seminal papers while advocating for further enhancements in research reproducibility practices in the field of NLP for materials science.
no code implementations • 29 Dec 2022 • Andrew O'Brien, Rosina Weber, Edward Kim
The SINDy algorithm has been successfully used to identify the governing equations of dynamical systems from time series data.
no code implementations • 6 Dec 2022 • Darryl Hannan, Steven C. Nesbit, Ximing Wen, Glen Smith, Qiao Zhang, Alberto Goffi, Vincent Chan, Michael J. Morris, John C. Hunninghake, Nicholas E. Villalobos, Edward Kim, Rosina O. Weber, Christopher J. MacLellan
Point-of-Care Ultrasound (POCUS) refers to clinician-performed and interpreted ultrasonography at the patient's bedside.
no code implementations • 30 May 2022 • Gavin Parpart, Carlos Gonzalez, Terrence C. Stewart, Edward Kim, Jocelyn Rego, Andrew O'Brien, Steven Nesbit, Garrett T. Kenyon, Yijing Watkins
The Locally Competitive Algorithm (LCA) uses local competition between non-spiking leaky integrator neurons to infer sparse representations, allowing for potentially real-time execution on massively parallel neuromorphic architectures such as Intel's Loihi processor.
no code implementations • 14 Apr 2022 • Ethan Jacob Moyer, Alisha Isabelle Augustin, Satvik Tripathi, Ansh Aashish Dholakia, Andy Nguyen, Isamu Mclean Isozaki, Daniel Schwartz, Edward Kim
In each generation of our evolutionary algorithm, a set number of children with the same initial weights are spawned.
no code implementations • 11 Mar 2022 • Maryam Daniali, Edward Kim
Next, we demonstrate how our novel visual perception framework can utilize this information "over time" using a biologically plausible algorithm with recurrent units, and as a result, significantly improving its accuracy and robustness over standard CNNs.
no code implementations • 1 Dec 2021 • Edward Kim, Jay Shenoy, Sebastian Junges, Daniel Fremont, Alberto Sangiovanni-Vincentelli, Sanjit Seshia
Simulation-based testing of autonomous vehicles (AVs) has become an essential complement to road testing to ensure safety.
no code implementations • 28 Oct 2021 • Francis Indaheng, Edward Kim, Kesav Viswanadha, Jay Shenoy, Jinkyu Kim, Daniel J. Fremont, Sanjit A. Seshia
Hence, it is important that these prediction models are extensively tested in various test scenarios involving interactive behaviors prior to deployment.
no code implementations • 1 Oct 2021 • Andrew O'Brien, Edward Kim
The recent adoption of machine learning as a tool in real world decision making has spurred interest in understanding how these decisions are being made.
no code implementations • 20 Aug 2021 • Kesav Viswanadha, Francis Indaheng, Justin Wong, Edward Kim, Ellen Kalvan, Yash Pant, Daniel J. Fremont, Sanjit A. Seshia
Sampling from an abstract scenario yields many different concrete scenarios which can be run as test cases for the AV.
1 code implementation • 9 Jul 2021 • Kesav Viswanadha, Edward Kim, Francis Indaheng, Daniel J. Fremont, Sanjit A. Seshia
Falsification has emerged as an important tool for simulation-based verification of autonomous systems.
no code implementations • 18 Jun 2021 • Abdus Salam Azad, Edward Kim, Qiancheng Wu, Kimin Lee, Ion Stoica, Pieter Abbeel, Sanjit A. Seshia
To showcase the benefits, we interfaced SCENIC to an existing RTS environment Google Research Football(GRF) simulator and introduced a benchmark consisting of 32 realistic scenarios, encoded in SCENIC, to train RL agents and testing their generalization capabilities.
no code implementations • 13 Jun 2021 • Yigit Alparslan, Edward Kim
In this paper, we explore the effect of architecture completeness on adversarial robustness.
no code implementations • 25 May 2021 • Edward Kim, Stanley Bak, Parasara Sridhar Duggirala
To this end, we investigate two techniques for generating the template directions.
no code implementations • 18 Apr 2021 • Ethan Moyer, Jeff Winchell, Isamu Isozaki, Yigit Alparslan, Mali Halac, Edward Kim
Identifying novel functional protein structures is at the heart of molecular engineering and molecular biology, requiring an often computationally exhaustive search.
1 code implementation • 1 Mar 2021 • Yigit Alparslan, Edward Kim
We compare both strategies to buying and holding one single share for the period that we picked as a benchmark.
1 code implementation • 24 Feb 2021 • Yigit Alparslan, Edward Kim
In this study, we investigate eye closedness detection to prevent vehicle accidents related to driver disengagements and driver drowsiness.
1 code implementation • 16 Jan 2021 • Yigit Alparslan, Ethan Jacob Moyer, Isamu Mclean Isozaki, Daniel Schwartz, Adam Dunlop, Shesh Dave, Edward Kim
Architecture size for a neural network contributes significantly to the success of any neural network.
1 code implementation • 16 Jan 2021 • Yigit Alparslan, Ethan Jacob Moyer, Edward Kim
In this paper, we study compact neural network architectures for binary classification and investigate improvements in speed and accuracy when favoring overcomplete architecture candidates that have a very high-dimensional representation of the input.
no code implementations • 30 Nov 2020 • Jay Shenoy, Edward Kim, Xiangyu Yue, Taesung Park, Daniel Fremont, Alberto Sangiovanni-Vincentelli, Sanjit Seshia
In this paper, we present a platform to model dynamic and interactive scenarios, generate the scenarios in simulation with different modalities of labeled sensor data, and collect this information for data augmentation.
no code implementations • 24 Nov 2020 • Edward Kim, Connor Onweller, Andrew O'Brien, Kathleen Mccoy
Artificial neural networks (ANNs), specifically deep learning networks, have often been labeled as black boxes due to the fact that the internal representation of the data is not easily interpretable.
no code implementations • 23 Nov 2020 • Edward Kim, Maryam Daniali, Jocelyn Rego, Garrett T. Kenyon
Research has shown that neurons within the brain are selective to certain stimuli.
2 code implementations • 13 Oct 2020 • Daniel J. Fremont, Edward Kim, Tommaso Dreossi, Shromona Ghosh, Xiangyu Yue, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia
We design a domain-specific language, Scenic, for describing scenarios that are distributions over scenes and the behaviors of their agents over time.
no code implementations • CVPR 2020 • Edward Kim, Divya Gopinath, Corina Pasareanu, Sanjit A. Seshia
Our approach is semantic in that it employs a high-level representation of the distribution of environment scenarios that the detector is intended to work on.
no code implementations • CVPR 2020 • Edward Kim, Jocelyn Rego, Yijing Watkins, Garrett T. Kenyon
These exploits, or adversarial examples, are a type of signal attack that can change the output class of a classifier by perturbing the stimulus signal by an imperceptible amount.
no code implementations • 17 Mar 2020 • Daniel J. Fremont, Edward Kim, Yash Vardhan Pant, Sanjit A. Seshia, Atul Acharya, Xantha Bruso, Paul Wells, Steve Lemke, Qiang Lu, Shalin Mehta
We present a new approach to automated scenario-based testing of the safety of autonomous vehicles, especially those using advanced artificial intelligence-based components, spanning both simulation-based evaluation as well as testing in the real world.
no code implementations • 1 Dec 2019 • Edward Kim, Divya Gopinath, Corina Pasareanu, Sanjit Seshia
It is programmatic in that scenario representation is a program in a domain-specific probabilistic programming language which can be used to generate synthetic data to test a given perception module.
no code implementations • 25 Nov 2019 • Yoolhee Kim, Edward Kim, Erin Antono, Bryce Meredig, Julia Ling
Materials discovery is often compared to the challenge of finding a needle in a haystack.
no code implementations • WS 2019 • Sheshera Mysore, Zach Jensen, Edward Kim, Kevin Huang, Haw-Shiuan Chang, Emma Strubell, Jeffrey Flanigan, Andrew McCallum, Elsa Olivetti
Materials science literature contains millions of materials synthesis procedures described in unstructured natural language text.
1 code implementation • 12 Feb 2019 • Tommaso Dreossi, Daniel J. Fremont, Shromona Ghosh, Edward Kim, Hadi Ravanbakhsh, Marcell Vazquez-Chanlatte, Sanjit A. Seshia
We present VERIFAI, a software toolkit for the formal design and analysis of systems that include artificial intelligence (AI) and machine learning (ML) components.
1 code implementation • 31 Dec 2018 • Edward Kim, Zach Jensen, Alexander van Grootel, Kevin Huang, Matthew Staib, Sheshera Mysore, Haw-Shiuan Chang, Emma Strubell, Andrew McCallum, Stefanie Jegelka, Elsa Olivetti
Leveraging new data sources is a key step in accelerating the pace of materials design and discovery.
no code implementations • 17 Nov 2018 • Jacob M. Springer, Charles S. Strauss, Austin M. Thresher, Edward Kim, Garrett T. Kenyon
Although deep learning has shown great success in recent years, researchers have discovered a critical flaw where small, imperceptible changes in the input to the system can drastically change the output classification.
no code implementations • CVPR 2018 • Edward Kim, Darryl Hannan, Garrett Kenyon
The brain does not work solely in a feed-forward fashion, but rather all of the neurons are in competition with each other; neurons are integrating information in a bottom up and top down fashion and incorporating expectation and feedback in the modeling process.
no code implementations • 18 Nov 2017 • Sheshera Mysore, Edward Kim, Emma Strubell, Ao Liu, Haw-Shiuan Chang, Srikrishna Kompella, Kevin Huang, Andrew McCallum, Elsa Olivetti
In this work, we present a system for automatically extracting structured representations of synthesis procedures from the texts of materials science journal articles that describe explicit, experimental syntheses of inorganic compounds.