Search Results for author: Stefano Markidis

Found 9 papers, 2 papers with code

FFTc: An MLIR Dialect for Developing HPC Fast Fourier Transform Libraries

no code implementations14 Jul 2022 Yifei He, Artur Podobas, Måns I. Andersson, Stefano Markidis

Discrete Fourier Transform (DFT) libraries are one of the most critical software components for scientific computing.

Exploring Techniques for the Analysis of Spontaneous Asynchronicity in MPI-Parallel Applications

no code implementations27 May 2022 Ayesha Afzal, Georg Hager, Gerhard Wellein, Stefano Markidis

This paper studies the utility of using data analytics and machine learning techniques for identifying, classifying, and characterizing the dynamics of large-scale parallel (MPI) programs.

Clustering General Classification

Higgs Boson Classification: Brain-inspired BCPNN Learning with StreamBrain

1 code implementation14 Jul 2021 Martin Svedin, Artur Podobas, Steven W. D. Chien, Stefano Markidis

One of the most promising approaches for data analysis and exploration of large data sets is Machine Learning techniques that are inspired by brain models.

Classification

A Deep Learning-Based Particle-in-Cell Method for Plasma Simulations

no code implementations5 Jul 2021 Xavier Aguilar, Stefano Markidis

We design and develop a new Particle-in-Cell (PIC) method for plasma simulations using Deep-Learning (DL) to calculate the electric field from the electron phase space.

Total Energy

Automatic Particle Trajectory Classification in Plasma Simulations

no code implementations11 Oct 2020 Stefano Markidis, Ivy Peng, Artur Podobas, Itthinat Jongsuebchoke, Gabriel Bengtsson, Pawel Herman

Our overall goal is to provide a general workflow for exploring particle trajectory space and automatically classifying particle trajectories from plasma simulations in an unsupervised manner.

Classification Clustering +1

Characterizing Deep-Learning I/O Workloads in TensorFlow

no code implementations6 Oct 2018 Steven W. D. Chien, Stefano Markidis, Chaitanya Prasad Sishtla, Luis Santos, Pawel Herman, Sai Narasimhamurthy, Erwin Laure

To measure TensorFlow I/O performance, we first design a micro-benchmark to measure TensorFlow reads, and then use a TensorFlow mini-application based on AlexNet to measure the performance cost of I/O and checkpointing in TensorFlow.

Distributed, Parallel, and Cluster Computing

NVIDIA Tensor Core Programmability, Performance & Precision

no code implementations11 Mar 2018 Stefano Markidis, Steven Wei Der Chien, Erwin Laure, Ivy Bo Peng, Jeffrey S. Vetter

After experimenting with different approaches, we found that NVIDIA Tensor Cores can deliver up to 83 Tflops/s in mixed precision on a Tesla V100 GPU, seven and three times the performance in single and half precision respectively.

Distributed, Parallel, and Cluster Computing Performance

Cannot find the paper you are looking for? You can Submit a new open access paper.