Search Results for author: Geoffrey Fox

Found 20 papers, 3 papers with code

RINAS: Training with Dataset Shuffling Can Be General and Fast

no code implementations4 Dec 2023 Tianle Zhong, Jiechen Zhao, Xindi Guo, Qiang Su, Geoffrey Fox

However, loading shuffled data for large datasets incurs significant overhead in the deep learning pipeline and severely impacts the end-to-end training throughput.

Language Modelling

RTP: Rethinking Tensor Parallelism with Memory Deduplication

1 code implementation2 Nov 2023 Cheng Luo, Tianle Zhong, Geoffrey Fox

In the evolving landscape of neural network models, one prominent challenge stand out: the significant memory overheads associated with training expansive models.

Analyzing the Performance of Deep Encoder-Decoder Networks as Surrogates for a Diffusion Equation

no code implementations7 Feb 2023 J. Quetzalcoatl Toledo-Marin, James A. Glazier, Geoffrey Fox

Our results indicate that increasing the size of the training set has a substantial effect on reducing performance fluctuations and overall error.

GTrans: Spatiotemporal Autoregressive Transformer with Graph Embeddings for Nowcasting Extreme Events

no code implementations18 Jan 2022 Bo Feng, Geoffrey Fox

In contrast, applications in social networks, road traffic, physics, and chemical property prediction where data features can be organized with nodes and edges of graphs.

Property Prediction Time Series +1

Earthquake Nowcasting with Deep Learning

no code implementations18 Dec 2021 Geoffrey Fox, John Rundle, Andrea Donnellan, Bo Feng

We review previous approaches to nowcasting earthquakes and introduce new approaches based on deep learning using three distinct models based on recurrent neural networks and transformers.

Scientific Machine Learning Benchmarks

no code implementations25 Oct 2021 Jeyan Thiyagalingam, Mallikarjun Shankar, Geoffrey Fox, Tony Hey

In this paper, we describe our approach to the development of scientific machine learning benchmarks and review other approaches to benchmarking scientific machine learning.

Benchmarking BIG-bench Machine Learning

Multidimensional Scaling for Gene Sequence Data with Autoencoders

no code implementations19 Apr 2021 Pulasthi Wickramasinghe, Geoffrey Fox

Multidimensional scaling of gene sequence data has long played a vital role in analysing gene sequence data to identify clusters and patterns.

Dimensionality Reduction

Deep learning approaches to surrogates for solving the diffusion equation for mechanistic real-world simulations

1 code implementation10 Feb 2021 J. Quetzalcóatl Toledo-Marín, Geoffrey Fox, James P. Sluka, James A. Glazier

To improve convergence during training, we apply a training approach that uses roll-back to reject stochastic changes to the network that increase the loss function.

A Fast, Scalable, Universal Approach For Distributed Data Aggregations

no code implementations27 Oct 2020 Niranda Perera, Vibhatha Abeykoon, Chathura Widanage, Supun Kamburugamuve, Thejaka Amila Kanewala, Pulasthi Wickramasinghe, Ahmet Uyar, Hasara Maithree, Damitha Lenadora, Geoffrey Fox

But, we believe that there is an essential requirement for a data analytics tool that can universally integrate with existing frameworks, and thereby increase the productivity and efficiency of the entire data analytics pipeline.

Deep Tiered Image Segmentation For Detecting Internal Ice Layers in Radar Imagery

no code implementations8 Oct 2020 Yuchen Wang, Mingze Xu, John Paden, Lora Koenig, Geoffrey Fox, David Crandall

Understanding the structure of Earth's polar ice sheets is important for modeling how global warming will impact polar ice and, in turn, the Earth's climate.

Image Segmentation Semantic Segmentation

High Performance Data Engineering Everywhere

no code implementations19 Jul 2020 Chathura Widanage, Niranda Perera, Vibhatha Abeykoon, Supun Kamburugamuve, Thejaka Amila Kanewala, Hasara Maithree, Pulasthi Wickramasinghe, Ahmet Uyar, Gurhan Gunduz, Geoffrey Fox

In this paper we present Cylon, an open-source high performance distributed data processing library that can be seamlessly integrated with existing Big Data and AI/ML frameworks.

Distributed, Parallel, and Cluster Computing Databases

Scientific Image Restoration Anywhere

2 code implementations12 Nov 2019 Vibhatha Abeykoon, Zhengchun Liu, Rajkumar Kettimuthu, Geoffrey Fox, Ian Foster

We explore this question by evaluating the performance and accuracy of a scientific image restoration model, for which both model input and output are images, on edge computing devices.

Edge-computing Image Denoising +2

Learning Everywhere: A Taxonomy for the Integration of Machine Learning and Simulations

no code implementations29 Sep 2019 Geoffrey Fox, Shantenu Jha

We present a taxonomy of research on Machine Learning (ML) applied to enhance simulations together with a catalog of some activities.

BIG-bench Machine Learning

Understanding ML driven HPC: Applications and Infrastructure

no code implementations5 Sep 2019 Geoffrey Fox, Shantenu Jha

We recently outlined the vision of "Learning Everywhere" which captures the possibility and impact of how learning methods and traditional HPC methods can be coupled together.

Performance Optimization on Model Synchronization in Parallel Stochastic Gradient Descent Based SVM

no code implementations3 May 2019 Vibhatha Abeykoon, Geoffrey Fox, Minje Kim

In this research, we identify the bottlenecks in model synchronization in parallel stochastic gradient descent (PSGD)-based SVM algorithm with respect to the training model synchronization frequency (MSF).

Model Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.