Search Results for author: Patrick Valduriez

Found 10 papers, 1 papers with code

StreamEnsemble: Predictive Queries over Spatiotemporal Streaming Data

no code implementations30 Sep 2024 Anderson Chaves, Eduardo Ogasawara, Patrick Valduriez, Fabio Porto

Predictive queries over spatiotemporal (ST) stream data pose significant data processing and analysis challenges.

Time Series

Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

no code implementations30 Sep 2024 Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou

First, we propose a fisher information-based method to adaptively sample data within each device to improve the effectiveness of the FL fine-tuning process.

Federated Learning

AEDFL: Efficient Asynchronous Decentralized Federated Learning with Heterogeneous Devices

no code implementations18 Dec 2023 Ji Liu, Tianshi Che, Yang Zhou, Ruoming Jin, Huaiyu Dai, Dejing Dou, Patrick Valduriez

First, we propose an asynchronous FL system model with an efficient model aggregation method for improving the FL convergence.

Federated Learning

Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources

1 code implementation14 Jul 2022 Ji Liu, daxiang dong, Xi Wang, An Qin, Xingjian Li, Patrick Valduriez, Dejing Dou, dianhai yu

Although more layers and more parameters generally improve the accuracy of the models, such big models generally have high computational complexity and require big memory, which exceed the capacity of small devices for inference and incurs long training time.

Knowledge Distillation

Distributed intelligence on the Edge-to-Cloud Continuum: A systematic literature review

no code implementations29 Apr 2022 Daniel Rosendo, Alexandru Costan, Patrick Valduriez, Gabriel Antoniu

Today, to balance various trade-offs, ML-based analytics tends to increasingly leverage an interconnected ecosystem that allows complex applications to be executed on hybrid infrastructures where IoT Edge devices are interconnected to Cloud/HPC systems in what is called the Computing Continuum, the Digital Continuum, or the Transcontinuum. Enabling learning-based analytics on such complex infrastructures is challenging.

Systematic Literature Review

Hyperspherical embedding for novel class classification

no code implementations5 Feb 2021 Rafael S. Pereira, Alexis Joly, Patrick Valduriez, Fabio Porto

However, this traditional approach is not useful for identifying classes unseen on the training set, known as the open set problem.

Classification Few-Shot Learning +3

Workflow Provenance in the Lifecycle of Scientific Machine Learning

no code implementations30 Sep 2020 Renan Souza, Leonardo G. Azevedo, Vítor Lourenço, Elton Soares, Raphael Thiago, Rafael Brandão, Daniel Civitarese, Emilio Vital Brazil, Marcio Moreno, Patrick Valduriez, Marta Mattoso, Renato Cerqueira, Marco A. S. Netto

We contribute with (i) characterization of the lifecycle and taxonomy for data analyses; (ii) design principles to build this view, with a W3C PROV compliant data representation and a reference system architecture; and (iii) lessons learned after an evaluation in an Oil & Gas case using an HPC cluster with 393 nodes and 946 GPUs.

BIG-bench Machine Learning

Provenance Data in the Machine Learning Lifecycle in Computational Science and Engineering

no code implementations9 Oct 2019 Renan Souza, Leonardo Azevedo, Vítor Lourenço, Elton Soares, Raphael Thiago, Rafael Brandão, Daniel Civitarese, Emilio Vital Brazil, Marcio Moreno, Patrick Valduriez, Marta Mattoso, Renato Cerqueira, Marco A. S. Netto

To handle this problem, in this paper we contribute with a detailed characterization of provenance data in the ML lifecycle in CSE; a new provenance data representation, called PROV-ML, built on top of W3C PROV and ML Schema; and extensions to a system that tracks provenance from multiple workflows to address the characteristics of ML and CSE, and to allow for provenance queries with a standard vocabulary.

BIG-bench Machine Learning

Parallel Computation of PDFs on Big Spatial Data Using Spark

no code implementations8 May 2018 Ji Liu, Noel Moreno Lemus, Esther Pacitti, Fabio Porto, Patrick Valduriez

We consider big spatial data, which is typically produced in scientific areas such as geological or seismic interpretation.

Seismic Interpretation

Cannot find the paper you are looking for? You can Submit a new open access paper.