Talks | Parallel Computing

2024

Gianluca Mittone, Alberto Mulone, Iacopo Colonnelli, Robert Birke, Marco Aldinucci

Enabling Cross-Facility LLMs Pre-Training Miscellaneous

Accelerating the Development and Use of Generative AI for Science and Engineering: The Trillion Parameter Consortium (TPC), 2024.

Abstract | Links | BibTeX | Tags: eupilot, fl, icsc, space, streamflow

Gianluca Mittone

Benchmarking HPC Performance for State-of-the-Art AI Workloads Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

@misc{24:mittone:itadata:shpcpee,

title = {Benchmarking HPC Performance for State-of-the-Art AI Workloads},

author = {Gianluca Mittone},

url = {https://datacloud.di.unito.it/index.php/s/5Ep3W7cPW5baZfr},

year  = {2024},

date = {2024-09-01},

address = {Pisa, Italy},

abstract = {Benchmarking the performance of modern High-Performance Computing (HPC) infrastructure on Artificial Intelligence (AI) workloads is a hot topic in the supercomputing community. While research communities and big-tech companies actively invest in larger, more powerful data centres to support AI research, the standard computational performance benchmarking tools (e.g., LINPACK) are increasingly becoming outdated since they are not specifically tailored for AI workloads. Some tools, such as MLPerf, are trying to bridge this gap, but the HPC community still has not adopted them as standards. Since this trend became particularly evident with the advent of Large Language Models (LLMs), this work will delve into LLM training at scale as a way to benchmark Top500 HPC infrastructures on current AI workloads. The scalability performances of a major LLM model (i.e., Meta's LLaMA) on different HPCs (Leonardo, LUMI, MeluXina, Karolina) are exposed and discussed along with their Top500 positioning. 

However, it should be noted that state-of-the-art LLM models are not trained on thousands of computing nodes but on hundreds. This choice is due to multiple factors, such as the influence of the training scaling on the model's convergence and the instability of large-scale deployments due to hardware/software failure. A benchmarking approach based on the next-generation LLM training approach is proposed to bypass all these issues. State-of-the-art LLMs are not monolithic structures but Mixture-of-Experts (MoE) models; this design implies innovative frontiers for the distributed training of such models due to the experts' training being potentially more parallelisable than a single monolithic model. We thus propose to create an AI-oriented HPC benchmark suite based on the parallel training of MoE models to measure the throughput performance of HPC systems on state-of-the-art AI workloads.},

keywords = {ai, eupilot, icsc},

pubstate = {published},

tppubtype = {misc}

}

Gianluca Mittone

Pushing Federated Learning Boundaries: Three Innovative Distributed Intelligence Approaches Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: eupilot, fl, icsc

Giulio Malenza

Preliminary analysis of model parallelism applications on a 64-core RV64 Server CPU Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: eupilot, icsc

Robert Birke

The impact of the advances in generative models on applications and systems Miscellaneous

8th GDR RSD / ASF Winter School on Distributed Systems & Networks 2024, 2024, (Keynote talk).

Abstract | Links | BibTeX | Tags: ai, eupilot, textarossa

Gianluca Mittone

RISC-V for AI Miscellaneous

High Performance, Edge And Cloud computing Conference 2024 (HiPEAC 2024), 2024.

Abstract | Links | BibTeX | Tags: eupilot, icsc

2023

Marco Aldinucci

Federated Learning: A Distributed System Viewpoint Miscellaneous

Bicocca University seminars, Milan, Italy, 2023, (Invited talk).

Abstract | Links | BibTeX | Tags: eupilot, icsc, textarossa

Iacopo Colonnelli, Robert Birke, Giulio Malenza, Gianluca Mittone, Alberto Mulone, Marco Aldinucci, Valerio Basile, Marco Antonio Stranisci, Viviana Patti, Jeroen Galjaard, Lydia Y. Chen, Sanzio Bassini, Massimiliano Guarrasi, Gabriella Scipione, Jan Martinovič, Vit Vondrák

Cross-Facility Federated Learning Miscellaneous

1st EuroHPC User Day, 2023.

Links | BibTeX | Tags: across, ai, eupex, eupilot, HPC

Gianluca Mittone, Giulio Malenza, Marco Aldinucci, Robert Birke

Distributed Edge Inference: an Experimental Study on Multiview Detection Miscellaneous

The 16th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 2023), 2023.

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

Samuele Fonio

Benchmarking Federated Learning Frameworks for Medical Imaging Tasks Miscellaneous

Image Analysis and Processing - ICIAP 2023 - 22th International Conference - FedMed, 2023.

Abstract | Links | BibTeX | Tags: ai, eupilot, fl, icsc

Gianluca Mittone, Samuele Fonio

Benchmarking Federated Learning Scalability Miscellaneous

2nd Italian Conference on Big Data and Data Science (ITADATA 2023), 2023.

Abstract | Links | BibTeX | Tags: ai, eupilot, fl, icsc

Gianluca Mittone, Walter Riviera, Iacopo Colonnelli, Robert Birke, Marco Aldinucci

Model-Agnostic Federated Learning Miscellaneous

29th International European Conference on Parallel and Distributed Computing (Euro-Par '23), 2023.

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

Gianluca Mittone, Robert Birke, Marco Aldinucci

Model-Agnostic Federated Learning Miscellaneous

29th International European Conference on Parallel and Distributed Computing (Euro-Par '23), 2023.

Abstract | Links | BibTeX | Tags: eupilot, icsc

Gianluca Mittone, Nicolò Tonci, Robert Birke, Iacopo Colonnelli, Doriana Medić, Andrea Bartolini, Roberto Esposito, Emanuele Parisi, Francesco Beneventi, Mirko Polato, Massimo Torquati, Luca Benini, Marco Aldinucci

Experimenting with Emerging RISC-V Systems for Decentralised Machine Learning Miscellaneous

20th ACM international conference on computing frontiers (CF '23), 2023, (Invited talk).

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

Gianluca Mittone, Filip Svoboda, Marco Aldinucci, Nicholas D. Lane, Pietro Lio'

A Federated Learning Benchmark for Drug-Target Interaction Miscellaneous

2023 ACM international Web Conference (WWW '23), 2023, (Invited talk).

Abstract | Links | BibTeX | Tags: eupilot, icsc

Marco Aldinucci

Experimenting with Systems for Decentralized Machine Learning Miscellaneous

NVidia GTC 2023, 2023.

Abstract | Links | BibTeX | Tags: across, admire, epi, eumaster4hpc, eupex, eupilot, hpc4ai, space, textarossa

Marco Aldinucci

HPC4AI: The Research on AI beyond the public cloud Miscellaneous

CENTAI kick-off meeting, 2023.

Links | BibTeX | Tags: across, admire, brainteaser, epi, eumaster4hpc, eupex, eupilot, hpc4ai, space, textarossa

Marco Aldinucci

From HPC4AI to ICSC living lab: Where systems are the research Miscellaneous

Dell Advanced Computing Workshop 2023: HPC and Beyond, 2023.

Links | BibTeX | Tags: admire, eupex, eupilot, hpc4ai, textarossa

Gianluca Mittone

Paving the way to innovative tools for Federated Learning Miscellaneous

2023 HiPEAC Conference, 2023, (Invited talk).

Abstract | Links | BibTeX | Tags: eupilot

2022

Bruno Casella

Benchmarking FedAvg and FedCurv for Image Classification Tasks Miscellaneous

ITADATA, 2022.

Abstract | Links | BibTeX | Tags: eupilot

Marco Aldinucci

Il calcolo parallelo: una storia di metodi e algoritmi raccontata dalle macchine Miscellaneous

Olimpiadi di Informatica, 2022, (Invited talk).

Abstract | Links | BibTeX | Tags: across, admire, eumaster4hpc, eupex, eupilot, textarossa

Marco Aldinucci

La convergenza HPC-cloud è l'anello mancante tra il calcolo scientifico e l'IA applicata Miscellaneous

Intelligenza Artificiale e Business Applications, 2022, (Invited talk).

Abstract | Links | BibTeX | Tags: across, admire, eumaster4hpc, eupex, eupilot, textarossa

@misc{22:soiel:ai,

title = {La convergenza HPC-cloud è l'anello mancante tra il calcolo scientifico e l'IA applicata},

author = {Marco Aldinucci},

url = {https://datacloud.di.unito.it/index.php/s/xCQSqJ8bCKCXMK9},

year  = {2022},

date = {2022-09-01},

address = {Virtual event},

abstract = {Innanzitutto, le infrastrutture HPC stanno adottando le GPU per il loro rapporto prestazioni per watt superiore rispetto ai multicore generici. In secondo luogo, i flussi di lavoro scientifici di prossima generazione stanno integrando passaggi basati sull'intelligenza artificiale per la loro precisione nell'approssimazione e nell'analisi di fenomeni complessi. In terzo luogo, l'IA e in particolare il Machine Learning (ML) rappresentano un carico di lavoro perfetto per le GPU in termini di prestazioni e tempo di sviluppo. Oggi non possiamo ancora chiudere il cerchio eseguendo senza problemi carichi di lavoro scientifici abilitati all'intelligenza artificiale nelle infrastrutture HPC perché il loro software di sistema e gli strumenti di sviluppo non sono progettati per i carichi di lavoro moderni, come i framework ML progettati per il cloud. È probabile che la convergenza HPC-cloud colmi il divario. Nel talk verranno presentate le infrastrutture e gli strumenti sviluppati all'Università di Torino per la convergenza HPC-cloud (es. HPC4AI, StreamFlow, CAPIO, Jupyter-workflow) e come sono stati utilizzati per le applicazioni di intelligenza artificiale, come la diagnosi spiegabile di polmonite COVID-19 e la tutela della privacy AI. L'esperienza maturata nella progettazione e gestione di HPC4AI costituisce il cuore della progettazione del laboratorio di contaminazione del "FutureHPC" di Torino secondo il Centro Nazionale "HPC, BigData e Quantum Computing" finanziato dal PNRR con 320M€ che dovrebbe essere operativo dal 1 settembre 2022. L'obiettivo finale del laboratorio di contaminazione è sviluppare relazioni e collaborazioni tra industria e università.},

howpublished = {Intelligenza Artificiale e Business Applications},

note = {Invited talk},

keywords = {across, admire, eumaster4hpc, eupex, eupilot, textarossa},

pubstate = {published},

tppubtype = {misc}

}