Talks | Parallel Computing

2024

Gianluca Mittone, Alberto Mulone, Iacopo Colonnelli, Robert Birke, Marco Aldinucci

Enabling Cross-Facility LLMs Pre-Training Miscellaneous

Accelerating the Development and Use of Generative AI for Science and Engineering: The Trillion Parameter Consortium (TPC), 2024.

Abstract | Links | BibTeX | Tags: eupilot, fl, icsc, space, streamflow

Giulio Malenza

Performance portability via C++ PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case study Miscellaneous

IEEE, 2024.

Links | BibTeX | Tags: eupex, icsc

Lorenzo Brescia

Secure workflow computation Miscellaneous

Presentation of previous works and future directions on securing workflows, 2024.

Links | BibTeX | Tags: confidential, icsc

Roberto Esposito Mirko Polato Samuele Fonio

FedHP: Federated Learning with Hyperspherical Prototypical Regularization Miscellaneous

32nd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, (ESANN), 2024.

Abstract | Links | BibTeX | Tags: ai, fl, icsc

Adriano Marques Garcia

Assessing Large Language Models Inference Performance on a 64-core RISC-V CPU with Silicon-Enabled Vectors Miscellaneous

Proceedings of BigHPC2024: Special Track on Big Data and High-Performance Computing, co-located with the 3textsuperscriptrd Italian Conference on Big Data and Data Science, ITADATA2024, 2024.

Links | BibTeX | Tags: confidential, icsc

Lorenzo Brescia

Performance Analysis on DNA Alignment Workload with Intel SGX Multithreading Miscellaneous

Proceedings of BigHPC2024: Special Track on Big Data and High-Performance Computing, co-located with the 3textsuperscriptrd Italian Conference on Big Data and Data Science, ITADATA2024, 2024.

Links | BibTeX | Tags: confidential, icsc

Iacopo Colonnelli

Scientific Workflows in the Heterogeneous Computing Era Miscellaneous

2024.

Links | BibTeX | Tags: icsc

Giulio Malenza

Exploiting C++ Parallel Algorithms through FastFlow Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: icsc

Giulio Malenza

Exploring energy consumption of AI frameworks on a 64-core RV64 Server CPU Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: ai, DYMAN, icsc

Marco Edoardo Santimaria, Iacopo Colonnelli, Marco Aldinucci

Releasing the CAPIO middleware from MPI derived constraints Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: across, admire, capio, capiocl, eupex, icsc

Marco Edoardo Santimaria, Iacopo Colonnelli, Massimo Torquati, Marco Aldinucci

CAPIO: Cross Application Programamble IO Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: across, admire, capio, capiocl, eupex, icsc

Gianluca Mittone

Benchmarking HPC Performance for State-of-the-Art AI Workloads Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

@misc{24:mittone:itadata:shpcpee,

title = {Benchmarking HPC Performance for State-of-the-Art AI Workloads},

author = {Gianluca Mittone},

url = {https://datacloud.di.unito.it/index.php/s/5Ep3W7cPW5baZfr},

year  = {2024},

date = {2024-09-01},

address = {Pisa, Italy},

abstract = {Benchmarking the performance of modern High-Performance Computing (HPC) infrastructure on Artificial Intelligence (AI) workloads is a hot topic in the supercomputing community. While research communities and big-tech companies actively invest in larger, more powerful data centres to support AI research, the standard computational performance benchmarking tools (e.g., LINPACK) are increasingly becoming outdated since they are not specifically tailored for AI workloads. Some tools, such as MLPerf, are trying to bridge this gap, but the HPC community still has not adopted them as standards. Since this trend became particularly evident with the advent of Large Language Models (LLMs), this work will delve into LLM training at scale as a way to benchmark Top500 HPC infrastructures on current AI workloads. The scalability performances of a major LLM model (i.e., Meta's LLaMA) on different HPCs (Leonardo, LUMI, MeluXina, Karolina) are exposed and discussed along with their Top500 positioning. 

However, it should be noted that state-of-the-art LLM models are not trained on thousands of computing nodes but on hundreds. This choice is due to multiple factors, such as the influence of the training scaling on the model's convergence and the instability of large-scale deployments due to hardware/software failure. A benchmarking approach based on the next-generation LLM training approach is proposed to bypass all these issues. State-of-the-art LLMs are not monolithic structures but Mixture-of-Experts (MoE) models; this design implies innovative frontiers for the distributed training of such models due to the experts' training being potentially more parallelisable than a single monolithic model. We thus propose to create an AI-oriented HPC benchmark suite based on the parallel training of MoE models to measure the throughput performance of HPC systems on state-of-the-art AI workloads.},

keywords = {ai, eupilot, icsc},

pubstate = {published},

tppubtype = {misc}

}

Gianluca Mittone

Pushing Federated Learning Boundaries: Three Innovative Distributed Intelligence Approaches Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: eupilot, fl, icsc

Samuele Fonio Bruno Casella Oussama Harrak

Federated Adaboost for Survival Analysis Miscellaneous

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2nd Workshop on Advancements in Federated Learning (WAFL), 2024.

Abstract | Links | BibTeX | Tags: ai, epi, fl, icsc

Alberto Mulone, Doriana Medić, Marco Aldinucci

A Fault Tolerance mechanism for Hybrid Scientific Workflows Miscellaneous

1st workshop about High-Performance e-Science (HiPES), 2024.

Abstract | Links | BibTeX | Tags: eupex, icsc, streamflow

Iacopo Colonnelli

Scientific Workflows in the Continuum Era Miscellaneous

2024, (Keynote Talk).

Abstract | Links | BibTeX | Tags: icsc

Giulio Malenza

Preliminary analysis of model parallelism applications on a 64-core RV64 Server CPU Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: eupilot, icsc

Marco Edoardo Santimaria

CAPIO-CL: Cross Application Programmable IO - Coordination Language Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: across, admire, capio, eupex, icsc

Lorenzo Brescia

Towards Secure WMS with TEEs Miscellaneous

Mentoring session: HPC Summer School, university of Trento, 2024.

Links | BibTeX | Tags: confidential, icsc

Iacopo Colonnelli, Robert Birke, Giulio Malenza, Gianluca Mittone, Alberto Mulone, Marco Aldinucci

Cross-Facility Federated Learning - Part II Miscellaneous

2024, (Invited talk).

Links | BibTeX | Tags: eupex, icsc, space

Marco Aldinucci

From HPC4AI to Software & Integration living lab to innovation Miscellaneous

HPC as an enabling platform for AI event, 2024.

Abstract | Links | BibTeX | Tags: ai, HPC, icsc

Marco Aldinucci

Software & Integration lab of FutureHPC spoke Miscellaneous

HPC as an enabling platform for AI event, 2024.

Abstract | Links | BibTeX | Tags: ai, HPC, icsc

Iacopo Colonnelli

Dynamic hybrid workflows for Deep Learning on HPC infrastructure Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: icsc, jupyter-workflow, streamflow

Gianluca Mittone

Into to Federated Learning Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: fl, icsc

@misc{24:mittone:ictp,

title = {Into to Federated Learning},

author = {Gianluca Mittone},

url = {https://datacloud.di.unito.it/index.php/s/nSwwmedjqe2jbWJ},

year  = {2024},

date = {2024-05-01},

address = {Trieste, Italy},

abstract = {Machine Learning (ML) is the branch of Artificial Intelligence focused on developing algorithms capable of adapting and improving their predictive or generative performance by feeding on data. Adapting or improving the system’s behaviour based on the provided data is called learning since it is similar to the human learning process in many aspects. The same ML algorithm, usually referred to as a model, trained on different data will thus expose different capabilities and can, therefore, solve different tasks. FL is a relatively recent distributed ML methodology aiming to bridge the gap between the need to train ever bigger ML models on ever larger datasets and the individual and companies’ will to protect and not share their private data. From another point of view, FL is also a way to distribute the training of an ML model even more than before. However, it should be considered that the learning performance of FL is usually lower than that of traditional centralised learning. This course will start from Kairouz ad McMahan’s definition of FL: ”Federated learning is a machine learning setting where multiple entities (clients) collaborate in solving a machine learning problem, under the coordination of a central server or service provider. Each client’s raw data is stored locally and not exchanged or transferred; instead, focused updates intended for immediate aggregation are used to achieve the learning objective.” From this starting point, the most significant aspects of FL will be exposed and discussed. This tutorial will particularly explore FL from both the learning and computational [5] performance perspectives, investigating its pros and cons in a distributed ML setting. Since FL natively targets data privacy, some insights on how the FL process can be attacked and protected will also be discussed from a high-level perspective. Finally, a hands-on session will guide the participants in building a basic FL system, providing a better understanding of the major implementational difficulties of such a technique.},

keywords = {fl, icsc},

pubstate = {published},

tppubtype = {misc}

}

Machine Learning (ML) is the branch of Artificial Intelligence focused on developing algorithms capable of adapting and improving their predictive or generative performance by feeding on data. Adapting or improving the system’s behaviour based on the provided data is called learning since it is similar to the human learning process in many aspects. The same ML algorithm, usually referred to as a model, trained on different data will thus expose different capabilities and can, therefore, solve different tasks. FL is a relatively recent distributed ML methodology aiming to bridge the gap between the need to train ever bigger ML models on ever larger datasets and the individual and companies’ will to protect and not share their private data. From another point of view, FL is also a way to distribute the training of an ML model even more than before. However, it should be considered that the learning performance of FL is usually lower than that of traditional centralised learning. This course will start from Kairouz ad McMahan’s definition of FL: ”Federated learning is a machine learning setting where multiple entities (clients) collaborate in solving a machine learning problem, under the coordination of a central server or service provider. Each client’s raw data is stored locally and not exchanged or transferred; instead, focused updates intended for immediate aggregation are used to achieve the learning objective.” From this starting point, the most significant aspects of FL will be exposed and discussed. This tutorial will particularly explore FL from both the learning and computational [5] performance perspectives, investigating its pros and cons in a distributed ML setting. Since FL natively targets data privacy, some insights on how the FL process can be attacked and protected will also be discussed from a high-level perspective. Finally, a hands-on session will guide the participants in building a basic FL system, providing a better understanding of the major implementational difficulties of such a technique.

Iacopo Colonnelli

CWL Working Groups Miscellaneous

2024 CWL Conference, 2024.

Abstract | Links | BibTeX | Tags: icsc

Lorenzo Brescia

Secure Generic Remote Workflow Execution with TEEs Miscellaneous

Proceedings of the 2nd Workshop on Workflows in Distributed Environments, 2024.

Abstract | Links | BibTeX | Tags: confidential, icsc

Alberto Mulone

Workflows for future High-Performance Computing Miscellaneous

COMETE PhD Workshop, 2024.

Links | BibTeX | Tags: icsc

Iacopo Colonnelli

CWL in the HPC Ecosystem Miscellaneous

Workshop on workflow languages for HEP analysis, 2024.

Links | BibTeX | Tags: across, eupex, icsc, space, streamflow

Giulio Malenza, Marco Edoardo Santimaria

Benchmarking Parallelization Models through Karmarkar`s algorithm Miscellaneous

2024.

Abstract | Links | BibTeX | Tags: HPC, icsc

Robert Birke

FLaaS: Federated Learning as a Service Miscellaneous

ICSC - Spoke 1 meeting, 2024.

Abstract | Links | BibTeX | Tags: ai, icsc

Alberto Mulone

Cross-Platform Full Waveform Inversion Miscellaneous

ICSC - Spoke 1 meeting, 2024.

Abstract | Links | BibTeX | Tags: icsc, streamflow

Gianluca Mittone

RISC-V for AI Miscellaneous

High Performance, Edge And Cloud computing Conference 2024 (HiPEAC 2024), 2024.

Abstract | Links | BibTeX | Tags: eupilot, icsc

2023

Lorenzo Brescia, Iacopo Colonnelli

Trusted Computing at Scale Miscellaneous

CN HPC Flagship 4 Working Day, 2023.

Links | BibTeX | Tags: confidential, icsc

Marco Aldinucci

Federated Learning: A Distributed System Viewpoint Miscellaneous

Bicocca University seminars, Milan, Italy, 2023, (Invited talk).

Abstract | Links | BibTeX | Tags: eupilot, icsc, textarossa

Gianluca Mittone, Giulio Malenza, Marco Aldinucci, Robert Birke

Distributed Edge Inference: an Experimental Study on Multiview Detection Miscellaneous

The 16th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 2023), 2023.

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

Marco Aldinucci, Elena Baralis, Valeria Cardellini, Iacopo Colonnelli, Marco Danelutto, Sergio Decherchi, Giuseppe Di Modica, Luca Ferrucci, Marco Gribaudo, Francesco Iannone, Marco Lapegna, Doriana Medić, Giuseppa Muscianisi, Francesca Righetti, Eva Sciacca, Nicola Tonellotto, Mauro Tortonesi, Paolo Trunfio, Tullio Vardanega

A Systematic Mapping Study of Italian Research on Workflows Miscellaneous

18th Workshop on Workflows in Support of Large-Scale Science (WORKS 2023), 2023.

Abstract | Links | BibTeX | Tags: icsc

Iacopo Colonnelli, Doriana Medić, Barbara Cantalupo, Marco Aldinucci

Università degli Studi di Torino: Alpha parallel research group Miscellaneous

HaMMon Kick-Off meeting, 2023.

Links | BibTeX | Tags: icsc, streamflow

Giulio Malenza, Valentina Cesare, Marco Aldinucci

Performance portability in HPC: the Gaia use-case. Miscellaneous

2nd Italian Conference on Big Data and Data Science (ITADATA 2023), 2023.

Links | BibTeX | Tags: icsc

Samuele Fonio

Benchmarking Federated Learning Frameworks for Medical Imaging Tasks Miscellaneous

Image Analysis and Processing - ICIAP 2023 - 22th International Conference - FedMed, 2023.

Abstract | Links | BibTeX | Tags: ai, eupilot, fl, icsc

Gianluca Mittone, Samuele Fonio

Benchmarking Federated Learning Scalability Miscellaneous

2nd Italian Conference on Big Data and Data Science (ITADATA 2023), 2023.

Abstract | Links | BibTeX | Tags: ai, eupilot, fl, icsc

Gianluca Mittone, Walter Riviera, Iacopo Colonnelli, Robert Birke, Marco Aldinucci

Model-Agnostic Federated Learning Miscellaneous

29th International European Conference on Parallel and Distributed Computing (Euro-Par '23), 2023.

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

Gianluca Mittone, Robert Birke, Marco Aldinucci

Model-Agnostic Federated Learning Miscellaneous

29th International European Conference on Parallel and Distributed Computing (Euro-Par '23), 2023.

Abstract | Links | BibTeX | Tags: eupilot, icsc

Giulio Malenza

Building an accelerated OpenFOAM Proof-of-Concept application using Modern C++. Miscellaneous

18th OpenFOAM Workshop 2023, Genova, 2023.

Links | BibTeX | Tags: icsc

Alberto Mulone, Sherine Awad, Davide Chiarugi, Marco Aldinucci

Porting the Variant Calling Pipeline for NGS data in cloud-HPC environment Miscellaneous

47th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2023, 2023.

Abstract | Links | BibTeX | Tags: across, icsc, streamflow

Gianluca Mittone, Nicolò Tonci, Robert Birke, Iacopo Colonnelli, Doriana Medić, Andrea Bartolini, Roberto Esposito, Emanuele Parisi, Francesco Beneventi, Mirko Polato, Massimo Torquati, Luca Benini, Marco Aldinucci

Experimenting with Emerging RISC-V Systems for Decentralised Machine Learning Miscellaneous

20th ACM international conference on computing frontiers (CF '23), 2023, (Invited talk).

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

Gianluca Mittone, Filip Svoboda, Marco Aldinucci, Nicholas D. Lane, Pietro Lio'

A Federated Learning Benchmark for Drug-Target Interaction Miscellaneous

2023 ACM international Web Conference (WWW '23), 2023, (Invited talk).

Abstract | Links | BibTeX | Tags: eupilot, icsc

Bruno Casella, Samuele Fonio

Architecture-Based FedAvg for Vertical Federated Learning Miscellaneous

2023.

Abstract | Links | BibTeX | Tags: ai, epi, fl, icsc

@misc{23:casella:architecturalfedavgtalk,

title = {Architecture-Based FedAvg for Vertical Federated Learning},

author = {Bruno Casella and Samuele Fonio},

url = {https://datacloud.di.unito.it/index.php/s/kJQxnqG4d2ZSicK},

year  = {2023},

date = {2023-01-01},

booktitle = {Proceedings of the 3rd Workshop on Distributed Machine Learning for the Intelligent Computing Continuum (DML-ICC), IEEE/ACM UCC 2023, Taormina, Italy, 4 December 2023},

abstract = {Federated Learning (FL) has emerged as a promising solution to address privacy concerns by collaboratively training Deep Learning (DL) models across distributed parties. This work proposes an architecture-based aggregation strategy in Vertical FL, where parties hold data with different attributes but shared instances. Our approach leverages the identical architectural parts, i.e. neural network layers, of different models to selectively aggregate weights, which is particularly relevant when collaborating with institutions holding different types of datasets, i.e., image, text, or tabular datasets. In a scenario where two entities train DL models, such as a Convolutional Neural Network (CNN) and a Multi-Layer Perceptron (MLP), our strategy computes the average only for architecturally identical segments. This preserves data-specific features learned from demographic and clinical data. We tested our approach on two clinical datasets, i.e., the COVID-CXR dataset and the ADNI study. Results show that our method achieves comparable results with the centralized scenario, in which all the data are collected in a single data lake, and benefits from FL generalizability. In particular, compared to the non-federated models, our proposed proof-of-concept model exhibits a slight performance loss on the COVID-CXR dataset (less than 8%), but outperforms ADNI models by up to 12%. Moreover, communication costs between training rounds are minimized by exchanging only the dense layer parameters.},

keywords = {ai, epi, fl, icsc},

pubstate = {published},

tppubtype = {misc}

}