Papers | Parallel Computing

2024

Bruno Casella, Iacopo Colonnelli, Gianluca Mittone, Robert Birke, Walter Riviera, Antonio Sciarappa, Carlo Cavazzoni, Marco Aldinucci

A Performance Model for Confidential Federated Learning Proceedings Article

In: Proceedings of the 2024 Deep Learning Security and Privacy Workshop, IEEE Symposium on Security and Privacy 2024, San Francisco, CA, 2024.

Abstract | Links | BibTeX | Tags: confidential, epi, icsc

Bruno Casella, Walter Riviera, Marco Aldinucci, Gloria Menegaz

Protocol for training MERGE: A federated multi-input neural network for COVID-19 prognosis Journal Article

In: STAR Protocols, 2024, (https://prod-shared-star-protocols.s3.amazonaws.com/protocols/3225.pdf).

Abstract | Links | BibTeX | Tags: confidential, epi, icsc

2023

Samuele Fonio

Benchmarking Federated Learning Frameworks for Medical Imaging Tasks Proceedings Article

In: Image Analysis and Processing - ICIAP 2023 - 22th International Conference - FedMed, Springer LNCS, Udine, Italy, 2023, (In print).

Abstract | Links | BibTeX | Tags: ai, confidential, eupilot, icsc

Gianluca Mittone, Samuele Fonio

Benchmarking Federated Learning Scalability Proceedings Article

In: Proceedings of the 2nd Italian Conference on Big Data and Data Science, ITADATA 2023, September 11-13, 2023, CEUR, Naples, Italy, 2023.

Abstract | Links | BibTeX | Tags: confidential, eupilot, HPC, icsc

Gianluca Mittone, Walter Riviera, Iacopo Colonnelli, Robert Birke, Marco Aldinucci

Model-Agnostic Federated Learning Proceedings Article

In: Euro-Par 2023: Parallel Processing, pp. 383–396, Springer, Limassol, Cyprus, 2023.

Abstract | Links | BibTeX | Tags: confidential, eupilot, icsc, riscv

Gianluca Mittone, Nicolò Tonci, Robert Birke, Iacopo Colonnelli, Doriana Medić, Andrea Bartolini, Roberto Esposito, Emanuele Parisi, Francesco Beneventi, Mirko Polato, Massimo Torquati, Luca Benini, Marco Aldinucci

Experimenting with Emerging RISC-V Systems for Decentralised Machine Learning Proceedings Article

In: 20th ACM International Conference on Computing Frontiers (CF '23), ACM, Bologna, Italy, 2023, ISBN: 979-8-4007-0140-5/23/05, (https://arxiv.org/abs/2302.07946).

Abstract | Links | BibTeX | Tags: confidential, eupilot, HPC, icsc, riscv

Gianluca Mittone, Filip Svoboda, Marco Aldinucci, Nicholas D. Lane, Pietro Lio

A Federated Learning Benchmark for Drug-Target Interaction Proceedings Article

In: Companion Proceedings of the ACM Web Conference 2023 (WWW '23 Companion), ACM, Austin, Texas, 2023, ISBN: 978-1-4503-9419-2/23/04, (https://arxiv.org/abs/2302.07684).

Abstract | Links | BibTeX | Tags: confidential, eupilot, icsc

Matteo Pennisi, Federica Proietto Salanitri, Giovanni Bellitto, Bruno Casella, Marco Aldinucci, Simone Palazzo, Concetto Spampinato

Experience Replay as an Effective Strategy for Optimizing Decentralized Federated Learning Proceedings Article

In: Proceedings of the 1st Workshop on Visual Continual Learning, ICCV 2023, Paris, France, 2 October 2023, 2023, (https://ieeexplore.ieee.org/document/10350429).

Abstract | Links | BibTeX | Tags: ai, confidential

@inproceedings{23:casella:ERGANs,

title = {Experience Replay as an Effective Strategy for Optimizing Decentralized Federated Learning},

author = {Matteo Pennisi and Federica Proietto Salanitri and Giovanni Bellitto and Bruno Casella and Marco Aldinucci and Simone Palazzo and Concetto Spampinato},

url = {https://openaccess.thecvf.com/content/ICCV2023W/VCL/papers/Pennisi_Experience_Replay_as_an_Effective_Strategy_for_Optimizing_Decentralized_Federated_ICCVW_2023_paper.pdf},

doi = {10.1109/ICCVW60793.2023.00362},

year  = {2023},

date = {2023-01-01},

booktitle = {Proceedings of the 1st Workshop on Visual Continual Learning, ICCV 2023, Paris, France, 2 October 2023},

abstract = {Federated and continual learning are training paradigms addressing data distribution shift in space and time. More specifically, federated learning tackles non-i.i.d data in space as information is distributed in multiple nodes, while continual learning faces with temporal aspect of training as it deals with continuous streams of data. Distribution shifts over space and time is what it happens in real federated learning scenarios that show multiple challenges. First, the federated model needs to learn sequentially while retaining knowledge from the past training rounds. Second, the model has also to deal with concept drift from the distributed data distributions. To address these complexities, we attempt to combine continual and federated learning strategies by proposing a solution inspired by experience replay and generative adversarial concepts for supporting decentralized distributed training. In particular, our approach relies on using limited memory buffers of synthetic privacy-preserving samples and interleaving training on local data and on buffer data. By translating the CL formulation into the task of integrating distributed knowledge with local knowledge, our method enables models to effectively integrate learned representation from local nodes, providing models the capability to generalize across multiple datasets.We test our integrated strategy on two realistic medical image analysis tasks — tuberculosis and melanoma classification — using multiple datasets in order to simulate realistic non-i.i.d. medical data scenarios. Results show that our approach achieves performance comparable to standard (non-federated) learning and significantly outperforms state-of-the-art federated methods in their centralized (thus, more favourable) formulation.},

note = {https://ieeexplore.ieee.org/document/10350429},

keywords = {ai, confidential},

pubstate = {published},

tppubtype = {inproceedings}

}

Matteo Pennisi, Federica Proietto Salanitri, Giovanni Bellitto, Bruno Casella, Marco Aldinucci, Simone Palazzo, Concetto Spampinato

FedER: Federated Learning through Experience Replay and Privacy-Preserving Data Synthesis Journal Article

In: Computer Vision and Image Understanding, vol. 238, pp. 103882, 2023.

Abstract | Links | BibTeX | Tags: ai, confidential

@article{23:casella:FedER,

title = {FedER: Federated Learning through Experience Replay and Privacy-Preserving Data Synthesis},

author = {Matteo Pennisi and Federica Proietto Salanitri and Giovanni Bellitto and Bruno Casella and Marco Aldinucci and Simone Palazzo and Concetto Spampinato},

url = {https://www.sciencedirect.com/science/article/pii/S107731422300262X?via%3Dihub},

doi = {10.1016/j.cviu.2023.103882},

year  = {2023},

date = {2023-01-01},

journal = {Computer Vision and Image Understanding},

volume = {238},

pages = {103882},

institution = {Computer Science Department, University of Torino},

abstract = {In the medical field, multi-center collaborations are often sought to yield more generalizable findings by leveraging the heterogeneity of patient and clinical data. However, recent privacy regulations hinder the possibility to share data, and consequently, to come up with machine learning-based solutions that support diagnosis and prognosis. Federated learning (FL) aims at sidestepping this limitation by bringing AI-based solutions to data owners and only sharing local AI models, or parts thereof, that need then to be aggregated. However, most of the existing federated learning solutions are still at their infancy and show several shortcomings, from the lack of a reliable and effective aggregation scheme able to retain the knowledge learned locally to weak privacy preservation as real data may be reconstructed from model updates. Furthermore, the majority of these approaches, especially those dealing with medical data, relies on a centralized distributed learning strategy that poses robustness, scalability and trust issues. In this paper we present a federated and decentralized learning strategy, FedER, that, exploiting experience replay and generative adversarial concepts, effectively integrates features from local nodes, providing models able to generalize across multiple datasets while maintaining privacy. FedER is tested on two tasks — tuberculosis and melanoma classification — using multiple datasets in order to simulate realistic non-i.i.d. medical data scenarios. Results show that our approach achieves performance comparable to standard (non-federated) learning and significantly outperforms state-of-the-art federated methods in their centralized (thus, more favourable) formulation. Code is available at https://github.com/perceivelab/FedER},

keywords = {ai, confidential},

pubstate = {published},

tppubtype = {article}

}

Bruno Casella, Walter Riviera, Marco Aldinucci, Gloria Menegaz

MERGE: A model for multi-input biomedical federated learning Journal Article

In: Patterns, pp. 100856, 2023, ISSN: 2666-3899.

Abstract | Links | BibTeX | Tags: ai, confidential, epi, icsc

Bruno Casella, Roberto Esposito, Antonio Sciarappa, Carlo Cavazzoni, Marco Aldinucci

Experimenting with Normalization Layers in Federated Learning on non-IID scenarios Technical Report

Computer Science Department, University of Torino 2023.

Abstract | Links | BibTeX | Tags: confidential, epi, icsc

Yasir Arfat, Gianluca Mittone, Iacopo Colonnelli, Fabrizio D'Ascenzo, Roberto Esposito, Marco Aldinucci

Pooling critical datasets with Federated Learning Proceedings Article

In: 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2023, pp. 329–337, IEEE, Napoli, Italy, 2023.

Abstract | Links | BibTeX | Tags: admire, confidential, hpc4ai

2022

Bruno Casella, Roberto Esposito, Carlo Cavazzoni, Marco Aldinucci

Benchmarking FedAvg and FedCurv for Image Classification Tasks Proceedings Article

In: Anisetti, Marco, Bonifati, Angela, Bena, Nicola, Ardagna, Claudio, Malerba, Donato (Ed.): Proceedings of the 1st Italian Conference on Big Data and Data Science, ITADATA 2022, September 20-21, 2022, CEUR-WS.org, 2022.

Abstract | Links | BibTeX | Tags: confidential, eupilot

@inproceedings{casella2022benchmarking,

title = {Benchmarking FedAvg and FedCurv for Image Classification Tasks},

author = {Bruno Casella and Roberto Esposito and Carlo Cavazzoni and Marco Aldinucci},

editor = {Marco Anisetti and Angela Bonifati and Nicola Bena and Claudio Ardagna and Donato Malerba},

url = {https://ceur-ws.org/Vol-3340/paper40.pdf},

year  = {2022},

date = {2022-01-01},

booktitle = {Proceedings of the 1st Italian Conference on Big Data and Data Science, ITADATA 2022, September 20-21, 2022},

volume = {3340},

publisher = {CEUR-WS.org},

series = {CEUR Workshop Proceedings},

abstract = {Classic Machine Learning (ML) techniques require training on data available in a single data lake (either centralized or distributed). However, aggregating data from different owners is not always convenient for different reasons, including security, privacy and secrecy. Data carry a value that might vanish when shared with others; the ability to avoid sharing the data enables industrial applications where security and privacy are of paramount importance, making it possible to train global models by implementing only local policies which can be run independently and even on air-gapped data centres. Federated Learning (FL) is a distributed machine learning approach which has emerged as an effective way to address privacy concerns by only sharing local AI models while keeping the data decentralized. Two critical challenges of Federated Learning are managing the heterogeneous systems in the same federated network and dealing with real data, which are often not independently and identically distributed (non-IID) among the clients. In this paper, we focus on the second problem, i.e., the problem of statistical heterogeneity of the data in the same federated network. In this setting, local models might be strayed far from the local optimum of the complete dataset, thus possibly hindering the convergence of the federated model. Several Federated Learning algorithms, such as FedAvg, FedProx and Federated Curvature (FedCurv), aiming at tackling the non-IID setting, have already been proposed. This work provides an empirical assessment of the behaviour of FedAvg and FedCurv in common non-IID scenarios. Results show that the number of epochs per round is an important hyper-parameter that, when tuned appropriately, can lead to significant performance gains while reducing the communication cost. As a side product of this work, we release the non-IID version of the datasets we used so to facilitate further comparisons from the FL community.},

keywords = {confidential, eupilot},

pubstate = {published},

tppubtype = {inproceedings}

}

Bruno Casella, Alessio Chisari, Sebastiano Battiato, Mario Giuffrida.

Transfer Learning via Test-time Neural Networks Aggregation Proceedings Article

In: Farinella, Giovanni Maria, Radeva, Petia, Bouatouch, Kadi (Ed.): Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2022, Volume 5: VISAPP, Online Streaming, February 6-8, 2022, pp. 642–649, INSTICC SciTePress, 2022, ISBN: 978-989-758-555-5.

Abstract | Links | BibTeX | Tags: confidential

@inproceedings{22:VISAPP:transferlearning,

title = {Transfer Learning via Test-time Neural Networks Aggregation},

author = {Bruno Casella and Alessio Chisari and Sebastiano Battiato and Mario Giuffrida.},

editor = {Giovanni Maria Farinella and Petia Radeva and Kadi Bouatouch},

url = {https://iris.unito.it/retrieve/handle/2318/1844159/947123/TRANSFER_LEARNING_VIA_TEST_TIME_NEURAL_NETWORKS_AGGREGATION.pdf},

doi = {10.5220/0010907900003124},

isbn = {978-989-758-555-5},

year  = {2022},

date = {2022-01-01},

booktitle = {Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2022, Volume 5: VISAPP, Online Streaming, February 6-8, 2022},

pages = {642–649},

publisher = {SciTePress},

organization = {INSTICC},

abstract = {It has been demonstrated that deep neural networks outperform traditional machine learning. However, deep networks lack generalisability, that is, they will not perform as good as in a new (testing) set drawn from a different distribution due to the domain shift. In order to tackle this known issue, several transfer learning approaches have been proposed, where the knowledge of a trained model is transferred into another to improve performance with different data. However, most of these approaches require additional training steps, or they suffer from catastrophic forgetting that occurs when a trained model has overwritten previously learnt knowledge. We address both problems with a novel transfer learning approach that uses network aggregation. We train dataset-specific networks together with an aggregation network in a unified framework. The loss function includes two main components: a task-specific loss (such as cross-entropy) and an aggregation loss. The proposed aggregation loss allows our model to learn how trained deep network parameters can be aggregated with an aggregation operator. We demonstrate that the proposed approach learns model aggregation at test time without any further training step, reducing the burden of transfer learning to a simple arithmetical operation. The proposed approach achieves comparable performance w.r.t. the baseline. Besides, if the aggregation operator has an inverse, we will show that our model also inherently allows for selective forgetting, i.e., the aggregated model can forget one of the datasets it was trained on, retaining information on the others.},

keywords = {confidential},

pubstate = {published},

tppubtype = {inproceedings}

}

WE ARE HIRING! If you are Research Engineers, Ph.D. Candidates and Post-Doctoral Researchers send your CV to alpha@di.unito.it

WE ARE HIRING! If you are Research Engineers, Ph.D. Candidates and Post-Doctoral Researchers send your CV to alpha@di.unito.it

Papers | Parallel Computing

2024

2023

2022