Papers | Parallel Computing

2024

Alessia Antelmi, Massimo Torquati, Giacomo Corridori, Daniele Gregori, Francesco Polzella, Gianmarco Spinatelli, Marco Aldinucci

Analyzing FOSS license usage in publicly available software at scale via the SWH-analytics framework Journal Article

In: The Journal of Supercomputing, vol. 80, no. 11, pp. 15799-15833, 2024, ISSN: 1573-0484.

Abstract | Links | BibTeX | Tags: analytics, icsc

Miruna Bețianu, Abele Mălan, Marco Aldinucci, Robert Birke, Lydia Chen

DALLMi: Domain Adaption for LLM-based Multi-label Classifier Proceedings Article

In: Yang, De-Nian, Xie, Xing, Tseng, Vincent S., Pei, Jian, Huang, Jen-Wei, Lin, Jerry Chun-Wei (Ed.): Proceedings of the 28th Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 277–289, Springer, Taipei, Taiwan, 2024.

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

@inproceedings{24:betianu:llm,

title = {DALLMi: Domain Adaption for LLM-based Multi-label Classifier},

author = {Miruna Bețianu and Abele Mălan and Marco Aldinucci and Robert Birke and Lydia Chen},

editor = {De-Nian Yang and Xing Xie and Vincent S. Tseng and Jian Pei and Jen-Wei Huang and Jerry Chun-Wei Lin},

url = {https://hdl.handle.net/2318/1976672},

doi = {10.1007/978-981-97-2259-4_21},

year  = {2024},

date = {2024-05-01},

booktitle = {Proceedings of the 28th Pacific-Asia Conference on Knowledge Discovery and Data Mining},

volume = {14647},

pages = {277–289},

publisher = {Springer},

address = {Taipei, Taiwan},

series = {Lecture Notes in Computer Science},

abstract = {Large language models (LLMs) increasingly serve as the backbone for classifying text associated with distinct domains and simultaneously several labels (classes). When encountering domain shifts, e.g., classifier of movie reviews from IMDb to Rotten Tomatoes, adapting such an LLM-based multi-label classifier is challenging due to incomplete label sets at the target domain and daunting training overhead. The existing domain adaptation methods address either image multi-label classifiers or text binary classifiers. In this paper, we design DALLMi, Domain Adaptation Large Language Model interpolator, a first-of-its-kind semi-supervised domain adaptation method for text data models based on LLMs, specifically BERT. The core of DALLMi is the novel variation loss and MixUp regularization, which jointly leverage the limited positively labeled and large quantity of unlabeled text and, importantly, their interpolation from the BERT word embeddings. DALLMi also introduces a label-balanced sampling strategy to overcome the imbalance between labeled and unlabeled data. We evaluate DALLMi against the partial-supervised and unsupervised approach on three datasets under different scenarios of label availability for the target domain. Our results show that DALLMi achieves higher mAP than unsupervised and partially-supervised approaches by 19.9% and 52.2%, respectively.},

keywords = {ai, eupilot, icsc},

pubstate = {published},

tppubtype = {inproceedings}

}

Chi Hong, Robert Birke, Pin-Yu Chen, Lydia Chen

On Dark Knowledge for Distilling Generators Proceedings Article

Abstract | Links | BibTeX | Tags: ai, epi, icsc

@inproceedings{24:chen:llm,

title = {On Dark Knowledge for Distilling Generators},

author = {Chi Hong and Robert Birke and Pin-Yu Chen and Lydia Chen},

editor = {De-Nian Yang and Xing Xie and Vincent S. Tseng and Jian Pei and Jen-Wei Huang and Jerry Chun-Wei Lin},

url = {https://hdl.handle.net/2318/1976671},

doi = {10.1007/978-981-97-2253-2_19},

year  = {2024},

date = {2024-05-01},

booktitle = {Proceedings of the 28th Pacific-Asia Conference on Knowledge Discovery and Data Mining},

volume = {14646},

pages = {235–247},

publisher = {Springer},

address = {Taipei, Taiwan},

series = {Lecture Notes in Computer Science},

abstract = {Knowledge distillation has been applied on generative models, such as Variational Autoencoder (VAE) and Generative Adversarial Networks (GANs). To distill the knowledge, the synthetic outputs of a teacher generator are used to train a student model. While the dark knowledge, i.e., the probabilistic output, is well explored in distilling classifiers, little is known about the existence of an equivalent dark knowledge for generative models and its extractability. In this paper, we derive the first kind of empirical risk bound for distilling generative models from a Bayesian perspective. Through our analysis, we show the existence of the dark knowledge for generative models, i.e., Bayes probability distribution of a synthetic output from a given input, which achieves lower empirical risk bound than merely using the synthetic output of the generators. Furthermore, we propose a Dark Knowledge based Distillation , DKtill, which trains the student generator based on the (approximate) dark knowledge. Our extensive evaluation on distilling VAE, conditional GANs, and translation GANs on Facades and CelebA datasets show that the FID of student generators trained by DKtill combining dark knowledge are lower than student generators trained only by the synthetic outputs by up to 42.66%, and 78.99%, respectively.},

keywords = {ai, epi, icsc},

pubstate = {published},

tppubtype = {inproceedings}

}

Bruno Casella, Iacopo Colonnelli, Gianluca Mittone, Robert Birke, Walter Riviera, Antonio Sciarappa, Carlo Cavazzoni, Marco Aldinucci

A Performance Analysis for Confidential Federated Learning Proceedings Article

In: Proceedings of the 2024 Deep Learning Security and Privacy Workshop, IEEE Symposium on Security and Privacy 2024, San Francisco, CA, 2024.

Abstract | Links | BibTeX | Tags: confidential, epi, icsc

@inproceedings{24:casella:sgx,

title = {A Performance Analysis for Confidential Federated Learning},

author = {Bruno Casella and Iacopo Colonnelli and Gianluca Mittone and Robert Birke and Walter Riviera and Antonio Sciarappa and Carlo Cavazzoni and Marco Aldinucci},

url = {https://iris.unito.it/retrieve/b5877a97-2d8d-4e95-8791-0aa4a1b953b3/DLSP___CONFIDENTIAL_FL.pdf},

doi = {10.1109/SPW63631.2024.00009},

year  = {2024},

date = {2024-05-01},

booktitle = {Proceedings of the 2024 Deep Learning Security and Privacy Workshop, IEEE Symposium on Security and Privacy 2024},

address = {San Francisco, CA},

abstract = {Federated Learning (FL) has emerged as a solution to preserve data privacy by keeping the data locally on each participant's device. However, FL alone is still vulnerable to attacks that can cause privacy leaks. Therefore, it becomes necessary to take additional security measures at the cost of increasing runtimes. The Trusted Execution Environment (TEE) approach promises to offer the highest degree of security during execution. However, TEEs suffer from memory limits which prevent safe end-to-end FL training of modern deep models. State-of- the-art approaches limit secure training to selected layers, failing to avert the full spectrum of attacks or adopt layer-wise training affecting model performance. We benchmark the usage of a library OS (LibOS) to run the full, unmodified end-to-end FL training inside the TEE. We extensively evaluate and model the overhead of the different security mechanisms needed to protect the data and model during computation (TEE), communication (TLS), and storage (disk encryption). The obtained results across three datasets and two models demonstrate that LibOSes are a viable way to seamlessly inject security into FL with limited overhead (at most 2x), offering valuable guidance for researchers and developers aiming to apply FL in data-security-focused contexts.},

keywords = {confidential, epi, icsc},

pubstate = {published},

tppubtype = {inproceedings}

}

Giulio Malenza, Valentina Cesare, Marco Aldinucci, Ugo Becciani, Alberto Vecchiato

Toward HPC application portability via C++ PSTL: the Gaia AVU-GSR code assessment Journal Article

In: The Journal of Supercomputing, 2024, ISSN: 09208542.

Abstract | Links | BibTeX | Tags: eupex, HPC, icsc

Marco Edoardo Santimaria, Samuele Fonio, Giulio Malenza, Iacopo Colonnelli, Marco Aldinucci

Benchmarking Parallelization Models through Karmarkar Interior-point method Proceedings Article

In: Chis, Horacio González-Vélez Adriana E. (Ed.): 2024 32nd Euromicro International Conference on Parallel, Distributed and Network-based Processing, pp. 1–8, IEEE, Dublin, Ireland, 2024, ISSN: 2377-5750.

Abstract | Links | BibTeX | Tags: HPC, icsc

Bruno Casella, Roberto Esposito, Antonio Sciarappa, Carlo Cavazzoni, Marco Aldinucci

Experimenting With Normalization Layers in Federated Learning on Non-IID Scenarios Journal Article

In: IEEE Access, vol. 12, pp. 47961-47971, 2024.

Links | BibTeX | Tags: epi, icsc

Lorenzo Brescia, Marco Aldinucci

Secure Generic Remote Workflow Execution with TEEs Proceedings Article

In: Proceedings of the 2nd Workshop on Workflows in Distributed Environments, pp. 8-13, ACM, Athens, Greece, 2024.

Abstract | Links | BibTeX | Tags: confidential, icsc

Chi Hong, Robert Birke, Pin-Yu Chen, Lydia Y. Chen

On Dark Knowledge for Distilling Generators Proceedings Article

In: Yang, De-Nian, Xie, Xing, Tseng, Vincent S., Pei, Jian, Huang, Jen-Wei, Lin, Jerry Chun-Wei (Ed.): 28th Pacific-Asia Conference on Knowledge Discovery and Data Mining PAKDD, pp. 235–247, Springer, 2024.

Links | BibTeX | Tags: ai, eupilot, icsc

Miruna Betianu, Abele Malan, Marco Aldinucci, Robert Birke, Lydia Y. Chen

DALLMi: Domain Adaption for LLM-Based Multi-label Classifier Proceedings Article

In: Yang, De-Nian, Xie, Xing, Tseng, Vincent S., Pei, Jian, Huang, Jen-Wei, Lin, Jerry Chun-Wei (Ed.): Advances in Knowledge Discovery and Data Mining - 28th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2024, Taipei, Taiwan, May 7-10, 2024, Proceedings, Part III, pp. 277–289, Springer, 2024.

Links | BibTeX | Tags: ai, eupilot, icsc

Simon Queyrut, Robert Birke, Pascal Felber, Valerio Schiavon

CLUES: Collusive Theft of Conditional Generative Adversarial Networks Proceedings Article

In: 43rd International Symposium on Reliable Distributed Systems SRDS, 2024.

BibTeX | Tags:

Daniele De Vinco, Alessia Antelmi, Carmine Spagnuolo, Luca Maria Aiello

Deciphering Conversational Networks: Stance Detection via Hypergraphs and LLMs Proceedings Article

In: Companion Publication of the 16th ACM Web Science Conference, pp. 3–4, Association for Computing Machinery, Stuttgart, Germany, 2024, ISBN: 9798400704536.

Abstract | Links | BibTeX | Tags: analytics, icsc

Alessia Antelmi, Daniele De Vinco, Carmine Spagnuolo

HypergraphRepository: A Community-Driven and Interactive Hypernetwork Data Collection Proceedings Article

In: Dewar, Megan, Kamiński, Bogumił, Kaszyński, Daniel, Kraiński, Łukasz, Prałat, Paweł, Théberge, François, Wrzosek, Małgorzata (Ed.): Modelling and Mining Networks, pp. 159–173, Springer Nature Switzerland, Cham, 2024, ISBN: 978-3-031-59205-8.

Abstract | Links | BibTeX | Tags: analytics, icsc

Alessia Antelmi, Pasquale Caramante, Gennaro Cordasco, Giuseppe D'Ambrosio, Daniele De Vinco, Francesco Foglia, Luca Postiglione, Carmine Spagnuolo

Reliable and Efficient Agent-Based Modeling and Simulation Journal Article

In: Journal of Artificial Societies and Social Simulation, vol. 27, no. 2, pp. 4, 2024, ISSN: 1460-7425.

Abstract | Links | BibTeX | Tags: analytics, icsc

Bruno Casella, Walter Riviera, Marco Aldinucci, Gloria Menegaz

Protocol for training MERGE: A federated multi-input neural network for COVID-19 prognosis Journal Article

In: STAR Protocols, 2024, (https://prod-shared-star-protocols.s3.amazonaws.com/protocols/3225.pdf).

Abstract | Links | BibTeX | Tags: epi, icsc

Adriano Marques Garcia, Dalvan Griebler, Claudio Schepke, José Daniel García, Javier Fernández Muñoz, Luiz Gustavo Fernandes

Performance and programmability of GrPPI for parallel stream processing on multi-cores Journal Article

In: The Journal of Supercomputing, vol. In press, no. In press, pp. 1-35, 2024, ISBN: 1573-0484.

Abstract | Links | BibTeX | Tags: admire

2023

Alberto Riccardo Martinelli, Massimo Torquati, Marco Aldinucci, Iacopo Colonnelli, Barbara Cantalupo

CAPIO: a Middleware for Transparent I/O Streaming in Data-Intensive Workflows Proceedings Article

In: 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics (HiPC), IEEE, Goa, India, 2023.

Abstract | Links | BibTeX | Tags: admire, capio, eupex, icsc

Gianluca Mittone, Giulio Malenza, Marco Aldinucci, Robert Birke

Distributed Edge Inference: an Experimental Study on Multiview Detection Proceedings Article

In: UCC '23: Proceedings of the 16th IEEE/ACM International Conference on Utility and Cloud Computing Companion, Taormina, Italy, 2023, (eupilot, icsc, In press).

Abstract | Links | BibTeX | Tags: ai

Marco Aldinucci, Elena Maria Baralis, Valeria Cardellini, Iacopo Colonnelli, Marco Danelutto, Sergio Decherchi, Giuseppe Di Modica, Luca Ferrucci, Marco Gribaudo, Francesco Iannone, Marco Lapegna, Doriana Medic, Giuseppa Muscianisi, Francesca Righetti, Eva Sciacca, Nicola Tonellotto, Mauro Tortonesi, Paolo Trunfio, Tullio Vardanega

A Systematic Mapping Study of Italian Research on Workflows Proceedings Article

In: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, SC-W 2023, pp. 2065–2076, ACM, Denver, CO, USA, 2023.

Abstract | Links | BibTeX | Tags: icsc, jupyter-workflow, streamflow

@inproceedings{WORKS2023,

title = {A Systematic Mapping Study of Italian Research on Workflows},

author = {Marco Aldinucci and Elena Maria Baralis and Valeria Cardellini and Iacopo Colonnelli and Marco Danelutto and Sergio Decherchi and Giuseppe Di Modica and Luca Ferrucci and Marco Gribaudo and Francesco Iannone and Marco Lapegna and Doriana Medic and Giuseppa Muscianisi and Francesca Righetti and Eva Sciacca and Nicola Tonellotto and Mauro Tortonesi and Paolo Trunfio and Tullio Vardanega},

url = {https://doi.org/10.1145/3624062.3624285},

doi = {10.1145/3624062.3624285},

year  = {2023},

date = {2023-11-01},

booktitle = {Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, SC-W 2023},

pages = {2065–2076},

publisher = {ACM},

address = {Denver, CO, USA},

abstract = {An entire ecosystem of methodologies and tools revolves around scientific workflow management. They cover crucial non-functional requirements that standard workflow models fail to target, such as interactive execution, energy efficiency, performance portability, Big Data management, and intelligent orchestration in the Computing Continuum. Characterizing and monitoring this ecosystem is crucial to develop an informed view of current and future research directions. This work conducts a systematic mapping study of the Italian workflow research community, collecting and analyzing 25 tools and 10 applications from several scientific domains in the context of the ``National Research Centre for HPC, Big Data, and Quantum Computing'' (ICSC). The study aims to outline the main current research directions and determine how they address the critical needs of modern scientific applications. The findings highlight a variegated research ecosystem of tools, with a prominent interest in advanced workflow orchestration and still immature but promising efforts toward energy efficiency.},

keywords = {icsc, jupyter-workflow, streamflow},

pubstate = {published},

tppubtype = {inproceedings}

}

Zilong Zhao, Robert Birke, Lydia Y. Chen

FCT-GAN: Enhancing Global Correlation of Table Synthesis via Fourier Transform Proceedings Article

In: 32nd ACM International Conference on Information and Knowledge Management (CIKM '23), ACM, Birmingham, United Kingdom, 2023.

Abstract | Links | BibTeX | Tags: icsc

@inproceedings{23:zhao:fctgan,

title = {FCT-GAN: Enhancing Global Correlation of Table Synthesis via Fourier Transform},

author = {Zilong Zhao and Robert Birke and Lydia Y. Chen},

url = {https://iris.unito.it/retrieve/966ba767-dbbd-41e1-b4e3-7ab7ba09303f/FCT-GAN.pdf},

doi = {10.1145/3583780.3615202},

year  = {2023},

date = {2023-10-01},

booktitle = {32nd ACM International Conference on Information and Knowledge Management (CIKM '23)},

publisher = {ACM},

address = {Birmingham, United Kingdom},

abstract = {An alternative method for sharing knowledge while complying with strict data access regulations, such as the European General Data Protection Regulation (GDPR), is the emergence of synthetic tabular data. Mainstream table synthesizers utilize methodologies derived from Generative Adversarial Networks (GAN). Although several state-of-the-art (SOTA) tabular GAN algorithms inherit Convolutional Neural Network (CNN)-based architectures, which have proven effective for images, they tend to overlook two critical properties of tabular data: (i) the global correlation across columns, and (ii) the semantic invariance to the column order. Permuting columns in a table does not alter the semantic meaning of the data, but features extracted by CNNs can change significantly due to their limited convolution filter kernel size. To address the above problems, we propose FCT-GAN– the first conditional tabular GAN to adopt Fourier networks into table synthesis. FCT-GAN enhances permutation invariant GAN training by strengthening the learning of global correlations via Fourier layers. Extensive evaluation on benchmarks and real-world datasets show that FCT-GAN can synthesize tabular data with better (up to 27.8%) machine learning utility (i.e. a proxy of global correlations) and higher (up to 26.5%) statistical similarity to real data. FCT-GAN also has the least variation on synthetic data quality among 7 SOTA baselines on 3 different training-data column orders.},

keywords = {icsc},

pubstate = {published},

tppubtype = {inproceedings}

}

Samuele Fonio, Lorenzo Paletto, Mattia Cerrato, Dino Ienco, Roberto Esposito

Hierarchical priors for Hyperspherical Prototypical Networks Proceedings Article

In: 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN, Bruges, Belgium, 2023, (In print).

Abstract | Links | BibTeX | Tags: ai, icsc

Samuele Fonio

Benchmarking Federated Learning Frameworks for Medical Imaging Tasks Proceedings Article

In: Foresti, Fusiello G. L. (Ed.): Image Analysis and Processing - ICIAP 2023 Workshops. ICIAP 2023, Springer, Cham, Udine, Italy, 2023, (In print).

Abstract | Links | BibTeX | Tags: ai, eupilot, icsc

Gianluca Mittone, Samuele Fonio

Benchmarking Federated Learning Scalability Proceedings Article

In: Proceedings of the 2nd Italian Conference on Big Data and Data Science, ITADATA 2023, September 11-13, 2023, CEUR, Naples, Italy, 2023.

Abstract | Links | BibTeX | Tags: eupilot, HPC, icsc

Chi Hong, Jiyue Huang, Robert Birke, Lydia Y. Chen

Exploring and Exploiting Data-Free Model Stealing Proceedings Article

In: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), Turin, Italy, 2023.

Abstract | Links | BibTeX | Tags: eupilot, icsc

@inproceedings{23:hong:datafree,

title = {Exploring and Exploiting Data-Free Model Stealing},

author = {Chi Hong and Jiyue Huang and Robert Birke and Lydia Y. Chen},

url = {https://iris.unito.it/retrieve/ce44dec6-12c9-443d-99e7-f1141e50aa3a/Data-free%20Model%20Stealing.pdf},

doi = {10.1007/978-3-031-43424-2_2},

year  = {2023},

date = {2023-09-01},

booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD)},

address = {Turin, Italy},

abstract = {Deep machine learning models, e.g., image classifier, are increasingly deployed in the wild to provide services to users. Adversaries are shown capable of stealing the knowledge of these models by sending inference queries and then training substitute models based on query results. The availability and quality of adversarial query inputs are undoubtedly crucial in the stealing process. The recent prior art demonstrates the feasibility of replacing real data by exploring the synthetic adversarial queries, so called data-free attacks, under strong adversarial assumptions, i.e., the deployed classier returns not only class labels but also class probabilities. In this paper, we consider a general adversarial model and propose an effective data-free stealing algorithm, Tandem-GAN, which not only explores synthetic queries but also explicitly exploits the high quality ones. The core of TandemGAN is composed of (i) substitute model which imitates the target model through synthetic queries and their inferred labels; and (ii) a tandem generator consisting of two networks, Gx and Ge, which first explores the synthetic data space via Gx and then exploits high-quality examples via Ge to maximize the knowledge transfer from the target to the substitute model. Our results on four datasets show that the accuracy of our trained substitute model ranges between 96-67% of the target model and outperforms the existing state-of-the-art data-free model stealing approach by up to 2.5X.},

keywords = {eupilot, icsc},

pubstate = {published},

tppubtype = {inproceedings}

}

Valentina Cesare, Ugo Becciani, Alberto Vecchiato, Mario Gilberto Lattanzi, Fabio Pitari, Marco Aldinucci, Beatrice Bucciarelli

The MPI + CUDA Gaia AVU–GSR Parallel Solver Toward Next-generation Exascale Infrastructures Journal Article

In: Publications of the Astronomical Society of the Pacific, vol. 135, no. 1049, 2023.

Abstract | Links | BibTeX | Tags: HPC

@article{23:GAIAMPI_PASP,

title = {The MPI + CUDA Gaia AVU–GSR Parallel Solver Toward Next-generation Exascale Infrastructures},

author = {Valentina Cesare and Ugo Becciani and Alberto Vecchiato and Mario Gilberto Lattanzi and Fabio Pitari and Marco Aldinucci and Beatrice Bucciarelli},

url = {https://iopscience.iop.org/article/10.1088/1538-3873/acdf1e/pdf},

doi = {10.1088/1538-3873/acdf1e},

year  = {2023},

date = {2023-08-01},

journal = {Publications of the Astronomical Society of the Pacific},

volume = {135},

number = {1049},

abstract = {We ported to the GPU with CUDA the Astrometric Verification Unit–Global Sphere Reconstruction (AVU–GSR) Parallel Solver developed for the ESA Gaia mission, by optimizing a previous OpenACC porting of this application. The code aims to find, with a [10, 100] μarcsec precision, the astrometric parameters of about 10^8 stars, the attitude and instrumental settings of the Gaia satellite, and the global parameter γ of the parametrized Post-Newtonian formalism, by solving a system of linear equations, A × x = b, with the LSQR iterative algorithm. The coefficient matrix A of the final Gaia data set is large, with ∼1011 × 108 elements, and sparse, reaching a size of ∼10–100 TB, typical for the Big Data analysis, which requires an efficient parallelization to obtain scientific results in reasonable timescales. The speedup of the CUDA code over the original AVU–GSR solver, parallelized on the CPU with MPI + OpenMP, increases with the system size and the number of resources, reaching a maximum of ∼14×, >9× over the OpenACC application. This result is obtained by comparing the two codes on the CINECA cluster Marconi100, with 4 V100 GPUs per node. After verifying the agreement between the solutions of a set of systems with different sizes computed with the CUDA and the OpenMP codes and that the solutions showed the required precision, the CUDA code was put in production on Marconi100, essential for an optimal AVU–GSR pipeline and the successive Gaia Data Releases. This analysis represents a first step to understand the (pre-)Exascale behavior of a class of applications that follow the same structure of this code. In the next months, we plan to run this code on the pre-Exascale platform Leonardo of CINECA, with 4 next-generation A200 GPUs per node, toward a porting on this infrastructure, where we expect to obtain even higher performances.},

key = {icsc, eupex},

keywords = {HPC},

pubstate = {published},

tppubtype = {article}

}

Gianluca Mittone, Walter Riviera, Iacopo Colonnelli, Robert Birke, Marco Aldinucci

Model-Agnostic Federated Learning Proceedings Article

In: Euro-Par 2023: Parallel Processing, pp. 383–396, Springer, Limassol, Cyprus, 2023.

Abstract | Links | BibTeX | Tags: confidential, eupilot, icsc, riscv

Zilong Zhao, Robert Birke, Lydia Y. Chen

GDTS: GAN-based Distributed Tabular Synthesizer Proceedings Article

In: 16th IEEE International Conference on Cloud Computing (CLOUD), IEEE, Chicago, USA, 2023.

Abstract | Links | BibTeX | Tags: ai

Iacopo Colonnelli, Robert Birke, Marco Aldinucci

Experimenting with PyTorch on RISC-V Proceedings Article

In: RISC-V Summit Europe 2023, Barcelona, Spain, 2023, (Poster).

Abstract | Links | BibTeX | Tags: eupilot, icsc, riscv

Alessia Antelmi, Gennaro Cordasco, Mirko Polato, Vittorio Scarano, Carmine Spagnuolo, Dingqi Yang

A Survey on Hypergraph Representation Learning Journal Article

In: ACM Comput. Surv., 2023, ISSN: 0360-0300.

Abstract | Links | BibTeX | Tags: analytics

Marco Aldinucci, Robert Birke, Antonio Brogi, Emanuele Carlini, Massimo Coppola, Marco Danelutto, Patrizio Dazzi, Luca Ferrucci, Forti Stefano, Hanna Kavalionak, Gabriele Mencagli, Matteo Mordacchin, Marcelo Pasin, Federica Paganelli, Massimo Torquati

A Proposal for a Continuum-aware Programming Model: From Workflows to Services Autonomously Interacting in the Compute Continuum Proceedings Article

In: 2023 IEEE 47th Annual Computers, Software, and Applications Conference (COMPSAC), IEEE, Turin, Italy, 2023.

Abstract | Links | BibTeX | Tags: icsc

@inproceedings{23:aldinucci:continuum,

title = {A Proposal for a Continuum-aware Programming Model: From Workflows to Services Autonomously Interacting in the Compute Continuum},

author = {Marco Aldinucci and Robert Birke and Antonio Brogi and Emanuele Carlini and Massimo Coppola and Marco Danelutto and Patrizio Dazzi and Luca Ferrucci and Forti Stefano and Hanna Kavalionak and Gabriele Mencagli and Matteo Mordacchin and Marcelo Pasin and Federica Paganelli and Massimo Torquati},

url = {https://iris.unito.it/retrieve/2ae13a33-5814-43da-8ea6-2d3e8b122384/Continuum-aware-PM.pdf},

doi = {10.1109/COMPSAC57700.2023.00287},

year  = {2023},

date = {2023-06-01},

booktitle = {2023 IEEE 47th Annual Computers, Software, and Applications Conference (COMPSAC)},

publisher = {IEEE},

address = {Turin, Italy},

abstract = {This paper proposes a continuum-aware programming model enabling the execution of application workflows across the compute continuum: cloud, fog and edge resources. It simplifies the management of heterogeneous nodes while alleviating the burden of programmers and unleashing innovation. This model optimizes the continuum through advanced development experiences by transforming workflows into autonomous service collaborations. It reduces complexity in positioning/interconnecting services across the continuum. A meta-model introduces high-level workflow descriptions as service networks with defined contracts and quality of service, thus enabling the deployment/management of workflows as first-class entities. It also provides automation based on policies, monitoring and heuristics. Tailored mechanisms orchestrate/manage services across the continuum, optimizing performance, cost, data protection and sustainability while managing risks. This model facilitates incremental development with visibility of design impacts and seamless evolution of applications and infrastructures. In this work, we explore this new computing paradigm showing how it can trigger the development of a new generation of tools to support the compute continuum progress.},

keywords = {icsc},

pubstate = {published},

tppubtype = {inproceedings}

}

Jani Valtari, Anna Kulmala, Sandro Schönborn, David Khozaya, Robert Birke, Reikko Jyrki

Real-life Pilot of Virtual Protection and Control - Experiences and Performance Analysis Proceedings Article

In: 27th International Conference on Electricity Distribution (CIRED), Rome, Italy, 2023.

Abstract | Links | BibTeX | Tags:

Sandro Schönborn, Robert Birke, David Kozhaya, Thanikesavan Sivanthi

Real-Time Performance of Virtualised Protection and Control Software Proceedings Article

In: 27th International Conference on Electricity Distribution (CIRED), Rome, Italy, 2023.

Abstract | Links | BibTeX | Tags:

Jesus Carretero, Javier Garcia-Blas, Marco Aldinucci, Jean Baptiste Besnard Besnard, Jean-Thomas Acquaviva, André Brinkmann, Marc-André Vef, Emmanuel Jeannot, Alberto Miranda, Ramon Nou, Morris Riedel, Massimo Torquati, Felix Wolf

Adaptive multi-tier intelligent data manager for Exascale Proceedings Article

In: 20th ACM International Conference on Computing Frontiers (CF '23), ACM, Bologna, Italy, 2023.

Abstract | Links | BibTeX | Tags: admire

Gianluca Mittone, Nicolò Tonci, Robert Birke, Iacopo Colonnelli, Doriana Medić, Andrea Bartolini, Roberto Esposito, Emanuele Parisi, Francesco Beneventi, Mirko Polato, Massimo Torquati, Luca Benini, Marco Aldinucci

Experimenting with Emerging RISC-V Systems for Decentralised Machine Learning Proceedings Article

In: 20th ACM International Conference on Computing Frontiers (CF '23), ACM, Bologna, Italy, 2023, ISBN: 979-8-4007-0140-5/23/05, (https://arxiv.org/abs/2302.07946).

Abstract | Links | BibTeX | Tags: confidential, eupilot, HPC, icsc, riscv

Gianluca Mittone, Filip Svoboda, Marco Aldinucci, Nicholas D. Lane, Pietro Lio

A Federated Learning Benchmark for Drug-Target Interaction Proceedings Article

In: Companion Proceedings of the ACM Web Conference 2023 (WWW '23 Companion), ACM, Austin, Texas, 2023, ISBN: 978-1-4503-9419-2/23/04, (https://arxiv.org/abs/2302.07684).

Abstract | Links | BibTeX | Tags: confidential, eupilot, icsc

Adriano Marques Garcia, Dalvan Griebler, Claudio Schepke, André Sacilotto Santos, José Daniel García, Javier Fernández Muñoz, Luiz Gustavo Fernandes

A Latency, Throughput, and Programmability Perspective of GrPPI for Streaming on Multi-cores Proceedings Article

In: 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), pp. 164-168, IEEE, Naples, Italy, 2023.

Abstract | Links | BibTeX | Tags: admire

Alberto Mulone, Sherine Awad, Davide Chiarugi, Marco Aldinucci

Porting the Variant Calling Pipeline for NGS data in cloud-HPC environment Proceedings Article

In: Shahriar, Hossain, Teranishi, Yuuichi, Cuzzocrea, Alfredo, Sharmin, Moushumi, Towey, Dave, Majumder, A. K. M. Jahangir Alam, Kashiwazaki, Hiroki, Yang, Ji-Jiang, Takemoto, Michiharu, Sakib, Nazmus, Banno, Ryohei, Ahamed, Sheikh Iqbal (Ed.): 47th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2023, pp. 1858–1863, IEEE, Torino, Italy, 2023.

Abstract | Links | BibTeX | Tags: across, icsc, streamflow

Iacopo Colonnelli, Bruno Casella, Gianluca Mittone, Yasir Arfat, Barbara Cantalupo, Roberto Esposito, Alberto Riccardo Martinelli, Doriana Medić, Marco Aldinucci

Federated Learning meets HPC and cloud Proceedings Article

In: Bufano, Filomena, Riggi, Simone, Sciacca, Eva, Schilliro, Francesco (Ed.): Astrophysics and Space Science Proceedings, pp. 193–199, Springer, Catania, Italy, 2023, ISBN: 978-3-031-34167-0, (Keynote talk).

Abstract | Links | BibTeX | Tags: across, eupilot, streamflow

Alessia Antelmi, Luca La Cava, Arianna Pera

Tell Me Who You Are and I Will Predict Your Vulnerability to Political Persuasion Techniques Proceedings Article

In: The 12th International Conference on Complex Networks and their Applications-Book of Abstracts, 2023.

Abstract | Links | BibTeX | Tags: analytics, icsc

Alessia Antelmi, Luca La Cava, Arianna Pera

Finding Hidden Swingers in the 2022 Italian Elections Twitter Discourse Proceedings Article

In: The 12th International Conference on Complex Networks and their Applications-Book of Abstracts, 2023.

Abstract | Links | BibTeX | Tags: analytics, icsc

Alessia Antelmi, Massimo Torquati, Daniele Gregori, Francesco Polzella, Gianmarco Spinatelli, Marco Aldinucci

The SWH-Analytics Framework Proceedings Article

In: Bena, Nicola, Martino, Beniamino Di, Maratea, Antonio, Sperduti, Alessandro, Nardo, Emanuel Di, Ciaramella, Angelo, Montella, Raffaele, Ardagna, Claudio A. (Ed.): Proceedings of the 2nd Italian Conference on Big Data and Data Science (ITADATA 2023), Naples, Italy, September 11-13, 2023, CEUR-WS.org, 2023.

Abstract | Links | BibTeX | Tags: admire, analytics, icsc

Iacopo Colonnelli

Workflow Models for Heterogeneous Distributed Systems Proceedings Article

Abstract | Links | BibTeX | Tags: across, eupex, icsc, jupyter-workflow, streamflow

Bruno Casella, Lorenzo Paletto

Predicting Cryptocurrencies Market Phases through On-Chain Data Long-Term Forecasting Proceedings Article

In: Proceedings of the 2023 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), 1-5 May 2023, Dubai, 2023, (https://ieeexplore.ieee.org/document/10174989).

Abstract | Links | BibTeX | Tags: epi, icsc

Bruno Casella, Samuele Fonio

Architecture-Based FedAvg for Vertical Federated Learning Proceedings Article

In: Proceedings of the 3rd Workshop on Distributed Machine Learning for the Intelligent Computing Continuum (DML-ICC), IEEE/ACM UCC 2023, Taormina, Italy, 4 December 2023, 2023, (https://iris.unito.it/bitstream/2318/1949730/1/HALF_HVL_for_DML_ICC23___Taormina-2.pdf).

Abstract | Links | BibTeX | Tags: ai, epi, icsc

@inproceedings{23:casella:architecturalfedavg,

title = {Architecture-Based FedAvg for Vertical Federated Learning},

author = {Bruno Casella and Samuele Fonio},

url = {https://iris.unito.it/retrieve/173d9960-8531-419d-9bd5-5acce6694c4e/Aggregation%20Based%20VFL.pdf},

doi = {10.1145/3603166.3632559},

year  = {2023},

date = {2023-01-01},

booktitle = {Proceedings of the 3rd Workshop on Distributed Machine Learning for the Intelligent Computing Continuum (DML-ICC), IEEE/ACM UCC 2023, Taormina, Italy, 4 December 2023},

abstract = {Federated Learning (FL) has emerged as a promising solution to address privacy concerns by collaboratively training Deep Learning (DL) models across distributed parties. This work proposes an architecture-based aggregation strategy in Vertical FL, where parties hold data with different attributes but shared instances. Our approach leverages the identical architectural parts, i.e. neural network layers, of different models to selectively aggregate weights, which is particularly relevant when collaborating with institutions holding different types of datasets, i.e., image, text, or tabular datasets. In a scenario where two entities train DL models, such as a Convolutional Neural Network (CNN) and a Multi-Layer Perceptron (MLP), our strategy computes the average only for architecturally identical segments. This preserves data-specific features learned from demographic and clinical data. We tested our approach on two clinical datasets, i.e., the COVID-CXR dataset and the ADNI study. Results show that our method achieves comparable results with the centralized scenario, in which all the data are collected in a single data lake, and benefits from FL generalizability. In particular, compared to the non-federated models, our proposed proof-of-concept model exhibits a slight performance loss on the COVID-CXR dataset (less than 8%), but outperforms ADNI models by up to 12%. Moreover, communication costs between training rounds are minimized by exchanging only the dense layer parameters.},

note = {https://iris.unito.it/bitstream/2318/1949730/1/HALF_HVL_for_DML_ICC23___Taormina-2.pdf},

keywords = {ai, epi, icsc},

pubstate = {published},

tppubtype = {inproceedings}

}

Matteo Pennisi, Federica Proietto Salanitri, Giovanni Bellitto, Bruno Casella, Marco Aldinucci, Simone Palazzo, Concetto Spampinato

Experience Replay as an Effective Strategy for Optimizing Decentralized Federated Learning Proceedings Article

In: Proceedings of the 1st Workshop on Visual Continual Learning, ICCV 2023, Paris, France, 2 October 2023, 2023, (https://ieeexplore.ieee.org/document/10350429).

Abstract | Links | BibTeX | Tags: ai

@inproceedings{23:casella:ERGANs,

title = {Experience Replay as an Effective Strategy for Optimizing Decentralized Federated Learning},

author = {Matteo Pennisi and Federica Proietto Salanitri and Giovanni Bellitto and Bruno Casella and Marco Aldinucci and Simone Palazzo and Concetto Spampinato},

url = {https://openaccess.thecvf.com/content/ICCV2023W/VCL/papers/Pennisi_Experience_Replay_as_an_Effective_Strategy_for_Optimizing_Decentralized_Federated_ICCVW_2023_paper.pdf},

doi = {10.1109/ICCVW60793.2023.00362},

year  = {2023},

date = {2023-01-01},

booktitle = {Proceedings of the 1st Workshop on Visual Continual Learning, ICCV 2023, Paris, France, 2 October 2023},

abstract = {Federated and continual learning are training paradigms addressing data distribution shift in space and time. More specifically, federated learning tackles non-i.i.d data in space as information is distributed in multiple nodes, while continual learning faces with temporal aspect of training as it deals with continuous streams of data. Distribution shifts over space and time is what it happens in real federated learning scenarios that show multiple challenges. First, the federated model needs to learn sequentially while retaining knowledge from the past training rounds. Second, the model has also to deal with concept drift from the distributed data distributions. To address these complexities, we attempt to combine continual and federated learning strategies by proposing a solution inspired by experience replay and generative adversarial concepts for supporting decentralized distributed training. In particular, our approach relies on using limited memory buffers of synthetic privacy-preserving samples and interleaving training on local data and on buffer data. By translating the CL formulation into the task of integrating distributed knowledge with local knowledge, our method enables models to effectively integrate learned representation from local nodes, providing models the capability to generalize across multiple datasets.We test our integrated strategy on two realistic medical image analysis tasks — tuberculosis and melanoma classification — using multiple datasets in order to simulate realistic non-i.i.d. medical data scenarios. Results show that our approach achieves performance comparable to standard (non-federated) learning and significantly outperforms state-of-the-art federated methods in their centralized (thus, more favourable) formulation.},

note = {https://ieeexplore.ieee.org/document/10350429},

keywords = {ai},

pubstate = {published},

tppubtype = {inproceedings}

}

Giorgio Audrito, Alberto Riccardo Martinelli, Gianluca Torta

Parallelising an Aggregate Programming Framework with Message-Passing Interface Proceedings Article

In: 2023 IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion (ACSOS-C), pp. 140–145, 2023.

Links | BibTeX | Tags: HPC

Matteo Pennisi, Federica Proietto Salanitri, Giovanni Bellitto, Bruno Casella, Marco Aldinucci, Simone Palazzo, Concetto Spampinato

FedER: Federated Learning through Experience Replay and Privacy-Preserving Data Synthesis Journal Article

In: Computer Vision and Image Understanding, vol. 238, pp. 103882, 2023.

Abstract | Links | BibTeX | Tags: ai

@article{23:casella:FedER,

title = {FedER: Federated Learning through Experience Replay and Privacy-Preserving Data Synthesis},

author = {Matteo Pennisi and Federica Proietto Salanitri and Giovanni Bellitto and Bruno Casella and Marco Aldinucci and Simone Palazzo and Concetto Spampinato},

url = {https://www.sciencedirect.com/science/article/pii/S107731422300262X?via%3Dihub},

doi = {10.1016/j.cviu.2023.103882},

year  = {2023},

date = {2023-01-01},

journal = {Computer Vision and Image Understanding},

volume = {238},

pages = {103882},

institution = {Computer Science Department, University of Torino},

abstract = {In the medical field, multi-center collaborations are often sought to yield more generalizable findings by leveraging the heterogeneity of patient and clinical data. However, recent privacy regulations hinder the possibility to share data, and consequently, to come up with machine learning-based solutions that support diagnosis and prognosis. Federated learning (FL) aims at sidestepping this limitation by bringing AI-based solutions to data owners and only sharing local AI models, or parts thereof, that need then to be aggregated. However, most of the existing federated learning solutions are still at their infancy and show several shortcomings, from the lack of a reliable and effective aggregation scheme able to retain the knowledge learned locally to weak privacy preservation as real data may be reconstructed from model updates. Furthermore, the majority of these approaches, especially those dealing with medical data, relies on a centralized distributed learning strategy that poses robustness, scalability and trust issues. In this paper we present a federated and decentralized learning strategy, FedER, that, exploiting experience replay and generative adversarial concepts, effectively integrates features from local nodes, providing models able to generalize across multiple datasets while maintaining privacy. FedER is tested on two tasks — tuberculosis and melanoma classification — using multiple datasets in order to simulate realistic non-i.i.d. medical data scenarios. Results show that our approach achieves performance comparable to standard (non-federated) learning and significantly outperforms state-of-the-art federated methods in their centralized (thus, more favourable) formulation. Code is available at https://github.com/perceivelab/FedER},

keywords = {ai},

pubstate = {published},

tppubtype = {article}

}

Bruno Casella, Walter Riviera, Marco Aldinucci, Gloria Menegaz

MERGE: A model for multi-input biomedical federated learning Journal Article

In: Patterns, pp. 100856, 2023, ISSN: 2666-3899.

Abstract | Links | BibTeX | Tags: ai, epi, icsc

Javier Garcia-Blas, Genaro Sanchez-Gallegos, Cosmin Petre, Alberto Riccardo Martinelli, Marco Aldinucci, Jesus Carretero

Hercules: Scalable and Network Portable In-Memory Ad-Hoc File System for Data-Centric and High-Performance Applications Proceedings Article

In: Cano, José, Dikaiakos, Marios D., Papadopoulos, George A., Pericàs, Miquel, Sakellariou, Rizos (Ed.): Euro-Par 2023: Parallel Processing, pp. 679–693, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-39698-4.

Abstract | BibTeX | Tags: admire, HPC

Marco Aldinucci Mirko Polato Roberto Esposito

Boosting Methods for Federated Learning Proceedings Article

In: Calvanese, Diego, Diamantini, Claudia, Ferro, Nicola, Marchesin, Stefano, Silvello, Gianmaria, Tanca, Letizia (Ed.): Proc. of the 31th Italian Symposium on Advanced Database Systems,SEBD 2023, pp. 439–448, CEUR-WS.org, 2023.

Abstract | Links | BibTeX | Tags: eupilot

@inproceedings{DBLP:conf/sebd/Esposito23,

title = {Boosting Methods for Federated Learning},

author = {Marco Aldinucci Mirko Polato Roberto Esposito},

editor = {Diego Calvanese and Claudia Diamantini and Nicola Ferro and Stefano Marchesin and Gianmaria Silvello and Letizia Tanca},

url = {https://ceur-ws.org/Vol-3478/paper48.pdf},

year  = {2023},

date = {2023-01-01},

booktitle = {Proc. of the 31th Italian Symposium on Advanced Database Systems,SEBD 2023},

pages = {439–448},

publisher = {CEUR-WS.org},

series = {CEUR Workshop Proceedings},

abstract = {Federated Learning (FL) has been proposed to develop better AI systems without compromising the privacy of final users and the legitimate interests of private companies. Initially deployed by Google to predict text input on mobile devices, FL has been deployed in many other industries. Since its introduction, Federated Learning mainly exploited the inner working of neural networks and other gradient descent-based algorithms by either exchanging the weights of the model or the gradients computed during learning. While this approach has been very successful, it rules out applying FL in contexts where other models are preferred, e.g., easier to interpret or known to work better. This paper proposes to leverage distributed versions of the AdaBoost algorithm to acquire strong federated models. In contrast with previous approaches, our proposal does not put any constraint on the client-side learning models and does not rely on inner workings of the learning algorithms used in the clients. We perform a large set of experiments on ten UCI datasets, comparing the algorithms in six non-iidness settings. Results show that the approach is effective, in the case of an IID setting, results are often near to the theoretical optimum (i.e., the performances of AdaBoost on the complete dataset). In case of non-IID settings, results very much depend on the severity of the non-IIDness.},

keywords = {eupilot},

pubstate = {published},

tppubtype = {inproceedings}

}

Pedro Ângelo, Viviana Bono, Mariangiola Dezani-Ciancaglini, Mário Florido

Gradual Guarantee for FJ with lambda-Expressions Proceedings Article

In: Tomb, Aaron (Ed.): Proceedings of the 25th ACM International Workshop on Formal Techniques for Java-like Programs, FTfJP 2023, Seattle, WA, USA, 18 July 2023, pp. 32–38, ACM, 2023.

Links | BibTeX | Tags: admire, icsc

William Fornaciari, Federico Reghenzani, Federico Terraneo, Davide Baroffio, Cecilia Metra, Martin Omana, Josie E. Rodriguez Condia, Matteo Sonza Reorda, Robert Birke, Iacopo Colonnelli, Gianluca Mittone, Marco Aldinucci, Gabriele Mencagli, Francesco Iannone, Filippo Palombi, Giuseppe Zummo, Daniele Cesarini, Federico Tesser

RISC-V-based Platforms for HPC: Analyzing Non-functional Properties for Future HPC and Big-Data Clusters Proceedings Article

In: Embedded Computer Systems: Architectures, Modeling, and Simulation - 23rd International Conference, SAMOS 2023, Samos, Greece, 2023, (icsc).

Abstract | Links | BibTeX | Tags: icsc, riscv

Alessia Antelmi, Daniele De Vinco, Gennaro Cordasco, Carmine Spagnuolo

Towards Unraveling Developers Communities in Stack Overflow and Reddit Proceedings Article

In: International Conference on Computational Social Science 2023, 2023.

Abstract | Links | BibTeX | Tags: analytics, icsc

Alessia Antelmi

Engagement in Open Data Workshops: The dark side of remote settings Proceedings Article

In: Methodologies and Intelligent Systems for Technology Enhanced Learning, 12th International Conference, Springer International Publishing, Cham, 2023.

Abstract | Links | BibTeX | Tags: analytics, icsc

Doriana Medić, Marco Aldinucci

Towards formal model for location aware workflows Proceedings Article

Abstract | Links | BibTeX | Tags: eupex, icsc, semantics

Ovidio Filippo, Francesco Bruno, Tineke H. Pinxterhuis, Mariusz Gasior, Leor Perl, Luca Gaido, Domenico Tuttolomondo, Antonio Greco, Roberto Verardi, Gianluca Lo Martire, Mario Iannaccone, Attilio Leone, Gaetano Liccardo, Serena Caglioni, Rocio González Ferreiro, Giulio Rodinò, Giuseppe Musumeci, Giuseppe Patti, Irene Borzillo, Giuseppe Tarantini, Wojciech Wańha, Bruno Casella, Eline H Ploumen, Lukasz Pyka, Ran Kornowski, Andrea Gagnor, Raffaele Piccolo, Sergio Raposeiras Roubin, Davide Capodanno, Paolo Zocca, Federico Conrotto, Gaetano M De Ferrari, Clemens Birgelen, Fabrizio D'Ascenzo

Predictors of target lesion failure after treatment of left main, bifurcation, or chronic total occlusion lesions with ultrathin-strut drug-eluting coronary stents in the ULTRA registry Journal Article

In: Catheterization and Cardiovascular Interventions, 2023.

Abstract | Links | BibTeX | Tags: ai

@article{23:casella:ultra,

title = {Predictors of target lesion failure after treatment of left main, bifurcation, or chronic total occlusion lesions with ultrathin-strut drug-eluting coronary stents in the ULTRA registry},

author = {Ovidio Filippo and Francesco Bruno and Tineke H. Pinxterhuis and Mariusz Gasior and Leor Perl and Luca Gaido and Domenico Tuttolomondo and Antonio Greco and Roberto Verardi and Gianluca Lo Martire and Mario Iannaccone and Attilio Leone and Gaetano Liccardo and Serena Caglioni and Rocio González Ferreiro and Giulio Rodinò and Giuseppe Musumeci and Giuseppe Patti and Irene Borzillo and Giuseppe Tarantini and Wojciech Wańha and Bruno Casella and Eline H Ploumen and Lukasz Pyka and Ran Kornowski and Andrea Gagnor and Raffaele Piccolo and Sergio Raposeiras Roubin and Davide Capodanno and Paolo Zocca and Federico Conrotto and Gaetano M De Ferrari and Clemens Birgelen and Fabrizio D'Ascenzo},

url = {https://onlinelibrary.wiley.com/doi/full/10.1002/ccd.30696},

doi = {10.1002/ccd.30696},

year  = {2023},

date = {2023-01-01},

journal = {Catheterization and Cardiovascular Interventions},

abstract = {Background: Data about the long-term performance of new-generation ultrathin-strut drug-eluting stents (DES) in challenging coronary lesions, such as left main (LM), bifurcation, and chronic total occlusion (CTO) lesions are scant. Methods: The international multicenter retrospective observational ULTRA study included consecutive patients treated from September 2016 to August 2021 with ultrathin-strut (<70µm) DES in challenging de novo lesions. Primary endpoint was target lesion failure (TLF): composite of cardiac death, target-lesion revascularization (TLR), target-vessel myocardial infarction (TVMI), or definite stent thrombosis (ST). Secondary endpoints included all-cause death, acute myocardial infarction (AMI), target vessel revascularization, and TLF components. TLF predictors were assessed with Cox multivariable analysis. Results: Of 1801 patients (age: 66.6$±$11.2 years; male: 1410 [78.3%]), 170 (9.4%) experienced TLF during follow-up of 3.1$±$1.4 years. In patients with LM, CTO, and bifurcation lesions, TLF rates were 13.5%, 9.9%, and 8.9%, respectively. Overall, 160 (8.9%) patients died (74 [4.1%] from cardiac causes). AMI and TVMI rates were 6.0% and 3.2%, respectively. ST occurred in 11 (1.1%) patients while 77 (4.3%) underwent TLR. Multivariable analysis identified the following predictors of TLF: age, STEMI with cardiogenic shock, impaired left ventricular ejection fraction, diabetes, and renal dysfunction. Among the procedural variables, total stent length increased TLF risk (HR: 1.01, 95% CI: 1-1.02 per mm increase), while intracoronary imaging reduced the risk substantially (HR: 0.35, 95% CI: 0.12-0.82). Conclusions: Ultrathin-strut DES showed high efficacy and satisfactory safety, even in patients with challenging coronary lesions. Yet, despite using contemporary gold-standard DES, the association persisted between established patient- and procedure-related features of risk and impaired 3-year clinical outcome.},

keywords = {ai},

pubstate = {published},

tppubtype = {article}

}

Bruno Casella, Roberto Esposito, Antonio Sciarappa, Carlo Cavazzoni, Marco Aldinucci

Experimenting with Normalization Layers in Federated Learning on non-IID scenarios Technical Report

Computer Science Department, University of Torino 2023.

Abstract | Links | BibTeX | Tags: confidential, epi, icsc

Yasir Arfat, Gianluca Mittone, Iacopo Colonnelli, Fabrizio D'Ascenzo, Roberto Esposito, Marco Aldinucci

Pooling critical datasets with Federated Learning Proceedings Article

In: 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2023, pp. 329–337, IEEE, Napoli, Italy, 2023.

Abstract | Links | BibTeX | Tags: admire, confidential, hpc4ai

Sandro Gepiro Contaldo, Luca Alessandri, Iacopo Colonnelli, Marco Beccuti, Marco Aldinucci

Bringing Cell Subpopulation Discovery on a Cloud-HPC Using rCASC and StreamFlow Book Chapter

In: Calogero, Raffaele Adolfo, Benes, Vladimir (Ed.): Single Cell Transcriptomics: Methods and Protocols, pp. 337–345, Springer US, New York, NY, 2023, ISBN: 978-1-0716-2756-3.

Abstract | Links | BibTeX | Tags: streamflow

Amirmasoud Ghiassi, Robert Birke, Lydia Chen

Robust Learning via Golden Symmetric Loss of (un)Trusted Labels Proceedings Article

In: SDM '23: SIAM International Conference on Data Mining, pp. 568–576, 2023.

Abstract | Links | BibTeX | Tags: textarossa

Adriano Marques Garcia, Dalvan Griebler, Claudio Schepke, Luiz Gustavo Fernandes

Micro-batch and data frequency for stream processing on multi-cores Journal Article

In: The Journal of Supercomputing, vol. 79, no. 8, pp. 9206-9244, 2023, ISBN: 1573-0484.

Abstract | Links | BibTeX | Tags:

@article{GARCIA:JSuper:23,

title = {Micro-batch and data frequency for stream processing on multi-cores},

author = {Adriano Marques Garcia and Dalvan Griebler and Claudio Schepke and Luiz Gustavo Fernandes},

url = {https://iris.unito.it/retrieve/9328dbca-98ae-4ac5-b856-57c72db4444a/s11227-022-05024-y_preprint.pdf},

doi = {10.1007/s11227-022-05024-y},

isbn = {1573-0484},

year  = {2023},

date = {2023-01-01},

journal = {The Journal of Supercomputing},

volume = {79},

number = {8},

pages = {9206-9244},

publisher = {Springer},

abstract = {Latency or throughput is often critical performance metrics in stream processing. Applicationsâ€™ performance can fluctuate depending on the input stream. This unpredictability is due to the variety in data arrival frequency and size, complexity, and other factors. Researchers are constantly investigating new ways to mitigate the impact of these variations on performance with self-adaptive techniques involving elasticity or micro-batching. However, there is a lack of benchmarks capable of creating test scenarios to further evaluate these techniques. This work extends and improves the SPBench benchmarking framework to support dynamic micro-batching and data stream frequency management. We also propose a set of algorithms that generates the most commonly used frequency patterns for benchmarking stream processing in related work. It allows the creation of a wide variety of test scenarios. To validate our solution, we use SPBench to create custom benchmarks and evaluate the impact of micro-batching and data stream frequency on the performance of Intel TBB and FastFlow. These are two libraries that leverage stream parallelism for multi-core architectures. Our results demonstrated that our test cases did not benefit from micro-batches on multi-cores. For different data stream frequency configurations, TBB ensured the lowest latency, while FastFlow assured higher throughput in shorter pipelines.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2022

Yujin Zhu, Zilong Zhao, Robert Birke, Lydia Y. Chen

Permutation-Invariant Tabular Data Synthesis Proceedings Article

In: Tsumoto, Shusaku, Ohsawa, Yukio, Chen, Lei, Poel, Dirk Van, Hu, Xiaohua, Motomura, Yoichi, Takagi, Takuya, Wu, Lingfei, Xie, Ying, Abe, Akihiro, Raghavan, Vijay (Ed.): IEEE International Conference on Big Data (Big Data), pp. 5855–5864, IEEE, 2022.

Abstract | Links | BibTeX | Tags:

@inproceedings{bigdata-zhu22,

title = {Permutation-Invariant Tabular Data Synthesis},

author = {Yujin Zhu and Zilong Zhao and Robert Birke and Lydia Y. Chen},

editor = {Shusaku Tsumoto and Yukio Ohsawa and Lei Chen and Dirk Van Poel and Xiaohua Hu and Yoichi Motomura and Takuya Takagi and Lingfei Wu and Ying Xie and Akihiro Abe and Vijay Raghavan},

url = {https://datacloud.di.unito.it/index.php/s/b6z3moNLxnNiCxz},

doi = {10.1109/BigData55660.2022.10020639},

year  = {2022},

date = {2022-12-01},

booktitle = {IEEE International Conference on Big Data (Big Data)},

pages = {5855–5864},

publisher = {IEEE},

abstract = {Tabular data synthesis is an emerging approach to circumvent strict regulations on data privacy while discovering knowledge through big data. Although state-of-the-art AI-based tabular data synthesizers, e.g., table-GAN, CTGAN, TVAE, and CTAB-GAN, are effective at generating synthetic tabular data, their training is sensitive to column permutations of input data. In this paper, we first c onduct a n e xtensive e mpirical s tudy to disclose such a property of permutation invariance and an in-depth analysis of the existing synthesizers. We show that changing the input column order worsens the statistical difference between real and synthetic data by up to 38.67% due to the encoding of tabular data and the network architectures. To fully unleash the potential of big synthetic tabular data, we propose two solutions: (i) AE-GAN, a synthesizer that uses an autoencoder network to represent the tabular data and GAN networks to synthesize the latent representation, and (ii) a feature sorting algorithm to find t he s uitable c olumn o rder o f i nput d ata f or CNN-based synthesizers. We evaluate the proposed solutions on five datasets in terms of the sensitivity to the column permutation, the quality of synthetic data, and the utility in downstream analyses. Our results show that we enhance the property of permutation-invariance when training synthesizers and further improve the quality and utility of synthetic data, up to 22%, compared to the existing synthesizers.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Emilio Sulis, Ilaria Angela Amantea, Marco Aldinucci, Guido Boella, Renata Marinello, Marco Grosso, Paolo Platter, Serena Ambrosini

An ambient assisted living architecture for hospital at home coupled with a process-oriented perspective Journal Article

In: Journal of Ambient Intelligence and Humanized Computing, 2022, ISBN: 1868-5145.

Abstract | Links | BibTeX | Tags: ai

Iacopo Colonnelli, Marco Aldinucci

Hybrid Workflows For Large - Scale Scientific Applications Proceedings Article

In: Sixth EAGE High Performance Computing Workshop, pp. 1–5, European Association of Geoscientists & Engineers , Milano, Italy, 2022, ISSN: 2214-4609.

Abstract | Links | BibTeX | Tags: across, eupex

@inproceedings{22:eage-hpc-workshop,

title = {Hybrid Workflows For Large - Scale Scientific Applications},

author = {Iacopo Colonnelli and Marco Aldinucci},

url = {https://iris.unito.it/retrieve/d79ddabb-f9d7-4a55-9f84-1528b1533ba3/Extended_Abstract.pdf},

doi = {10.3997/2214-4609.2022615029},

issn = {2214-4609},

year  = {2022},

date = {2022-09-01},

booktitle = {Sixth EAGE High Performance Computing Workshop},

pages = {1–5},

publisher = {European Association of Geoscientists & Engineers },

address = {Milano, Italy},

abstract = {Large-scale scientific applications are facing an irrevrsible transition from monolithic, high-performance oriented codes to modular and polyglot deployments of specialised (micro-)services. The reasons behind this transition are many: coupling of standard solvers with Deep Learning techniques, offloading of data analysis and visualisation to Cloud, and the advent of specialised hardware accelerators. Topology-aware Workflow Management Systems (WMSs) play a crucial role. In particular, topology-awareness allows an explicit mapping of workflow steps onto heterogeneous locations, allowing automated executions on top of hybrid architectures (e.g., cloud+HPC or classical+quantum). Plus, topology-aware WMSs can offer nonfunctional requirements OOTB, e.g. components' life-cycle orchestration, secure and efficient data transfers, fault tolerance, and cross-cluster execution of urgent workloads. Augmenting interactive Jupyter Notebooks with distributed workflow capabilities allows domain experts to prototype and scale applications using the same technological stack, while relying on a feature-rich and user-friendly web interface. This abstract will showcase how these general methodologies can be applied to a typical geoscience simulation pipeline based on the Full Wavefront Inversion (FWI) technique. In particular, a prototypical Jupyter Notebook will be executed interactively on Cloud. Preliminary data analyses and post-processing will be executed locally, while the computationally demanding optimisation loop will be scheduled on a remote HPC cluster.},

keywords = {across, eupex},

pubstate = {published},

tppubtype = {inproceedings}

}

Christopher Stewart, Nathaniel Morris, Lydia Y. Chen, Robert Birke

Performance Modeling for Short-Term Cache Allocation Proceedings Article

In: Proceedings of the 51st International Conference on Parallel Processing (ICPP), pp. 31:1–31:11, ACM, 2022.

Abstract | Links | BibTeX | Tags:

@inproceedings{icpp-stewart22,

title = {Performance Modeling for Short-Term Cache Allocation},

author = {Christopher Stewart and Nathaniel Morris and Lydia Y. Chen and Robert Birke},

url = {https://doi.org/10.1145/3545008.3545094},

doi = {10.1145/3545008.3545094},

year  = {2022},

date = {2022-08-01},

booktitle = {Proceedings of the 51st International Conference on Parallel Processing (ICPP)},

pages = {31:1–31:11},

publisher = {ACM},

abstract = {Short-term cache allocation grants and then revokes access to processor cache lines dynamically. For online services, short-term allocation can speed up targeted query executions and free up cache lines reserved, but normally not needed, for performance. However, in collocated settings, short-term allocation can increase cache contention, slowing down collocated query executions. To offset slowdowns, collocated services may request short-term allocation more often, making the problem worse. Short-term allocation policies manage which queries receive cache allocations and when. In collocated settings, these policies should balance targeted query speedups against slowdowns caused by recurring cache contention. We present a model-driven approach that (1) predicts response time under a given policy, (2) explores competing policies and (3) chooses policies that yield low response time for all collocated services. Our approach profiles cache usage offline, characterizes the effects of cache allocation policies using deep learning techniques and devises novel performance models for short-term allocation with online services. We tested our approach using data processing, cloud, and high-performance computing benchmarks collocated on Intel processors equipped with Cache Allocation Technology. Our models predicted median response time with 11% absolute percent error. Short-term allocation policies found using our approach out performed state-of-the-art shared cache allocation policies by 1.2-2.3X.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Mirko Polato, Roberto Esposito, Marco Aldinucci

Boosting the Federation: Cross-Silo Federated Learning without Gradient Descent Proceedings Article

In: Intl. Joint Conference on Neural Networks (IJCNN), pp. 1–10, IEEE, Padua, Italy, 2022.

Abstract | Links | BibTeX | Tags: eupilot, hpc4ai

Adriano Marques Garcia, Dalvan Griebler, Claudio Schepke, Luiz Gustavo Fernandes

Evaluating Micro-batch and Data Frequency for Stream Processing Applications on Multi-cores Proceedings Article

In: 30th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), pp. 10-17, IEEE, Valladolid, Spain, 2022.

Abstract | Links | BibTeX | Tags:

@inproceedings{GARCIA:PDP:22,

title = {Evaluating Micro-batch and Data Frequency for Stream Processing Applications on Multi-cores},

author = {Adriano Marques Garcia and Dalvan Griebler and Claudio Schepke and Luiz Gustavo Fernandes},

url = {https://iris.unito.it/retrieve/f6d113e5-789b-4f8b-924d-8ca3d38e8d62/PDP_2022__SPBench_with_Batch_and_Data_Frequency_.pdf},

doi = {10.1109/PDP55904.2022.00011},

year  = {2022},

date = {2022-04-01},

booktitle = {30th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)},

pages = {10-17},

publisher = {IEEE},

address = {Valladolid, Spain},

series = {PDP'22},

abstract = {In stream processing, data arrives constantly and is often unpredictable. It can show large fluctuations in arrival frequency, size, complexity, and other factors. These fluctuations can strongly impact application latency and throughput, which are critical factors in this domain. Therefore, there is a significant amount of research on self-adaptive techniques involving elasticity or micro-batching as a way to mitigate this impact. However, there is a lack of benchmarks and tools for helping researchers to investigate micro-batching and data stream frequency implications. In this paper, we extend a benchmarking framework to support dynamic micro-batching and data stream frequency management. We used it to create custom benchmarks and compare latency and throughput aspects from two different parallel libraries. We validate our solution through an extensive analysis of the impact of micro-batching and data stream frequency on stream processing applications using Intel TBB and FastFlow, which are two libraries that leverage stream parallelism on multi-core architectures. Our results demonstrated up to 33% throughput gain over latency using micro-batches. Additionally, while TBB ensures lower latency, FastFlow ensures higher throughput in the parallel applications for different data stream frequency configurations.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Amirmasoud Ghiassi, Robert Birke, Lydia Y. Chen

LABNET: A Collaborative Method for DNN Training and Label Aggregation Proceedings Article

In: Rocha, Ana Paula, Steels, Luc, Herik, H. Jaap (Ed.): 14th International Conference on Agents and Artificial Intelligence (ICAART), pp. 56–66, SCITEPRESS, 2022.

Abstract | Links | BibTeX | Tags:

@inproceedings{ghiassi/iccart22,

title = {LABNET: A Collaborative Method for DNN Training and Label Aggregation},

author = {Amirmasoud Ghiassi and Robert Birke and Lydia Y. Chen},

editor = {Ana Paula Rocha and Luc Steels and H. Jaap Herik},

url = {https://www.scitepress.org/Link.aspx?doi=10.5220/0010770400003116},

doi = {10.5220/0010770400003116},

year  = {2022},

date = {2022-02-01},

booktitle = {14th International Conference on Agents and Artificial Intelligence (ICAART)},

pages = {56–66},

publisher = {SCITEPRESS},

abstract = {Today, to label the massive datasets needed to train Deep Neural Networks (DNNs), cheap and error-prone methods such as crowdsourcing are used. Label aggregation methods aim to infer the true labels from noisy labels annotated by crowdsourcing workers via labels statistics features. Aggregated labels are the main data source to train deep neural networks, and their accuracy directly affects the deep neural network performance. In this paper, we argue that training DNN and aggregating labels are not two separate tasks. Incorporation between DNN training and label aggregation connects data features, noisy labels, and aggregated labels. Since each image contains valuable knowledge about its label, the data features help aggregation methods enhance their performance. We propose LABNET an iterative two-step method. Step one: the label aggregation algorithm provides labels to train the DNN. Step two: the DNN shares a representation of the data features with the label aggregation algorithm. These steps are repeated until the converging label aggregation error rate. To evaluate LABNET we conduct an extensive empirical comparison on CIFAR-10 and CIFAR-100 under different noise and worker statistics. Our evaluation results show that LABNET achieves the highest mean accuracy with an increase of at least 8% to 0.6% and lowest error rate with a reduction of 7.5% to 0.25% against existing aggregation and training methods in most cases.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Federica Proietto Salanitri, Giovanni Bellitto, Simone Palazzo, Ismail Irmakci, Michael B. Wallace, Candice W. Bolan, Megan Engels, Sanne Hoogenboom, Marco Aldinucci, Ulas Bagci, Daniela Giordano, Concetto Spampinato

Neural Transformers for Intraductal Papillary Mucosal Neoplasms (IPMN) Classification in MRI images Proceedings Article

In: 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, EMBC 2022, Glasgow, Scotland, United Kingdom, July 11-15, 2022, pp. 475–479, IEEE, 2022.

Links | BibTeX | Tags: hpc4ai

Valentina Cesare, Ugo Becciani, Alberto Vecchiato, Mario Gilberto Lattanzi, Fabio Pitari, Mario Raciti, Giuseppe Tudisco, Marco Aldinucci, Beatrice Bucciarelli

The Gaia AVU-GSR parallel solver: Preliminary studies of a LSQR-based application in perspective of exascale systems Journal Article

In: Astronomy and Computing, pp. 100660, 2022, ISSN: 2213-1337.

Abstract | Links | BibTeX | Tags: eupex

@article{CESARE2022100660,

title = {The Gaia AVU-GSR parallel solver: Preliminary studies of a LSQR-based application in perspective of exascale systems},

author = {Valentina Cesare and Ugo Becciani and Alberto Vecchiato and Mario Gilberto Lattanzi and Fabio Pitari and Mario Raciti and Giuseppe Tudisco and Marco Aldinucci and Beatrice Bucciarelli},

url = {https://openaccess.inaf.it/handle/20.500.12386/32451},

doi = {10.1016/j.ascom.2022.100660},

issn = {2213-1337},

year  = {2022},

date = {2022-01-01},

journal = {Astronomy and Computing},

pages = {100660},

abstract = {The Gaia Astrometric Verification Unit–Global Sphere Reconstruction (AVU–GSR) Parallel Solver aims to find the astrometric parameters for circa 10^8 stars in the Milky Way, the attitude and the instrumental specifications of the Gaia satellite, and the global parameter γ of the post Newtonian formalism. The code iteratively solves a system of linear equations, A×x=b, where the coefficient matrix A is large (circa 10^11×10^8 elements) and sparse. To solve this system of equations, the code exploits a hybrid implementation of the iterative PC-LSQR algorithm, where the computation related to different horizontal portions of the coefficient matrix is assigned to separate MPI processes. In the original code, each matrix portion is further parallelized over the OpenMP threads. To further improve the code performance, we ported the application to the GPU, replacing the OpenMP parallelization language with OpenACC. In this port, ∼95% of the data is copied from the host to the device at the beginning of the entire cycle of iterations, making the code compute bound rather than data-transfer bound. The OpenACC code presents a speedup of circa 1.5 over the OpenMP version but further optimizations are in progress to obtain higher gains. The code runs on multiple GPUs and it was tested on the CINECA supercomputer Marconi100, in anticipation of a port to the pre-exascale system Leonardo, that will be installed at CINECA in 2022.},

keywords = {eupex},

pubstate = {published},

tppubtype = {article}

}

Giovanni Agosta, Marco Aldinucci, Carlos Alvarez, Roberto Ammendola, Yasir Arfat, Olivier Beaumont, Massimo Bernaschi, Andrea Biagioni, Tommaso Boccali, Berenger Bramas, Carlo Brandolese, Barbara Cantalupo, Mauro Carrozzo, Daniele Cattaneo, Alessandro Celestini, Massimo Celino, Iacopo Colonnelli, Paolo Cretaro, Pasqua D'Ambra, Marco Danelutto, Roberto Esposito, Lionel Eyraud-Dubois, Antonio Filgueras, William Fornaciari, Ottorino Frezza, Andrea Galimberti, Francesco Giacomini, Brice Goglin, Daniele Gregori, Abdou Guermouche, Francesco Iannone, Michal Kulczewski, Francesca Lo Cicero, Alessandro Lonardo, Alberto R. Martinelli, Michele Martinelli, Xavier Martorell, Giuseppe Massari, Simone Montangero, Gianluca Mittone, Raymond Namyst, Ariel Oleksiak, Paolo Palazzari, Pier Stanislao Paolucci, Federico Reghenzani, Cristian Rossi, Sergio Saponara, Francesco Simula, Federico Terraneo, Samuel Thibault, Massimo Torquati, Matteo Turisini, Piero Vicini, Miquel Vidal, Davide Zoni, Giuseppe Zummo

Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach Journal Article

In: Microprocessors and Microsystems, vol. 95, pp. 104679, 2022, ISSN: 0141-9331.

Abstract | Links | BibTeX | Tags: textarossa

@article{textarossa2022micpro:,

title = {Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach},

author = {Giovanni Agosta and Marco Aldinucci and Carlos Alvarez and Roberto Ammendola and Yasir Arfat and Olivier Beaumont and Massimo Bernaschi and Andrea Biagioni and Tommaso Boccali and Berenger Bramas and Carlo Brandolese and Barbara Cantalupo and Mauro Carrozzo and Daniele Cattaneo and Alessandro Celestini and Massimo Celino and Iacopo Colonnelli and Paolo Cretaro and Pasqua D'Ambra and Marco Danelutto and Roberto Esposito and Lionel Eyraud-Dubois and Antonio Filgueras and William Fornaciari and Ottorino Frezza and Andrea Galimberti and Francesco Giacomini and Brice Goglin and Daniele Gregori and Abdou Guermouche and Francesco Iannone and Michal Kulczewski and Francesca Lo Cicero and Alessandro Lonardo and Alberto R. Martinelli and Michele Martinelli and Xavier Martorell and Giuseppe Massari and Simone Montangero and Gianluca Mittone and Raymond Namyst and Ariel Oleksiak and Paolo Palazzari and Pier Stanislao Paolucci and Federico Reghenzani and Cristian Rossi and Sergio Saponara and Francesco Simula and Federico Terraneo and Samuel Thibault and Massimo Torquati and Matteo Turisini and Piero Vicini and Miquel Vidal and Davide Zoni and Giuseppe Zummo},

doi = {10.1016/j.micpro.2022.104679},

issn = {0141-9331},

year  = {2022},

date = {2022-01-01},

journal = {Microprocessors and Microsystems},

volume = {95},

pages = {104679},

abstract = {In the near future, Exascale systems will need to bridge three technology gaps to achieve high performance while remaining under tight power constraints: energy efficiency and thermal control; extreme computation efficiency via HW acceleration and new arithmetic; methods and tools for seamless integration of reconfigurable accelerators in heterogeneous HPC multi-node platforms. TEXTAROSSA addresses these gaps through a co-design approach to heterogeneous HPC solutions, supported by the integration and extension of HW and SW IPs, programming models, and tools derived from European research.},

keywords = {textarossa},

pubstate = {published},

tppubtype = {article}

}

Bruno Casella, Roberto Esposito, Carlo Cavazzoni, Marco Aldinucci

Benchmarking FedAvg and FedCurv for Image Classification Tasks Proceedings Article

In: Anisetti, Marco, Bonifati, Angela, Bena, Nicola, Ardagna, Claudio, Malerba, Donato (Ed.): Proceedings of the 1st Italian Conference on Big Data and Data Science, ITADATA 2022, September 20-21, 2022, CEUR-WS.org, 2022.

Abstract | Links | BibTeX | Tags: eupilot

@inproceedings{casella2022benchmarking,

title = {Benchmarking FedAvg and FedCurv for Image Classification Tasks},

author = {Bruno Casella and Roberto Esposito and Carlo Cavazzoni and Marco Aldinucci},

editor = {Marco Anisetti and Angela Bonifati and Nicola Bena and Claudio Ardagna and Donato Malerba},

url = {https://ceur-ws.org/Vol-3340/paper40.pdf},

year  = {2022},

date = {2022-01-01},

booktitle = {Proceedings of the 1st Italian Conference on Big Data and Data Science, ITADATA 2022, September 20-21, 2022},

volume = {3340},

publisher = {CEUR-WS.org},

series = {CEUR Workshop Proceedings},

abstract = {Classic Machine Learning (ML) techniques require training on data available in a single data lake (either centralized or distributed). However, aggregating data from different owners is not always convenient for different reasons, including security, privacy and secrecy. Data carry a value that might vanish when shared with others; the ability to avoid sharing the data enables industrial applications where security and privacy are of paramount importance, making it possible to train global models by implementing only local policies which can be run independently and even on air-gapped data centres. Federated Learning (FL) is a distributed machine learning approach which has emerged as an effective way to address privacy concerns by only sharing local AI models while keeping the data decentralized. Two critical challenges of Federated Learning are managing the heterogeneous systems in the same federated network and dealing with real data, which are often not independently and identically distributed (non-IID) among the clients. In this paper, we focus on the second problem, i.e., the problem of statistical heterogeneity of the data in the same federated network. In this setting, local models might be strayed far from the local optimum of the complete dataset, thus possibly hindering the convergence of the federated model. Several Federated Learning algorithms, such as FedAvg, FedProx and Federated Curvature (FedCurv), aiming at tackling the non-IID setting, have already been proposed. This work provides an empirical assessment of the behaviour of FedAvg and FedCurv in common non-IID scenarios. Results show that the number of epochs per round is an important hyper-parameter that, when tuned appropriately, can lead to significant performance gains while reducing the communication cost. As a side product of this work, we release the non-IID version of the datasets we used so to facilitate further comparisons from the FL community.},

keywords = {eupilot},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, David Atienza, Federico Bolelli, Mónica Caballero, Iacopo Colonnelli, José Flich, Jon Ander Gómez, David González, Costantino Grana, Marco Grangetto, Simone Leo, Pedro López, Dana Oniga, Roberto Paredes, Luca Pireddu, Eduardo Quiñones, Tatiana Silva, Enzo Tartaglione, Marina Zapater

The DeepHealth Toolkit: A Key European Free and Open-Source Software for Deep Learning and Computer Vision Ready to Exploit Heterogeneous HPC and Cloud Architectures Book Section

In: Curry, Edward, Auer, Sören, Berre, Arne J., Metzger, Andreas, Perez, Maria S., Zillner, Sonja (Ed.): Technologies and Applications for Big Data Value, pp. 183–202, Springer International Publishing, Cham, 2022, ISBN: 978-3-030-78307-5.

Abstract | Links | BibTeX | Tags: deephealth, streamflow

@incollection{22:TABDV,

title = {The DeepHealth Toolkit: A Key European Free and Open-Source Software for Deep Learning and Computer Vision Ready to Exploit Heterogeneous HPC and Cloud Architectures},

author = {Marco Aldinucci and David Atienza and Federico Bolelli and Mónica Caballero and Iacopo Colonnelli and José Flich and Jon Ander Gómez and David González and Costantino Grana and Marco Grangetto and Simone Leo and Pedro López and Dana Oniga and Roberto Paredes and Luca Pireddu and Eduardo Quiñones and Tatiana Silva and Enzo Tartaglione and Marina Zapater},

editor = {Edward Curry and Sören Auer and Arne J. Berre and Andreas Metzger and Maria S. Perez and Sonja Zillner},

url = {https://link.springer.com/content/pdf/10.1007/978-3-030-78307-5_9.pdf},

doi = {10.1007/978-3-030-78307-5_9},

isbn = {978-3-030-78307-5},

year  = {2022},

date = {2022-01-01},

booktitle = {Technologies and Applications for Big Data Value},

pages = {183–202},

publisher = {Springer International Publishing},

address = {Cham},

chapter = {9},

abstract = {At the present time, we are immersed in the convergence between Big Data, High-Performance Computing and Artificial Intelligence. Technological progress in these three areas has accelerated in recent years, forcing different players like software companies and stakeholders to move quickly. The European Union is dedicating a lot of resources to maintain its relevant position in this scenario, funding projects to implement large-scale pilot testbeds that combine the latest advances in Artificial Intelligence, High-Performance Computing, Cloud and Big Data technologies. The DeepHealth project is an example focused on the health sector whose main outcome is the DeepHealth toolkit, a European unified framework that offers deep learning and computer vision capabilities, completely adapted to exploit underlying heterogeneous High-Performance Computing, Big Data and cloud architectures, and ready to be integrated into any software platform to facilitate the development and deployment of new applications for specific problems in any sector. This toolkit is intended to be one of the European contributions to the field of AI. This chapter introduces the toolkit with its main components and complementary tools, providing a clear view to facilitate and encourage its adoption and wide use by the European community of developers of AI-based solutions and data scientists working in the healthcare sector and others.},

keywords = {deephealth, streamflow},

pubstate = {published},

tppubtype = {incollection}

}

Bruno Casella, Alessio Chisari, Sebastiano Battiato, Mario Giuffrida

Transfer Learning via Test-time Neural Networks Aggregation Proceedings Article

In: Farinella, Giovanni Maria, Radeva, Petia, Bouatouch, Kadi (Ed.): Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2022, Volume 5: VISAPP, Online Streaming, February 6-8, 2022, pp. 642–649, INSTICC SciTePress, 2022, ISBN: 978-989-758-555-5.

Abstract | Links | BibTeX | Tags: ai

@inproceedings{22:VISAPP:transferlearning,

title = {Transfer Learning via Test-time Neural Networks Aggregation},

author = {Bruno Casella and Alessio Chisari and Sebastiano Battiato and Mario Giuffrida},

editor = {Giovanni Maria Farinella and Petia Radeva and Kadi Bouatouch},

url = {https://iris.unito.it/retrieve/handle/2318/1844159/947123/TRANSFER_LEARNING_VIA_TEST_TIME_NEURAL_NETWORKS_AGGREGATION.pdf},

doi = {10.5220/0010907900003124},

isbn = {978-989-758-555-5},

year  = {2022},

date = {2022-01-01},

booktitle = {Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2022, Volume 5: VISAPP, Online Streaming, February 6-8, 2022},

pages = {642–649},

publisher = {SciTePress},

organization = {INSTICC},

abstract = {It has been demonstrated that deep neural networks outperform traditional machine learning. However, deep networks lack generalisability, that is, they will not perform as good as in a new (testing) set drawn from a different distribution due to the domain shift. In order to tackle this known issue, several transfer learning approaches have been proposed, where the knowledge of a trained model is transferred into another to improve performance with different data. However, most of these approaches require additional training steps, or they suffer from catastrophic forgetting that occurs when a trained model has overwritten previously learnt knowledge. We address both problems with a novel transfer learning approach that uses network aggregation. We train dataset-specific networks together with an aggregation network in a unified framework. The loss function includes two main components: a task-specific loss (such as cross-entropy) and an aggregation loss. The proposed aggregation loss allows our model to learn how trained deep network parameters can be aggregated with an aggregation operator. We demonstrate that the proposed approach learns model aggregation at test time without any further training step, reducing the burden of transfer learning to a simple arithmetical operation. The proposed approach achieves comparable performance w.r.t. the baseline. Besides, if the aggregation operator has an inverse, we will show that our model also inherently allows for selective forgetting, i.e., the aggregated model can forget one of the datasets it was trained on, retaining information on the others.},

keywords = {ai},

pubstate = {published},

tppubtype = {inproceedings}

}

Eduardo Quiñones, Jesus Perales, Jorge Ejarque, Asaf Badouh, Santiago Marco, Fabrice Auzanneau, François Galea, David González, José Ramón Hervás, Tatiana Silva, Iacopo Colonnelli, Barbara Cantalupo, Marco Aldinucci, Enzo Tartaglione, Rafael Tornero, José Flich, Jose Maria Martinez, David Rodriguez, Izan Catalán, Jorge Garcia, Carles Hernández

The DeepHealth HPC Infrastructure: Leveraging Heterogenous HPC and Cloud Computing Infrastructures for IA-based Medical Solutions Book Section

In: Terzo, Olivier, Martinovič, Jan (Ed.): HPC, Big Data, and AI Convergence Towards Exascale: Challenge and Vision, pp. 191–216, CRC Press, Boca Raton, Florida, 2022, ISBN: 978-1-0320-0984-1.

Abstract | Links | BibTeX | Tags: deephealth, streamflow

Martin Golasowski, Jan Martinovič, Marc Levrier, Stephan Hachinger, Sophia Karagiorgou, Aikaterini Papapostolou, Spiros Mouzakitis, Ioannis Tsapelas, Monica Caballero, Marco Aldinucci, Jon Ander Gómez, Antony Chazapis, Jean-Thomas Acquaviva

Toward the Convergence of High-Performance Computing, Cloud, and Big Data Domains Book Section

In: Terzo, Olivier, Martinovič, Jan (Ed.): HPC, Big Data, and AI Convergence Towards Exascale: Challenge and Vision, pp. 1–16, CRC Press, Boca Raton, Florida, 2022, ISBN: 978-1-0320-0984-1.

Abstract | Links | BibTeX | Tags: deephealth, streamflow

Dana Oniga, Barbara Cantalupo, Enzo Tartaglione, Daniele Perlo, Marco Grangetto, Marco Aldinucci, Federico Bolelli, Federico Pollastri, Michele Cancilla, Laura Canalini, Costantino Grana, Cristina Muñoz Alcalde, Franco Alberto Cardillo, Monica Florea

Applications of AI and HPC in the Health Domain Book Section

In: Terzo, Olivier, Martinovič, Jan (Ed.): HPC, Big Data, and AI Convergence Towards Exascale: Challenge and Vision, pp. 217–239, CRC Press, Boca Raton, Florida, 2022, ISBN: 978-1-0320-0984-1.

Abstract | Links | BibTeX | Tags: deephealth, streamflow

Iacopo Colonnelli, Marco Aldinucci, Barbara Cantalupo, Luca Padovani, Sergio Rabellino, Concetto Spampinato, Roberto Morelli, Rosario Di Carlo, Nicolò Magini, Carlo Cavazzoni

Distributed workflows with Jupyter Journal Article

In: Future Generation Computer Systems, vol. 128, pp. 282–298, 2022, ISSN: 0167-739X.

Abstract | Links | BibTeX | Tags: across, deephealth, jupyter-workflow, streamflow

@article{21:FGCS:jupyflow,

title = {Distributed workflows with Jupyter},

author = {Iacopo Colonnelli and Marco Aldinucci and Barbara Cantalupo and Luca Padovani and Sergio Rabellino and Concetto Spampinato and Roberto Morelli and Rosario Di Carlo and Nicolò Magini and Carlo Cavazzoni},

url = {https://www.sciencedirect.com/science/article/pii/S0167739X21003976},

doi = {10.1016/j.future.2021.10.007},

issn = {0167-739X},

year  = {2022},

date = {2022-01-01},

journal = {Future Generation Computer Systems},

volume = {128},

pages = {282–298},

abstract = {The designers of a new coordination interface enacting complex workflows have to tackle a dichotomy: choosing a language-independent or language-dependent approach. Language-independent approaches decouple workflow models from the host code's business logic and advocate portability. Language-dependent approaches foster flexibility and performance by adopting the same host language for business and coordination code. Jupyter Notebooks, with their capability to describe both imperative and declarative code in a unique format, allow taking the best of the two approaches, maintaining a clear separation between application and coordination layers but still providing a unified interface to both aspects. We advocate the Jupyter Notebooks' potential to express complex distributed workflows, identifying the general requirements for a Jupyter-based Workflow Management System (WMS) and introducing a proof-of-concept portable implementation working on hybrid Cloud-HPC infrastructures. As a byproduct, we extended the vanilla IPython kernel with workflow-based parallel and distributed execution capabilities. The proposed Jupyter-workflow (Jw) system is evaluated on common scenarios for High Performance Computing (HPC) and Cloud, showing its potential in lowering the barriers between prototypical Notebooks and production-ready implementations.},

keywords = {across, deephealth, jupyter-workflow, streamflow},

pubstate = {published},

tppubtype = {article}

}

Bart Cox, Robert Birke, Lydia Y. Chen

Memory-aware and context-aware multi-DNN inference on the edge Journal Article

In: Pervasive and Mobile Computing, vol. 83, pp. 1–16, 2022, ISSN: 1574-1192.

Abstract | Links | BibTeX | Tags: ai

@article{COX2022101594,

title = {Memory-aware and context-aware multi-DNN inference on the edge},

author = {Bart Cox and Robert Birke and Lydia Y. Chen},

url = {https://www.sciencedirect.com/science/article/pii/S1574119222000372},

doi = {https://doi.org/10.1016/j.pmcj.2022.101594},

issn = {1574-1192},

year  = {2022},

date = {2022-01-01},

journal = {Pervasive and Mobile Computing},

volume = {83},

pages = {1–16},

abstract = {Deep neural networks (DNNs) are becoming the core components of many applications running on edge devices, especially for real time image-based analysis. Increasingly, multi-faced knowledge is extracted by executing multiple DNNs inference models, e.g., identifying objects, faces, and genders from images. It is of paramount importance to guarantee low response times of such multi-DNN executions as it affects not only users quality of experience but also safety. The challenge, largely unaddressed by the state of the art, is how to overcome the memory limitation of edge devices without altering the DNN models. In this paper, we design and implement Masa, a responsive memory-aware multi-DNN execution and scheduling framework, which requires no modification of DNN models. The aim of Masa is to consistently ensure the average response time when deterministically and stochastically executing multiple DNN-based image analyses. The enabling features of Masa are (i) modeling inter- and intra-network dependency, (ii) leveraging complimentary memory usage of each layer, and (iii) exploring the context dependency of DNNs. We verify the correctness and scheduling optimality via mixed integer programming. We extensively evaluate two versions of Masa, context-oblivious and context-aware, on three configurations of Raspberry Pi and a large set of popular DNN models triggered by different generation patterns of images. Our evaluation results show that Masa can achieve lower average response times by up to 90% on devices with small memory, i.e., 512 MB to 1 GB, compared to the state of the art multi-DNN scheduling solutions.},

keywords = {ai},

pubstate = {published},

tppubtype = {article}

}

Guglielmo Gallone, Jeehoon Kang, Francesco Bruno, Jung-Kyu Han, Ovidio De Filippo, Han-Mo Yang, Mattia Doronzo, Kyung-Woo Park, Gianluca Mittone, Hyun-Jae Kang, Radoslaw Parma, Hyeon-Cheol Gwon, Enrico Cerrato, Woo Jung Chun, Grzegorz Smolka, Seung-Ho Hur, Gerard Helft, Seung Hwan Han, Saverio Muscoli, Young Bin Song, Filippo Figini, Ki Hong Choi, Giacomo Boccuzzi, Soon-Jun Hong, Daniela Trabattoni, Chang-Wook Nam, Massimo Giammaria, Hyo-Soo Kim, Federico Conrotto, Javier Escaned, Carlo Di Mario, Fabrizio D'Ascenzo, Bon-Kwon Koo, Gaetano Maria Ferrari

Impact of Left Ventricular Ejection Fraction on Procedural and Long-Term Outcomes of Bifurcation Percutaneous Coronary Intervention Journal Article

In: The American Journal of Cardiology, vol. 172, pp. 18–25, 2022, ISSN: 0002-9149.

Abstract | Links | BibTeX | Tags:

@article{GALLONE202218,

title = {Impact of Left Ventricular Ejection Fraction on Procedural and Long-Term Outcomes of Bifurcation Percutaneous Coronary Intervention},

author = {Guglielmo Gallone and Jeehoon Kang and Francesco Bruno and Jung-Kyu Han and Ovidio De Filippo and Han-Mo Yang and Mattia Doronzo and Kyung-Woo Park and Gianluca Mittone and Hyun-Jae Kang and Radoslaw Parma and Hyeon-Cheol Gwon and Enrico Cerrato and Woo Jung Chun and Grzegorz Smolka and Seung-Ho Hur and Gerard Helft and Seung Hwan Han and Saverio Muscoli and Young Bin Song and Filippo Figini and Ki Hong Choi and Giacomo Boccuzzi and Soon-Jun Hong and Daniela Trabattoni and Chang-Wook Nam and Massimo Giammaria and Hyo-Soo Kim and Federico Conrotto and Javier Escaned and Carlo Di Mario and Fabrizio D'Ascenzo and Bon-Kwon Koo and Gaetano Maria Ferrari},

url = {https://www.sciencedirect.com/science/article/pii/S0002914922001692},

doi = {https://doi.org/10.1016/j.amjcard.2022.02.015},

issn = {0002-9149},

year  = {2022},

date = {2022-01-01},

journal = {The American Journal of Cardiology},

volume = {172},

pages = {18–25},

abstract = {The association of left ventricular ejection fraction (LVEF) with procedural and long-term outcomes after state-of-the-art percutaneous coronary intervention (PCI) of bifurcation lesions remains unsettled. A total of 5,333 patients who underwent contemporary coronary bifurcation PCI were included in the intercontinental retrospective combined insights from the unified RAIN (veRy thin stents for patients with left mAIn or bifurcatioN in real life) and COBIS (COronary BIfurcation Stenting) III bifurcation registries. Of 5,003 patients (93.8%) with known baseline LVEF, 244 (4.9%) had LVEF <40% (bifurcation with reduced ejection fraction [BIFrEF] group), 430 (8.6%) had LVEF 40% to 49% (bifurcation with mildly reduced ejection fraction [BIFmEF] group) and 4,329 (86.5%) had ejection fraction (EF) ≥50% (bifurcation with preserved ejection fraction [BIFpEF] group). The primary end point was the Kaplan-Meier estimate of major adverse cardiac events (MACEs) (a composite of all-cause death, myocardial infarction, and target vessel revascularization). Patients with BIFrEF had a more complex clinical profile and coronary anatomy. No difference in procedural (30 days) MACE was observed across EF categories, also after adjustment for in-study outcome predictors (BIFrEF vs BIFmEF: adjusted hazard ratio [adj-HR] 1.39, 95% confidence interval [CI] 0.37 to 5.21},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Adriano Marques Garcia, Dalvan Griebler, Claudio Schepke, Luiz Gustavo Fernandes

SPBench: a framework for creating benchmarks of stream processing applications Journal Article

In: Computing, vol. 105, no. 5, pp. 1077-1099, 2022, ISBN: 1436-5057.

Abstract | Links | BibTeX | Tags:

2021

Amirmasoud Ghiassi, Robert Birke, Lydia Y. Chen

TrustNet: Learning from Trusted Data Against (A)symmetric Label Noise Proceedings Article

In: 8th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT), pp. 52–62, ACM, 2021.

Abstract | Links | BibTeX | Tags:

Zilong Zhao, Aditya Kunar, Robert Birke, Lydia Y. Chen

CTAB-GAN: Effective Table Data Synthesizing Proceedings Article

In: Balasubramanian, Vineeth N., Tsang, Ivor (Ed.): Proceedings of The 13th Asian Conference on Machine Learning, pp. 97–112, PMLR, 2021.

Abstract | Links | BibTeX | Tags:

@inproceedings{pmlr-v157-zhao21a,

title = {CTAB-GAN: Effective Table Data Synthesizing},

author = {Zilong Zhao and Aditya Kunar and Robert Birke and Lydia Y. Chen},

editor = {Vineeth N. Balasubramanian and Ivor Tsang},

url = {https://proceedings.mlr.press/v157/zhao21a.html},

year  = {2021},

date = {2021-11-01},

booktitle = {Proceedings of The 13th Asian Conference on Machine Learning},

volume = {157},

pages = {97–112},

publisher = {PMLR},

series = {Proceedings of Machine Learning Research},

abstract = {While data sharing is crucial for knowledge development, privacy concerns and strict regulation (e.g., European General Data Protection Regulation (GDPR)) unfortunately limit its full effectiveness. Synthetic tabular data emerges as an alternative to enable data sharing while fulfilling regulatory and privacy constraints. The state-of-the-art tabular data synthesizers draw methodologies from Generative Adversarial Networks (GAN) and address two main data types in industry, i.e., continuous and categorical. In this paper, we develop CTAB-GAN, a novel conditional table GAN architecture that can effectively model diverse data types, including a mix of continuous and categorical variables. Moreover, we address data imbalance and long tail issues, i.e., certain variables have drastic frequency differences across large values. To achieve those aims, we first introduce the information loss, classification loss and generator loss to the conditional GAN. Secondly, we design a novel conditional vector, which efficiently encodes the mixed data type and skewed distribution of data variable. We extensively evaluate CTAB-GAN with the state of the art GANs that generate synthetic tables, in terms of data similarity and analysis utility. The results on five datasets show that the synthetic data of CTAB-GAN remarkably resembles the real data for all three types of variables and results into higher accuracy for five machine learning algorithms, by up to 17%.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Taraneh Younesian, Zilong Zhao, Amirmasoud Ghiassi, Robert Birke, Lydia Y Chen

QActor: Active Learning on Noisy Labels Proceedings Article

In: Balasubramanian, Vineeth N., Tsang, Ivor (Ed.): Proceedings of The 13th Asian Conference on Machine Learning, pp. 548–563, PMLR, 2021.

Abstract | Links | BibTeX | Tags:

Giuliano Albanese, Robert Birke, Georgia Giannopoulou, Sandro Schönborn, Thanikesavan Sivanthi

Evaluation of Networking Options for Containerized Deployment of Real-Time Applications Proceedings Article

In: 26th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA), pp. 1–8, IEEE, 2021.

Abstract | Links | BibTeX | Tags:

Giovanni Agosta, William Fornaciari, Andrea Galimberti, Giuseppe Massari, Federico Reghenzani, Federico Terraneo, Davide Zoni, Carlo Brandolese, Massimo Celino, Francesco Iannone, Paolo Palazzari, Giuseppe Zummo, Massimo Bernaschi, Pasqua D'Ambra, Sergio Saponara, Marco Danelutto, Massimo Torquati, Marco Aldinucci, Yasir Arfat, Barbara Cantalupo, Iacopo Colonnelli, Roberto Esposito, Alberto Riccardo Martinelli, Gianluca Mittone, Olivier Beaumont, Berenger Bramas, Lionel Eyraud-Dubois, Brice Goglin, Abdou Guermouche, Raymond Namyst, Samuel Thibault, Antonio Filgueras, Miquel Vidal, Carlos Alvarez, Xavier Martorell, Ariel Oleksiak, Michal Kulczewski, Alessandro Lonardo, Piero Vicini, Francesco Lo Cicero, Francesco Simula, Andrea Biagioni, Paolo Cretaro, Ottorino Frezza, Pier Stanislao Paolucci, Matteo Turisini, Francesco Giacomini, Tommaso Boccali, Simone Montangero, Roberto Ammendola

TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale Proceedings Article

In: Proc. of the 24th Euromicro Conference on Digital System Design (DSD), IEEE, Palermo, Italy, 2021.

Abstract | Links | BibTeX | Tags: streamflow, textarossa

@inproceedings{21:DSD:textarossa,

title = {TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale},

author = {Giovanni Agosta and William Fornaciari and Andrea Galimberti and Giuseppe Massari and Federico Reghenzani and Federico Terraneo and Davide Zoni and Carlo Brandolese and Massimo Celino and Francesco Iannone and Paolo Palazzari and Giuseppe Zummo and Massimo Bernaschi and Pasqua D'Ambra and Sergio Saponara and Marco Danelutto and Massimo Torquati and Marco Aldinucci and Yasir Arfat and Barbara Cantalupo and Iacopo Colonnelli and Roberto Esposito and Alberto Riccardo Martinelli and Gianluca Mittone and Olivier Beaumont and Berenger Bramas and Lionel Eyraud-Dubois and Brice Goglin and Abdou Guermouche and Raymond Namyst and Samuel Thibault and Antonio Filgueras and Miquel Vidal and Carlos Alvarez and Xavier Martorell and Ariel Oleksiak and Michal Kulczewski and Alessandro Lonardo and Piero Vicini and Francesco Lo Cicero and Francesco Simula and Andrea Biagioni and Paolo Cretaro and Ottorino Frezza and Pier Stanislao Paolucci and Matteo Turisini and Francesco Giacomini and Tommaso Boccali and Simone Montangero and Roberto Ammendola},

doi = {10.1109/DSD53832.2021.00051},

year  = {2021},

date = {2021-08-01},

booktitle = {Proc. of the 24th Euromicro Conference on Digital System Design (DSD)},

publisher = {IEEE},

address = {Palermo, Italy},

abstract = {To achieve high performance and high energy effi- ciency on near-future exascale computing systems, three key technology gaps needs to be bridged. These gaps include: en- ergy efficiency and thermal control; extreme computation effi- ciency via HW acceleration and new arithmetics; methods and tools for seamless integration of reconfigurable accelerators in heterogeneous HPC multi-node platforms. TEXTAROSSA aims at tackling this gap through a co-design approach to heterogeneous HPC solutions, supported by the integration and extension of HW and SW IPs, programming models and tools derived from European research.},

keywords = {streamflow, textarossa},

pubstate = {published},

tppubtype = {inproceedings}

}

Amirmasoud Ghiassi, Robert Birke, Rui Han, Lydia Y. Chen

LABELNET: Recovering Noisy Labels Proceedings Article

In: International Joint Conference on Neural Networks (IJCNN), pp. 1–8, IEEE, 2021.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Giovanni Agosta, Antonio Andreini, Claudio A. Ardagna, Andrea Bartolini, Alessandro Cilardo, Biagio Cosenza, Marco Danelutto, Roberto Esposito, William Fornaciari, Roberto Giorgi, Davide Lengani, Raffaele Montella, Mauro Olivieri, Sergio Saponara, Daniele Simoni, Massimo Torquati

The Italian research on HPC key technologies across EuroHPC Proceedings Article

In: ACM Computing Frontiers, pp. 279–286, ACM, Virtual Conference, Italy, 2021.

Abstract | Links | BibTeX | Tags: admire, eupex, eupilot, textarossa

@inproceedings{21:CINI_acm_CF,

title = {The Italian research on HPC key technologies across EuroHPC},

author = {Marco Aldinucci and Giovanni Agosta and Antonio Andreini and Claudio A. Ardagna and Andrea Bartolini and Alessandro Cilardo and Biagio Cosenza and Marco Danelutto and Roberto Esposito and William Fornaciari and Roberto Giorgi and Davide Lengani and Raffaele Montella and Mauro Olivieri and Sergio Saponara and Daniele Simoni and Massimo Torquati},

url = {https://iris.unito.it/retrieve/handle/2318/1783118/744641/preprint.pdf},

doi = {10.1145/3457388.3458508},

year  = {2021},

date = {2021-05-01},

booktitle = {ACM Computing Frontiers},

pages = {279–286},

publisher = {ACM},

address = {Virtual Conference, Italy},

abstract = {High-Performance Computing (HPC) is one of the strategic priorities for research and innovation worldwide due to its relevance for industrial and scientific applications. We envision HPC as composed of three pillars: infrastructures, applications, and key technologies and tools. While infrastructures are by construction centralized in large-scale HPC centers, and applications are generally within the purview of domain-specific organizations, key technologies fall in an intermediate case where coordination is needed, but design and development are often decentralized. A large group of Italian researchers has started a dedicated laboratory within the National Interuniversity Consortium for Informatics (CINI) to address this challenge. The laboratory, albeit young, has managed to succeed in its first attempts to propose a coordinated approach to HPC research within the EuroHPC Joint Undertaking, participating in the calls 2019-20 to five successful proposals for an aggregate total cost of 95M Euro. In this paper, we outline the working group's scope and goals and provide an overview of the five funded projects, which become fully operational in March 2021, and cover a selection of key technologies provided by the working group partners, highlighting their usage development within the projects.},

keywords = {admire, eupex, eupilot, textarossa},

pubstate = {published},

tppubtype = {inproceedings}

}

Carmelo Pino, Simone Palazzo, Francesca Trenta, Francesca Cordero, Ulas Bagci, Francesco Rundo, Sebastiano Battiato, Daniela Giordano, Marco Aldinucci, Concetto Spampinato

Interpretable Deep Model for Predicting Gene-Addicted Non-Small-Cell Lung Cancer in CT Scans Proceedings Article

In: 18th IEEE Intl. Symposium on Biomedical Imaging (ISBI), IEEE, Nice, France, 2021.

Abstract | Links | BibTeX | Tags: deephealth

@inproceedings{21:ct:isbi,

title = {Interpretable Deep Model for Predicting Gene-Addicted Non-Small-Cell Lung Cancer in CT Scans},

author = {Carmelo Pino and Simone Palazzo and Francesca Trenta and Francesca Cordero and Ulas Bagci and Francesco Rundo and Sebastiano Battiato and Daniela Giordano and Marco Aldinucci and Concetto Spampinato},

url = {https://iris.unito.it/retrieve/handle/2318/1790376/764762/21_ISBI_smallcell.pdf},

doi = {10.1109/ISBI48211.2021.9433832},

year  = {2021},

date = {2021-04-01},

booktitle = {18th IEEE Intl. Symposium on Biomedical Imaging (ISBI)},

publisher = {IEEE},

address = {Nice, France},

abstract = {Genetic profiling and characterization of lung cancers have recently emerged as a new technique for targeted therapeutic treatment based on immunotherapy or molecular drugs. However, the most effective way to discover specific gene mutations through tissue biopsy has several limitations, from invasiveness to being a risky procedure. Recently, quantitative assessment of visual features from CT data has been demonstrated to be a valid alternative to biopsy for the diagnosis of gene-addicted tumors. In this paper, we present a deep model for automated lesion segmentation and classification as gene-addicted or not. The segmentation approach extends the 2D Tiramisu architecture for 3D segmentation through dense blocks and squeeze-and-excitation layers, while a multi-scale 3D CNN is used for lesion classification. We also train our model with adversarial samples, and show that this approach acts as a gradient regularizer and enhances model interpretability. We also built a dataset, the first of its nature, consisting of 73 CT scans annotated with the presence of a specific genomics profile. We test our approach on this dataset achieving a segmentation accuracy of 93.11% (Dice score) and a classification accuracy in identifying oncogene-addicted lung tumors of 82.00%.},

keywords = {deephealth},

pubstate = {published},

tppubtype = {inproceedings}

}

Chi Hong, Amirmasoud Ghiassi, Yichi Zhou, Robert Birke, Lydia Y. Chen

Online Label Aggregation: A Variational Bayesian Approach Proceedings Article

In: Leskovec, Jure, Grobelnik, Marko, Najork, Marc, Tang, Jie, Zia, Leila (Ed.): WWW '21: The Web Conference 2021, pp. 1904–1915, ACM / IW3C2, 2021.

Abstract | Links | BibTeX | Tags: ai

Adriano Marques Garcia, Dalvan Griebler, Claudio Schepke, Luiz Gustavo Fernandes

Introducing a Stream Processing Framework for Assessing Parallel Programming Interfaces Proceedings Article

In: 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), pp. 84-88, IEEE, Valladolid, Spain, 2021.

Abstract | Links | BibTeX | Tags:

Bart Cox, Jeroen Galjaard, Amirmasoud Ghiassi, Robert Birke, Lydia Y. Chen

Masa: Responsive Multi-DNN Inference on the Edge Proceedings Article

In: 19th IEEE International Conference on Pervasive Computing and Communications (PerCom), pp. 1–10, IEEE, 2021.

Abstract | Links | BibTeX | Tags:

Jeroen Galjaard, Bart Cox, Amirmasoud Ghiassi, Lydia Y. Chen, Robert Birke

MemA: Fast Inference of Multiple Deep Models Proceedings Article

In: 19th IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events, pp. 281–286, IEEE, 2021.

Abstract | Links | BibTeX | Tags:

@inproceedings{percom-galjaard21,

title = {MemA: Fast Inference of Multiple Deep Models},

author = {Jeroen Galjaard and Bart Cox and Amirmasoud Ghiassi and Lydia Y. Chen and Robert Birke},

url = {https://doi.org/10.1109/PerComWorkshops51409.2021.9430952},

doi = {10.1109/PerComWorkshops51409.2021.9430952},

year  = {2021},

date = {2021-03-01},

booktitle = {19th IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events},

pages = {281–286},

publisher = {IEEE},

abstract = {The execution of deep neural network (DNN) inference jobs on edge devices has become increasingly popular. Multiple of such inference models can concurrently analyse the on-device data, e.g. images, to extract valuable insights. Prior art focuses on low-power accelerators, compressed neural network architectures, and specialized frameworks to reduce execution time of single inference jobs on edge devices which are resource constrained. However, it is little known how different scheduling policies can further improve the runtime performance of multi-inference jobs without additional edge resources. To enable the exploration of scheduling policies, we first develop an execution framework, EdgeCaffe, which splits the DNN inference jobs by loading and execution of each network layer. We empirically characterize the impact of loading and scheduling policies on the execution time of multi-inference jobs and point out their dependency on the available memory space. We propose a novel memory-aware scheduling policy, MemA, which opportunistically interleaves the executions of different types of DNN layers based on their estimated run-time memory demands. Our evaluation on exhaustive combinations of five networks, data inputs, and memory configurations show that MemA can alleviate the degradation of execution times of multi-inference (up to 5×) under severely constrained memory compared to standard scheduling policies without affecting accuracy.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Gianluca Bontempi, Ricardo Chavarriaga, Hans De Canck, Emanuela Girardi, Holger Hoos, Iarla Kilbane‐Dawe, Tonio Ball, Ann Nowé, Jose Sousa, Davide Bacciu, Marco Aldinucci, Manlio De Domenico, Alessandro Saffiotti, Marco Maratea

The CLAIRE COVID-19 initiative: approach, experiences and recommendations Journal Article

In: Ethics and Information Technology, 2021.

Abstract | Links | BibTeX | Tags: deephealth

Ivan Lanese, Doriana Medić, Claudio Antares Mezzina

Static versus dynamic reversibility in CCS Journal Article

In: Acta Informatica, vol. 58, pp. 1–34, 2021.

Abstract | Links | BibTeX | Tags: semantics

Clément Aubert, Doriana Medić

Explicit Identifiers and Contexts in Reversible Concurrent Calculus Proceedings Article

In: Reversible Computation - 13th International Conference, RC 2021, Virtual Event, July 7-8, 2021, Proceedings, Springer, 2021.

Abstract | Links | BibTeX | Tags: semantics

C. Pino, G. Vecchio, Marco Fronda, Marco Calandri, Marco Aldinucci, Concetto Spampinato

TwinLiverNet: Predicting TACE Treatment Outcome from CT scans for Hepatocellular Carcinoma using Deep Capsule Networks Proceedings Article

In: 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, EMBC 2021, Mexico, November 1-5, 2021, pp. 3039–3043, IEEE, 2021.

Abstract | Links | BibTeX | Tags: ai

@inproceedings{21:DBLP:conf/embc/PinoVFCAS21,

title = {TwinLiverNet: Predicting TACE Treatment Outcome from CT scans for Hepatocellular Carcinoma using Deep Capsule Networks},

author = {C. Pino and G. Vecchio and Marco Fronda and Marco Calandri and Marco Aldinucci and Concetto Spampinato},

url = {https://doi.org/10.1109/EMBC46164.2021.9630913},

doi = {10.1109/EMBC46164.2021.9630913},

year  = {2021},

date = {2021-01-01},

booktitle = {43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, EMBC 2021, Mexico, November 1-5, 2021},

pages = {3039–3043},

publisher = {IEEE},

abstract = {Predicting response to treatment plays a key role to assist radiologists in hepato-cellular carcinoma (HCC) therapy planning. The most widely used treatment for unresectable HCC is the trans-arterial chemoembolization (TACE). A complete radiological response after the first TACE is a reliable predictor of treatment favourable outcome. However, visual inspection of contrast-enhanced CT scans is time-consuming, error prone and too operator-dependent. Thus, in this paper we propose TwinLiverNet: a deep neural network that is able to predict TACE treatment outcome through learning visual cue from CT scans. TwinLiverNet, specifically, integrates 3D convolutions and capsule networks and is designed to process simultaneously late arterial and delayed phases from contrast-enhanced CTs. Experimental results carried out on a dataset consisting of 126 HCC lesions show that TwinLiverNet reaches an average accuracy of 82% in predicting complete response to TACE treatment. Furthermore, combining multiple CT phases (specifically, late arterial and delayed ones) yields a performance increase of over 12 percent points. Finally, the introduction of capsule layers into the model avoids the model to overfit, while enhancing accuracy.Clinical relevance— TwinLiverNet supports radiologists in visual inspection of CT scans to assess TACE treatment outcome, while reducing inter-operator variability.},

keywords = {ai},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Valentina Cesare, Iacopo Colonnelli, Alberto Riccardo Martinelli, Gianluca Mittone, Barbara Cantalupo

Practical Parallelizazion of a Laplace Solver with MPI Proceedings Article

In: Iannone, Francesco (Ed.): ENEA CRESCO in the fight against COVID-19, pp. 21–24, ENEA, 2021.

Abstract | BibTeX | Tags: hpc4ai

Iacopo Colonnelli, Barbara Cantalupo, Concetto Spampinato, Matteo Pennisi, Marco Aldinucci

Bringing AI pipelines onto cloud-HPC: setting a baseline for accuracy of COVID-19 diagnosis Proceedings Article

In: Iannone, Francesco (Ed.): ENEA CRESCO in the fight against COVID-19, ENEA, 2021.

Abstract | Links | BibTeX | Tags: streamflow

Ovidio De Filippo, Jeehoon Kang, Francesco Bruno, Jung-Kyu Han, Andrea Saglietto, Han-Mo Yang, Giuseppe Patti, Kyung-Woo Park, Radoslaw Parma, Hyo-Soo Kim, Leonardo De Luca, Hyeon-Cheol Gwon, Mario Iannaccone, Woo Jung Chun, Grzegorz Smolka, Seung-Ho Hur, Enrico Cerrato, Seung Hwan Han, Carlo Mario, Young Bin Song, Javier Escaned, Ki Hong Choi, Gerard Helft, Joon-Hyung Doh, Alessandra Truffa Giachet, Soon-Jun Hong, Saverio Muscoli, Chang-Wook Nam, Guglielmo Gallone, Davide Capodanno, Daniela Trabattoni, Yoichi Imori, Veronica Dusi, Bernardo Cortese, Antonio Montefusco, Federico Conrotto, Iacopo Colonnelli, Imad Sheiban, Gaetano Maria Ferrari, Bon-Kwon Koo, Fabrizio D'Ascenzo

Benefit of Extended Dual Antiplatelet Therapy Duration in Acute Coronary Syndrome Patients Treated with Drug Eluting Stents for Coronary Bifurcation Lesions (from the BIFURCAT Registry) Journal Article

In: The American Journal of Cardiology, 2021, ISSN: 0002-9149.

Abstract | Links | BibTeX | Tags: ai

@article{21:ajc:bifurcat,

title = {Benefit of Extended Dual Antiplatelet Therapy Duration in Acute Coronary Syndrome Patients Treated with Drug Eluting Stents for Coronary Bifurcation Lesions (from the BIFURCAT Registry)},

author = {Ovidio De Filippo and Jeehoon Kang and Francesco Bruno and Jung-Kyu Han and Andrea Saglietto and Han-Mo Yang and Giuseppe Patti and Kyung-Woo Park and Radoslaw Parma and Hyo-Soo Kim and Leonardo De Luca and Hyeon-Cheol Gwon and Mario Iannaccone and Woo Jung Chun and Grzegorz Smolka and Seung-Ho Hur and Enrico Cerrato and Seung Hwan Han and Carlo Mario and Young Bin Song and Javier Escaned and Ki Hong Choi and Gerard Helft and Joon-Hyung Doh and Alessandra Truffa Giachet and Soon-Jun Hong and Saverio Muscoli and Chang-Wook Nam and Guglielmo Gallone and Davide Capodanno and Daniela Trabattoni and Yoichi Imori and Veronica Dusi and Bernardo Cortese and Antonio Montefusco and Federico Conrotto and Iacopo Colonnelli and Imad Sheiban and Gaetano Maria Ferrari and Bon-Kwon Koo and Fabrizio D'Ascenzo},

url = {https://www.sciencedirect.com/science/article/pii/S0002914921006354},

doi = {10.1016/j.amjcard.2021.07.005},

issn = {0002-9149},

year  = {2021},

date = {2021-01-01},

journal = {The American Journal of Cardiology},

abstract = {Optimal dual antiplatelet therapy (DAPT) duration for patients undergoing percutaneous coronary intervention (PCI) for coronary bifurcations is an unmet issue. The BIFURCAT registry was obtained by merging two registries on coronary bifurcations. Three groups were compared in a two-by-two fashion: short-term DAPT (≤ 6 months), intermediate-term DAPT (6-12 months) and extended DAPT (>12 months). Major adverse cardiac events (MACE) (a composite of all-cause death, myocardial infarction (MI), target-lesion revascularization and stent thrombosis) were the primary endpoint. Single components of MACE were the secondary endpoints. Events were appraised according to the clinical presentation: chronic coronary syndrome (CCS) versus acute coronary syndrome (ACS). 5537 patients (3231 ACS, 2306 CCS) were included. After a median follow-up of 2.1 years (IQR 0.9-2.2), extended DAPT was associated with a lower incidence of MACE compared with intermediate-term DAPT (2.8% versus 3.4%, adjusted HR 0.23 [0.1-0.54], p <0.001), driven by a reduction of all-cause death in the ACS cohort. In the CCS cohort, an extended DAPT strategy was not associated with a reduced risk of MACE. In conclusion, among real-world patients receiving PCI for coronary bifurcation, an extended DAPT strategy was associated with a reduction of MACE in ACS but not in CCS patients.},

keywords = {ai},

pubstate = {published},

tppubtype = {article}

}

Yasir Arfat, Gianluca Mittone, Roberto Esposito, Barbara Cantalupo, Gaetano Maria De Ferrari, Marco Aldinucci

A Review of Machine Learning for Cardiology Journal Article

In: Minerva cardiology and angiology, 2021.

Abstract | Links | BibTeX | Tags: deephealth, hpc4ai

@article{21:ai4numbers:minerva,

title = {A Review of Machine Learning for Cardiology},

author = {Yasir Arfat and Gianluca Mittone and Roberto Esposito and Barbara Cantalupo and Gaetano Maria De Ferrari and Marco Aldinucci},

url = {https://iris.unito.it/retrieve/handle/2318/1796298/780512/21_AI4numbers-preprint.pdf},

doi = {10.23736/s2724-5683.21.05709-4},

year  = {2021},

date = {2021-01-01},

journal = {Minerva cardiology and angiology},

abstract = {This paper reviews recent cardiology literature and reports how Artificial Intelligence Tools (specifically, Machine Learning techniques) are being used by physicians in the field. Each technique is introduced with enough details to allow the understanding of how it works and its intent, but without delving into details that do not add immediate benefits and require expertise in the field. We specifically focus on the principal Machine Learning based risk scores used in cardiovascular research. After introducing them and summarizing their assumptions and biases, we discuss their merits and shortcomings. We report on how frequently they are adopted in the field and suggest why this is the case based on our expertise in Machine Learning. We complete the analysis by reviewing how corresponding statistical approaches compare with them. Finally, we discuss the main open issues in applying Machine Learning tools to cardiology tasks, also drafting possible future directions. Despite the growing interest in these tools, we argue that there are many still underutilized techniques: while Neural Networks are slowly being incorporated in cardiovascular research, other important techniques such as Semi-Supervised Learning and Federated Learning are still underutilized. The former would allow practitioners to harness the information contained in large datasets that are only partially labeled, while the latter would foster collaboration between institutions allowing building larger and better models.},

keywords = {deephealth, hpc4ai},

pubstate = {published},

tppubtype = {article}

}

Marco Aldinucci, Valentina Cesare, Iacopo Colonnelli, Alberto Riccardo Martinelli, Gianluca Mittone, Barbara Cantalupo, Carlo Cavazzoni, Maurizio Drocco

Practical Parallelization of Scientific Applications with OpenMP, OpenACC and MPI Journal Article

In: Journal of Parallel and Distributed Computing, vol. 157, pp. 13–29, 2021.

Abstract | Links | BibTeX | Tags: HPC

Daniele D'Agostino, Ivan Merelli, Marco Aldinucci, Daniele Cesini

Hardware and Software Solutions for Energy-Efficient Computing in Scientific Programming Journal Article

In: Scientific Programming, vol. 2021, pp. 5514284, 2021, ISBN: 1058-9244.

Abstract | Links | BibTeX | Tags: HPC

Daniele D'Agostino, Pietro Liò, Marco Aldinucci, Ivan Merelli

Advantages of using graph databases to explore chromatin conformation capture experiments Journal Article

In: BMC Bioinformatics, vol. 22, no. 2, pp. 43–58, 2021, ISBN: 1471-2105.

Abstract | Links | BibTeX | Tags: deephealth, hpc4ai

Marco Aldinucci

L'infrastruttura necessaria per creare interoperabilità tra pubbliche amministrazioni Book Section

In: Perin, Roberto Cavallo (Ed.): L'amministrazione pubblica con i big data: da Torino un dibattito sull'intelligenza artificiale, pp. 225–232, 2021, ISBN: 9788875901806.

Abstract | Links | BibTeX | Tags:

Matteo Pennisi, Isaak Kavasidis, Concetto Spampinato, Vincenzo Schinina, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Francesco Rundo, Marco Aldinucci, Massimo Cristofaro, others

An Explainable AI System for Automated COVID-19 Assessment and Lesion Categorization from CT-scans Journal Article

In: Artificial Intelligence in Medicine, pp. 102114, 2021.

Abstract | Links | BibTeX | Tags: ai

@article{pennisi2021explainable,

title = {An Explainable AI System for Automated COVID-19 Assessment and Lesion Categorization from CT-scans},

author = {Matteo Pennisi and Isaak Kavasidis and Concetto Spampinato and Vincenzo Schinina and Simone Palazzo and Federica Proietto Salanitri and Giovanni Bellitto and Francesco Rundo and Marco Aldinucci and Massimo Cristofaro and others},

url = {https://iris.unito.it/retrieve/handle/2318/1792619/770952/2021_COVID_AIM_preprint.pdf},

doi = {10.1016/j.artmed.2021.102114},

year  = {2021},

date = {2021-01-01},

journal = {Artificial Intelligence in Medicine},

pages = {102114},

publisher = {Elsevier},

abstract = {COVID-19 infection caused by SARS-CoV-2 pathogen has been a catastrophic pandemic outbreak all over the world, with exponential increasing of confirmed cases and, unfortunately, deaths. In this work we propose an AI-powered pipeline, based on the deep-learning paradigm, for automated COVID-19 detection and lesion categorization from CT scans. We first propose a new segmentation module aimed at automatically identifying lung parenchyma and lobes. Next, we combine the segmentation network with classification networks for COVID-19 identification and lesion categorization. We compare the model's classification results with those obtained by three expert radiologists on a dataset of 166 CT scans. Results showed a sensitivity of 90.3% and a specificity of 93.5% for COVID-19 detection, at least on par with those yielded by the expert radiologists, and an average lesion categorization accuracy of about 84%. Moreover, a significant role is played by prior lung and lobe segmentation, that allowed us to enhance classification performance by over 6 percent points. The interpretation of the trained AI models reveals that the most significant areas for supporting the decision on COVID-19 identification are consistent with the lesions clinically associated to the virus, i.e., crazy paving, consolidation and ground glass. This means that the artificial models are able to discriminate a positive patient from a negative one (both controls and patients with interstitial pneumonia tested negative to COVID) by evaluating the presence of those lesions into CT scans. Finally, the AI models are integrated into a user-friendly GUI to support AI explainability for radiologists, which is publicly available at http://perceivelab.com/covid-ai. The whole AI system is unique since, to the best of our knowledge, it is the first AI-based software, publicly available, that attempts to explain to radiologists what information is used by AI methods for making decisions and that proactively involves them in the decision loop to further improve the COVID-19 understanding.},

keywords = {ai},

pubstate = {published},

tppubtype = {article}

}

COVID-19 infection caused by SARS-CoV-2 pathogen has been a catastrophic pandemic outbreak all over the world, with exponential increasing of confirmed cases and, unfortunately, deaths. In this work we propose an AI-powered pipeline, based on the deep-learning paradigm, for automated COVID-19 detection and lesion categorization from CT scans. We first propose a new segmentation module aimed at automatically identifying lung parenchyma and lobes. Next, we combine the segmentation network with classification networks for COVID-19 identification and lesion categorization. We compare the model's classification results with those obtained by three expert radiologists on a dataset of 166 CT scans. Results showed a sensitivity of 90.3% and a specificity of 93.5% for COVID-19 detection, at least on par with those yielded by the expert radiologists, and an average lesion categorization accuracy of about 84%. Moreover, a significant role is played by prior lung and lobe segmentation, that allowed us to enhance classification performance by over 6 percent points. The interpretation of the trained AI models reveals that the most significant areas for supporting the decision on COVID-19 identification are consistent with the lesions clinically associated to the virus, i.e., crazy paving, consolidation and ground glass. This means that the artificial models are able to discriminate a positive patient from a negative one (both controls and patients with interstitial pneumonia tested negative to COVID) by evaluating the presence of those lesions into CT scans. Finally, the AI models are integrated into a user-friendly GUI to support AI explainability for radiologists, which is publicly available at http://perceivelab.com/covid-ai. The whole AI system is unique since, to the best of our knowledge, it is the first AI-based software, publicly available, that attempts to explain to radiologists what information is used by AI methods for making decisions and that proactively involves them in the decision loop to further improve the COVID-19 understanding.

Zilong Zhao, Robert Birke, Rui Han, Bogdan Robu, Sara Bouchenak, Sonia Ben Mokhtar, Lydia Y. Chen

Enhancing Robustness of On-Line Learning Models on Highly Noisy Data Journal Article

In: IEEE Trans. Dependable Secur. Comput., vol. 18, no. 5, pp. 2177–2192, 2021.

Abstract | Links | BibTeX | Tags: ai

@article{ZhaoBHRBMC21,

title = {Enhancing Robustness of On-Line Learning Models on Highly Noisy Data},

author = {Zilong Zhao and Robert Birke and Rui Han and Bogdan Robu and Sara Bouchenak and Sonia Ben Mokhtar and Lydia Y. Chen},

url = {https://doi.org/10.1109/TDSC.2021.3063947},

doi = {10.1109/TDSC.2021.3063947},

year  = {2021},

date = {2021-01-01},

journal = {IEEE Trans. Dependable Secur. Comput.},

volume = {18},

number = {5},

pages = {2177–2192},

abstract = {Classification algorithms have been widely adopted to detect anomalies for various systems, e.g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the wild can be unreliable due to careless annotations or malicious data transformation for incorrect anomaly detection. In this article, we extend a two-layer on-line data selection framework: Robust Anomaly Detector (RAD) with a newly designed ensemble prediction where both layers contribute to the final anomaly detection decision. To adapt to the on-line nature of anomaly detection, we consider additional features of conflicting opinions of classifiers, repetitive cleaning, and oracle knowledge. We on-line learn from incoming data streams and continuously cleanse the data, so as to adapt to the increasing learning capacity from the larger accumulated data set. Moreover, we explore the concept of oracle learning that provides additional information of true labels for difficult data points. We specifically focus on three use cases, (i) detecting 10 classes of IoT attacks, (ii) predicting 4 classes of task failures of big data jobs, and (iii) recognising 100 celebrities faces. Our evaluation results show that RAD can robustly improve the accuracy of anomaly detection, to reach up to 98.95 percent for IoT device attacks (i.e., +7%), up to 85.03 percent for cloud task failures (i.e., +14%) under 40 percent label noise, and for its extension, it can reach up to 77.51 percent for face recognition (i.e., +39%) under 30 percent label noise. The proposed RAD and its extensions are general and can be applied to different anomaly detection algorithms.},

keywords = {ai},

pubstate = {published},

tppubtype = {article}

}

Robert Birke, Juan F. Pérez, Zhan Qiu, Mathias Björkqvist, Lydia Y. Chen

sPARE: Partial Replication for Multi-Tier Applications in the Cloud Journal Article

In: IEEE Trans. Serv. Comput., vol. 14, no. 2, pp. 574–588, 2021.

Abstract | Links | BibTeX | Tags: parallel

Iacopo Colonnelli, Barbara Cantalupo, Roberto Esposito, Matteo Pennisi, Concetto Spampinato, Marco Aldinucci

HPC Application Cloudification: The StreamFlow Toolkit Proceedings Article

In: Bispo, João, Cherubin, Stefano, Flich, José (Ed.): 12th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 10th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms (PARMA-DITAM 2021), pp. 5:1–5:13, Schloss Dagstuhl – Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 2021, ISSN: 2190-6807.

Abstract | Links | BibTeX | Tags: deephealth, hpc4ai, streamflow

@inproceedings{colonnelli_et_al:OASIcs.PARMA-DITAM.2021.5,

title = {HPC Application Cloudification: The StreamFlow Toolkit},

author = {Iacopo Colonnelli and Barbara Cantalupo and Roberto Esposito and Matteo Pennisi and Concetto Spampinato and Marco Aldinucci},

editor = {João Bispo and Stefano Cherubin and José Flich},

url = {https://drops.dagstuhl.de/opus/volltexte/2021/13641/pdf/OASIcs-PARMA-DITAM-2021-5.pdf},

doi = {10.4230/OASIcs.PARMA-DITAM.2021.5},

issn = {2190-6807},

year  = {2021},

date = {2021-01-01},

booktitle = {12th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 10th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms (PARMA-DITAM 2021)},

volume = {88},

pages = {5:1–5:13},

publisher = {Schloss Dagstuhl – Leibniz-Zentrum für Informatik},

address = {Dagstuhl, Germany},

series = {Open Access Series in Informatics (OASIcs)},

abstract = {Finding an effective way to improve accessibility to High-Performance Computing facilities, still anchored to SSH-based remote shells and queue-based job submission mechanisms, is an open problem in computer science. This work advocates a cloudification of HPC applications through a cluster-as-accelerator pattern, where computationally demanding portions of the main execution flow hosted on a Cloud Finding an effective way to improve accessibility to High-Performance Computing facilities, still anchored to SSH-based remote shells and queue-based job submission mechanisms, is an open problem in computer science. This work advocates a cloudification of HPC applications through a cluster-as-accelerator pattern, where computationally demanding portions of the main execution flow hosted on a Cloud infrastructure can be offloaded to HPC environments to speed them up. We introduce StreamFlow, a novel Workflow Management System that supports such a design pattern and makes it possible to run the steps of a standard workflow model on independent processing elements with no shared storage. We validated the proposed approach's effectiveness on the CLAIRE COVID-19 universal pipeline, i.e. a reproducible workflow capable of automating the comparison of (possibly all) state-of-the-art pipelines for the diagnosis of COVID-19 interstitial pneumonia from CT scans images based on Deep Neural Networks (DNNs).},

keywords = {deephealth, hpc4ai, streamflow},

pubstate = {published},

tppubtype = {inproceedings}

}

Fabrizio D'Ascenzo, Ovidio De Filippo, Guglielmo Gallone, Gianluca Mittone, Marco Agostino Deriu, Mario Iannaccone, Albert Ariza-Solé, Christoph Liebetrau, Sergio Manzano-Fernández, Giorgio Quadri, Tim Kinnaird, Gianluca Campo, Jose Paulo Simao Henriques, James M Hughes, Alberto Dominguez-Rodriguez, Marco Aldinucci, Umberto Morbiducci, Giuseppe Patti, Sergio Raposeiras-Roubin, Emad Abu-Assi, Gaetano Maria De Ferrari, Francesco Piroli, Andrea Saglietto, Federico Conrotto, Pierluigi Omedé, Antonio Montefusco, Mauro Pennone, Francesco Bruno, Pier Paolo Bocchino, Giacomo Boccuzzi, Enrico Cerrato, Ferdinando Varbella, Michela Sperti, Stephen B. Wilton, Lazar Velicki, Ioanna Xanthopoulou, Angel Cequier, Andres Iniguez-Romo, Isabel Munoz Pousa, Maria Cespon Fernandez, Berenice Caneiro Queija, Rafael Cobas-Paz, Angel Lopez-Cuenca, Alberto Garay, Pedro Flores Blanco, Andrea Rognoni, Giuseppe Biondi Zoccai, Simone Biscaglia, Ivan Nunez-Gil, Toshiharu Fujii, Alessandro Durante, Xiantao Song, Tetsuma Kawaji, Dimitrios Alexopoulos, Zenon Huczek, Jose Ramon Gonzalez Juanatey, Shao-Ping Nie, Masa-aki Kawashiri, Iacopo Colonnelli, Barbara Cantalupo, Roberto Esposito, Sergio Leonardi, Walter Grosso Marra, Alaide Chieffo, Umberto Michelucci, Dario Piga, Marta Malavolta, Sebastiano Gili, Marco Mennuni, Claudio Montalto, Luigi Oltrona Visconti, Yasir Arfat

Machine learning-based prediction of adverse events following an acute coronary syndrome (PRAISE): a modelling study of pooled datasets Journal Article

In: The Lancet, vol. 397, no. 10270, pp. 199–207, 2021, ISSN: 0140-6736.

Abstract | Links | BibTeX | Tags: deephealth, hpc4ai

@article{21:lancet,

title = {Machine learning-based prediction of adverse events following an acute coronary syndrome (PRAISE): a modelling study of pooled datasets},

author = {Fabrizio D'Ascenzo and Ovidio De Filippo and Guglielmo Gallone and Gianluca Mittone and Marco Agostino Deriu and Mario Iannaccone and Albert Ariza-Solé and Christoph Liebetrau and Sergio Manzano-Fernández and Giorgio Quadri and Tim Kinnaird and Gianluca Campo and Jose Paulo Simao Henriques and James M Hughes and Alberto Dominguez-Rodriguez and Marco Aldinucci and Umberto Morbiducci and Giuseppe Patti and Sergio Raposeiras-Roubin and Emad Abu-Assi and Gaetano Maria De Ferrari and Francesco Piroli and Andrea Saglietto and Federico Conrotto and Pierluigi Omedé and Antonio Montefusco and Mauro Pennone and Francesco Bruno and Pier Paolo Bocchino and Giacomo Boccuzzi and Enrico Cerrato and Ferdinando Varbella and Michela Sperti and Stephen B. Wilton and Lazar Velicki and Ioanna Xanthopoulou and Angel Cequier and Andres Iniguez-Romo and Isabel Munoz Pousa and Maria Cespon Fernandez and Berenice Caneiro Queija and Rafael Cobas-Paz and Angel Lopez-Cuenca and Alberto Garay and Pedro Flores Blanco and Andrea Rognoni and Giuseppe Biondi Zoccai and Simone Biscaglia and Ivan Nunez-Gil and Toshiharu Fujii and Alessandro Durante and Xiantao Song and Tetsuma Kawaji and Dimitrios Alexopoulos and Zenon Huczek and Jose Ramon Gonzalez Juanatey and Shao-Ping Nie and Masa-aki Kawashiri and Iacopo Colonnelli and Barbara Cantalupo and Roberto Esposito and Sergio Leonardi and Walter Grosso Marra and Alaide Chieffo and Umberto Michelucci and Dario Piga and Marta Malavolta and Sebastiano Gili and Marco Mennuni and Claudio Montalto and Luigi Oltrona Visconti and Yasir Arfat},

url = {https://www.researchgate.net/profile/James_Hughes3/publication/348501148_Machine_learning-based_prediction_of_adverse_events_following_an_acute_coronary_syndrome_PRAISE_a_modelling_study_of_pooled_datasets/links/6002a81ba6fdccdcb858b6c2/Machine-learning-based-prediction-of-adverse-events-following-an-acute-coronary-syndrome-PRAISE-a-modelling-study-of-pooled-datasets.pdf},

doi = {10.1016/S0140-6736(20)32519-8},

issn = {0140-6736},

year  = {2021},

date = {2021-01-01},

journal = {The Lancet},

volume = {397},

number = {10270},

pages = {199–207},

abstract = {Background The accuracy of current prediction tools for ischaemic and bleeding events after an acute coronary syndrome (ACS) remains insufficient for individualised patient management strategies. We developed a machine learning-based risk stratification model to predict all-cause death, recurrent acute myocardial infarction, and major bleeding after ACS. Methods Different machine learning models for the prediction of 1-year post-discharge all-cause death, myocardial infarction, and major bleeding (defined as Bleeding Academic Research Consortium type 3 or 5) were trained on a cohort of 19826 adult patients with ACS (split into a training cohort [80%] and internal validation cohort [20%]) from the BleeMACS and RENAMI registries, which included patients across several continents. 25 clinical features routinely assessed at discharge were used to inform the models. The best-performing model for each study outcome (the PRAISE score) was tested in an external validation cohort of 3444 patients with ACS pooled from a randomised controlled trial and three prospective registries. Model performance was assessed according to a range of learning metrics including area under the receiver operating characteristic curve (AUC). Findings The PRAISE score showed an AUC of 0.82 (95% CI 0.78-0.85) in the internal validation cohort and 0.92 (0.90-0.93) in the external validation cohort for 1-year all-cause death; an AUC of 0.74 (0.70-0.78) in the internal validation cohort and 0.81 (0.76-0.85) in the external validation cohort for 1-year myocardial infarction; and an AUC of 0.70 (0.66-0.75) in the internal validation cohort and 0.86 (0.82-0.89) in the external validation cohort for 1-year major bleeding. Interpretation A machine learning-based approach for the identification of predictors of events after an ACS is feasible and effective. The PRAISE score showed accurate discriminative capabilities for the prediction of all-cause death, myocardial infarction, and major bleeding, and might be useful to guide clinical decision making.},

keywords = {deephealth, hpc4ai},

pubstate = {published},

tppubtype = {article}

}

Background The accuracy of current prediction tools for ischaemic and bleeding events after an acute coronary syndrome (ACS) remains insufficient for individualised patient management strategies. We developed a machine learning-based risk stratification model to predict all-cause death, recurrent acute myocardial infarction, and major bleeding after ACS. Methods Different machine learning models for the prediction of 1-year post-discharge all-cause death, myocardial infarction, and major bleeding (defined as Bleeding Academic Research Consortium type 3 or 5) were trained on a cohort of 19826 adult patients with ACS (split into a training cohort [80%] and internal validation cohort [20%]) from the BleeMACS and RENAMI registries, which included patients across several continents. 25 clinical features routinely assessed at discharge were used to inform the models. The best-performing model for each study outcome (the PRAISE score) was tested in an external validation cohort of 3444 patients with ACS pooled from a randomised controlled trial and three prospective registries. Model performance was assessed according to a range of learning metrics including area under the receiver operating characteristic curve (AUC). Findings The PRAISE score showed an AUC of 0.82 (95% CI 0.78-0.85) in the internal validation cohort and 0.92 (0.90-0.93) in the external validation cohort for 1-year all-cause death; an AUC of 0.74 (0.70-0.78) in the internal validation cohort and 0.81 (0.76-0.85) in the external validation cohort for 1-year myocardial infarction; and an AUC of 0.70 (0.66-0.75) in the internal validation cohort and 0.86 (0.82-0.89) in the external validation cohort for 1-year major bleeding. Interpretation A machine learning-based approach for the identification of predictors of events after an ACS is feasible and effective. The PRAISE score showed accurate discriminative capabilities for the prediction of all-cause death, myocardial infarction, and major bleeding, and might be useful to guide clinical decision making.

Iacopo Colonnelli, Barbara Cantalupo, Ivan Merelli, Marco Aldinucci

StreamFlow: cross-breeding cloud with HPC Journal Article

In: IEEE Transactions on Emerging Topics in Computing, vol. 9, no. 4, pp. 1723–1737, 2021.

Abstract | Links | BibTeX | Tags: deephealth, hpc4ai, streamflow

2020

Adriano Marques Garcia, Matheus Serpa, Dalvan Griebler, Claudio Schepke, Luiz Gustavo Fernandes, Philippe O. A. Navaux

The Impact of CPU Frequency Scaling on Power Consumption of Computing Infrastructures Proceedings Article

In: International Conference on Computational Science and its Applications (ICCSA), pp. 142-157, Springer, Cagliari, Italy, 2020.

Abstract | Links | BibTeX | Tags:

@inproceedings{GARCIA:ICCSA:20,

title = {The Impact of CPU Frequency Scaling on Power Consumption of Computing Infrastructures},

author = {Adriano Marques Garcia and Matheus Serpa and Dalvan Griebler and Claudio Schepke and Luiz Gustavo Fernandes and Philippe O. A. Navaux},

url = {https://iris.unito.it/retrieve/3b8f3dc0-cd4d-4f36-801d-9e8c613ea2e8/ICCSA_Energy_governors_preprint.pdf},

doi = {10.1007/978-3-030-58817-5_12},

year  = {2020},

date = {2020-07-01},

booktitle = {International Conference on Computational Science and its Applications (ICCSA)},

volume = {12254},

pages = {142-157},

publisher = {Springer},

address = {Cagliari, Italy},

series = {ICCSA'20},

abstract = {Since the demand for computing power increases, new architectures emerged to obtain better performance. Reducing the power and energy consumption of these architectures is one of the main challenges to achieving high-performance computing. Current research trends aim at developing new software and hardware techniques to achieve the best performance and energy trade-offs. In this work, we investigate the impact of different CPU frequency scaling techniques such as ondemand, performance, and powersave on the power and energy consumption of multi-core based computer infrastructure. We apply these techniques in PAMPAR, a parallel benchmark suite implemented in PThreads, OpenMP, MPI-1, and MPI-2 (spawn). We measure the energy and execution time of 10 benchmarks, varying the number of threads. Our results show that although powersave consumes up to 43.1% less power than performance and ondemand governors, it consumes the triple of energy due to the high execution time. Our experiments also show that the performance governor consumes up to 9.8% more energy than ondemand for CPU-bound benchmarks. Finally, our results show that PThreads has the lowest power consumption, consuming less than the sequential version for memory-bound benchmarks. Regarding performance, the performance governor achieved 3% of performance over the ondemand.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Doriana Medić, Claudio Antares Mezzina, Iain Phillips, Nobuko Yoshida

A parametric framework for reversible emph(pi)-calculi Journal Article

In: Information and Computation, vol. 275, pp. 104644, 2020.

Abstract | Links | BibTeX | Tags: semantics

Ivan Lanese, Doriana Medić

A General Approach to Derive Uncontrolled Reversible Semantics Proceedings Article

In: 31st International Conference on Concurrency Theory, CONCUR 2020, September 1-4, 2020, Vienna, Austria (Virtual Conference), pp. 33:1–33:24, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2020.

Abstract | Links | BibTeX | Tags: semantics

@inproceedings{20:concur:LaneseM20,

title = {A General Approach to Derive Uncontrolled Reversible Semantics},

author = {Ivan Lanese and Doriana Medić},

url = {https://doi.org/10.4230/LIPIcs.CONCUR.2020.33},

doi = {10.4230/LIPIcs.CONCUR.2020.33},

year  = {2020},

date = {2020-01-01},

booktitle = {31st International Conference on Concurrency Theory, CONCUR 2020, September 1-4, 2020, Vienna, Austria (Virtual Conference)},

volume = {171},

pages = {33:1–33:24},

publisher = {Schloss Dagstuhl - Leibniz-Zentrum für Informatik},

series = {LIPIcs},

abstract = {Reversible computing is a paradigm where programs can execute backward as well as in the usual forward direction. Reversible computing is attracting interest due to its applications in areas as different as biochemical modelling, simulation, robotics and debugging, among others. In concurrent systems the main notion of reversible computing is called causal-consistent reversibility, and it allows one to undo an action if and only if its consequences, if any, have already been undone. This paper presents a general and automatic technique to define a causal-consistent reversible extension for given forward models. We support models defined using a reduction semantics in a specific format and consider a causality relation based on resources consumed and produced. The considered format is general enough to fit many formalisms studied in the literature on causal-consistent reversibility, notably Higher-Order ?-calculus and Core Erlang, an intermediate language in the Erlang compilation. Reversible extensions of these models in the literature are ad hoc, while we build them using the same general technique. This also allows us to show in a uniform way that a number of relevant properties, causal-consistency in particular, hold in the reversible extensions we build. Our technique also allows us to go beyond the reversible models in the literature: we cover a larger fragment of Core Erlang, including remote error handling based on links, which has never been considered in the reversibility literature.},

keywords = {semantics},

pubstate = {published},

tppubtype = {inproceedings}

}

Doriana Medić, Claudio Antares Mezzina, Iain Phillips, Nobuko Yoshida

Towards a Formal Account for Software Transactional Memory Proceedings Article

In: Reversible Computation - 12th International Conference, RC 2020, Oslo, Norway, July 9-10, 2020, Proceedings, pp. 255–263, Springer, 2020.

Abstract | Links | BibTeX | Tags: semantics

Vasco Amaral, Beatriz Norberto, Miguel Goulão, Marco Aldinucci, Siegfried Benkner, Andrea Bracciali, Paulo Carreira, Edgars Celms, Luís Correia, Clemens Grelck, Helen Karatza, Christoph Kessler, Peter Kilpatrick, Hugo Martiniano, Ilias Mavridis, Sabri Pllana, Ana Respício, José Simão, Luís Veiga, Ari Visa

Programming languages for data-Intensive HPC applications: A systematic mapping study Journal Article

In: Parallel Computing, pp. 102584, 2020, ISSN: 0167-8191.

Abstract | Links | BibTeX | Tags: HPC

@article{20:sms:chipset,

title = {Programming languages for data-Intensive HPC applications: A systematic mapping study},

author = {Vasco Amaral and Beatriz Norberto and Miguel Goulão and Marco Aldinucci and Siegfried Benkner and Andrea Bracciali and Paulo Carreira and Edgars Celms and Luís Correia and Clemens Grelck and Helen Karatza and Christoph Kessler and Peter Kilpatrick and Hugo Martiniano and Ilias Mavridis and Sabri Pllana and Ana Respício and José Simão and Luís Veiga and Ari Visa},

url = {https://iris.unito.it/retrieve/689605/1-s2.0-S0167819119301759-main.pdf},

doi = {https://doi.org/10.1016/j.parco.2019.102584},

issn = {0167-8191},

year  = {2020},

date = {2020-01-01},

journal = {Parallel Computing},

pages = {102584},

abstract = {A major challenge in modelling and simulation is the need to combine expertise in both software technologies and a given scientific domain. When High-Performance Computing (HPC) is required to solve a scientific problem, software development becomes a problematic issue. Considering the complexity of the software for HPC, it is useful to identify programming languages that can be used to alleviate this issue. Because the existing literature on the topic of HPC is very dispersed, we performed a Systematic Mapping Study (SMS) in the context of the European COST Action cHiPSet. This literature study maps characteristics of various programming languages for data-intensive HPC applications, including category, typical user profiles, effectiveness, and type of articles. We organised the SMS in two phases. In the first phase, relevant articles are identified employing an automated keyword-based search in eight digital libraries. This lead to an initial sample of 420 papers, which was then narrowed down in a second phase by human inspection of article abstracts, titles and projects to 152 relevant articles published in the period 2006–2018. The analysis of these articles enabled us to identify 26 programming languages referred to in 33 of relevant articles. We compared the outcome of the mapping study with results of our questionnaire-based survey that involved 57 HPC experts. The mapping study and the survey revealed that the desired features of programming languages for data-intensive HPC applications are portability, performance and usability. Furthermore, we observed that the majority of the programming languages used in the context of data-intensive HPC applications are text-based general-purpose programming languages. Typically these have a steep learning curve, which makes them difficult to adopt. We believe that the outcome of this study will inspire future research and development in programming languages for data-intensive HPC applications.},

keywords = {HPC},

pubstate = {published},

tppubtype = {article}

}

Daniele D'Agostino, Pietro Liò, Marco Aldinucci, Ivan Merelli

NeoHiC: A web application for the analysis of Hi-C data Proceedings Article

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), pp. 98–107, 2020, ISBN: 978-3-030-63061-4.

Abstract | Links | BibTeX | Tags:

Paul Metzger, Murray Cole, Christian Fensch, Marco Aldinucci, Enrico Bini

Enforcing Deadlines for Skeleton-based Parallel Programming Proceedings Article

In: 26th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), Sydney, Australia, 2020.

Abstract | Links | BibTeX | Tags:

@inproceedings{20:farm:rtas,

title = {Enforcing Deadlines for Skeleton-based Parallel Programming},

author = {Paul Metzger and Murray Cole and Christian Fensch and Marco Aldinucci and Enrico Bini},

url = {https://iris.unito.it/retrieve/handle/2318/1741320/616056/20_ske_RTAS.pdf},

doi = {10.1109/RTAS48715.2020.000-7},

year  = {2020},

date = {2020-01-01},

booktitle = {26th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)},

address = {Sydney, Australia},

abstract = {High throughput applications with real-time guar- antees are increasingly relevant. For these applications, parallelism must be exposed to meet deadlines. Directed Acyclic Graphs (DAGs) are a popular and very general application model that can capture any possible interaction among threads. However, we argue that by constraining the application structure to a set of composable ``skeletons'', at the price of losing some generality w.r.t. DAGs, the following advantages are gained: (i) a finer model of the application enables tighter analysis, (ii) specialised scheduling policies are applicable, (iii) programming is simplified, (iv) specialised implementation techniques can be exploited transparently, and (v) the program can be automatically tuned to minimise resource usage while still meeting its hard deadlines. As a first step towards a set of real-time skeletons we conduct a case study with the job farm skeleton and the hard real- time XMOS xCore-200 microcontroller. We present an analytical framework for job farms that reduces the number of required cores by scheduling jobs in batches, while ensuring that deadlines are still met. Our experimental results demonstrate that batching reduces the minimum sustainable period by up to 22%, leading to a reduced number of required cores. The framework chooses the best parameters in 83% of cases and never selects parameters that cause deadline misses. Finally, we show that the overheads introduced by the skeleton abstraction layer are negligible.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Vincent Reniers, Yuan Gao, Ren Zhang, Paolo Viviani, Akash Madhusudan, Bert Lagaisse, Svetla Nikova, Dimitri Van Landuyt, Riccardo Lombardi, Bart Preneel, Wouter Joosen

Authenticated and Auditable Data Sharing via Smart Contract Proceedings Article

In: Proceedings of the 35th ACM/SIGAPP Symposium on Applied Computing, pp. 1–8, ACM, Brno, Czech Republic, 2020, ISBN: xxx-x-xxxx-xxxx-x.

Abstract | BibTeX | Tags:

Valentina Cesare, Iacopo Colonnelli, Marco Aldinucci

Practical Parallelization of Scientific Applications Proceedings Article

In: Proc. of 28th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 376–384, IEEE, Västerås, Sweden, 2020.

Abstract | Links | BibTeX | Tags: c3s, hpc4ai

Jose Daniel Garcia, Jose Daniel Rio, Marco Aldinucci, Fabio Tordini, Marco Danelutto, Gabriele Mencagli, Massimo Torquati

Challenging the abstraction penalty in parallel patterns libraries: Adding FastFlow support to GrPPI Journal Article

In: The Journal of Supercomputing, vol. 76, no. 7, pp. 5139–5159, 2020.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

@article{19:jsupe:grppi,

title = {Challenging the abstraction penalty in parallel patterns libraries: Adding FastFlow support to GrPPI},

author = {Jose Daniel Garcia and Jose Daniel Rio and Marco Aldinucci and Fabio Tordini and Marco Danelutto and Gabriele Mencagli and Massimo Torquati},

url = {https://iris.unito.it/retrieve/handle/2318/1762686/744894/2020-js-grppi-postprint.pdf},

doi = {10.1007/s11227-019-02826-5},

year  = {2020},

date = {2020-01-01},

journal = {The Journal of Supercomputing},

volume = {76},

number = {7},

pages = {5139–5159},

abstract = {In the last years, pattern-based programming has been recognized as a good practice for efficiently exploiting parallel hardware resources. Following this approach, multiple libraries have been designed for providing such high-level abstractions to ease the parallel programming. However, those libraries do not share a common interface. To pave the way, GrPPI has been designed for providing an intermediate abstraction layer between application developers and existing parallel programming frameworks like OpenMP, Intel TBB or ISO C++ threads. On the other hand, FastFlow has been adopted as an efficient object-based programming framework that may benefit from being supported as an additional GrPPI backend. However, the object-based approach presents some major challenges to be incorporated under the GrPPI type safe functional programming style. In this paper, we present the integration of FastFlow as a new GrPPI backend to demonstrate that structured parallel programming frameworks perfectly fit the GrPPI design. Additionally, we also demonstrate that GrPPI does not incur in additional overheads for providing its abstraction layer, and we study the programmability in terms of lines of code and cyclomatic complexity. In general, the presented work acts as reciprocal validation of both FastFlow (as an efficient, native structured parallel programming framework) and GrPPI (as an efficient abstraction layer on top of existing parallel programming frameworks).},

keywords = {fastflow, rephrase},

pubstate = {published},

tppubtype = {article}

}

2019

Adriano Marques Garcia, Claudio Schepke, Alessandro Gonçalves Girardi

PAMPAR: A new parallel benchmark for performance and energy consumption evaluation Journal Article

In: Concurrency and Computation: Practice and Experience, vol. 32, no. 20, pp. 1-21, 2019.

Abstract | Links | BibTeX | Tags:

Paolo Viviani

Deep Learning at Scale with Nearest Neighbours Communications PhD Thesis

Computer Science Department, University of Torino, 2019.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Stefano Bagnasco, Matteo Concas, Stefano Lusso, Sergio Rabellino, Danilo Demarchi, Sara Vallero

Managing a heterogeneous scientific computing cluster with cloud-like tools: ideas and experience Proceedings Article

In: European Physical Journal Web of Conferences, pp. 07030, 2019.

Abstract | Links | BibTeX | Tags:

@inproceedings{2019EPJWC.21407030A,

title = {Managing a heterogeneous scientific computing cluster with cloud-like tools: ideas and experience},

author = {Marco Aldinucci and Stefano Bagnasco and Matteo Concas and Stefano Lusso and Sergio Rabellino and Danilo Demarchi and Sara Vallero},

url = {https://iris.unito.it/retrieve/533279/epjconf_chep2018_07030.pdf},

doi = {10.1051/epjconf/201921407030},

year  = {2019},

date = {2019-07-01},

booktitle = {European Physical Journal Web of Conferences},

volume = {214},

pages = {07030},

series = {European Physical Journal Web of Conferences},

abstract = {Obtaining CPU cycles on an HPC cluster is nowadays relatively simple and sometimes even cheap for academic institutions. However, in most of the cases providers of HPC services would not allow changes on the configuration, implementation of special features or a lower-level control on the computing infrastructure, for example for testing experimental configurations. The variety of use cases proposed by several departments of the University of Torino, including ones from solid-state chemistry, computational biology, genomics and many others, called for different and sometimes conflicting configurations; furthermore, several R&D activities in the field of scientific computing, with topics ranging from GPU acceleration to Cloud Computing technologies, needed a platform to be carried out on. The Open Computing Cluster for Advanced data Manipulation (OCCAM) is a multi-purpose flexible HPC cluster designed and operated by a collaboration between the University of Torino and the Torino branch of the Istituto Nazionale di Fisica Nucleare. It is aimed at providing a flexible and reconfigurable infrastructure to cater to a wide range of different scientific computing needs, as well as a platform for R&D activities on computational technologies themselves. We describe some of the use cases that prompted the design and construction of the system, its architecture and a first characterisation of its performance by some synthetic benchmark tools and a few realistic use-case tests.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Adriano Marques Garcia, Claudio Schepke, Alessandro Gonçalves Girardi, Sherlon Almeida Silva

A New Parallel Benchmark for Performance Evaluation and Energy Consumption Proceedings Article

In: High Performance Computing for Computational Science – VECPAR 2018, pp. 188-201, Springer International Publishing, Cham, 2019, ISBN: 978-3-030-15996-2.

Abstract | Links | BibTeX | Tags:

Massimo Torquati, Daniele De Sensi, Gabriele Mencagli, Marco Aldinucci, Marco Danelutto

Power-Aware Pipelining with Automatic Concurrency Control Journal Article

In: Concurrency and Computation: Practice and Experience, vol. 31, no. 5, 2019.

Abstract | Links | BibTeX | Tags: rephrase

Marco Aldinucci, Maurizio Drocco, Claudia Misale, Guy Tremblay

Languages for Big Data analysis Book Chapter

In: Sakr, Sherif, Zomaya, Albert (Ed.): Encyclopedia of Big Data Technologies, Springer International Publishing, Cham, 2019, ISBN: 978-3-319-63962-8.

Abstract | Links | BibTeX | Tags: parallel

Marco Danelutto, Tiziano De Matteis, Daniele De Sensi, Gabriele Mencagli, Massimo Torquati, Marco Aldinucci, Peter Kilpatrick

The RePhrase Extended Pattern Set for Data Intensive Parallel Computing Journal Article

In: International Journal of Parallel Programming, vol. 47, no. 1, pp. 74–93, 2019.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Massimo Torquati, Gabriele Mencagli, Maurizio Drocco, Marco Aldinucci, Tiziano De Matteis, Marco Danelutto

On Dynamic Memory Allocation in Sliding-Window Parallel Patterns for Streaming Analytics Journal Article

In: The Journal of Supercomputing, vol. 75, no. 8, pp. 4114–4131, 2019.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Doriana Medić

Relative expressiveness of calculi for reversible concurrency Journal Article

In: Bull. EATCS, vol. 129, 2019.

Abstract | Links | BibTeX | Tags: semantics

Paolo Viviani, Maurizio Drocco, Daniele Baccega, Iacopo Colonnelli, Marco Aldinucci

Deep Learning at Scale Proceedings Article

In: Proc. of 27th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 124–131, IEEE, Pavia, Italy, 2019.

Abstract | Links | BibTeX | Tags: ai

Ivan Merelli, Federico Fornari, Fabio Tordini, Daniele D'Agostino, Marco Aldinucci, Daniele Cesini

Exploiting Docker containers over Grid computing for a comprehensive study of chromatin conformation in different cell types Journal Article

In: Journal of Parallel and Distributed Computing, vol. 134, pp. 116–127, 2019, ISSN: 0743-7315.

Abstract | Links | BibTeX | Tags: bioinformatics

Clemens Grelck, Ewa Niewiadomska-Szynkiewicz, Marco Aldinucci, Andrea Bracciali, Elisabeth Larsson

Why High-Performance Modelling and Simulation for Big Data Applications Matters Book Chapter

In: Kołodziej, Joanna, González-Vélez, Horacio (Ed.): High-Performance Modelling and Simulation for Big Data Applications: Selected Results of the COST Action IC1406 cHiPSet, no. 11400, pp. 1–35, Springer International Publishing, Cham, 2019, ISBN: 978-3-030-16272-6.

Abstract | Links | BibTeX | Tags: HPC

Maurizio Drocco, Paolo Viviani, Iacopo Colonnelli, Marco Aldinucci, Marco Grangetto

Accelerating spectral graph analysis through wavefronts of linear algebra operations Proceedings Article

In: Proc. of 27th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 9–16, IEEE, Pavia, Italy, 2019.

Abstract | Links | BibTeX | Tags:

Vincent Reniers, Dimitri Van Landuyt, Paolo Viviani, Bert Lagaisse, Riccardo Lombardi, Wouter Joosen

Analysis of Architectural Variants for Auditable Blockchain-based Private Data Sharing Proceedings Article

In: Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, pp. 346–354, ACM, Limassol, Cyprus, 2019, ISBN: 978-1-4503-5933-7.

Abstract | Links | BibTeX | Tags:

@inproceedings{19:sac:blockchain,

title = {Analysis of Architectural Variants for Auditable Blockchain-based Private Data Sharing},

author = {Vincent Reniers and Dimitri Van Landuyt and Paolo Viviani and Bert Lagaisse and Riccardo Lombardi and Wouter Joosen},

url = {https://doi.acm.org/10.1145/3297280.3297316},

doi = {10.1145/3297280.3297316},

isbn = {978-1-4503-5933-7},

year  = {2019},

date = {2019-01-01},

booktitle = {Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing},

pages = {346–354},

publisher = {ACM},

address = {Limassol, Cyprus},

series = {SAC '19},

abstract = {Many applications by design depend on costly trusted third-party auditors. One such example is the industrial application case of federated multi-disciplinary optimization (MDO), in which different organizations contribute to a complex engineering design effort. Although blockchain and distributed ledger technology (DLT) has strong potential in reducing the dependence on such intermediaries, the architectural complexity involved in designing a solution is daunting. In this paper, we analyze the architectural variants for decentralized private data sharing while guaranteeing auditability in terms of data access operations. Non-repudiation of actions taken by each party is a key requirement, as is availability of the shared data. % through storage governed by the chain. The architectural variants analyzed focus on attaining:~(i)~confidential data exchange, (ii)~maintaining and governing access to the shared data, (iii)~providing data access auditability, (iv)~data validation or conflict resolution, and to a lesser degree (v)~transaction and identity privacy. We systematically enumerate architectural decisions at the levels of:~storage, policy-based file access control, data encryption methods, and auditability mechanisms for private data. This analysis is based on extensive assessment of the state of the art on decentralized private data access management using static or dynamic policies, and private data validation without exposing confidential information. The main contribution of this work is a comprehensive overview of architectural variants for decentralized control of private, encrypted data, and the involved trade-offs in terms of performance, auditable trust and security. These findings are validated in the context on the aforementioned industry case that involves federated multi-disciplinary optimization (MDO).},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

2018

Adriano Marques Garcia, Claudio Schepke, Alessandro Gonçalves Girardi, Sherlon Almeida Silva

Power Consumption of Parallel Programming Interfaces in Multicore Architectures: A Case Study Proceedings Article

In: 2018 Symposium on High Performance Computing Systems (WSCAD), pp. 77-83, 2018.

Abstract | Links | BibTeX | Tags:

Claudia Misale, Maurizio Drocco, Guy Tremblay, Marco Aldinucci

PiCo: a Novel Approach to Stream Data Analytics Proceedings Article

In: Proc. of Euro-Par Workshops: 1st Intl. Workshop on Autonomic Solutions for Parallel and Distributed Data Stream Processing (Auto-DaSP 2017), Springer, Santiago de Compostela, Spain, 2018.

Abstract | Links | BibTeX | Tags: rephrase, toreador

Gabriele Mencagli, Massimo Torquati, Fabio Lucattini, Salvatore Cuomo, Marco Aldinucci

Harnessing sliding-window execution semantics for parallel stream processing Journal Article

In: Journal of Parallel and Distributed Computing, vol. 116, pp. 74–88, 2018, ISSN: 0743-7315.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Marco Aldinucci, Sergio Rabellino, Marco Pironti, Filippo Spiga, Paolo Viviani, Maurizio Drocco, Marco Guerzoni, Guido Boella, Marco Mellia, Paolo Margara, Idillio Drago, Roberto Marturano, Guido Marchetto, Elio Piccolo, Stefano Bagnasco, Stefano Lusso, Sara Vallero, Giuseppe Attardi, Alex Barchiesi, Alberto Colla, Fulvio Galeazzi

HPC4AI, an AI-on-demand federated platform endeavour Proceedings Article

In: ACM Computing Frontiers, Ischia, Italy, 2018.

Abstract | Links | BibTeX | Tags: hpc4ai, rephrase, toreador

Paolo Viviani, Maurizio Drocco, Marco Aldinucci

Scaling Dense Linear Algebra on Multicore and Beyond: a Survey Proceedings Article

In: Proc. of 26th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), IEEE, Cambridge, United Kingdom, 2018.

Abstract | Links | BibTeX | Tags:

Fabio Tordini, Marco Aldinucci, Paolo Viviani, Ivan Merelli, Pietro Liò

Scientific Workflows on Clouds with Heterogeneous and Preemptible Instances Proceedings Article

In: Proc. of the Intl. Conference on Parallel Computing, ParCo 2017, 12-15 September 2017, Bologna, Italy, IOS Press, 2018.

Abstract | Links | BibTeX | Tags: rephrase

Paolo Viviani, Marco Aldinucci, Roberto d'Ippolito, Jan Lemeire, Dean Vucinic

A Flexible Numerical Framework for Engineering—A Response Surface Modelling Application Book Chapter

In: Improved Performance of Materials: Design and Experimental Approaches, pp. 93–106, Springer International Publishing, Cham, 2018, ISBN: 978-3-319-59590-0.

Abstract | Links | BibTeX | Tags: repara, rephrase

@inbook{17:viviani:advstruct,

title = {A Flexible Numerical Framework for Engineering—A Response Surface Modelling Application},

author = {Paolo Viviani and Marco Aldinucci and Roberto d'Ippolito and Jan Lemeire and Dean Vucinic},

doi = {10.1007/978-3-319-59590-0_9},

isbn = {978-3-319-59590-0},

year  = {2018},

date = {2018-01-01},

booktitle = {Improved Performance of Materials: Design and Experimental Approaches},

pages = {93–106},

publisher = {Springer International Publishing},

address = {Cham},

abstract = {This work presents an innovative approach adopted for the development of a new numerical software framework for accelerating dense linear algebra calculations and its application within an engineering context. In particular, response surface models (RSM) are a key tool to reduce the computational effort involved in engineering design processes like design optimization. However, RSMs may prove to be too expensive to be computed when the dimensionality of the system and/or the size of the dataset to be synthesized is significantly high or when a large number of different response surfaces has to be calculated in order to improve the overall accuracy (e.g. like when using ensemble modelling techniques). On the other hand, the potential of modern hybrid hardware (e.g. multicore, GPUs) is not exploited by current engineering tools, while they can lead to a significant performance improvement. To fill this gap, a software framework is being developed that enables the hybrid and scalable acceleration of the linear algebra core for engineering applications and especially of RSMs calculations with a user-friendly syntax that allows good portability between different hardware architectures, with no need of specific expertise in parallel programming and accelerator technology. The effectiveness of this framework is shown by comparing an accelerated code to a single-core calculation of a radial basis function RSM on some benchmark datasets. This approach is then validated within a real-life engineering application and the achievements are presented and discussed.},

keywords = {repara, rephrase},

pubstate = {published},

tppubtype = {inbook}

}

Marco Aldinucci, Marco Danelutto, Maurizio Drocco, Peter Kilpatrick, Claudia Misale, Guilherme Peretti Pezzi, Massimo Torquati

A Parallel Pattern for Iterative Stencil + Reduce Journal Article

In: Journal of Supercomputing, vol. 74, no. 11, pp. 5690–5705, 2018.

Abstract | Links | BibTeX | Tags: HPC, repara, rephrase

Paolo Viviani, Maurizio Drocco, Marco Aldinucci

Pushing the boundaries of parallel Deep Learning - A practical approach Journal Article

In: CoRR, vol. abs/1806.09528, 2018.

Abstract | Links | BibTeX | Tags:

Claudia Misale, Maurizio Drocco, Guy Tremblay, Alberto R. Martinelli, Marco Aldinucci

PiCo: High-performance data analytics pipelines in modern C++ Journal Article

In: Future Generation Computer Systems, vol. 87, pp. 392–403, 2018.

Abstract | Links | BibTeX | Tags: fastflow, HPC, toreador

2017

Salvatore Cuomo, Marco Aldinucci, Massimo Torquati

Guest Editorial for Programming Models and Algorithms for Data Analysis in HPC Systems Journal Article

In: International Journal of Parallel Programming, pp. 1–3, 2017, ISSN: 0885-7458, (Editorial).

Abstract | Links | BibTeX | Tags: HPC

Maurizio Drocco

Parallel Programming with Global Asynchronous Memory: Models, C++ APIs and Implementations PhD Thesis

Computer Science Department, University of Torino, 2017.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, repara, rephrase, toreador

@phdthesis{17:gam:drocco:thesis,

title = {Parallel Programming with Global Asynchronous Memory: Models, C++ APIs and Implementations},

author = {Maurizio Drocco},

url = {https://zenodo.org/record/1037585/files/Drocco_phd_thesis.pdf},

doi = {10.5281/zenodo.1037585},

year  = {2017},

date = {2017-10-01},

school = {Computer Science Department, University of Torino},

abstract = {In the realm of High Performance Computing (HPC), message passing has been the programming paradigm of choice for over twenty years. The durable MPI (Message Passing Interface) standard, with send/receive communication, broadcast, gather/scatter, and reduction collectives is still used to construct parallel programs where each communication is orchestrated by the de-vel-oper-based precise knowledge of data distribution and overheads; collective communications simplify the orchestration but might induce excessive synchronization. Early attempts to bring shared-memory programming model—with its programming adv-antages—to distributed computing, referred as the Distributed Shared Memory (DSM) model, faded away; one of the main issue was to combine performance and programmability with the memory consistency model. The recently proposed Partitioned Global Address Space (PGAS) model is a modern revamp of DSM that exposes data placement to enable optimizations based on locality, but it still addresses (simple) data-parallelism only and it relies on expensive sharing protocols. We advocate an alternative programming model for distributed computing based on a Global Asynchronous Memory (GAM), aiming to emphavoid coherency and consistency problems rather than solving them. We materialize GAM by designing and implementing a emphdistributed smart pointers library, inspired by C++ smart pointers. In this model, public and private pointers (resembling C++ shared and unique pointers, respectively) are moved around instead of messages (i.e., data), thus alleviating the user from the burden of minimizing transfers. On top of smart pointers, we propose a high-level C++ template library for writing applications in terms of dataflow-like networks, namely GAM nets, consisting of stateful processors exchanging pointers in fully asynchronous fashion. We demonstrate the validity of the proposed approach, from the expressiveness perspective, by showing how GAM nets can be exploited to implement higher-level parallel programming models, such as data and task parallelism. As for the performance perspective, the execution of two non-toy benchmarks on a number of different small-scale HPC clusters exhibits both close-to-ideal scalability and negligible overhead with respect to state-of-the-art benchmark implementations. For instance, the GAM implementation of a high-quality video restoration filter sustains a 100 fps throughput over 70%-noisy high-quality video streams on a 4-node cluster of Graphics Processing Units (GPUs), with minimal programming effort.},

keywords = {fastflow, paraphrase, repara, rephrase, toreador},

pubstate = {published},

tppubtype = {phdthesis}

}

In the realm of High Performance Computing (HPC), message passing has been the programming paradigm of choice for over twenty years. The durable MPI (Message Passing Interface) standard, with send/receive communication, broadcast, gather/scatter, and reduction collectives is still used to construct parallel programs where each communication is orchestrated by the de-vel-oper-based precise knowledge of data distribution and overheads; collective communications simplify the orchestration but might induce excessive synchronization. Early attempts to bring shared-memory programming model—with its programming adv-antages—to distributed computing, referred as the Distributed Shared Memory (DSM) model, faded away; one of the main issue was to combine performance and programmability with the memory consistency model. The recently proposed Partitioned Global Address Space (PGAS) model is a modern revamp of DSM that exposes data placement to enable optimizations based on locality, but it still addresses (simple) data-parallelism only and it relies on expensive sharing protocols. We advocate an alternative programming model for distributed computing based on a Global Asynchronous Memory (GAM), aiming to emphavoid coherency and consistency problems rather than solving them. We materialize GAM by designing and implementing a emphdistributed smart pointers library, inspired by C++ smart pointers. In this model, public and private pointers (resembling C++ shared and unique pointers, respectively) are moved around instead of messages (i.e., data), thus alleviating the user from the burden of minimizing transfers. On top of smart pointers, we propose a high-level C++ template library for writing applications in terms of dataflow-like networks, namely GAM nets, consisting of stateful processors exchanging pointers in fully asynchronous fashion. We demonstrate the validity of the proposed approach, from the expressiveness perspective, by showing how GAM nets can be exploited to implement higher-level parallel programming models, such as data and task parallelism. As for the performance perspective, the execution of two non-toy benchmarks on a number of different small-scale HPC clusters exhibits both close-to-ideal scalability and negligible overhead with respect to state-of-the-art benchmark implementations. For instance, the GAM implementation of a high-quality video restoration filter sustains a 100 fps throughput over 70%-noisy high-quality video streams on a 4-node cluster of Graphics Processing Units (GPUs), with minimal programming effort.

Paula Severi, Luca Padovani, Emilio Tuosto, Mariangiola Dezani-Ciancaglini

On Sessions and Infinite Data Journal Article

In: Logical Methods in Computer Science, vol. Volume 13, Issue 2, 2017.

Links | BibTeX | Tags: rephrase, semantics

Maurizio Drocco, Claudia Misale, Guy Tremblay, Marco Aldinucci

A Formal Semantics for Data Analytics Pipelines Technical Report

Computer Science Department, University of Torino 2017, (https://arxiv.org/abs/1705.01629).

Links | BibTeX | Tags: rephrase, toreador

Claudia Misale

PiCo: A Domain-Specific Language for Data Analytics Pipelines PhD Thesis

Computer Science Department, University of Torino, 2017.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, repara, rephrase, toreador

@phdthesis{17:pico:misale:thesis,

title = {PiCo: A Domain-Specific Language for Data Analytics Pipelines},

author = {Claudia Misale},

url = {https://iris.unito.it/retrieve/handle/2318/1633743/320170/Misale_thesis.pdf},

doi = {10.5281/zenodo.579753},

year  = {2017},

date = {2017-05-01},

school = {Computer Science Department, University of Torino},

abstract = {In the world of Big Data analytics, there is a series of tools aiming at simplifying programming applications to be executed on clusters. Although each tool claims to provide better programming, data and execution models—for which only informal (and often confusing) semantics is generally provided—all share a common under- lying model, namely, the Dataflow model. Using this model as a starting point, it is possible to categorize and analyze almost all aspects about Big Data analytics tools from a high level perspective. This analysis can be considered as a first step toward a formal model to be exploited in the design of a (new) framework for Big Data analytics. By putting clear separations between all levels of abstraction (i.e., from the runtime to the user API), it is easier for a programmer or software designer to avoid mixing low level with high level aspects, as we are often used to see in state-of-the-art Big Data analytics frameworks. 

 From the user-level perspective, we think that a clearer and simple semantics is preferable, together with a strong separation of concerns. For this reason, we use the Dataflow model as a starting point to build a programming environment with a simplified programming model implemented as a Domain-Specific Language, that is on top of a stack of layers that build a prototypical framework for Big Data analytics. 

 The contribution of this thesis is twofold: first, we show that the proposed model is (at least) as general as existing batch and streaming frameworks (e.g., Spark, Flink, Storm, Google Dataflow), thus making it easier to understand high-level data-processing applications written in such frameworks. As result of this analysis, we provide a layered model that can represent tools and applications following the Dataflow paradigm and we show how the analyzed tools fit in each level. 

 Second, we propose a programming environment based on such layered model in the form of a Domain-Specific Language (DSL) for processing data collections, called PiCo (Pipeline Composition). The main entity of this programming model is the Pipeline, basically a DAG-composition of processing elements. This model is intended to give the user an unique interface for both stream and batch processing, hiding completely data management and focusing only on operations, which are represented by Pipeline stages. Our DSL will be built on top of the FastFlow library, exploiting both shared and distributed parallelism, and implemented in C++11/14 with the aim of porting C++ into the Big Data world.},

keywords = {fastflow, paraphrase, repara, rephrase, toreador},

pubstate = {published},

tppubtype = {phdthesis}

}

In the world of Big Data analytics, there is a series of tools aiming at simplifying programming applications to be executed on clusters. Although each tool claims to provide better programming, data and execution models—for which only informal (and often confusing) semantics is generally provided—all share a common under- lying model, namely, the Dataflow model. Using this model as a starting point, it is possible to categorize and analyze almost all aspects about Big Data analytics tools from a high level perspective. This analysis can be considered as a first step toward a formal model to be exploited in the design of a (new) framework for Big Data analytics. By putting clear separations between all levels of abstraction (i.e., from the runtime to the user API), it is easier for a programmer or software designer to avoid mixing low level with high level aspects, as we are often used to see in state-of-the-art Big Data analytics frameworks.
From the user-level perspective, we think that a clearer and simple semantics is preferable, together with a strong separation of concerns. For this reason, we use the Dataflow model as a starting point to build a programming environment with a simplified programming model implemented as a Domain-Specific Language, that is on top of a stack of layers that build a prototypical framework for Big Data analytics.
The contribution of this thesis is twofold: first, we show that the proposed model is (at least) as general as existing batch and streaming frameworks (e.g., Spark, Flink, Storm, Google Dataflow), thus making it easier to understand high-level data-processing applications written in such frameworks. As result of this analysis, we provide a layered model that can represent tools and applications following the Dataflow paradigm and we show how the analyzed tools fit in each level.
Second, we propose a programming environment based on such layered model in the form of a Domain-Specific Language (DSL) for processing data collections, called PiCo (Pipeline Composition). The main entity of this programming model is the Pipeline, basically a DAG-composition of processing elements. This model is intended to give the user an unique interface for both stream and batch processing, hiding completely data management and focusing only on operations, which are represented by Pipeline stages. Our DSL will be built on top of the FastFlow library, exploiting both shared and distributed parallelism, and implemented in C++11/14 with the aim of porting C++ into the Big Data world.

Paolo Viviani, Massimo Torquati, Marco Aldinucci, Roberto d'Ippolito

Multiple back-end support for the Armadillo linear algebra interface Proceedings Article

In: In proc. of the 32nd ACM Symposium on Applied Computing (SAC), pp. 1566–1573, Marrakesh, Morocco, 2017.

Abstract | Links | BibTeX | Tags: HPC, repara, rephrase

Marco Aldinucci, Marco Danelutto, Daniele De Sensi, Gabriele Mencagli, Massimo Torquati

Towards Power-Aware Data Pipelining on Multicores Proceedings Article

In: Proceedings of the 10th International Symposium on High-Level Parallel Programming and Applications, Valladolid, Spain, 2017.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Claudia Misale, Maurizio Drocco, Marco Aldinucci, Guy Tremblay

A Comparison of Big Data Frameworks on a Layered Dataflow Model Journal Article

In: Parallel Processing Letters, vol. 27, no. 01, pp. 1–20, 2017.

Abstract | Links | BibTeX | Tags: rephrase, toreador

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

FastFlow: high-level and efficient streaming on multi-core Book Chapter

In: Pllana, Sabri, Xhafa, Fatos (Ed.): Programming Multi-core and Many-core Computing Systems, Chapter 13, pp. 261–280, John Wiley & Sons, Ltd, 2017, ISBN: 9781119332015.

Abstract | Links | BibTeX | Tags: fastflow

Fabio Tordini, Maurizio Drocco, Claudia Misale, Luciano Milanesi, Pietro Liò, Ivan Merelli, Massimo Torquati, Marco Aldinucci

NuChart-II: the road to a fast and scalable tool for Hi-C data analysis Journal Article

In: International Journal of High Performance Computing Applications, vol. 31, no. 3, pp. 196–211, 2017.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, repara, rephrase

@article{16:ijhpca:nuchart,

title = {NuChart-II: the road to a fast and scalable tool for Hi-C data analysis},

author = {Fabio Tordini and Maurizio Drocco and Claudia Misale and Luciano Milanesi and Pietro Liò and Ivan Merelli and Massimo Torquati and Marco Aldinucci},

url = {https://iris.unito.it/retrieve/handle/2318/1607126/238747/main.pdf},

doi = {10.1177/1094342016668567},

year  = {2017},

date = {2017-01-01},

journal = {International Journal of High Performance Computing Applications},

volume = {31},

number = {3},

pages = {196–211},

abstract = {Recent advances in molecular biology and bioinformatics techniques brought to an explosion of the information about the spatial organisation of the DNA in the nucleus of a cell. High-throughput molecular biology techniques provide a genome-wide capture of the spatial organization of chromosomes at unprecedented scales, which permit to identify physical interactions between genetic elements located throughout a genome. Recent results have shown that there is a large correlation between co-localization and co-regulation of genes, but these important information are hampered by the lack of biologists-friendly analysis and visualisation software. In this work we present NuChart-II, an efficient and highly optimized tool for genomic data analysis that provides a gene-centric, graph-based representation of genomic information. While designing NuChart-II we addressed several common issues in the parallelisation of memory bound algorithms for shared-memory systems. With performance and usability in mind, NuChart-II is a R package that embeds a C++ engine: computing capabilities and memory hierarchy of multi-core architectures are fully exploited, while the versatile R environment for statistical analysis and data visualisation rises the level of abstraction and permits to orchestrate analysis and visualisation of genomic data.},

keywords = {bioinformatics, fastflow, repara, rephrase},

pubstate = {published},

tppubtype = {article}

}

Marco Aldinucci, Stefano Bagnasco, Stefano Lusso, Paolo Pasteris, Sergio Rabellino

OCCAM: a flexible, multi-purpose and extendable HPC cluster Proceedings Article

In: Journal of Physics: Conf. Series (CHEP 2016), pp. 082039, San Francisco, USA, 2017.

Abstract | Links | BibTeX | Tags: c3s, HPC

@inproceedings{16:occam:chep,

title = {OCCAM: a flexible, multi-purpose and extendable HPC cluster},

author = {Marco Aldinucci and Stefano Bagnasco and Stefano Lusso and Paolo Pasteris and Sergio Rabellino},

url = {http://iopscience.iop.org/article/10.1088/1742-6596/898/8/082039/meta},

doi = {10.1088/1742-6596/898/8/082039},

year  = {2017},

date = {2017-01-01},

booktitle = {Journal of Physics: Conf. Series (CHEP 2016)},

volume = {898},

number = {8},

pages = {082039},

address = {San Francisco, USA},

abstract = {Obtaining CPU cycles on an HPC cluster is nowadays relatively simple and sometimes even cheap for academic institutions. However, in most of the cases providers of HPC services would not allow changes on the configuration, implementation of special features or a lower-level control on the computing infrastructure and networks, for example for testing new computing patterns or conducting research on HPC itself. The variety of use cases proposed by several departments of the University of Torino, including ones from solid-state chemistry, high-energy physics, computer science, big data analytics, computational biology, genomics and many others, called for different and sometimes conflicting configurations; furthermore, several R&D activities in the field of scientific computing, with topics ranging from GPU acceleration to Cloud Computing technologies, needed a platform to be carried out on. The Open Computing Cluster for Advanced data Manipulation (OCCAM) is a multi-purpose flexible HPC cluster designed and operated by a collaboration between the University of Torino and the Torino branch of the Istituto Nazionale di Fisica Nucleare. It is aimed at providing a flexible, reconfigurable and extendable infrastructure to cater to a wide range of different scientific computing needs, as well as a platform for R&D activities on computational technologies themselves. Extending it with novel architecture CPU, accelerator or hybrid microarchitecture (such as forthcoming Intel Xeon Phi Knights Landing) should be as a simple as plugging a node in a rack. The initial system counts slightly more than 1100 cpu cores and includes different types of computing nodes (standard dual-socket nodes, large quad-sockets nodes with 768 GB RAM, and multi-GPU nodes) and two separate disk storage subsystems: a smaller high-performance scratch area, based on the Lustre file system, intended for direct computational I/O and a larger one, of the order of 1PB, to archive near-line data for archival purposes. All the components of the system are interconnected through a 10Gb/s Ethernet layer with one-level topology and an InfiniBand FDR 56Gbps layer in fat-tree topology. A system of this kind, heterogeneous and reconfigurable by design, poses a number of challenges related to the frequency at which heterogeneous hardware resources might change their availability and shareability status, which in turn affect methods and means to allocate, manage, optimize, bill, monitor VMs, virtual farms, jobs, interactive bare-metal sessions, etc. This poster describes some of the use cases that prompted the design ad construction of the HPC cluster, its architecture and a first characterization of its performance by some synthetic benchmark tools and a few realistic use-case tests.},

keywords = {c3s, HPC},

pubstate = {published},

tppubtype = {inproceedings}

}

Obtaining CPU cycles on an HPC cluster is nowadays relatively simple and sometimes even cheap for academic institutions. However, in most of the cases providers of HPC services would not allow changes on the configuration, implementation of special features or a lower-level control on the computing infrastructure and networks, for example for testing new computing patterns or conducting research on HPC itself. The variety of use cases proposed by several departments of the University of Torino, including ones from solid-state chemistry, high-energy physics, computer science, big data analytics, computational biology, genomics and many others, called for different and sometimes conflicting configurations; furthermore, several R&D activities in the field of scientific computing, with topics ranging from GPU acceleration to Cloud Computing technologies, needed a platform to be carried out on. The Open Computing Cluster for Advanced data Manipulation (OCCAM) is a multi-purpose flexible HPC cluster designed and operated by a collaboration between the University of Torino and the Torino branch of the Istituto Nazionale di Fisica Nucleare. It is aimed at providing a flexible, reconfigurable and extendable infrastructure to cater to a wide range of different scientific computing needs, as well as a platform for R&D activities on computational technologies themselves. Extending it with novel architecture CPU, accelerator or hybrid microarchitecture (such as forthcoming Intel Xeon Phi Knights Landing) should be as a simple as plugging a node in a rack. The initial system counts slightly more than 1100 cpu cores and includes different types of computing nodes (standard dual-socket nodes, large quad-sockets nodes with 768 GB RAM, and multi-GPU nodes) and two separate disk storage subsystems: a smaller high-performance scratch area, based on the Lustre file system, intended for direct computational I/O and a larger one, of the order of 1PB, to archive near-line data for archival purposes. All the components of the system are interconnected through a 10Gb/s Ethernet layer with one-level topology and an InfiniBand FDR 56Gbps layer in fat-tree topology. A system of this kind, heterogeneous and reconfigurable by design, poses a number of challenges related to the frequency at which heterogeneous hardware resources might change their availability and shareability status, which in turn affect methods and means to allocate, manage, optimize, bill, monitor VMs, virtual farms, jobs, interactive bare-metal sessions, etc. This poster describes some of the use cases that prompted the design ad construction of the HPC cluster, its architecture and a first characterization of its performance by some synthetic benchmark tools and a few realistic use-case tests.

Concetto Spampinato, Simone Palazzo, Daniela Giordano, Marco Aldinucci, Rosalia Leonardi

Deep learning for automated skeletal bone age assessment in X-ray images Journal Article

In: Medical Image Analysis, vol. 36, pp. 41–51, 2017.

Abstract | Links | BibTeX | Tags: HPC

@article{17:deepx:conce,

title = {Deep learning for automated skeletal bone age assessment in X-ray images},

author = {Concetto Spampinato and Simone Palazzo and Daniela Giordano and Marco Aldinucci and Rosalia Leonardi},

url = {https://iris.unito.it/retrieve/e27ce42b-5743-2581-e053-d805fe0acbaa/main.pdf},

doi = {10.1016/j.media.2016.10.010},

year  = {2017},

date = {2017-01-01},

journal = {Medical Image Analysis},

volume = {36},

pages = {41–51},

abstract = {Skeletal bone age assessment is a common clinical practice to investigate endocrinology, genetic and growth disorders in children. It is generally performed by radiological examination of the left hand by using either the Greulich and Pyle (G&P) method or the Tanner–Whitehouse (TW) one. However, both clinical procedures show several limitations, from the examination effort of radiologists to (most importantly) significant intra- and inter-operator variability. To address these problems, several automated approaches (especially relying on the TW method) have been proposed; nevertheless, none of them has been proved able to generalize to different races, age ranges and genders. In this paper, we propose and test several deep learning approaches to assess skeletal bone age automatically; the results showed an average discrepancy between manual and automatic evaluation of about 0.8 years, which is state-of-the-art performance. Furthermore, this is the first automated skeletal bone age assessment work tested on a public dataset and for all age ranges, races and genders, for which the source code is available, thus representing an exhaustive baseline for future research in the field. Beside the specific application scenario, this paper aims at providing answers to more general questions about deep learning on medical images: from the comparison between deep-learned features and manually-crafted ones, to the usage of deep-learning methods trained on general imagery for medical problems, to how to train a CNN with few images.},

keywords = {HPC},

pubstate = {published},

tppubtype = {article}

}

Mario Coppo, Mariangiola Dezani-Ciancaglini, Alejandro D'ıaz-Caro, Ines Margaria, Maddalena Zacchi

Retractions in Intersection Types Proceedings Article

In: Kobayashi, Naoki (Ed.): ITRS'16, pp. 31–47, 2017.

Links | BibTeX | Tags: rephrase, semantics

Joshua Romero, Everett H. Phillips, Gregory Ruetsch, Massimiliano Fatica, Filippo Spiga, Paolo Giannozzi

A Performance Study of Quantum ESPRESSO's PWscf Code on Multi-core and GPU Systems Proceedings Article

In: High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation - 8th International Workshop, PMBS 2017, Denver, CO, USA, November 13, 2017, Proceedings, pp. 67–87, 2017.

Links | BibTeX | Tags:

Wissam Abu Ahmad, Andrea Bartolini, Francesco Beneventi, Luca Benini, Andrea Borghesi, Marco Cicala, Privato Forestieri, Cosimo Gianfreda, Daniele Gregori, Antonio Libri, Filippo Spiga, Simone Tinti

Design of an Energy Aware Petaflops Class High Performance Cluster Based on Power Architecture Proceedings Article

In: 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2017, Orlando / Buena Vista, FL, USA, May 29 - June 2, 2017, pp. 964–973, 2017.

Links | BibTeX | Tags: HPC

2016

Paolo Viviani, Marco Aldinucci, Roberto d'Ippolito

An hybrid linear algebra framework for engineering Proceedings Article

In: Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts, Fiuggi, Italy, 2016.

Abstract | Links | BibTeX | Tags: HPC, repara

Claudia Misale, Maurizio Drocco, Marco Aldinucci, Guy Tremblay

A Comparison of Big Data Frameworks on a Layered Dataflow Model Proceedings Article

In: Proc. of Intl. Workshop on High-Level Parallel Programming (HLPP), pp. 1–19, arXiv.org, Muenster, Germany, 2016.

Abstract | Links | BibTeX | Tags: rephrase, toreador

Fabio Tordini

The road towards a Cloud-based High-Performance solution for genomic data analysis PhD Thesis

Computer Science Department, University of Torino, Italy, 2016.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

@phdthesis{tordiniThesis16,

title = {The road towards a Cloud-based High-Performance solution for genomic data analysis},

author = {Fabio Tordini},

url = {http://calvados.di.unipi.it/storage/paper_files/2016_tordini_phdthesis.pdf},

year  = {2016},

date = {2016-04-01},

school = {Computer Science Department, University of Torino, Italy},

abstract = {Nowadays, molecular biology laboratories are delivering more and more data about DNA organisation, at increasing resolution and in a large number of samples. So much that genomic research is now facing many of the scale-out issues that high-performance computing has been addressing for years: they require powerful infrastructures with fast computing and storage capabilities, with substantial challenges in terms of data processing, statistical analysis and data representation. With this thesis we propose a high-performance pipeline for the analysis and interpretation of heterogeneous genomic information: beside performance, usability and availability are two essential requirements that novel Bioinformatics tools should satisfy. In this perspective, we propose and discuss our efforts towards a solid infrastructure for data processing and storage, where software that operates over data is exposed as a service, and is accessible by users through the Internet. We begin by presenting NuChart-II, a tool for the analysis and interpretation of spatial genomic information. With NuChart-II we propose a graph-based representation of genomic data, which can provide insights on the disposition of genomic elements in the DNA. We also discuss our approach for the normalisation of biases that affect raw sequenced data. We believe that many currently available tools for genomic data analysis are perceived as tricky and troublesome applications, that require highly specialised skills to obtain the desired outcomes. Concerning usability, we want to rise the level of abstraction perceived by the user, but maintain high performance and correctness while providing an exhaustive solution for data visualisation. We also intend to foster the availability of novel tools: in this work we also discuss a cloud solution that delivers computation and storage as dynamically allocated virtual resources via the Internet, while needed software is provided as a service. In this way, the computational demand of genomic research can be satisfied more economically by using lab-scale and enterprise-oriented technologies. Here we discuss our idea of a task farm for the integration of heterogeneous data resulting from different sequencing experiments: we believe that the integration of multi-omic features on a nuclear map can be a valuable mean for studying the interactions among genetic elements. This can reveal insights on biological mechanisms, such as genes regulation, translocations and epigenetic patterns.},

keywords = {bioinformatics, fastflow},

pubstate = {published},

tppubtype = {phdthesis}

}

Nowadays, molecular biology laboratories are delivering more and more data about DNA organisation, at increasing resolution and in a large number of samples. So much that genomic research is now facing many of the scale-out issues that high-performance computing has been addressing for years: they require powerful infrastructures with fast computing and storage capabilities, with substantial challenges in terms of data processing, statistical analysis and data representation. With this thesis we propose a high-performance pipeline for the analysis and interpretation of heterogeneous genomic information: beside performance, usability and availability are two essential requirements that novel Bioinformatics tools should satisfy. In this perspective, we propose and discuss our efforts towards a solid infrastructure for data processing and storage, where software that operates over data is exposed as a service, and is accessible by users through the Internet. We begin by presenting NuChart-II, a tool for the analysis and interpretation of spatial genomic information. With NuChart-II we propose a graph-based representation of genomic data, which can provide insights on the disposition of genomic elements in the DNA. We also discuss our approach for the normalisation of biases that affect raw sequenced data. We believe that many currently available tools for genomic data analysis are perceived as tricky and troublesome applications, that require highly specialised skills to obtain the desired outcomes. Concerning usability, we want to rise the level of abstraction perceived by the user, but maintain high performance and correctness while providing an exhaustive solution for data visualisation. We also intend to foster the availability of novel tools: in this work we also discuss a cloud solution that delivers computation and storage as dynamically allocated virtual resources via the Internet, while needed software is provided as a service. In this way, the computational demand of genomic research can be satisfied more economically by using lab-scale and enterprise-oriented technologies. Here we discuss our idea of a task farm for the integration of heterogeneous data resulting from different sequencing experiments: we believe that the integration of multi-omic features on a nuclear map can be a valuable mean for studying the interactions among genetic elements. This can reveal insights on biological mechanisms, such as genes regulation, translocations and epigenetic patterns.

Bogdan Nicolae, Carlos H. A. Costa, Claudia Misale, Kostas Katrinis, Yoonho Park

Towards Memory-Optimized Data Shuffling Patterns for Big Data Analytics Proceedings Article

In: IEEE/ACM 16th Intl. Symposium on Cluster, Cloud and Grid Computing, CCGrid 2016, IEEE, Cartagena, Colombia, 2016.

Abstract | Links | BibTeX | Tags: HPC

Paolo Viviani, Marco Aldinucci, Roberto d'Ippolito, Jean Lemeire, Dean Vucinic

A flexible numerical framework for engineering - a Response Surface Modelling application Unpublished

2016.

Abstract | BibTeX | Tags: HPC, repara, rephrase

@unpublished{16:acex:armadillo,

title = {A flexible numerical framework for engineering - a Response Surface Modelling application},

author = {Paolo Viviani and Marco Aldinucci and Roberto d'Ippolito and Jean Lemeire and Dean Vucinic},

year  = {2016},

date = {2016-01-01},

booktitle = {10th Intl. Conference on Advanced Computational Engineering and Experimenting (ACE-X)},

abstract = {This work presents the innovative approach adopted for the development of a new numerical software framework for accelerating Dense Linear Algebra calculations and its application within an engineering context. In particular, Response Surface Models (RSM) are a key tool to reduce the computational effort involved in engineering design processes like design optimization. However, RSMs may prove to be too expensive to be computed when the dimensionality of the system and/or the size of the dataset to be synthesized is significantly high or when a large number of different Response Surfaces has to be calculated in order to improve the overall accuracy (e.g. like when using Ensemble Modelling techniques). On the other hand, it is a known challenge that the potential of modern hybrid hardware (e.g. multicore, GPUs) is not exploited by current engineering tools, while they can lead to a significant performance improvement. To fill this gap, a software framework is being developed that enables the hybrid and scalable acceleration of the linear algebra core for engineering applications and especially of RSMs calculations with a user-friendly syntax that allows good portability between different hardware architectures, with no need of specific expertise in parallel programming and accelerator technology. The effectiveness of this framework is shown by comparing an accelerated code to a single-core calculation of a Radial Basis Function RSM on some benchmark datasets. This approach is then validated within a real-life engineering application and the achievements are presented and discussed.},

keywords = {HPC, repara, rephrase},

pubstate = {published},

tppubtype = {unpublished}

}

Maurizio Drocco, Claudia Misale, Marco Aldinucci

A Cluster-As-Accelerator approach for SPMD-free Data Parallelism Proceedings Article

In: Proc. of 24th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 350–353, IEEE, Crete, Greece, 2016.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Vladimir Janjic, Christopher Brown, Kenneth MacKenzie, Kevin Hammond, Marco Danelutto, Marco Aldinucci, Jose Daniel Garcia

RPL: A Domain-Specific Language for Designing and Implementing Parallel C++ Applications Proceedings Article

In: Proc. of Intl. Euromicro PDP 2016: Parallel Distributed and network-based Processing, IEEE, Crete, Greece, 2016.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

@inproceedings{rpl:pdp:16,

title = {RPL: A Domain-Specific Language for Designing and Implementing Parallel C++ Applications},

author = {Vladimir Janjic and Christopher Brown and Kenneth MacKenzie and Kevin Hammond and Marco Danelutto and Marco Aldinucci and Jose Daniel Garcia},

url = {https://iris.unito.it/retrieve/handle/2318/1597172/299237/2016_jsupe_stencil_pp_4aperto.pdf},

doi = {10.1109/PDP.2016.122},

year  = {2016},

date = {2016-01-01},

booktitle = {Proc. of Intl. Euromicro PDP 2016: Parallel Distributed and network-based Processing},

publisher = {IEEE},

address = {Crete, Greece},

abstract = {Parallelising sequential applications is usually a very hard job, due to many different ways in which an application can be parallelised and a large number of programming models (each with its own advantages and disadvantages) that can be used. In this paper, we describe a method to semi- automatically generate and evaluate different parallelisations of the same application, allowing programmers to find the best parallelisation without significant manual reengineering of the code. We describe a novel, high-level domain-specific language, Refactoring Pattern Language (RPL), that is used to represent the parallel structure of an application and to capture its extra-functional properties (such as service time). We then describe a set of RPL rewrite rules that can be used to generate alternative, but semantically equivalent, parallel structures (parallelisations) of the same application. We also describe the RPL Shell that can be used to evaluate these parallelisations, in terms of the desired extra-functional properties. Finally, we describe a set of C++ refactorings, targeting OpenMP, Intel TBB and FastFlow parallel programming models, that semi-automatically apply the desired parallelisation to the application's source code, therefore giving a parallel version of the code. We demonstrate how the RPL and the refactoring rules can be used to derive efficient parallelisations of two realistic C++ use cases (Image Convolution and Ant Colony Optimisation).},

keywords = {fastflow, rephrase},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Pool Evolution: A Parallel Pattern for Evolutionary and Symbolic Computing Journal Article

In: International Journal of Parallel Programming, vol. 44, no. 3, pp. 531–551, 2016, ISSN: 0885-7458.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, repara

Fabio Tordini, Ivan Merelli, Pietro Liò, Luciano Milanesi, Marco Aldinucci

NuchaRt: embedding high-level parallel computing in R for augmented Hi-C data analysis Book Section

In: Publishing, Springer International (Ed.): Computational Intelligence Methods for Bioinformatics and Biostatistics, vol. 9874, pp. 259–272, Springer International Publishing, Cham (ZG), 2016, ISBN: 978-3-319-44331-7.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, repara

Fabio Tordini

A cloud solution for multi-omics data integration Proceedings Article

In: Proceedings of the 16th IEEE International Conference on Scalable Computing and Communication, pp. 559–566, IEEE Computer Society, 2016, (Best paper award).

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, rephrase

Manuel F. Dolz, David Rio Astorga, Javier Fernández, J. Daniel Garc'ıa, Félix Garc'ıa-Carballeira, Marco Danelutto, Massimo Torquati

Embedding Semantics of the Single-Producer/Single-Consumer Lock-Free Queue into a Race Detection Tool Proceedings Article

In: Proceedings of the 7th International Workshop on Programming Models and Applications for Multicores and Manycores, pp. 20–29, ACM, Barcelona, Spain, 2016, ISBN: 978-1-4503-4196-7.

Links | BibTeX | Tags: fastflow, repara

Andrea Bracciali, Marco Aldinucci, Murray Patterson, Tobias Marschall, Nadia Pisanti, Ivan Merelli, Massimo Torquati

pWhatsHap: efficient haplotyping for future generation sequencing Journal Article

In: BMC Bioinformatics, vol. 17, no. Suppl 11, pp. 342, 2016.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, rephrase

@article{16:pwhatshap:bmc,

title = {pWhatsHap: efficient haplotyping for future generation sequencing},

author = {Andrea Bracciali and Marco Aldinucci and Murray Patterson and Tobias Marschall and Nadia Pisanti and Ivan Merelli and Massimo Torquati},

url = {http://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/s12859-016-1170-y?site=bmcbioinformatics.biomedcentral.com},

doi = {10.1186/s12859-016-1170-y},

year  = {2016},

date = {2016-01-01},

journal = {BMC Bioinformatics},

volume = {17},

number = {Suppl 11},

pages = {342},

abstract = {Background: Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WhatsHap is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Results: Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered pWhatsHap, a parallel, high-performance version of WhatsHap. pWhatsHap is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WhatsHap, pWhatsHap exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WhatsHap, which increases with coverage. Conclusions: Due to its structure and management of the large datasets, the parallelisation of WhatsHap posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, pWhatsHap, is a freely available toolkit that improves the efficiency of the analysis of genomics information.},

keywords = {fastflow, paraphrase, rephrase},

pubstate = {published},

tppubtype = {article}

}

Background: Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WhatsHap is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Results: Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered pWhatsHap, a parallel, high-performance version of WhatsHap. pWhatsHap is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WhatsHap, pWhatsHap exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WhatsHap, which increases with coverage. Conclusions: Due to its structure and management of the large datasets, the parallelisation of WhatsHap posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, pWhatsHap, is a freely available toolkit that improves the efficiency of the analysis of genomics information.

Fabio Tordini, Marco Aldinucci, Luciano Milanesi, Pietro Liò, Ivan Merelli

The Genome Conformation as an Integrator of Multi-Omic Data: The Example of Damage Spreading in Cancer Journal Article

In: Frontiers in Genetics, vol. 7, no. 194, pp. 1–17, 2016.

Abstract | Links | BibTeX | Tags:

@article{2016_omics_fgenetics,

title = {The Genome Conformation as an Integrator of Multi-Omic Data: The Example of Damage Spreading in Cancer},

author = {Fabio Tordini and Marco Aldinucci and Luciano Milanesi and Pietro Liò and Ivan Merelli},

url = {http://journal.frontiersin.org/article/10.3389/fgene.2016.00194},

doi = {10.3389/fgene.2016.00194},

year  = {2016},

date = {2016-01-01},

journal = {Frontiers in Genetics},

volume = {7},

number = {194},

pages = {1–17},

abstract = {Publicly available multi-omic databases, in particular if associated with medical annotations, are rich resources with the potential to lead a rapid transition from high-throughput molecular biology experiments to better clinical outcomes for patients. In this work, we propose a model for multi-omic data integration (i.e. genetic variations, gene expression, genome conformation and epigenetic patterns), which exploits a multi-layer network approach to analyse, visualize and obtain insights from such biological information, in order to use achieved results at a macroscopic level. Using this representation, we can describe how driver and passenger mutations accumulate during the development of diseases providing, for example, a tool able to characterise the evolution of cancer. Indeed, our test case concerns the MCF-7 breast cancer cell line, before and after the stimulation with estrogen, since many datasets are available for this case study. In particular, the integration of data about cancer mutations, gene functional annotations, genome conformation, epigenetic patterns, gene expression and metabolic pathways in our multi-layer representation will allow a better interpretation of the mechanisms behind a complex disease such as cancer. Thanks to this multi-layer approach, we focus on the interplay of chromatin conformation and cancer mutations in different pathways, such as metabolic processes, that are very important for tumour development. Working on this model, a variance analysis can be implemented to identify normal variations within each omics and to characterize, by contrast, variations that can be accounted to pathological samples compared to normal ones. This integrative model can be used to identify novel biomarkers and to provide innovative omic-based guidelines for treating many diseases, improving the efficacy of decision trees currently used in clinic.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Bogdan Nicolae, Carlos H. A. Costa, Claudia Misale, Kostas Katrinis, Yoonho Park

Leveraging Adaptive I/O to Optimize Collective Data Shuffling Patterns for Big Data Analytics Journal Article

In: IEEE Transactions on Parallel and Distributed Systems, vol. PP, no. 99, 2016.

Abstract | Links | BibTeX | Tags:

@article{16:shuffle:tpds:misale,

title = {Leveraging Adaptive I/O to Optimize Collective Data Shuffling Patterns for Big Data Analytics},

author = {Bogdan Nicolae and Carlos H. A. Costa and Claudia Misale and Kostas Katrinis and Yoonho Park},

url = {https://iris.unito.it/retrieve/handle/2318/1624908/295954/tpds_4aperto.pdf},

doi = {10.1109/TPDS.2016.2627558},

year  = {2016},

date = {2016-01-01},

journal = {IEEE Transactions on Parallel and Distributed Systems},

volume = {PP},

number = {99},

abstract = {Big data analytics is an indispensable tool in transforming science, engineering, medicine, health-care, finance and ultimately business itself. With the explosion of data sizes and need for shorter time-to-solution, in-memory platforms such as Apache Spark gain increasing popularity. In this context, data shuffling, a particularly difficult transformation pattern, introduces important challenges. Specifically, data shuffling is a key component of complex computations that has a major impact on the overall performance and scalability. Thus, speeding up data shuffling is a critical goal. To this end, state-of-the-art solutions often rely on overlapping the data transfers with the shuffling phase. However, they employ simple mechanisms to decide how much data and where to fetch it from, which leads to sub-optimal performance and excessive auxiliary memory utilization for the purpose of prefetching. The latter aspect is a growing concern, given evidence that memory per computation unit is continuously decreasing while interconnect bandwidth is increasing. This paper contributes a novel shuffle data transfer strategy that addresses the two aforementioned dimensions by dynamically adapting the prefetching to the computation. We implemented this novel strategy in Spark, a popular in-memory data analytics framework. To demonstrate the benefits of our proposal, we run extensive experiments on an HPC cluster with large core count per node. Compared with the default Spark shuffle strategy, our proposal shows: up to 40% better performance with 50% less memory utilization for buffering and excellent weak scalability.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Ilaria Castellani, Mariangiola Dezani-Ciancaglini, Ugo Liguoro

Secure Multiparty Sessions with Topics Proceedings Article

In: PLACES'16, pp. 1–12, 2016.

Links | BibTeX | Tags: rephrase, semantics

Ilaria Castellani, Mariangiola Dezani-Ciancaglini, Jorge A. Pérez

Self-Adaptation and Secure Information Flow in Multiparty Communications Journal Article

In: Formal Aspects of Computing, vol. 28, no. 4, pp. 669–696, 2016.

Links | BibTeX | Tags: rephrase, semantics

Mario Coppo, Mariangiola Dezani-Ciancaglini, Betti Venneri

Parallel Monitors for Self-adaptive Sessions Proceedings Article

In: PLACES'16, pp. 25–36, 2016.

Links | BibTeX | Tags: rephrase, semantics

Mariangiola Dezani-Ciancaglini, Silvia Ghilezan, Svetlana Jaksic, Jovanka Pantovic, Nobuko Yoshida

Denotational and Operational Preciseness of Subtyping: A Roadmap Proceedings Article

In: Theory and Practice of Formal Methods, pp. 155–172, 2016.

Links | BibTeX | Tags: rephrase, semantics

Mariangiola Dezani-Ciancaglini, Paola Giannini

Reversible Multiparty Sessions with Checkpoints Proceedings Article

In: EXPRESS/SOS'16, pp. 60–74, 2016.

Links | BibTeX | Tags: rephrase, semantics

2015

Paolo Inaudi

Progettazione e sviluppo di un provider libfabric per la rete ad alte prestazioni Ronniee/A3Cube Masters Thesis

Computer Science Department, University of Torino, 2015.

BibTeX | Tags: fastflow

Marco Aldinucci, Marco Danelutto, Maurizio Drocco, Peter Kilpatrick, Guilherme Peretti Pezzi, Massimo Torquati

The Loop-of-Stencil-Reduce paradigm Proceedings Article

In: Proc. of Intl. Workshop on Reengineering for Parallelism in Heterogeneous Parallel Platforms (RePara), pp. 172–177, IEEE, Helsinki, Finland, 2015.

Abstract | Links | BibTeX | Tags: fastflow, HPC, repara

Fabio Tordini, Maurizio Drocco, Ivan Merelli, Luciano Milanesi, Pietro Liò, Marco Aldinucci

NuChart-II: a graph-based approach for the analysis and interpretation of Hi-C data Proceedings Article

In: Serio, Clelia Di, Liò, Pietro, Nonis, Alessandro, Tagliaferri, Roberto (Ed.): Proc. of 11th Intl. Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB), pp. 298–311, Springer, Cambridge, UK, 2015, ISBN: 978-3-319-24461-7.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, paraphrase, repara

@inproceedings{14:ff:nuchart:cibb,

title = {NuChart-II: a graph-based approach for the analysis and interpretation of Hi-C data},

author = {Fabio Tordini and Maurizio Drocco and Ivan Merelli and Luciano Milanesi and Pietro Liò and Marco Aldinucci},

editor = {Clelia Di Serio and Pietro Liò and Alessandro Nonis and Roberto Tagliaferri},

url = {http://calvados.di.unipi.it/storage/paper_files/2014_nuchart_cibb.pdf},

doi = {10.1007/978-3-319-24462-4_25},

isbn = {978-3-319-24461-7},

year  = {2015},

date = {2015-06-01},

booktitle = {Proc. of 11th Intl. Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB)},

volume = {8623},

pages = {298–311},

publisher = {Springer},

address = {Cambridge, UK},

series = {LNCS},

abstract = {Long-range chromosomal associations between genomic regions, and their repositioning in the 3D space of the nucleus, are now considered to be key contributors to the regulation of gene expressions, and important links have been highlighted with other genomic features involved in DNA rearrangements. Recent Chromosome Conformation Capture (3C) measurements performed with high throughput sequencing (Hi-C) and molecular dynamics studies show that there is a large correlation between co-localization and co-regulation of genes, but these important researches are hampered by the lack of biologists-friendly analysis and visualisation software. In this work we present NuChart-II, a software that allows the user to annotate and visualize a list of input genes with information relying on Hi-C data, integrating knowledge data about genomic features that are involved in the chromosome spatial organization. This software works directly with sequenced reads to identify related Hi-C fragments, with the aim of creating gene-centric neighbourhood graphs on which multi-omics features can be mapped. NuChart-II is a highly optimized implementation of a previous prototype package developed in R, in which the graph-based representation of Hi-C data was tested. The prototype showed inevitable problems of scalability while working genome-wide on large datasets: particular attention has been paid in optimizing the data structures employed while constructing the neighbourhood graph, so as to foster an efficient parallel implementation of the software. The normalization of Hi-C data has been modified and improved, in order to provide a reliable estimation of proximity likelihood for the genes.},

keywords = {bioinformatics, fastflow, paraphrase, repara},

pubstate = {published},

tppubtype = {inproceedings}

}

Maurizio Drocco, Claudia Misale, Guilherme Peretti Pezzi, Fabio Tordini, Marco Aldinucci

Memory-Optimised Parallel Processing of Hi-C Data Proceedings Article

In: Proc. of 23rd Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 1–8, IEEE, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, impact, paraphrase, repara

Fabio Tordini, Maurizio Drocco, Claudia Misale, Luciano Milanesi, Pietro Liò, Ivan Merelli, Marco Aldinucci

Parallel Exploration of the Nuclear Chromosome Conformation with NuChart-II Proceedings Article

In: Proc. of 23rd Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), IEEE, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, impact, paraphrase, repara

Paolo Viviani

Parallel Computing Techniques for High Energy Physics Masters Thesis

Physics Department, University of Torino, 2015.

Abstract | BibTeX | Tags: fastflow, impact

Ivan Merelli, Fabio Tordini, Maurizio Drocco, Marco Aldinucci, Pietro Liò, Luciano Milanesi

Integrating Multi-omic features exploiting Chromosome Conformation Capture data Journal Article

In: Frontiers in Genetics, vol. 6, no. 40, 2015, ISSN: 1664-8021.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

@article{nuchart:frontiers:15,

title = {Integrating Multi-omic features exploiting Chromosome Conformation Capture data},

author = {Ivan Merelli and Fabio Tordini and Maurizio Drocco and Marco Aldinucci and Pietro Liò and Luciano Milanesi},

url = {http://journal.frontiersin.org/Journal/10.3389/fgene.2015.00040/pdf},

doi = {10.3389/fgene.2015.00040},

issn = {1664-8021},

year  = {2015},

date = {2015-01-01},

journal = {Frontiers in Genetics},

volume = {6},

number = {40},

abstract = {The representation, integration and interpretation of omic data is a complex task, in particular considering the huge amount of information that is daily produced in molecular biology laboratories all around the world. The reason is that sequencing data regarding expression profiles, methylation patterns, and chromatin domains is difficult to harmonize in a systems biology view, since genome browsers only allow coordinate-based representations, discarding functional clusters created by the spatial conformation of the DNA in the nucleus. In this context, recent progresses in high throughput molecular biology techniques and bioinformatics have provided insights into chromatin interactions on a larger scale and offer a formidable support for the interpretation of multi-omic data. In particular, a novel sequencing technique called Chromosome Conformation Capture (3C) allows the analysis of the chromosome organization in the cell's natural state. While performed genome wide, this technique is usually called Hi-C. Inspired by service applications such as Google Maps, we developed NuChart, an R package that integrates Hi-C data to describe the chromosomal neighbourhood starting from the information about gene positions, with the possibility of mapping on the achieved graphs genomic features such as methylation patterns and histone modifications, along with expression profiles. In this paper we show the importance of the NuChart application for the integration of multi-omic data in a systems biology fashion, with particular interest in cytogenetic applications of these techniques. Moreover, we demonstrate how the integration of multi-omic data can provide useful information in understanding why genes are in certain specific positions inside the nucleus and how epigenetic patterns correlate with their expression.},

keywords = {bioinformatics, fastflow},

pubstate = {published},

tppubtype = {article}

}

Marco Aldinucci, Andrea Bracciali, Tobias Marschall, Murray Patterson, Nadia Pisanti, Massimo Torquati

High-Performance Haplotype Assembly Proceedings Article

In: Serio, Clelia Di, Liò, Pietro, Nonis, Alessandro, Tagliaferri, Roberto (Ed.): Computational Intelligence Methods for Bioinformatics and Biostatistics - 11th International Meeting, CIBB 2014, Cambridge, UK, June 26-28, 2014, Revised Selected Papers, pp. 245–258, Springer, Cambridge, UK, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Marco Aldinucci, Guilherme Peretti Pezzi, Maurizio Drocco, Concetto Spampinato, Massimo Torquati

Parallel Visual Data Restoration on Multi-GPGPUs using Stencil-Reduce Pattern Journal Article

In: International Journal of High Performance Computing Applications, vol. 29, no. 4, pp. 461–472, 2015.

Abstract | Links | BibTeX | Tags: fastflow, HPC, impact, paraphrase

Guilherme Peretti Pezzi, Evelyne Vaissié, Yann Viala, Denis Caromel, Philippe Gourbesville

Parallel profiling of water distribution networks using the Clément formula Journal Article

In: Applied Mathematics and Computation, vol. 267, pp. 83–95, 2015, ISSN: 0096-3003, (The Fourth European Seminar on Computing (ESCO 2014)).

Abstract | Links | BibTeX | Tags: impact

Mohammed Sourouri, Johannes Langguth, Filippo Spiga, Scott B. Baden, Xing Cai

CPU+GPU Programming of Stencil Computations for Resource-Efficient Use of GPU Clusters Proceedings Article

In: 18th IEEE International Conference on Computational Science and Engineering, CSE 2015, Porto, Portugal, October 21-23, 2015, pp. 17–26, 2015.

Links | BibTeX | Tags:

Jens Glaser, Trung Dac Nguyen, Joshua A. Anderson, Pak Lui, Filippo Spiga, Jaime A. Millan, David C. Morse, Sharon C. Glotzer

Strong scaling of general-purpose molecular dynamics simulations on GPUs Journal Article

In: Computer Physics Communications, vol. 192, pp. 97–107, 2015.

Links | BibTeX | Tags:

2014

Michael G. Epitropakis, Andrea Bracciali, Marco Aldinucci, Emily Potts, Edmund K. Burke

Predictive scheduling for optimal cloud configuration Proceedings Article

In: Özcan, Ender, Burke, Edmund K., MCCollum, Barry (Ed.): Proc. of 10th Intl. Conference on the Practice and Theory of Automated Timetabling, PATAT, York, United Kingdom, 2014.

Links | BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Pool evolution: a domain specific parallel pattern Proceedings Article

In: Proc.of the 7th Intl. Symposium on High-level Parallel Programming and Applications (HLPP), Amsterdam, The Netherlands, 2014.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, repara

Marco Aldinucci, Massimo Torquati, Maurizio Drocco, Guilherme Peretti Pezzi, Concetto Spampinato

FastFlow: Combining Pattern-Level Abstraction and Efficiency in GPGPUs Proceedings Article

In: GPU Technology Conference (GTC), San Jose, CA, USA, 2014.

Abstract | Links | BibTeX | Tags: fastflow, HPC, impact, paraphrase

Marco Aldinucci, Massimo Torquati, Maurizio Drocco, Guilherme Peretti Pezzi, Concetto Spampinato

An Overview of FastFlow: Combining Pattern-Level Abstraction and Efficiency in GPGPUs Proceedings Article

In: GPU Technology Conference (GTC), San Jose, CA, USA, 2014.

Abstract | Links | BibTeX | Tags: fastflow, HPC, impact, paraphrase

Daniele Buono, Marco Danelutto, Tiziano De Matteis, Gabriele Mencagli, Massimo Torquati

A Lightweight Run-Time Support For Fast Dense Linear Algebra on Multi-Core Proceedings Article

In: Proc. of the 12th International Conference on Parallel and Distributed Computing and Networks (PDCN 2014), IASTED, ACTA press, 2014.

BibTeX | Tags: fastflow

Marco Aldinucci, Massimo Torquati, Concetto Spampinato, Maurizio Drocco, Claudia Misale, Cristina Calcagno, Mario Coppo

Parallel stochastic systems biology in the cloud Journal Article

In: Briefings in Bioinformatics, vol. 15, no. 5, pp. 798–813, 2014, ISSN: 1467-5463.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, impact, paraphrase

Concetto Spampinato, Isaak Kavasidis, Marco Aldinucci, Carmelo Pino, Daniela Giordano, Alberto Faro

Discovering Biological Knowledge by Integrating High Throughput Data and Scientific Literature on the Cloud Journal Article

In: Concurrency and Computation: Practice and Experience, vol. 26, no. 10, pp. 1771–1786, 2014.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Design patterns percolating to parallel programming framework implementation Journal Article

In: International Journal of Parallel Programming, vol. 42, no. 6, pp. 1012–1031, 2014, ISSN: 0885-7458.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Salvatore Ruggieri, Massimo Torquati

Decision Tree Building on Multi-Core using FastFlow Journal Article

In: Concurrency and Computation: Practice and Experience, vol. 26, no. 3, pp. 800–820, 2014.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Guilherme Peretti Pezzi, Evelyne Vaissié, Yann Viala, Denis Caromel, Philippe Gourbesville

Parallel Profiling of Water Distribution Networks Using the Clément Formula Proceedings Article

In: 4th European Seminar on Computing, 2014.

Abstract | BibTeX | Tags: impact

Marco Aldinucci, Cristina Calcagno, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Eva Sciacca, Salvatore Spinella, Massimo Torquati, Angelo Troina

On designing multicore-aware simulators for systems biology endowed with on-line statistics Journal Article

In: BioMed Research International, 2014.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, paraphrase

Marco Aldinucci, Maurizio Drocco, Guilherme Peretti Pezzi, Claudia Misale, Fabio Tordini, Massimo Torquati

Exercising high-level parallel programming on streams: a systems biology use case Proceedings Article

In: Proc. of 34th IEEE Intl. Conference on Distributed Computing Systems Workshops (ICDCSW), IEEE, Madrid, Spain, 2014.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, impact, paraphrase

@inproceedings{cwc:gpu:dcperf:14,

title = {Exercising high-level parallel programming on streams: a systems biology use case},

author = {Marco Aldinucci and Maurizio Drocco and Guilherme Peretti Pezzi and Claudia Misale and Fabio Tordini and Massimo Torquati},

url = {https://iris.unito.it/retrieve/handle/2318/154516/26657/2014_dcperf_cwc_gpu.pdf},

doi = {10.1109/ICDCSW.2014.38},

year  = {2014},

date = {2014-01-01},

booktitle = {Proc. of 34th IEEE Intl. Conference on Distributed Computing Systems Workshops (ICDCSW)},

publisher = {IEEE},

address = {Madrid, Spain},

abstract = {The stochastic modelling of biological systems, cou- pled with Monte Carlo simulation of models, is an increasingly popular technique in Bioinformatics. The simulation-analysis workflow may result into a computationally expensive task reducing the interactivity required in the model tuning. In this work, we advocate high-level software design as a vehicle for building efficient and portable parallel simulators for a variety of platforms, ranging from multi-core platforms to GPGPUs to cloud. In particular, the Calculus of Wrapped Compartments (CWC) parallel simulator for systems biology equipped with on- line mining of results, which is designed according to the FastFlow pattern-based approach, is discussed as a running example. In this work, the CWC simulator is used as a paradigmatic example of a complex C++ application where the quality of results is correlated with both computation and I/O bounds, and where high-quality results might turn into big data. The FastFlow parallel programming framework, which advocates C++ pattern- based parallel programming makes it possible to develop portable parallel code without relinquish neither run-time efficiency nor performance tuning opportunities. Performance and effectiveness of the approach are validated on a variety of platforms, inter-alia cache-coherent multi-cores, cluster of multi-core (Ethernet and Infiniband) and the Amazon Elastic Compute Cloud.},

keywords = {bioinformatics, fastflow, impact, paraphrase},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Guilherme Peretti Pezzi, Maurizio Drocco, Fabio Tordini, Peter Kilpatrick, Massimo Torquati

Parallel video denoising on heterogeneous platforms Proceedings Article

In: Proc. of Intl. Workshop on High-level Programming for Heterogeneous and Hierarchical Parallel Systems (HLPGPU), 2014.

Abstract | Links | BibTeX | Tags: fastflow, impact, paraphrase

Claudia Misale

Accelerating Bowtie2 with a lock-less concurrency approach and memory affinity Proceedings Article

In: Aldinucci, Marco, D'Agostino, Daniele, Kilpatrick, Peter (Ed.): Proc. of Intl. Euromicro PDP 2014: Parallel Distributed and network-based Processing, IEEE, Torino, Italy, 2014, ((Best paper award)).

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

@inproceedings{ff:bowtie2:pdp:14,

title = {Accelerating Bowtie2 with a lock-less concurrency approach and memory affinity},

author = {Claudia Misale},

editor = {Marco Aldinucci and Daniele D'Agostino and Peter Kilpatrick},

url = {http://calvados.di.unipi.it/storage/paper_files/2014_pdp_bowtieff.pdf},

doi = {10.1109/PDP.2014.50},

year  = {2014},

date = {2014-01-01},

booktitle = {Proc. of Intl. Euromicro PDP 2014: Parallel Distributed and network-based Processing},

publisher = {IEEE},

address = {Torino, Italy},

abstract = {The implementation of DNA alignment tools for Bioinformatics lead to face different problems that dip into performances. A single alignment takes an amount of time that is not predictable and there are different factors that can affect performances, for instance the length of sequences can determine the computational grain of the task and mismatches or insertion/deletion (indels) increase time needed to complete an alignment. Moreover, an alignment is a strong memory- bound problem because of the irregular memory access pat- terns and limitations in memory-bandwidth. Over the years, many alignment tools were implemented. A concrete example is Bowtie2, one of the fastest (concurrent, Pthread-based) and state of the art not GPU-based alignment tool. Bowtie2 exploits concurrency by instantiating a pool of threads, which have access to a global input dataset, share the reference genome and have access to different objects for collecting alignment results. In this paper a modified implementation of Bowtie2 is presented, in which the concurrency structure has been changed. The proposed implementation exploits the task-farm skeleton pattern implemented as a Master-Worker. The Master-Worker pattern permits to delegate only to the Master thread dataset reading and to make private to each Worker data structures that are shared in the original version. Only the reference genome is left shared. As a further optimisation, the Master and each Worker were pinned on cores and the reference genome was allocated interleaved among memory nodes. The proposed implementation is able to gain up to 10 speedup points over the original implementation.},

note = {(Best paper award)},

keywords = {fastflow, paraphrase},

pubstate = {published},

tppubtype = {inproceedings}

}

Alessandro Secco, Irfan Uddin, Guilherme Peretti Pezzi, Massimo Torquati

Message passing on InfiniBand RDMA for parallel run-time supports Proceedings Article

In: Aldinucci, Marco, D'Agostino, Daniele, Kilpatrick, Peter (Ed.): Proc. of Intl. Euromicro PDP 2014: Parallel Distributed and network-based Processing, IEEE, Torino, Italy, 2014.

Abstract | Links | BibTeX | Tags: fastflow, impact, paraphrase

Maurizio Drocco, Marco Aldinucci, Massimo Torquati

A Dynamic Memory Allocator for heterogeneous platforms Proceedings Article

In: Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts, HiPEAC, Fiuggi, Italy, 2014.

Abstract | Links | BibTeX | Tags: fastflow, HPC

Claudia Misale, Giulio Ferrero, Massimo Torquati, Marco Aldinucci

Sequence alignment tools: one parallel pattern to rule them all? Journal Article

In: BioMed Research International, 2014.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, paraphrase, repara

2013

Maurizio Drocco

Parallel stochastic simulators in systems biology: the evolution of the species Masters Thesis

Computer Science Department, University of Torino, Italy, 2013.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Fabio Tordini, Maurizio Drocco, Massimo Torquati, Mario Coppo

Parallel stochastic simulators in system biology: the evolution of the species Proceedings Article

In: Proc. of 21st Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), IEEE, Belfast, Nothern Ireland, U.K., 2013.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Claudia Misale, Marco Aldinucci, Massimo Torquati

Memory affinity in multi-threading: the Bowtie2 case study Proceedings Article

In: Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts, HiPEAC, Fiuggi, Italy, 2013, ISBN: 9789038221908.

Abstract | Links | BibTeX | Tags: fastflow

@inproceedings{ff:acaces:13,

title = {Memory affinity in multi-threading: the Bowtie2 case study},

author = {Claudia Misale and Marco Aldinucci and Massimo Torquati},

url = {https://iris.unito.it/retrieve/handle/2318/143005/23874/2013_ACACES_ex-abstract.pdf},

isbn = {9789038221908},

year  = {2013},

date = {2013-01-01},

booktitle = {Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts},

publisher = {HiPEAC},

address = {Fiuggi, Italy},

abstract = {The diffusion of the Next Generation Sequencing (NGS) has increased the amount of data obtainable by genomic experiments. From a DNA sample a NGS run is able to produce millions of short sequences (called reads), which should be mapped into a reference genome. In this paper, we analyse the performance of Bowtie2, a fast and popular DNA mapping tool. Bowtie2 exhibits a multithreading implementation on top of pthreads, spin-locks and SSE2 SIMD extension. From parallel computing viewpoint, is a paradigmatic example of a software requiring to address three fundamental problems in shared-memory programming for cache-coherent multi-core platforms: synchronisation efficiency at very fine grain (due to short reads), load-balancing (due to long reads), and efficient usage of memory subsystem (due to SSE2 memory pressure). We compare the original implementation against an alternative implementation on top of the FastFlow pattern-based programming framework. The proposed design exploits the high-level farm pattern of FastFlow, which is implemented top of nonblocking multi-threading and lock-less (CAS-free) queues, and provides the programmer with high-level mechanism to tune task scheduling to achieve both load-balancing and memory affinity. The proposed design, despite the high-level design, is always faster and more scalable with respect to the original one. The design of both original and alternative version will be presented along with their experimental evaluation on real-world data sets.},

keywords = {fastflow},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Carlo Montangero, Laura Semini

Managing Adaptivity in Parallel Systems Book Section

In: Beckert, Bernhard, Damiani, Ferruccio, Boer, Frank S., Bonsangue, Marcello M. (Ed.): Formal Methods for Components and Objects: Intl. Symposium, FMCO 2011, Torino, Italy, October 3-5, 2011, Revised Invited Lectures, vol. 7542, pp. 199–217, Springer, 2013, ISBN: 978-3-642-35886-9.

Abstract | Links | BibTeX | Tags: paraphrase

Marco Aldinucci, Sonia Campa, Peter Kilpatrick, Massimo Torquati

Structured Data Access Annotations for Massively Parallel Computations Proceedings Article

In: Euro-Par 2012 Workshops, Proc. of the ParaPhrase Workshop on Parallel Processing, pp. 381–390, Springer, 2013.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Targeting Distributed Systems in FastFlow Proceedings Article

In: Euro-Par 2012 Workshops, Proc. of the CoreGrid Workshop on Grids, Clouds and P2P Computing, pp. 47–56, Springer, 2013.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Kevin Hammond, Marco Aldinucci, Chris Brown, Francesco Cesarini, Marco Danelutto, Horacio González-Vélez, Peter Kilpatrick, Rainer Keller, Michael Rossbory, Gilad Shainer

The ParaPhrase Project: Parallel Patterns for Adaptive Heterogeneous Multicore Systems Book Section

In: Beckert, Bernhard, Damiani, Ferruccio, Boer, Frank S., Bonsangue, Marcello M. (Ed.): Formal Methods for Components and Objects: Intl. Symposium, FMCO 2011, Torino, Italy, October 3-5, 2011, Revised Invited Lectures, vol. 7542, pp. 218–236, Springer, 2013, ISBN: 978-3-642-35886-9.

Abstract | Links | BibTeX | Tags: paraphrase

@incollection{paraphrase:fmco:11,

title = {The ParaPhrase Project: Parallel Patterns for Adaptive Heterogeneous Multicore Systems},

author = {Kevin Hammond and Marco Aldinucci and Chris Brown and Francesco Cesarini and Marco Danelutto and Horacio González-Vélez and Peter Kilpatrick and Rainer Keller and Michael Rossbory and Gilad Shainer},

editor = {Bernhard Beckert and Ferruccio Damiani and Frank S. Boer and Marcello M. Bonsangue},

url = {http://calvados.di.unipi.it/storage/paper_files/2013_fmco11_paraphrase.pdf},

doi = {10.1007/978-3-642-35887-6_12},

isbn = {978-3-642-35886-9},

year  = {2013},

date = {2013-01-01},

booktitle = {Formal Methods for Components and Objects: Intl. Symposium, FMCO 2011, Torino, Italy, October 3-5, 2011, Revised Invited Lectures},

volume = {7542},

pages = {218–236},

publisher = {Springer},

series = {LNCS},

abstract = {This paper describes the ParaPhrase project, a new 3-year targeted research project funded under EU Framework 7 Objective 3.4 (Computer Systems), starting in October 2011. ParaPhrase aims to follow a new approach to introducing parallelism using advanced refactoring techniques coupled with high-level parallel design patterns. The refactoring approach will use these design patterns to restructure programs defined as networks of software components into other forms that are more suited to parallel execution. The programmer will be aided by high-level cost information that will be integrated into the refactoring tools. The implementation of these patterns will then use a well-understood algorithmic skeleton approach to achieve good parallelism. A key ParaPhrase design goal is that parallel components are intended to match heterogeneous architectures, defined in terms of CPU/GPU combinations, for example. In order to achieve this, the ParaPhrase approach will map components at link time to the available hardware, and will then re-map them during program execution, taking account of multiple applications, changes in hardware resource availability, the desire to reduce communication costs etc. In this way, we aim to develop a new approach to programming that will be able to produce software that can adapt to dynamic changes in the system environment. Moreover, by using a strong component basis for parallelism, we can achieve potentially significant gains in terms of reducing sharing at a high level of abstraction, and so in reducing or even eliminating the costs that are usually associated with cache management, locking, and synchronisation.},

keywords = {paraphrase},

pubstate = {published},

tppubtype = {incollection}

}

Marco Aldinucci, Sonia Campa, Fabio Tordini, Massimo Torquati, Peter Kilpatrick

An abstract annotation model for skeletons Book Section

In: Beckert, Bernhard, Damiani, Ferruccio, Boer, Frank S., Bonsangue, Marcello M. (Ed.): Formal Methods for Components and Objects: Intl. Symposium, FMCO 2011, Torino, Italy, October 3-5, 2011, Revised Invited Lectures, vol. 7542, pp. 257–276, Springer, 2013, ISBN: 978-3-642-35886-9.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

2012

Marco Aldinucci, Concetto Spampinato, Maurizio Drocco, Massimo Torquati, Simone Palazzo

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising Proceedings Article

In: Djemal, K., Deriche, M., Puech, W., Ucan, Osman N. (Ed.): Proc. of 2nd Intl. Conference on Image Processing Theory Tools and Applications (IPTA), pp. 97–102, IEEE, Istambul, Turkey, 2012, ISBN: 978-1-4673-2582-0.

Abstract | Links | BibTeX | Tags: fastflow, impact

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimiliano Meneghin, Massimo Torquati

An Efficient Unbounded Lock-Free Queue for Multi-core Systems Proceedings Article

In: Proc. of 18th Intl. Euro-Par 2012 Parallel Processing, pp. 662–673, Springer, Rhodes Island, Greece, 2012.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Targeting heterogeneous architectures via macro data flow Journal Article

In: Parallel Processing Letters, vol. 22, no. 2, 2012, ISSN: 0129-6264.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Marco Danelutto, Massimo Torquati

FastFlow tutorial Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-12-04, 2012.

Links | BibTeX | Tags: fastflow

Massimo Torquati, Marco Vanneschi, Mehdi Amini, S. Guelton, Ronan Keryell, Vincent Lanore, F. -X. Pasquier, Michel Barreteau, R. Barrere, T. Petrisor, Eric Lenormand, C. Cantini, F. De Stefani

An innovative compilation tool-chain for embedded multi-core architectures Proceedings Article

In: Embedded World Conference, Nuremberg, Germany, 2012.

BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Lorenzo Anardu, Massimo Torquati, Peter Kilpatrick

Parallel patterns + Macro Data Flow for multi-core programming Proceedings Article

In: Proc. of Intl. Euromicro PDP 2012: Parallel Distributed and network-based Processing, pp. 27–36, IEEE, Garching, Germany, 2012.

Abstract | Links | BibTeX | Tags: fastflow

Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Elena Grassi, Eva Sciacca, Salvatore Spinella, Angelo Troina

Simulation techniques for the calculus of wrapped compartments Journal Article

In: Theoretical Computer Science, vol. 431, pp. 75–95, 2012.

Abstract | Links | BibTeX | Tags: semantics

@article{DBLP:journals/tcs/CoppoDDGSST12,

title = {Simulation techniques for the calculus of wrapped compartments},

author = {Mario Coppo and Ferruccio Damiani and Maurizio Drocco and Elena Grassi and Eva Sciacca and Salvatore Spinella and Angelo Troina},

doi = {10.1016/j.tcs.2011.12.063},

year  = {2012},

date = {2012-01-01},

journal = {Theoretical Computer Science},

volume = {431},

pages = {75–95},

abstract = {The modelling and analysis of biological systems has deep roots in Mathematics, specifically in the field of Ordinary Differential Equations (ODEs). Alternative approaches based on formal calculi, often derived from process algebras or term rewriting systems, provide a quite complementary way to analyse the behaviour of biological systems. These calculi allow to cope in a natural way with notions like compartments and membranes, which are not easy (sometimes impossible) to handle with purely numerical approaches, and are often based on stochastic simulation methods. Recently, it has also become evident that stochastic effects in regulatory networks play a crucial role in the analysis of such systems. Actually, in many situations it is necessary to use stochastic models. For example when the system to be described is based on the interaction of few molecules, when we are at the presence of a chemical instability, or when we want to simulate the functioning of a pool of entities whose compartmentalised structure evolves dynamically. In contrast, stable metabolic networks, involving a large number of reagents, for which the computational cost of a stochastic simulation becomes an insurmountable obstacle, are efficiently modelled with ODEs. In this paper we define a hybrid simulation method, combining the stochastic approach with ODEs, for systems described in the Calculus of Wrapped Compartments (CWC), a calculus on which we can express the compartmentalisation of a biological system whose evolution is defined by a set of rewrite rules.},

keywords = {semantics},

pubstate = {published},

tppubtype = {article}

}

Marco Aldinucci, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Eva Sciacca, Salvatore Spinella, Massimo Torquati, Angelo Troina

On Parallelizing On-Line Statistics for Stochastic Biological Simulations Proceedings Article

In: Alexander, Michael, D'Ambra, Pasqua, Belloum, Adam, Bosilca, George, Cannataro, Mario, Danelutto, Marco, Martino, Beniamino Di, Gerndt, Michael, Jeannot, Emmanuel, Namyst, Raymond, Roman, Jean, Scott, Stephen L., Träff, Jesper Larsson, Vallée, Geoffroy, Weidendorfer, Josef (Ed.): Proc. of Euro-Par Workshops: 2nd Workshop on High Performance Bioinformatics and Biomedicine (HiBB), pp. 3–12, Springer, Bordeaux, France, 2012.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Fabio Tordini, Marco Aldinucci, Massimo Torquati

High-level lock-less programming for multicore Proceedings Article

In: Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts, HiPEAC, Fiuggi, Italy, 2012, ISBN: 9789038219875.

Abstract | Links | BibTeX | Tags: fastflow

Thomas Weigold, Marco Aldinucci, Marco Danelutto, Vladimir Getov

Process-Driven Biometric Identification by means of Autonomic Grid Components Journal Article

In: Int. J. of Autonomous and Adaptive Communications Systems, vol. 5, no. 3, pp. 274–291, 2012, ISSN: 1754-8632.

Abstract | Links | BibTeX | Tags:

Filippo Spiga, Ivan Girotto

phiGEMM: A CPU-GPU Library for Porting Quantum ESPRESSO on Hybrid Systems Proceedings Article

In: Proceedings of the 20th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2012, Munich, Germany, February 15-17, 2012, pp. 368–375, 2012.

Links | BibTeX | Tags:

2011

C Grandi, B Bockelman, D Bonacorsi, I Fisk, Isidro González Caballero, Fabio Farina, M Hernández, Sanjay Padhi, Sabyasachi Sarkar, Andrea Sciabà, I Sfiligoi, F Spiga, M Úbeda García, D C Van Der Ster, Marian Zvada

CMS Distributed Computing Integration in the LHC sustained operations era Journal Article

In: vol. 331, pp. 062032, 2011.

BibTeX | Tags:

Cristina Calcagno, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Eva Sciacca, Salvatore Spinella, Angelo Troina

Modelling Spatial Interactions in the Arbuscular Mycorrhizal Symbiosis using the Calculus of Wrapped Compartments Proceedings Article

In: Petre, Ion, Vink, Erik P. (Ed.): Proc. of 3rd Intl. Workshop on Computational Models for Cell Processes (CompMod), pp. 3–18, Aachen, Germany, 2011.

Abstract | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Vamir Xhagjika

LIBERO: a framework for autonomic management of multiple non-functional concerns Proceedings Article

In: Guarracino, M. R., Vivien, F., Träff, J. L., Cannataro, M., Danelutto, M., Hast, A., Perla, F., Knüpfer, A., Martino, B. Di, Alexander, M. (Ed.): Euro-Par 2010 Workshops, Proc. of the CoreGrid Workshop on Grids, Clouds and P2P Computing, pp. 237–245, Springer, Ischia, Italy, 2011.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Lorenzo Anardu, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Targeting multi cores by structured programming and data flow Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-11-13, 2011.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimiliano Meneghin, Massimo Torquati

Accelerating code on multi-cores with FastFlow Proceedings Article

In: Jeannot, E., Namyst, R., Roman, J. (Ed.): Proc. of 17th Intl. Euro-Par 2011 Parallel Processing, pp. 170–181, Springer, Bordeaux, France, 2011.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Andrea Bracciali, Pietro Liò, Anil Sorathiya, Massimo Torquati

StochKit-FF: Efficient Systems Biology on Multicore Architectures Proceedings Article

In: Guarracino, M. R., Vivien, F., Träff, J. L., Cannataro, M., Danelutto, M., Hast, A., Perla, F., Knüpfer, A., Martino, B. Di, Alexander, M. (Ed.): Euro-Par 2010 Workshops, Proc. of the 1st Workshop on High Performance Bioinformatics and Biomedicine (HiBB), pp. 167–175, Springer, Ischia, Italy, 2011.

Abstract | Links | BibTeX | Tags: bioinformatics

Marco Aldinucci, Maurizio Drocco, Daniela Giordano, Concetto Spampinato, Massimo Torquati

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising Technical Report

Università degli Studi di Torino, Dip. di Informatica, Italy no. 138/2011, 2011.

Links | BibTeX | Tags: fastflow

Marco Aldinucci, Salvatore Ruggieri, Massimo Torquati

Porting Decision Tree Building and Pruning Algorithms to Multicore using FastFlow Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-11-06, 2011.

Links | BibTeX | Tags: fastflow

Marco Aldinucci, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Massimo Torquati, Angelo Troina

On Designing Multicore-Aware Simulators for Biological Systems Proceedings Article

In: Cotronis, Yiannis, Danelutto, Marco, Papadopoulos, George Angelos (Ed.): Proc. of 19th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 318–325, IEEE, Ayia Napa, Cyprus, 2011.

Abstract | Links | BibTeX | Tags: fastflow

Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Elena Grassi, Mike Guether, Angelo Troina

Modelling Ammonium Transporters in Arbuscular Mycorrhiza Symbiosis Journal Article

In: Transactions on Computational Systems Biology, vol. 6575, no. 13, pp. 85–109, 2011.

Abstract | Links | BibTeX | Tags: semantics

2010

Marco Aldinucci, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Massimo Torquati, Angelo Troina

On Designing Multicore-Aware Simulators for Biological Systems Technical Report

Università degli Studi di Torino, Dipartimento di Informatica, Italy no. 131/2010, 2010.

BibTeX | Tags: fastflow

Marco Aldinucci, Salvatore Ruggieri, Massimo Torquati

Porting Decision Tree Algorithms to Multicore using FastFlow Proceedings Article

In: Balcázar, José L., Bonchi, Francesco, Gionis, Aristides, Sebag, Michèle (Ed.): Proc. of European Conference in Machine Learning and Knowledge Discovery in Databases (ECML PKDD), pp. 7–23, Springer, Barcelona, Spain, 2010.

Abstract | Links | BibTeX | Tags: fastflow

Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Elena Grassi, Eva Sciacca, Salvatore Spinella, Angelo Troina

Hybrid Calculus of Wrapped Compartments Proceedings Article

In: Ciobanu, Gabriel, Koutny, Maciej (Ed.): Proc. of 4th Workshop on Membrane Computing and Biologically Inspired Process Calculi (MeCBIC), pp. 102–120, Jena, Germany, 2010.

Abstract | BibTeX | Tags:

@inproceedings{DBLP:journals/corr/abs-1011-0494,

title = {Hybrid Calculus of Wrapped Compartments},

author = {Mario Coppo and Ferruccio Damiani and Maurizio Drocco and Elena Grassi and Eva Sciacca and Salvatore Spinella and Angelo Troina},

editor = {Gabriel Ciobanu and Maciej Koutny},

year  = {2010},

date = {2010-08-01},

booktitle = {Proc. of 4th Workshop on Membrane Computing and Biologically Inspired Process Calculi (MeCBIC)},

volume = {40},

pages = {102–120},

address = {Jena, Germany},

series = {EPTCS},

abstract = {The modelling and analysis of biological systems has deep roots in Mathematics, specifically in the field of ordinary differential equations (ODEs). Alternative approaches based on formal calculi, often derived from process algebras or term rewriting systems, provide a quite complementary way to analyze the behaviour of biological systems. These calculi allow to cope in a natural way with notions like compartments and membranes, which are not easy (sometimes impossible) to handle with purely numerical approaches, and are often based on stochastic simulation methods. Recently, it has also become evident that stochastic effects in regulatory networks play a crucial role in the analysis of such systems. Actually, in many situations it is necessary to use stochastic models. For example when the system to be described is based on the interaction of few molecules, when we are at the presence of a chemical instability, or when we want to simulate the functioning of a pool of entities whose compartmentalised structure evolves dynamically. In contrast, stable metabolic networks, involving a large number of reagents, for which the computational cost of a stochastic simulation becomes an insurmountable obstacle, are efficiently modelled with ODEs. In this paper we define a hybrid simulation method, combining the stochastic approach with ODEs, for systems described in CWC, a calculus on which we can express the compartmentalisation of a biological system whose evolution is defined by a set of rewrite rules.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Autonomic Management of Multiple Non-Functional Concerns in Behavioural Skeletons Book Section

In: Desprez, Frédéric, Getov, Vladimir, Priol, Thierry, Yahyapour, Ramin (Ed.): Grids, P2P and Services Computing, pp. 89–103, Springer, 2010.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Andrea Bracciali, Pietro Liò, Anil Sorathiya, Massimo Torquati

StochKit-FF: Efficient Systems Biology on Multicore Architectures Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-10-12, 2010.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Andrea Bracciali, Pietro Liò

Formal Synthetic Immunology Journal Article

In: ERCIM News, vol. 82, pp. 40–41, 2010, ISSN: 0926-4981.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Marco Aldinucci, Salvatore Ruggieri, Massimo Torquati

Porting Decision Tree Algorithms to Multicore using FastFlow Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-10-11, 2010.

Abstract | Links | BibTeX | Tags: fastflow

Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Elena Grassi, Angelo Troina

Stochastic Calculus of Wrapped Compartments Proceedings Article

In: Pierro, Alessandra Di, Norman, Gethin (Ed.): Proc. of 8th Workshop on Quantitative Aspects of Programming Languages (QAPL), pp. 82–98, Paphos, Cyprus, 2010.

Abstract | BibTeX | Tags:

Marco Aldinucci, Massimiliano Meneghin, Massimo Torquati

Efficient Smith-Waterman on multi-core with FastFlow Proceedings Article

In: Danelutto, Marco, Gross, Tom, Bourgeois, Julien (Ed.): Proc. of Intl. Euromicro PDP 2010: Parallel Distributed and network-based Processing, pp. 195–199, IEEE, Pisa, Italy, 2010.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimiliano Meneghin, Massimo Torquati

Accelerating sequential programs using FastFlow and self-offloading Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-10-03, 2010.

Abstract | BibTeX | Tags: fastflow

CWC Simulator project Technical Manual

Sourceforge website 2010, (urlhttp://sourceforge.net/projects/cwcsimulator/).

BibTeX | Tags:

Marco Aldinucci

Efficient Parallel MonteCarlo with FastFlow Book Section

In: HPC-Europa2: Science and Supercomputing in Europe, research highlights 2010, Cineca, 2010.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Thomas Weigold, Marco Aldinucci, Marco Danelutto, Vladimir Getov

Integrating Autonomic Grid Components and Process-Driven Business Applications Proceedings Article

In: Vasilakos, Athanasios V., Beraldi, Roberto, Friedman, Roy, Mamei, Marco (Ed.): Autonomic Computing and Communications Systems Third International ICST Conference, Autonomics 2009, Limassol, Cyprus, September 9-11, 2009, Revised Selected Papers, pp. 98–113, Springer, Limassol, Cyprus, 2010, ISSN: 1867-8211.

Abstract | Links | BibTeX | Tags:

@inproceedings{ibm:autonomics:09,

title = {Integrating Autonomic Grid Components and Process-Driven Business Applications},

author = {Thomas Weigold and Marco Aldinucci and Marco Danelutto and Vladimir Getov},

editor = {Athanasios V. Vasilakos and Roberto Beraldi and Roy Friedman and Marco Mamei},

url = {http://calvados.di.unipi.it/storage/paper_files/2010_BS_autonomics09.pdf},

doi = {10.1007/978-3-642-11482-3_7},

issn = {1867-8211},

year  = {2010},

date = {2010-01-01},

booktitle = {Autonomic Computing and Communications Systems Third International ICST Conference, Autonomics 2009, Limassol, Cyprus, September 9-11, 2009, Revised Selected Papers},

volume = {23},

pages = {98–113},

publisher = {Springer},

address = {Limassol, Cyprus},

series = {Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering (LNICST)},

abstract = {Today's business applications are increasingly process driven, meaning that the main application logic is executed by a dedicate process engine. In addition, component-oriented software development has been attracting attention for building complex distributed applications. In this paper we present the experiences gained from building a process-driven biometric identification application which makes use of Grid infrastructures via the Grid Component Model (GCM). GCM, besides guaranteeing access to Grid resources, supports autonomic management of notable parallel composite components. This feature is exploited within our biometric identification application to ensure real time identification of fingerprints. Therefore, we briefly introduce the GCM framework and the process engine used, and we describe the implementation of the application using autonomic GCM components. Finally, we summarize the results, experiences, and lessons learned focusing on the integration of autonomic GCM components and the process-driven approach.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Skeletons for multi/many-core systems Proceedings Article

In: Chapman, Barbara, Desprez, Frédéric, Joubert, Gerhard R., Lichnewsky, Alain, Peters, Frans, Priol, Thierry (Ed.): Parallel Computing: From Multicores and GPU's to Petascale (Proc. of PARCO 2009, Lyon, France), pp. 265–272, IOS press, Lyon, France, 2010.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Massimiliano Meneghin, Massimo Torquati, Peter Kilpatrick

Efficient streaming applications on multi-core with FastFlow: The biosequence alignment test-bed Book Chapter

In: vol. 19, pp. 273–280, Elsevier, 2010.

Abstract | Links | BibTeX | Tags: fastflow

2009

Marco Aldinucci, Massimo Torquati, Massimiliano Meneghin

FastFlow: Efficient Parallel Streaming Applications on Multi-core Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-09-12, 2009.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Hinde Lilia Bouziane, Marco Danelutto, Christian Pérez

STKM on SCA: a Unified Framework with Components, Workflows and Algorithmic Skeletons Proceedings Article

In: Proc. of 15th Intl. Euro-Par 2009 Parallel Processing, pp. 678–690, Springer, Delft, The Netherlands, 2009.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Autonomic management of multiple non-functional concerns in behavioural skeletons Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-09-10, 2009.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Autonomic management of non-functional concerns in distributed and parallel application programming Proceedings Article

In: Proc. of Intl. Parallel & Distributed Processing Symposium (IPDPS), pp. 1–12, IEEE, Rome, Italy, 2009.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Co-design of distributed systems using skeletons and autonomic management abstractions Proceedings Article

In: César, E., Alexander, M., Streit, A., Träff, J. L., Cérin, C., Knüpfer, A., Kranzlmüller, D., Jha, S. (Ed.): Euro-Par 2008 Workshops - Parallel Processing, Selected Papers, pp. 403–414, Springer, Las Palmas, Spain, 2009, ISBN: 978-3-642-00954-9.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Towards hierarchical management of autonomic components: a case study Proceedings Article

In: Baz, Didier El, Gross, Tom, Spies, Francois (Ed.): Proc. of Intl. Euromicro PDP 2009: Parallel Distributed and network-based Processing, pp. 3–10, IEEE, Weimar, Germany, 2009.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Massimo Torquati

FastFlow website Technical Manual

2009, (urlhttp://mc-fastflow.sourceforge.net/).

BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Patrizio Dazzi, Nicola Tonellotto

GridComp website Technical Manual

2009, (urlhttp://gridcomp.ercim.org/).

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Semi-formal models to support program development: autonomic management within component based parallel and distributed programming Proceedings Article

In: Boer, Frank S., Bonsangue, Marcello M., Madelaine, Eric (Ed.): Formal Methods for Components and Objects: 7th Intl. Symposium, FMCO 2008, Sophia-Antipolis, France, October 20 - 24, 2008, Revised Lectures, pp. 204–225, Springer, 2009.

Abstract | Links | BibTeX | Tags:

2008

Marco Aldinucci, Gabriel Antoniu, Marco Danelutto, Mathieu Jan

Fault-Tolerant Data Sharing for High-level Grid Programming: A Hierarchical Storage Architecture Book Section

In: Bubak, Marian, Gorlatch, Sergei, Priol, Thierry (Ed.): Achievements in European Research on Grid Systems, pp. 67–81, Springer, Kraków, Poland, 2008, ISBN: 978-0-387-72811-7.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Marco Danelutto, Hinde Lilia Bouziane, Christian Pérez

Towards Software Component Assembly Language Enhanced with Workflows and Skeletons Proceedings Article

In: Proc. of the ACM SIGPLAN Component-Based High Performance Computing (CBHPC), pp. 1–11, ACM, Karlsruhe, Germany, 2008, ISBN: 978-1-60558-311-2.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Hinde Lilia Bouziane, Christian Pérez

Towards a Spatio-Temporal sKeleton Model implementation on top of SCA Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0171, 2008.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto

Securing skeletal systems with limited performance penalty: the Muskel experience Journal Article

In: Journal of Systems Architecture, vol. 54, no. 9, pp. 868–876, 2008.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Sonia Campa, Marco Danelutto, Patrizio Dazzi, Peter Kilpatrick, Domenico Laforenza, Nicola Tonellotto

Behavioural skeletons for component autonomic management on grids Book Section

In: Danelutto, Marco, Frangopoulou, Paraskevi, Getov, Vladimir (Ed.): Making Grids Work, pp. 3–16, Springer, 2008, ISBN: 978-0-387-78447-2.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Marco Danelutto, Giorgio Zoppi, Peter Kilpatrick

Advances in Autonomic Components & Services Proceedings Article

In: Priol, Thierry, Vanneschi, Marco (Ed.): From Grids To Service and Pervasive Computing (Proc. of the CoreGRID Symposium 2008), pp. 3–18, Springer, Las Palmas, Spain, 2008, ISBN: 978-0-387-09454-0.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Emilio Tuosto

Towards a Formal Semantics for Autonomic Components Proceedings Article

In: Priol, Thierry, Vanneschi, Marco (Ed.): From Grids To Service and Pervasive Computing (Proc. of the CoreGRID Symposium 2008), pp. 31–45, Springer, Las Palmas, Spain, 2008, ISBN: 978-0-387-09454-0.

Abstract | Links | BibTeX | Tags:

Ana-Maria Oprescu, Thilo Kielmann, Marco Danelutto, Marco Aldinucci

Autonomic Behavior of Grid Applications using Component Platforms Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0156, 2008.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Hinde Lilia Bouziane, Christian Pérez

Towards Software Component Assembly Language Enhanced with Workflows and Skeletons Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0153, 2008.

Links | BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Massimo Coppola, Marco Danelutto, Giorgio Zoppi, Alessandro Basso, Alexander Bolotov, Francoise Baude, Hinde Lilia Bouziane, Denis Caromel, Ludovic Henrio, Christian Pérez, Jose Cunha, Classen Michael, Philipp Classen, Christian Lengauer, J. Cohen, S. Mc Gough, Natalia Currle-Linde, Patrizio Dazzi, Nicola Tonellotto, Jan Dünnweber, Sergei Gorlatch, Peter Kilpatrick, Nadia Ranaldo, Eugenio Zimeo

Proceedings of the Programming Model Institute Technical meeting 2008 Technical Report

Institute of Programming Model, CoreGRID - Network of Excellence no. TR-0138, 2008.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Patrizio Dazzi

From Orc Models to Distributed Grid Java code Proceedings Article

In: Gorlatch, Sergei, Fragopoulou, Paraskevi, Priol, Thierry (Ed.): Proc. of the Integrated Research in Grid Computing Workshop, pp. 2–13, Hersonissos, Crete, Greece, 2008.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Emilio Tuosto

Toward a Formal Semantics for Autonomic Components Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-08-08, 2008.

Links | BibTeX | Tags:

Marco Aldinucci, Anne Benoit

Automatic mapping of ASSIST applications using process algebra Journal Article

In: Parallel Processing Letters, vol. 18, no. 1, pp. 175–188, 2008, ISSN: 0129-6264.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Hierarchical autonomic management: a case study with skeletal systems Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0127, 2008.

Links | BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Marco Danelutto, Marco Vanneschi, Patrizio Dazzi, Domenico Laforenza, Nicola Tonellotto, Peter Kilpatrick

Behavioural skeletons in GCM: autonomic management of grid components Proceedings Article

In: Baz, Didier El, Bourgeois, Julien, Spies, Francois (Ed.): Proc. of Intl. Euromicro PDP 2008: Parallel Distributed and network-based Processing, pp. 54–63, IEEE, Toulouse, France, 2008.

Abstract | Links | BibTeX | Tags:

@inproceedings{orc:pdp:08,

title = {Behavioural skeletons in GCM: autonomic management of grid components},

author = {Marco Aldinucci and Sonia Campa and Marco Danelutto and Marco Vanneschi and Patrizio Dazzi and Domenico Laforenza and Nicola Tonellotto and Peter Kilpatrick},

editor = {Didier El Baz and Julien Bourgeois and Francois Spies},

url = {http://calvados.di.unipi.it/storage/paper_files/2008_orc_PDP.pdf},

doi = {10.1109/PDP.2008.46},

year  = {2008},

date = {2008-02-01},

booktitle = {Proc. of Intl. Euromicro PDP 2008: Parallel Distributed and network-based Processing},

pages = {54–63},

publisher = {IEEE},

address = {Toulouse, France},

abstract = {Autonomic management can be used to improve the QoS provided by parallel/distributed applications. We discuss behavioural skeletons introduced in earlier work: rather than relying on programmer ability to design "from scratch" efficient autonomic policies, we encapsulate general autonomic controller features into algorithmic skeletons. Then we leave to the programmer the duty of specifying the parameters needed to specialise the skeletons to the needs of the particular application at hand. This results in the programmer having the ability to fast prototype and tune distributed/parallel applications with non-trivial autonomic management capabilities. We discuss how behavioural skeletons have been implemented in the framework of GCM (the grid component model developed within the CoreGRID NoE and currently being implemented within the GridCOMP STREP project). We present results evaluating the overhead introduced by autonomic management activities as well as the overall behaviour of the skeletons. We also present results achieved with a long running application subject to autonomic management and dynamically adapting to changing features of the target architecture. Overall the results demonstrate both the feasibility of implementing autonomic control via behavioural skeletons and the effectiveness of our sample behavioural skeletons in managing the "functional replication" pattern(s).},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Massimo Torquati, Marco Vanneschi, Pierfrancesco Zuccato

The VirtuaLinux Storage Abstraction Layer for Efficient Virtual Clustering Proceedings Article

In: Baz, Didier El, Bourgeois, Julien, Spies, Francois (Ed.): Proc. of Intl. Euromicro PDP 2008: Parallel Distributed and network-based Processing, pp. 619–627, IEEE, Toulouse, France, 2008.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Patrizio Dazzi

From Orc Models to Distributed Grid Java code Book Section

In: Gorlatch, Sergei, Fragopoulou, Paraskevi, Priol, Thierry (Ed.): Grid Computing: Achievements and Prospects, pp. 13–24, Springer, 2008, ISBN: 978-0-387-09456-4.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

A framework for prototyping and reasoning about grid systems Proceedings Article

In: Bischof, C., Bücker, M., Gibbon, P., Joubert, G. R., Lippert, T., Mohr, B., Peters, F. J. (Ed.): Parallel Computing: Architectures, Algorithms and Applications, pp. 235–242, IOS press, Germany, 2008, ISBN: 9781586037963.

Abstract | Links | BibTeX | Tags:

Carlo Bertolli, Romano Fantacci, Gabriele Mencagli, Daniele Tarchi, Marco Vanneschi

Next generation grids and wireless communication networks: towards a novel integrated approach Journal Article

In: Wireless Communications and Mobile Computing, 2008, (To appear).

BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Massimo Torquati, Francesco Polzella, Gianmarco Spinatelli, Marco Vanneschi, Alessandro Gervaso, Manuel Cacitti, Pierfrancesco Zuccato

VirtuaLinux: virtualized high-density clusters with no single point of failure Proceedings Article

In: Bischof, C., Bücker, M., Gibbon, P., Joubert, G. R., Lippert, T., Mohr, B., Peters, F. J. (Ed.): Parallel Computing: Architectures, Algorithms and Applications, pp. 355–362, IOS press, The Netherlands, 2008.

Abstract | Links | BibTeX | Tags:

2007

Marco Aldinucci, Marco Danelutto, Patrizio Dazzi

MUSKEL: an expandable skeleton environment Journal Article

In: Scalable Computing: Practice and Experience, vol. 8, no. 4, pp. 325–341, 2007.

Abstract | Links | BibTeX | Tags: parallel

Marco Danelutto, Marcelo Pasin, Marco Vanneschi, Patrizio Dazzi, Luigi Presti, Domenico Laforenza

PAL: Exploiting Java Annotations for Parallelism Book Section

In: Bubak, Marian, Gorlatch, Sergei, Priol, Thierry (Ed.): Achievements in European Research on Grid Systems, pp. 83–96, Springer, Kraków, Poland, 2007.

BibTeX | Tags:

Carlo Bertolli, Massimo Coppola, Corrado Zoccolo

The co-replication methodology and its application to structured parallel programs Proceedings Article

In: CompFrame '07: Proc. of the 2007 symposium on Component and framework technology in high-performance and scientific computing, pp. 39–48, ACM, Montreal, Quebec, Canada, 2007, ISBN: 978-1-59593-867-1.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto

Skeleton based parallel programming: functional and parallel semantic in a single shot Journal Article

In: Computer Languages, Systems and Structures, vol. 33, no. 3-4, pp. 179–192, 2007.

Abstract | Links | BibTeX | Tags: parallel

Thierry Priol, Marco Vanneschi (Ed.)

Toward Next Generation Grids, Proc. of the CorGRID Symposium 2007 Proceedings

Springer, Rennes, France, 2007.

BibTeX | Tags:

Massimo Coppola, Domenico Laforenza, Nicola Tonellotto, Marco Danelutto, Marco Vanneschi, Corrado Zoccolo

Managing User Expectation with Component Performance Contracts Proceedings Article

In: Proc. of the Workshop on Usage of Service Level Agreements in Grids, Springer, Austin, TX, USA, 2007.

BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Adding metadata to Orc to support reasoning about grid programming Proceedings Article

In: Priol, Thierry, Vanneschi, Marco (Ed.): Towards Next Generation Grids (Proc. of the CoreGRID Symposium 2007), pp. 205–214, Springer, Rennes, France, 2007, ISBN: 978-0-387-72497-3.

Abstract | Links | BibTeX | Tags:

Marco Danelutto, Marco Aldinucci, Peter Kilpatrick

Prototyping and reasoning about distributed systems: an Orc based framework Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0102, 2007.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Management in distributed systems: a semi-formal approach Proceedings Article

In: Kermarrec, A. -M., Bougé, L., Priol, T. (Ed.): Proc. of 13th Intl. Euro-Par 2007 Parallel Processing, pp. 651–661, Springer, Rennes, France, 2007, ISBN: 978-3-540-74465-8.

Abstract | Links | BibTeX | Tags: parallel, semantics

Marco Aldinucci, Sonia Campa, Marco Danelutto, Patrizio Dazzi, Peter Kilpatrick, Domenico Laforenza, Nicola Tonellotto

Behavioural skeletons for component autonomic management on grids Proceedings Article

In: CoreGRID Workshop on Grid Programming Model, Grid and P2P Systems Architecture, Grid Systems, Tools and Environments, Heraklion, Crete, Greece, 2007.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Pierfrancesco Zuccato

Virtual clusters with no single point of failure Proceedings Article

In: Intl. Supercomputing Conference (ISC2007), Poster session, Dresden, Germany, 2007.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Massimo Torquati, Marco Vanneschi, Manuel Cacitti, Alessandro Gervaso, Pierfrancesco Zuccato

VirtuaLinux Design Principles Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-07-13, 2007.

Links | BibTeX | Tags:

Cristian Dittamo, Antonio Cisternino, Marco Danelutto

Parallelization of C# Programs Through Annotations Proceedings Article

In: Proc. of Practical Aspects of High-Level Parallel Programming Workshop (PAPP, co-located with ICCS 2007), pp. 585–592, Springer, Beijing, China, 2007.

Links | BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Patrizio Dazzi, Domenico Laforenza, Nicola Tonellotto

Behavioural skeletons for component autonomic management on grids Technical Report

Università di Pisa, Dipartimento di Informatica no. TR-07-12, 2007.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Orc + metadata supporting grid programming Technical Report

Università di Pisa, Dipartimento di Informatica no. TR-07-10, 2007.

Links | BibTeX | Tags:

Peter Kilpatrick, Marco Danelutto, Marco Aldinucci

Deriving Grid Applications from Abstract Models Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0085, 2007.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick

Management in distributed systems: a semi-formal approach Technical Report

Università di Pisa, Dipartimento di Informatica no. TR-07-05, 2007.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto

The cost of security in skeletal systems Proceedings Article

In: D'Ambra, Pasqua, Guarracino, Mario Rosario (Ed.): Proc. of Intl. Euromicro PDP 2007: Parallel Distributed and network-based Processing, pp. 213–220, IEEE, Napoli, Italia, 2007.

Abstract | Links | BibTeX | Tags:

Nicola Tonellotto, Domenico Laforenza, Marco Danelutto, Marco Vanneschi, Corrado Zoccolo

A Performance Model for Stream-Based Computations Proceedings Article

In: D'Ambra, Pasqua, Guarracino, Mario Rosario (Ed.): Proc. of Intl. Euromicro PDP 2007: Parallel Distributed and network-based Processing, pp. 91–96, IEEE, Napoli, Italia, 2007.

BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Massimo Coppola, Marco Danelutto, Corrado Zoccolo, Francoise André, Jérémy Buisson

An abstract schema modeling adaptivity management Book Section

In: Gorlatch, Sergei, Danelutto, Marco (Ed.): Integrated Research in Grid Computing, pp. 89–102, Springer, 2007, ISBN: 978-0-387-47656-8.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Anne Benoit

Towards the Automatic Mapping of ASSIST Applications for the Grid Book Section

In: Gorlatch, Sergei, Danelutto, Marco (Ed.): Integrated Research in Grid Computing, pp. 73–87, Springer, 2007, ISBN: 978-0-387-47656-8.

Abstract | Links | BibTeX | Tags: parallel

Jan Dünnweber, Sergei Gorlatch, Sonia Campa, Marco Aldinucci, Marco Danelutto

Adaptable Parallel Components for Grid Programming Book Section

In: Gorlatch, Sergei, Danelutto, Marco (Ed.): Integrated Research in Grid Computing, pp. 43–57, Springer, 2007, ISBN: 978-0-387-47656-8.

Abstract | Links | BibTeX | Tags:

Massimo Coppola, Marco Danelutto, Nicola Tonellotto, Marco Vanneschi, Corrado Zoccolo

Execution Support of High Performance Heterogeneous Component-Based Applications on the Grid Proceedings Article

In: Lehner, Wolfgang, Meyer, Norbert, Streit, Achim, Stewart, Craig (Ed.): in Proc. of Euro-Par 2006 Workshops: Parallel Processing, CoreGRID 2006, UNICORE Summit 2006, Petascale Computational Biology and Bioinformatics, Dresden, Germany, August 29-September 1, 2006, Revised Selected Papers, pp. 171–185, Springer, 2007.

BibTeX | Tags:

Marco Vanneschi, Luca Veraldi

Dynamicity in distributed applications: issues, problems and the ASSIST approach Journal Article

In: Parallel Computing, vol. 33, no. 12, pp. 822–845, 2007, ISSN: 0167-8191.

Links | BibTeX | Tags:

Marcelo Pasin, Pierre Kuonen, Marco Danelutto, Marco Aldinucci

Skeleton Parallel Programming and Parallel Objects Book Section

In: Gorlatch, Sergei, Danelutto, Marco (Ed.): Integrated Research in Grid Computing, pp. 59–71, Springer, 2007, ISBN: 978-0-387-47656-8.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Massimo Torquati, Pierfrancesco Zuccato

Virtualinux website Technical Manual

2007, (urlhttp://virtualinux.sourceforge.net/).

BibTeX | Tags:

2006

Marco Aldinucci, Francoise André, Jérémy Buisson, Sonia Campa, Massimo Coppola, Marco Danelutto, Corrado Zoccolo

Parallel program/component adaptivity management Proceedings Article

In: Joubert, G. R., Nagel, W. E., Peters, F. J., Plata, O., Tirado, P., Zapata, E. (Ed.): Parallel Computing: Current & Future Issues of High-End Computing (Proc. of PARCO 2005, Malaga, Spain), pp. 89–96, John von Neumann Institute for Computing, Germany, 2006.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Marco Danelutto, Gianni Giaccherini, Massimo Torquati, Marco Vanneschi

Towards a distributed scalable data service for the grid Proceedings Article

In: Joubert, G. R., Nagel, W. E., Peters, F. J., Plata, O., Tirado, P., Zapata, E. (Ed.): Parallel Computing: Current & Future Issues of High-End Computing (Proc. of PARCO 2005, Malaga, Spain), pp. 73–80, John von Neumann Institute for Computing, Germany, 2006.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Marco Danelutto, Andrea Paternesi, Roberto Ravazzolo, Marco Vanneschi

Building interoperable grid-aware ASSIST applications via WebServices Proceedings Article

In: Joubert, G. R., Nagel, W. E., Peters, F. J., Plata, O., Tirado, P., Zapata, E. (Ed.): Parallel Computing: Current & Future Issues of High-End Computing (Proc. of PARCO 2005, Malaga, Spain), pp. 145–152, John von Neumann Institute for Computing, Germany, 2006, ISBN: 3000173528.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Gabriel Antoniu, Marco Danelutto, Mathieu Jan

Fault-Tolerant Data Sharing for High-level Grid Programming: A Hierarchical Storage Architecture Proceedings Article

In: Bubak, Marian, Gorlatch, Sergei, Priol, Thierry (Ed.): Proc. of the Integrated Research in Grid Computing Workshop, pp. 177–188, Academic Computing Centre CYFRONET AGH, Kraków, Poland, 2006.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Gabriel Antoniu, Marco Danelutto, Mathieu Jan

Fault-Tolerant Data Sharing for High-level Grid Programming: A Hierarchical Storage Architecture Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0058, 2006.

Links | BibTeX | Tags:

Marco Aldinucci, Carlo Bertolli, Sonia Campa, Massimo Coppola, Marco Vanneschi, Luca Veraldi, Corrado Zoccolo

Self-Configuring and Self-Optimising Grid Components in the GCM model and their ASSIST Implementation Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-06-13, 2006.

Links | BibTeX | Tags:

Marco Aldinucci, Carlo Bertolli, Sonia Campa, Massimo Coppola, Marco Vanneschi, Luca Veraldi, Corrado Zoccolo

Self-configuring and self-optimizing grid components in the GCM model and their ASSIST implementation Proceedings Article

In: Proc. of. HPC-GECO/Compframe (held in conjunction with HPDC-15), pp. 45–52, IEEE, Paris, France, 2006.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Marco Vanneschi

Autonomic QoS in ASSIST Grid-aware components Proceedings Article

In: Proc. of Intl. Euromicro PDP 2006: Parallel Distributed and network-based Processing, pp. 221–230, IEEE, Montbéliard, France, 2006.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto

The cost of security in skeletal systems Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-06-03, 2006.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto

Algorithmic skeletons meeting grids Journal Article

In: Parallel Computing, vol. 32, no. 7, pp. 449–462, 2006.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Massimo Coppola, Marco Danelutto, Nicola Tonellotto, Marco Vanneschi, Corrado Zoccolo

High level grid programming with ASSIST Journal Article

In: Computational Methods in Science and Technology, vol. 12, no. 1, pp. 21–32, 2006.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Massimo Coppola, Marco Danelutto, Marco Vanneschi, Corrado Zoccolo

ASSIST as a research framework for high-performance Grid programming environments Book Section

In: Cunha, J. C., Rana, O. F. (Ed.): Grid Computing: Software environments and Tools, pp. 230–256, Springer, 2006, ISBN: 978-1-85233-998-2.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Massimo Coppola, Sonia Campa, Marco Danelutto, Marco Vanneschi, Corrado Zoccolo

Structured implementation of component based grid programming environments Book Section

In: Getov, Vladimir, Laforenza, Domenico, Reinefeld, Alexander (Ed.): Future Generation Grids, pp. 217–239, Springer, 2006, ISBN: 978-0-387-27935-0.

Abstract | Links | BibTeX | Tags:

2005

Marco Aldinucci, Marco Danelutto, Andrea Paternesi, Roberto Ravazzolo, Marco Vanneschi

Building Interoperable Grid-aware ASSIST Applications via Web Services Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-05-24, 2005.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Francoise André, Jérémy Buisson, Sonia Campa, Massimo Coppola, Marco Danelutto, Corrado Zoccolo

Parallel program/component adaptivity management Proceedings Article

In: Gorlatch, Sergei, Danelutto, Marco (Ed.): Proc. of the Integrated Research in Grid Computing Workshop, pp. 95–104, Università di Pisa, Dipartimento di Informatica, Pisa, Italy, 2005.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Anne Benoit

Towards the Automatic Mapping of ASSIST Applications for the Grid Proceedings Article

In: Gorlatch, Sergei, Danelutto, Marco (Ed.): Proc. of the Integrated Research in Grid Computing Workshop, pp. 59–68, Università di Pisa, Dipartimento di Informatica, Pisa, Italy, 2005.

Abstract | Links | BibTeX | Tags:

Jan Dünnweber, Sergei Gorlatch, Sonia Campa, Marco Aldinucci, Marco Danelutto

Using Code Parameters for Component Adaptations Proceedings Article

In: Gorlatch, Sergei, Danelutto, Marco (Ed.): Proc. of the Integrated Research in Grid Computing Workshop, pp. 49–57, Università di Pisa, Dipartimento di Informatica, Pisa, Italy, 2005.

Abstract | Links | BibTeX | Tags:

Marcelo Pasin, Pierre Kuonen, Marco Danelutto, Marco Aldinucci

Skeleton Parallel Programming and Parallel Objects Proceedings Article

In: Gorlatch, Sergei, Danelutto, Marco (Ed.): Proc. of the Integrated Research in Grid Computing Workshop, pp. 115–124, Università di Pisa, Dipartimento di Informatica, Pisa, Italy, 2005.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Gianni Giaccherini, Massimo Torquati, Marco Vanneschi

Towards a distributed scalable data service for the Grid Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-05-21, 2005.

Links | BibTeX | Tags:

Marco Aldinucci, Anne Benoit

Automatic mapping of ASSIST applications using process algebra Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0016, 2005.

Links | BibTeX | Tags:

Marco Aldinucci, Marco Vanneschi, Matteo Villa

Grid technologies and c-business for SMEs Proceedings Article

In: Cunningham, P., Cunningham, M. (Ed.): Innovation and the Knowledge Economy: Issues, Applications, Case Studies (Proc. of Intl. conference eChallenges 2005), IOS press, Amsterdam, The Netherland, 2005.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Jan Dünnweber, Sergei Gorlatch

Optimization techniques for skeletons on grids Book Section

In: Grandinetti, L. (Ed.): Grid Computing and New Frontiers of High Performance Processing, vol. 14, pp. 255–273, Elsevier, 2005, ISSN: 09275452.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Francoise André, Jérémy Buisson, Sonia Campa, Massimo Coppola, Marco Danelutto, Corrado Zoccolo

Parallel program/component adaptivity management Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0014, 2005.

Links | BibTeX | Tags:

Marco Aldinucci, Alessandro Petrocelli, Edoardo Pistoletti, Massimo Torquati, Marco Vanneschi, Luca Veraldi, Corrado Zoccolo

Dynamic reconfiguration of grid-aware applications in ASSIST Proceedings Article

In: Cunha, J. C., Medeiros, P. D. (Ed.): Proc. of 11th Intl. Euro-Par 2005 Parallel Processing, pp. 771–781, Springer, 2005.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Anne Benoit

Automatic mapping of ASSIST applications using process algebra Proceedings Article

In: Proc. of HLPP2005: Intl. Workshop on High-Level Parallel Programming, Warwick University, Coventry, UK 2005.

Abstract | Links | BibTeX | Tags:

Jan Dünnweber, Sergei Gorlatch, Sonia Campa, Marco Aldinucci, Marco Danelutto

Behavior Customization of Parallel Components Application Programming Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0002, 2005.

Links | BibTeX | Tags:

Marco Aldinucci, Alessandro Petrocelli, Edoardo Pistoletti, Massimo Torquati, Marco Vanneschi, Luca Veraldi, Corrado Zoccolo

Dynamic reconfiguration of Grid-aware applications in ASSIST Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-05-05, 2005.

Links | BibTeX | Tags:

Marco Aldinucci, Sergio Gusmeroli, Marco Vanneschi, Matteo Villa

SFIDA: interoperability in innovative c-business models for SMEs through an enabling Grid platform Proceedings Article

In: Pre-proc. of INTEROP-ESA: Intl. Conference on Interoperability on Enterprise Software and Applications, pp. 547–557, Geneva, Switzerland, 2005.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Massimo Coppola, Marco Danelutto, Domenico Laforenza, Diego Puppin, Luca Scarponi, Marco Vanneschi, Corrado Zoccolo

Components for high performance Grid programming in Grid.it Proceedings Article

In: Getov, V., Kielmann, T. (Ed.): Proc. of the Intl. Workshop on Component Models and Systems for Grid Applications, pp. 19–38, Springer, Saint-Malo, France, 2005, ISBN: 978-0-387-23351-2.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Jan Dünnweber, Sergei Gorlatch

Optimization Techniques for Implementing Parallel Skeletons in Distributed Environments Technical Report

Institute on Programming Model, CoreGRID - Network of Excellence no. TR-0001, 2005.

Links | BibTeX | Tags:

2004

Marco Aldinucci, Massimo Torquati

Accelerating apache farms through ad-HOC distributed scalable object repository Proceedings Article

In: Danelutto, Marco, Vanneschi, Marco, Laforenza, Domenico (Ed.): Proc. of 10th Intl. Euro-Par 2004 Parallel Processing, pp. 596–605, Springer, 2004.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Sonia Campa, Massimo Coppola, Silvia Magini, Paolo Pesciullesi, Laura Potiti, Roberto Ravazzolo, Massimo Torquati, Corrado Zoccolo

Targeting heterogeneous architectures in ASSIST: Experimental results Proceedings Article

In: Danelutto, Marco, Vanneschi, Marco, Laforenza, Domenico (Ed.): Proc. of 10th Intl. Euro-Par 2004 Parallel Processing, pp. 638–643, Springer, 2004, ISBN: 978-3-540-22924-7.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Marco Danelutto, Jan Dünnweber

Optimization Techniques for Implementing Parallel Skeletons in Grid Environments Proceedings Article

In: Gorlatch, S. (Ed.): Proc. of CMPP: Intl. Workshop on Constructive Methods for Parallel Programming, pp. 35–47, Universität Münster, Germany, Stirling, Scotland, UK, 2004.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Massimo Torquati

Accelerating Apache farms through ad-HOC distributed scalable objects repository Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-04-08, 2004.

Links | BibTeX | Tags:

Marco Aldinucci, Massimo Coppola, Marco Danelutto, Marco Vanneschi, Corrado Zoccolo

ASSIST as a Research Framework for High-performance Grid Programming Environments Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-04-09, 2004.

Links | BibTeX | Tags:

Marco Aldinucci, Sonia Campa, Pierpaolo Ciullo, Massimo Coppola, Marco Danelutto, Paolo Pesciullesi, Roberto Ravazzolo, Massimo Torquati, Marco Vanneschi, Corrado Zoccolo

A framework for experimenting with structure parallel programming environment design Proceedings Article

In: Joubert, G. R., Nagel, W. E., Peters, F. J., Walter, W. V. (Ed.): Parallel Computing: Software Technology, Algorithms, Architectures and Applications (Proc. of PARCO 2003, Dresden, Germany), pp. 617–624, Elsevier, 2004, ISSN: 09275452.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto

An operational semantics for skeletons Proceedings Article

Abstract | Links | BibTeX | Tags:

2003

Marco Aldinucci

Dynamic shared data in structured parallel programming frameworks PhD Thesis

Computer Science Dept., University of Pisa, 2003.

BibTeX | Tags:

Marco Aldinucci

eskimo: experimenting with Skeletons in the Shared Address Model Journal Article

In: Parallel Processing Letters, vol. 13, no. 3, pp. 449–460, 2003, ISSN: 0129-6264.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Sonia Campa, Pierpaolo Ciullo, Massimo Coppola, Marco Danelutto, Paolo Pesciullesi, Roberto Ravazzolo, Massimo Torquati, Marco Vanneschi, Corrado Zoccolo

ASSIST demo: a high level, high performance, portable, structured parallel programming environment at work Proceedings Article

In: Kosch, H., Böszörményi, L., Hellwagner, H. (Ed.): Proc. of 9th Intl. Euro-Par 2003 Parallel Processing, pp. 1295–1300, Springer, Klagenfurt, Austria, 2003.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci, Sonia Campa, Pierpaolo Ciullo, Massimo Coppola, Silvia Magini, Paolo Pesciullesi, Laura Potiti, Roberto Ravazzolo, Massimo Torquati, Marco Vanneschi, Corrado Zoccolo

The Implementation of ASSIST, an Environment for Parallel and Distributed Programming Proceedings Article

In: Kosch, H., Böszörményi, L., Hellwagner, H. (Ed.): Proc. of 9th Intl Euro-Par 2003 Parallel Processing, pp. 712–721, Springer, Klagenfurt, Austria, 2003, ISBN: 978-3-540-40788-1.

Abstract | Links | BibTeX | Tags:

Marco Aldinucci, Marco Danelutto, Paolo Teti

An advanced environment supporting structured parallel programming in Java Journal Article

In: Future Generation Computer Systems, vol. 19, no. 5, pp. 611–626, 2003.

Abstract | Links | BibTeX | Tags: parallel

Marco Aldinucci

eskimo: experimenting skeletons on the shared address model Proceedings Article

In: Proc. of HLPP2003: Intl. Workshop on High-Level Parallel Programming, pp. 89–100, Paris, France, 2003.

Abstract | Links | BibTeX | Tags:

2002

Marco Aldinucci, Marco Danelutto

An operational semantic for skeletons Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-02-13, 2002.

Links | BibTeX | Tags:

Marco Aldinucci

Automatic Program Transformation: The Meta Tool for Skeleton-based Languages Book Section

In: Gorlatch, Sergei, Lengauer, Christian (Ed.): Constructive Methods for Parallel Programming, pp. 59–78, Nova Science Publishers, NY, USA, 2002, ISBN: 1-59033-374-8.

Abstract | Links | BibTeX | Tags: parallel

2001

Marco Aldinucci, Sergei Gorlatch, Christian Lengauer, Susanna Pelagatti

Towards Parallel Programming by Transformation: The FAN Skeleton Framework Journal Article

In: Parallel Algorithms and Applications, vol. 16, no. 2-3, pp. 87–121, 2001.

Abstract | Links | BibTeX | Tags: parallel

2000

Marco Aldinucci

The Meta Transformation Tool for Skeleton-Based Languages Proceedings Article

In: Gorlatch, S., Lengauer, C. (Ed.): Proc. of CMPP: Intl. Workshop on Constructive Methods for Parallel Programming, pp. 53–68, Fakultät für mathematik und informatik Uni. Passau, Germany, 2000.

Abstract | Links | BibTeX | Tags: parallel

Marco Danelutto

Task Farm Computations in Java Proceedings Article

In: Bubak, Marian, Afsarmanesh, Hamideh, Williams, Roy, Hertzberger, Louis O. (Ed.): Proc. of the 8th Intl. Conference on High-Performance Computing and Networking (HPCN Europe 2000), pp. 385–394, Springer, Amsterdam, The Netherlands, 2000, ISBN: 3-540-67553-1.

Links | BibTeX | Tags:

Marco Danelutto, Geppino Pucci

A Compact, Thread-Safe Communication Library for Efficient Cluster Computing Proceedings Article

In: Bubak, Marian, Afsarmanesh, Hamideh, Williams, Roy, Hertzberger, Louis O. (Ed.): Proc. of the 8th Intl. Conference on High-Performance Computing and Networking (HPCN Europe 2000), pp. 407–416, Springer, Amsterdam, The Netherlands, 2000, ISBN: 3-540-67553-1.

Links | BibTeX | Tags:

1999

Marco Aldinucci, Marco Danelutto

Stream parallel skeleton optimization Proceedings Article

In: Proc. of PDCS: Intl. Conference on Parallel and Distributed Computing and Systems, pp. 955–962, IASTED ACTA press, Cambridge, Massachusetts, USA, 1999.

Abstract | Links | BibTeX | Tags: parallel

Primo Becuzzi, Massimo Coppola, Marco Vanneschi

Mining of Association Rules in Very Large Databases: A Structured Parallel Approach Proceedings Article

In: Amestoy, Patrick, Berger, Philippe, Daydé, Michel J., Duff, Iain S., Frayssé, Valérie, Giraud, Luc, Ruiz, Daniel (Ed.): Proc. of 5th Intl. Euro-Par '99 Parallel Processing, pp. 1441–1450, Springer, Toulouse, France, 1999, ISBN: 3-540-66443-2.

Links | BibTeX | Tags:

1998

Marco Aldinucci, Massimo Coppola, Marco Danelutto

Rewriting skeleton programs: How to evaluate the data-parallel stream-parallel tradeoff Proceedings Article

In: Gorlatch, S. (Ed.): Proc. of CMPP: Intl. Workshop on Constructive Methods for Parallel Programming, pp. 44–58, Fakultät für mathematik und informatik Uni. Passau, Germany, 1998.

Abstract | Links | BibTeX | Tags: parallel

WE ARE HIRING! If you are Research Engineers, Ph.D. Candidates and Post-Doctoral Researchers send your CV to alpha@di.unito.it

WE ARE HIRING! If you are Research Engineers, Ph.D. Candidates and Post-Doctoral Researchers send your CV to alpha@di.unito.it

Papers | Parallel Computing

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998