Papers | Parallel Computing

2023

Iacopo Colonnelli

Workflow Models for Heterogeneous Distributed Systems Proceedings Article

In: Bena, Nicola, Martino, Beniamino Di, Maratea, Antonio, Sperduti, Alessandro, Nardo, Emanuel Di, Ciaramella, Angelo, Montella, Raffaele, Ardagna, Claudio A. (Ed.): Proceedings of the 2nd Italian Conference on Big Data and Data Science (ITADATA 2023), Naples, Italy, September 11-13, 2023, CEUR-WS.org, 2023.

Abstract | Links | BibTeX | Tags: across, eupex, icsc, jupyter-workflow, streamflow

2022

Iacopo Colonnelli, Marco Aldinucci

Hybrid Workflows For Large - Scale Scientific Applications Proceedings Article

In: Sixth EAGE High Performance Computing Workshop, pp. 1–5, European Association of Geoscientists & Engineers , Milano, Italy, 2022, ISSN: 2214-4609.

Abstract | Links | BibTeX | Tags: across, eupex

@inproceedings{22:eage-hpc-workshop,

title = {Hybrid Workflows For Large - Scale Scientific Applications},

author = {Iacopo Colonnelli and Marco Aldinucci},

url = {https://iris.unito.it/retrieve/d79ddabb-f9d7-4a55-9f84-1528b1533ba3/Extended_Abstract.pdf},

doi = {10.3997/2214-4609.2022615029},

issn = {2214-4609},

year  = {2022},

date = {2022-09-01},

booktitle = {Sixth EAGE High Performance Computing Workshop},

pages = {1–5},

publisher = {European Association of Geoscientists & Engineers },

address = {Milano, Italy},

abstract = {Large-scale scientific applications are facing an irrevrsible transition from monolithic, high-performance oriented codes to modular and polyglot deployments of specialised (micro-)services. The reasons behind this transition are many: coupling of standard solvers with Deep Learning techniques, offloading of data analysis and visualisation to Cloud, and the advent of specialised hardware accelerators. Topology-aware Workflow Management Systems (WMSs) play a crucial role. In particular, topology-awareness allows an explicit mapping of workflow steps onto heterogeneous locations, allowing automated executions on top of hybrid architectures (e.g., cloud+HPC or classical+quantum). Plus, topology-aware WMSs can offer nonfunctional requirements OOTB, e.g. components' life-cycle orchestration, secure and efficient data transfers, fault tolerance, and cross-cluster execution of urgent workloads. Augmenting interactive Jupyter Notebooks with distributed workflow capabilities allows domain experts to prototype and scale applications using the same technological stack, while relying on a feature-rich and user-friendly web interface. This abstract will showcase how these general methodologies can be applied to a typical geoscience simulation pipeline based on the Full Wavefront Inversion (FWI) technique. In particular, a prototypical Jupyter Notebook will be executed interactively on Cloud. Preliminary data analyses and post-processing will be executed locally, while the computationally demanding optimisation loop will be scheduled on a remote HPC cluster.},

keywords = {across, eupex},

pubstate = {published},

tppubtype = {inproceedings}

}

Iacopo Colonnelli, Marco Aldinucci, Barbara Cantalupo, Luca Padovani, Sergio Rabellino, Concetto Spampinato, Roberto Morelli, Rosario Di Carlo, Nicolò Magini, Carlo Cavazzoni

Distributed workflows with Jupyter Journal Article

In: Future Generation Computer Systems, vol. 128, pp. 282–298, 2022, ISSN: 0167-739X.

Abstract | Links | BibTeX | Tags: across, deephealth, jupyter-workflow, streamflow

@article{21:FGCS:jupyflow,

title = {Distributed workflows with Jupyter},

author = {Iacopo Colonnelli and Marco Aldinucci and Barbara Cantalupo and Luca Padovani and Sergio Rabellino and Concetto Spampinato and Roberto Morelli and Rosario Di Carlo and Nicolò Magini and Carlo Cavazzoni},

url = {https://www.sciencedirect.com/science/article/pii/S0167739X21003976},

doi = {10.1016/j.future.2021.10.007},

issn = {0167-739X},

year  = {2022},

date = {2022-01-01},

journal = {Future Generation Computer Systems},

volume = {128},

pages = {282–298},

abstract = {The designers of a new coordination interface enacting complex workflows have to tackle a dichotomy: choosing a language-independent or language-dependent approach. Language-independent approaches decouple workflow models from the host code's business logic and advocate portability. Language-dependent approaches foster flexibility and performance by adopting the same host language for business and coordination code. Jupyter Notebooks, with their capability to describe both imperative and declarative code in a unique format, allow taking the best of the two approaches, maintaining a clear separation between application and coordination layers but still providing a unified interface to both aspects. We advocate the Jupyter Notebooks' potential to express complex distributed workflows, identifying the general requirements for a Jupyter-based Workflow Management System (WMS) and introducing a proof-of-concept portable implementation working on hybrid Cloud-HPC infrastructures. As a byproduct, we extended the vanilla IPython kernel with workflow-based parallel and distributed execution capabilities. The proposed Jupyter-workflow (Jw) system is evaluated on common scenarios for High Performance Computing (HPC) and Cloud, showing its potential in lowering the barriers between prototypical Notebooks and production-ready implementations.},

keywords = {across, deephealth, jupyter-workflow, streamflow},

pubstate = {published},

tppubtype = {article}

}

WE ARE HIRING! If you are Research Engineers, Ph.D. Candidates and Post-Doctoral Researchers send your CV to alpha@di.unito.it

WE ARE HIRING! If you are Research Engineers, Ph.D. Candidates and Post-Doctoral Researchers send your CV to alpha@di.unito.it

Papers | Parallel Computing

2023

2022