Papers | Parallel Computing

2020

Jose Daniel Garcia, Jose Daniel Rio, Marco Aldinucci, Fabio Tordini, Marco Danelutto, Gabriele Mencagli, Massimo Torquati

Challenging the abstraction penalty in parallel patterns libraries: Adding FastFlow support to GrPPI Journal Article

In: The Journal of Supercomputing, vol. 76, no. 7, pp. 5139–5159, 2020.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

@article{19:jsupe:grppi,

title = {Challenging the abstraction penalty in parallel patterns libraries: Adding FastFlow support to GrPPI},

author = {Jose Daniel Garcia and Jose Daniel Rio and Marco Aldinucci and Fabio Tordini and Marco Danelutto and Gabriele Mencagli and Massimo Torquati},

url = {https://iris.unito.it/retrieve/handle/2318/1762686/744894/2020-js-grppi-postprint.pdf},

doi = {10.1007/s11227-019-02826-5},

year  = {2020},

date = {2020-01-01},

journal = {The Journal of Supercomputing},

volume = {76},

number = {7},

pages = {5139–5159},

abstract = {In the last years, pattern-based programming has been recognized as a good practice for efficiently exploiting parallel hardware resources. Following this approach, multiple libraries have been designed for providing such high-level abstractions to ease the parallel programming. However, those libraries do not share a common interface. To pave the way, GrPPI has been designed for providing an intermediate abstraction layer between application developers and existing parallel programming frameworks like OpenMP, Intel TBB or ISO C++ threads. On the other hand, FastFlow has been adopted as an efficient object-based programming framework that may benefit from being supported as an additional GrPPI backend. However, the object-based approach presents some major challenges to be incorporated under the GrPPI type safe functional programming style. In this paper, we present the integration of FastFlow as a new GrPPI backend to demonstrate that structured parallel programming frameworks perfectly fit the GrPPI design. Additionally, we also demonstrate that GrPPI does not incur in additional overheads for providing its abstraction layer, and we study the programmability in terms of lines of code and cyclomatic complexity. In general, the presented work acts as reciprocal validation of both FastFlow (as an efficient, native structured parallel programming framework) and GrPPI (as an efficient abstraction layer on top of existing parallel programming frameworks).},

keywords = {fastflow, rephrase},

pubstate = {published},

tppubtype = {article}

}

2019

Marco Danelutto, Tiziano De Matteis, Daniele De Sensi, Gabriele Mencagli, Massimo Torquati, Marco Aldinucci, Peter Kilpatrick

The RePhrase Extended Pattern Set for Data Intensive Parallel Computing Journal Article

In: International Journal of Parallel Programming, vol. 47, no. 1, pp. 74–93, 2019.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Massimo Torquati, Gabriele Mencagli, Maurizio Drocco, Marco Aldinucci, Tiziano De Matteis, Marco Danelutto

On Dynamic Memory Allocation in Sliding-Window Parallel Patterns for Streaming Analytics Journal Article

In: The Journal of Supercomputing, vol. 75, no. 8, pp. 4114–4131, 2019.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

2018

Gabriele Mencagli, Massimo Torquati, Fabio Lucattini, Salvatore Cuomo, Marco Aldinucci

Harnessing sliding-window execution semantics for parallel stream processing Journal Article

In: Journal of Parallel and Distributed Computing, vol. 116, pp. 74–88, 2018, ISSN: 0743-7315.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Claudia Misale, Maurizio Drocco, Guy Tremblay, Alberto R. Martinelli, Marco Aldinucci

PiCo: High-performance data analytics pipelines in modern C++ Journal Article

In: Future Generation Computer Systems, vol. 87, pp. 392–403, 2018.

Abstract | Links | BibTeX | Tags: fastflow, HPC, toreador

2017

Maurizio Drocco

Parallel Programming with Global Asynchronous Memory: Models, C++ APIs and Implementations PhD Thesis

Computer Science Department, University of Torino, 2017.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, repara, rephrase, toreador

@phdthesis{17:gam:drocco:thesis,

title = {Parallel Programming with Global Asynchronous Memory: Models, C++ APIs and Implementations},

author = {Maurizio Drocco},

url = {https://zenodo.org/record/1037585/files/Drocco_phd_thesis.pdf},

doi = {10.5281/zenodo.1037585},

year  = {2017},

date = {2017-10-01},

school = {Computer Science Department, University of Torino},

abstract = {In the realm of High Performance Computing (HPC), message passing has been the programming paradigm of choice for over twenty years. The durable MPI (Message Passing Interface) standard, with send/receive communication, broadcast, gather/scatter, and reduction collectives is still used to construct parallel programs where each communication is orchestrated by the de-vel-oper-based precise knowledge of data distribution and overheads; collective communications simplify the orchestration but might induce excessive synchronization. Early attempts to bring shared-memory programming model—with its programming adv-antages—to distributed computing, referred as the Distributed Shared Memory (DSM) model, faded away; one of the main issue was to combine performance and programmability with the memory consistency model. The recently proposed Partitioned Global Address Space (PGAS) model is a modern revamp of DSM that exposes data placement to enable optimizations based on locality, but it still addresses (simple) data-parallelism only and it relies on expensive sharing protocols. We advocate an alternative programming model for distributed computing based on a Global Asynchronous Memory (GAM), aiming to emphavoid coherency and consistency problems rather than solving them. We materialize GAM by designing and implementing a emphdistributed smart pointers library, inspired by C++ smart pointers. In this model, public and private pointers (resembling C++ shared and unique pointers, respectively) are moved around instead of messages (i.e., data), thus alleviating the user from the burden of minimizing transfers. On top of smart pointers, we propose a high-level C++ template library for writing applications in terms of dataflow-like networks, namely GAM nets, consisting of stateful processors exchanging pointers in fully asynchronous fashion. We demonstrate the validity of the proposed approach, from the expressiveness perspective, by showing how GAM nets can be exploited to implement higher-level parallel programming models, such as data and task parallelism. As for the performance perspective, the execution of two non-toy benchmarks on a number of different small-scale HPC clusters exhibits both close-to-ideal scalability and negligible overhead with respect to state-of-the-art benchmark implementations. For instance, the GAM implementation of a high-quality video restoration filter sustains a 100 fps throughput over 70%-noisy high-quality video streams on a 4-node cluster of Graphics Processing Units (GPUs), with minimal programming effort.},

keywords = {fastflow, paraphrase, repara, rephrase, toreador},

pubstate = {published},

tppubtype = {phdthesis}

}

In the realm of High Performance Computing (HPC), message passing has been the programming paradigm of choice for over twenty years. The durable MPI (Message Passing Interface) standard, with send/receive communication, broadcast, gather/scatter, and reduction collectives is still used to construct parallel programs where each communication is orchestrated by the de-vel-oper-based precise knowledge of data distribution and overheads; collective communications simplify the orchestration but might induce excessive synchronization. Early attempts to bring shared-memory programming model—with its programming adv-antages—to distributed computing, referred as the Distributed Shared Memory (DSM) model, faded away; one of the main issue was to combine performance and programmability with the memory consistency model. The recently proposed Partitioned Global Address Space (PGAS) model is a modern revamp of DSM that exposes data placement to enable optimizations based on locality, but it still addresses (simple) data-parallelism only and it relies on expensive sharing protocols. We advocate an alternative programming model for distributed computing based on a Global Asynchronous Memory (GAM), aiming to emphavoid coherency and consistency problems rather than solving them. We materialize GAM by designing and implementing a emphdistributed smart pointers library, inspired by C++ smart pointers. In this model, public and private pointers (resembling C++ shared and unique pointers, respectively) are moved around instead of messages (i.e., data), thus alleviating the user from the burden of minimizing transfers. On top of smart pointers, we propose a high-level C++ template library for writing applications in terms of dataflow-like networks, namely GAM nets, consisting of stateful processors exchanging pointers in fully asynchronous fashion. We demonstrate the validity of the proposed approach, from the expressiveness perspective, by showing how GAM nets can be exploited to implement higher-level parallel programming models, such as data and task parallelism. As for the performance perspective, the execution of two non-toy benchmarks on a number of different small-scale HPC clusters exhibits both close-to-ideal scalability and negligible overhead with respect to state-of-the-art benchmark implementations. For instance, the GAM implementation of a high-quality video restoration filter sustains a 100 fps throughput over 70%-noisy high-quality video streams on a 4-node cluster of Graphics Processing Units (GPUs), with minimal programming effort.

Claudia Misale

PiCo: A Domain-Specific Language for Data Analytics Pipelines PhD Thesis

Computer Science Department, University of Torino, 2017.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, repara, rephrase, toreador

@phdthesis{17:pico:misale:thesis,

title = {PiCo: A Domain-Specific Language for Data Analytics Pipelines},

author = {Claudia Misale},

url = {https://iris.unito.it/retrieve/handle/2318/1633743/320170/Misale_thesis.pdf},

doi = {10.5281/zenodo.579753},

year  = {2017},

date = {2017-05-01},

school = {Computer Science Department, University of Torino},

abstract = {In the world of Big Data analytics, there is a series of tools aiming at simplifying programming applications to be executed on clusters. Although each tool claims to provide better programming, data and execution models—for which only informal (and often confusing) semantics is generally provided—all share a common under- lying model, namely, the Dataflow model. Using this model as a starting point, it is possible to categorize and analyze almost all aspects about Big Data analytics tools from a high level perspective. This analysis can be considered as a first step toward a formal model to be exploited in the design of a (new) framework for Big Data analytics. By putting clear separations between all levels of abstraction (i.e., from the runtime to the user API), it is easier for a programmer or software designer to avoid mixing low level with high level aspects, as we are often used to see in state-of-the-art Big Data analytics frameworks. 

 From the user-level perspective, we think that a clearer and simple semantics is preferable, together with a strong separation of concerns. For this reason, we use the Dataflow model as a starting point to build a programming environment with a simplified programming model implemented as a Domain-Specific Language, that is on top of a stack of layers that build a prototypical framework for Big Data analytics. 

 The contribution of this thesis is twofold: first, we show that the proposed model is (at least) as general as existing batch and streaming frameworks (e.g., Spark, Flink, Storm, Google Dataflow), thus making it easier to understand high-level data-processing applications written in such frameworks. As result of this analysis, we provide a layered model that can represent tools and applications following the Dataflow paradigm and we show how the analyzed tools fit in each level. 

 Second, we propose a programming environment based on such layered model in the form of a Domain-Specific Language (DSL) for processing data collections, called PiCo (Pipeline Composition). The main entity of this programming model is the Pipeline, basically a DAG-composition of processing elements. This model is intended to give the user an unique interface for both stream and batch processing, hiding completely data management and focusing only on operations, which are represented by Pipeline stages. Our DSL will be built on top of the FastFlow library, exploiting both shared and distributed parallelism, and implemented in C++11/14 with the aim of porting C++ into the Big Data world.},

keywords = {fastflow, paraphrase, repara, rephrase, toreador},

pubstate = {published},

tppubtype = {phdthesis}

}

In the world of Big Data analytics, there is a series of tools aiming at simplifying programming applications to be executed on clusters. Although each tool claims to provide better programming, data and execution models—for which only informal (and often confusing) semantics is generally provided—all share a common under- lying model, namely, the Dataflow model. Using this model as a starting point, it is possible to categorize and analyze almost all aspects about Big Data analytics tools from a high level perspective. This analysis can be considered as a first step toward a formal model to be exploited in the design of a (new) framework for Big Data analytics. By putting clear separations between all levels of abstraction (i.e., from the runtime to the user API), it is easier for a programmer or software designer to avoid mixing low level with high level aspects, as we are often used to see in state-of-the-art Big Data analytics frameworks.
From the user-level perspective, we think that a clearer and simple semantics is preferable, together with a strong separation of concerns. For this reason, we use the Dataflow model as a starting point to build a programming environment with a simplified programming model implemented as a Domain-Specific Language, that is on top of a stack of layers that build a prototypical framework for Big Data analytics.
The contribution of this thesis is twofold: first, we show that the proposed model is (at least) as general as existing batch and streaming frameworks (e.g., Spark, Flink, Storm, Google Dataflow), thus making it easier to understand high-level data-processing applications written in such frameworks. As result of this analysis, we provide a layered model that can represent tools and applications following the Dataflow paradigm and we show how the analyzed tools fit in each level.
Second, we propose a programming environment based on such layered model in the form of a Domain-Specific Language (DSL) for processing data collections, called PiCo (Pipeline Composition). The main entity of this programming model is the Pipeline, basically a DAG-composition of processing elements. This model is intended to give the user an unique interface for both stream and batch processing, hiding completely data management and focusing only on operations, which are represented by Pipeline stages. Our DSL will be built on top of the FastFlow library, exploiting both shared and distributed parallelism, and implemented in C++11/14 with the aim of porting C++ into the Big Data world.

Marco Aldinucci, Marco Danelutto, Daniele De Sensi, Gabriele Mencagli, Massimo Torquati

Towards Power-Aware Data Pipelining on Multicores Proceedings Article

In: Proceedings of the 10th International Symposium on High-Level Parallel Programming and Applications, Valladolid, Spain, 2017.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

FastFlow: high-level and efficient streaming on multi-core Book Chapter

In: Pllana, Sabri, Xhafa, Fatos (Ed.): Programming Multi-core and Many-core Computing Systems, Chapter 13, pp. 261–280, John Wiley & Sons, Ltd, 2017, ISBN: 9781119332015.

Abstract | Links | BibTeX | Tags: fastflow

Fabio Tordini, Maurizio Drocco, Claudia Misale, Luciano Milanesi, Pietro Liò, Ivan Merelli, Massimo Torquati, Marco Aldinucci

NuChart-II: the road to a fast and scalable tool for Hi-C data analysis Journal Article

In: International Journal of High Performance Computing Applications, vol. 31, no. 3, pp. 196–211, 2017.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, repara, rephrase

@article{16:ijhpca:nuchart,

title = {NuChart-II: the road to a fast and scalable tool for Hi-C data analysis},

author = {Fabio Tordini and Maurizio Drocco and Claudia Misale and Luciano Milanesi and Pietro Liò and Ivan Merelli and Massimo Torquati and Marco Aldinucci},

url = {https://iris.unito.it/retrieve/handle/2318/1607126/238747/main.pdf},

doi = {10.1177/1094342016668567},

year  = {2017},

date = {2017-01-01},

journal = {International Journal of High Performance Computing Applications},

volume = {31},

number = {3},

pages = {196–211},

abstract = {Recent advances in molecular biology and bioinformatics techniques brought to an explosion of the information about the spatial organisation of the DNA in the nucleus of a cell. High-throughput molecular biology techniques provide a genome-wide capture of the spatial organization of chromosomes at unprecedented scales, which permit to identify physical interactions between genetic elements located throughout a genome. Recent results have shown that there is a large correlation between co-localization and co-regulation of genes, but these important information are hampered by the lack of biologists-friendly analysis and visualisation software. In this work we present NuChart-II, an efficient and highly optimized tool for genomic data analysis that provides a gene-centric, graph-based representation of genomic information. While designing NuChart-II we addressed several common issues in the parallelisation of memory bound algorithms for shared-memory systems. With performance and usability in mind, NuChart-II is a R package that embeds a C++ engine: computing capabilities and memory hierarchy of multi-core architectures are fully exploited, while the versatile R environment for statistical analysis and data visualisation rises the level of abstraction and permits to orchestrate analysis and visualisation of genomic data.},

keywords = {bioinformatics, fastflow, repara, rephrase},

pubstate = {published},

tppubtype = {article}

}

2016

Fabio Tordini

The road towards a Cloud-based High-Performance solution for genomic data analysis PhD Thesis

Computer Science Department, University of Torino, Italy, 2016.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

@phdthesis{tordiniThesis16,

title = {The road towards a Cloud-based High-Performance solution for genomic data analysis},

author = {Fabio Tordini},

url = {http://calvados.di.unipi.it/storage/paper_files/2016_tordini_phdthesis.pdf},

year  = {2016},

date = {2016-04-01},

school = {Computer Science Department, University of Torino, Italy},

abstract = {Nowadays, molecular biology laboratories are delivering more and more data about DNA organisation, at increasing resolution and in a large number of samples. So much that genomic research is now facing many of the scale-out issues that high-performance computing has been addressing for years: they require powerful infrastructures with fast computing and storage capabilities, with substantial challenges in terms of data processing, statistical analysis and data representation. With this thesis we propose a high-performance pipeline for the analysis and interpretation of heterogeneous genomic information: beside performance, usability and availability are two essential requirements that novel Bioinformatics tools should satisfy. In this perspective, we propose and discuss our efforts towards a solid infrastructure for data processing and storage, where software that operates over data is exposed as a service, and is accessible by users through the Internet. We begin by presenting NuChart-II, a tool for the analysis and interpretation of spatial genomic information. With NuChart-II we propose a graph-based representation of genomic data, which can provide insights on the disposition of genomic elements in the DNA. We also discuss our approach for the normalisation of biases that affect raw sequenced data. We believe that many currently available tools for genomic data analysis are perceived as tricky and troublesome applications, that require highly specialised skills to obtain the desired outcomes. Concerning usability, we want to rise the level of abstraction perceived by the user, but maintain high performance and correctness while providing an exhaustive solution for data visualisation. We also intend to foster the availability of novel tools: in this work we also discuss a cloud solution that delivers computation and storage as dynamically allocated virtual resources via the Internet, while needed software is provided as a service. In this way, the computational demand of genomic research can be satisfied more economically by using lab-scale and enterprise-oriented technologies. Here we discuss our idea of a task farm for the integration of heterogeneous data resulting from different sequencing experiments: we believe that the integration of multi-omic features on a nuclear map can be a valuable mean for studying the interactions among genetic elements. This can reveal insights on biological mechanisms, such as genes regulation, translocations and epigenetic patterns.},

keywords = {bioinformatics, fastflow},

pubstate = {published},

tppubtype = {phdthesis}

}

Nowadays, molecular biology laboratories are delivering more and more data about DNA organisation, at increasing resolution and in a large number of samples. So much that genomic research is now facing many of the scale-out issues that high-performance computing has been addressing for years: they require powerful infrastructures with fast computing and storage capabilities, with substantial challenges in terms of data processing, statistical analysis and data representation. With this thesis we propose a high-performance pipeline for the analysis and interpretation of heterogeneous genomic information: beside performance, usability and availability are two essential requirements that novel Bioinformatics tools should satisfy. In this perspective, we propose and discuss our efforts towards a solid infrastructure for data processing and storage, where software that operates over data is exposed as a service, and is accessible by users through the Internet. We begin by presenting NuChart-II, a tool for the analysis and interpretation of spatial genomic information. With NuChart-II we propose a graph-based representation of genomic data, which can provide insights on the disposition of genomic elements in the DNA. We also discuss our approach for the normalisation of biases that affect raw sequenced data. We believe that many currently available tools for genomic data analysis are perceived as tricky and troublesome applications, that require highly specialised skills to obtain the desired outcomes. Concerning usability, we want to rise the level of abstraction perceived by the user, but maintain high performance and correctness while providing an exhaustive solution for data visualisation. We also intend to foster the availability of novel tools: in this work we also discuss a cloud solution that delivers computation and storage as dynamically allocated virtual resources via the Internet, while needed software is provided as a service. In this way, the computational demand of genomic research can be satisfied more economically by using lab-scale and enterprise-oriented technologies. Here we discuss our idea of a task farm for the integration of heterogeneous data resulting from different sequencing experiments: we believe that the integration of multi-omic features on a nuclear map can be a valuable mean for studying the interactions among genetic elements. This can reveal insights on biological mechanisms, such as genes regulation, translocations and epigenetic patterns.

Maurizio Drocco, Claudia Misale, Marco Aldinucci

A Cluster-As-Accelerator approach for SPMD-free Data Parallelism Proceedings Article

In: Proc. of 24th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 350–353, IEEE, Crete, Greece, 2016.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

Vladimir Janjic, Christopher Brown, Kenneth MacKenzie, Kevin Hammond, Marco Danelutto, Marco Aldinucci, Jose Daniel Garcia

RPL: A Domain-Specific Language for Designing and Implementing Parallel C++ Applications Proceedings Article

In: Proc. of Intl. Euromicro PDP 2016: Parallel Distributed and network-based Processing, IEEE, Crete, Greece, 2016.

Abstract | Links | BibTeX | Tags: fastflow, rephrase

@inproceedings{rpl:pdp:16,

title = {RPL: A Domain-Specific Language for Designing and Implementing Parallel C++ Applications},

author = {Vladimir Janjic and Christopher Brown and Kenneth MacKenzie and Kevin Hammond and Marco Danelutto and Marco Aldinucci and Jose Daniel Garcia},

url = {https://iris.unito.it/retrieve/handle/2318/1597172/299237/2016_jsupe_stencil_pp_4aperto.pdf},

doi = {10.1109/PDP.2016.122},

year  = {2016},

date = {2016-01-01},

booktitle = {Proc. of Intl. Euromicro PDP 2016: Parallel Distributed and network-based Processing},

publisher = {IEEE},

address = {Crete, Greece},

abstract = {Parallelising sequential applications is usually a very hard job, due to many different ways in which an application can be parallelised and a large number of programming models (each with its own advantages and disadvantages) that can be used. In this paper, we describe a method to semi- automatically generate and evaluate different parallelisations of the same application, allowing programmers to find the best parallelisation without significant manual reengineering of the code. We describe a novel, high-level domain-specific language, Refactoring Pattern Language (RPL), that is used to represent the parallel structure of an application and to capture its extra-functional properties (such as service time). We then describe a set of RPL rewrite rules that can be used to generate alternative, but semantically equivalent, parallel structures (parallelisations) of the same application. We also describe the RPL Shell that can be used to evaluate these parallelisations, in terms of the desired extra-functional properties. Finally, we describe a set of C++ refactorings, targeting OpenMP, Intel TBB and FastFlow parallel programming models, that semi-automatically apply the desired parallelisation to the application's source code, therefore giving a parallel version of the code. We demonstrate how the RPL and the refactoring rules can be used to derive efficient parallelisations of two realistic C++ use cases (Image Convolution and Ant Colony Optimisation).},

keywords = {fastflow, rephrase},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Pool Evolution: A Parallel Pattern for Evolutionary and Symbolic Computing Journal Article

In: International Journal of Parallel Programming, vol. 44, no. 3, pp. 531–551, 2016, ISSN: 0885-7458.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, repara

Fabio Tordini, Ivan Merelli, Pietro Liò, Luciano Milanesi, Marco Aldinucci

NuchaRt: embedding high-level parallel computing in R for augmented Hi-C data analysis Book Section

In: Publishing, Springer International (Ed.): Computational Intelligence Methods for Bioinformatics and Biostatistics, vol. 9874, pp. 259–272, Springer International Publishing, Cham (ZG), 2016, ISBN: 978-3-319-44331-7.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, repara

Fabio Tordini

A cloud solution for multi-omics data integration Proceedings Article

In: Proceedings of the 16th IEEE International Conference on Scalable Computing and Communication, pp. 559–566, IEEE Computer Society, 2016, (Best paper award).

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, rephrase

Manuel F. Dolz, David Rio Astorga, Javier Fernández, J. Daniel Garc'ıa, Félix Garc'ıa-Carballeira, Marco Danelutto, Massimo Torquati

Embedding Semantics of the Single-Producer/Single-Consumer Lock-Free Queue into a Race Detection Tool Proceedings Article

In: Proceedings of the 7th International Workshop on Programming Models and Applications for Multicores and Manycores, pp. 20–29, ACM, Barcelona, Spain, 2016, ISBN: 978-1-4503-4196-7.

Links | BibTeX | Tags: fastflow, repara

Andrea Bracciali, Marco Aldinucci, Murray Patterson, Tobias Marschall, Nadia Pisanti, Ivan Merelli, Massimo Torquati

pWhatsHap: efficient haplotyping for future generation sequencing Journal Article

In: BMC Bioinformatics, vol. 17, no. Suppl 11, pp. 342, 2016.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, rephrase

@article{16:pwhatshap:bmc,

title = {pWhatsHap: efficient haplotyping for future generation sequencing},

author = {Andrea Bracciali and Marco Aldinucci and Murray Patterson and Tobias Marschall and Nadia Pisanti and Ivan Merelli and Massimo Torquati},

url = {http://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/s12859-016-1170-y?site=bmcbioinformatics.biomedcentral.com},

doi = {10.1186/s12859-016-1170-y},

year  = {2016},

date = {2016-01-01},

journal = {BMC Bioinformatics},

volume = {17},

number = {Suppl 11},

pages = {342},

abstract = {Background: Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WhatsHap is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Results: Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered pWhatsHap, a parallel, high-performance version of WhatsHap. pWhatsHap is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WhatsHap, pWhatsHap exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WhatsHap, which increases with coverage. Conclusions: Due to its structure and management of the large datasets, the parallelisation of WhatsHap posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, pWhatsHap, is a freely available toolkit that improves the efficiency of the analysis of genomics information.},

keywords = {fastflow, paraphrase, rephrase},

pubstate = {published},

tppubtype = {article}

}

Background: Haplotype phasing is an important problem in the analysis of genomics information. Given a set of DNA fragments of an individual, it consists of determining which one of the possible alleles (alternative forms of a gene) each fragment comes from. Haplotype information is relevant to gene regulation, epigenetics, genome-wide association studies, evolutionary and population studies, and the study of mutations. Haplotyping is currently addressed as an optimisation problem aiming at solutions that minimise, for instance, error correction costs, where costs are a measure of the confidence in the accuracy of the information acquired from DNA sequencing. Solutions have typically an exponential computational complexity. WhatsHap is a recent optimal approach which moves computational complexity from DNA fragment length to fragment overlap, i.e., coverage, and is hence of particular interest when considering sequencing technology's current trends that are producing longer fragments. Results: Given the potential relevance of efficient haplotyping in several analysis pipelines, we have designed and engineered pWhatsHap, a parallel, high-performance version of WhatsHap. pWhatsHap is embedded in a toolkit developed in Python and supports genomics datasets in standard file formats. Building on WhatsHap, pWhatsHap exhibits the same complexity exploring a number of possible solutions which is exponential in the coverage of the dataset. The parallel implementation on multi-core architectures allows for a relevant reduction of the execution time for haplotyping, while the provided results enjoy the same high accuracy as that provided by WhatsHap, which increases with coverage. Conclusions: Due to its structure and management of the large datasets, the parallelisation of WhatsHap posed demanding technical challenges, which have been addressed exploiting a high-level parallel programming framework. The result, pWhatsHap, is a freely available toolkit that improves the efficiency of the analysis of genomics information.

2015

Paolo Inaudi

Progettazione e sviluppo di un provider libfabric per la rete ad alte prestazioni Ronniee/A3Cube Masters Thesis

Computer Science Department, University of Torino, 2015.

BibTeX | Tags: fastflow

Marco Aldinucci, Marco Danelutto, Maurizio Drocco, Peter Kilpatrick, Guilherme Peretti Pezzi, Massimo Torquati

The Loop-of-Stencil-Reduce paradigm Proceedings Article

In: Proc. of Intl. Workshop on Reengineering for Parallelism in Heterogeneous Parallel Platforms (RePara), pp. 172–177, IEEE, Helsinki, Finland, 2015.

Abstract | Links | BibTeX | Tags: fastflow, HPC, repara

Fabio Tordini, Maurizio Drocco, Ivan Merelli, Luciano Milanesi, Pietro Liò, Marco Aldinucci

NuChart-II: a graph-based approach for the analysis and interpretation of Hi-C data Proceedings Article

In: Serio, Clelia Di, Liò, Pietro, Nonis, Alessandro, Tagliaferri, Roberto (Ed.): Proc. of 11th Intl. Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB), pp. 298–311, Springer, Cambridge, UK, 2015, ISBN: 978-3-319-24461-7.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, paraphrase, repara

@inproceedings{14:ff:nuchart:cibb,

title = {NuChart-II: a graph-based approach for the analysis and interpretation of Hi-C data},

author = {Fabio Tordini and Maurizio Drocco and Ivan Merelli and Luciano Milanesi and Pietro Liò and Marco Aldinucci},

editor = {Clelia Di Serio and Pietro Liò and Alessandro Nonis and Roberto Tagliaferri},

url = {http://calvados.di.unipi.it/storage/paper_files/2014_nuchart_cibb.pdf},

doi = {10.1007/978-3-319-24462-4_25},

isbn = {978-3-319-24461-7},

year  = {2015},

date = {2015-06-01},

booktitle = {Proc. of 11th Intl. Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB)},

volume = {8623},

pages = {298–311},

publisher = {Springer},

address = {Cambridge, UK},

series = {LNCS},

abstract = {Long-range chromosomal associations between genomic regions, and their repositioning in the 3D space of the nucleus, are now considered to be key contributors to the regulation of gene expressions, and important links have been highlighted with other genomic features involved in DNA rearrangements. Recent Chromosome Conformation Capture (3C) measurements performed with high throughput sequencing (Hi-C) and molecular dynamics studies show that there is a large correlation between co-localization and co-regulation of genes, but these important researches are hampered by the lack of biologists-friendly analysis and visualisation software. In this work we present NuChart-II, a software that allows the user to annotate and visualize a list of input genes with information relying on Hi-C data, integrating knowledge data about genomic features that are involved in the chromosome spatial organization. This software works directly with sequenced reads to identify related Hi-C fragments, with the aim of creating gene-centric neighbourhood graphs on which multi-omics features can be mapped. NuChart-II is a highly optimized implementation of a previous prototype package developed in R, in which the graph-based representation of Hi-C data was tested. The prototype showed inevitable problems of scalability while working genome-wide on large datasets: particular attention has been paid in optimizing the data structures employed while constructing the neighbourhood graph, so as to foster an efficient parallel implementation of the software. The normalization of Hi-C data has been modified and improved, in order to provide a reliable estimation of proximity likelihood for the genes.},

keywords = {bioinformatics, fastflow, paraphrase, repara},

pubstate = {published},

tppubtype = {inproceedings}

}

Maurizio Drocco, Claudia Misale, Guilherme Peretti Pezzi, Fabio Tordini, Marco Aldinucci

Memory-Optimised Parallel Processing of Hi-C Data Proceedings Article

In: Proc. of 23rd Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 1–8, IEEE, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, impact, paraphrase, repara

Fabio Tordini, Maurizio Drocco, Claudia Misale, Luciano Milanesi, Pietro Liò, Ivan Merelli, Marco Aldinucci

Parallel Exploration of the Nuclear Chromosome Conformation with NuChart-II Proceedings Article

In: Proc. of 23rd Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), IEEE, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, impact, paraphrase, repara

Paolo Viviani

Parallel Computing Techniques for High Energy Physics Masters Thesis

Physics Department, University of Torino, 2015.

Abstract | BibTeX | Tags: fastflow, impact

Ivan Merelli, Fabio Tordini, Maurizio Drocco, Marco Aldinucci, Pietro Liò, Luciano Milanesi

Integrating Multi-omic features exploiting Chromosome Conformation Capture data Journal Article

In: Frontiers in Genetics, vol. 6, no. 40, 2015, ISSN: 1664-8021.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

@article{nuchart:frontiers:15,

title = {Integrating Multi-omic features exploiting Chromosome Conformation Capture data},

author = {Ivan Merelli and Fabio Tordini and Maurizio Drocco and Marco Aldinucci and Pietro Liò and Luciano Milanesi},

url = {http://journal.frontiersin.org/Journal/10.3389/fgene.2015.00040/pdf},

doi = {10.3389/fgene.2015.00040},

issn = {1664-8021},

year  = {2015},

date = {2015-01-01},

journal = {Frontiers in Genetics},

volume = {6},

number = {40},

abstract = {The representation, integration and interpretation of omic data is a complex task, in particular considering the huge amount of information that is daily produced in molecular biology laboratories all around the world. The reason is that sequencing data regarding expression profiles, methylation patterns, and chromatin domains is difficult to harmonize in a systems biology view, since genome browsers only allow coordinate-based representations, discarding functional clusters created by the spatial conformation of the DNA in the nucleus. In this context, recent progresses in high throughput molecular biology techniques and bioinformatics have provided insights into chromatin interactions on a larger scale and offer a formidable support for the interpretation of multi-omic data. In particular, a novel sequencing technique called Chromosome Conformation Capture (3C) allows the analysis of the chromosome organization in the cell's natural state. While performed genome wide, this technique is usually called Hi-C. Inspired by service applications such as Google Maps, we developed NuChart, an R package that integrates Hi-C data to describe the chromosomal neighbourhood starting from the information about gene positions, with the possibility of mapping on the achieved graphs genomic features such as methylation patterns and histone modifications, along with expression profiles. In this paper we show the importance of the NuChart application for the integration of multi-omic data in a systems biology fashion, with particular interest in cytogenetic applications of these techniques. Moreover, we demonstrate how the integration of multi-omic data can provide useful information in understanding why genes are in certain specific positions inside the nucleus and how epigenetic patterns correlate with their expression.},

keywords = {bioinformatics, fastflow},

pubstate = {published},

tppubtype = {article}

}

Marco Aldinucci, Andrea Bracciali, Tobias Marschall, Murray Patterson, Nadia Pisanti, Massimo Torquati

High-Performance Haplotype Assembly Proceedings Article

In: Serio, Clelia Di, Liò, Pietro, Nonis, Alessandro, Tagliaferri, Roberto (Ed.): Computational Intelligence Methods for Bioinformatics and Biostatistics - 11th International Meeting, CIBB 2014, Cambridge, UK, June 26-28, 2014, Revised Selected Papers, pp. 245–258, Springer, Cambridge, UK, 2015.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Marco Aldinucci, Guilherme Peretti Pezzi, Maurizio Drocco, Concetto Spampinato, Massimo Torquati

Parallel Visual Data Restoration on Multi-GPGPUs using Stencil-Reduce Pattern Journal Article

In: International Journal of High Performance Computing Applications, vol. 29, no. 4, pp. 461–472, 2015.

Abstract | Links | BibTeX | Tags: fastflow, HPC, impact, paraphrase

2014

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Pool evolution: a domain specific parallel pattern Proceedings Article

In: Proc.of the 7th Intl. Symposium on High-level Parallel Programming and Applications (HLPP), Amsterdam, The Netherlands, 2014.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase, repara

Marco Aldinucci, Massimo Torquati, Maurizio Drocco, Guilherme Peretti Pezzi, Concetto Spampinato

FastFlow: Combining Pattern-Level Abstraction and Efficiency in GPGPUs Proceedings Article

In: GPU Technology Conference (GTC), San Jose, CA, USA, 2014.

Abstract | Links | BibTeX | Tags: fastflow, HPC, impact, paraphrase

Marco Aldinucci, Massimo Torquati, Maurizio Drocco, Guilherme Peretti Pezzi, Concetto Spampinato

An Overview of FastFlow: Combining Pattern-Level Abstraction and Efficiency in GPGPUs Proceedings Article

In: GPU Technology Conference (GTC), San Jose, CA, USA, 2014.

Abstract | Links | BibTeX | Tags: fastflow, HPC, impact, paraphrase

Daniele Buono, Marco Danelutto, Tiziano De Matteis, Gabriele Mencagli, Massimo Torquati

A Lightweight Run-Time Support For Fast Dense Linear Algebra on Multi-Core Proceedings Article

In: Proc. of the 12th International Conference on Parallel and Distributed Computing and Networks (PDCN 2014), IASTED, ACTA press, 2014.

BibTeX | Tags: fastflow

Marco Aldinucci, Massimo Torquati, Concetto Spampinato, Maurizio Drocco, Claudia Misale, Cristina Calcagno, Mario Coppo

Parallel stochastic systems biology in the cloud Journal Article

In: Briefings in Bioinformatics, vol. 15, no. 5, pp. 798–813, 2014, ISSN: 1467-5463.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, impact, paraphrase

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Design patterns percolating to parallel programming framework implementation Journal Article

In: International Journal of Parallel Programming, vol. 42, no. 6, pp. 1012–1031, 2014, ISSN: 0885-7458.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Salvatore Ruggieri, Massimo Torquati

Decision Tree Building on Multi-Core using FastFlow Journal Article

In: Concurrency and Computation: Practice and Experience, vol. 26, no. 3, pp. 800–820, 2014.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Cristina Calcagno, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Eva Sciacca, Salvatore Spinella, Massimo Torquati, Angelo Troina

On designing multicore-aware simulators for systems biology endowed with on-line statistics Journal Article

In: BioMed Research International, 2014.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, paraphrase

Marco Aldinucci, Maurizio Drocco, Guilherme Peretti Pezzi, Claudia Misale, Fabio Tordini, Massimo Torquati

Exercising high-level parallel programming on streams: a systems biology use case Proceedings Article

In: Proc. of 34th IEEE Intl. Conference on Distributed Computing Systems Workshops (ICDCSW), IEEE, Madrid, Spain, 2014.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, impact, paraphrase

@inproceedings{cwc:gpu:dcperf:14,

title = {Exercising high-level parallel programming on streams: a systems biology use case},

author = {Marco Aldinucci and Maurizio Drocco and Guilherme Peretti Pezzi and Claudia Misale and Fabio Tordini and Massimo Torquati},

url = {https://iris.unito.it/retrieve/handle/2318/154516/26657/2014_dcperf_cwc_gpu.pdf},

doi = {10.1109/ICDCSW.2014.38},

year  = {2014},

date = {2014-01-01},

booktitle = {Proc. of 34th IEEE Intl. Conference on Distributed Computing Systems Workshops (ICDCSW)},

publisher = {IEEE},

address = {Madrid, Spain},

abstract = {The stochastic modelling of biological systems, cou- pled with Monte Carlo simulation of models, is an increasingly popular technique in Bioinformatics. The simulation-analysis workflow may result into a computationally expensive task reducing the interactivity required in the model tuning. In this work, we advocate high-level software design as a vehicle for building efficient and portable parallel simulators for a variety of platforms, ranging from multi-core platforms to GPGPUs to cloud. In particular, the Calculus of Wrapped Compartments (CWC) parallel simulator for systems biology equipped with on- line mining of results, which is designed according to the FastFlow pattern-based approach, is discussed as a running example. In this work, the CWC simulator is used as a paradigmatic example of a complex C++ application where the quality of results is correlated with both computation and I/O bounds, and where high-quality results might turn into big data. The FastFlow parallel programming framework, which advocates C++ pattern- based parallel programming makes it possible to develop portable parallel code without relinquish neither run-time efficiency nor performance tuning opportunities. Performance and effectiveness of the approach are validated on a variety of platforms, inter-alia cache-coherent multi-cores, cluster of multi-core (Ethernet and Infiniband) and the Amazon Elastic Compute Cloud.},

keywords = {bioinformatics, fastflow, impact, paraphrase},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Guilherme Peretti Pezzi, Maurizio Drocco, Fabio Tordini, Peter Kilpatrick, Massimo Torquati

Parallel video denoising on heterogeneous platforms Proceedings Article

In: Proc. of Intl. Workshop on High-level Programming for Heterogeneous and Hierarchical Parallel Systems (HLPGPU), 2014.

Abstract | Links | BibTeX | Tags: fastflow, impact, paraphrase

Claudia Misale

Accelerating Bowtie2 with a lock-less concurrency approach and memory affinity Proceedings Article

In: Aldinucci, Marco, D'Agostino, Daniele, Kilpatrick, Peter (Ed.): Proc. of Intl. Euromicro PDP 2014: Parallel Distributed and network-based Processing, IEEE, Torino, Italy, 2014, ((Best paper award)).

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

@inproceedings{ff:bowtie2:pdp:14,

title = {Accelerating Bowtie2 with a lock-less concurrency approach and memory affinity},

author = {Claudia Misale},

editor = {Marco Aldinucci and Daniele D'Agostino and Peter Kilpatrick},

url = {http://calvados.di.unipi.it/storage/paper_files/2014_pdp_bowtieff.pdf},

doi = {10.1109/PDP.2014.50},

year  = {2014},

date = {2014-01-01},

booktitle = {Proc. of Intl. Euromicro PDP 2014: Parallel Distributed and network-based Processing},

publisher = {IEEE},

address = {Torino, Italy},

abstract = {The implementation of DNA alignment tools for Bioinformatics lead to face different problems that dip into performances. A single alignment takes an amount of time that is not predictable and there are different factors that can affect performances, for instance the length of sequences can determine the computational grain of the task and mismatches or insertion/deletion (indels) increase time needed to complete an alignment. Moreover, an alignment is a strong memory- bound problem because of the irregular memory access pat- terns and limitations in memory-bandwidth. Over the years, many alignment tools were implemented. A concrete example is Bowtie2, one of the fastest (concurrent, Pthread-based) and state of the art not GPU-based alignment tool. Bowtie2 exploits concurrency by instantiating a pool of threads, which have access to a global input dataset, share the reference genome and have access to different objects for collecting alignment results. In this paper a modified implementation of Bowtie2 is presented, in which the concurrency structure has been changed. The proposed implementation exploits the task-farm skeleton pattern implemented as a Master-Worker. The Master-Worker pattern permits to delegate only to the Master thread dataset reading and to make private to each Worker data structures that are shared in the original version. Only the reference genome is left shared. As a further optimisation, the Master and each Worker were pinned on cores and the reference genome was allocated interleaved among memory nodes. The proposed implementation is able to gain up to 10 speedup points over the original implementation.},

note = {(Best paper award)},

keywords = {fastflow, paraphrase},

pubstate = {published},

tppubtype = {inproceedings}

}

Alessandro Secco, Irfan Uddin, Guilherme Peretti Pezzi, Massimo Torquati

Message passing on InfiniBand RDMA for parallel run-time supports Proceedings Article

In: Aldinucci, Marco, D'Agostino, Daniele, Kilpatrick, Peter (Ed.): Proc. of Intl. Euromicro PDP 2014: Parallel Distributed and network-based Processing, IEEE, Torino, Italy, 2014.

Abstract | Links | BibTeX | Tags: fastflow, impact, paraphrase

Maurizio Drocco, Marco Aldinucci, Massimo Torquati

A Dynamic Memory Allocator for heterogeneous platforms Proceedings Article

In: Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts, HiPEAC, Fiuggi, Italy, 2014.

Abstract | Links | BibTeX | Tags: fastflow, HPC

Claudia Misale, Giulio Ferrero, Massimo Torquati, Marco Aldinucci

Sequence alignment tools: one parallel pattern to rule them all? Journal Article

In: BioMed Research International, 2014.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow, paraphrase, repara

2013

Maurizio Drocco

Parallel stochastic simulators in systems biology: the evolution of the species Masters Thesis

Computer Science Department, University of Torino, Italy, 2013.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Fabio Tordini, Maurizio Drocco, Massimo Torquati, Mario Coppo

Parallel stochastic simulators in system biology: the evolution of the species Proceedings Article

In: Proc. of 21st Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), IEEE, Belfast, Nothern Ireland, U.K., 2013.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Claudia Misale, Marco Aldinucci, Massimo Torquati

Memory affinity in multi-threading: the Bowtie2 case study Proceedings Article

In: Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts, HiPEAC, Fiuggi, Italy, 2013, ISBN: 9789038221908.

Abstract | Links | BibTeX | Tags: fastflow

@inproceedings{ff:acaces:13,

title = {Memory affinity in multi-threading: the Bowtie2 case study},

author = {Claudia Misale and Marco Aldinucci and Massimo Torquati},

url = {https://iris.unito.it/retrieve/handle/2318/143005/23874/2013_ACACES_ex-abstract.pdf},

isbn = {9789038221908},

year  = {2013},

date = {2013-01-01},

booktitle = {Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts},

publisher = {HiPEAC},

address = {Fiuggi, Italy},

abstract = {The diffusion of the Next Generation Sequencing (NGS) has increased the amount of data obtainable by genomic experiments. From a DNA sample a NGS run is able to produce millions of short sequences (called reads), which should be mapped into a reference genome. In this paper, we analyse the performance of Bowtie2, a fast and popular DNA mapping tool. Bowtie2 exhibits a multithreading implementation on top of pthreads, spin-locks and SSE2 SIMD extension. From parallel computing viewpoint, is a paradigmatic example of a software requiring to address three fundamental problems in shared-memory programming for cache-coherent multi-core platforms: synchronisation efficiency at very fine grain (due to short reads), load-balancing (due to long reads), and efficient usage of memory subsystem (due to SSE2 memory pressure). We compare the original implementation against an alternative implementation on top of the FastFlow pattern-based programming framework. The proposed design exploits the high-level farm pattern of FastFlow, which is implemented top of nonblocking multi-threading and lock-less (CAS-free) queues, and provides the programmer with high-level mechanism to tune task scheduling to achieve both load-balancing and memory affinity. The proposed design, despite the high-level design, is always faster and more scalable with respect to the original one. The design of both original and alternative version will be presented along with their experimental evaluation on real-world data sets.},

keywords = {fastflow},

pubstate = {published},

tppubtype = {inproceedings}

}

Marco Aldinucci, Sonia Campa, Peter Kilpatrick, Massimo Torquati

Structured Data Access Annotations for Massively Parallel Computations Proceedings Article

In: Euro-Par 2012 Workshops, Proc. of the ParaPhrase Workshop on Parallel Processing, pp. 381–390, Springer, 2013.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Sonia Campa, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Targeting Distributed Systems in FastFlow Proceedings Article

In: Euro-Par 2012 Workshops, Proc. of the CoreGrid Workshop on Grids, Clouds and P2P Computing, pp. 47–56, Springer, 2013.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Sonia Campa, Fabio Tordini, Massimo Torquati, Peter Kilpatrick

An abstract annotation model for skeletons Book Section

In: Beckert, Bernhard, Damiani, Ferruccio, Boer, Frank S., Bonsangue, Marcello M. (Ed.): Formal Methods for Components and Objects: Intl. Symposium, FMCO 2011, Torino, Italy, October 3-5, 2011, Revised Invited Lectures, vol. 7542, pp. 257–276, Springer, 2013, ISBN: 978-3-642-35886-9.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

2012

Marco Aldinucci, Concetto Spampinato, Maurizio Drocco, Massimo Torquati, Simone Palazzo

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising Proceedings Article

In: Djemal, K., Deriche, M., Puech, W., Ucan, Osman N. (Ed.): Proc. of 2nd Intl. Conference on Image Processing Theory Tools and Applications (IPTA), pp. 97–102, IEEE, Istambul, Turkey, 2012, ISBN: 978-1-4673-2582-0.

Abstract | Links | BibTeX | Tags: fastflow, impact

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimiliano Meneghin, Massimo Torquati

An Efficient Unbounded Lock-Free Queue for Multi-core Systems Proceedings Article

In: Proc. of 18th Intl. Euro-Par 2012 Parallel Processing, pp. 662–673, Springer, Rhodes Island, Greece, 2012.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimo Torquati

Targeting heterogeneous architectures via macro data flow Journal Article

In: Parallel Processing Letters, vol. 22, no. 2, 2012, ISSN: 0129-6264.

Abstract | Links | BibTeX | Tags: fastflow, paraphrase

Marco Aldinucci, Marco Danelutto, Massimo Torquati

FastFlow tutorial Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-12-04, 2012.

Links | BibTeX | Tags: fastflow

Marco Aldinucci, Marco Danelutto, Lorenzo Anardu, Massimo Torquati, Peter Kilpatrick

Parallel patterns + Macro Data Flow for multi-core programming Proceedings Article

In: Proc. of Intl. Euromicro PDP 2012: Parallel Distributed and network-based Processing, pp. 27–36, IEEE, Garching, Germany, 2012.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Eva Sciacca, Salvatore Spinella, Massimo Torquati, Angelo Troina

On Parallelizing On-Line Statistics for Stochastic Biological Simulations Proceedings Article

In: Alexander, Michael, D'Ambra, Pasqua, Belloum, Adam, Bosilca, George, Cannataro, Mario, Danelutto, Marco, Martino, Beniamino Di, Gerndt, Michael, Jeannot, Emmanuel, Namyst, Raymond, Roman, Jean, Scott, Stephen L., Träff, Jesper Larsson, Vallée, Geoffroy, Weidendorfer, Josef (Ed.): Proc. of Euro-Par Workshops: 2nd Workshop on High Performance Bioinformatics and Biomedicine (HiBB), pp. 3–12, Springer, Bordeaux, France, 2012.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Fabio Tordini, Marco Aldinucci, Massimo Torquati

High-level lock-less programming for multicore Proceedings Article

In: Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) – Poster Abstracts, HiPEAC, Fiuggi, Italy, 2012, ISBN: 9789038219875.

Abstract | Links | BibTeX | Tags: fastflow

2011

Marco Aldinucci, Marco Danelutto, Peter Kilpatrick, Massimiliano Meneghin, Massimo Torquati

Accelerating code on multi-cores with FastFlow Proceedings Article

In: Jeannot, E., Namyst, R., Roman, J. (Ed.): Proc. of 17th Intl. Euro-Par 2011 Parallel Processing, pp. 170–181, Springer, Bordeaux, France, 2011.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Maurizio Drocco, Daniela Giordano, Concetto Spampinato, Massimo Torquati

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising Technical Report

Università degli Studi di Torino, Dip. di Informatica, Italy no. 138/2011, 2011.

Links | BibTeX | Tags: fastflow

Marco Aldinucci, Salvatore Ruggieri, Massimo Torquati

Porting Decision Tree Building and Pruning Algorithms to Multicore using FastFlow Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-11-06, 2011.

Links | BibTeX | Tags: fastflow

Marco Aldinucci, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Massimo Torquati, Angelo Troina

On Designing Multicore-Aware Simulators for Biological Systems Proceedings Article

In: Cotronis, Yiannis, Danelutto, Marco, Papadopoulos, George Angelos (Ed.): Proc. of 19th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), pp. 318–325, IEEE, Ayia Napa, Cyprus, 2011.

Abstract | Links | BibTeX | Tags: fastflow

2010

Marco Aldinucci, Mario Coppo, Ferruccio Damiani, Maurizio Drocco, Massimo Torquati, Angelo Troina

On Designing Multicore-Aware Simulators for Biological Systems Technical Report

Università degli Studi di Torino, Dipartimento di Informatica, Italy no. 131/2010, 2010.

BibTeX | Tags: fastflow

Marco Aldinucci, Salvatore Ruggieri, Massimo Torquati

Porting Decision Tree Algorithms to Multicore using FastFlow Proceedings Article

In: Balcázar, José L., Bonchi, Francesco, Gionis, Aristides, Sebag, Michèle (Ed.): Proc. of European Conference in Machine Learning and Knowledge Discovery in Databases (ECML PKDD), pp. 7–23, Springer, Barcelona, Spain, 2010.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Andrea Bracciali, Pietro Liò, Anil Sorathiya, Massimo Torquati

StochKit-FF: Efficient Systems Biology on Multicore Architectures Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-10-12, 2010.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Andrea Bracciali, Pietro Liò

Formal Synthetic Immunology Journal Article

In: ERCIM News, vol. 82, pp. 40–41, 2010, ISSN: 0926-4981.

Abstract | Links | BibTeX | Tags: bioinformatics, fastflow

Marco Aldinucci, Salvatore Ruggieri, Massimo Torquati

Porting Decision Tree Algorithms to Multicore using FastFlow Technical Report

Università di Pisa, Dipartimento di Informatica, Italy no. TR-10-11, 2010.

Abstract | Links | BibTeX | Tags: fastflow

Marco Aldinucci, Massimiliano Meneghin, Massimo Torquati

Efficient Smith-Waterman on multi-core with FastFlow Proceedings Article

In: Danelutto, Marco, Gross, Tom, Bourgeois, Julien (Ed.): Proc. of Intl. Euromicro PDP 2010: Parallel Distributed and network-based Processing, pp. 195–199, IEEE, Pisa, Italy, 2010.

Abstract | Links | BibTeX | Tags: fastflow