Skip to main content
SLU publication database (SLUpub)

Review article2016Peer reviewedOpen access

Recommendations on e-Infrastructures for next-generation sequencing

Spjuth, Ola; Bongcam-Rudloff, Erik; Dahlberg, Johan; Dahlö, Martin; Kallio, Aleksi; Pireddu, Luca; Vezzi, Francesco; Korpelainen, Eija

Abstract

With ever-increasing amounts of data being produced by next-generation sequencing (NGS) experiments, the requirements placed on supporting e-infrastructures have grown. In this work, we provide recommendations based on the collective experiences from participants in the EU COST Action SeqAhead for the tasks of data preprocessing, upstream processing, data delivery, and downstream analysis, as well as long-term storage and archiving. We cover demands on computational and storage resources, networks, software stacks, automation of analysis, education, and also discuss emerging trends in the field. E-infrastructures for NGS require substantial effort to set up and maintain over time, and with sequencing technologies and best practices for data analysis evolving rapidly it is important to prioritize both processing capacity and e-infrastructure flexibility when making strategic decisions to support the data analysis demands of tomorrow. Due to increasingly demanding technical requirements we recommend that e-infrastructure development and maintenance be handled by a professional service unit, be it internal or external to the organization, and emphasis should be placed on collaboration between researchers and IT professionals.

Keywords

E-infrastructure; Next-generation sequencing; High-performance computing; Cloud computing

Published in

GigaScience
2016, Volume: 5, article number: 26

    UKÄ Subject classification

    Bioinformatics (Computational Biology)
    Bioinformatics and Systems Biology

    Publication identifier

    DOI: https://doi.org/10.1186/s13742-016-0132-7

    Permanent link to this page (URI)

    https://res.slu.se/id/publ/76617