Scaling bioinformatics training: an ELIXIR, GOBLET & Galaxy Training Network collaboration

Bérénice Batut, Björn Grüning, Frederik Coppens, Hans-Rudolf Hotz, Gabriella Rustici, Celia W. G. van Gelder, Dave Clements

Abstract

Scalability is a key challenge in bioinformatics. Data management, analysis pipelines, compute infrastructure, visualisations, and training must all scale to address ever-larger experimental designs, and the increasing prevalence of data-intensive life science research. ELIXIR, GOBLET, and the Galaxy Training Network (GTN) have teamed up to create reusable and community extensible bioinformatics training materials, available online for individual life science researchers and trainers.Thanks to this community led effort, a series of Galaxy-based tutorials were developed to teach numerous data analysis topics, ranging from genomics to proteomics to metabolomics. Training on Galaxy administration and development is also included in this effort. The content is developed in Markdown, stored in GitHub and separated from the presentation on the website, facilitating its development and updating by the community. The technological infrastructure needed to teach is provided with Docker images for each topic and the datasets are stored in Zenodo with assigned DOIs. All materials are annotated by a rich set of metadata and automatically propagated to ELIXIR’s TeSS portal. This approach creates tutorials that are accessible, easy to find and use (FAIR) by individuals and by trainers for workshops.This poster highlights the approach taken, how it addresses the scalability of presenting, creating, and maintaining training materials, as well as the resulting products. We will also report on the the recently held ELIXIR/GOBLET/GTN hackathon for Galaxy training material re-use and the community’s future plans.