Wrangling Galaxy's reference data

Daniel Blankenberg, James E. Johnson, James Taylor, Anton Nekrutenko

Research output: Contribution to journalArticlepeer-review

25 Scopus citations


Summary: The Galaxy platform has developed into a fully featured collaborative workbench, with goals of inherently capturing provenance to enable reproducible data analysis, and of making it straightforward to run one's own server. However, many Galaxy platform tools rely on the presence of reference data, such as alignment indexes, to function efficiently. Until now, the building of this cache of data for Galaxy has been an error-prone manual process lacking reproducibility and provenance. The Galaxy Data Manager framework is an enhancement that changes the management of Galaxy's built-in data cache from a manual procedure to an automated graphical user interface (GUI) driven process, which contains the same openness, reproducibility and provenance that is afforded to Galaxy's analysis tools. Data Manager tools allow the Galaxy administrator to download, create and install additional datasets for any type of reference data in real time.

Original languageEnglish (US)
Pages (from-to)1917-1919
Number of pages3
Issue number13
StatePublished - Jul 1 2014

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics


Dive into the research topics of 'Wrangling Galaxy's reference data'. Together they form a unique fingerprint.

Cite this