Full Program »
A NoSQL Solution For Bioinformatics Data Provenance Storage
The provenance data can support experiments reproducibility providing the history of the data in a scientific workflow. Bioinformatics generates an increasing amount of data, which are often analyzed employing workflows. This paper proposes a way to manage automatic executions of Bioinformatics workflows, storing its provenance and raw data in the MongoDB NoSQL database system. It uses a program that manages three different data models, a referenced, an embedded, and hybrid data model for purposes of comparison. The results showed general advantages and disadvantages for each data model and some particularities of Bioinformatics.