Maneage: Managing data lineage for long-term and archivable reproducibility (Invited talk at ESO'...
The increasing volume, diversity, and role of data in modern research has been very fruitful. However, these same factors, have also made it harder to describe (in sufficient detail) the processing behind a scientific result within the confines of a traditional paper. It is thus becoming harder and harder to reproduce results (i.e., critically review by coauthors, referees or larger community) that define scientific progress. In this talk, Maneage (MANaging data linEAGE) is introduced as a working solution to this problem. Maneage is a template that should be customized for every project. It will enable exact reproduce a scientific analysis (from the input data and software, to the processing and creation of final report, paper or dataset. The necessary software are built (from the low-level C compiler and shell, to the higher-level science programs and all their dependencies) with the predefined configuration. The data are imported from pre-determined URLs (and validated with recorded
βhttps://peertube.stream/w/rQqt2RpyYPmGzAD4uBbxQz