Data Management and Preservation Planning for Big Science

<p>‘Big Science’ - that is, science which involves large collaborations with dedicated facilities, and involving large data volumes and multinational investments – is often seen as different when it comes to data management and preservation planning. Big Science handles its data differently fr...

Full description

Bibliographic Details
Main Authors: Juan Bicarregui, Norman Gray, Rob Henderson, Roger Jones, Simon Lambert, Brian Matthews
Format: Article
Language:English
Published: University of Edinburgh 2013-06-01
Series:International Journal of Digital Curation
Online Access:http://www.ijdc.net/index.php/ijdc/article/view/247
id doaj-738a914f30f04cfdb3f94a7716d719d3
record_format Article
spelling doaj-738a914f30f04cfdb3f94a7716d719d32020-11-24T22:55:06ZengUniversity of EdinburghInternational Journal of Digital Curation1746-82562013-06-0181294110.2218/ijdc.v8i1.247223Data Management and Preservation Planning for Big ScienceJuan BicarreguiNorman GrayRob HendersonRoger JonesSimon LambertBrian Matthews<p>‘Big Science’ - that is, science which involves large collaborations with dedicated facilities, and involving large data volumes and multinational investments – is often seen as different when it comes to data management and preservation planning. Big Science handles its data differently from other disciplines and has data management problems that are qualitatively different from other disciplines. In part, these differences arise from the quantities of data involved, but possibly more importantly from the cultural, organisational and technical distinctiveness of these academic cultures. Consequently, the data management systems are typically and rationally bespoke, but this means that the planning for data management and preservation (DMP) must also be bespoke.</p><p>These differences are such that ‘just read and implement the OAIS specification’ is reasonable Data Management and Preservation (DMP) advice, but this bald prescription can and should be usefully supported by a methodological ‘toolkit’, including overviews, case-studies and costing models to provide guidance on developing best practice in DMP policy and infrastructure for these projects, as well as considering OAIS validation, audit and cost modelling.</p><p>In this paper, we build on previous work with the LIGO collaboration to consider the role of DMP planning within these big science scenarios, and discuss how to apply current best practice. We discuss the result of the MaRDI-Gross project (Managing Research Data Infrastructures – Big Science), which has been developing a toolkit to provide guidelines on the application of best practice in DMP planning within big science projects. This is targeted primarily at projects’ engineering managers, but intending also to help funders collaborate on DMP plans which satisfy the requirements imposed on them.</p>http://www.ijdc.net/index.php/ijdc/article/view/247
collection DOAJ
language English
format Article
sources DOAJ
author Juan Bicarregui
Norman Gray
Rob Henderson
Roger Jones
Simon Lambert
Brian Matthews
spellingShingle Juan Bicarregui
Norman Gray
Rob Henderson
Roger Jones
Simon Lambert
Brian Matthews
Data Management and Preservation Planning for Big Science
International Journal of Digital Curation
author_facet Juan Bicarregui
Norman Gray
Rob Henderson
Roger Jones
Simon Lambert
Brian Matthews
author_sort Juan Bicarregui
title Data Management and Preservation Planning for Big Science
title_short Data Management and Preservation Planning for Big Science
title_full Data Management and Preservation Planning for Big Science
title_fullStr Data Management and Preservation Planning for Big Science
title_full_unstemmed Data Management and Preservation Planning for Big Science
title_sort data management and preservation planning for big science
publisher University of Edinburgh
series International Journal of Digital Curation
issn 1746-8256
publishDate 2013-06-01
description <p>‘Big Science’ - that is, science which involves large collaborations with dedicated facilities, and involving large data volumes and multinational investments – is often seen as different when it comes to data management and preservation planning. Big Science handles its data differently from other disciplines and has data management problems that are qualitatively different from other disciplines. In part, these differences arise from the quantities of data involved, but possibly more importantly from the cultural, organisational and technical distinctiveness of these academic cultures. Consequently, the data management systems are typically and rationally bespoke, but this means that the planning for data management and preservation (DMP) must also be bespoke.</p><p>These differences are such that ‘just read and implement the OAIS specification’ is reasonable Data Management and Preservation (DMP) advice, but this bald prescription can and should be usefully supported by a methodological ‘toolkit’, including overviews, case-studies and costing models to provide guidance on developing best practice in DMP policy and infrastructure for these projects, as well as considering OAIS validation, audit and cost modelling.</p><p>In this paper, we build on previous work with the LIGO collaboration to consider the role of DMP planning within these big science scenarios, and discuss how to apply current best practice. We discuss the result of the MaRDI-Gross project (Managing Research Data Infrastructures – Big Science), which has been developing a toolkit to provide guidelines on the application of best practice in DMP planning within big science projects. This is targeted primarily at projects’ engineering managers, but intending also to help funders collaborate on DMP plans which satisfy the requirements imposed on them.</p>
url http://www.ijdc.net/index.php/ijdc/article/view/247
work_keys_str_mv AT juanbicarregui datamanagementandpreservationplanningforbigscience
AT normangray datamanagementandpreservationplanningforbigscience
AT robhenderson datamanagementandpreservationplanningforbigscience
AT rogerjones datamanagementandpreservationplanningforbigscience
AT simonlambert datamanagementandpreservationplanningforbigscience
AT brianmatthews datamanagementandpreservationplanningforbigscience
_version_ 1725657917758111744