D1.3 Data Management Plan

Summary
The Data Management Plan will be in line with the template in the Model Grant Agreement and EC guidelines. The main sections will be as follows: • Policy information: the project will identify the basic information that will be collected during the project lifetime, i.e. policy information including KPIs and costs. • Information classification and description: a classification into main categories will be provided based on the work of the e-IRG ‘W’orking Group on the evaluation of e- Infrastructures and the development of related KPIs, along with a description of the information to be generated or collected. • Standards and metadata: The data model for collecting and storing information will be described along with related standard(s), if available. The metadata for the policy information will accompany the primary data to help third party users to understand and reuse it. This will include fields to help people find the data, such as the owner or contributors of the data, title, date of creation, etc. • Data sharing: the sharing policy will be specified. The project will specify the data repository for sharing the information generated, collected or aggregated. It will take measures to use open formats to enable third parties to access, reuse and disseminate the information. In case there is sensitive information (such as costs) the sharing and reuse policy will depend on the license or rights of the owner. In case there is information that cannot be shared, the reasons will be explained (personal information, intellectual property protection, etc.) and aggregation or anonymisation of information will be attempted in agreement with the owner. • Data curation and preservation: The DMP will also specify how the information will be curated and preserved in the future. Some first directions on the DMP are given here: • The data that e-IRGSP5 will collect is mainly produced by others, such as e-Infrastructures presenting their KPIs. • The e-IRG Knowledge Base also contains a lot of data, again mainly produced by others as Open Data. • e-IRGSP5 will combine all this information and present it in an easy accessible way. • The data format used in the Knowledge Base to store and combine information is based on Open Standards. At the core everything is stored and described in XML (a W3C standard). However, at a higher level the information (data) is stored in a graph data base that uses Topic Maps (an ISO standard). For combining and extracting information standard query languages including XSLT and XQUERY (both W3C standards) are used. During the course of the project the data management plan will be monitored regularly and updated if necessary.