Talk:WS1UC2
As a part of preparation tasks, each project partner selected and characterize 2 databases thematically related to the area of power transmission and distribution networks. Therefore, as part of the preliminary work 8 databases were selected and used in further analysis during the workshop (databases verified: OPSD, EIA, . OpenEI, PSE, NREL, ENTSO-E, SMARD, The World bank data). During the workshop the databases were analyzed based on the methodology developed by: Swiss National Science Fundation (Explanation of the FAIR data principles-Wilkinson et al. (2016), The FAIR Guiding Principles for scientific data management and stewardship, Scientific Data 3, doi:10.1038/sdata.2016.18).
http://www.snf.ch/SiteCollectionDocuments/FAIR_principles_translation_SNSF_logo.pdf
As part of conducted assessment the databases have been characterized, the percentage of compliance with FAIR/O criteria was indicated, and base elements/ requiring improvement were indicated.
A summary of the analysis performed for each database is provided below.
- database 1: ENTSO-E
- Short description: Central collection and publication of electricity generation, transportation and consumption data and information for the pan-European market. Main data categories are: Load, generation, transmission, balancing, outages, congestion management, system operations. It covers data from the starting year 2014 in hourly resolution.
- Current State of FAIR/O pronciples:F1 50 % - F2 50% - F3 20% - F4 100% A1.1 100% - A1.2 100% - A2 50% I1 100% - I2 80% - I3 100% R1.1 50% - R1.2 50% - R1.3 90%.
- Indication of the elements requiring improvement:F1: Assign data set a globally unique and persistent identified (e.g. DOI).
Findable: Improve F2: data should be better described with rich metadata and Improve F3: downloaded data should include explicitly the identifier of the data (e.g. source) Accessibility: Improve A2: not sure, no information regarding if metadata are accessible even when the data are no longer available. Interoperability: Improve I3: data should include qualified references to other data (which TSO or national authority are the main reference?). Also improve I2: There is a long list of databases under sitemap, this can be linked also explanations. Reusability: improve R1.1 and R1.2: it should contain more clear and accessible data usage license and associated provenance. regarding license, there is only information in pdf. under “terms and conditions” it states not sub-licensable or transferrable. It needs to clarify licensing issue regarding different data origins.
database 2: SMARD (Strommarktdaten)
short description: Electricity market information platform of German Federal Network Agency Bundesnetzagentur (BnetzA). It presents the most important electricity market data for Germany containing electricity market data such as electricity generation, consumption, import and export, market balancing and power plants in different periods of time (i.e. power plants data between 2015-2025 in hourly resolution, generation in 15 min.).
Current State of FAIR/O pronciples: F1 50 % - F2 50% - F3 20% - F4 100% A1.1 100% - A1.2 100% - A2 50% I1 100% - I2 80% - I3 100% R1.1 50% - R1.2 50% - R1.3 90%
Indication of the elements requiring improvement:.................
database 3: .... short description: ....... Current State of FAIR/O pronciples:........ Indication of the elements requiring improvement:.................
database 4: ....
short description: .......
Current State of FAIR/O pronciples:........
Indication of the elements requiring improvement:.................
database 5: .... short description: ....... Current State of FAIR/O pronciples:........ Indication of the elements requiring improvement:.................
database 6: .... short description: ....... Current State of FAIR/O pronciples:........ Indication of the elements requiring improvement:.................
database 7: ....
short description: .......
Current State of FAIR/O pronciples:........
Indication of the elements requiring improvement:.................
database 8: .... short description: ....... Current State of FAIR/O pronciples:........ Indication of the elements requiring improvement:.................
2nd Table"
PROBLEMATIC ISSES: During the analysis, the problems related to incorrect interpretation of the FAIR/O assessment methodology were identified. Issues that need clarification: - deeper analysis is required for the criteria R1.1 "(Meta)data are released with a clear and accessible data usage licences - whether the inability to login with simultaneous open access to the data should be understood as an adventage or disadventage in terms of the criteria.This issues ic corelated with legal aspects of using data. Comment: For example the lack of a license creates a problem and often prevents the use of data. (https://open-power-system-data.org/legal#Licenses_for_open_data)
- deeper analysis is required for the criteria A1.2 "The protocol allows for an authentication and authorisation where necessary" - how to interpret the possibility or the inability to create an account by the database user?
- (meta)data frameworks should be described and clarified