Persistent Identifier [PID] was bringt uns die eindeutige Benennung von Daten? Chris Schubert, Head of CCCA Data Centre GEO Coordinator for Austria Member of EuroGEOSS Coordination Group Control Body of INSPIRE Vocabulary & Registers ZAMG, Hohe Warte 38, Wien
Connect Persistent Online-Resourcen sind in der Forschungs- & Wissenslandschaft stark dezentralisiert [Datenrepositorien, Registries, Datenbanken, etc.] Identifikatoren sind Kernkomponenten für die Integration von Infrastrukturen, re-usability + reproducability & … PURL
proper attribution and credit reuse of data reproducibility proper attribution and credit hdl.handle.net/20.500.11756/9df12611 use PID persistent identifier, adequate for doi data citation cite your data
(research) data is dynamic Korrektur hdl.20.500. Version 2 hdl.20.500. Version 1 data citation cite your data
Cite your Data SUBSETTING + dynamic data citation (research) data is dynamic keep all relations between updates, original sources & subsets hdl.20.600. Version 1 hdl.20.500. Version 1 Korrektur SUBSETTING + dynamic data citation hdl.20.500. Version 2 Cite your Data
Cite your Data … nichts ist so ärgerlich wie broken links zu wichtigen Dingen. extrapolated fraction of immune, healthy, and infected articles One in Five Articles Suffers from Reference Rot. M. Klein et al. PLOS ONE 2014. The digital entropy of death: link rot
Ich brauche mehr Informationen PID Provenance Metadaten Ich brauche mehr Informationen Wer / Was ist das? Ein persistenter Identifier ist eine langfristige Referenz auf eine digitale Ressource. [1] stehen bleiben [2] bei etwas verharren Wie ist es garantiert ? Was heißt langfristig ? Wo kann ich es finden? Wie komme ich dorthin? ©J.Clark International DOI Foundation Policies & Guarantees Machine-readability
MEtadaten Policies & Guarantees Machine-readability Content Beschreibung … Policies & Guarantees … Direktiven, z.B. aus der Forschungslandschaft, Fördergeber, etc. … kein technisches Problem, sondern eher ein “social contract” … Kriterien für vertrauensvollen, persistenten Repositorien, und Verpflichtungen der Datenanbieter, z.B. Zugang, Übertragungsfehler, -schwierigkeiten, oder andere technische Probleme. Machine-readability … Data in a data format that can be automatically read and processed by a computer
Provenance The term “data provenance” refers to a record trail that accounts for the origin of a piece of data (in a database, document or repository) together with an explanation of how and why it got to the present place. DOI: https://doi.org/10.1007/978-0-387-39940-9_1305 … beschreibt die “Datenabstammung” und Historie Gil, Y., et al. (2016), Toward the Geoscience Paper of the Future: Best practices for documenting and sharing research from data to software to provenance, Earth and Space Science, 3, 388–415, doi:10.1002/2015EA000136.
Create SUBSETTING + dynamic data citation (research) data is dynamic identify precisely the data at a specific point in time identify precisely the subset of (dynamic) in a process Parameter Area of interest Time range @keep versioning @keep timestamps @keep & adapt Metadata SUBSETTING + dynamic data citation Choose a: Create
Create SUBSETTING + dynamic data citation (research) data is dynamic Re-published avoid redundant storage consumption keep all relations between updates, original sources & subsets SUBSETTING + dynamic data citation Create
Create SUBSETTING + dynamic data citation (research) data is dynamic © 2019 Service Oriented Mapping Changing Paradigm in Map Production and Geoinformation Management (research) data is dynamic identify precisely the data at a specific point in time identify precisely the subsetof (dynamic) in a process Handling Continuous Streams for Meteorological Mapping Chris Schubert1, Harald Bamberger2 1 CCCA Data Centre, Vienna, Austria, hosted by ZAMG, 2 ZAMG, Dep. Software Application development and Data Management SUBSETTING + dynamic data citation ISO 690:2010 Information and documentation Guidelines for bibliographic references and citations to information resources CCCA Dynamic Citation Tool as reference implementation Create
Was bringt uns das ? … Blick auf Methoden, Kategorien und Praktiken unterschiedlicher Wissenschaftskulturen … „Denkkollektiv“, „Denkstil“ Denkstile und der wissenschaftlichen Gestaltwahrnehmung aus der wissenschaftlichen Praxis active publication ethics cases at PLOS ONE https://www.nature.com/news/1.19970
Was bringt uns das ? Fünf “eigennützige” Gründe für reproduzierbare Daten: No1: reproducibility helps to avoid disaster No 2: reproducibility makes it easier to write Papers No 3: reproducibility helps reviewers see it your way No 4: reproducibility enables continuity of your work No 5: reproducibility helps to build your reputation https://doi.org/10.1186/s13059-015-0850-7 Markowetz, Genome Biology, (2015) 16:274
INSPIRE register & registry Was bringt uns das ? INSPIRE register & registry CCCA /ZAMG is hosting the Austrian INSPIRE Registry registry.inspire.gv.at A vocabulary server for long term and persistent maintenance of natural language concepts (terms), their definitions, status in time and process (version, validation) a controlled vocabulary
Thank you for attention ! Chris Schubert Head of CCCA – Data Centre GEO Coordinator for Austria data.ccca.ac.at 1190 Wien, Hohe Warte 38 Tel: +43136026 2519 chris.schubert[at]ccca.ac.at ©J.Clark International DOI Foundation (2016)