Reproducible-Research.de

Concepts and tools for the responsible scientist

User Tools

Site Tools


concepts:dataset:index

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
concepts:dataset:index [2019/01/28 21:46] tillconcepts:dataset:index [2020/09/27 12:18] (current) – [Dataset] till
Line 1: Line 1:
-====== Dataset ======+{{fa>cogs?48&align=right}} 
 + ====== Dataset ======
  
 //Unit of (numeric) data and accompanying [[..:metadata:|metadata]]// //Unit of (numeric) data and accompanying [[..:metadata:|metadata]]//
  
-Every measurement (or calculation) produces (raw) data that are useless without additional information, such as experimental parameters. This additional information is termed [[..:metadata:|metadata]]. A dataset is the unit of (numerical) data and metadata. Another integral aspect is the history containing all relevant information regarding each single processing step performed on the data of the dataset.+Every measurement (or calculation) produces (raw) data that are useless without additional information, such as experimental parameters. This additional information is termed [[..:metadata:|metadata]]. A dataset is the unit of (numerical) data and metadata. Another integral aspect is the history containing all relevant information regarding each single processing step performed on the data of the dataset. This is the idea behind [[..:selfdocumenting:|self-documenting]] routines.
  
  
-===== History ===== +\\ 
- +<WRAP half column leftalign><WRAP button>[[..:metadata:|← Metadata]]</WRAP></WRAP> 
-Reproducibility is an essential aspect of good scientific practiceIn the context of data processing and analysis, this means that each processing step performed on data (of a dataset) should be stored in an reproducible way and preferably in a consistent format+<WRAP half column rightalign><WRAP button>[[..:openformats:|Open formats →]]</WRAP></WRAP>
- +
-To be of actual use, an entry of the history needs to contain all information necessary to reproduce the processing step in its original formThis includes as a minimum the name of the processing routine used, the complete list of necessary parameters for that routine, and a unique version information of the routineAdditional useful aspects contain information about the operating system used, the name of the operator, and the date the processing step has been performed. +
  
concepts/dataset/index.1548708388.txt.gz · Last modified: 2019/01/28 21:46 by till