terms4FAIRskills describes the competencies, skills and knowledge associated with making and keeping data FAIR.\nThis terminology applies to a variety of use cases, including: assisting with the creation and assessment of stewardship curricula; facilitating the annotation, discovery and evaluation of FAIR-enabling materials \(e.g. training\) and resources; enabling the formalisation of job descriptions and CVs with recognised, structured competencies.\nIt is intended to be of use to trainers who teach FAIR data skills, researchers who wish to identify skill gaps in their teams and managers who need to recruit individuals to relevant roles. terms4FAIRskills (T4FS) terms4FAIRskills by the terms4FAIRskills developers is licensed under CC BY 4.0. You are free to share (copy and redistribute the material in any medium or format) and adapt (remix, transform, and build upon the material) for any purpose, even commercially. for any purpose, even commercially. The licensor cannot revoke these freedoms as long as you follow the license terms. You must give appropriate credit (by using the original ontology IRI for the whole ontology and original term IRIs for individual terms), provide a link to the license, and indicate if any changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. 2023-03-02 definition The official definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions. 2012-04-05: Barry Smith The official OBI definition, explaining the meaning of a class or property: 'Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions' is terrible. Can you fix to something like: A statement of necessary and sufficient conditions explaining the meaning of an expression referring to a class or property. Alan Ruttenberg Your proposed definition is a reasonable candidate, except that it is very common that necessary and sufficient conditions are not given. Mostly they are necessary, occasionally they are necessary and sufficient or just sufficient. Often they use terms that are not themselves defined and so they effectively can't be evaluated by those criteria. On the specifics of the proposed definition: We don't have definitions of 'meaning' or 'expression' or 'property'. For 'reference' in the intended sense I think we use the term 'denotation'. For 'expression', I think we you mean symbol, or identifier. For 'meaning' it differs for class and property. For class we want documentation that let's the intended reader determine whether an entity is instance of the class, or not. For property we want documentation that let's the intended reader determine, given a pair of potential relata, whether the assertion that the relation holds is true. The 'intended reader' part suggests that we also specify who, we expect, would be able to understand the definition, and also generalizes over human and computer reader to include textual and logical definition. Personally, I am more comfortable weakening definition to documentation, with instructions as to what is desirable. We also have the outstanding issue of how to aim different definitions to different audiences. A clinical audience reading chebi wants a different sort of definition documentation/definition from a chemistry trained audience, and similarly there is a need for a definition that is adequate for an ontologist to work with. PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> https://orcid.org/0000-0002-7702-4495 Allyson Lister 8.10.2019: Although the IAO definition has been included here, we have made a curation decision to allow non-Aristotelian definitions. definition The official definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions. IAO term editor Name of editor entering the term in the file. The term editor is a point of contact for information regarding the term. The term editor may be, but is not always, the author of the definition, which may have been worked upon by several people 20110707, MC: label update to term editor and definition modified accordingly. See https://github.com/information-artifact-ontology/IAO/issues/115. PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> Peter McQuilton https://orcid.org/0000-0002-7702-4495 term editor Name of editor entering the term in the file. The term editor is a point of contact for information regarding the term. The term editor may be, but is not always, the author of the definition, which may have been worked upon by several people IAO alternative label A label for a class or property that can be used to refer to the class or property instead of the preferred rdfs:label. Alternative labels should be used to indicate community- or context-specific labels, abbreviations, shorthand forms and the like. An alternative name for a class or property which means the same thing as the preferred name (semantically equivalent) OBO Operations committee PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> Consider re-defing to: An alternative name for a class or property which can mean the same thing as the preferred name (semantically equivalent, narrow, broad or related). alternative label alternative term An alternative name for a class or property which means the same thing as the preferred name (semantically equivalent) IAO definition source Formal citation, e.g. identifier in external database to indicate / attribute source(s) for the definition. Free text indicate / attribute source(s) for the definition. EXAMPLE: Author Name, URI, MeSH Term C04, PUBMED ID, Wiki uri on 31.01.2007 PERSON:Daniel Schober Discussion on obo-discuss mailing-list, see http://bit.ly/hgm99w GROUP:OBI:<http://purl.obolibrary.org/obo/obi> Peter McQuilton https://orcid.org/0000-0002-7702-4495 definition source Formal citation, e.g. identifier in external database to indicate / attribute source(s) for the definition. Free text indicate / attribute source(s) for the definition. EXAMPLE: Author Name, URI, MeSH Term C04, PUBMED ID, Wiki uri on 31.01.2007 IAO term tracker item the URI for an OBI Terms ticket at sourceforge, such as https://sourceforge.net/p/obi/obi-terms/772/ An IRI or similar locator for a request or discussion of an ontology term. Person: Jie Zheng, Chris Stoeckert, Alan Ruttenberg Person: Jie Zheng, Chris Stoeckert, Alan Ruttenberg The 'tracker item' can associate a tracker with a specific ontology term. term tracker item Peter McQuilton 2019-10-17T13:38:16.742124Z alternative definition Peter McQuilton 2019-10-17T13:39:13.568975Z alternative definition source created by creation date This document is about information artifacts and their representations A (currently) primitive relation that relates an information artifact to an entity. 7/6/2009 Alan Ruttenberg. Following discussion with Jonathan Rees, and introduction of "mentions" relation. Weaken the is_about relationship to be primitive. We will try to build it back up by elaborating the various subproperties that are more precisely defined. Some currently missing phenomena that should be considered "about" are predications - "The only person who knows the answer is sitting beside me" , Allegory, Satire, and other literary forms that can be topical without explicitly mentioning the topic. person:Alan Ruttenberg Smith, Ceusters, Ruttenberg, 2000 years of philosophy is about inheres in this fragility is a characteristic of this vase this red color is a characteristic of this apple a relation between a specifically dependent continuant (the characteristic) and any other entity (the bearer), in which the characteristic depends on the bearer for its existence. inheres_in Note that this relation was previously called "inheres in", but was changed to be called "characteristic of" because BFO2 uses "inheres in" in a more restricted fashion. This relation differs from BFO2:inheres_in in two respects: (1) it does not impose a range constraint, and thus it allows qualities of processes, as well as of information entities, whereas BFO2 restricts inheres_in to only apply to independent continuants (2) it is declared functional, i.e. something can only be a characteristic of one thing. characteristic of bearer of this apple is bearer of this red color this vase is bearer of this fragility Inverse of characteristic_of A bearer can have many dependents, and its dependents can exist for different periods of time, but none of its dependents can exist when the bearer does not exist. bearer_of is bearer of has characteristic this catalysis function is a function of this enzyme a relation between a function and an independent continuant (the bearer), in which the function specifically depends on the bearer for its existence A function inheres in its bearer at all times for which the function exists, however the function need not be realized at all the times that the function exists. function_of is function of This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020. function of this red color is a quality of this apple a relation between a quality and an independent continuant (the bearer), in which the quality specifically depends on the bearer for its existence A quality inheres in its bearer at all times for which the quality exists. is quality of quality_of This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020. quality of this investigator role is a role of this person a relation between a role and an independent continuant (the bearer), in which the role specifically depends on the bearer for its existence A role inheres in its bearer at all times for which the role exists, however the role need not be realized at all the times that the role exists. is role of role_of This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020. role of this enzyme has function this catalysis function (more colloquially: this enzyme has this catalysis function) a relation between an independent continuant (the bearer) and a function, in which the function specifically depends on the bearer for its existence A bearer can have many functions, and its functions can exist for different periods of time, but none of its functions can exist when the bearer does not exist. A function need not be realized at all the times that the function exists. has_function has function this apple has quality this red color a relation between an independent continuant (the bearer) and a quality, in which the quality specifically depends on the bearer for its existence A bearer can have many qualities, and its qualities can exist for different periods of time, but none of its qualities can exist when the bearer does not exist. has_quality has quality this person has role this investigator role (more colloquially: this person has this role of investigator) a relation between an independent continuant (the bearer) and a role, in which the role specifically depends on the bearer for its existence A bearer can have many roles, and its roles can exist for different periods of time, but none of its roles can exist when the bearer does not exist. A role need not be realized at all the times that the role exists. has_role has role a relation between an independent continuant (the bearer) and a disposition, in which the disposition specifically depends on the bearer for its existence has disposition This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020. disposition of Describes how a learning medium is intended to confer a competence or capability regarding a particular data stewardship activity, e.g. a presentation confering competency in metadata creation. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2020-10-01T21:36:11.080721Z confers competency about Describes how a learning medium is intended to confer a competence or capability regarding a particular data stewardship activity, e.g. a presentation confering competency in metadata creation. https://orcid.org/0000-0002-7702-4495 Describes how a learning medium is intended to confer knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation conferring competency in metadata. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2020-12-06 00:00:00 confers knowledge about Describes how a learning medium is intended to confer knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation conferring competency in metadata. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0002-7702-4495 Peter McQuilton requires/improves personal attribute Describes how a learning medium confers practical skills regarding a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop conferring a practical skill in repository access. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2020-12-06 00:00:00 confers practical skill about Describes how a learning medium confers practical skills regarding a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop conferring a practical skill in repository access. https://orcid.org/0000-0002-7702-4495 Describes how an expertise level or role (e.g. "data steward") is associated with a data stewardship activity (e.g. ‘workflow set-up and management’) to indicate whether someone has an awareness of the area, or an ability to do it, or expert knowledge of it. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Definition added 8.2.22 has/wants competency in Describes how an expertise level or role (e.g. "data steward") is associated with a data stewardship activity (e.g. ‘workflow set-up and management’) to indicate whether someone has an awareness of the area, or an ability to do it, or expert knowledge of it. https://orcid.org/0000-0002-7702-4495 Peter McQuilton https://orcid.org/0000-0002-7702-4495 has/wants knowledge about https://orcid.org/0000-0002-7702-4495 Peter McQuilton supports implementation of Desires/has a competence or capability acquired or applied in a specific context, e.g. producing a research output or deploying a service. A skill may be specified in a ‘skills user story’. A badge or certificate may provide evidence that a skill has been acquired, and a publication, personal profile, portfolio or CV may provide evidence that a skill has been applied. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton has/wants practical skill about Desires/has a competence or capability acquired or applied in a specific context, e.g. producing a research output or deploying a service. A skill may be specified in a ‘skills user story’. A badge or certificate may provide evidence that a skill has been acquired, and a publication, personal profile, portfolio or CV may provide evidence that a skill has been applied. FAIR4S Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2021-03-24T15:33:43.476917Z has aptitude for Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2021-03-22T15:25:13.443153Z contributes to the implementation of The learning media that creates a competence or capability. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0002-7702-4495 Allyson Lister 13.9.22: This class is created as the inverse of confers/requires competency about to allow consistent reasoning and access to the Data stewardship guidelines for Data stewardship activities. It needs to be revisited by the entire team to check the model. competency gained through The learning media that creates a competence or capability. https://orcid.org/0000-0002-7702-4495 Describes how a learning medium requires competency in a particular data stewardship activity in order for that learning medium to perform its function, e.g. a presentation requiring competency in metadata creation prior to engaging with that learning medium. requires competency about Describes how a learning medium requires competency in a particular data stewardship activity in order for that learning medium to perform its function, e.g. a presentation requiring competency in metadata creation prior to engaging with that learning medium. https://orcid.org/0000-0002-7702-4495 Describes how a learning medium requires knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation requiring knowledge of metadata. requires knowledge about Describes how a learning medium requires knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation requiring knowledge of metadata. https://orcid.org/0000-0002-7702-4495 Describes how a learning medium requires practical skills of a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop requiring a practical skill in repository access. requires practical skill about Describes how a learning medium requires practical skills of a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop requiring a practical skill in repository access. https://orcid.org/0000-0002-7702-4495 Describes how an evaluation indicator may confer compliance with a particular guideline or set of guidelines https://orcid.org/0000-0002-7702-4495 https://github.com/terms4fairskills/FAIRterminology/issues/22 confers compliance with Describes how an evaluation indicator may confer compliance with a particular guideline or set of guidelines https://orcid.org/0000-0002-7702-4495 A learning medium may demonstrate how to meet the requirements for a particular evaluation indicator(s). https://orcid.org/0000-0002-7702-4495 https://github.com/terms4fairskills/FAIRterminology/issues/22 demonstrates how to fulfil the requirements for A learning medium may demonstrate how to meet the requirements for a particular evaluation indicator(s). https://orcid.org/0000-0002-7702-4495 reference URL entity Entity Julius Caesar Verdi’s Requiem the Second World War your body mass index BFO 2 Reference: In all areas of empirical inquiry we encounter general terms of two sorts. First are general terms which refer to universals or types:animaltuberculosissurgical procedurediseaseSecond, are general terms used to refer to groups of entities which instantiate a given universal but do not correspond to the extension of any subuniversal of that universal because there is nothing intrinsic to the entities in question by virtue of which they – and only they – are counted as belonging to the given group. Examples are: animal purchased by the Emperortuberculosis diagnosed on a Wednesdaysurgical procedure performed on a patient from Stockholmperson identified as candidate for clinical trial #2056-555person who is signatory of Form 656-PPVpainting by Leonardo da VinciSuch terms, which represent what are called ‘specializations’ in [81 Entity doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example Werner Ceusters 'portions of reality' include 4 sorts, entities (as BFO construes them), universals, configurations, and relations. It is an open question as to whether entities as construed in BFO will at some point also include these other portions of reality. See, for example, 'How to track absolutely everything' at http://www.referent-tracking.com/_RTU/papers/CeustersICbookRevised.pdf An entity is anything that exists or has existed or will exist. (axiom label in BFO2 Reference: [001-001]) entity Entity doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example Werner Ceusters 'portions of reality' include 4 sorts, entities (as BFO construes them), universals, configurations, and relations. It is an open question as to whether entities as construed in BFO will at some point also include these other portions of reality. See, for example, 'How to track absolutely everything' at http://www.referent-tracking.com/_RTU/papers/CeustersICbookRevised.pdf per discussion with Barry Smith An entity is anything that exists or has existed or will exist. (axiom label in BFO2 Reference: [001-001]) continuant Continuant An entity that exists in full at any time in which it exists at all, persists through time while maintaining its identity and has no temporal parts. BFO 2 Reference: Continuant entities are entities which can be sliced to yield parts only along the spatial dimension, yielding for example the parts of your table which we call its legs, its top, its nails. ‘My desk stretches from the window to the door. It has spatial parts, and can be sliced (in space) in two. With respect to time, however, a thing is a continuant.’ [60, p. 240 Continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example, in an expansion involving bringing in some of Ceuster's other portions of reality, questions are raised as to whether universals are continuants A continuant is an entity that persists, endures, or continues to exist through time while maintaining its identity. (axiom label in BFO2 Reference: [008-002]) if b is a continuant and if, for some t, c has_continuant_part b at t, then c is a continuant. (axiom label in BFO2 Reference: [126-001]) if b is a continuant and if, for some t, cis continuant_part of b at t, then c is a continuant. (axiom label in BFO2 Reference: [009-002]) if b is a material entity, then there is some temporal interval (referred to below as a one-dimensional temporal region) during which b exists. (axiom label in BFO2 Reference: [011-002]) (forall (x y) (if (and (Continuant x) (exists (t) (continuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [009-002] (forall (x y) (if (and (Continuant x) (exists (t) (hasContinuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [126-001] (forall (x) (if (Continuant x) (Entity x))) // axiom label in BFO2 CLIF: [008-002] (forall (x) (if (Material Entity x) (exists (t) (and (TemporalRegion t) (existsAt x t))))) // axiom label in BFO2 CLIF: [011-002] continuant Continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example, in an expansion involving bringing in some of Ceuster's other portions of reality, questions are raised as to whether universals are continuants A continuant is an entity that persists, endures, or continues to exist through time while maintaining its identity. (axiom label in BFO2 Reference: [008-002]) if b is a continuant and if, for some t, c has_continuant_part b at t, then c is a continuant. (axiom label in BFO2 Reference: [126-001]) if b is a continuant and if, for some t, cis continuant_part of b at t, then c is a continuant. (axiom label in BFO2 Reference: [009-002]) if b is a material entity, then there is some temporal interval (referred to below as a one-dimensional temporal region) during which b exists. (axiom label in BFO2 Reference: [011-002]) (forall (x y) (if (and (Continuant x) (exists (t) (continuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [009-002] (forall (x y) (if (and (Continuant x) (exists (t) (hasContinuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [126-001] (forall (x) (if (Continuant x) (Entity x))) // axiom label in BFO2 CLIF: [008-002] (forall (x) (if (Material Entity x) (exists (t) (and (TemporalRegion t) (existsAt x t))))) // axiom label in BFO2 CLIF: [011-002] occurrent Occurrent An entity that has temporal parts and that happens, unfolds or develops through time. BFO 2 Reference: every occurrent that is not a temporal or spatiotemporal region is s-dependent on some independent continuant that is not a spatial region BFO 2 Reference: s-dependence obtains between every process and its participants in the sense that, as a matter of necessity, this process could not have existed unless these or those participants existed also. A process may have a succession of participants at different phases of its unfolding. Thus there may be different players on the field at different times during the course of a football game; but the process which is the entire game s-depends_on all of these players nonetheless. Some temporal parts of this process will s-depend_on on only some of the players. Occurrent doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. An example would be the sum of a process and the process boundary of another process. Simons uses different terminology for relations of occurrents to regions: Denote the spatio-temporal location of a given occurrent e by 'spn[e]' and call this region its span. We may say an occurrent is at its span, in any larger region, and covers any smaller region. Now suppose we have fixed a frame of reference so that we can speak not merely of spatio-temporal but also of spatial regions (places) and temporal regions (times). The spread of an occurrent, (relative to a frame of reference) is the space it exactly occupies, and its spell is likewise the time it exactly occupies. We write 'spr[e]' and `spl[e]' respectively for the spread and spell of e, omitting mention of the frame. An occurrent is an entity that unfolds itself in time or it is the instantaneous boundary of such an entity (for example a beginning or an ending) or it is a temporal or spatiotemporal region which such an entity occupies_temporal_region or occupies_spatiotemporal_region. (axiom label in BFO2 Reference: [077-002]) Every occurrent occupies_spatiotemporal_region some spatiotemporal region. (axiom label in BFO2 Reference: [108-001]) b is an occurrent entity iff b is an entity that has temporal parts. (axiom label in BFO2 Reference: [079-001]) (forall (x) (if (Occurrent x) (exists (r) (and (SpatioTemporalRegion r) (occupiesSpatioTemporalRegion x r))))) // axiom label in BFO2 CLIF: [108-001] (forall (x) (iff (Occurrent x) (and (Entity x) (exists (y) (temporalPartOf y x))))) // axiom label in BFO2 CLIF: [079-001] occurent occurrent Occurrent doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. An example would be the sum of a process and the process boundary of another process. per discussion with Barry Smith Simons uses different terminology for relations of occurrents to regions: Denote the spatio-temporal location of a given occurrent e by 'spn[e]' and call this region its span. We may say an occurrent is at its span, in any larger region, and covers any smaller region. Now suppose we have fixed a frame of reference so that we can speak not merely of spatio-temporal but also of spatial regions (places) and temporal regions (times). The spread of an occurrent, (relative to a frame of reference) is the space it exactly occupies, and its spell is likewise the time it exactly occupies. We write 'spr[e]' and `spl[e]' respectively for the spread and spell of e, omitting mention of the frame. An occurrent is an entity that unfolds itself in time or it is the instantaneous boundary of such an entity (for example a beginning or an ending) or it is a temporal or spatiotemporal region which such an entity occupies_temporal_region or occupies_spatiotemporal_region. (axiom label in BFO2 Reference: [077-002]) Every occurrent occupies_spatiotemporal_region some spatiotemporal region. (axiom label in BFO2 Reference: [108-001]) b is an occurrent entity iff b is an entity that has temporal parts. (axiom label in BFO2 Reference: [079-001]) (forall (x) (if (Occurrent x) (exists (r) (and (SpatioTemporalRegion r) (occupiesSpatioTemporalRegion x r))))) // axiom label in BFO2 CLIF: [108-001] (forall (x) (iff (Occurrent x) (and (Entity x) (exists (y) (temporalPartOf y x))))) // axiom label in BFO2 CLIF: [079-001] ic IndependentContinuant a chair a heart a leg a molecule a spatial region an atom an orchestra. an organism the bottom right portion of a human torso the interior of your mouth b is an independent continuant = Def. b is a continuant which is such that there is no c and no t such that b s-depends_on c at t. (axiom label in BFO2 Reference: [017-002]) For any independent continuant b and any time t there is some spatial region r such that b is located_in r at t. (axiom label in BFO2 Reference: [134-001]) For every independent continuant b and time t during the region of time spanned by its life, there are entities which s-depends_on b during t. (axiom label in BFO2 Reference: [018-002]) (forall (x t) (if (IndependentContinuant x) (exists (r) (and (SpatialRegion r) (locatedInAt x r t))))) // axiom label in BFO2 CLIF: [134-001] (forall (x t) (if (and (IndependentContinuant x) (existsAt x t)) (exists (y) (and (Entity y) (specificallyDependsOnAt y x t))))) // axiom label in BFO2 CLIF: [018-002] (iff (IndependentContinuant a) (and (Continuant a) (not (exists (b t) (specificallyDependsOnAt a b t))))) // axiom label in BFO2 CLIF: [017-002] independent continuant b is an independent continuant = Def. b is a continuant which is such that there is no c and no t such that b s-depends_on c at t. (axiom label in BFO2 Reference: [017-002]) For any independent continuant b and any time t there is some spatial region r such that b is located_in r at t. (axiom label in BFO2 Reference: [134-001]) For every independent continuant b and time t during the region of time spanned by its life, there are entities which s-depends_on b during t. (axiom label in BFO2 Reference: [018-002]) (forall (x t) (if (IndependentContinuant x) (exists (r) (and (SpatialRegion r) (locatedInAt x r t))))) // axiom label in BFO2 CLIF: [134-001] (forall (x t) (if (and (IndependentContinuant x) (existsAt x t)) (exists (y) (and (Entity y) (specificallyDependsOnAt y x t))))) // axiom label in BFO2 CLIF: [018-002] (iff (IndependentContinuant a) (and (Continuant a) (not (exists (b t) (specificallyDependsOnAt a b t))))) // axiom label in BFO2 CLIF: [017-002] process Process a process of cell-division, \ a beating of the heart a process of meiosis a process of sleeping the course of a disease the flight of a bird the life of an organism your process of aging. p is a process = Def. p is an occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t. (axiom label in BFO2 Reference: [083-003]) BFO 2 Reference: The realm of occurrents is less pervasively marked by the presence of natural units than is the case in the realm of independent continuants. Thus there is here no counterpart of ‘object’. In BFO 1.0 ‘process’ served as such a counterpart. In BFO 2.0 ‘process’ is, rather, the occurrent counterpart of ‘material entity’. Those natural – as contrasted with engineered, which here means: deliberately executed – units which do exist in the realm of occurrents are typically either parasitic on the existence of natural units on the continuant side, or they are fiat in nature. Thus we can count lives; we can count football games; we can count chemical reactions performed in experiments or in chemical manufacturing. We cannot count the processes taking place, for instance, in an episode of insect mating behavior.Even where natural units are identifiable, for example cycles in a cyclical process such as the beating of a heart or an organism’s sleep/wake cycle, the processes in question form a sequence with no discontinuities (temporal gaps) of the sort that we find for instance where billiard balls or zebrafish or planets are separated by clear spatial gaps. Lives of organisms are process units, but they too unfold in a continuous series from other, prior processes such as fertilization, and they unfold in turn in continuous series of post-life processes such as post-mortem decay. Clear examples of boundaries of processes are almost always of the fiat sort (midnight, a time of death as declared in an operating theater or on a death certificate, the initiation of a state of war) (iff (Process a) (and (Occurrent a) (exists (b) (properTemporalPartOf b a)) (exists (c t) (and (MaterialEntity c) (specificallyDependsOnAt a c t))))) // axiom label in BFO2 CLIF: [083-003] process p is a process = Def. p is an occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t. (axiom label in BFO2 Reference: [083-003]) (iff (Process a) (and (Occurrent a) (exists (b) (properTemporalPartOf b a)) (exists (c t) (and (MaterialEntity c) (specificallyDependsOnAt a c t))))) // axiom label in BFO2 CLIF: [083-003] disposition Disposition an atom of element X has the disposition to decay to an atom of element Y certain people have a predisposition to colon cancer children are innately disposed to categorize objects in certain ways. the cell wall is disposed to filter chemicals in endocytosis and exocytosis BFO 2 Reference: Dispositions exist along a strength continuum. Weaker forms of disposition are realized in only a fraction of triggering cases. These forms occur in a significant number of cases of a similar type. b is a disposition means: b is a realizable entity & b’s bearer is some material entity & b is such that if it ceases to exist, then its bearer is physically changed, & b’s realization occurs when and because this bearer is in some special physical circumstances, & this realization occurs in virtue of the bearer’s physical make-up. (axiom label in BFO2 Reference: [062-002]) If b is a realizable entity then for all t at which b exists, b s-depends_on some material entity at t. (axiom label in BFO2 Reference: [063-002]) (forall (x t) (if (and (RealizableEntity x) (existsAt x t)) (exists (y) (and (MaterialEntity y) (specificallyDepends x y t))))) // axiom label in BFO2 CLIF: [063-002] (forall (x) (if (Disposition x) (and (RealizableEntity x) (exists (y) (and (MaterialEntity y) (bearerOfAt x y t)))))) // axiom label in BFO2 CLIF: [062-002] disposition b is a disposition means: b is a realizable entity & b’s bearer is some material entity & b is such that if it ceases to exist, then its bearer is physically changed, & b’s realization occurs when and because this bearer is in some special physical circumstances, & this realization occurs in virtue of the bearer’s physical make-up. (axiom label in BFO2 Reference: [062-002]) If b is a realizable entity then for all t at which b exists, b s-depends_on some material entity at t. (axiom label in BFO2 Reference: [063-002]) (forall (x t) (if (and (RealizableEntity x) (existsAt x t)) (exists (y) (and (MaterialEntity y) (specificallyDepends x y t))))) // axiom label in BFO2 CLIF: [063-002] (forall (x) (if (Disposition x) (and (RealizableEntity x) (exists (y) (and (MaterialEntity y) (bearerOfAt x y t)))))) // axiom label in BFO2 CLIF: [062-002] realizable RealizableEntity the disposition of this piece of metal to conduct electricity. the disposition of your blood to coagulate the function of your reproductive organs the role of being a doctor the role of this boundary to delineate where Utah and Colorado meet A specifically dependent continuant that inheres in continuant entities and are not exhibited in full at every time in which it inheres in an entity or group of entities. The exhibition or actualization of a realizable entity is a particular manifestation, functioning or process that occurs under certain circumstances. To say that b is a realizable entity is to say that b is a specifically dependent continuant that inheres in some independent continuant which is not a spatial region and is of a type instances of which are realized in processes of a correlated type. (axiom label in BFO2 Reference: [058-002]) All realizable dependent continuants have independent continuants that are not spatial regions as their bearers. (axiom label in BFO2 Reference: [060-002]) (forall (x t) (if (RealizableEntity x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (bearerOfAt y x t))))) // axiom label in BFO2 CLIF: [060-002] (forall (x) (if (RealizableEntity x) (and (SpecificallyDependentContinuant x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (inheresIn x y)))))) // axiom label in BFO2 CLIF: [058-002] realizable entity To say that b is a realizable entity is to say that b is a specifically dependent continuant that inheres in some independent continuant which is not a spatial region and is of a type instances of which are realized in processes of a correlated type. (axiom label in BFO2 Reference: [058-002]) All realizable dependent continuants have independent continuants that are not spatial regions as their bearers. (axiom label in BFO2 Reference: [060-002]) (forall (x t) (if (RealizableEntity x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (bearerOfAt y x t))))) // axiom label in BFO2 CLIF: [060-002] (forall (x) (if (RealizableEntity x) (and (SpecificallyDependentContinuant x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (inheresIn x y)))))) // axiom label in BFO2 CLIF: [058-002] quality Quality the ambient temperature of this portion of air the color of a tomato the length of the circumference of your waist the mass of this piece of gold. the shape of your nose the shape of your nostril a quality is a specifically dependent continuant that, in contrast to roles and dispositions, does not require any further process in order to be realized. (axiom label in BFO2 Reference: [055-001]) If an entity is a quality at any time that it exists, then it is a quality at every time that it exists. (axiom label in BFO2 Reference: [105-001]) (forall (x) (if (Quality x) (SpecificallyDependentContinuant x))) // axiom label in BFO2 CLIF: [055-001] (forall (x) (if (exists (t) (and (existsAt x t) (Quality x))) (forall (t_1) (if (existsAt x t_1) (Quality x))))) // axiom label in BFO2 CLIF: [105-001] quality a quality is a specifically dependent continuant that, in contrast to roles and dispositions, does not require any further process in order to be realized. (axiom label in BFO2 Reference: [055-001]) If an entity is a quality at any time that it exists, then it is a quality at every time that it exists. (axiom label in BFO2 Reference: [105-001]) (forall (x) (if (Quality x) (SpecificallyDependentContinuant x))) // axiom label in BFO2 CLIF: [055-001] (forall (x) (if (exists (t) (and (existsAt x t) (Quality x))) (forall (t_1) (if (existsAt x t_1) (Quality x))))) // axiom label in BFO2 CLIF: [105-001] sdc SpecificallyDependentContinuant Reciprocal specifically dependent continuants: the function of this key to open this lock and the mutually dependent disposition of this lock: to be opened by this key of one-sided specifically dependent continuants: the mass of this tomato of relational dependent continuants (multiple bearers): John’s love for Mary, the ownership relation between John and this statue, the relation of authority between John and his subordinates. the disposition of this fish to decay the function of this heart: to pump blood the mutual dependence of proton donors and acceptors in chemical reactions [79 the mutual dependence of the role predator and the role prey as played by two organisms in a given interaction the pink color of a medium rare piece of grilled filet mignon at its center the role of being a doctor the shape of this hole. the smell of this portion of mozzarella b is a specifically dependent continuant = Def. b is a continuant & there is some independent continuant c which is not a spatial region and which is such that b s-depends_on c at every time t during the course of b’s existence. (axiom label in BFO2 Reference: [050-003]) Specifically dependent continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. We're not sure what else will develop here, but for example there are questions such as what are promises, obligation, etc. (iff (SpecificallyDependentContinuant a) (and (Continuant a) (forall (t) (if (existsAt a t) (exists (b) (and (IndependentContinuant b) (not (SpatialRegion b)) (specificallyDependsOnAt a b t))))))) // axiom label in BFO2 CLIF: [050-003] specifically dependent continuant b is a specifically dependent continuant = Def. b is a continuant & there is some independent continuant c which is not a spatial region and which is such that b s-depends_on c at every time t during the course of b’s existence. (axiom label in BFO2 Reference: [050-003]) Specifically dependent continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. We're not sure what else will develop here, but for example there are questions such as what are promises, obligation, etc. per discussion with Barry Smith (iff (SpecificallyDependentContinuant a) (and (Continuant a) (forall (t) (if (existsAt a t) (exists (b) (and (IndependentContinuant b) (not (SpatialRegion b)) (specificallyDependsOnAt a b t))))))) // axiom label in BFO2 CLIF: [050-003] role Role John’s role of husband to Mary is dependent on Mary’s role of wife to John, and both are dependent on the object aggregate comprising John and Mary as member parts joined together through the relational quality of being married. the priest role the role of a boundary to demarcate two neighboring administrative territories the role of a building in serving as a military target the role of a stone in marking a property boundary the role of subject in a clinical trial the student role A realizable entity the manifestation of which brings about some result or end that is not essential to a continuant in virtue of the kind of thing that it is but that can be served or participated in by that kind of continuant in some kinds of natural, social or institutional contexts. BFO 2 Reference: One major family of examples of non-rigid universals involves roles, and ontologies developed for corresponding administrative purposes may consist entirely of representatives of entities of this sort. Thus ‘professor’, defined as follows,b instance_of professor at t =Def. there is some c, c instance_of professor role & c inheres_in b at t.denotes a non-rigid universal and so also do ‘nurse’, ‘student’, ‘colonel’, ‘taxpayer’, and so forth. (These terms are all, in the jargon of philosophy, phase sortals.) By using role terms in definitions, we can create a BFO conformant treatment of such entities drawing on the fact that, while an instance of professor may be simultaneously an instance of trade union member, no instance of the type professor role is also (at any time) an instance of the type trade union member role (any more than any instance of the type color is at any time an instance of the type length).If an ontology of employment positions should be defined in terms of roles following the above pattern, this enables the ontology to do justice to the fact that individuals instantiate the corresponding universals – professor, sergeant, nurse – only during certain phases in their lives. Peter McQuilton https://orcid.org/0000-0002-7702-4495 b is a role means: b is a realizable entity & b exists because there is some single bearer that is in some special physical, social, or institutional set of circumstances in which this bearer does not have to be& b is not such that, if it ceases to exist, then the physical make-up of the bearer is thereby changed. (axiom label in BFO2 Reference: [061-001]) (forall (x) (if (Role x) (RealizableEntity x))) // axiom label in BFO2 CLIF: [061-001] role b is a role means: b is a realizable entity & b exists because there is some single bearer that is in some special physical, social, or institutional set of circumstances in which this bearer does not have to be& b is not such that, if it ceases to exist, then the physical make-up of the bearer is thereby changed. (axiom label in BFO2 Reference: [061-001]) (forall (x) (if (Role x) (RealizableEntity x))) // axiom label in BFO2 CLIF: [061-001] gdc GenericallyDependentContinuant The entries in your database are patterns instantiated as quality instances in your hard drive. The database itself is an aggregate of such patterns. When you create the database you create a particular instance of the generically dependent continuant type database. Each entry in the database is an instance of the generically dependent continuant type IAO: information content entity. the pdf file on your laptop, the pdf file that is a copy thereof on my laptop the sequence of this protein molecule; the sequence that is a copy thereof in that protein molecule. b is a generically dependent continuant = Def. b is a continuant that g-depends_on one or more other entities. (axiom label in BFO2 Reference: [074-001]) (iff (GenericallyDependentContinuant a) (and (Continuant a) (exists (b t) (genericallyDependsOnAt a b t)))) // axiom label in BFO2 CLIF: [074-001] generically dependent continuant b is a generically dependent continuant = Def. b is a continuant that g-depends_on one or more other entities. (axiom label in BFO2 Reference: [074-001]) (iff (GenericallyDependentContinuant a) (and (Continuant a) (exists (b t) (genericallyDependsOnAt a b t)))) // axiom label in BFO2 CLIF: [074-001] function Function the function of a hammer to drive in nails the function of a heart pacemaker to regulate the beating of a heart through electricity the function of amylase in saliva to break down starch into sugar BFO 2 Reference: In the past, we have distinguished two varieties of function, artifactual function and biological function. These are not asserted subtypes of BFO:function however, since the same function – for example: to pump, to transport – can exist both in artifacts and in biological entities. The asserted subtypes of function that would be needed in order to yield a separate monoheirarchy are not artifactual function, biological function, etc., but rather transporting function, pumping function, etc. A function is a disposition that exists in virtue of the bearer’s physical make-up and this physical make-up is something the bearer possesses because it came into being, either through evolution (in the case of natural biological entities) or through intentional design (in the case of artifacts), in order to realize processes of a certain sort. (axiom label in BFO2 Reference: [064-001]) (forall (x) (if (Function x) (Disposition x))) // axiom label in BFO2 CLIF: [064-001] function A function is a disposition that exists in virtue of the bearer’s physical make-up and this physical make-up is something the bearer possesses because it came into being, either through evolution (in the case of natural biological entities) or through intentional design (in the case of artifacts), in order to realize processes of a certain sort. (axiom label in BFO2 Reference: [064-001]) (forall (x) (if (Function x) (Disposition x))) // axiom label in BFO2 CLIF: [064-001] data item Data items include counts of things, analyte concentrations, and statistical summaries. An information content entity that is intended to be a truthful statement about something (modulo, e.g., measurement precision or other systematic errors) and is constructed/acquired by a method which reliably tends to produce (approximately) truthful statements. 2/2/2009 Alan and Bjoern discussing FACS run output data. This is a data item because it is about the cell population. Each element records an event and is typically further composed a set of measurment data items that record the fluorescent intensity stimulated by one of the lasers. 2009-03-16: data item deliberatly ambiguous: we merged data set and datum to be one entity, not knowing how to define singular versus plural. So data item is more general than datum. 2009-03-16: removed datum as alternative term as datum specifically refers to singular form, and is thus not an exact synonym. 2014-03-31: See discussion at http://odontomachus.wordpress.com/2014/03/30/aboutness-objects-propositions/ JAR: datum -- well, this will be very tricky to define, but maybe some information-like stuff that might be put into a computer and that is meant, by someone, to denote and/or to be interpreted by some process... I would include lists, tables, sentences... I think I might defer to Barry, or to Brian Cantwell Smith JAR: A data item is an approximately justified approximately true approximate belief PERSON: Alan Ruttenberg PERSON: Chris Stoeckert PERSON: Jonathan Rees data data item information content entity Examples of information content entites include journal articles, data, graphical layouts, and graphs. A generically dependent continuant that is about some thing. 2014-03-10: The use of "thing" is intended to be general enough to include universals and configurations (see https://groups.google.com/d/msg/information-ontology/GBxvYZCk1oc/-L6B5fSBBTQJ). information_content_entity 'is_encoded_in' some digital_entity in obi before split (040907). information_content_entity 'is_encoded_in' some physical_document in obi before split (040907). Previous. An information content entity is a non-realizable information entity that 'is encoded in' some digital or physical entity. PERSON: Chris Stoeckert OBI_0000142 information content entity curation status specification The curation status of the term. The allowed values come from an enumerated list of predefined terms. See the specification of these instances for more detailed definitions of each enumerated value. Better to represent curation as a process with parts and then relate labels to that process (in IAO meeting) PERSON:Bill Bug GROUP:OBI:<http://purl.obolibrary.org/obo/obi> OBI_0000266 curation status specification data about an ontology part Data about an ontology part is a data item about a part of an ontology, for example a term Person:Alan Ruttenberg data about an ontology part obsolescence reason specification The reason for which a term has been deprecated. The allowed values come from an enumerated list of predefined terms. See the specification of these instances for more detailed definitions of each enumerated value. The creation of this class has been inspired in part by Werner Ceusters' paper, Applying evolutionary terminology auditing to the Gene Ontology. PERSON: Alan Ruttenberg PERSON: Melanie Courtot obsolescence reason specification denotator type The Basic Formal Ontology ontology makes a distinction between Universals and defined classes, where the formal are "natural kinds" and the latter arbitrary collections of entities. A denotator type indicates how a term should be interpreted from an ontological perspective. Alan Ruttenberg Barry Smith, Werner Ceusters denotator type A file that contains the values in a table as a series of ASCII text lines organized so that each column value is separated by a pipe. Peter McQuilton https://orcid.org/0000-0002-7702-4495 pipe separated values A file that contains the values in a table as a series of ASCII text lines organized so that each column value is separated by a pipe. CASRAI. https://casrai.org/term/pipe-separated-values/ Software preservation involves the collection and long-term storage of software for archiving as well as maintaining availability and accessibility. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc https://orcid.org/0000-0002-7702-4495 AL 8.2.22: Refactored "software review and preservation" to "software preservation", as the review process is not within the remit of terms4FAIRskills. Added definition and source. software preservation Software preservation involves the collection and long-term storage of software for archiving as well as maintaining availability and accessibility. https://orcid.org/0000-0002-7702-4495 8.2.22 In accessing a repository one uses a client (application) to discover relevant digital objects within a repository, and then retrieve a copy of a desired digital object. Peter McQuilton https://orcid.org/0000-0002-7702-4495 repository access In accessing a repository one uses a client (application) to discover relevant digital objects within a repository, and then retrieve a copy of a desired digital object. CASRAI. https://casrai.org/term/repository-access Include FAIR and open research in the strategic framework for the organization and set objectives and timeframe. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang develop fair and open research vision The activity of developing an open research strategy and vision. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang develop open research strategy and vision The activity of developing an open research strategy and vision. PMQ 3.2022 A written document backed by management describing policy and providing guidance to ensure that appropriate standards, consistent guidelines, and common strategies are used, providing linkages to and consistency with other similar systems, and fostering a true network across an organization producing data. Peter McQuilton data management policy A written document backed by management describing policy and providing guidance to ensure that appropriate standards, consistent guidelines, and common strategies are used, providing linkages to and consistency with other similar systems, and fostering a true network across an organization producing data. CASRAI. https://casrai.org/term/data-management-policy/ Understand how the governing principles of research integrity and FAIR overlap Angus Whyte Celia van Gelder understand research code of conduct To be able to choose the appropriate reporting guideline/checklist for your data, based on community-adopted standards. Peter McQuilton choosing the appropriate reporting guideline for your data To be able to choose the appropriate reporting guideline/checklist for your data, based on community-adopted standards. [PMQ] A specific deed, action, function or sphere of action in relation with the role of data stewardship Peter McQuilton Yann Le Franc https://orcid.org/0000-0002-7702-4495 Set of actions carried out during data stewardship processes data stewardship activity A series of potentially destructive or irrevocable changes to a piece of data or a file. Common munging operations include removing punctuation or html tags, data parsing, filtering, and transformation. Peter McQuilton data munging A series of potentially destructive or irrevocable changes to a piece of data or a file. Common munging operations include removing punctuation or html tags, data parsing, filtering, and transformation. CASRAI. https://casrai.org/term/data-munging/ Peter McQuilton interoperability of digital assets Bin for Skills related to Resource management leightonlc skills for resource management An activity within archiving in which specific items of data are maintained over time so that they can still be accessed and understood through changes in technology. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Conservation preservation An activity within archiving in which specific items of data are maintained over time so that they can still be accessed and understood through changes in technology. CASRAI. https://casrai.org/term/preservation/ Data Categorization Data Classification Peter McQuilton 2021-02-17T22:30:31.531624Z data categorisation A curation process on a data object by which it receives a persistent object identifier (PID) from a trusted registration authority. Registration must be accompanied by the step(s) to upload the data object to a persistent repository. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data registration A curation process on a data object by which it receives a persistent object identifier (PID) from a trusted registration authority. Registration must be accompanied by the step(s) to upload the data object to a persistent repository. CASRAI. https://casrai.org/term/data-registration/ RELATED TERM. Repository; Persistent identifierREFERENCE. Research Data Alliance http://smw-rda.esc.rzg.mpg.de/index.php/Main_Page ; NISO (2004) Understanding Metadata. Bethesda, MD: NISO Press. The capability to communicate, execute programs, or transfer data among various functional units in a useful and meaningful manner that requires the user to have little or no knowledge of the unique characteristics of those units. Foundational, syntactic, and semantic interoperability are the three necessary aspects of interoperability. Peter McQuilton interoperability The capability to communicate, execute programs, or transfer data among various functional units in a useful and meaningful manner that requires the user to have little or no knowledge of the unique characteristics of those units. Foundational, syntactic, and semantic interoperability are the three necessary aspects of interoperability. CASRAI. https://casrai.org/term/interoperability/ Exposing data is the activity of exposing your data to collaborators, the public, or other interested parties. A data producer makes the data accessible to external users in a machine- and/or human-readable way. Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 15.3.22: Was "expose your data", which was not in the style of other term labels. Also updated definition, but might need revisiting later for a more formal definition. exposing data Exposing data is the activity of exposing your data to collaborators, the public, or other interested parties. A data producer makes the data accessible to external users in a machine- and/or human-readable way. [PMQ 3.2022, AL 15.3.22] Bin for Aptitudes related to Workflow technologies management. leightonlc aptitudes for workflow technologies management A standard that is widely accepted and used, but lacks formal approval by a recognized standards developing organization (e.g., the QWERTY keyboard). Peter McQuilton de facto standard A standard that is widely accepted and used, but lacks formal approval by a recognized standards developing organization (e.g., the QWERTY keyboard). CASRAI. https://casrai.org/term/de-facto-standard/ An object, event, or phenomenon about which data are stored in a database and which has intermediate representation in a Data Model. Peter McQuilton data entity An object, event, or phenomenon about which data are stored in a database and which has intermediate representation in a Data Model. CASRAI. https://casrai.org/term/data-entity/ Data that are being received, processed and stored at the time of their occurrence with only small delays. Examples include: stock quotes, manufacturing statistics, Web server loads, data warehouse activity and sensor feeds to data collectors. Real-time data are data streams that are typically generated by sensors and received via direct networking connections. Peter McQuilton https://orcid.org/0000-0002-7702-4495 real-time data Data that are being received, processed and stored at the time of their occurrence with only small delays. Examples include: stock quotes, manufacturing statistics, Web server loads, data warehouse activity and sensor feeds to data collectors. Real-time data are data streams that are typically generated by sensors and received via direct networking connections. CASRAI. https://casrai.org/term/real-time-data Detect and analysis security risk of storage periodically, and minimize the impact of the risks detected Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang storage security risk assessment and mitigation The ability to get access to a computer or a network from a remote distance. Access may be through an Internet service provider (ISP) or through a dedicated line between a computer or a remote local area network and the central or main corporate local area network. A dedicated line is more expensive and less flexible but offers faster data rates. Peter McQuilton https://orcid.org/0000-0002-7702-4495 remote access The ability to get access to a computer or a network from a remote distance. Access may be through an Internet service provider (ISP) or through a dedicated line between a computer or a remote local area network and the central or main corporate local area network. A dedicated line is more expensive and less flexible but offers faster data rates. CASRAI. https://casrai.org/term/remote-access The activity of supervision of other people to ensure FAIR data practices. Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 fair stewardship supervising The activity of supervision of other people to ensure FAIR data practices. PMQ Bin for Skills needed for High performance computing management. leightonlc skills related to high performance computing management Ability to select the appropriate FAIR metrics among the existing ones in relation to the type of digital object concerned. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc fair metrics selection skills A collection of data items organized as a set of formally-described tables from which data can be accessed or reassembled in many different ways without having to reorganize the database tables. The standard user and application program interface to a relational database is the structured query language (SQL). Peter McQuilton https://orcid.org/0000-0002-7702-4495 relational database A collection of data items organized as a set of formally-described tables from which data can be accessed or reassembled in many different ways without having to reorganize the database tables. The standard user and application program interface to a relational database is the structured query language (SQL). CASRAI. https://casrai.org/term/relational-database Data that could not lead to the identification of a specific object of interest. These may be data that have been de-identified, or that could not lead to identifiable information in the first place. Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 6.5.22: Modifed CASRAI definition to distinguish this term from 'non personally identifiable information.' Further work may determine whether or not both terms are required. non identifiable data Data that could not lead to the identification of a specific object of interest. These may be data that have been de-identified, or that could not lead to identifiable information in the first place. Modified by AL on 6.5.22 from CASRAI. https://casrai.org/term/non-identifiable-data/ database developer The activity of managing and promoting the use of data from their point of creation to ensure that they are fit for contemporary purpose and available for discovery and reuse. Peter McQuilton curation The activity of managing and promoting the use of data from their point of creation to ensure that they are fit for contemporary purpose and available for discovery and reuse. CASRAI. https://casrai.org/term/curation/ Peter McQuilton A1. (meta)data are retrievable by their identifier using a standardised communications protocol Control over time of data, computer code, software, and documents that allows for the ability to revert to a previous revision, which is critical for data traceability, tracking edits, and correcting mistakes. Version control generates a (changed) copy of a data object that is uniquely labeled with a version number. The intent is to track changes to a data object, by making versioned copies. Note that a version is different from a backup copy, which is typically a copy made at a specific point in time, or a replica. Peter McQuilton https://orcid.org/0000-0002-7702-4495 version control Control over time of data, computer code, software, and documents that allows for the ability to revert to a previous revision, which is critical for data traceability, tracking edits, and correcting mistakes. Version control generates a (changed) copy of a data object that is uniquely labeled with a version number. The intent is to track changes to a data object, by making versioned copies. Note that a version is different from a backup copy, which is typically a copy made at a specific point in time, or a replica. CASRAI. https://casrai.org/term/version-control Understand the FAIR and open research practices, and the research landscape / current data management practices in the organization. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang understanding fair and open research challenges in your organization ethical application of patents, licenses A system that allows outsiders to be granted access to databases without overloading the system. Peter McQuilton data access protocol A system that allows outsiders to be granted access to databases without overloading the system. CASRAI. https://casrai.org/term/data-access-protocol/ A generic concept referring to all kinds of procedures being executed on data at any point in the data life cycle. Peter McQuilton data processing A generic concept referring to all kinds of procedures being executed on data at any point in the data life cycle. CASRAI. https://casrai.org/term/data-processing/ Denotes the complexity of measures that are used by a repository to form aggregations of data objects (including collections and metadata) to describe the properties of data objects, to register PIDs, to build the PID records, to link between all components, and to set up the containers (software stack) that are used to store all... Peter McQuilton data organization Denotes the complexity of measures that are used by a repository to form aggregations of data objects (including collections and metadata) to describe the properties of data objects, to register PIDs, to build the PID records, to link between all components, and to set up the containers (software stack) that are used to store all... CASRAI. https://casrai.org/term/data-organization/ Recovery and/or transformation and digitization of dark data and at-risk data so that they can be preserved, accessed, shared, and used. Data rescue also involves the addition of rich metadata to make the content understandable and more easily re-usable. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data rescue Recovery and/or transformation and digitization of dark data and at-risk data so that they can be preserved, accessed, shared, and used. Data rescue also involves the addition of rich metadata to make the content understandable and more easily re-usable. CASRAI. https://casrai.org/term/data-rescue/ REFERENCE. http://www.wmo.int/pages/prog/hwrp/datarescue.php ; http://iedro.org/data-rescue-process/ ; Anticipate possible implications of the research and making its outputs FAIR, reflecting on rmotivations and areas of uncertainty Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang understand societal impact of research To be able to search using software with a GUI or terminal access. For example using BLAST or genomic search tools in the life sciences. Peter McQuilton Algorithm searching Programmatic search batch search searching via algorithms and software To be able to search using software with a GUI or terminal access. For example using BLAST or genomic search tools in the life sciences. [PMQ] An independent evaluation of an organization, system, process, project or product. Peter McQuilton audit An independent evaluation of an organization, system, process, project or product. CASRAI. https://casrai.org/term/audit/ Peter McQuilton 2020-10-01T21:03:37.147424Z wiki site Bin for Skills needed for Authorization management. leightonlc skills related to authorisation management The term storage management encompasses the technologies and processes organizations use to maximize or improve the performance of their data storage resources. It is a broad category that includes virtualization, replication, mirroring, security, compression, traffic analysis, process automation, storage provisioning and related techniques. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel storage management The term storage management encompasses the technologies and processes organizations use to maximize or improve the performance of their data storage resources. It is a broad category that includes virtualization, replication, mirroring, security, compression, traffic analysis, process automation, storage provisioning and related techniques. (webopedia.com/TERM/S/storage_management.html)[VDA] A central repository for all or significant parts of the data that an organization's various business systems collect. A data warehouse tends to be a strategic but somewhat unfinished concept. Data warehousing emphasizes the capture of data from diverse sources for useful analysis and access, but does not generally start from the point-of-view of the &hellip; <a href=CASRAI. https://casrai.org/term/data-warehouse/ class=more-link Peter McQuilton data warehouse A central repository for all or significant parts of the data that an organization's various business systems collect. A data warehouse tends to be a strategic but somewhat unfinished concept. Data warehousing emphasizes the capture of data from diverse sources for useful analysis and access, but does not generally start from the point-of-view of the &hellip; <a href=CASRAI. https://casrai.org/term/data-warehouse/ class=more-link CASRAI. https://casrai.org/term/data-warehouse/ A serious problem caused by one or more ineffective data analysis processes. In addition to the financial burden, problems with data quality and analysis can have a serious impact on security, compliance, project management and human resource management, among others. Peter McQuilton data driven disaster A serious problem caused by one or more ineffective data analysis processes. In addition to the financial burden, problems with data quality and analysis can have a serious impact on security, compliance, project management and human resource management, among others. CASRAI. https://casrai.org/term/data-driven-disaster/ An identifier that uniquely distinguishes one set of data from all others. Peter McQuilton data identifier An identifier that uniquely distinguishes one set of data from all others. CASRAI. https://casrai.org/term/data-identifier/ An approach to protecting sensitive data from unauthorized access by encrypting the data and storing different portions of a file on different servers. An unauthorized person would need to know the locations of the servers containing the parts, be able to get access to each server, know what data to combine, and how to decrypt it. Data splitting can be made even more effective by periodically retrieving and recombining the parts, and then splitting the data in a different way among different servers, and using a different encryption key. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data splitting An approach to protecting sensitive data from unauthorized access by encrypting the data and storing different portions of a file on different servers. An unauthorized person would need to know the locations of the servers containing the parts, be able to get access to each server, know what data to combine, and how to decrypt it. Data splitting can be made even more effective by periodically retrieving and recombining the parts, and then splitting the data in a different way among different servers, and using a different encryption key. CASRAI. https://casrai.org/term/data-splitting/ Monitor the status of information access of different stakeholder groups. Evaluate new access request and authorize or decline it. Update the organizational information access overview. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang manage access Digital entity properties that are generated by the data management system (e.g., creation time; owner; storage location; data retention period; the length of time a digital entity will be retained). Peter McQuilton https://orcid.org/0000-0002-7702-4495 system metadata Digital entity properties that are generated by the data management system (e.g., creation time; owner; storage location; data retention period; the length of time a digital entity will be retained). CASRAI. https://casrai.org/term/system-metadata/ Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 tactical/short-term planning An electronic version of the paper record that doctors have traditionally maintained for their patients and which is typically only accessible within the facility or office that controls it. Peter McQuilton electronic medical record An electronic version of the paper record that doctors have traditionally maintained for their patients and which is typically only accessible within the facility or office that controls it. CASRAI. https://casrai.org/term/electronic-medical-record/ Bin for Aptitudes related to Provenance information management. leightonlc aptitudes for provenance information management Data modeling formalizes and documents existing processes and events. It captures and translates complex system designs into easily understood representations of the data flows and processes, creating a blueprint for construction and/or re-engineering. Peter McQuilton data modeling Data modeling formalizes and documents existing processes and events. It captures and translates complex system designs into easily understood representations of the data flows and processes, creating a blueprint for construction and/or re-engineering. CASRAI. https://casrai.org/term/data-modeling/ To be able to search and understand the results from dataset aggregrator sites. To understand the implications of the provenance of the data and how to integrate and analyse data with differing metadata. Peter McQuilton Searching aggregator sites use of aggregator sites To be able to search and understand the results from dataset aggregrator sites. To understand the implications of the provenance of the data and how to integrate and analyse data with differing metadata. [PMQ] Implement the policies that govern the choice of metadata schema, reserved vocabularies, metadata organization in tables, and metadata properties (creation date, access control, ownership, etc.). Peter McQuilton https://orcid.org/0000-0002-7702-4495 manage metadata catalog Implement the policies that govern the choice of metadata schema, reserved vocabularies, metadata organization in tables, and metadata properties (creation date, access control, ownership, etc.). CASRAI. https://casrai.org/term/manage-metadata-catalog Manipulation of raw data to produce a single output. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data transformation Manipulation of raw data to produce a single output. CASRAI. https://casrai.org/term/data-transformation/ An intellectual process of describing objects in accordance with accepted library principles, particularly those of subject and classification order. Peter McQuilton cataloguing An intellectual process of describing objects in accordance with accepted library principles, particularly those of subject and classification order. CASRAI. https://casrai.org/term/cataloguing/ Iniitate and develop processes for crediting the contributions of researchers and professional groups towards making FAIR outputs Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang design and apply processes for attribution A collection of datasets sharing the same product specification. A dataset series is a type of aggregation or collection with some logical grouping such as by a topic (specification) with the (product) unit being a dataset series. Example: A series of earth observations. Each year, month or week (depending on the volume) might be a &hellip; <a href=CASRAI. https://casrai.org/term/dataset-series/ class=more-link Peter McQuilton dataset series A collection of datasets sharing the same product specification. A dataset series is a type of aggregation or collection with some logical grouping such as by a topic (specification) with the (product) unit being a dataset series. Example: A series of earth observations. Each year, month or week (depending on the volume) might be a &hellip; <a href=CASRAI. https://casrai.org/term/dataset-series/ class=more-link CASRAI. https://casrai.org/term/dataset-series/ A collection of descriptions of the data objects or items in a data model. Peter McQuilton data dictionary A collection of descriptions of the data objects or items in a data model. CASRAI. https://casrai.org/term/data-dictionary/ A process that creates a new dataset from an original source. Examples include: creating a subset of the data,querying a database. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data selection A process that creates a new dataset from an original source. Examples include: creating a subset of the data,querying a database. CASRAI. https://casrai.org/term/data-selection/ Define data access policy based on project requirements. Provide access to autorised parties. Celia van Gelder Mateusz Kuzak Yan Wang apply data access policy The ability to find and comprehend data produced by people other than yourself. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Data search AL 15.3.22: tidied label from "discover other peoples data" to "Data discovery data discovery The ability to find and comprehend data produced by people other than yourself. [PMQ] Data that are delivered with all associated metadata, data dictionary, description of methods and instruments used to collect and process the data, and other supporting data (e.g., duplicate sample results, replicate analyses, percent recovery, etc.). Peter McQuilton documented data Data that are delivered with all associated metadata, data dictionary, description of methods and instruments used to collect and process the data, and other supporting data (e.g., duplicate sample results, replicate analyses, percent recovery, etc.). CASRAI. https://casrai.org/term/documented-data/ Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 conducting operations Data the content of which is changing frequently and at asynchronous moments. Examples include: Data streams that are generated by sensors when it is unpredictable when data segments will appear in time (i.e. data streams have gaps); Data streams that are generated by humans in crowdsourcing scenarios where it is not clear when which cell in a database will be filled. Peter McQuilton https://orcid.org/0000-0002-7702-4495 dynamic data Data the content of which is changing frequently and at asynchronous moments. Examples include: Data streams that are generated by sensors when it is unpredictable when data segments will appear in time (i.e. data streams have gaps); Data streams that are generated by humans in crowdsourcing scenarios where it is not clear when which cell in a database will be filled. CASRAI. https://casrai.org/term/dynamic-data/ A design covering a class of frameworks with the following characteristics: (1) it can be used to generate more specific models that still belong to the class and (2) it can be used to compare a concrete framework design to identify whether it belongs to the same class. Peter McQuilton https://orcid.org/0000-0002-7702-4495 reference model A design covering a class of frameworks with the following characteristics: (1) it can be used to generate more specific models that still belong to the class and (2) it can be used to compare a concrete framework design to identify whether it belongs to the same class. CASRAI. https://casrai.org/term/reference-model Peter McQuilton 2020-10-01T20:52:57.551905Z online workbook engaging in open innovation beyond academia Data that have gone through a registration process and have been assigned an identifier metadata to aid in their search and retrieval. Peter McQuilton https://orcid.org/0000-0002-7702-4495 registered data Data that have gone through a registration process and have been assigned an identifier metadata to aid in their search and retrieval. CASRAI. https://casrai.org/term/registered-data In the context of data and network security: The assurance that information can only be accessed or modified by those authorized to do so. Measures taken to ensure integrity include controlling the physical environment of networked terminals and servers, restricting access to data, and maintaining rigorous authentication practices. Data integrity can also be threatened by environmental hazards, such as heat, dust, and electrical surges. Peter McQuilton https://orcid.org/0000-0002-7702-4495 integrity In the context of data and network security: The assurance that information can only be accessed or modified by those authorized to do so. Measures taken to ensure integrity include controlling the physical environment of networked terminals and servers, restricting access to data, and maintaining rigorous authentication practices. Data integrity can also be threatened by environmental hazards, such as heat, dust, and electrical surges. CASRAI. https://casrai.org/term/integrity/ Incorporates: building a digital collection of information for further study and analysis; creating appropriate tools for collection- building; creating appropriate tools for the analysis and study of collections; using digital collections and analytical tools to generate new intellectual products; and, Creating authoring tools for these new intellectual products, either in traditional forms or in digital form. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digital scholarship Incorporates: building a digital collection of information for further study and analysis; creating appropriate tools for collection- building; creating appropriate tools for the analysis and study of collections; using digital collections and analytical tools to generate new intellectual products; and, Creating authoring tools for these new intellectual products, either in traditional forms or in digital form. CASRAI. https://casrai.org/term/digital-scholarship/ Bin for types of Knowledge related to Resource management leightonlc a knowledge for resource management Bin for Aptitudes related to Resource management. leightonlc aptitudes for resource management Bin for types of Knowledge related to Storage management. leightonlc knowledge of storage management Know about the open access publishing procedure, journals' policies on OA publishing, project funder and institutions policy as well as the financial support on OA publishing Celia van Gelder Mateusz Kuzak Yan Wang understand open access publishing OBSOLETE. The use of persistent identifiers or PIDs to uniquely and persistently identify an entity. Nancy Hoebelheinrich Peter McQuilton https://orcid.org/0000-0002-7702-4495 https://github.com/terms4fairskills/FAIRterminology/issues/21 This term is redundant with http://purl.obolibrary.org/obo/T4FS_0000388 and has been deprecated. use of persistent, resolvable identifiers (pids) true OBSOLETE. The use of persistent identifiers or PIDs to uniquely and persistently identify an entity. [PMQ 3.2022] A process in which files are first parsed (assigned to appropriate fields in a record) and then translated to a common format. For example, if an original record had the client’s name and address as “Bob Jones, VP Acme. Co., 15 S. Main St, Brooklyn” the standardized record might read “Bob Jones, Vice President, Acme Corporation, 15 South Main Street, Brooklyn, New York”. Data often lack consistency simply because there are a many of ways of saying the same thing. Standardizing the record ensures that when a query is run for a particular field, accurate results will be returned. Peter McQuilton https://orcid.org/0000-0002-7702-4495 record standardization A process in which files are first parsed (assigned to appropriate fields in a record) and then translated to a common format. For example, if an original record had the client’s name and address as “Bob Jones, VP Acme. Co., 15 S. Main St, Brooklyn” the standardized record might read “Bob Jones, Vice President, Acme Corporation, 15 South Main Street, Brooklyn, New York”. Data often lack consistency simply because there are a many of ways of saying the same thing. Standardizing the record ensures that when a query is run for a particular field, accurate results will be returned. CASRAI. https://casrai.org/term/record-standardization Peter McQuilton Peter McQuilton 2020-10-01T20:53:59.107153Z online documentation A catalogue containing metadata records in XML-encoded (machine-readable and human-readable) format that enables services to find data and services. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton metadata catalogue A catalogue containing metadata records in XML-encoded (machine-readable and human-readable) format that enables services to find data and services. https://casrai.org/term/metadata-catalogue/ Learn about the types of patents and its legal implications. Understand the application procedure of patents. Celia van Gelder Mateusz Kuzak Yan Wang understand legal background of patents The layout of a file in terms of how the data within the file are organized. A program that uses the data in a file must be able to recognize and possibly access data within the file. Peter McQuilton data file format The layout of a file in terms of how the data within the file are organized. A program that uses the data in a file must be able to recognize and possibly access data within the file. CASRAI. https://casrai.org/term/data-file-format/ A curated collection of metadata about datasets and their data elements. Peter McQuilton data catalogue A curated collection of metadata about datasets and their data elements. CASRAI. https://casrai.org/term/data-catalogue/ A type of workflow that includes active steps to curate data as an aid to on-going management of data through its lifecycle. Peter McQuilton curation workflow A type of workflow that includes active steps to curate data as an aid to on-going management of data through its lifecycle. CASRAI. https://casrai.org/term/curation-workflow/ Peter McQuilton A1.1 the protocol is open, free, and universally implementable Planning for and controlling the present and future costs of the conservation of data, including technical storage, fixed costs, and staff resources. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel preservation costs management Planning for and controlling the present and future costs of the conservation of data, including technical storage, fixed costs, and staff resources. Based on https://www.cessda.eu/About/Projects/Past-projects/CESSDA-SaW/WP4/Cost-Benefit-Advocacy-Toolkit/Costs-Factsheet [LLC] The person who is tasked with delivering a project within the boundaries and framework established by the program manager. The project manager is and should be delivery and execution focused and is judged on the elements of time, cost, and scope of the project. The person responsible for ensuring that the Project Team completes the project. The Project Manager develops the Project Plan with the team and manages the teamís performance of project tasks. It is also the responsibility of the Project Manager to secure acceptance and approval of deliverables from the Project Sponsor and Stakeholders. The Project Manager is responsible for communication, including status reporting, risk management, escalation of issues that cannot be resolved in the team, and, in general, making sure the project is delivered in budget, on schedule, and within scope. Peter McQuilton https://orcid.org/0000-0002-7702-4495 project manager The person who is tasked with delivering a project within the boundaries and framework established by the program manager. The project manager is and should be delivery and execution focused and is judged on the elements of time, cost, and scope of the project. The person responsible for ensuring that the Project Team completes the project. The Project Manager develops the Project Plan with the team and manages the teamís performance of project tasks. It is also the responsibility of the Project Manager to secure acceptance and approval of deliverables from the Project Sponsor and Stakeholders. The Project Manager is responsible for communication, including status reporting, risk management, escalation of issues that cannot be resolved in the team, and, in general, making sure the project is delivered in budget, on schedule, and within scope. CASRAI. https://casrai.org/term/project-manager Celia van Gelder Mateusz Kuzak Peter McQuilton Yan Wang https://orcid.org/0000-0002-7702-4495 Release data AL 9.3.22. Removed from 'FAIR4S defined activity' class to better integrate with the T4FS hierarchy. AL 5.5.22: removed "Publishing and archiving data" and "Publish and archive data" alternative terms, as the two actions are not the same. publish data https://orcid.org/0000-0002-7702-4495 Access management AL 16.3.22: Changed 'information security and access management' such that access management went in as an alternative term. If required in future, we may wish to separate it out completely. information security Includes all activities involved in the planning, collecting, processing, analysis and maintenance of data in the original research project. Among these activities are selecting a study design, constructing instruments for data collection, conducting data collection/creation, performing data editing/verification/validation, analyzing data, backing up data versions and preparing and tagging metadata. Peter McQuilton data production Includes all activities involved in the planning, collecting, processing, analysis and maintenance of data in the original research project. Among these activities are selecting a study design, constructing instruments for data collection, conducting data collection/creation, performing data editing/verification/validation, analyzing data, backing up data versions and preparing and tagging metadata. CASRAI. https://casrai.org/term/data-production/ Data that have not been processed for meaningful use. Although raw data have the potential to become information, they require selective extraction, organization, and sometimes analysis and formatting for presentation. As a result of processing, raw data sometimes end up in a database, which enables the data to become accessible for further processing and analysis. Peter McQuilton https://orcid.org/0000-0002-7702-4495 raw data Data that have not been processed for meaningful use. Although raw data have the potential to become information, they require selective extraction, organization, and sometimes analysis and formatting for presentation. As a result of processing, raw data sometimes end up in a database, which enables the data to become accessible for further processing and analysis. CASRAI. https://casrai.org/term/raw-data Testing conducted to evaluate whether systems or components pass data and control correctly to each other. Peter McQuilton https://orcid.org/0000-0002-7702-4495 interface testing Testing conducted to evaluate whether systems or components pass data and control correctly to each other. CASRAI. https://casrai.org/term/interface-testing/ A digital object is editable, interactive, accessible and modifiable by means of digital objects other than the one governing its behaviour, and is distributed over information infrastructures. It is a machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems in the Internet. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digital object A digital object is editable, interactive, accessible and modifiable by means of digital objects other than the one governing its behaviour, and is distributed over information infrastructures. It is a machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems in the Internet. CASRAI. https://casrai.org/term/digital-object/ A broad term encompassing: (a) digital surrogates created as a result of converting analogue materials to digital form (digitisation); (b) born digital for which there has never been and is never intended to be an analogue equivalent; and, (c) digital records. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digital materials A broad term encompassing: (a) digital surrogates created as a result of converting analogue materials to digital form (digitisation); (b) born digital for which there has never been and is never intended to be an analogue equivalent; and, (c) digital records. CASRAI. https://casrai.org/term/digital-materials/ Understand the needs and conditions of information access for different stakeholder groups. Identify the responsibility and rights for each stakeholder group on informaiton access. Build up a catalogue of risk profiles. For each risk profile, develope mitigation protocols. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang define access policy Bin for Skills needed for Provenance information management. leightonlc 2019-10-17T15:17:02.313138Z skills related to provenance information management The ability to access and download data from a repository. Peter McQuilton https://orcid.org/0000-0002-7702-4495 remote data access The ability to access and download data from a repository. CASRAI. https://casrai.org/term/remote-data-access The act of interpreting an author's intended use of a word that has multiple meanings or spellings. Peter McQuilton https://orcid.org/0000-0002-7702-4495 disambiguation The act of interpreting an author's intended use of a word that has multiple meanings or spellings. CASRAI. https://casrai.org/term/dissambuation/ Data compliance consists of the ongoing processes to ensure adherence of data to both enterprise business rules (government department, university, industry, or agency), and to legal, regulatory and accreditation requirements. Peter McQuilton data compliance Data compliance consists of the ongoing processes to ensure adherence of data to both enterprise business rules (government department, university, industry, or agency), and to legal, regulatory and accreditation requirements. CASRAI. https://casrai.org/term/data-compliance/ OBSOLETE. Use of content outside of its original intention. Peter McQuilton https://orcid.org/0000-0002-7702-4495 https://github.com/terms4fairskills/FAIRterminology/issues/15 AL 2.3.23: Removed because of its high level of similarity with the 'R' FAIR principle. re-use true OBSOLETE. Use of content outside of its original intention. CASRAI. https://casrai.org/term/re-use trainer/teacher Bin for Aptitudes related to Identity management. leightonlc aptitudes for identity management Know the security requirements on the organization regarding different types of information Know the current organizational policy, infrastructure and capacity on information security. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang understand information security challenges Bin for Aptitudes related to Authorization management. leightonlc aptitudes for authorisation management Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang https://orcid.org/0000-0002-7702-4495 AL 17.3.22: Updated to "Assessment" to match the style of its sibling classes. AL 3.22: Was "govern and assess", which is a compound concept. Split into two classes, Assess and Governance. Original source: FAIR4S defined activity. assessment The World Wide Web Consortium’s Platform for Personal Privacy Project (P3P) offers specific recommendations for practices that will let users define and share personal information with Web sites that they agree to share it with. The P3P incorporates a number of industry proposals, including the Open Profiling Standard (OPS). Using software that adheres to the P3P recommendations, users will be able to create a personal profile, all or parts of which can be made accessible to a Web site as the user directs. A tool that will help a user decide whether to trust a given Website with personal information is a Statement of Privacy Policy that a Web site can post. Peter McQuilton https://orcid.org/0000-0002-7702-4495 personal information privacy The World Wide Web Consortium’s Platform for Personal Privacy Project (P3P) offers specific recommendations for practices that will let users define and share personal information with Web sites that they agree to share it with. The P3P incorporates a number of industry proposals, including the Open Profiling Standard (OPS). Using software that adheres to the P3P recommendations, users will be able to create a personal profile, all or parts of which can be made accessible to a Web site as the user directs. A tool that will help a user decide whether to trust a given Website with personal information is a Statement of Privacy Policy that a Web site can post. CASRAI. https://casrai.org/term/personal-information-privacy Demonstrate ability to involve others and share responsibility for applying FAIR principles. Angus Whyte ability to solve fair implementation problems collaboratively Demonstrate ability to involve others and share responsibility for applying FAIR principles. [PMQ, 3.2022] Bin for Skills needed for Cloud computing environment management. leightonlc 2019-10-17T15:14:37.180683Z skills related to cloud computing environment management A managed process, throughout the data lifecycle, by which data &amp; data collections are cleansed, documented, standardized, formatted and inter-related. This includes versioning data, or forming a new collection from several data sources, annotating with metadata, adding codes to raw data (e.g., classifying a galaxy image with a galaxy type such as spiral). Peter McQuilton data curation A managed process, throughout the data lifecycle, by which data &amp; data collections are cleansed, documented, standardized, formatted and inter-related. This includes versioning data, or forming a new collection from several data sources, annotating with metadata, adding codes to raw data (e.g., classifying a galaxy image with a galaxy type such as spiral). CASRAI. https://casrai.org/term/data-curation/ Cite contributions to data work in published literature which draws upon that data. Laura Molloy Peter McQuilton cite contributions Cite contributions to data work in published literature which draws upon that data. [LM] Ongoing organizational activities associated with supporting functional elements, as opposed to project elements. Operational management also includes support of products that the organization has created through project activity. Peter McQuilton https://orcid.org/0000-0002-7702-4495 operational management Ongoing organizational activities associated with supporting functional elements, as opposed to project elements. Operational management also includes support of products that the organization has created through project activity. CASRAI. https://casrai.org/term/operational-management/ High-quality data are complete, timely, accurate, consistent, relevant, reliable, traceable, cleaned, validated, and well documented. Peter McQuilton high quality data High-quality data are complete, timely, accurate, consistent, relevant, reliable, traceable, cleaned, validated, and well documented. CASRAI. https://casrai.org/term/high-quality-data/ Understand what reuse is permitted according to the licenses or terms and conditions applicable to services, tools and their content. Angus Whyte how to make your data reuseable Peter McQuilton I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation 1. The organization or structure for a database. The activity of data modeling leads to a schema. (The plural form is schemata.) The term is used in discussing both relational databases and object-oriented databases. The term sometimes seems to refer to a visualization of a structure and sometimes to a formal text-oriented description. Two common types of database schemata are the star schema and the snowflake schema. 2. A formal expression of an inference rule for artificial intelligence (AI) computing. The expression is a generalized axiom in which specific values or cases are substituted for each symbol in the axiom to derive a specific inference. Peter McQuilton https://orcid.org/0000-0002-7702-4495 schema 1. The organization or structure for a database. The activity of data modeling leads to a schema. (The plural form is schemata.) The term is used in discussing both relational databases and object-oriented databases. The term sometimes seems to refer to a visualization of a structure and sometimes to a formal text-oriented description. Two common types of database schemata are the star schema and the snowflake schema. 2. A formal expression of an inference rule for artificial intelligence (AI) computing. The expression is a generalized axiom in which specific values or cases are substituted for each symbol in the axiom to derive a specific inference. CASRAI. https://casrai.org/term/schema A service that provides a connection between a PID and its target object. Peter McQuilton https://orcid.org/0000-0002-7702-4495 pid service A service that provides a connection between a PID and its target object. CASRAI. https://casrai.org/term/pid-service/ Foundational interoperability allows data exchange from one information technology system to be received by another and does not require the ability for the receiving information technology system to interpret the data.REFERENCE. Healthcare information management and systems society Peter McQuilton foundational interoperability Foundational interoperability allows data exchange from one information technology system to be received by another and does not require the ability for the receiving information technology system to interpret the data.REFERENCE. Healthcare information management and systems society CASRAI. https://casrai.org/term/foundational-interoperability/ The activities of data policies, data planning, data element standardization, information management control, data synchronization, data sharing, and database development, including practices and projects that acquire, control, protect, deliver and enhance the value of data and information. Peter McQuilton data management The activities of data policies, data planning, data element standardization, information management control, data synchronization, data sharing, and database development, including practices and projects that acquire, control, protect, deliver and enhance the value of data and information. CASRAI. https://casrai.org/term/data-management/ Authorization management is concerned with people's access to different objects, most often to data or physical objects, such as land, buildings, rooms or infrastructure. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel authorisation management Authorization management is concerned with people's access to different objects, most often to data or physical objects, such as land, buildings, rooms or infrastructure. https://managementmania.com/en/authorization-management Bin for types of Knowledge related to Authorization management. leightonlc knowledge of authorisation management The ability to search repositories and knowledge-bases. Peter McQuilton Repository search database search searching databases searching repositories search repositories and knowledge-bases The ability to search repositories and knowledge-bases. [PMQ] An ecosystem that includes not only traditional elements of cloud computing such as software and infrastructure, but also consultants, integrators, partners, third parties and anything in their environments that has a bearing on the other components. Peter McQuilton cloud ecosystem An ecosystem that includes not only traditional elements of cloud computing such as software and infrastructure, but also consultants, integrators, partners, third parties and anything in their environments that has a bearing on the other components. CASRAI. https://casrai.org/term/cloud-ecosystem/ Learn about data licensing, the types of data licences and its legal implications. Understand the application procedure of data licences. Celia van Gelder Mateusz Kuzak Yan Wang understand legal background of licensing A place or collection containing static records, documents, or other materials for long-term preservation. Peter McQuilton archive A place or collection containing static records, documents, or other materials for long-term preservation. CASRAI. https://casrai.org/term/archive/ An infrastructure component that provides reliable, long-term access to managed digital resources. It stores, manages, and curates digital objects and returns their bit streams when a request is issued. Trusted repositories undergo regular assessments according to a set of rules such as defined by Data Seal of Approval (DSA) or TRAC (ISO 16363). It is well understood that such an assessment has the potential of increasing trust from its depositors and users, but it will not be the only criterion for users. Repositories can be at different stages of assessments. However, it is evident that certain quality criteria need to be met to distinguish trusted repositories from all types of other entities that store data such as notebooks or lab servers. Peter McQuilton https://orcid.org/0000-0002-7702-4495 trusted digital repository An infrastructure component that provides reliable, long-term access to managed digital resources. It stores, manages, and curates digital objects and returns their bit streams when a request is issued. Trusted repositories undergo regular assessments according to a set of rules such as defined by Data Seal of Approval (DSA) or TRAC (ISO 16363). It is well understood that such an assessment has the potential of increasing trust from its depositors and users, but it will not be the only criterion for users. Repositories can be at different stages of assessments. However, it is evident that certain quality criteria need to be met to distinguish trusted repositories from all types of other entities that store data such as notebooks or lab servers. CASRAI. https://casrai.org/term/trusted-digital-repository/ The function of managing the physical aspects of data resources, including database design and integrity, backup and recovery, performance and tuning. Peter McQuilton https://orcid.org/0000-0002-7702-4495 database administration The function of managing the physical aspects of data resources, including database design and integrity, backup and recovery, performance and tuning. CASRAI. https://casrai.org/term/database-administration/ REFERENCE. DAMA Dictionary of Data Management Bin for types of Knowledge related to Identity management leightonlc knowledge of identity management Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang quality assessment A specialized format for organizing and storing data. General data structure types include the array, the file, the record, the table, the tree, and so on. Any data structure is designed to organize data to suit a specific purpose so that it can be accessed and worked with in appropriate ways.... Peter McQuilton Data format data structure A specialized format for organizing and storing data. General data structure types include the array, the file, the record, the table, the tree, and so on. Any data structure is designed to organize data to suit a specific purpose so that it can be accessed and worked with in appropriate ways.... CASRAI. https://casrai.org/term/data-structure/ Information systems and technology infrastructure manager, expert, or technician. Peter McQuilton information technology specialist Information systems and technology infrastructure manager, expert, or technician. CASRAI. https://casrai.org/term/information-technology-specialist/ A curation activity that ensures that data are properly selected, stored, and can be accessed, and for which logical and physical integrity are maintained over time, including security and authenticity. Peter McQuilton archiving A curation activity that ensures that data are properly selected, stored, and can be accessed, and for which logical and physical integrity are maintained over time, including security and authenticity. CASRAI. https://casrai.org/term/archiving/ Bin for types of Knowledge related to Preservation costs management. leightonlc 2019-10-17T15:17:32.816776Z knowledge of preservation costs management A phase of development where the product is tested in the real world by the intended audience. The experiences of the early users are forwarded back to the developers who make final changes before releasing the product. Peter McQuilton https://orcid.org/0000-0002-7702-4495 user acceptance testing A phase of development where the product is tested in the real world by the intended audience. The experiences of the early users are forwarded back to the developers who make final changes before releasing the product. CASRAI. https://casrai.org/term/user-acceptance-testing/ A description with very little curation that would include at least a name and PID of a data object. Minimal metadata is only marginally targeted at discovery since there is much better infrastructure to accomplish this. https://orcid.org/0000-0002-7702-4495 minimal metadata A description with very little curation that would include at least a name and PID of a data object. Minimal metadata is only marginally targeted at discovery since there is much better infrastructure to accomplish this. CASRAI, https://casrai.org/term/minimal-metadata/ Metadata exposure is the process of finding, identifying, selecting and acquiring/obtaining access to a metadata entity. Nancy Hoebelheinrich https://orcid.org/0000-0002-7702-4495 AL 22.3.22: Refactored from 'metadata creation and exposure'. Metadata creation has moved to the Curation hierarchy. metadata exposure Metadata exposure is the process of finding, identifying, selecting and acquiring/obtaining access to a metadata entity. AL 22.3.22, and see also FRBR User Tasks at: https://sites.google.com/site/metadatastandards/chapter-6/6-3-frbr-user-tasks. The process of destroying data stored on tapes, hard disks and other forms of electronic media so that it is completely unreadable and cannot be accessed or used. data destruction The process of destroying data stored on tapes, hard disks and other forms of electronic media so that it is completely unreadable and cannot be accessed or used. CASRAI. https://casrai.org/term/data-destruction/ The provision of training materials and events in and around good data stewardship. Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 fair stewardship training The provision of training materials and events in and around good data stewardship. PMQ The ability to have an innovative approach to research by creating new or modified current concepts, theories, approaches and/or solutions. Peter McQuilton creativity The ability to have an innovative approach to research by creating new or modified current concepts, theories, approaches and/or solutions. CASRAI. https://casrai.org/term/creativity/ 1. A field or column in a database table. It is an abbreviation for 'physical data attribute' which is a single data element related to a data object, such as a table in a database. The database schema associates one or more attributes with each database entity (i.e. table). 2. A term for a logical or conceptual attribute such as in an entity-attribute-relationship (EAR) data model. Peter McQuilton data table attribute 1. A field or column in a database table. It is an abbreviation for 'physical data attribute' which is a single data element related to a data object, such as a table in a database. The database schema associates one or more attributes with each database entity (i.e. table). 2. A term for a logical or conceptual attribute such as in an entity-attribute-relationship (EAR) data model. CASRAI. https://casrai.org/term/data-table-attribute/ A standard developed through the cooperation of all parties who have an interest in participating in its development and/or use. Peter McQuilton Community standard consensus standard A standard developed through the cooperation of all parties who have an interest in participating in its development and/or use. CASRAI. https://casrai.org/term/consensus-standard/ Identity Management (IdM), also called Identity and Access Management (IAM) refers to a framework of policies and technologies for ensuring that the proper user in an organisation has the appropriate access to technology resources systems fall under the overarching umbrella of IT security. IdM systems not only identify, but authenticate and authorize individuals who will be using IT resources, but also the hardware and applications users need to access. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel IAM IdM Identity and Access Management identity management Identity Management (IdM), also called Identity and Access Management (IAM) refers to a framework of policies and technologies for ensuring that the proper user in an organisation has the appropriate access to technology resources systems fall under the overarching umbrella of IT security. IdM systems not only identify, but authenticate and authorize individuals who will be using IT resources, but also the hardware and applications users need to access. https://en.wikipedia.org/wiki/Identity_management Peter McQuilton reuse of digital assets Explore the potential application domain/sociental implication of the project work and the communities of users/partners. Celia van Gelder Mateusz Kuzak Yan Wang investigate open innovation opportunities The process of restoring data that have been lost, accidentally deleted, corrupted or made inaccessible for any reason. The data recovery process may vary, depending on the circumstances of the data loss, the data recovery software used to create backups, and backup target media. In some cases, end users may be able to restore lost... Peter McQuilton data recovery The process of restoring data that have been lost, accidentally deleted, corrupted or made inaccessible for any reason. The data recovery process may vary, depending on the circumstances of the data loss, the data recovery software used to create backups, and backup target media. In some cases, end users may be able to restore lost... CASRAI. https://casrai.org/term/data-quality-review/ Peter McQuilton R1. (meta)data are richly described with a plurality of accurate and relevant attributes To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate repository for a dataset. This may, for example, involve discovering the mark-up of the repository, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRness of a repository based on human-entered questionnaires or FAIRsharing resource metadata. Peter McQuilton FAIR data submission Repository FAIRness fair evaluation of repositories for data deposition To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate repository for a dataset. This may, for example, involve discovering the mark-up of the repository, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRness of a repository based on human-entered questionnaires or FAIRsharing resource metadata. [PMQ] A Regional standard is one that applies across a multi-nation region. Most countries have their own national standards-making bodies, which in turn may also group together to make regional standards. For example, national standards bodies in Europe are also members of the European Committee for Standardization (CEN) as well as members of ISO. The use of such standards may be voluntary, or they may be referenced in regulation (therefore mandatory). Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 9.3.22: Removed reference to CASRAI (https://casrai.org/term/regional-standard/), as the definition was unsuitable. regional standard A Regional standard is one that applies across a multi-nation region. Most countries have their own national standards-making bodies, which in turn may also group together to make regional standards. For example, national standards bodies in Europe are also members of the European Committee for Standardization (CEN) as well as members of ISO. The use of such standards may be voluntary, or they may be referenced in regulation (therefore mandatory). Adapted by AL from ISO. https://www.iso.org/sites/ConsumersStandards/1_standards.html#section1_2 The process of setting up environments for workflow technologies. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel workflow technologies management The process of setting up environments for workflow technologies. KH Data stewardship is a shared responsibility between Principal Investigators and data stewards. Principal Investigators are responsible for, and data stewards provide support for: (a) Data collection, data integration, or reuse of existing data; (b) Review of data quality; (c) Description of scientific workflow/process; (d) Provision of standards-compliant metadata; and, (e) Submission of data and data. Peter McQuilton Peter McQuilton data steward Data stewardship is a shared responsibility between Principal Investigators and data stewards. Principal Investigators are responsible for, and data stewards provide support for: (a) Data collection, data integration, or reuse of existing data; (b) Review of data quality; (c) Description of scientific workflow/process; (d) Provision of standards-compliant metadata; and, (e) Submission of data and data. CASRAI. https://casrai.org/term/data-steward/ Bin for Skills needed for Preservation costs management. leightonlc 2019-10-17T15:17:32.816013Z skills related to preservation costs management Peter McQuilton 2021-02-17T22:10:11.837519Z repository certification schemes A database containing information about trusted repositories that are provided by the repository managers and are useful for human and machine users. It is a registry information system on which a register is maintained. These registries do not contain information about all metadata descriptions of digital objects, nor do they offer a list of PIDs of all stored digital objects. They do offer information based on standardized types on how to retrieve such information (e.g., the port under which OAI-PMH can be accessed to offer metadata). It is a set of files containing identifiers assigned to items with descriptions of the associated items. It is assignment of a permanent, unique and unambiguous identifier to an item. Peter McQuilton https://orcid.org/0000-0002-7702-4495 registry A database containing information about trusted repositories that are provided by the repository managers and are useful for human and machine users. It is a registry information system on which a register is maintained. These registries do not contain information about all metadata descriptions of digital objects, nor do they offer a list of PIDs of all stored digital objects. They do offer information based on standardized types on how to retrieve such information (e.g., the port under which OAI-PMH can be accessed to offer metadata). It is a set of files containing identifiers assigned to items with descriptions of the associated items. It is assignment of a permanent, unique and unambiguous identifier to an item. CASRAI. https://casrai.org/term/registry To clearly communicate the existence of contributions from different individuals / projects to the compilation of a given dataset, in such a way that can be consistently cited. Laura Molloy attribution credit recognise and acknowledge contributions To clearly communicate the existence of contributions from different individuals / projects to the compilation of a given dataset, in such a way that can be consistently cited. [LM] Data where relationships/connections between them are available to allow easy data access. A typical case of a large Linked dataset is DBPedia (http://dbpedia.org/), which essentially makes the content of Wikipedia available in RDF. This related collection of interrelated datasets is stored on the Web and available via a common format -RDF. Peter McQuilton https://orcid.org/0000-0002-7702-4495 linked open data Data where relationships/connections between them are available to allow easy data access. A typical case of a large Linked dataset is DBPedia (http://dbpedia.org/), which essentially makes the content of Wikipedia available in RDF. This related collection of interrelated datasets is stored on the Web and available via a common format -RDF. CASRAI. https://casrai.org/term/linked-open-data/ Research data format is a generic term encompassing the concept of a standardised format for research data. Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 9.3.2022: The original CASRAI definition is unsuitable (https://casrai.org/term/research-data-format), therefore I have changed the definition to something more generic. research data format Research data format is a generic term encompassing the concept of a standardised format for research data. AL, 9.3.22 Ensures that the benefits to society of research outweigh any risks, from both an ethical and legal perspective. Peter McQuilton https://orcid.org/0000-0002-7702-4495 research governance Ensures that the benefits to society of research outweigh any risks, from both an ethical and legal perspective. CASRAI. https://casrai.org/term/research-governance/ The ability of computer systems to transmit data with unambiguous, shared meaning. Semantic interoperability is a requirement to enable machine computable logic, inferencing, knowledge discovery, and data federation between information systems. Semantic interoperability is achieved when the information transferred has, in its communicated form, all of the meaning required for the receiving system to interpret it correctly, even when the algorithms used by the receiving system are unknown to the sending system. Syntactic interoperability is a prerequisite to semantic interoperability. Peter McQuilton https://orcid.org/0000-0002-7702-4495 semantic interoperability The ability of computer systems to transmit data with unambiguous, shared meaning. Semantic interoperability is a requirement to enable machine computable logic, inferencing, knowledge discovery, and data federation between information systems. Semantic interoperability is achieved when the information transferred has, in its communicated form, all of the meaning required for the receiving system to interpret it correctly, even when the algorithms used by the receiving system are unknown to the sending system. Syntactic interoperability is a prerequisite to semantic interoperability. CASRAI. https://casrai.org/term/semantic-interoperability In the context of chemistry: The International Chemical Identifier is a non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations. International Union of Pure and Applied Chemistry (IUPAC)REFERENCE. MIT data management and publishing Peter McQuilton international chemical identifier In the context of chemistry: The International Chemical Identifier is a non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations. International Union of Pure and Applied Chemistry (IUPAC)REFERENCE. MIT data management and publishing CASRAI. https://casrai.org/term/international-chemical-identifier/ Activities and processes in a digital environment that lead to the publication of research data, associated metadata and accompanying documentation and software code on the Web. In contrast to interim or final published products, workflows are the means to curate, document, and review, and thus ensure and enhance the value of the published product. Workflows can involve both humans and machines and often humans are supported by technology as they perform steps in the workflow. Similar workflows may vary in the details depending on the research discipline, data publishing product and/or the host institution of the workflow (e.g., individual publisher/journal, institutional repository, discipline-specific repository). Peter McQuilton https://orcid.org/0000-0002-7702-4495 research data publication workflow Activities and processes in a digital environment that lead to the publication of research data, associated metadata and accompanying documentation and software code on the Web. In contrast to interim or final published products, workflows are the means to curate, document, and review, and thus ensure and enhance the value of the published product. Workflows can involve both humans and machines and often humans are supported by technology as they perform steps in the workflow. Similar workflows may vary in the details depending on the research discipline, data publishing product and/or the host institution of the workflow (e.g., individual publisher/journal, institutional repository, discipline-specific repository). CASRAI. https://casrai.org/term/research-data-publication-workflow/ The degree to which all required measures are known. Values may be designated as missing in order not to have empty cells, or missing values may be replaced with default or interpolated values. In the case of default or interpolated values, these must be flagged as such to distinguish them from actual measurements or observations. Peter McQuilton data completeness The degree to which all required measures are known. Values may be designated as missing in order not to have empty cells, or missing values may be replaced with default or interpolated values. In the case of default or interpolated values, these must be flagged as such to distinguish them from actual measurements or observations. CASRAI. https://casrai.org/term/data-completeness/ Data linkage where the resulting product has been de-identified. Peter McQuilton https://orcid.org/0000-0002-7702-4495 privacy-preserving data linkage Data linkage where the resulting product has been de-identified. CASRAI. https://casrai.org/term/privacy-preserving-data-linkage Celia van Gelder Mateusz Kuzak Yan Wang https://orcid.org/0000-0002-7702-4495 AL 22.3.22: refactored 'open access publishing and self-archiving' to be two separate concepts ('open access publishing', and 'self-archiving') according to T4FS best practices. Self-archiving did not have to be created because we already had self-archive, which was then moved to the Archiving hierarchy. 'publish open access' was also removed in favour of this term as they were too similar to have without the potential for confusion by users. open access publishing Prepare the data in preferred types and the data documentation. Choose the data repository and data license, deposit the data in the repository. Celia van Gelder Mateusz Kuzak Yan Wang https://orcid.org/0000-0002-7702-4495 self-archiving A person having a broad knowledge of information management disciplines and who provides guidance and support to program and staff functions on all aspects of managing the information resource. Peter McQuilton information management advisor A person having a broad knowledge of information management disciplines and who provides guidance and support to program and staff functions on all aspects of managing the information resource. CASRAI. https://casrai.org/term/information-management-advisor/ A formal statement describing how research data will be managed and documented throughout a research project and the terms regarding the subsequent deposit of the data with a data repository for long-term management and preservation. Peter McQuilton data management plan A formal statement describing how research data will be managed and documented throughout a research project and the terms regarding the subsequent deposit of the data with a data repository for long-term management and preservation. CASRAI. https://casrai.org/term/data-management-plan/ Peter McQuilton F2. data are described with rich metadata Peter McQuilton Data license R1.1 (meta)data are released with a clear and accessible data usage license A sequence of digitally encoded, coherent signals used to send or receive a representation of information content as transmitted. Peter McQuilton data stream A sequence of digitally encoded, coherent signals used to send or receive a representation of information content as transmitted. CASRAI. https://casrai.org/term/data-stream/ Data experts who have a librarian background. Data librarians often carry out curation and metadata related work. There is much overlap between data librarians, data managers, and data stewards. Peter McQuilton data librarian Data experts who have a librarian background. Data librarians often carry out curation and metadata related work. There is much overlap between data librarians, data managers, and data stewards. CASRAI. https://casrai.org/term/data-librarian/ The process of developing, communicating, implementing, monitoring, and assuring the policies, procedures, organizational structures, and practices associated with a given program. Peter McQuilton https://orcid.org/0000-0002-7702-4495 program governance The process of developing, communicating, implementing, monitoring, and assuring the policies, procedures, organizational structures, and practices associated with a given program. CASRAI. https://casrai.org/term/program-governance/ Bin for types of Knowledge related to Workflow technologies management. leightonlc 2019-10-17T15:16:10.561925Z knowledge of workflow technologies management Obtain an overview of information access status for different stakeholder groups. Check this overview against the organizational information access policy and risk catalogue, identify the type of information security risks based on the mismatch between the current status and policy. Choose and implement the right protocal to mitigate the risk. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang access risk assessment and mitigation Data that can be understood and used without additional information. Usable data are delivered in a form that meets the needs of different end-user audiences, is ready for the tasks that the end-user needs to accomplish, and that has been adapted to the end-user’s needs (not the other way around). Usable data have been cleaned, structured, are in machine readable format, fully documented, and ready for analysis and interpretation. Peter McQuilton https://orcid.org/0000-0002-7702-4495 usable data Data that can be understood and used without additional information. Usable data are delivered in a form that meets the needs of different end-user audiences, is ready for the tasks that the end-user needs to accomplish, and that has been adapted to the end-user’s needs (not the other way around). Usable data have been cleaned, structured, are in machine readable format, fully documented, and ready for analysis and interpretation. CASRAI. https://casrai.org/term/usable-data Peter McQuilton Peter McQuilton 2020-10-01T20:52:43.543314Z presentation slides Configure secure storage and monitor its usage Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang implement secure storage Refers to all the stages in the existence of digital information from creation to destruction. A lifecycle view is used to enable active management of the data objects and resource over time, thus maintaining accessibility and usability. Peter McQuilton data lifecycle Refers to all the stages in the existence of digital information from creation to destruction. A lifecycle view is used to enable active management of the data objects and resource over time, thus maintaining accessibility and usability. CASRAI. https://casrai.org/term/data-lifecycle/ Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc format and media migration A repository for persistently storing collections of data, such as a database, a file system or a directory. The data stored can be of any type that can be rendered in digital format and placed in electronic media. Examples include text, image, video files and audio files. Peter McQuilton data store A repository for persistently storing collections of data, such as a database, a file system or a directory. The data stored can be of any type that can be rendered in digital format and placed in electronic media. Examples include text, image, video files and audio files. CASRAI. https://casrai.org/term/data-store/ Initiate and develop processes to ensure outputs are made FAIR consistently with research integrity principles, and with ethical oversight. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang design processes for research integrity CoreTrustSeal offers to any interested data repository a core level certification based on the DSA–WDS Core Trustworthy Data Repositories Requirements catalogue and procedures. This universal catalogue of requirements reflects the core characteristics of trustworthy data repositories and is the culmination of a cooperative effort between DSA and WDS under the umbrella of the Research Data Alliance to merge their data repositories certifications. Peter McQuilton CTS Peter McQuilton 2021-02-17T22:12:12.996581Z core trust seal CoreTrustSeal offers to any interested data repository a core level certification based on the DSA–WDS Core Trustworthy Data Repositories Requirements catalogue and procedures. This universal catalogue of requirements reflects the core characteristics of trustworthy data repositories and is the culmination of a cooperative effort between DSA and WDS under the umbrella of the Research Data Alliance to merge their data repositories certifications. https://www.coretrustseal.org/about/ Defines how to manage a project. It will always be the same, regardless of the project lifecycle being employed. Peter McQuilton https://orcid.org/0000-0002-7702-4495 project management lifecycle Defines how to manage a project. It will always be the same, regardless of the project lifecycle being employed. CASRAI. https://casrai.org/term/project-management-lifecycle A type of metadata that conveys information needed to link a data object to its original source. Peter McQuilton authenticity metadata A type of metadata that conveys information needed to link a data object to its original source. CASRAI. https://casrai.org/term/authenticity-metadata/ researcher Bin for Skills needed for Service level management. leightonlc 2019-10-17T15:18:53.731967Z skills related to service level management 1. Data which relate to a living individual who can be identified (a) from those data, or (b) from those data and other information which is in the possession of, or is likely to come into the possession of, the data controller, and includes any expression of opinion about the individual and any indication of the intentions of the data controller or any other person in respect of the individual. 2. Any data that could potentially identify a specific individual. Any information that can be used to distinguish one person from another and can be used for de-anonymizing anonymous data can be considered personally identifiable data. 3. Data are identifiable if the information contains the name of an individual, or other identifying items such as birth date, address or geocoding. Data will be identifiable if the information contains a unique personal identifier and the holder of the information also has the master list linking the identifiers to individuals. Data may also be identifiable because of the number of different pieces of information known about a particular individual. It may also be possible to ascertain the identity of individuals from aggregated data where there are very few individuals in a particular category. Identifiability is dependent on the amount of information held and also on the skills and technology of the holder. Peter McQuilton https://orcid.org/0000-0002-7702-4495 personally identifiable information 1. Data which relate to a living individual who can be identified (a) from those data, or (b) from those data and other information which is in the possession of, or is likely to come into the possession of, the data controller, and includes any expression of opinion about the individual and any indication of the intentions of the data controller or any other person in respect of the individual. 2. Any data that could potentially identify a specific individual. Any information that can be used to distinguish one person from another and can be used for de-anonymizing anonymous data can be considered personally identifiable data. 3. Data are identifiable if the information contains the name of an individual, or other identifying items such as birth date, address or geocoding. Data will be identifiable if the information contains a unique personal identifier and the holder of the information also has the master list linking the identifiers to individuals. Data may also be identifiable because of the number of different pieces of information known about a particular individual. It may also be possible to ascertain the identity of individuals from aggregated data where there are very few individuals in a particular category. Identifiability is dependent on the amount of information held and also on the skills and technology of the holder. CASRAI. https://casrai.org/term/personally-identifiable-information Understand the current storage security status and the preferred status, as well as the barriers between them. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang understand storage security challanges The process of organizing data into tables in such a way that the results of using the database are always unambiguous and as intended. Normalization is typically a refinement process after the initial exercise of identifying the data objects that should be in the database, identifying their relationships, and defining the tables required and the columns within each table. Peter McQuilton https://orcid.org/0000-0002-7702-4495 normalization The process of organizing data into tables in such a way that the results of using the database are always unambiguous and as intended. Normalization is typically a refinement process after the initial exercise of identifying the data objects that should be in the database, identifying their relationships, and defining the tables required and the columns within each table. CASRAI. https://casrai.org/term/normalization/ Evaluation is a decision about significance, value, or quality of something, based on careful study of its good and bad features. Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 17.3.22: We may wish to make this an alternative term of Assessment. evaluation Evaluation is a decision about significance, value, or quality of something, based on careful study of its good and bad features. CASRAI. https://casrai.org/term/evaluation/ A type of data element that expresses a proposition that binds one or more property values to some data entity. Peter McQuilton data item A type of data element that expresses a proposition that binds one or more property values to some data entity. CASRAI. https://casrai.org/term/data-item/ The duties and practices of people and organizations to ensure that individual's personal information only flows from one entity to another according to legislated or otherwise broadly accepted norms and policies. Peter McQuilton confidentiality The duties and practices of people and organizations to ensure that individual's personal information only flows from one entity to another according to legislated or otherwise broadly accepted norms and policies. CASRAI. https://casrai.org/term/confidentiality/ Bin for types of Knowledge needed for Data management costs management. leightonlc knowledge of data management costs management Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 logistical support Bin for Aptitudes related to Data management costs management. leightonlc aptitudes for data management costs management The process of acquiring data from some source. For example, data may be acquired by download from a repository, transfer from a data logger, data capture, etc. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Data capture AL 15.3.22: Removed "data and metadata capture" as was too similar to this term. Added alternative term "data capture". data acquisition The process of acquiring data from some source. For example, data may be acquired by download from a repository, transfer from a data logger, data capture, etc. CASRAI. https://casrai.org/term/data-acquisition/ Planning for and controlling expenditures related to the data management lifecycle, including, but not limited to, labor and infrastructure expenses for data collection; data documentation; data storage; data access and security; data preservation; data sharing; and data disposition. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel data management costs management Planning for and controlling expenditures related to the data management lifecycle, including, but not limited to, labor and infrastructure expenses for data collection; data documentation; data storage; data access and security; data preservation; data sharing; and data disposition. Written by LLC, based on list at https://www.uu.nl/en/research/research-data-management/guides/costs-of-data-management A Text file is a kind of computer file that is structured as a sequence of lines of electronic text. A text file exists stored as data within a computer file system. Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 9.3.22: The CASRAI term (https://casrai.org/term/text-file) had an unsuitable definition, and therefore was removed. text file A Text file is a kind of computer file that is structured as a sequence of lines of electronic text. A text file exists stored as data within a computer file system. AL, Adapted from Wikipedia. https://en.wikipedia.org/wiki/Text_file Data and code that are commented so that humans can understand what it represents, it's design, and purpose.REFERENCE. Wilson G, Aruliah DA, Brown CT, Hong NPC, Davis M, Guy RT, Haddock SHD, Huff K, Mitchell IM, Plumbley MD, Waugh B, White EP, Wilson P (2012). Best practices for scientific computing , arXiv, 29 November, 1-6. Peter McQuilton human-readable format Data and code that are commented so that humans can understand what it represents, it's design, and purpose.REFERENCE. Wilson G, Aruliah DA, Brown CT, Hong NPC, Davis M, Guy RT, Haddock SHD, Huff K, Mitchell IM, Plumbley MD, Waugh B, White EP, Wilson P (2012). Best practices for scientific computing , arXiv, 29 November, 1-6. CASRAI. https://casrai.org/term/human-readable-format/ Provides well-defined guarantees for fitness, accuracy, and consistency for any of various kinds of user input into an application or automated system. Data validation checks that data are valid, sensible, reasonable, clean, usable, and secure before they are processed. Failures or omissions in data validation can lead to data corruption, security vulnerability. Improperly validated data &hellip; <a href=CASRAI. https://casrai.org/term/data-validation/ class=more-link Peter McQuilton data validation Provides well-defined guarantees for fitness, accuracy, and consistency for any of various kinds of user input into an application or automated system. Data validation checks that data are valid, sensible, reasonable, clean, usable, and secure before they are processed. Failures or omissions in data validation can lead to data corruption, security vulnerability. Improperly validated data &hellip; <a href=CASRAI. https://casrai.org/term/data-validation/ class=more-link CASRAI. https://casrai.org/term/data-validation/ A type of record (and organization) that stores an instance of an executable/understandable PID. The content of a PID record distinguishes a registered digital or data object from other DOs. A PID record is a type of record that includes property information that characterizes the digital object it is identifying. Important parts of a PID record are location and checksum. However there is a large variation in usage. In some data models the PID is simply used as a unique label with an empty record. A PID record has a lifecycle including creation, publication, Curation and the destruction. Peter McQuilton https://orcid.org/0000-0002-7702-4495 pid record A type of record (and organization) that stores an instance of an executable/understandable PID. The content of a PID record distinguishes a registered digital or data object from other DOs. A PID record is a type of record that includes property information that characterizes the digital object it is identifying. Important parts of a PID record are location and checksum. However there is a large variation in usage. In some data models the PID is simply used as a unique label with an empty record. A PID record has a lifecycle including creation, publication, Curation and the destruction. CASRAI, https://casrai.org/term/pid-record/ Take proactive approach to ensure outputs are made FAIR consistently with research integrity principles. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang appreciate importance of research integrity A type of metadata that indicates how compound objects are put together (e.g., how pages are ordered to form chapters; how data are organized in a table; how datasets are organized in a collection) 2. The underlying structural metadata of digital objects that tells computers how to assemble them. Peter McQuilton https://orcid.org/0000-0002-7702-4495 structural metadata A type of metadata that indicates how compound objects are put together (e.g., how pages are ordered to form chapters; how data are organized in a table; how datasets are organized in a collection) 2. The underlying structural metadata of digital objects that tells computers how to assemble them. CASRAI. https://casrai.org/term/structural-metadata/ Data that could not lead to the identification of a specific individual, to distinguishing one person from another, or to personally identifiable information. These may be data that have been de-identified, or that could not lead to personally identifiable information in the first place. Peter McQuilton https://orcid.org/0000-0002-7702-4495 non personally identifiable information Data that could not lead to the identification of a specific individual, to distinguishing one person from another, or to personally identifiable information. These may be data that have been de-identified, or that could not lead to personally identifiable information in the first place. CASRAI. https://casrai.org/term/non-personally-identifiable-information/ The process of creating digital files by scanning or otherwise converting analogue materials. The resulting digital copy, or digital surrogate, would then be classed as digital material and then subject to the same broad challenges involved in preserving access to it, as born digital materials. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digitisation The process of creating digital files by scanning or otherwise converting analogue materials. The resulting digital copy, or digital surrogate, would then be classed as digital material and then subject to the same broad challenges involved in preserving access to it, as born digital materials. CASRAI. https://casrai.org/term/digitisation/ REFERENCE. Digital preservation coalitionhttp://www.dpconline.org/advice/preservationhandbook/introduction/definitions-and-concepts The release of research data, associated metadata, accompanying documentation, and software code (in cases where the raw data have been processed or manipulated) for re-use and analysis in such a manner that they can be discovered on the Web and referred to in a unique and persistent way. Data publishing occurs via dedicated data repositories... Peter McQuilton data publication The release of research data, associated metadata, accompanying documentation, and software code (in cases where the raw data have been processed or manipulated) for re-use and analysis in such a manner that they can be discovered on the Web and referred to in a unique and persistent way. Data publishing occurs via dedicated data repositories... CASRAI. https://casrai.org/term/data-publication/ Peter McQuilton findability of digital assets Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 AL 9.3.22: renamed to prevent compound term. Was 'Influencing/community building' community building A type of referable data that has undergone quality assessment and can be referred to as citations in publications and as part of research objects. Peter McQuilton citable data A type of referable data that has undergone quality assessment and can be referred to as citations in publications and as part of research objects. CASRAI. https://casrai.org/term/citable-data/ An evolving term that describes any voluminous amount of structured, semi-structured and unstructured data that have the potential to be mined for information. Peter McQuilton big data An evolving term that describes any voluminous amount of structured, semi-structured and unstructured data that have the potential to be mined for information. CASRAI. https://casrai.org/term/big-data/ Ability to apply knowledge about FAIR metrics and assessing FAIRness using FAIR assessment tools. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc fairness assessment An archival service providing the long-term permanent care and accessibility for digital objects with research value. Peter McQuilton data archive An archival service providing the long-term permanent care and accessibility for digital objects with research value. CASRAI. https://casrai.org/term/data-archive/ Describes the technical processes used to produce, or required to use a digital object. Peter McQuilton https://orcid.org/0000-0002-7702-4495 technical metadata Describes the technical processes used to produce, or required to use a digital object. CASRAI. https://casrai.org/term/technical-metadata/ OBSOLETE. In a form that can be used and understood by a computer. Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 5.12.22: Deprecated because of its similarity with http://purl.obolibrary.org/obo/T4FS_0000551. We don't need to be going to the level of granularity that the concept of machine readability needs to be modelled as well as machine readable format. machine readable true OBSOLETE. In a form that can be used and understood by a computer. CASRAI. https://casrai.org/term/machine-readable/ A combination of business processes, policies and technologies that allows organizations to provide secure access to confidential data. Integrated access management software is used by enterprises to control the flow of sensitive data in and out of the network. Peter McQuilton https://orcid.org/0000-0002-7702-4495 integrated access management A combination of business processes, policies and technologies that allows organizations to provide secure access to confidential data. Integrated access management software is used by enterprises to control the flow of sensitive data in and out of the network. CASRAI. https://casrai.org/term/integrated-access-management/ Meaningless data, including: Any data that cannot be understood and interpreted correctly by machines, such as unstructured text; any data that has been received, stored, or changed in such a manner that it cannot be read or used by the program that originally created it. Peter McQuilton https://orcid.org/0000-0002-7702-4495 noisy data Meaningless data, including: Any data that cannot be understood and interpreted correctly by machines, such as unstructured text; any data that has been received, stored, or changed in such a manner that it cannot be read or used by the program that originally created it. CASRAI. https://casrai.org/term/noisy-data/ Activities in and around the provision of training, mentorship and teaching opportunities for good data management and the implementation of FAIR practices. Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 fair training Activities in and around the provision of training, mentorship and teaching opportunities for good data management and the implementation of FAIR practices. PMQ A set of instructions that direct a computer to do a specific task. Peter McQuilton https://orcid.org/0000-0002-7702-4495 software A set of instructions that direct a computer to do a specific task. CASRAI. https://casrai.org/term/software In the context of data analysis and data mining: Where V represents the value of the variable in the original datasets: Transformation of data to have zero mean and unit variance. Techniques used include: (a) Data normalization; (b) z-score scaling; (c) Dividing each value by the range: recalculates each variable as V /(max V - ... Peter McQuilton data standardization In the context of data analysis and data mining: Where V represents the value of the variable in the original datasets: Transformation of data to have zero mean and unit variance. Techniques used include: (a) Data normalization; (b) z-score scaling; (c) Dividing each value by the range: recalculates each variable as V /(max V - ... CASRAI. https://casrai.org/term/data-standardization/ Strutured data that are accessible, machine-readable, usable, intelligible, and freely shared. Open data can be freely used, re-used, built on, and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike. Peter McQuilton https://orcid.org/0000-0002-7702-4495 open data Strutured data that are accessible, machine-readable, usable, intelligible, and freely shared. Open data can be freely used, re-used, built on, and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike. CASRAI. https://casrai.org/term/open-data/ The output of a data curation activity. Such data has generally already been cleaned, standardised, documented. Additional metadata relevant to the data object has also been added, via either manual or automatic methods. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton AL 9.5.22: The orginal label ("data curation") matched a term within the Data stewardship activity hierarchy. Changing the class label to "Curated data" better fit the model. curated data The output of a data curation activity. Such data has generally already been cleaned, standardised, documented. Additional metadata relevant to the data object has also been added, via either manual or automatic methods. AL 9.5.22 Bin for Skills needed for Funding acquisition management. leightonlc skills related to funding acquisition management A 128-bit number used to guarantee unique identity for different objects on the internet over time. File system partitions. Peter McQuilton https://orcid.org/0000-0002-7702-4495 UUID universally unique identifier A 128-bit number used to guarantee unique identity for different objects on the internet over time. File system partitions. CASRAI. https://casrai.org/term/universally-unique-identifier/ The process of confirming the identity of a principal entity. Peter McQuilton authentication The process of confirming the identity of a principal entity. CASRAI. https://casrai.org/term/authentication/ A large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualized, dynamically- scalable, managed computing power, storage, platforms and services are delivered on demand to external customers over the Internet. Peter McQuilton cloud computing A large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualized, dynamically- scalable, managed computing power, storage, platforms and services are delivered on demand to external customers over the Internet. CASRAI. https://casrai.org/term/cloud-computing/ Short-term preservation. Access to digital materials either for a defined period of time while use is predicted but which does not extend beyond the foreseeable future and/or until it becomes inaccessible because of changes in technology. Peter McQuilton https://orcid.org/0000-0002-7702-4495 short-term preservation Short-term preservation. Access to digital materials either for a defined period of time while use is predicted but which does not extend beyond the foreseeable future and/or until it becomes inaccessible because of changes in technology. CASRAI. https://casrai.org/term/short-term-preservation A standard that is used in multiple nations and whose development process is open to representatives from all countries. Peter McQuilton international standard A standard that is used in multiple nations and whose development process is open to representatives from all countries. CASRAI. https://casrai.org/term/international-standard/ The state of having satisfied the requirements of some specific standard(s) and/or specification(s). Peter McQuilton conformance The state of having satisfied the requirements of some specific standard(s) and/or specification(s). CASRAI. https://casrai.org/term/conformance/ A data custodian is an IT individual or organization responsible for the IT infrastructure providing and protecting data in conformance with the policies and practices prescribed by data governance. Peter McQuilton data custodian A data custodian is an IT individual or organization responsible for the IT infrastructure providing and protecting data in conformance with the policies and practices prescribed by data governance. CASRAI. https://casrai.org/term/data-custodian/ Bin for Aptitudes related to High performance computing management. leightonlc aptitudes for high performance computing management Implement the policies that govern the arrangement, naming, descriptive metadata, provenance metadata, representation metadata, administrative metadata, access controls, retention, disposition, integrity, and replication of digital objects. Peter McQuilton https://orcid.org/0000-0002-7702-4495 manage datasets in a repository Implement the policies that govern the arrangement, naming, descriptive metadata, provenance metadata, representation metadata, administrative metadata, access controls, retention, disposition, integrity, and replication of digital objects. CASRAI. https://casrai.org/term/manage-datasets-in-a-repository Removing noise from data. Peter McQuilton data de-noising Removing noise from data. CASRAI. https://casrai.org/term/data-de-noising/ Provides the relationship and process context for working together to ensure outcomes are achieved. Peter McQuilton https://orcid.org/0000-0002-7702-4495 governance and accountability model Provides the relationship and process context for working together to ensure outcomes are achieved. CASRAI. https://casrai.org/term/governance-and-accountability-model The continued, available for use, ongoing usability of a digital resource, retaining all qualities of authenticity, accuracy and functionality deemed to be essential for the purposes the digital material was created and/or acquired for. Users who have access can retrieve, manipulate, copy, and store copies on a wide range of hard drives and external devices. Peter McQuilton access The continued, available for use, ongoing usability of a digital resource, retaining all qualities of authenticity, accuracy and functionality deemed to be essential for the purposes the digital material was created and/or acquired for. Users who have access can retrieve, manipulate, copy, and store copies on a wide range of hard drives and external devices. CASRAI. https://casrai.org/term/access Find the people with a good understand of FAIR and open research challenges in your organiation, and create a work agenda. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang develop fair and open research strategy Choose the proper type of patent for the project data and apply for the patent chosen. Celia van Gelder Mateusz Kuzak Yan Wang patent application A type of data provenance that adds metadata to identify data collections. Peter McQuilton collection management identification A type of data provenance that adds metadata to identify data collections. CASRAI. https://casrai.org/term/collection-management-identification/ Activity/Process whereby digital objects are evaluated with the relevant FAIR metrics and assessment tools. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc assess fairness Activity/Process whereby digital objects are evaluated with the relevant FAIR metrics and assessment tools. [YLF, VDA, KH] Bin for types of Knowledge related to Change management. leightonlc knowledge of change management The process of obtaining, importing, and processing data for later use or storage in a database. This process often involves altering individual files by editing their content and/or formatting them to fit into a larger document. Peter McQuilton data ingestion The process of obtaining, importing, and processing data for later use or storage in a database. This process often involves altering individual files by editing their content and/or formatting them to fit into a larger document. CASRAI. https://casrai.org/term/data-ingestion/ The process of manually or semi-automatically converting or mapping data from one form into another format that allows for more convenient consumption of the data with the help of semi-automated tools. Gathering and organizing disparate data from different sources, often collected by many different investigators. Activities include developing and supporting search tools that utilize standardized &hellip; <a href=CASRAI. https://casrai.org/term/data-wrangling/ class=more-link Peter McQuilton data wrangling The process of manually or semi-automatically converting or mapping data from one form into another format that allows for more convenient consumption of the data with the help of semi-automated tools. Gathering and organizing disparate data from different sources, often collected by many different investigators. Activities include developing and supporting search tools that utilize standardized &hellip; <a href=CASRAI. https://casrai.org/term/data-wrangling/ class=more-link CASRAI. https://casrai.org/term/data-wrangling/ The configuration of staff, services and tools assembled to support data management across the research lifecycle and more specifically to provide comprehensive coverage of the stages making up the data lifecycle. It can be organized locally and/or globally to support research data activities across the research lifecycle. Peter McQuilton https://orcid.org/0000-0002-7702-4495 research data management infrastructure The configuration of staff, services and tools assembled to support data management across the research lifecycle and more specifically to provide comprehensive coverage of the stages making up the data lifecycle. It can be organized locally and/or globally to support research data activities across the research lifecycle. CASRAI. https://casrai.org/term/research-data-management-infrastructure Appreciate the need for flexibility in applying FAIR criteria to make data as open as possible, as closed as necessary. Angus Whyte flexibility in relating fair criteria to openness To be able to research and identify the appropriate taxonomy for your project. This may involve using resources such as FAIRsharing, which captures metadata on taxonomies and provides an assessment of their FAIRness. Peter McQuilton Controlled Vocabulary identification Ontology identification Taxonomy identification Thesaurus identification choosing the appropriate terminology for your data To be able to research and identify the appropriate taxonomy for your project. This may involve using resources such as FAIRsharing, which captures metadata on taxonomies and provides an assessment of their FAIRness. [PMQ] Data whose elements have been organized into a consistent format and data structure within a defined data model such that the elements can be easily addressed, organized and accessed in various combinations to make better use of the information, such as in a relational database. Peter McQuilton https://orcid.org/0000-0002-7702-4495 structured data Data whose elements have been organized into a consistent format and data structure within a defined data model such that the elements can be easily addressed, organized and accessed in various combinations to make better use of the information, such as in a relational database. CASRAI. https://casrai.org/term/structured-data/ Know how data handling practices in the domain make FAIR criteria more or less easy to implement. Angus Whyte knowledge to contextualise fair principles to domain Data Management refers to the storage, access and preservation of data produced from a given investigation. Data management practices cover the entire lifecycle of the data, from planning the investigation to conducting it, and from backing up data as it is created and used to long term preservation of data deliverables after the research investigation has concluded. Specific activities and issues that fall within the category of data management include: File naming (the proper way to name computer files); data quality control and quality assurance; data access; data documentation (including levels of uncertainty); metadata creation and controlled vocabularies; data storage; data archiving and preservation; data sharing and reuse; data integrity; data security; data privacy; data rights; notebook protocols (lab or field). Peter McQuilton https://orcid.org/0000-0002-7702-4495 research data management Data Management refers to the storage, access and preservation of data produced from a given investigation. Data management practices cover the entire lifecycle of the data, from planning the investigation to conducting it, and from backing up data as it is created and used to long term preservation of data deliverables after the research investigation has concluded. Specific activities and issues that fall within the category of data management include: File naming (the proper way to name computer files); data quality control and quality assurance; data access; data documentation (including levels of uncertainty); metadata creation and controlled vocabularies; data storage; data archiving and preservation; data sharing and reuse; data integrity; data security; data privacy; data rights; notebook protocols (lab or field). CASRAI. https://casrai.org/term/research-data-management/ Deposit research output (software, data and publication together with documentation) in previousy selected repositories. Celia van Gelder Mateusz Kuzak Yan Wang publish output in a repository To be able to identify and process metadata over the internet, sourced from a variety of metadata providers and schemata. Peter McQuilton Metadata processing Metadata search metadata search via metadata providers To be able to identify and process metadata over the internet, sourced from a variety of metadata providers and schemata. [PMQ] Information for a data object that includes: * the person who deposited the data object in the repository, * the source of the data object, * the date when the object was deposited, and * authenticity information needed to link the data object to its original source. Peter McQuilton https://orcid.org/0000-0002-7702-4495 record provenance information Information for a data object that includes: * the person who deposited the data object in the repository, * the source of the data object, * the date when the object was deposited, and * authenticity information needed to link the data object to its original source. CASRAI. https://casrai.org/term/record-provenance-information/ Offers proper recognition to authors as well as permanent identification through the use of global persistent identifiers in place of URLs which can change frequently. Peter McQuilton data citation Offers proper recognition to authors as well as permanent identification through the use of global persistent identifiers in place of URLs which can change frequently. CASRAI. https://casrai.org/term/data-citation/ Peter McQuilton Peter McQuilton 2020-10-01T20:54:08.535183Z book The capacity to influence stakeholders and the direction of research activities; the ability to shape others' understanding in ways that capture interest, inform and gain support; and, the capacity to influence the actions and opinions of others. Peter McQuilton https://orcid.org/0000-0002-7702-4495 intellectual leadership The capacity to influence stakeholders and the direction of research activities; the ability to shape others' understanding in ways that capture interest, inform and gain support; and, the capacity to influence the actions and opinions of others. CASRAI. https://casrai.org/term/intellectual-leadership/ In the context of a researcher's activities, innovation is the development of modified or novel approaches, theories, concepts, ideas or solutions. Innovation is one of four valued outcomes. Peter McQuilton https://orcid.org/0000-0002-7702-4495 innovation In the context of a researcher's activities, innovation is the development of modified or novel approaches, theories, concepts, ideas or solutions. Innovation is one of four valued outcomes. CASRAI. https://casrai.org/term/innovation/ High-performance computing (HPC) is a process for using of super computers and parallel processing techniques for solving complex computational problems. HPC technology focuses on developing parallel processing algorithms and systems by incorporating both administration and parallel computational techniques. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel HPC management high performance computing management High-performance computing (HPC) is a process for using of super computers and parallel processing techniques for solving complex computational problems. HPC technology focuses on developing parallel processing algorithms and systems by incorporating both administration and parallel computational techniques. (modified from wikipedia)[VDA] A type of historical information or metadata about the origin, location or the source of something, or the history of the ownership or location of an object or resource including digital objects. For example, information about the Principal Investigator who recorded the data, and the information concerning its storage, handling, and migration. Peter McQuilton https://orcid.org/0000-0002-7702-4495 provenance A type of historical information or metadata about the origin, location or the source of something, or the history of the ownership or location of an object or resource including digital objects. For example, information about the Principal Investigator who recorded the data, and the information concerning its storage, handling, and migration. CASRAI. https://casrai.org/term/provenance A record created digitally in the day-to-day business of the organisation and assigned formal status by the organisation. Examples include: word processing documents, emails, databases, or intranet web pages. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Electronic record AL 8.2.22: Updated label from the CASRAI "Digital" to "Digital record" to better match the definition and intent of the term, and to provide clarity. digital record A record created digitally in the day-to-day business of the organisation and assigned formal status by the organisation. Examples include: word processing documents, emails, databases, or intranet web pages. CASRAI. https://casrai.org/term/digital/ Processes and procedures designed to ensure that the results of laboratory analysis are consistent, comparable, accurate and within specified limits of precision. Peter McQuilton analytical quality control Processes and procedures designed to ensure that the results of laboratory analysis are consistent, comparable, accurate and within specified limits of precision. CASRAI. https://casrai.org/term/analytical-quality-control/ Research data which is in digital form. It may have been originally created in digital form, or it may have been converted from paper, or other form to a digital representation. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digital research data Research data which is in digital form. It may have been originally created in digital form, or it may have been converted from paper, or other form to a digital representation. CASRAI. https://casrai.org/term/digital-research-data/ In the context of a researcher’s activities, Managing research is the processes related to the planning, organizing, setting objectives, controlling and evaluating of RDA activities and their associated human and financial resources. It includes the provision of leadership to, and assessment of, other scientists, engineers, technologists, and/or other staff. Managing research is one of the three research contexts in which a researcher is expected to conduct his/her activities. Managing research is distinct from the position-based role of a research manager. Peter McQuilton https://orcid.org/0000-0002-7702-4495 managing research In the context of a researcher’s activities, Managing research is the processes related to the planning, organizing, setting objectives, controlling and evaluating of RDA activities and their associated human and financial resources. It includes the provision of leadership to, and assessment of, other scientists, engineers, technologists, and/or other staff. Managing research is one of the three research contexts in which a researcher is expected to conduct his/her activities. Managing research is distinct from the position-based role of a research manager. CASRAI. https://casrai.org/term/managing-research/ To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate publishing platform for a dataset. This may, for example, involve discovering the mark-up of the platform, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRnessbased on human-entered questionnaires or FAIRsharing resource metadata. Peter McQuilton evaluation of how to publish your data To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate publishing platform for a dataset. This may, for example, involve discovering the mark-up of the platform, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRnessbased on human-entered questionnaires or FAIRsharing resource metadata. [PMQ] Numbers used by the National Center for Biotechnology Information (NCBI) that are unique and citable. Peter McQuilton accession number Numbers used by the National Center for Biotechnology Information (NCBI) that are unique and citable. CASRAI. https://casrai.org/term/accession-number/ The series of managed activities necessary to ensure continued access to digital materials for as long as necessary. Digital preservation is defined very broadly and refers to all of the actions required to maintain access to digital materials beyond the limits of media failure or technological change. Those materials may be records created during the day-to-day business of an organisation; “”born-digital”” materials created for a specific purpose (e.g. teaching resources); or the products of digitisation projects. This definition specifically excludes the potential use of digital technology to preserve the original artefacts through digitisation. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digital preservation The series of managed activities necessary to ensure continued access to digital materials for as long as necessary. Digital preservation is defined very broadly and refers to all of the actions required to maintain access to digital materials beyond the limits of media failure or technological change. Those materials may be records created during the day-to-day business of an organisation; “”born-digital”” materials created for a specific purpose (e.g. teaching resources); or the products of digitisation projects. This definition specifically excludes the potential use of digital technology to preserve the original artefacts through digitisation. CASRAI. https://casrai.org/term/digital-preservation/ The practice of initiating, planning, executing, controlling, and closing the work of a team in relation to FAIR data stewardship. Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 project management activities The practice of initiating, planning, executing, controlling, and closing the work of a team in relation to FAIR data stewardship. PMQ Bin for types of Knowledge related to Funding acquisition management. leightonlc knowledge of funding acquisition management Monitor the authorised parties' access to data and keep it up to date. Celia van Gelder Mateusz Kuzak Yan Wang manage access control The practice of initiating, planning, executing, controlling, and closing the work of a team to achieve specific goals and meet specific success criteria at the specified time. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel PM project management The practice of initiating, planning, executing, controlling, and closing the work of a team to achieve specific goals and meet specific success criteria at the specified time. KH An organization's established protocol for retaining information for operational or regulatory compliance needs. The objectives of a data retention policy are to keep important information for future use or reference, to organize information so it can be searched and accessed at a later date, and to dispose of information that is no longer needed. Peter McQuilton data retention policy An organization's established protocol for retaining information for operational or regulatory compliance needs. The objectives of a data retention policy are to keep important information for future use or reference, to organize information so it can be searched and accessed at a later date, and to dispose of information that is no longer needed. CASRAI. https://casrai.org/term/data-retention-policy/ An object describing the context of the data, including provenance, description, structural, and administrative information. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data representation An object describing the context of the data, including provenance, description, structural, and administrative information. CASRAI. https://casrai.org/term/data-representation/ The set of metadata describing a specific dataset. Peter McQuilton https://orcid.org/0000-0002-7702-4495 metadata dataset The set of metadata describing a specific dataset. CASRAI. https://casrai.org/term/metadata-dataset/ Long-term preservation - Continued access to digital materials, or at least to the information contained in them, indefinitely. Peter McQuilton https://orcid.org/0000-0002-7702-4495 long-term preservation Long-term preservation - Continued access to digital materials, or at least to the information contained in them, indefinitely. CASRAI. https://casrai.org/term/long-term-preservation Research metadata format is a generic term encompassing the concept of a standardised format for research metadata. Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 9.3.2022: The original CASRAI definition is unsuitable (https://casrai.org/term/research-metadata-format), therefore I have changed the definition to something more generic research metadata format Research metadata format is a generic term encompassing the concept of a standardised format for research metadata. AL, 9.3.2022 The physical or geographic location of an organization's data or information. Data residency also refers to the legal or regulatory requirements imposed on data based on the country or region in which it resides. Cloud computing, which allows organizations to deliver hosted services over the Internet, can create data residency concerns. Peter McQuilton data residency The physical or geographic location of an organization's data or information. Data residency also refers to the legal or regulatory requirements imposed on data based on the country or region in which it resides. Cloud computing, which allows organizations to deliver hosted services over the Internet, can create data residency concerns. CASRAI. https://casrai.org/term/data-residency/ Information governance, is the overall strategy for information at an organization. Information governance balances the risk that information presents with the value that information provides. Information governance helps with legal compliance, operational transparency, and reducing expenditures associated with legal discovery. Peter McQuilton https://orcid.org/0000-0002-7702-4495 CASRAI is the source for the term, with the definition provided separately. information governance Information governance, is the overall strategy for information at an organization. Information governance balances the risk that information presents with the value that information provides. Information governance helps with legal compliance, operational transparency, and reducing expenditures associated with legal discovery. https://en.wikipedia.org/wiki/Information_governance, accessed 8.2.22 A unit of data for which the definition, identification, representation (term used to represent it), and permissible values are specified by means of a set of attributes. Peter McQuilton data element A unit of data for which the definition, identification, representation (term used to represent it), and permissible values are specified by means of a set of attributes. CASRAI. https://casrai.org/term/data-element/ An organization's stated data/information management processes designed to assist and protect the organization's data research assets. It is a set of high-level principles that establish a guiding framework for data management. A data policy can be used to address strategic aspects such as data access, relevant legal matters, data stewardship issues and custodial duties, data... Peter McQuilton data policy An organization's stated data/information management processes designed to assist and protect the organization's data research assets. It is a set of high-level principles that establish a guiding framework for data management. A data policy can be used to address strategic aspects such as data access, relevant legal matters, data stewardship issues and custodial duties, data... CASRAI. https://casrai.org/term/data-policy/ Data that fall into the category of dark data or at-risk data. Peter McQuilton https://orcid.org/0000-0002-7702-4495 legacy data Data that fall into the category of dark data or at-risk data. CASRAI. https://casrai.org/term/legacy-data A list of standardized terminology, words, or phrases, used for indexing or content analysis and information retrieval, usually in a defined information domain. Peter McQuilton controlled vocabulary A list of standardized terminology, words, or phrases, used for indexing or content analysis and information retrieval, usually in a defined information domain. CASRAI. https://casrai.org/term/controlled-vocabulary/ The assurance that information can only be accessed or modified by those authorized to do so. Peter McQuilton data integrity The assurance that information can only be accessed or modified by those authorized to do so. CASRAI. https://casrai.org/term/data-integrity/ A single data element related to a PID and part of its record content. Peter McQuilton https://orcid.org/0000-0002-7702-4495 pid attribute A single data element related to a PID and part of its record content. CASRAI. https://casrai.org/term/pid-attribute/ Bin for Aptitudes related to Change management. leightonlc aptitudes for change management Medium-term preservation - Continued access to digital materials beyond changes in technology for a defined period of time but not indefinitely. Peter McQuilton https://orcid.org/0000-0002-7702-4495 medium-term preservation Medium-term preservation - Continued access to digital materials beyond changes in technology for a defined period of time but not indefinitely. CASRAI. https://casrai.org/term/medium-term-preservation/ Describes the processes and tasks that must be completed to produce a product or service. Different project lifecycles exist for specific products and services. (For example, the lifecycle followed to build a house is very different from the lifecycle followed to develop a software package. Peter McQuilton https://orcid.org/0000-0002-7702-4495 project lifecycle Describes the processes and tasks that must be completed to produce a product or service. Different project lifecycles exist for specific products and services. (For example, the lifecycle followed to build a house is very different from the lifecycle followed to develop a software package. CASRAI. https://casrai.org/term/project-lifecycle Know how the acceptability of research data FAIRness depends on the research community norms e.g. concepts of data and methods for deriving valid knowledge. Angus Whyte knowledge to relate fair data assessment to research community norms Data traceability follows the lifecycle of data to track all access and changes to the data. It helps demonstrate transparency, compliance and adherence to regulations. Data traceability, along with data compliance, can be considered part of a data audit process. Data traceability is fundamental to reproducible research. Peter McQuilton data traceability Data traceability follows the lifecycle of data to track all access and changes to the data. It helps demonstrate transparency, compliance and adherence to regulations. Data traceability, along with data compliance, can be considered part of a data audit process. Data traceability is fundamental to reproducible research. CASRAI. https://casrai.org/term/data-traceability/ Peter McQuilton 2020-10-02T14:23:22.115275Z maintaining persistent identifiers Choose the proper license considering types of project data and clearly indicate (apply) the license chosen in the project work. Celia van Gelder Mateusz Kuzak Yan Wang select and apply license Indicates how the different components within a system are linked to fulfill the tasks. Relations are thus defined by the services they are making use of and by the interface specifications. Peter McQuilton https://orcid.org/0000-0002-7702-4495 relations Indicates how the different components within a system are linked to fulfill the tasks. Relations are thus defined by the services they are making use of and by the interface specifications. CASRAI. https://casrai.org/term/relations Any organized collection of data in a computational format, defined by a theme or category that reflects what is being measured/observed/monitored. The presentation of the data in the application is enabled through metadata.REFERENCE. Research Data Alliance http://smw-rda.esc.rzg.mpg.de/index.php/Main_Page ; Mapping the Data Landscape 2011 Summit; TBS Standard on Geospatial Data (ISO 19115:2003); Environment Canada data stewardship &hellip; <a href=CASRAI. https://casrai.org/term/dataset/ class=more-link Peter McQuilton dataset Any organized collection of data in a computational format, defined by a theme or category that reflects what is being measured/observed/monitored. The presentation of the data in the application is enabled through metadata.REFERENCE. Research Data Alliance http://smw-rda.esc.rzg.mpg.de/index.php/Main_Page ; Mapping the Data Landscape 2011 Summit; TBS Standard on Geospatial Data (ISO 19115:2003); Environment Canada data stewardship &hellip; <a href=CASRAI. https://casrai.org/term/dataset/ class=more-link CASRAI. https://casrai.org/term/dataset/ Bin for Aptitudes related to Service level management. leightonlc aptitudes for service level management Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879). Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere. Peter McQuilton https://orcid.org/0000-0002-7702-4495 XML extensible markup language Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879). Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere. CASRAI. https://casrai.org/term/extensible-markup-language/ Bin for Skills needed for Change management. leightonlc skills related to change management A data lifecycle stage that involves the techniques that produce synthesized knowledge from organized information. Peter McQuilton data analysis A data lifecycle stage that involves the techniques that produce synthesized knowledge from organized information. CASRAI. https://casrai.org/term/data-analysis/ Resource management is the efficient and effective development of an organization's resources when they are needed. Such resources may include the financial resources, inventory, human skills, production resources, or information technology (IT) and natural resources. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel resource management Resource management is the efficient and effective development of an organization's resources when they are needed. Such resources may include the financial resources, inventory, human skills, production resources, or information technology (IT) and natural resources. [LLC, https://en.wikipedia.org/wiki/Resource_management] Data exploration involves summarizing the main characteristics of a dataset using visualization and should be the first step in data analysis. Peter McQuilton data exploration Data exploration involves summarizing the main characteristics of a dataset using visualization and should be the first step in data analysis. CASRAI. https://casrai.org/term/data-exploration/ Bin for Aptitudes related to Storage management. leightonlc aptitudes for storage management Management and provision of good data stewardship practice. Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 Advise and enable Data Stewardship and FAIR mentorship activity data management and open research AL 9.3.22. Removed from 'FAIR4S defined activity' class to better integrate with the T4FS hierarchy. advise and mentor Management and provision of good data stewardship practice. PMQ Bin for Skills needed for Workflow technologies management. leightonlc 2019-10-17T15:15:55.86138Z skills related to workflow technologies management Those layers that sit between base technology (a computer science concern) and discipline-specific science. The focus is on value-added systems and services that can be widely shared across scientific domains, both supporting and enabling large increases in multi- and interdisciplinary science while reducing duplication of effort and resources (e.g., including hardware, software, personnel, services and organizations). Peter McQuilton https://orcid.org/0000-0002-7702-4495 Cyber infrastructure digital infrastructure Those layers that sit between base technology (a computer science concern) and discipline-specific science. The focus is on value-added systems and services that can be widely shared across scientific domains, both supporting and enabling large increases in multi- and interdisciplinary science while reducing duplication of effort and resources (e.g., including hardware, software, personnel, services and organizations). CASRAI. https://casrai.org/term/digital-infrastructure/ Bin for Skills needed for Data management cost management. leightonlc skills related to data management cost management Consists of at least one PID resolver, a name schema and a defined mechanism for issuing PIDs that conform to the name schema. Examples include: DOI, Handle System, URN, ARK, PURL, etc. Peter McQuilton https://orcid.org/0000-0002-7702-4495 pid system Consists of at least one PID resolver, a name schema and a defined mechanism for issuing PIDs that conform to the name schema. Examples include: DOI, Handle System, URN, ARK, PURL, etc. CASRAI. https://casrai.org/term/pid-system/ Peter McQuilton I2. (meta)data use vocabularies that follow fair principles Explains aspects of one discipline in terms of another (e.g., the physics of music; the politics of literature). Peter McQuilton cross-disciplinary Explains aspects of one discipline in terms of another (e.g., the physics of music; the politics of literature). CASRAI. https://casrai.org/term/cross-disciplinary/ Data that are tagged with particular metadata that can be used to derive relationships between data. Peter McQuilton https://orcid.org/0000-0002-7702-4495 semantic data Data that are tagged with particular metadata that can be used to derive relationships between data. CASRAI. https://casrai.org/term/semantic-data For a single identifier, the class of entity it refers to. For a PID system, the typical class of entities it is intended to be used for. Examples include: digital objects, physical objects, bodies, actors. Peter McQuilton https://orcid.org/0000-0002-7702-4495 pid domain For a single identifier, the class of entity it refers to. For a PID system, the typical class of entities it is intended to be used for. Examples include: digital objects, physical objects, bodies, actors. CASRAI. https://casrai.org/term/pid-domain/ Select data handling approaches likely to make data as FAIR as possible, considering what the data is about and its purpose in the research. Angus Whyte knowledge to choose fair data handling approaches appropriate to the research phenomena The activity of recording provenance for data and software. Knowledge about provenance vocabularies, for example PROV-O and models provenance such as nanopublications. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel provenance information management The activity of recording provenance for data and software. Knowledge about provenance vocabularies, for example PROV-O and models provenance such as nanopublications. KH Published results can be replicated using the documented data, code, and methods employed by the author or provider without the need for any additional information or needing to communicate with the author or provider. Peter McQuilton https://orcid.org/0000-0002-7702-4495 reproducible research Published results can be replicated using the documented data, code, and methods employed by the author or provider without the need for any additional information or needing to communicate with the author or provider. CASRAI. https://casrai.org/term/reproducible-research Peter McQuilton A1.2 the protocol allows for an authentication and authorisation procedure, where necessary Peter McQuilton https://orcid.org/0000-0002-7702-4495 data stewardship technical concept A data collection that has been normalized by some established criteria to allow for effective data management. Examples include: data files that belong to a certain experiment, all files that are created by one specific simulation, all files that belong to a specific observation (same day, same place, etc.). Peter McQuilton canonical data collection A data collection that has been normalized by some established criteria to allow for effective data management. Examples include: data files that belong to a certain experiment, all files that are created by one specific simulation, all files that belong to a specific observation (same day, same place, etc.). CASRAI. https://casrai.org/term/canonical-data-collection/ Ability to clean up existing source code and version control systems, get Unique ID for the code, choose the appropriate code repository/registry [YLF, VDA, KH] Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc https://orcid.org/0000-0002-7702-4495 archival documentation of software Data harmonization is the process of making data from different sources comparable. Peter McQuilton https://orcid.org/0000-0002-7702-4495 https://github.com/terms4fairskills/FAIRterminology/issues/17 data harmonization Data harmonization is the process of making data from different sources comparable. Modified from CASRAI. https://casrai.org/term/data-harmonization/ A repository of data designed to serve a particular community of knowledge workers. The goal of a data mart is to meet the particular demands of a specific group of users. Peter McQuilton data mart A repository of data designed to serve a particular community of knowledge workers. The goal of a data mart is to meet the particular demands of a specific group of users. CASRAI. https://casrai.org/term/data-mart/ A type of access entity that contains the services and functions which make the data object holdings and their information content and related services visible to data consumers. Peter McQuilton access workflow A type of access entity that contains the services and functions which make the data object holdings and their information content and related services visible to data consumers. CASRAI. https://casrai.org/term/access-workflow/ Bin for types of Knowledge related to Service level management. leightonlc 2019-10-17T15:18:53.73274Z knowledge of service level management A model that specifies the structure or schema of a dataset. The model provides a documented description of the data and thus is an instance of metadata. It is a logical, relational data model showing an organized dataset as a collection of tables with entity, attributes and relations. Peter McQuilton data model A model that specifies the structure or schema of a dataset. The model provides a documented description of the data and thus is an instance of metadata. It is a logical, relational data model showing an organized dataset as a collection of tables with entity, attributes and relations. CASRAI. https://casrai.org/term/data-model/ Be aware of the potential conflicts between security and usability. Understand the importance of having information security policy and keeping a balance between the information security and usability. Willing to establish both organization and technical information security measurements. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang appreciate security and usability trade-offs A document that applies collectively to codes, specifications, recommended practices, classifications, test methods, and guides, which have been prepared by a standards developing organization or group, and published in accordance with established procedures. Peter McQuilton https://orcid.org/0000-0002-7702-4495 standard A document that applies collectively to codes, specifications, recommended practices, classifications, test methods, and guides, which have been prepared by a standards developing organization or group, and published in accordance with established procedures. CASRAI. https://casrai.org/term/standard The application of a comprehensive knowledge of a discipline or disciplines to the development of expertise and the generation of new knowledge through research, and the planning and presentation of courses of study for undergraduates and graduates in universities. Peter McQuilton https://orcid.org/0000-0002-7702-4495 university teaching The application of a comprehensive knowledge of a discipline or disciplines to the development of expertise and the generation of new knowledge through research, and the planning and presentation of courses of study for undergraduates and graduates in universities. CASRAI. https://casrai.org/term/university-teaching/ The process of transferring data between storage types, formats, information technologies, or computer systems. A data migration project is usually undertaken to replace or upgrade servers or storage equipment, for a website consolidation, to conduct server maintenance or to relocate a data center. Peter McQuilton data migration The process of transferring data between storage types, formats, information technologies, or computer systems. A data migration project is usually undertaken to replace or upgrade servers or storage equipment, for a website consolidation, to conduct server maintenance or to relocate a data center. CASRAI. https://casrai.org/term/data-migration/ A continuous process that requires corrective actions throughout the data lifecycle. Peter McQuilton data cleaning A continuous process that requires corrective actions throughout the data lifecycle. CASRAI. https://casrai.org/term/data-cleaning/ Peter McQuilton R1.2 (meta)data are associated with detailed provenance Bin for Skills needed for Identity management. leightonlc skills related to identity management The use of persistent identifiers or PIDs to uniquely and persistently identify an entity. Nancy Hoebelheinrich Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2020-10-02T14:22:51.569762Z using persistent identifiers The use of persistent identifiers or PIDs to uniquely and persistently identify an entity. [PMQ 3.2022] https://orcid.org/0000-0002-7702-4495 database curator AL 15.3.22: Updated label to 'data curator' to encompass curation beyond just in databases. data curator Learn about innovation partners' interests and concerns, analyze them and position the project work/outcome properly. Reach out to innovation partners with proactive and pragmatic innovation plan which is in compliance with FAIR principles. Celia van Gelder Mateusz Kuzak Yan Wang engage innovation partners Peter McQuilton R1.3 (meta)data meet domain-relevant community standards The process of reducing the amount or size of stored data. This may be achieved by eliminating redundant copies of data files, deduplicating data files by removing redundant records, or by compressing the data files. Peter McQuilton data reduction The process of reducing the amount or size of stored data. This may be achieved by eliminating redundant copies of data files, deduplicating data files by removing redundant records, or by compressing the data files. CASRAI. https://casrai.org/term/data-reduction/ Peter McQuilton F4. (meta)data are registered or indexed in a searchable resource 1. A registry that links data types of all sorts with the executable data processing functions that can be useful for working with a specific data type. Examples include: complex file types in biology (diagnosis), registering categories that appear in PID records to describe data properties. Data types range from complex digital objects to simple... Peter McQuilton data type registry 1. A registry that links data types of all sorts with the executable data processing functions that can be useful for working with a specific data type. Examples include: complex file types in biology (diagnosis), registering categories that appear in PID records to describe data properties. Data types range from complex digital objects to simple... CASRAI. https://casrai.org/term/data-type-registry/ A collection of data that is organised in a according to a conceptual structure/model describing the characteristics of these data and the relationships among their corresponding entities, supporting one or more application areas. A database allows its contents to be easily accessed, managed and updated. The type of database used depends on the requirements of &hellip; <a href=CASRAI. https://casrai.org/term/database/ class=more-link Peter McQuilton database A collection of data that is organised in a according to a conceptual structure/model describing the characteristics of these data and the relationships among their corresponding entities, supporting one or more application areas. A database allows its contents to be easily accessed, managed and updated. The type of database used depends on the requirements of &hellip; <a href=CASRAI. https://casrai.org/term/database/ class=more-link CASRAI. https://casrai.org/term/database/ Recognize the added value of FAIR and open research, encourage researchers to practise FAIR and open research. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang appreciation of fair and open research Demonstrate willingness to engage with new ways of applying FAIR principles. Angus Whyte ability to learn new techniques for fair implementation Bin for Aptitudes related to Funding acquisition management. leightonlc aptitudes for funding acquisition management To be able to choose the correct data model and/or exchange format for your data, based on the repository where you plan to deposit your data. Peter McQuilton Which data model to use Which format to use choosing the appropriate model or format for your data To be able to choose the correct data model and/or exchange format for your data, based on the repository where you plan to deposit your data. [PMQ] Information concerning the creation, attribution, or version history of managed data. Provenance metadata that indicates the relationship between two versions of data objects and is generated whenever a new version of a dataset is created. Examples include: (i) the name of the program that generated the new version, (ii) the commit id of the program. https://orcid.org/0000-0002-7702-4495 provenance metadata Information concerning the creation, attribution, or version history of managed data. Provenance metadata that indicates the relationship between two versions of data objects and is generated whenever a new version of a dataset is created. Examples include: (i) the name of the program that generated the new version, (ii) the commit id of the program. CASRAI, https://casrai.org/term/provenance-metadata/ The state when data are in the place needed by the user, at the time the user needs them, and in the form needed by the user. Peter McQuilton data availability The state when data are in the place needed by the user, at the time the user needs them, and in the form needed by the user. CASRAI. https://casrai.org/term/data-availability/ An activity through which the correctness conditions of the data are verified. It also includes the specification of the type of the error or condition not met, and the qualification of the data and its division into the error-free and erroneous data. Data review consists of both error detection and data analysis, and can be carried out in manual or automated mode. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data review An activity through which the correctness conditions of the data are verified. It also includes the specification of the type of the error or condition not met, and the qualification of the data and its division into the error-free and erroneous data. Data review consists of both error detection and data analysis, and can be carried out in manual or automated mode. CASRAI. https://casrai.org/term/data-review/ HughShanahan 2019-10-18T08:56:32.673701Z aptitudes associated with using fair tools and services Documents actions that have been undertaken to preserve a digital resource such as migrations and checks sum calculations. Peter McQuilton https://orcid.org/0000-0002-7702-4495 preservation metadata Documents actions that have been undertaken to preserve a digital resource such as migrations and checks sum calculations. CASRAI. https://casrai.org/term/preservation-metadata/ Change management is a broad subject and can be applied to all different types of organizational change. The most common change drivers include: technological evolution, process reviews, crisis, and consumer habit changes; pressure from new business entrants, acquisitions, mergers, and organizational restructuring. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel CM change management Change management is a broad subject and can be applied to all different types of organizational change. The most common change drivers include: technological evolution, process reviews, crisis, and consumer habit changes; pressure from new business entrants, acquisitions, mergers, and organizational restructuring. https://www.hucmi.com/en/hcmbok/ A type of repository where the original copy of data was stored and probably a data identifier registered. Peter McQuilton https://orcid.org/0000-0002-7702-4495 original repository A type of repository where the original copy of data was stored and probably a data identifier registered. CASRAI. https://casrai.org/term/original repository 1. The act of minimally perturbing individual-level data to decrease the probability of discovering an individualís identity. It involves masking direct identifiers (e.g., name, phone number, address) as well as transforming indirect identifiers that could be used alone or in combination to-identify an individual (e.g., birth dates, geographic details, dates of key events). If done correctly, de-identification is a defensible, repeatable, and auditable process that consistently provides assurance, based on generally accepted and repeatable statistical methodologies, that there is a very small risk of re-identification of any data that are released. 2. The use of one or more techniques designed to make it impossible — or at least more difficult — to identify a particular individual from stored data related to them. The purpose of data anonymization is to protect the privacy of the individual and to make it legal for governments and businesses to share their data without obtaining permission. Such data have proven to be very valuable for researchers, particularly in health care. Data anonymization methods include removing personally identifiable information (e.g., names, addresses, social insurance numbers, Medicare numbers, etc.), or using obfuscation methods such as encryption, hashing, generalization, pseudonymization, and perturbation. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Anonymization de-identification 1. The act of minimally perturbing individual-level data to decrease the probability of discovering an individualís identity. It involves masking direct identifiers (e.g., name, phone number, address) as well as transforming indirect identifiers that could be used alone or in combination to-identify an individual (e.g., birth dates, geographic details, dates of key events). If done correctly, de-identification is a defensible, repeatable, and auditable process that consistently provides assurance, based on generally accepted and repeatable statistical methodologies, that there is a very small risk of re-identification of any data that are released. 2. The use of one or more techniques designed to make it impossible — or at least more difficult — to identify a particular individual from stored data related to them. The purpose of data anonymization is to protect the privacy of the individual and to make it legal for governments and businesses to share their data without obtaining permission. Such data have proven to be very valuable for researchers, particularly in health care. Data anonymization methods include removing personally identifiable information (e.g., names, addresses, social insurance numbers, Medicare numbers, etc.), or using obfuscation methods such as encryption, hashing, generalization, pseudonymization, and perturbation. CASRAI. https://casrai.org/term/de-identification/ Program delivery managers and support function managers, at all levels in an institution who are accountable for the direct delivery and support of programs and services within their domain of business responsibility. Peter McQuilton https://orcid.org/0000-0002-7702-4495 manager Program delivery managers and support function managers, at all levels in an institution who are accountable for the direct delivery and support of programs and services within their domain of business responsibility. CASRAI. https://casrai.org/term/manager Organizational leadership is: (a) The ability to attract, assess, mobilize and focus energies and talent to work towards a shared purpose aligned with the mandate of the organization; (b) The ability to change culture, processes and priorities within the organization; and, (c) The ability to mentor. Peter McQuilton https://orcid.org/0000-0002-7702-4495 organizational leadership Organizational leadership is: (a) The ability to attract, assess, mobilize and focus energies and talent to work towards a shared purpose aligned with the mandate of the organization; (b) The ability to change culture, processes and priorities within the organization; and, (c) The ability to mentor. CASRAI. https://casrai.org/term/organizational-leadership/ A data mining practice in which large volumes of data are analyzed seeking any possible relationships between data. The traditional scientific method, in contrast, begins with a hypothesis and follows with an examination of the data. Peter McQuilton data dredging A data mining practice in which large volumes of data are analyzed seeking any possible relationships between data. The traditional scientific method, in contrast, begins with a hypothesis and follows with an examination of the data. CASRAI. https://casrai.org/term/data-dredging/ Data that have not been organized into a format and identifiable data structure that makes them easy to access and process. These data can often be searched as long as they are digital, but they are difficult to use for computer analyses. Peter McQuilton https://orcid.org/0000-0002-7702-4495 unstructured data Data that have not been organized into a format and identifiable data structure that makes them easy to access and process. These data can often be searched as long as they are digital, but they are difficult to use for computer analyses. CASRAI. https://casrai.org/term/unstructured-data Peter McQuilton 2020-11-16T22:18:14.764875Z presentation slides with interactive exercises Peter McQuilton 2020-10-02T14:23:05.600474Z creating persistent identifiers The process of bringing together from two or more different sources, data that relate to the same individual, family, place or event). Peter McQuilton data linkage The process of bringing together from two or more different sources, data that relate to the same individual, family, place or event). CASRAI. https://casrai.org/term/data-linkage/ A type of collection that describes, and points to features of another collection. Peter McQuilton catalogue A type of collection that describes, and points to features of another collection. CASRAI. https://casrai.org/term/catalogue/ The consideration of available monetary resources for a specific time period to serve a specific purpose. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel funding acquisition management The consideration of available monetary resources for a specific time period to serve a specific purpose. KH To be able to use programmatic methods to access a resource's API to query and extract an appropriate subset of data. Peter McQuilton API access Machine actionable search machine access via api To be able to use programmatic methods to access a resource's API to query and extract an appropriate subset of data. PMQ The continuum of data structure that includes unstructured data, semi-structured data, and structured data. Peter McQuilton data structure continuum The continuum of data structure that includes unstructured data, semi-structured data, and structured data. CASRAI. https://casrai.org/term/data-structure-continuum/ Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Peter McQuilton data Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. CASRAI. https://casrai.org/term/data/ Data that have not been organized into a specialized repository, such as a database, but that nevertheless have associated information, such as metadata, that makes them more amenable to processing than raw data. Semi-structured data lie somewhere between structured and unstructured data. They are not organized in a complex manner that makes sophisticated access and analysis possible. However, they may have information associated with them, such as metadata tagging that allows elements contained to be addressed. Example: A Word document is generally considered to be unstructured data. However, metadata tags could be added in the form of keywords and other metadata that represent the document content and make it easier for that document to be found when people search for those terms — the data are now semi-structured. Nevertheless, the document still lacks the complex organization of a database, so falls short of being fully structured data. Peter McQuilton https://orcid.org/0000-0002-7702-4495 semi-structured data Data that have not been organized into a specialized repository, such as a database, but that nevertheless have associated information, such as metadata, that makes them more amenable to processing than raw data. Semi-structured data lie somewhere between structured and unstructured data. They are not organized in a complex manner that makes sophisticated access and analysis possible. However, they may have information associated with them, such as metadata tagging that allows elements contained to be addressed. Example: A Word document is generally considered to be unstructured data. However, metadata tags could be added in the form of keywords and other metadata that represent the document content and make it easier for that document to be found when people search for those terms — the data are now semi-structured. Nevertheless, the document still lacks the complex organization of a database, so falls short of being fully structured data. CASRAI. https://casrai.org/term/semi-structured-data Peter McQuilton 2020-10-01T20:53:37.24517Z book chapter A scientist who conducts activities in: (1) Research, development and analysis (RDA); (2) Managing research; and, (3) Representation and client services. Peter McQuilton https://orcid.org/0000-0002-7702-4495 research scientist A scientist who conducts activities in: (1) Research, development and analysis (RDA); (2) Managing research; and, (3) Representation and client services. CASRAI. https://casrai.org/term/research-scientist A name (not a location) for an entity on digital networks. It provides a system for persistent and actionable identification and interoperable exchange of managed information on digital networks. A DOI is a type of Persistent Identifier (PID) issued by the International DOI Foundation. This permanent identifier is associated with a digital object that permits it to be referenced reliably even if its location and metadata undergo change over time. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digital object identifier A name (not a location) for an entity on digital networks. It provides a system for persistent and actionable identification and interoperable exchange of managed information on digital networks. A DOI is a type of Persistent Identifier (PID) issued by the International DOI Foundation. This permanent identifier is associated with a digital object that permits it to be referenced reliably even if its location and metadata undergo change over time. CASRAI. https://casrai.org/term/digital-object-identifier/ Peter McQuilton F3. metadata clearly and explicitly include the identifier of the data they describe A person who is expert in one or more of the information management disciplines that support the effective and efficient management of information. Peter McQuilton information management specialist A person who is expert in one or more of the information management disciplines that support the effective and efficient management of information. CASRAI. https://casrai.org/term/information-management-specialist/ Peter McQuilton A2. metadata are accessible, even when the data are no longer available Identify and engage in dialogue with stakeholders affected by the research, or by making its outputs FAIR. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang stakeholder engagement on societal impact The Principal Investigator (P.I.) is a researcher who has a research leadership role and is the point of contact for a project or partnership that applies the scientific method, historical method, or other research methodology for the advancement of knowledge resulting in independent, objective, high quality, traceable, and reproducible results. The P.I. has primary responsibility for the intellectual direction and integrity of the research or research-related activity, including data production, findings and results, and ensures ethical conduct in all aspects of the research process including but not limited to the treatment of human and animal subjects, conflicts of interest, data acquisition, sharing and ownership, publication practices, responsible authorship, and collaborative research and reporting. While various tasks may be delegated to team members, some of whom may have greater expertise in specific areas, the P.I. is familiar with the various technical and scientific aspects of a project and how they fit together, is able to identify and remediate gaps, and ensure communication within the team and with users of the research data and results. Peter McQuilton https://orcid.org/0000-0002-7702-4495 principal investigator The Principal Investigator (P.I.) is a researcher who has a research leadership role and is the point of contact for a project or partnership that applies the scientific method, historical method, or other research methodology for the advancement of knowledge resulting in independent, objective, high quality, traceable, and reproducible results. The P.I. has primary responsibility for the intellectual direction and integrity of the research or research-related activity, including data production, findings and results, and ensures ethical conduct in all aspects of the research process including but not limited to the treatment of human and animal subjects, conflicts of interest, data acquisition, sharing and ownership, publication practices, responsible authorship, and collaborative research and reporting. While various tasks may be delegated to team members, some of whom may have greater expertise in specific areas, the P.I. is familiar with the various technical and scientific aspects of a project and how they fit together, is able to identify and remediate gaps, and ensure communication within the team and with users of the research data and results. CASRAI. https://casrai.org/term/principal-investigator The person responsible for creating the organizational environment culture by providing clear direction and circumstances that allow people to be successful. The program manager is judged on the elements time, cost, and scope, cumulatively for all the projects and operations within the program. Program management decisions are both tactical and strategic in nature. The strategy aspects of these decisions must consider multidimensional impacts beyond the near-term delivery dates of the project. In addition to delivery and execution, the program manager has to also be concerned with the overall health and effectiveness of the program over the long term. Peter McQuilton https://orcid.org/0000-0002-7702-4495 program manager The person responsible for creating the organizational environment culture by providing clear direction and circumstances that allow people to be successful. The program manager is judged on the elements time, cost, and scope, cumulatively for all the projects and operations within the program. Program management decisions are both tactical and strategic in nature. The strategy aspects of these decisions must consider multidimensional impacts beyond the near-term delivery dates of the project. In addition to delivery and execution, the program manager has to also be concerned with the overall health and effectiveness of the program over the long term. CASRAI. https://casrai.org/term/program-manager This is a URL. However, instead of pointing directly to the location of an Internet resource, a PURL points to an intermediate resolution service. The PURL resolution service associates the PURL with the actual URL and returns that URL to the client. Peter McQuilton https://orcid.org/0000-0002-7702-4495 persistent uniform resource locator This is a URL. However, instead of pointing directly to the location of an Internet resource, a PURL points to an intermediate resolution service. The PURL resolution service associates the PURL with the actual URL and returns that URL to the client. CASRAI. https://casrai.org/term/persistent-uniform-resource-locator An initiative to create a digital library card catalog for the Web. Dublin Core is made up of 15 metadata elements that offer expanded cataloging information and improved document indexing for search engine programs. The 15 metadata elements used by Dublin Core are: title (the name given the resource), creator (the person or organization responsible for the content), subject (the topic covered), description (a textual outline of the content), publisher (those responsible for making the resource available), contributor (those who added to the content), date (when the resource was made available), type (a category for the content), format (how the resource is presented), identifier (numerical identifier for the content such as a URL), source (where the content originally derived from), language (in what language the content is written), relation (how the content relates to other resources, for instance, if it is a chapter in a book), coverage (where the resource is physically located), and rights (a link to a copyright notice). Peter McQuilton https://orcid.org/0000-0002-7702-4495 dublin core An initiative to create a digital library card catalog for the Web. Dublin Core is made up of 15 metadata elements that offer expanded cataloging information and improved document indexing for search engine programs. The 15 metadata elements used by Dublin Core are: title (the name given the resource), creator (the person or organization responsible for the content), subject (the topic covered), description (a textual outline of the content), publisher (those responsible for making the resource available), contributor (those who added to the content), date (when the resource was made available), type (a category for the content), format (how the resource is presented), identifier (numerical identifier for the content such as a URL), source (where the content originally derived from), language (in what language the content is written), relation (how the content relates to other resources, for instance, if it is a chapter in a book), coverage (where the resource is physically located), and rights (a link to a copyright notice). CASRAI. https://casrai.org/term/dublin-core/ Yann Le Franc ylefranc 2021-03-17T16:34:14.943152Z data stewardship guideline The FAIR principles are guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of digital assets. The principles emphasise machine-actionability (i.e., the capacity of computational systems to find, access, interoperate, and reuse data with none or minimal human intervention) because humans increasingly rely on computational support to deal with data as a result of the increase in volume, complexity, and creation speed of data. The principles refer to three types of entities: data (or any digital object), metadata (information about that digital object), and infrastructure. Peter McQuilton Peter McQuilton 2020-10-02T14:56:15.031154Z fair principle The FAIR principles are guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of digital assets. The principles emphasise machine-actionability (i.e., the capacity of computational systems to find, access, interoperate, and reuse data with none or minimal human intervention) because humans increasingly rely on computational support to deal with data as a result of the increase in volume, complexity, and creation speed of data. The principles refer to three types of entities: data (or any digital object), metadata (information about that digital object), and infrastructure. https://www.go-fair.org/fair-principles/ Bin for Aptitudes related to Cloud computing environment management. leightonlc aptitudes for cloud computing environment management New datasets obtained by combining data appropriately from a variety of existing files, generating new data products that did not previously exist. Repurposed data result from data wrangling. Peter McQuilton https://orcid.org/0000-0002-7702-4495 repurposed data New datasets obtained by combining data appropriately from a variety of existing files, generating new data products that did not previously exist. Repurposed data result from data wrangling. CASRAI. https://casrai.org/term/repurposed-data/ The building blocks of an XML document. Peter McQuilton https://orcid.org/0000-0002-7702-4495 document type definition The building blocks of an XML document. CASRAI. https://casrai.org/term/document-type-definition/ Peter McQuilton I3. (meta)data include qualified references to other (meta)data A persistent identifier is a long-lasting reference to a digital object that gives information about that object regardless what happens to it. Developed to address link rot, a persistent identifier can be resolved to provide an appropriate representation of an object whether that objects changes its online location or goes offline. Peter McQuilton https://orcid.org/0000-0002-7702-4495 persistent identifier A persistent identifier is a long-lasting reference to a digital object that gives information about that object regardless what happens to it. Developed to address link rot, a persistent identifier can be resolved to provide an appropriate representation of an object whether that objects changes its online location or goes offline. CASRAI. https://casrai.org/term/persistent-identifier A person who is studying or has expert knowledge of one or more of the natural or physical sciences. Peter McQuilton https://orcid.org/0000-0002-7702-4495 scientist A person who is studying or has expert knowledge of one or more of the natural or physical sciences. CASRAI. https://casrai.org/term/scientist 1. The act of bringing together smaller components into a single system that functions as one. 2. In the context of information technology: The end result of a process that aims to stitch together different, often disparate, subsystems so that the data contained in each becomes part of a larger, more comprehensive system that, ideally, quickly and easily shares data when needed. This often requires that organizations build a customized architecture or structure of applications to combine new or existing hardware, software and other communications. Peter McQuilton https://orcid.org/0000-0002-7702-4495 integration 1. The act of bringing together smaller components into a single system that functions as one. 2. In the context of information technology: The end result of a process that aims to stitch together different, often disparate, subsystems so that the data contained in each becomes part of a larger, more comprehensive system that, ideally, quickly and easily shares data when needed. This often requires that organizations build a customized architecture or structure of applications to combine new or existing hardware, software and other communications. CASRAI. https://casrai.org/term/integration/ Bin for types of Knowledge related to High performance computing management. leightonlc knowledge of high performance computing management The act of mentoring around FAIR data stewardship. Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 fair mentoring The act of mentoring around FAIR data stewardship. PMQ The exercise of authority, control and shared decision making (planning, monitoring and enforcement) over the management of data assets. Peter McQuilton data governance The exercise of authority, control and shared decision making (planning, monitoring and enforcement) over the management of data assets. CASRAI. https://casrai.org/term/data-governance/ A defining scheme used for identification of resources (including people and organizations) and the sharing of data across domains, enterprises, and applications. XRI TC will define a Uniform Resource Identifier (URI) scheme and a corresponding Uniform Resource (URN) namespace.n/aCASRAI. https://www.oasis-open.org/committees/tc_home.php?wg_abbrev=xri Peter McQuilton extensible resource identifier A defining scheme used for identification of resources (including people and organizations) and the sharing of data across domains, enterprises, and applications. XRI TC will define a Uniform Resource Identifier (URI) scheme and a corresponding Uniform Resource (URN) namespace.n/aCASRAI. https://www.oasis-open.org/committees/tc_home.php?wg_abbrev=xri CASRAI. https://casrai.org/term/extensible-resource-identifier/ In the context of a researcher's activities, impact is the consequence of the research and new knowledge on the advancement of the specialty. Science-based policies, regulations, services and technology transfers are some examples of ways target results can be achieved and impact demonstrated. Peter McQuilton impact In the context of a researcher's activities, impact is the consequence of the research and new knowledge on the advancement of the specialty. Science-based policies, regulations, services and technology transfers are some examples of ways target results can be achieved and impact demonstrated. CASRAI. https://casrai.org/term/impact/ A list used to grant permission matched against credentials. Peter McQuilton access control list A list used to grant permission matched against credentials. CASRAI. https://casrai.org/term/access-control-list/ Learn about data copyright laws, licennsing and other legal aspects of data access. Understand the application of those laws at the project level. Celia van Gelder Mateusz Kuzak Yan Wang understand data ownership and access policies Meeting/conference organisation is a project management activity that encompasses all of the steps required to run a meeting or conference. Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 meeting/conference organisation It can be considered as a negotiated agreement between the customer and vendor which are acceptable to both parties with respect to costs and expectations in order to support the business process. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel SLM service level management It can be considered as a negotiated agreement between the customer and vendor which are acceptable to both parties with respect to costs and expectations in order to support the business process. https://www.techopedia.com/definition/13848/service-level-management-slm [LLC] From an official perspective, a national standard is adopted by a national standards body (e.g., Standards Council of Canada, American National Standards Institute, British Standards Institution) and made available to the public. Practically speaking, however, a national standard is any standard that is widely used and recognized within a country. In this context, even government standards, such as those issued by the Occupational Safety and Health Administration (OSHA), can be considered national standards. Peter McQuilton https://orcid.org/0000-0002-7702-4495 national standard From an official perspective, a national standard is adopted by a national standards body (e.g., Standards Council of Canada, American National Standards Institute, British Standards Institution) and made available to the public. Practically speaking, however, a national standard is any standard that is widely used and recognized within a country. In this context, even government standards, such as those issued by the Occupational Safety and Health Administration (OSHA), can be considered national standards. CASRAI. https://casrai.org/term/national-standard/ A governing culture that holds that the public has the right to access the documents and proceedings of government to allow for greater openness, accountability, and engagement. Peter McQuilton https://orcid.org/0000-0002-7702-4495 open government A governing culture that holds that the public has the right to access the documents and proceedings of government to allow for greater openness, accountability, and engagement. CASRAI. https://casrai.org/term/open-government/ access control and management Manage the assessment, implementation and monitoring of secure storage protocols. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang storage security management 1. Exercising authority to provide direction and to undertake, coordinate, and regulate activities in support of achieving this direction and desired outcomes. Governance can be thought of as the role of an organization's board of directors or its equivalent that is focused on defining that organization's purpose and the development of the strategies, objectives, values, and policies that frame how that purpose will be pursued. It includes the development of such things as mission statements, statements of organizational objectives and values, logic models, organizational performance metrics, risk management frameworks, policies and guidelines for financial and operational matters, stakeholder relations, etc. Peter McQuilton governance 1. Exercising authority to provide direction and to undertake, coordinate, and regulate activities in support of achieving this direction and desired outcomes. Governance can be thought of as the role of an organization's board of directors or its equivalent that is focused on defining that organization's purpose and the development of the strategies, objectives, values, and policies that frame how that purpose will be pursued. It includes the development of such things as mission statements, statements of organizational objectives and values, logic models, organizational performance metrics, risk management frameworks, policies and guidelines for financial and operational matters, stakeholder relations, etc. CASRAI. https://casrai.org/term/governance/ A low-barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked within HTTP. Peter McQuilton open archives initiative protocol for metadata harvesting A low-barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked within HTTP. CASRAI. https://casrai.org/term/open-archives-initiative-protocol-for-metadata-harvesting/ A series of computer instructions written in some human readable computer language, usually stored in a text file. Peter McQuilton computer code A series of computer instructions written in some human readable computer language, usually stored in a text file. CASRAI. https://casrai.org/term/computer-code/ Syntactic interoperability defines the structure or format of data exchange and is achieved through tools such as XML or SQL Standards. Peter McQuilton https://orcid.org/0000-0002-7702-4495 syntactic interoperability Syntactic interoperability defines the structure or format of data exchange and is achieved through tools such as XML or SQL Standards. CASRAI. https://casrai.org/term/syntactic-interoperability Bin for Aptitudes related to Preservation costs management. leightonlc aptitudes for preservation costs management Peter McQuilton accessibility of digital assets The activity of bringing computer system resources, especially data storage and computing power on demand via Internet. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel cloud computing environment management The activity of bringing computer system resources, especially data storage and computing power on demand via Internet. (modified from wikipedia) [VDA] A set of documents that has a scientific meaning. A corpus can be produced by an individual researchers activity (including its archival materials), or from a laboratory research, field campaign or science and culture heritage project, a survey, etc. Peter McQuilton corpus A set of documents that has a scientific meaning. A corpus can be produced by an individual researchers activity (including its archival materials), or from a laboratory research, field campaign or science and culture heritage project, a survey, etc. CASRAI. https://casrai.org/term/corpus/ Understand how to use FAIR and open research tools and services. Hugh Shanahan the use of fair and open research tools or services The practice of making data available for reuse. This may be done, for example, by depositing the data in a repository, through data publication. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Data dissemination Data posting data sharing The practice of making data available for reuse. This may be done, for example, by depositing the data in a repository, through data publication. CASRAI. https://casrai.org/term/data-sharing/ Understand how the rewards for effort depend on crediting the contributions of researchers and professional groups towards making FAIR outputs. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang appreciate importance of crediting research contributions The process of resolving a reference to useful information by using a globally available system. Peter McQuilton https://orcid.org/0000-0002-7702-4495 reference resolution The process of resolving a reference to useful information by using a globally available system. CASRAI. https://casrai.org/term/reference-resolution The person who manages or coordinates resources, personnel, facilities, and operating funds-allocations in an organization conducting research, development and analysis (RDA) in the natural and physical sciences. A research manager determines the nature, priority objectives and the resources committed to their achievement within and across the organizations, and evaluates program outputs in relation to organizational objectives and policies. A research manager provides scientific advice on the direction, conduct and management of these programs. A research manager does not personally conduct research development and analysis (RDA), control and coordinate projects, or control and coordinate contracted RDA. Peter McQuilton https://orcid.org/0000-0002-7702-4495 research manager The person who manages or coordinates resources, personnel, facilities, and operating funds-allocations in an organization conducting research, development and analysis (RDA) in the natural and physical sciences. A research manager determines the nature, priority objectives and the resources committed to their achievement within and across the organizations, and evaluates program outputs in relation to organizational objectives and policies. A research manager provides scientific advice on the direction, conduct and management of these programs. A research manager does not personally conduct research development and analysis (RDA), control and coordinate projects, or control and coordinate contracted RDA. CASRAI. https://casrai.org/term/research-manager Combining diverse datasets from disparate sources into one unified dataset or database. Data are accessed and extracted, moved, validated, cleaned, transformed and loaded. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data integration Combining diverse datasets from disparate sources into one unified dataset or database. Data are accessed and extracted, moved, validated, cleaned, transformed and loaded. CASRAI. https://casrai.org/term/data-integration Detailed, written instructions to achieve uniformity of the performance of a specific function. Peter McQuilton https://orcid.org/0000-0002-7702-4495 standard operating procedure Detailed, written instructions to achieve uniformity of the performance of a specific function. CASRAI. https://casrai.org/term/standard-operating-procedure Repositories preserve, manage, and provide access to many types of digital materials in a variety of formats. Peter McQuilton https://orcid.org/0000-0002-7702-4495 repository Repositories preserve, manage, and provide access to many types of digital materials in a variety of formats. CASRAI. https://casrai.org/term/repository Understand the concepts underpinning FAIR criteria. Angus Whyte knowledge of theories underlying fair implementation Know the purpose of the public use, choose the proper subjects for the audience and describe the project work for the purpose/audience chosen. Celia van Gelder Mateusz Kuzak Yan Wang document in lay terms Peter McQuilton 2020-10-02T14:54:27.168274Z understanding persistent identifiers A collection of interrelated data often with controlled redundancy, organized according to a scheme to serve one or more applications; the data are stored so that they can be used by several programs without concern for data structures or organization. Peter McQuilton data upload database A collection of interrelated data often with controlled redundancy, organized according to a scheme to serve one or more applications; the data are stored so that they can be used by several programs without concern for data structures or organization. CASRAI. https://casrai.org/term/data-upload-database/ A string of characters used to identify or name a resource on the Internet. Such identification enables interaction with representations of the resource over a network, typically the World Wide Web, using specific protocols. MIT data management and publishing Peter McQuilton https://orcid.org/0000-0002-7702-4495 uniform resource identifier A string of characters used to identify or name a resource on the Internet. Such identification enables interaction with representations of the resource over a network, typically the World Wide Web, using specific protocols. MIT data management and publishing CASRAI. https://casrai.org/term/uniform-resource-identifier/ Know how to search for and identify FAIR services or tools that fit project needs. Angus Whyte how to find fair research data tools/services (catalogues) A framework whose primary purpose is to enable information sharing and reuse across the federal government via the standard description and discovery of common data and the promotion of uniform data management practices.n/aCASRAI. https://www.whitehouse.gov/sites/default/files/omb/assets/egov_docs/DRM_2_0_Final.pdf Peter McQuilton data reference model A framework whose primary purpose is to enable information sharing and reuse across the federal government via the standard description and discovery of common data and the promotion of uniform data management practices.n/aCASRAI. https://www.whitehouse.gov/sites/default/files/omb/assets/egov_docs/DRM_2_0_Final.pdf CASRAI. https://casrai.org/term/data-reference-model/ A process by which a scholarly work (such as a paper or a research proposal) is checked by a group of experts in the same field to make sure it meets the necessary standards before it is published or accepted. Peter McQuilton https://orcid.org/0000-0002-7702-4495 peer review A process by which a scholarly work (such as a paper or a research proposal) is checked by a group of experts in the same field to make sure it meets the necessary standards before it is published or accepted. CASRAI. https://casrai.org/term/peer-review/ Bin for types of Knowledge related to Provenance information management. leightonlc 2019-10-17T15:17:02.313897Z knowledge of provenance information management 1. A collection of data items arranged for processing by a program. Multiple records are contained in a file or dataset. Typically, records can be of fixed-length or be of variable length with the length information contained within the record. 2. A record (sometimes called a row) is a group of fields (sometimes called columns) within a table that are relevant to a specific entity. Peter McQuilton https://orcid.org/0000-0002-7702-4495 record 1. A collection of data items arranged for processing by a program. Multiple records are contained in a file or dataset. Typically, records can be of fixed-length or be of variable length with the length information contained within the record. 2. A record (sometimes called a row) is a group of fields (sometimes called columns) within a table that are relevant to a specific entity. CASRAI. https://casrai.org/term/record De-anonymization is a reverse engineering process in which de-identified data are cross-referenced with other data sources to re-identify the personally identifiable information. This could occur if a de-identification process had not been not successfully performed, or had not been undertaken in the first place. Peter McQuilton https://orcid.org/0000-0002-7702-4495 de-anonymization De-anonymization is a reverse engineering process in which de-identified data are cross-referenced with other data sources to re-identify the personally identifiable information. This could occur if a de-identification process had not been not successfully performed, or had not been undertaken in the first place. CASRAI. https://casrai.org/term/de-anonymization/ HughShanahan 2019-10-18T08:45:02.980527Z application of fair tools and services Requires compliance because of a government statute or regulation, an organization internal policy, or contractual requirement. Failure to comply with a mandatory standard usually carries a sanction, such as civil or criminal penalties, or loss of employment. Peter McQuilton https://orcid.org/0000-0002-7702-4495 mandatory standard Requires compliance because of a government statute or regulation, an organization internal policy, or contractual requirement. Failure to comply with a mandatory standard usually carries a sanction, such as civil or criminal penalties, or loss of employment. CASRAI. https://casrai.org/term/mandatory-standard/ An approach to governance that values decisions that can be backed up with data that can be verified. The success of the data-driven approach is reliant upon the quality of the data gathered and the effectiveness of its analysis and interpretation. Peter McQuilton data driven decision management An approach to governance that values decisions that can be backed up with data that can be verified. The success of the data-driven approach is reliant upon the quality of the data gathered and the effectiveness of its analysis and interpretation. CASRAI. https://casrai.org/term/data-driven-decision-management/ 1. A single, well-defined version of all the data entities in an organizational ecosystem. In this context, a golden record is sometimes called the single version of the truth, where truth is understood to mean the reference to which data users can turn when they want to ensure that they have the correct version of a piece of information. The golden record encompasses all the data in every system of record within a particular organization. A system of record is an information storage and retrieval system that serves as the authoritative source for a particular data element in a system containing multiple sources of the same element. To ensure data integrity, a single system of record must always exist for each and every data element. A well-maintained, current golden record should be a fundamental element of the Master Data Management policy for every enterprise. 2. The word “golden” is sometimes used in information technology to express the importance of some type of source. In the context of virtualization, for example, a golden image is a template for a virtual machine, virtual desktop, servers, or hard disk drive. Peter McQuilton https://orcid.org/0000-0002-7702-4495 golden record 1. A single, well-defined version of all the data entities in an organizational ecosystem. In this context, a golden record is sometimes called the single version of the truth, where truth is understood to mean the reference to which data users can turn when they want to ensure that they have the correct version of a piece of information. The golden record encompasses all the data in every system of record within a particular organization. A system of record is an information storage and retrieval system that serves as the authoritative source for a particular data element in a system containing multiple sources of the same element. To ensure data integrity, a single system of record must always exist for each and every data element. A well-maintained, current golden record should be a fundamental element of the Master Data Management policy for every enterprise. 2. The word “golden” is sometimes used in information technology to express the importance of some type of source. In the context of virtualization, for example, a golden image is a template for a virtual machine, virtual desktop, servers, or hard disk drive. CASRAI. https://casrai.org/term/golden-record/ The reliability and application efficiency of data. It is a perception or an assessment of dataset's fitness to serve its purpose in a given context. Aspects of data quality include: Accuracy, Completeness, Update status, Relevance, Consistency across data sources, Reliability, Appropriate presentation, Accessibility. Within an organization, acceptable data quality is crucial to operational and transactional... Peter McQuilton data quality The reliability and application efficiency of data. It is a perception or an assessment of dataset's fitness to serve its purpose in a given context. Aspects of data quality include: Accuracy, Completeness, Update status, Relevance, Consistency across data sources, Reliability, Appropriate presentation, Accessibility. Within an organization, acceptable data quality is crucial to operational and transactional... CASRAI. https://casrai.org/term/data-quality/ HughShanahan 2019-10-18T08:38:59.038306Z using fair and open research tools or services Peter McQuilton Peter McQuilton 2020-10-01T20:53:15.691824Z webinar Data in the form of digital materials. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digital data Data in the form of digital materials. CASRAI. https://casrai.org/term/digital-data/ Peter McQuilton F1. (meta)data are assigned a globally unique and persistent identifier An organization's process of defining its strategy or direction in the context of FAIR project management activities, both in the context of current knowledge and unknown factors in the future. Peter McQuilton Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 strategic/long-term planning An organization's process of defining its strategy or direction in the context of FAIR project management activities, both in the context of current knowledge and unknown factors in the future. AL, and https://en.wikipedia.org/wiki/Strategic_planning [17.10.19] The collective processes conducted to ensure the cleanliness of data. Data are considered clean when they are relatively error-free. Peter McQuilton data hygiene The collective processes conducted to ensure the cleanliness of data. Data are considered clean when they are relatively error-free. CASRAI. https://casrai.org/term/data-hygiene/ The statistical analysis and assessment of the quality of data values within a dataset for consistency, uniqueness and logic. The data profiling process cannot identify inaccurate data; it can only identify business rules violations and anomalies. The insight gained by data profiling can be used to determine how difficult it will be to use existing data for other purposes. It can also be used to provide metrics to assess data quality and help determine whether or not metadata accurately describes the source data. Profiling tools evaluate the actual content, structure and quality of the data by exploring relationships that exist between value collections both within and across datasets. For example, by examining the frequency distribution of different values for each column in a table, an analyst can gain insight into the type and use of each column. Cross-column analysis can be used to expose embedded value dependencies and inter-table analysis allows the analyst to discover overlapping value sets that represent foreign key relationships between entities. Peter McQuilton data profiling The statistical analysis and assessment of the quality of data values within a dataset for consistency, uniqueness and logic. The data profiling process cannot identify inaccurate data; it can only identify business rules violations and anomalies. The insight gained by data profiling can be used to determine how difficult it will be to use existing data for other purposes. It can also be used to provide metrics to assess data quality and help determine whether or not metadata accurately describes the source data. Profiling tools evaluate the actual content, structure and quality of the data by exploring relationships that exist between value collections both within and across datasets. For example, by examining the frequency distribution of different values for each column in a table, an analyst can gain insight into the type and use of each column. Cross-column analysis can be used to expose embedded value dependencies and inter-table analysis allows the analyst to discover overlapping value sets that represent foreign key relationships between entities. CASRAI. https://casrai.org/term/data-profiling/ A collection of data defined by a theme, category, which reflects what is being measured, observed, monitored at the various sites. The Metadata Record is an information resource of business value. Peter McQuilton https://orcid.org/0000-0002-7702-4495 metadata record A collection of data defined by a theme, category, which reflects what is being measured, observed, monitored at the various sites. The Metadata Record is an information resource of business value. CASRAI. https://casrai.org/term/metadata-record/ Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data. Peter McQuilton https://orcid.org/0000-0002-7702-4495 research data Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data. CASRAI. https://casrai.org/term/research-data The medium through which learning has been provided. For example, through a course, slides, video presentation, online documentation, wiki pages or others. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2020-09-22T13:19:57.012225Z learning medium data manager 1. A compilation of core electronic health data submitted by various healthcare providers and organizations, accessible by numerous authorized parties from a number of points of care, possibly even from different jurisdictions. 2. An official health record for an individual that is shared among multiple facilities and agencies. 3. Electronic health records typically include: Contact information, Information about visits to health care professionals, Allergies, Insurance information, Family history, Immunization status, Information about any conditions or diseases, A list of medications, Records of hospitalization, Information about any surgeries or procedures performed. Digitized health information systems are expected to improve efficiency and quality of care and, ultimately, reduce costs. The benefits of electronic health records include: The ability to automatically share and update information among different offices and organizations, More efficient storage and retrieval, The ability to share multimedia information, such as medical imaging results, among locations, The ability to link records to sources of relevant and current research, Easier standardization of services and patient care, Provision of decision support systems (DSS) for healthcare professionals, Less redundancy of effort, Lower cost to the medical system once implementation is complete, The governments of many countries are working to ensure that all citizens have standardized electronic health records and that all records include the same types of information. The major barrier for the adoption of electronic health records is cost. Peter McQuilton https://orcid.org/0000-0002-7702-4495 electronic health record 1. A compilation of core electronic health data submitted by various healthcare providers and organizations, accessible by numerous authorized parties from a number of points of care, possibly even from different jurisdictions. 2. An official health record for an individual that is shared among multiple facilities and agencies. 3. Electronic health records typically include: Contact information, Information about visits to health care professionals, Allergies, Insurance information, Family history, Immunization status, Information about any conditions or diseases, A list of medications, Records of hospitalization, Information about any surgeries or procedures performed. Digitized health information systems are expected to improve efficiency and quality of care and, ultimately, reduce costs. The benefits of electronic health records include: The ability to automatically share and update information among different offices and organizations, More efficient storage and retrieval, The ability to share multimedia information, such as medical imaging results, among locations, The ability to link records to sources of relevant and current research, Easier standardization of services and patient care, Provision of decision support systems (DSS) for healthcare professionals, Less redundancy of effort, Lower cost to the medical system once implementation is complete, The governments of many countries are working to ensure that all citizens have standardized electronic health records and that all records include the same types of information. The major barrier for the adoption of electronic health records is cost. CASRAI. https://casrai.org/term/electronic-health-record/ Techniques used to deal with parameters having different units and scales. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Data rescaling data scaling Techniques used to deal with parameters having different units and scales. CASRAI. https://casrai.org/term/data-scaling/ Given a data object name, access controls define access relationships between the following metadata: data object name, a user name (or user group, or user role), and access permission. The information can be stored as metadata information associated with each data object. The information can be generated dynamically by applying the access controls of the collection that organizes the data objects. Peter McQuilton access controls Given a data object name, access controls define access relationships between the following metadata: data object name, a user name (or user group, or user role), and access permission. The information can be stored as metadata information associated with each data object. The information can be generated dynamically by applying the access controls of the collection that organizes the data objects. CASRAI. https://casrai.org/term/access-controls/ Monitoring the risk to privacy posed by data requests from researchers, and the practices of data custodians in providing data (information governance) to ensure that confidentiality is protected. Such governance requires specialized knowledge of technology, law, and statistical methods. Peter McQuilton https://orcid.org/0000-0002-7702-4495 privacy governance Monitoring the risk to privacy posed by data requests from researchers, and the practices of data custodians in providing data (information governance) to ensure that confidentiality is protected. Such governance requires specialized knowledge of technology, law, and statistical methods. CASRAI. https://casrai.org/term/privacy-governance/ Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang research integrity, attribution, impact awareness Bin for types of Knowledge related to Cloud computing environment management. leightonlc knowledge of cloud computing environment management Literally, “data about data”; data that defines and describes the characteristics of other data, used to improve both business and technical understanding of data and data-related processes. Business metadata includes the names and business definitions of subject areas, entities and attributes, attribute data types and other attribute properties, range descriptions, valid domain values and their definitions. Technical metadata includes physical database table and column names, column properties, and the properties of other database objects, including how data is stored. Process metadata is data that defines and describes the characteristics of other system elements (processes, business rules, programs, jobs, tools, etc.). Data stewardship metadata is data about data stewards, stewardship processes and responsibility assignments. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Data documentation metadata Literally, “data about data”; data that defines and describes the characteristics of other data, used to improve both business and technical understanding of data and data-related processes. Business metadata includes the names and business definitions of subject areas, entities and attributes, attribute data types and other attribute properties, range descriptions, valid domain values and their definitions. Technical metadata includes physical database table and column names, column properties, and the properties of other database objects, including how data is stored. Process metadata is data that defines and describes the characteristics of other system elements (processes, business rules, programs, jobs, tools, etc.). Data stewardship metadata is data about data stewards, stewardship processes and responsibility assignments. CASRAI. https://casrai.org/term/metadata/ A file that contains the values in a table as a series of ASCII text lines organized so that each column value is separated by a comma from the next column's value and each row starts a new line. Peter McQuilton Maybe remove as shouldn't have formats here. comma separated values A file that contains the values in a table as a series of ASCII text lines organized so that each column value is separated by a comma from the next column's value and each row starts a new line. CASRAI. https://casrai.org/term/comma-separated-values/ Peter McQuilton identifier Responsible for executing tasks and producing deliverables as outlined in the Project Plan and directed by the Project Manager, at whatever level of effort or participation has been defined for them. Peter McQuilton https://orcid.org/0000-0002-7702-4495 project team member Responsible for executing tasks and producing deliverables as outlined in the Project Plan and directed by the Project Manager, at whatever level of effort or participation has been defined for them. CASRAI. https://casrai.org/term/project-team-member Enables identification, location, and retrieval of information resources by users, often including the use of controlled vocabularies for classification and indexing and links to related resources.REFERENCE. DCC/TC3+ Peter McQuilton https://orcid.org/0000-0002-7702-4495 https://github.com/terms4fairskills/FAIRterminology/issues/13 descriptive metadata Enables identification, location, and retrieval of information resources by users, often including the use of controlled vocabularies for classification and indexing and links to related resources.REFERENCE. DCC/TC3+ CASRAI. https://casrai.org/term/descriptive-metadata/ Peter McQuilton Peter McQuilton 2020-10-01T21:03:55.425579Z github repository The process of analyzing multivariate datasets using pattern recognition or other knowledge discovery techniques to identify potentially unknown and potentially meaningful data content, relationships, classification, or trends. Peter McQuilton data mining The process of analyzing multivariate datasets using pattern recognition or other knowledge discovery techniques to identify potentially unknown and potentially meaningful data content, relationships, classification, or trends. CASRAI. https://casrai.org/term/data-mining/ Peter McQuilton https://orcid.org/0000-0002-7702-4495 Data preservation Peter McQuilton 2021-02-17T22:18:22.641817Z data archiving Machine processable specifications which define the structure and syntax of metadata specifications in a formal schema language. Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 5.12.22 Moved from 'findability of digital assets' as it is definitely not a particular data stewardship guideline. encoding schema Machine processable specifications which define the structure and syntax of metadata specifications in a formal schema language. CASRAI. https://casrai.org/term/encoding-schema/ 1. In the context of library and archiving communities: Digital archiving is often used interchangeably with digital preservation. 2. In the context of computing: Digital archiving is process of backup and ongoing maintenance as opposed to strategies for long-term digital preservation. Peter McQuilton https://orcid.org/0000-0002-7702-4495 digital archiving 1. In the context of library and archiving communities: Digital archiving is often used interchangeably with digital preservation. 2. In the context of computing: Digital archiving is process of backup and ongoing maintenance as opposed to strategies for long-term digital preservation. CASRAI. https://casrai.org/term/digital-archiving/ The process of resolving a PID to a useful state of information about a digital object by using a globally available system. Peter McQuilton https://orcid.org/0000-0002-7702-4495 pid resolution The process of resolving a PID to a useful state of information about a digital object by using a globally available system. CASRAI. https://casrai.org/term/pid-resolution/ Bin for Skills needed for Storage management. leightonlc skills related to storage management A type of data management using repositories. It is the set of policies that govern the organization, control, and properties of the repository such as: required file formats, access control restrictions, integrity, replication, retention, disposition, etc. Peter McQuilton https://orcid.org/0000-0002-7702-4495 data repository management A type of data management using repositories. It is the set of policies that govern the organization, control, and properties of the repository such as: required file formats, access control restrictions, integrity, replication, retention, disposition, etc. CASRAI. https://casrai.org/term/data-repository-management/ A document creation and management specification that builds content reuse into the authoring process. Peter McQuilton darwin information typing architecture A document creation and management specification that builds content reuse into the authoring process. CASRAI. https://casrai.org/term/darwin-information-typing-architecture/ The discovery of meaningful multidimensional patterns in data. Peter McQuilton analytics The discovery of meaningful multidimensional patterns in data. CASRAI. https://casrai.org/term/analytics/ The personal attributes necessary to perform a task. Peter McQuilton Aptitude is the innate trait or talent that a person brings to a task or situation. It is the quality of being able to do something. An aptitude is not something that can be learned or developed unless it is there to begin with. Laura Molloy, Celia van Gelder:https://docs.google.com/presentation/d/12oNBFix39ZtsLAR4tkqESBXxMU5rVg3cCKoEDG2oK-M/edit#slide=id.p5 data stewardship soft skill Ability to apply the FAIR principles i.e. describe the data with community metadata standard for machine and human, align with existing semantic models (ontologies, controlled vocabularies, ...), get a persistent ID, attribute licence and credit for data creators, legal aspect related to data (ownership, confidentiality, ethics, ...) add data provenance. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc https://orcid.org/0000-0002-7702-4495 archival documentation of data Ability to apply the FAIR principles i.e. describe the data with community metadata standard for machine and human, align with existing semantic models (ontologies, controlled vocabularies, ...), get a persistent ID, attribute licence and credit for data creators, legal aspect related to data (ownership, confidentiality, ethics, ...) add data provenance. [YLF, VDA, KH] A machine-readable format is a structured format that can be processed by a computer. Such formats can either be intended solely for machine processing (e.g. XML or RDF), or may be both human and machine accessible via appropriate markup (e.g. HTML). Peter McQuilton https://orcid.org/0000-0002-7702-4495 AL 5.12.22: Merged with now-obsolete 'machine readable' (http://purl.obolibrary.org/obo/T4FS_0000256), as we do not need that level of granularity. AL 6.5.22: The original CASRAI term (https://casrai.org/term/machine-readable-format/) has an identical source definition with 'Digital materials', and therefore has been removed. machine-readable format A machine-readable format is a structured format that can be processed by a computer. Such formats can either be intended solely for machine processing (e.g. XML or RDF), or may be both human and machine accessible via appropriate markup (e.g. HTML). Modified by AL from Wikipedia, https://en.wikipedia.org/wiki/Machine-readable_data, accessed 6.5.22 Any information obtained by a person on the understanding that they will not disclose it to others, or obtained in circumstances where it is expected that they will not disclose it. Peter McQuilton confidential information Any information obtained by a person on the understanding that they will not disclose it to others, or obtained in circumstances where it is expected that they will not disclose it. CASRAI. https://casrai.org/term/confidential-information/ The FAIR Cookbook’s recipes are a combination of guidance, technical, hands-on, background and review types to cover the operation steps of FAIR data management, and are classified according to the audience types, to serve all those involved in the data management life cycle. The FAIR Cookbook is for anyone working in the Life Sciences who needs guidance on applying the FAIR Principles in practice. https://orcid.org/0000-0002-7702-4495 fair cookbook recipe The FAIR Cookbook’s recipes are a combination of guidance, technical, hands-on, background and review types to cover the operation steps of FAIR data management, and are classified according to the audience types, to serve all those involved in the data management life cycle. The FAIR Cookbook is for anyone working in the Life Sciences who needs guidance on applying the FAIR Principles in practice. https://faircookbook.elixir-europe.org/content/recipes/introduction/FAIR-cookbook-audience.html Evaluation indicators are designed to enable the measurement of the degree of compliance against a guideline using a set of criteria. The goals of such indicators are varied, but a good example would be to improve the re-usability of the digital asset being measured for increased discoverability, interoperability and overall machine actionability. https://github.com/terms4fairskills/FAIRterminology/issues/22 evaluation indicator Evaluation indicators are designed to enable the measurement of the degree of compliance against a guideline using a set of criteria. The goals of such indicators are varied, but a good example would be to improve the re-usability of the digital asset being measured for increased discoverability, interoperability and overall machine actionability. Definition by https://orcid.org/0000-0002-7702-4495 and drawn in part from https://fairplus.github.io/Data-Maturity/docs/Indicators The fairplus dataset maturity model indicators are used within the context of the model, which is intended as a comprehensive reference model for state-of-FAIRness improvement in research datasets. Based on the FAIR guiding principles, the DSM model defines and classifies requirements that constitute an incremental path towards improving FAIRness level for a given research dataset. https://orcid.org/0000-0002-7702-4495 fairplus DSM model indicator fairplus dataset maturity model indicator The fairplus dataset maturity model indicators are used within the context of the model, which is intended as a comprehensive reference model for state-of-FAIRness improvement in research datasets. Based on the FAIR guiding principles, the DSM model defines and classifies requirements that constitute an incremental path towards improving FAIRness level for a given research dataset. https://fairplus.github.io/Data-Maturity/ The fairplus DSM content-related indicators relate to what is reported in the Dataset (data) & the Dataset Descriptor (metadata). https://orcid.org/0000-0002-7702-4495 fairplus DSM content-related indicator The fairplus DSM content-related indicators relate to what is reported in the Dataset (data) & the Dataset Descriptor (metadata). https://fairplus.github.io/Data-Maturity/ The fairplus DSM representation and format indicators relate to how the data object & metadata object are represented and formatted. https://orcid.org/0000-0002-7702-4495 AL 5.12.22: Please note that the 'and' in this label is a direct representation of the fairplus DSM model, and not a ontological design choice. fairplus DSM representation and format indicator The fairplus DSM representation and format indicators relate to how the data object & metadata object are represented and formatted. https://fairplus.github.io/Data-Maturity/ The fairplus DSM hosting-environment capabilities indicator relate to the capabilities of the hosting environment that enables and supports the use of FAIR data https://orcid.org/0000-0002-7702-4495 fairplus DSM hosting-environment capabilities indicator The fairplus DSM hosting-environment capabilities indicator relate to the capabilities of the hosting environment that enables and supports the use of FAIR data https://fairplus.github.io/Data-Maturity/ No representation of Data purposed for FAIR sharing is available https://orcid.org/0000-0002-7702-4495 DSM-0-R2 No representation of Data purposed for FAIR sharing is available https://fairplus.github.io/Data-Maturity/docs/Indicators/ Structured and/or Unstructured Data are organised into Dataset(s) created for the purpose of FAIR sharing https://orcid.org/0000-0002-7702-4495 DSM-1-R2 Structured and/or Unstructured Data are organised into Dataset(s) created for the purpose of FAIR sharing https://fairplus.github.io/Data-Maturity/docs/Indicators/ Project collected Data are organized into structured Dataset(s) and conform to a locally defined Dataset Model https://orcid.org/0000-0002-7702-4495 DSM-2-R2 Project collected Data are organized into structured Dataset(s) and conform to a locally defined Dataset Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Structured Data are represented as Datasets and conform to relevant Standard Dataset Model(s) for FAIR sharing https://orcid.org/0000-0002-7702-4495 DSM-3-R2 Structured Data are represented as Datasets and conform to relevant Standard Dataset Model(s) for FAIR sharing https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) content is semantically represented using Linked Data Represetations conforming to a Semantic Data Model https://orcid.org/0000-0002-7702-4495 DSM-4-R2 Dataset(s) content is semantically represented using Linked Data Represetations conforming to a Semantic Data Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) are granularly represented and managed at the Data Element Level (e.g. ISO 11179 MDR standard) https://orcid.org/0000-0002-7702-4495 DSM-5-R2 Dataset(s) are granularly represented and managed at the Data Element Level (e.g. ISO 11179 MDR standard) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Metadata is NOT formally represented in a structured Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-0-R3 Dataset Metadata is NOT formally represented in a structured Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Metadata is formally represented in the form of an Identifiable Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-1-R0 Dataset Metadata is formally represented in the form of an Identifiable Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ A representation of the Dataset Descriptor conforming to a relevant General Purpose Metadata Schema is available https://orcid.org/0000-0002-7702-4495 DSM-1-R3 A representation of the Dataset Descriptor conforming to a relevant General Purpose Metadata Schema is available https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor(s) conforms to or extends a Standard Generic Dataset Descriptor Model to describe and represent structural metadata of Dataset(s) https://orcid.org/0000-0002-7702-4495 DSM-2-R3 Dataset Descriptor(s) conforms to or extends a Standard Generic Dataset Descriptor Model to describe and represent structural metadata of Dataset(s) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor(s) use community-defined or domain-specific metadata standard https://orcid.org/0000-0002-7702-4495 DSM-3-R3 Dataset Descriptor(s) use community-defined or domain-specific metadata standard https://fairplus.github.io/Data-Maturity/docs/Indicators/ A Semantic Data Model (Metadata) used for data harmonisation across Datasets is formally defined and represented using Linked Data Representations https://orcid.org/0000-0002-7702-4495 DSM-4-R3 A Semantic Data Model (Metadata) used for data harmonisation across Datasets is formally defined and represented using Linked Data Representations https://fairplus.github.io/Data-Maturity/docs/Indicators/ Common Data Elements and their value sets are defined and registered in a managed Metadata Registry https://orcid.org/0000-0002-7702-4495 DSM-5-R3 Common Data Elements and their value sets are defined and registered in a managed Metadata Registry https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is NOT formally represented in any form https://orcid.org/0000-0002-7702-4495 DSM-0-R1 Contextual Metadata is NOT formally represented in any form https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is reported at summary level and represented in the Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-1-R1 Contextual Metadata is reported at summary level and represented in the Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is formally represented in the form of a locally defined Domain Model https://orcid.org/0000-0002-7702-4495 DSM-2-R1 Contextual Metadata is formally represented in the form of a locally defined Domain Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is formally represented and conforms to a standard defined Domain Model if available https://orcid.org/0000-0002-7702-4495 DSM-3-R1 Contextual Metadata is formally represented and conforms to a standard defined Domain Model if available https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is formally represented by a defined set Common Data Elements https://orcid.org/0000-0002-7702-4495 DSM-4-R1 Contextual Metadata is formally represented by a defined set Common Data Elements https://fairplus.github.io/Data-Maturity/docs/Indicators/ Domain entities are represented by Managed Master Data Objects conforming to a Master Data Model used for data consolidation https://orcid.org/0000-0002-7702-4495 DSM-5-R1 Domain entities are represented by Managed Master Data Objects conforming to a Master Data Model used for data consolidation https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor (metadata) is NOT available in a Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-0-R4 Dataset Descriptor (metadata) is NOT available in a Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor is available in Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-1-R4 Dataset Descriptor is available in Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata represented in the form of a Domain Model is available in a Human Readable Format https://orcid.org/0000-0002-7702-4495 DSM-2-R4 Contextual Metadata represented in the form of a Domain Model is available in a Human Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ A formal documentation of the adopted Standard Dataset Model is available in a Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-3-R4 A formal documentation of the adopted Standard Dataset Model is available in a Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ A Semantic Data Model (Metadata) describing the data is represented in a Machine Readable and Machine Interptretable format https://orcid.org/0000-0002-7702-4495 DSM-4-R4 A Semantic Data Model (Metadata) describing the data is represented in a Machine Readable and Machine Interptretable format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) are NOT available in a Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-0-R5 Dataset(s) are NOT available in a Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) available in Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-1-R5 Dataset(s) available in Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ If applicable, Dataset(s) available in non-proprietary Machine Readable Format relevant to the adopted standard Dataset Model https://orcid.org/0000-0002-7702-4495 DSM-3-R5 If applicable, Dataset(s) available in non-proprietary Machine Readable Format relevant to the adopted standard Dataset Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Datasets are available in a Machine Readable and Machine Interpretable format https://orcid.org/0000-0002-7702-4495 DSM-4-R5 Datasets are available in a Machine Readable and Machine Interpretable format https://fairplus.github.io/Data-Maturity/docs/Indicators/ If applicable, license information and/or permitted use and accessability to parts of the dataset is formally represented and encoded in a Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-4-R6 If applicable, license information and/or permitted use and accessability to parts of the dataset is formally represented and encoded in a Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) are NOT Identifiable via Unique Identifiers https://orcid.org/0000-0002-7702-4495 DSM-0-C0 Dataset(s) are NOT Identifiable via Unique Identifiers https://fairplus.github.io/Data-Maturity/docs/Indicators/ Each Dataset purposed for FAIR sharing is assigned a unique identifier https://orcid.org/0000-0002-7702-4495 DSM-1-C0 Each Dataset purposed for FAIR sharing is assigned a unique identifier https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, data is structured in the Dataset according to the Tidy Data Principles https://orcid.org/0000-0002-7702-4495 DSM-2-C2 Where applicable, data is structured in the Dataset according to the Tidy Data Principles https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, Dataset(s) scope and content are reported in compliance with relevant community-defined Data Reporting Guidelines https://orcid.org/0000-0002-7702-4495 DSM-3-C2 Where applicable, Dataset(s) scope and content are reported in compliance with relevant community-defined Data Reporting Guidelines https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) content is harmonised against a designed-for-purpose Semantic Data Model https://orcid.org/0000-0002-7702-4495 DSM-4-C2 Dataset(s) content is harmonised against a designed-for-purpose Semantic Data Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) include Reference Fields that enable joining related datasets https://orcid.org/0000-0002-7702-4495 DSM-2-C3 Dataset(s) include Reference Fields that enable joining related datasets https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, Dataset Field Names use standard controlled terms as recommended by the adopted Standard https://orcid.org/0000-0002-7702-4495 DSM-3-C3 Where applicable, Dataset Field Names use standard controlled terms as recommended by the adopted Standard https://fairplus.github.io/Data-Maturity/docs/Indicators/ Key Dataset Fields are mapped to Common Data Elements as defined by the Semantic Data Model https://orcid.org/0000-0002-7702-4495 DSM-4-C3 Key Dataset Fields are mapped to Common Data Elements as defined by the Semantic Data Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Fields are linked and harmonized against enterprise managed Metadata Elements (e.g. MDR registered Data Elements) https://orcid.org/0000-0002-7702-4495 DSM-5-C3 Dataset Fields are linked and harmonized against enterprise managed Metadata Elements (e.g. MDR registered Data Elements) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, Dataset Field Values are standardized against a locally defined Data Dictionary within and across related Datasets https://orcid.org/0000-0002-7702-4495 DSM-2-C4 Where applicable, Dataset Field Values are standardized against a locally defined Data Dictionary within and across related Datasets https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, Dataset Field Values are standardised against domain-specific Controlled Terminologies and/or Ontology Terms https://orcid.org/0000-0002-7702-4495 DSM-3-C4 Where applicable, Dataset Field Values are standardised against domain-specific Controlled Terminologies and/or Ontology Terms https://fairplus.github.io/Data-Maturity/docs/Indicators/ Values for key Domain Entities reported in the Dataset(s) are standardised and assigned unique Standard Identifiers https://orcid.org/0000-0002-7702-4495 DSM-4-C4 Values for key Domain Entities reported in the Dataset(s) are standardised and assigned unique Standard Identifiers https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Field values are controlled and managed via enterprise managed Reference and Master Data https://orcid.org/0000-0002-7702-4495 DSM-5-C4 Dataset Field values are controlled and managed via enterprise managed Reference and Master Data https://fairplus.github.io/Data-Maturity/docs/Indicators/ Study/Project-Level metadata is NOT reported https://orcid.org/0000-0002-7702-4495 DSM-0-C1 Study/Project-Level metadata is NOT reported https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes Descriptive Study/Project-Level summary information https://orcid.org/0000-0002-7702-4495 DSM-1-C1 Dataset Descriptor includes Descriptive Study/Project-Level summary information https://fairplus.github.io/Data-Maturity/docs/Indicators/ A locally defined Domain Model contains concepts that describes the overall project/study design, the relationships between the Datasets, the key entities reported within the Datasets and the relationships between them. https://orcid.org/0000-0002-7702-4495 DSM-2-C1 A locally defined Domain Model contains concepts that describes the overall project/study design, the relationships between the Datasets, the key entities reported within the Datasets and the relationships between them. https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, study-level / experimental metadata is reported in compliance with relevant Minimum Information Reporting Guidelines https://orcid.org/0000-0002-7702-4495 DSM-3-C1 Where applicable, study-level / experimental metadata is reported in compliance with relevant Minimum Information Reporting Guidelines https://fairplus.github.io/Data-Maturity/docs/Indicators/ A Semantic Data Model includes study design Data Elements and the relationships between them https://orcid.org/0000-0002-7702-4495 DSM-4-C1 A Semantic Data Model includes study design Data Elements and the relationships between them https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor does NOT include a reference to the Dataset it describes https://orcid.org/0000-0002-7702-4495 DSM-0-C2 Dataset Descriptor does NOT include a reference to the Dataset it describes https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes Identifying & Descriptive Dataset-Level metadata https://orcid.org/0000-0002-7702-4495 DSM-1-C2 Dataset Descriptor includes Identifying & Descriptive Dataset-Level metadata https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor contains access information for the Dataset https://orcid.org/0000-0002-7702-4495 DSM-1-C3 Dataset Descriptor contains access information for the Dataset https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes reference to related Datasets and if applicable the relevant joining Dataset Fields https://orcid.org/0000-0002-7702-4495 DSM-2-C5 Dataset Descriptor includes reference to related Datasets and if applicable the relevant joining Dataset Fields https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor references a standard license under which the dataset can be re-used. https://orcid.org/0000-0002-7702-4495 DSM-3-C7 Dataset Descriptor references a standard license under which the dataset can be re-used. https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes Field-level Metadata as prescribed by a locally defined Dataset Model https://orcid.org/0000-0002-7702-4495 DSM-2-C6 Dataset Descriptor includes Field-level Metadata as prescribed by a locally defined Dataset Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes standard-compliant Field-level Metadata as prescribed by the adopted standard Dataset Model. https://orcid.org/0000-0002-7702-4495 DSM-3-C6 Dataset Descriptor includes standard-compliant Field-level Metadata as prescribed by the adopted standard Dataset Model. https://fairplus.github.io/Data-Maturity/docs/Indicators/ The Semantic Data Model includes a pre-defined set of Common Data Elements reported within the Datasets and the relationships between them https://orcid.org/0000-0002-7702-4495 DSM-4-C5 The Semantic Data Model includes a pre-defined set of Common Data Elements reported within the Datasets and the relationships between them https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes Value-level Metadata or if applicable includes a reference to a locally defined Data Dictionary https://orcid.org/0000-0002-7702-4495 DSM-2-C7 Dataset Descriptor includes Value-level Metadata or if applicable includes a reference to a locally defined Data Dictionary https://fairplus.github.io/Data-Maturity/docs/Indicators/ Value Level Metadata includes Resolvable Identifiers for Controlled and/or Standard Terms reported in the Dataset https://orcid.org/0000-0002-7702-4495 DSM-3-C5 Value Level Metadata includes Resolvable Identifiers for Controlled and/or Standard Terms reported in the Dataset https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data or metadata is hosted in non-accessible storage (e.g., personal desktop, local file system or archive) https://orcid.org/0000-0002-7702-4495 DSM-0-H1 Data or metadata is hosted in non-accessible storage (e.g., personal desktop, local file system or archive) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Metadata hosting environment stores and maintains an identifiable Dataset Descriptor for each identifiable Dataset https://orcid.org/0000-0002-7702-4495 DSM-1-H1 Metadata hosting environment stores and maintains an identifiable Dataset Descriptor for each identifiable Dataset https://fairplus.github.io/Data-Maturity/docs/Indicators/ The Data hosting environment's Persistence Model is aligned with a locally defined Domain Model to enable interpretation of Datasets https://orcid.org/0000-0002-7702-4495 DSM-2-H1 The Data hosting environment's Persistence Model is aligned with a locally defined Domain Model to enable interpretation of Datasets https://fairplus.github.io/Data-Maturity/docs/Indicators/ The Data hosting environment's Persistence Model is aligned with a standard Dataset model or compliant with relevant Minimum Information Reporting Guidelines https://orcid.org/0000-0002-7702-4495 DSM-3-H1 The Data hosting environment's Persistence Model is aligned with a standard Dataset model or compliant with relevant Minimum Information Reporting Guidelines https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data Hosting environment stores data in a relevant linked data store (e.g., Triple Store or Graph Databaase) https://orcid.org/0000-0002-7702-4495 DSM-4-H1 Data Hosting environment stores data in a relevant linked data store (e.g., Triple Store or Graph Databaase) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data or metadata hosted in an accessible resource but with no retrieval capability https://orcid.org/0000-0002-7702-4495 DSM-0-H2 Data or metadata hosted in an accessible resource but with no retrieval capability https://fairplus.github.io/Data-Maturity/docs/Indicators/ The Dataset and its Descriptor are indexed and retrievable (in the same or separate hosting environments) via unique and persistent identifiers https://orcid.org/0000-0002-7702-4495 DSM-1-H2 The Dataset and its Descriptor are indexed and retrievable (in the same or separate hosting environments) via unique and persistent identifiers https://fairplus.github.io/Data-Maturity/docs/Indicators/ Retrieval of the Dataset and the Dataset Descriptor utilises a standardized communication protocol that is open, free and universally implementable https://orcid.org/0000-0002-7702-4495 DSM-1-H3 Retrieval of the Dataset and the Dataset Descriptor utilises a standardized communication protocol that is open, free and universally implementable https://fairplus.github.io/Data-Maturity/docs/Indicators/ Metadata hosting environment provides programmatic access and retrieval (API) for the Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-2-H2 Metadata hosting environment provides programmatic access and retrieval (API) for the Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ For each dataset, the hosting environment maintains a globally unique, persistent and resolvable identifier for access and retrieval https://orcid.org/0000-0002-7702-4495 DSM-3-H2 For each dataset, the hosting environment maintains a globally unique, persistent and resolvable identifier for access and retrieval https://fairplus.github.io/Data-Maturity/docs/Indicators/ If applicable, Dataset hosting environment offers dataset-level authentication and authorisation capabilities https://orcid.org/0000-0002-7702-4495 DSM-3-H4 If applicable, Dataset hosting environment offers dataset-level authentication and authorisation capabilities https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data Hosting Environment provides semantic querying capability https://orcid.org/0000-0002-7702-4495 DSM-4-H2 Data Hosting Environment provides semantic querying capability https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset's Metadata is NOT searchable via keywords or elements within the Descriptor https://orcid.org/0000-0002-7702-4495 DSM-0-H3 Dataset's Metadata is NOT searchable via keywords or elements within the Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ Metadata hosting environment offers the capability to browse and search contents of the Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-1-H4 Metadata hosting environment offers the capability to browse and search contents of the Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data hosting environment offers the capability to browse and search related Datasets https://orcid.org/0000-0002-7702-4495 DSM-2-H3 Data hosting environment offers the capability to browse and search related Datasets https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data Hosting environment utilises controlled terms and/or ontology terms to search within Dataset content. https://orcid.org/0000-0002-7702-4495 DSM-3-H3 Data Hosting environment utilises controlled terms and/or ontology terms to search within Dataset content. https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data Hosting Environment provides semantic querying capability https://orcid.org/0000-0002-7702-4495 DSM-4-H2 Data Hosting Environment provides semantic querying capability https://fairplus.github.io/Data-Maturity/docs/Indicators/ https://orcid.org/0000-0002-7702-4495 AL 15.3.22: Added for FAIR Cookbook integration. data scientist https://orcid.org/0000-0002-7702-4495 AL 15.3.22: Added for FAIR Cookbook integration. ontologist https://orcid.org/0000-0002-7702-4495 AL 15.3.22: Added for FAIR Cookbook integration. terminology manager https://orcid.org/0000-0002-7702-4495 AL 16.3.22 Added as part of FAIR cookbook integration. software engineer https://orcid.org/0000-0002-7702-4495 AL 16.3.22: Added as part of FAIR Cookbook integration. system administrator https://orcid.org/0000-0002-7702-4495 AL 16.3.22: Added as part of FAIR Cookbook integration. Useful for such roles to be able to understand the cost of e.g. making things FAIR (and how to deal with it), but also the benefits. procurement officer Metadata creation concerns the creation of data that provides information about characteristics, aspects or context of other data entities such as a dataset or other digital or analog object. https://orcid.org/0000-0002-7702-4495 metadata creation Metadata creation concerns the creation of data that provides information about characteristics, aspects or context of other data entities such as a dataset or other digital or analog object. AL 22.3.22, and see also FRBR User Tasks at: https://sites.google.com/site/metadatastandards/chapter-6/6-3-frbr-user-tasks. Obsolete Class example to be eventually removed example to be eventually removed failed exploratory term The term was used in an attempt to structure part of the ontology but in retrospect failed to do a good job Person:Alan Ruttenberg failed exploratory term metadata complete Class has all its metadata, but is either not guaranteed to be in its final location in the asserted IS_A hierarchy or refers to another class that is not complete. metadata complete organizational term Term created to ease viewing/sort terms for development purpose, and will not be included in a release organizational term ready for release Class has undergone final review, is ready for use, and will be included in the next release. Any class lacking "ready_for_release" should be considered likely to change place in hierarchy, have its definition refined, or be obsoleted in the next release. Those classes deemed "ready_for_release" will also derived from a chain of ancestor classes that are also "ready_for_release." ready for release metadata incomplete Class is being worked on; however, the metadata (including definition) are not complete or sufficiently clear to the branch editors. metadata incomplete uncurated Nothing done yet beyond assigning a unique class ID and proposing a preferred term. uncurated pending final vetting All definitions, placement in the asserted IS_A hierarchy and required minimal metadata are complete. The class is awaiting a final review by someone other than the term editor. pending final vetting placeholder removed placeholder removed terms merged An editor note should explain what were the merged terms and the reason for the merge. terms merged term imported This is to be used when the original term has been replaced by a term imported from an other ontology. An editor note should indicate what is the URI of the new term to use. term imported term split This is to be used when a term has been split in two or more new terms. An editor note should indicate the reason for the split and indicate the URIs of the new terms created. term split universal Hard to give a definition for. Intuitively a "natural kind" rather than a collection of any old things, which a class is able to be, formally. At the meta level, universals are defined as positives, are disjoint with their siblings, have single asserted parents. Alan Ruttenberg A Formal Theory of Substances, Qualities, and Universals, http://ontology.buffalo.edu/bfo/SQU.pdf universal defined class A defined class is a class that is defined by a set of logically necessary and sufficient conditions but is not a universal "definitions", in some readings, always are given by necessary and sufficient conditions. So one must be careful (and this is difficult sometimes) to distinguish between defined classes and universal. Alan Ruttenberg defined class named class expression A named class expression is a logical expression that is given a name. The name can be used in place of the expression. named class expressions are used in order to have more concise logical definition but their extensions may not be interesting classes on their own. In languages such as OWL, with no provisions for macros, these show up as actuall classes. Tools may with to not show them as such, and to replace uses of the macros with their expansions Alan Ruttenberg named class expression to be replaced with external ontology term Terms with this status should eventually replaced with a term from another ontology. Alan Ruttenberg group:OBI to be replaced with external ontology term requires discussion A term that is metadata complete, has been reviewed, and problems have been identified that require discussion before release. Such a term requires editor note(s) to identify the outstanding issues. Alan Ruttenberg group:OBI requires discussion The term was added to the ontology on the assumption it was in scope, but it turned out later that it was not. This obsolesence reason should be used conservatively. Typical valid examples are: un-necessary grouping classes in disease ontologies, a phenotype term added on the assumption it was a disease. https://github.com/information-artifact-ontology/ontology-metadata/issues/77 https://orcid.org/0000-0001-5208-3432 out of scope