Junhyong Kim Stephen A Fisher Ontology for RNA sequencing (ORNASEQ) The following ontology is provided by Stephen A Fisher and Junhyong Kim to annotate next-generation sequencing experiments performed on RNA. English language definitions of what NCI means by the concept. These are limited to 1024 characters. They may also include information about the definition's source and attribution in a form that can easily be interpreted by software. The official OBI definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions. The official definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions. 2012-04-05: Barry Smith The official OBI definition, explaining the meaning of a class or property: 'Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions' is terrible. Can you fix to something like: A statement of necessary and sufficient conditions explaining the meaning of an expression referring to a class or property. Alan Ruttenberg Your proposed definition is a reasonable candidate, except that it is very common that necessary and sufficient conditions are not given. Mostly they are necessary, occasionally they are necessary and sufficient or just sufficient. Often they use terms that are not themselves defined and so they effectively can't be evaluated by those criteria. On the specifics of the proposed definition: We don't have definitions of 'meaning' or 'expression' or 'property'. For 'reference' in the intended sense I think we use the term 'denotation'. For 'expression', I think we you mean symbol, or identifier. For 'meaning' it differs for class and property. For class we want documentation that let's the intended reader determine whether an entity is instance of the class, or not. For property we want documentation that let's the intended reader determine, given a pair of potential relata, whether the assertion that the relation holds is true. The 'intended reader' part suggests that we also specify who, we expect, would be able to understand the definition, and also generalizes over human and computer reader to include textual and logical definition. Personally, I am more comfortable weakening definition to documentation, with instructions as to what is desirable. We also have the outstanding issue of how to aim different definitions to different audiences. A clinical audience reading chebi wants a different sort of definition documentation/definition from a chemistry trained audience, and similarly there is a need for a definition that is adequate for an ontologist to work with. PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> definition definition textual definition An administrative note intended for its editor. It may not be included in the publication version of the ontology, so it should contain nothing necessary for end users to understand the ontology. PERSON:Daniel Schober GROUP:OBI:<http://purl.obfoundry.org/obo/obi> editor note editor note Name of editor entering the term in the file. The term editor is a point of contact for information regarding the term. The term editor may be, but is not always, the author of the definition, which may have been worked upon by several people 20110707, MC: label update to term editor and definition modified accordingly. See http://code.google.com/p/information-artifact-ontology/issues/detail?id=115. 20110707, MC: label update to term editor and definition modified accordingly. See https://github.com/information-artifact-ontology/IAO/issues/115. PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> definition editor definition editor term editor term editor An alternative name for a class or property which means the same thing as the preferred name (semantically equivalent) PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> alternative term formal citation, e.g. identifier in external database to indicate / attribute source(s) for the definition. Free text indicate / attribute source(s) for the definition. EXAMPLE: Author Name, URI, MeSH Term C04, PUBMED ID, Wiki uri on 31.01.2007 PERSON:Daniel Schober Discussion on obo-discuss mailing-list, see http://bit.ly/hgm99w GROUP:OBI:<http://purl.obolibrary.org/obo/obi> definition source definition source imported from label a core relation that holds between a part and its whole part of A relation between a planned process and a continuant participating in that process. The presence of the continuant at the end of the process is explicitly specified in the objective specification which the process realizes the concretization of. is_specified_output_of a relation between an independent continuant (the bearer) and a specifically dependent continuant (the dependent), in which the dependent specifically depends on the bearer for its existence bearer of a relation between a continuant and a process, in which the continuant is somehow involved in the process participates in a relation between an independent continuant (the bearer) and a role, in which the role specifically depends on the bearer for its existence has role x overlaps y if and only if there exists some z such that x has part z and z part of y overlaps inverse of starts with starts A mereological relationship or a topological relationship mereotopologically related to BFO 2 Reference: In all areas of empirical inquiry we encounter general terms of two sorts. First are general terms which refer to universals or types:animaltuberculosissurgical procedurediseaseSecond, are general terms used to refer to groups of entities which instantiate a given universal but do not correspond to the extension of any subuniversal of that universal because there is nothing intrinsic to the entities in question by virtue of which they – and only they – are counted as belonging to the given group. Examples are: animal purchased by the Emperortuberculosis diagnosed on a Wednesdaysurgical procedure performed on a patient from Stockholmperson identified as candidate for clinical trial #2056-555person who is signatory of Form 656-PPVpainting by Leonardo da VinciSuch terms, which represent what are called ‘specializations’ in [81 Entity doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example Werner Ceusters 'portions of reality' include 4 sorts, entities (as BFO construes them), universals, configurations, and relations. It is an open question as to whether entities as construed in BFO will at some point also include these other portions of reality. See, for example, 'How to track absolutely everything' at http://www.referent-tracking.com/_RTU/papers/CeustersICbookRevised.pdf entity BFO 2 Reference: Continuant entities are entities which can be sliced to yield parts only along the spatial dimension, yielding for example the parts of your table which we call its legs, its top, its nails. ‘My desk stretches from the window to the door. It has spatial parts, and can be sliced (in space) in two. With respect to time, however, a thing is a continuant.’ [60, p. 240 Continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example, in an expansion involving bringing in some of Ceuster's other portions of reality, questions are raised as to whether universals are continuants continuant BFO 2 Reference: every occurrent that is not a temporal or spatiotemporal region is s-dependent on some independent continuant that is not a spatial region BFO 2 Reference: s-dependence obtains between every process and its participants in the sense that, as a matter of necessity, this process could not have existed unless these or those participants existed also. A process may have a succession of participants at different phases of its unfolding. Thus there may be different players on the field at different times during the course of a football game; but the process which is the entire game s-depends_on all of these players nonetheless. Some temporal parts of this process will s-depend_on on only some of the players. Occurrent doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. An example would be the sum of a process and the process boundary of another process. Simons uses different terminology for relations of occurrents to regions: Denote the spatio-temporal location of a given occurrent e by 'spn[e]' and call this region its span. We may say an occurrent is at its span, in any larger region, and covers any smaller region. Now suppose we have fixed a frame of reference so that we can speak not merely of spatio-temporal but also of spatial regions (places) and temporal regions (times). The spread of an occurrent, (relative to a frame of reference) is the space it exactly occupies, and its spell is likewise the time it exactly occupies. We write 'spr[e]' and `spl[e]' respectively for the spread and spell of e, omitting mention of the frame. occurrent b is an independent continuant = Def. b is a continuant which is such that there is no c and no t such that b s-depends_on c at t. (axiom label in BFO2 Reference: [017-002]) independent continuant p is a process = Def. p is an occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t. (axiom label in BFO2 Reference: [083-003]) BFO 2 Reference: The realm of occurrents is less pervasively marked by the presence of natural units than is the case in the realm of independent continuants. Thus there is here no counterpart of ‘object’. In BFO 1.0 ‘process’ served as such a counterpart. In BFO 2.0 ‘process’ is, rather, the occurrent counterpart of ‘material entity’. Those natural – as contrasted with engineered, which here means: deliberately executed – units which do exist in the realm of occurrents are typically either parasitic on the existence of natural units on the continuant side, or they are fiat in nature. Thus we can count lives; we can count football games; we can count chemical reactions performed in experiments or in chemical manufacturing. We cannot count the processes taking place, for instance, in an episode of insect mating behavior.Even where natural units are identifiable, for example cycles in a cyclical process such as the beating of a heart or an organism’s sleep/wake cycle, the processes in question form a sequence with no discontinuities (temporal gaps) of the sort that we find for instance where billiard balls or zebrafish or planets are separated by clear spatial gaps. Lives of organisms are process units, but they too unfold in a continuous series from other, prior processes such as fertilization, and they unfold in turn in continuous series of post-life processes such as post-mortem decay. Clear examples of boundaries of processes are almost always of the fiat sort (midnight, a time of death as declared in an operating theater or on a death certificate, the initiation of a state of war) process realizable entity b is a specifically dependent continuant = Def. b is a continuant & there is some independent continuant c which is not a spatial region and which is such that b s-depends_on c at every time t during the course of b’s existence. (axiom label in BFO2 Reference: [050-003]) Specifically dependent continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. We're not sure what else will develop here, but for example there are questions such as what are promises, obligation, etc. specifically dependent continuant BFO 2 Reference: One major family of examples of non-rigid universals involves roles, and ontologies developed for corresponding administrative purposes may consist entirely of representatives of entities of this sort. Thus ‘professor’, defined as follows,b instance_of professor at t =Def. there is some c, c instance_of professor role & c inheres_in b at t.denotes a non-rigid universal and so also do ‘nurse’, ‘student’, ‘colonel’, ‘taxpayer’, and so forth. (These terms are all, in the jargon of philosophy, phase sortals.) By using role terms in definitions, we can create a BFO conformant treatment of such entities drawing on the fact that, while an instance of professor may be simultaneously an instance of trade union member, no instance of the type professor role is also (at any time) an instance of the type trade union member role (any more than any instance of the type color is at any time an instance of the type length).If an ontology of employment positions should be defined in terms of roles following the above pattern, this enables the ontology to do justice to the fact that individuals instantiate the corresponding universals – professor, sergeant, nurse – only during certain phases in their lives. role b is a generically dependent continuant = Def. b is a continuant that g-depends_on one or more other entities. (axiom label in BFO2 Reference: [074-001]) generically dependent continuant BFO 2 Reference: Material entities (continuants) can preserve their identity even while gaining and losing material parts. Continuants are contrasted with occurrents, which unfold themselves in successive temporal parts or phases [60 BFO 2 Reference: Object, Fiat Object Part and Object Aggregate are not intended to be exhaustive of Material Entity. Users are invited to propose new subcategories of Material Entity. BFO 2 Reference: ‘Matter’ is intended to encompass both mass and energy (we will address the ontological treatment of portions of energy in a later version of BFO). A portion of matter is anything that includes elementary particles among its proper or improper parts: quarks and leptons, including electrons, as the smallest particles thus far discovered; baryons (including protons and neutrons) at a higher level of granularity; atoms and molecules at still higher levels, forming the cells, organs, organisms and other material entities studied by biologists, the portions of rock studied by geologists, the fossils studied by paleontologists, and so on.Material entities are three-dimensional entities (entities extended in three spatial dimensions), as contrasted with the processes in which they participate, which are four-dimensional entities (entities extended also along the dimension of time).According to the FMA, material entities may have immaterial entities as parts – including the entities identified below as sites; for example the interior (or ‘lumen’) of your small intestine is a part of your body. BFO 2.0 embodies a decision to follow the FMA here. material entity High molecular weight, linear polymers, composed of nucleotides containing deoxyribose and linked by phosphodiester bonds; DNA contain the genetic information of organisms. deoxyribonucleic acid Any constitutionally or isotopically distinct atom, molecule, ion, ion pair, radical, radical ion, complex, conformer etc., identifiable as a separately distinguishable entity. molecular entity A material entity of anatomical origin (part of or deriving from an organism) that has as its parts a maximally connected cell compartment surrounded by a plasma membrane. cell A class of large neuroglial (macroglial) cells in the central nervous system - the largest and most numerous neuroglial cells in the brain and spinal cord. Astrocytes (from 'star' cells) are irregularly shaped with many long processes, including those with 'end feet' which form the glial (limiting) membrane and directly and indirectly contribute to the blood-brain barrier. They regulate the extracellular ionic and chemical environment, and 'reactive astrocytes' (along with microglia) respond to injury. astrocyte A cell from the thermogenic form of adipose tissue found in many species, particularly in newborns and hibernating mammals, but also in lesser amounts in adults of other mammals including humans. Brown fat is capable of rapid liberation of energy and seems to be important in the maintenance of body temperature immediately after birth and upon waking from hibernation. brown fat cell The basic cellular unit of nervous tissue. Each neuron consists of a body, an axon, and dendrites. Their purpose is to receive, conduct, and transmit impulses in the nervous system. neuron Cardiac muscle cells are striated muscle cells that are responsible for heart contraction. In mammals, the contractile fiber resembles those of skeletal muscle but are only one third as large in diameter, are richer in sarcoplasm, and contain centrally located instead of peripheral nuclei. cardiac muscle cell Any neuron that is part of a central nervous system. central nervous system neuron date the sequencing run was performed GROUP: IRIDA Ontology (Emma) sequencing run date Pre-filled, ready-to-use reagent cartridges. Used to produce improved chemistry, cluster density and read length as well as improve quality (Q) scores. Reagent components are encoded to interact with the sequencing system to validate compatibility with user-defined applications. GROUP: IRIDA Ontology (Emma) sequencing kit Packaged kits (containing adapters, indexes, enzymes, buffers etc), tailored for specific sequencing workflows, which allow the simplified preparation of sequencing-ready libraries for small genomes, amplicons, and plasmids. GROUP: IRIDA Ontology (Emma) and http://applications.illumina.com/applications/sequencing/ngs-library-prep.html library preparation kit organism datum A location, relative to cellular compartments and structures, occupied by a macromolecular machine when it carries out a molecular function. There are two ways in which the gene ontology describes locations of gene products: (1) relative to cellular structures (e.g., cytoplasmic side of plasma membrane) or compartments (e.g., mitochondrion), and (2) the stable macromolecular complexes of which they are parts (e.g., the ribosome). cellular_component A semiautonomous, self replicating organelle that occurs in varying numbers, shapes, and sizes in the cytoplasm of virtually all eukaryotic cells. It is notably the site of tissue respiration. mitochondrion A neuron projection that has a short, tapering, morphology. Dendrites receive and integrate signals from other neurons or from sensory stimuli, and conduct nerve impulses towards the axon or the cell body. In most neurons, the impulse is conveyed from dendrites to axon via the cell body, but in some types of unipolar neuron, the impulse does not travel via the cell body. dendrite The portion of a cell bearing surface projections such as axons, dendrites, cilia, or flagella that includes the nucleus, but excludes all cell projections. cell body Any constituent part of a cell, the basic structural and functional unit of all organisms. cell part Software is a plan specification composed of a series of instructions that can be interpreted by or directly executed by a processing unit. see sourceforge tracker discussion at http://sourceforge.net/tracker/index.php?func=detail&aid=1958818&group_id=177891&atid=886178 PERSON: Alan Ruttenberg PERSON: Bjoern Peters PERSON: Chris Stoeckert PERSON: Melanie Courtot GROUP: OBI software A model number is an information content entity specifically borne by catalogs, design specifications, advertising materials, inventory systems and similar that is about manufactured objects of the same class. The model number is an alternative term for the class. The manufactered objects may or may not also bear the model number. Model numbers can be encoded in a variety of other information objects, such as bar codes, numerals, or patterns of dots. manufactered items may have more than one model number, sometimes by rebranding, or because companies are sold and the products issued new model numbers Person: Alan Ruttenberg model number a data item is an information content entity that is intended to be a truthful statement about something (modulo, e.g., measurement precision or other systematic errors) and is constructed/acquired by a method which reliably tends to produce (approximately) truthful statements. 2/2/2009 Alan and Bjoern discussing FACS run output data. This is a data item because it is about the cell population. Each element records an event and is typically further composed a set of measurment data items that record the fluorescent intensity stimulated by one of the lasers. 2009-03-16: data item deliberatly ambiguous: we merged data set and datum to be one entity, not knowing how to define singular versus plural. So data item is more general than datum. 2009-03-16: removed datum as alternative term as datum specifically refers to singular form, and is thus not an exact synonym. 2014-03-31: See discussion at http://odontomachus.wordpress.com/2014/03/30/aboutness-objects-propositions/ JAR: datum -- well, this will be very tricky to define, but maybe some information-like stuff that might be put into a computer and that is meant, by someone, to denote and/or to be interpreted by some process... I would include lists, tables, sentences... I think I might defer to Barry, or to Brian Cantwell Smith JAR: A data item is an approximately justified approximately true approximate belief PERSON: Alan Ruttenberg PERSON: Chris Stoeckert PERSON: Jonathan Rees data data item An information content entity that is a mark(s) or character(s) used as a conventional representation of another entity. 20091104, MC: this needs work and will most probably change 2014-03-31: We would like to have a deeper analysis of 'mark' and 'sign' in the future (see https://github.com/information-artifact-ontology/IAO/issues/154). PERSON: James A. Overton PERSON: Jonathan Rees based on Oxford English Dictionary symbol A generically dependent continuant that is about some thing. 2014-03-10: The use of "thing" is intended to be general enough to include universals and configurations (see https://groups.google.com/d/msg/information-ontology/GBxvYZCk1oc/-L6B5fSBBTQJ). information_content_entity 'is_encoded_in' some digital_entity in obi before split (040907). information_content_entity 'is_encoded_in' some physical_document in obi before split (040907). Previous. An information content entity is a non-realizable information entity that 'is encoded in' some digital or physical entity. PERSON: Chris Stoeckert OBI_0000142 information content entity a scalar measurement datum is a measurement datum that is composed of two parts, numerals and a unit label. 2009-03-16: we decided to keep datum singular in scalar measurement datum, as in this case we explicitly refer to the singular form Would write this as: has_part some 'measurement unit label' and has_part some numeral and has_part exactly 2, except for the fact that this won't let us take advantage of OWL reasoning over the numbers. Instead use has measurment value property to represent the same. Use has measurement unit label (subproperty of has_part) so we can easily say that there is only one of them. PERSON: Alan Ruttenberg PERSON: Melanie Courtot scalar measurement datum An information content entity whose concretizations indicate to their bearer how to realize them in a process. 2009-03-16: provenance: a term realizable information entity was proposed for OBI (OBI_0000337) , edited by the PlanAndPlannedProcess branch. Original definition was "is the specification of a process that can be concretized and realized by an actor" with alternative term "instruction".It has been subsequently moved to IAO where the objective for which the original term was defined was satisfied with the definitionof this, different, term. 2013-05-30 Alan Ruttenberg: What differentiates a directive information entity from an information concretization is that it can have concretizations that are either qualities or realizable entities. The concretizations that are realizable entities are created when an individual chooses to take up the direction, i.e. has the intention to (try to) realize it. 8/6/2009 Alan Ruttenberg: Changed label from "information entity about a realizable" after discussions at ICBO Werner pushed back on calling it realizable information entity as it isn't realizable. However this name isn't right either. An example would be a recipe. The realizable entity would be a plan, but the information entity isn't about the plan, it, once concretized, *is* the plan. -Alan PERSON: Alan Ruttenberg PERSON: Bjoern Peters directive information entity A plan specification which describes the inputs and output of mathematical functions as well as workflow of execution for achieving an predefined objective. Algorithms are realized usually by means of implementation as computer programs for execution by automata. Philippe Rocca-Serra PlanAndPlannedProcess Branch OBI_0000270 adapted from discussion on OBI list (Matthew Pocock, Christian Cocos, Alan Ruttenberg) algorithm A data format specification is the information content borne by the document published defining the specification. Example: The ISO document specifying what encompasses an XML document; The instructions in a XSD file 2009-03-16: provenance: term imported from OBI_0000187, which had original definition "A data format specification is a plan which organizes information. Example: The ISO document specifying what encompasses an XML document; The instructions in a XSD file" PERSON: Alan Ruttenberg PlanAndPlannedProcess Branch OBI branch derived OBI_0000187 data format specification A directive information entity with action specifications and objective specifications as parts that, when concretized, is realized in a process in which the bearer tries to achieve the objectives by taking the actions specified. 2009-03-16: provenance: a term a plan was proposed for OBI (OBI_0000344) , edited by the PlanAndPlannedProcess branch. Original definition was " a plan is a specification of a process that is realized by an actor to achieve the objective specified as part of the plan". It has been subsequently moved to IAO where the objective for which the original term was defined was satisfied with the definitionof this, different, term. 2014-03-31: A plan specification can have other parts, such as conditional specifications. Alternative previous definition: a plan is a set of instructions that specify how an objective should be achieved Alan Ruttenberg OBI Plan and Planned Process branch OBI_0000344 plan specification A measurement datum is an information content entity that is a recording of the output of a measurement such as produced by a device. 2/2/2009 is_specified_output of some assay? person:Chris Stoeckert OBI_0000305 group:OBI measurement datum A scalar measurement datum that is the result of measuring a temporal interval 2009/09/28 Alan Ruttenberg. Fucoidan-use-case Person:Alan Ruttenberg time measurement datum A symbol that is part of a CRID and that is sufficient to look up a record from the CRID's registry. PERSON: Alan Ruttenberg PERSON: Bill Hogan PERSON: Bjoern Peters PERSON: Melanie Courtot CRID symbol Original proposal from Bjoern, discussions at IAO calls centrally registered identifier symbol A software application is software that can be directly executed by some processing unit. PERSON: Melanie Courtot PERSON: Michel Dumontier https://github.com/information-artifact-ontology/IAO/issues/80 software application Homo sapiens A physical object, or electronic counterpart, that is characterized by containing writing which is meant to be human-readable. Document An organizational header for concepts representing mostly abstract entities. Conceptual Entity Anything (e.g., a document) providing permanent evidence of or information about past events. Record One or more characters used to identify, name, or characterize the nature, properties, or contents of a thing. Identifier A control number unique to an object. It is used to identify the object among other objects in a collection. Accession Number A set of characters used as a code that is unique in the context or the system for which it is created. It serves as a means of identification and reference (often instead of a name) for an entity, person, thing, function, procedure, activity, variable, or body of data. Unique Identifier A processual entity that realizes a plan which is the concretization of a plan specification. 'Plan' includes a future direction sense. That can be problematic if plans are changed during their execution. There are however implicit contingencies for protocols that an agent has in his mind that can be considered part of the plan, even if the agent didn't have them in mind before. Therefore, a planned process can diverge from what the agent would have said the plan was before executing it, by adjusting to problems encountered during execution (e.g. choosing another reagent with equivalent properties, if the originally planned one has run out.) We are only considering successfully completed planned processes. A plan may be modified, and details added during execution. For a given planned process, the associated realized plan specification is the one encompassing all changes made during execution. This means that all processes in which an agent acts towards achieving some objectives is a planned process. Bjoern Peters branch derived planned process a role realized through the process of supplying materials such as animal subjects, reagents or other materials used in an investigation. Supplier role is a special kind of service, e.g. biobank PERSON:Jennifer Fostel material provider role supplier material supplier role a role inhering in a material entity that is realized when characteristics or responses elicited by the substance are used for comparison or reference. Person:Jennifer Fostel reference substance OBI reference substance role Is a material entity that is created or changed during material processing. PERSON: Alan Ruttenberg processed material A planned process with the objective to produce information about the material entity that is the evaluant, by physically examining it or its proxies. 12/3/12: BP: the reference to the 'physical examination' is included to point out that a prediction is not an assay, as that does not require physical examiniation. PlanAndPlannedProcess Branch measuring scientific observation OBI branch derived assay a role which inheres in a molecular entity and is realized by the process of recording or registering a stimulus. 19feb2009. not clear we need this term. originally if came from microarrays -- the probes on the array are termed detectors in some instances One that detects, especially a mechanical, electrical, or chemical device that automatically identifies and records or registers a stimulus, such as an environmental change in pressure or temperature, an electric signal, or radiation from a radioactive material. http://www.answers.com/topic/detector 19feb2009 detector reagent role A planned process which results in physical changes in a specified input material PERSON: Bjoern Peters PERSON: Frank Gibson PERSON: Jennifer Fostel PERSON: Melanie Courtot PERSON: Philippe Rocca Serra material transformation OBI branch derived material processing A role that is realized through the execution of a study design in which the bearer of the role participates and in which data about that bearer is collected. A participant can realize both "specimen role" and "participant under investigation role" at the same time. However "participant under investigation role" is distinct from "specimen role", since a specimen could somehow be involved in an investigation without being the thing that is under investigation. GROUP: Role Branch OBI participant under investigation role a study personnel role played by a party who is accountable for the execution of a study component and can make decisions about the conduct of the study Person: Jennifer Fostel responsible party OBI responsible party role a responsible party role played by a person responsible for the overall conduct of a study Person: Jennifer Fostel principal investigator principal investigator role a personnel role played by a party who executes a component of the study plan; this can occur before, during, after or outside the study timeline "executes the study plan" includes the suppliers and manufacturers of reagents and other materials used in the study Person:Jennifer Fostel worker OBI worker role a population is a collection of individuals from the same taxonomic class living, counted or sampled at a particular site or in a particular area 1/28/2013, BP, on the call it was raised that we may want to switch to an external ontology for all populatin terms: http://code.google.com/p/popcomm-ontology/ PERSON: Philippe Rocca-Serra adapted from Oxford English Dictionnary population a reference participant role realized by equivalent treatment of participants Person:Jennifer Fostel biological replicate OBI biological replicate role A role borne by an entity and that is realized in a process that is part of an investigation in which an objective is achieved. These processes include, among others: planning, overseeing, funding, reviewing. Implementing a study means carrying out or performing the study and providing reagents or other materials used in the study and other tasks without which the study would not happen. Philly2013: Historically, this role would have been borne only by humans or organizations. However, we now also want to enable representing investigations run by robot scientists such as ADAM (King et al, Science, 2009) GROUP: Role Branch investigator OBI investigation agent role a reference subject role which inheres in an organism or entity of organismal origin so that the characteristics or responses of the participant playing the reference participant role are used for comparison or reference Jennifer Fostel reference participant OBI reference subject role An entity that can bear roles, has members, and has a set of organization rules. Members of organizations are either organizations themselves or individual people. Members can bear specific organization member roles that are determined in the organization rules. The organization rules also determine how decisions are made on behalf of the organization by the organization members. BP: The definition summarizes long email discussions on the OBI developer, roles, biomaterial and denrie branches. It leaves open if an organization is a material entity or a dependent continuant, as no consensus was reached on that. The current placement as material is therefore temporary, in order to move forward with development. Here is the entire email summary, on which the definition is based: 1) there are organization_member_roles (president, treasurer, branch editor), with individual persons as bearers 2) there are organization_roles (employer, owner, vendor, patent holder) 3) an organization has a charter / rules / bylaws, which specify what roles there are, how they should be realized, and how to modify the charter/rules/bylaws themselves. It is debatable what the organization itself is (some kind of dependent continuant or an aggregate of people). This also determines who/what the bearer of organization_roles' are. My personal favorite is still to define organization as a kind of 'legal entity', but thinking it through leads to all kinds of questions that are clearly outside the scope of OBI. Interestingly enough, it does not seem to matter much where we place organization itself, as long as we can subclass it (University, Corporation, Government Agency, Hospital), instantiate it (Affymetrix, NCBI, NIH, ISO, W3C, University of Oklahoma), and have it play roles. This leads to my proposal: We define organization through the statements 1 - 3 above, but without an 'is a' statement for now. We can leave it in its current place in the is_a hierarchy (material entity) or move it up to 'continuant'. We leave further clarifications to BFO, and close this issue for now. PERSON: Alan Ruttenberg PERSON: Bjoern Peters PERSON: Philippe Rocca-Serra PERSON: Susanna Sansone GROUP: OBI organization technical replicate role is realized when two portions from one evaluant are used in replicate runs of an assay Person: Jennifer Fostel technical replicate technical replicate role A plan specification which has sufficient level of detail and quantitative information to communicate it between investigation agents, so that different investigation agents will reliably be able to independently reproduce the process. PlanAndPlannedProcess Branch OBI branch derived + wikipedia (http://en.wikipedia.org/wiki/Protocol_%28natural_sciences%29) protocol PCR is the process in which a DNA polymerase is used to amplify a piece of DNA by in vitro enzymatic replication. As PCR progresses, the DNA thus generated is itself used as a template for replication. This sets in motion a chain reaction in which the DNA template is exponentially amplified. OBI Plan PCR adapted from wikipedai polymerase chain reaction DNA sequencing is a sequencing process which uses deoxyribonucleic acid as input and results in a the creation of DNA sequence information artifact using a DNA sequencer instrument. Philippe Rocca-Serra OBI Branch derived DNA sequencing A planned process with the objective of collecting a specimen. Note: definition is in specimen creation objective which is defined as an objective to obtain and store a material entity for potential use as an input during an investigation. Philly2013: A specimen collection can have as part a material entity acquisition, such as ordering from a bank. The distinction is that specimen collection necessarily involves the creation of a specimen role. However ordering cell lines cells from ATCC for use in an investigation is NOT a specimen collection, because the cell lines already have a specimen role. Philly2013: The specimen_role for the specimen is created during the specimen collection process. label changed to 'specimen collection process' on 10/27/2014, details see tracker: http://sourceforge.net/p/obi/obi-terms/716/ Bjoern Peters specimen collection specimen collection process is a process which results in the creation of a library from fragments of DNA using cloning vectors or oligonucleotides with the role of adaptors. Philippe Rocca-Serra library construction library preparation is a collection of short paired tags from the two ends of DNA fragments are extracted and covalently linked as ditag constructs Philippe Rocca-Serra mate-paired library paired-end tag (PET) library adapted from information provided by Solid web site paired-end library A device in which a measure function inheres. GROUP:OBI Philly workshop OBI measurement device A person or organization that has a manufacturer role manufacturer A device with a separation function realized in a planed process material separation device is a role which inheres in a person or organization and is realized in in a planned process which provides access to training, materials or execution of protocols for an organization or person PERSON:Helen Parkinson service provider role A material entity that is designed to perform a function in a scientific investigation, but is not a reagent. 2012-12-17 JAO: In common lab usage, there is a distinction made between devices and reagents that is difficult to model. Therefore we have chosen to specifically exclude reagents from the definition of "device", and are enumerating the types of roles that a reagent can perform. 2013-6-5 MHB: The following clarifications are outcomes of the May 2013 Philly Workshop. Reagents are distinguished from devices that also participate in scientific techniques by the fact that reagents are chemical or biological in nature and necessarily participate in some chemical interaction or reaction during the realization of their experimental role. By contrast, devices do not participate in such chemical reactions/interactions. Note that there are cases where devices use reagent components during their operation, where the reagent-device distinction is less clear. For example: (1) An HPLC machine is considered a device, but has a column that holds a stationary phase resin as an operational component. This resin qualifies as a device if it participates purely in size exclusion, but bears a reagent role that is realized in the running of a column if it interacts electrostatically or chemically with the evaluant. The container the resin is in (“the column”) considered alone is a device. So the entire column as well as the entire HPLC machine are devices that have a reagent as an operating part. (2) A pH meter is a device, but its electrode component bears a reagent role in virtue of its interacting directly with the evaluant in execution of an assay. (3) A gel running box is a device that has a metallic lead as a component that participates in a chemical reaction with the running buffer when a charge is passed through it. This metallic lead is considered to have a reagent role as a component of this device realized in the running of a gel. In the examples above, a reagent is an operational component of a device, but the device itself does not realize a reagent role (as bearing a reagent role is not transitive across the part_of relation). In this way, the asserted disjointness between a reagent and device holds, as both roles are never realized in the same bearer during execution of an assay. PERSON: Helen Parkinson instrument OBI development call 2012-12-17. device A measurement datum that representing the primary structure of a macromolecule(it's sequence) sometimes associated with an indicator of confidence of that measurement. Person:Chris Stoeckert GROUP: OBI sequence data a reference substance role that is borne by a material entity with a known amount which is mixed into the evaluant of assays for quality control or data normalization purposes PERSON: Chris Stoeckert, Jie Zheng, Bjoern Peter MO_937 spike_quality_control spike-in quality control role A time measurement datum that is the result of measurement of age of an organism note that we are currently defining subtypes of age measurement datum that specify when the age is relative to, e.g. planting, as we don't have adequate temporal predicates yet. life of bearer doesn't imply organism this assay measures time not developmental stage. we recognize that development can take different time periods under different conditions such as media / temperature age as a quality is dubious; we plan to revisit stages in development are currently handled with controlled vocabulary, such as 2-somite stage PERSON: Alan Ruttenberg, Chris Stoeckert, Jie Zheng MO_178 Age age measurement datum An age measurement datum that is the result of the measurement of the age of an organism since birth, the process of emergence and separation of offspring from the mother. PERSON:Chris Stoeckert, Jie Zheng MO_710 birth age since birth measurement datum An organism that is bred to have some uniform behavioral, morphological, physiological, or genetic characteristics with similarly bred organisms Bjoern Peters, Helen Parkinson, Philippe Rocca-Serra, Jie Zheng, Chris Stoeckert cultivar ecotype strain MO_9 StrainOrLine, MO_71 Ecotype, MO_124 Cultivar selectively maintained organism An assay in which sequencing technology (e.g. Solexa/454) is used to generate RNA sequence, analyse the transcibed regions of the genome, and or to quantitate transcript abundance PERSON: James Malone transcription profiling by high throughput sequencing EFO_0002770 transcription profiling by high throughput sequencing RNA-seq assay A specimen primarily composed of a cell or cells collected from a multicellular organism or a cell culture. Discussed on obi call Jan 23, 2017. To improve cell specimen that include single cell specimen. Details see tracker: https://sourceforge.net/p/obi/obi-terms/828/ PERSON: Chris Stoeckert, Jie Zheng, Alexander Diehl MO_612 cell cell specimen An enzymatic amplification which amplifies nucleic acid sequence by making many copies off the same template. PERSON: Chris Stoeckert, Jie Zheng MO_997 linear_amplification linear amplification A CRID symbol denotes a specimen and used to distinguish one specimen from another in an investigation. Person: Chris Stoeckert, Jie Zheng specimen ID NIAID GSCID-BRC metadata working group specimen identifier An assay that incorporates Paired-End Tags and sequencing technology to determine transcripts, gene structures, and gene expressions. Person: Venkat Malladi, Chris Stoeckert, Jie Zheng RNA-PET RNA-PET assay Ruan, et al. Genome wide full-length transcript analysis using 5' and 3' paired-end-tag next generation sequencing (RNA-PET). Methods Mol Biol. 2012;809:535-62. [PMID:22113299] transcript analysis by paired-end tag sequencing A library preparation that results in the creation of a library of the 5' and 3' ends of DNA or cDNA fragments using adaptors and endonucleases. The preparation may or may not include cloning process. Person: Venkat Malladi, Jie Zheng Venkat Malladi, Jie Zheng paired-end library preparation An organization that provides sequence determination service PERSON: Chris Stoeckert, Jie Zheng NIAID GSCID-BRC metadata working group sequencing facility organization A person who collects the specimen PERSON: Chris Stoeckert, Jie Zheng NIAID GSCID-BRC metadata working group specimen collector An information content entity that specifies a value within a classification scheme or on a quantitative scale. This term is currently a descendant of 'information content entity', which requires that it 'is about' something. A value specification of '20g' for a measurement data item of the mass of a particular mouse 'is about' the mass of that mouse. However there are cases where a value specification is not clearly about any particular. In the future we may change 'value specification' to remove the 'is about' requirement. PERSON:Bjoern Peters value specification a planned process which consists in running a set of samples as a pool in one single instrument run of data acquisition process while retaining the ability to associate individual results to each of the individual input samples thanks to the use of a multiplex identifier, introduced during the ligation step of the individual library preparation and specific to a given sample. PERSON:Philippe Rocca-Serra OBI sequencing library multiplexing A multiplexing sequence identifier is a nucleic acid sequence which is used in a ligation step of library preparation process to allow pooling of samples while maintaining ability to identify individual source material and creation of a multiplexed library PERSON:Philippe Rocca-Serra OBI multiplexing sequence identifier A cell specimen that contains only one cell. Requested by Sirarat Sarntivijai (EBI). Details see tracker: https://sourceforge.net/p/obi/obi-terms/828/ PERSON: Jie Zheng, Alexander Diehl PERSON: Jie Zheng, Alexander Diehl single cell specimen A data format specification that is a binary file containing base calling data. These files are created by sequencing machines during the act of sequencing, and contain data about each of the nucleotide clusters on a sequencing flow cell. Stephen A. Fisher, Junhyong Kim, Dan Berrios BCL format A processed material developed by the External RNA Controls Consortium (ERCC) that consists of 92 transcripts, derived and traceable from NIST-certified DNA plasmids. Stephen A. Fisher, Junhyong Kim, Dan Berrios https://www.thermofisher.com/order/catalog/product/4456740 ERCC RNA spike-in A processed material that consists of transcript isoforms of 7 spike-in RNA variant (SIRV) genes and possibly the 92 External RNA Controls Consortium (ERCC) RNA spike-in genes. Stephen A. Fisher, Junhyong Kim, Dan Berrios https://www.lexogen.com/sirvs/design/ SIRV RNA spike-in An accession number that applies to a database record for an individual sample object in an NCBI Sequence Read Archive (SRA) submission. These records are imported from the NCBI BioSample database. Stephen Fisher, Dan Berrios SRA sample accession number SRS identifier An age measurement datum that is the result of the measurement of the age of a cell since cultured (the process of seeding cells onto a culture dish). Stephen A. Fisher, Junhyong Kim, Dan Berrios age of culture age since culture seeding measurement datum An algorithm used to assign aligned sequence reads, resulting from a reference genome transcriptome alignment algorithm, to sequence features (e.g. genes or transcripts). Stephen A. Fisher, Junhyong Kim, Dan Berrios alignment counting algorithm A software application that implements an alignment counting algorithm, used to count the overlap of aligned sequencing reads with genes. Stephen A. Fisher, Junhyong Kim, Dan Berrios doi:10.1093/bioinformatics/btu638 alignment counting application An algorithm that assigns nucleotides to temporal or spatial peaks generated by a detector, such as peaks of light intensities generated by a DNA sequencer Stephen A. Fisher, Junhyong Kim, Dan Berrios base calling algorithm A base-calling application that implments a base-calling algorithm to convert BCL (binary base call) files, (e.g., those generated by Illumina sequencing systems) to standard FASTQ file formats for downstream analysis. Stephen A. Fisher, Junhyong Kim, Dan Berrios base calling application A base calling application that implments a base calling algorithm to convert BCL (binary base call) files generated by Illumina sequencing systems to standard FASTQ file formats for downstream analysis. Stephen A. Fisher, Junhyong Kim, Dan Berrios https://support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.html bcl2fastq software application A data item that is the date when a process was initiated. Chris Stoeckert, Daniel Berrios, Stephen A. Fisher date process started A central registered identifier symbol that denotes a specific study in dbGaP. Stephen A. Fisher, Junhyong Kim, Dan Berrios dbGaP accession number doi:10.1038/ng1007-1181 dbGaP study identifier A worker role played by the person who runs the enzymatic amplifying process. Stephen A. Fisher, Junhyong Kim, Dan Berrios material amplification role A worker role played by the person who performs the library preparation process to generate a sequencing library for a sample to be sequenced. Stephen A. Fisher, Junhyong Kim, Dan Berrios material sequencing library preparation role A data item that is the number of PCR cycles used during the construction of a sequencing library. Stephen A. Fisher, Junhyong Kim, Dan Berrios library construction PCR cycles https://en.wikipedia.org/wiki/Polymerase_chain_reaction number of PCR cycles during library construction A data item that is the number of times an amplification reaction happened. Stephen A. Fisher, Junhyong Kim, Dan Berrios number of rounds of amplification A reference substance role that is played by a reference genome when used during sequence alignment. Stephen A. Fisher, Junhyong Kim, Dan Berrios reference genome role An algorithm that attempts to align a nucleic acid sequence to a reference genome and transcriptome. Stephen A. Fisher, Junhyong Kim, Dan Berrios reference genome-transcriptome alignment algorithm A measurement datum that is the result of the measurement of the number of bases in a DNA or RNA sequence. Stephen A. Fisher, Junhyong Kim, Dan Berrios read length, read length measurement datum sequence read length measurement datum A scalar measurement datum that indicates the amount of sequencing library used as input for a sequencer. Stephen A. Fisher, Junhyong Kim library input amount sequencing library input quantity measurement datum A cDNA library that is a collection of short tags from only one end of DNA fragments. Stephen A. Fisher, Junhyong Kim, Dan Berrios single-end library A scalar measurement datum that indicates the amount of specimen collected. Stephen A. Fisher, Junhyong Kim, Dan Berrios harvest quantity specimen harvest quantity A data item that indicates the dilution of spike-in added to a specimen. Stephen A. Fisher, Junhyong Kim, Dan Berrios spike-in dilution factor A reference genome transcriptome alignment algorithm that is a standalone RNA-seq alignment algorithm that uses uncompressed suffix arrays and a mapping algorithm similar to those used in large-scale genome alignment tools to align RNA-seq reads to a genomic reference. Stephen A. Fisher, Junhyong Kim, Dan Berrios https://www.ncbi.nlm.nih.gov/pubmed/23104886 star algorithm A sequencing assay that incorporates single-end reads and sequencing technology to determine transcripts, gene structures, and gene expressions. Stephen A. Fisher, Junhyong Kim, Dan Berrios https://www.illumina.com/science/technology/next-generation-sequencing/paired-end-vs-single-read-sequencing.html transcript analysis by single-end sequencing An alignment counting algorithm that assigns gene features to genomic alignments using a hierarchical assignment scheme, which allows simultaneous quantification of multiple feature types or annotation levels without repeatedly assigning reads. Stephen A. Fisher, Junhyong Kim http://kim.bio.upenn.edu/software/verse.shtml or http://biorxiv.org/content/early/2016/05/14/053306 verse algorithm A material separation device also commerically known as Isoraft, that is used to isolate single cells. Stephen A. Fisher, Junhyong Kim, Dan Berrios https://en.wikipedia.org/wiki/Microrafts microraft A device that is a laboratory tool commonly used in chemistry, biology and medicine to transport a measured volume of liquid, often as a media dispenser. Stephen A. Fisher, Junhyong Kim, Dan Berrios https://en.wikipedia.org/wiki/Pipette pipette A material entity that is an individual living system, such as animal, plant, bacteria or virus, that is capable of replicating or reproducing, growth and maintenance in the right environment. An organism may be unicellular or made up, like humans, of many billions of cells divided into specialized tissues and organs. 10/21/09: This is a placeholder term, that should ideally be imported from the NCBI taxonomy, but the high level hierarchy there does not suit our needs (includes plasmids and 'other organisms') 13-02-2009: OBI doesn't take position as to when an organism starts or ends being an organism - e.g. sperm, foetus. This issue is outside the scope of OBI. GROUP: OBI Biomaterial Branch WEB: http://en.wikipedia.org/wiki/Organism organism A material entity that has the specimen role. Note: definition is in specimen creation objective which is defined as an objective to obtain and store a material entity for potential use as an input during an investigation. PERSON: James Malone PERSON: Philippe Rocca-Serra GROUP: OBI Biomaterial Branch specimen A flow_cytometer_sorter is a flow_cytometer that analyzes and separates or sorts particles passing through (based on properties measured during analysis) to collect cells of interest. John Quinn Melanie Courtot http://www.flocyte.com/FRTP/Resources/flow_cytometry_glossary.htm flow cytometer sorter A DNA sequencer is an instrument that determines the order of deoxynucleotides in deoxyribonucleic acid sequences. Trish Whetzel MO DNA sequencer the use of a chemical or biochemical means to infer the sequence of a biomaterial has_output should be sequence of input; we don't have sequence well defined yet PlanAndPlannedProcess Branch OBI branch derived sequencing assay the use of enzymes to increase the number of molecules of a biomaterial PERSON:Kevin Clancy OBI branch derived enzymatic amplification An algorithm that attempts to align two molecular sequences (nucleic acid or protein sequences). Ready for OBI-WS : v1.0 WS Annotations Group, UGA WS Annotations Group, UGA pairwise sequence alignment algorithm BLAST (Basic Local Alignment Search Tool), a pairwise sequence alignment algorithm is a heuristic search algorithm for comparing primary biological sequence information, such as the amino-acid or nucleotides sequences with a database of sequences, and identify sequences that resemble the query sequence above a certain threshold. Ready for OBI-WS : v1.0 ITPPR Wikipedia: BLAST BLAST algorithm A data format specification for molecular sequence feature information. Ready for OBI-WS : v1.0 ITPPR sequence data format EDAM sequence data format specification A sequence data that is about the primary structure of RNA. ITPPR ITPPR RNA sequence data A processual entity whose completion is hypothesized (by a healthcare provider) to alleviate the signs and symptoms associated with a disorder Albert Goldfain http://code.google.com/p/ogms/issues/detail?id=35 treatment The principle investigtor or responsible person for a study or a sample collection. study principle investigator or responsible A data item that indicates the time when the collection of a specimen occurred. Alice Nzinga Mathias Brochhausen date of specimen preparation date of specimen collection PERSON: Junhyong Kim PERSON: Stephen A. Fisher amplified date A date process started item that indicates when a specimen was run through the amplification protocol prior to library preparation and sequencing. date of specimen amplification PERSON: Junhyong Kim PERSON: Stephen A. Fisher library preparation date A date process started item that indicates when a specimen was run through the library preparation protocol prior to sequencing. date of specimen library preparation PERSON: Junhyong Kim PERSON: Stephen A. Fisher amplification protocol A protocol document that contains detailed instructions on how to perform an enzymatic amplification planned process. enzymatic amplification protocol Exists in MS (Mass spectrometry ontology) with parent "quantification object attribute" PERSON: Junhyong Kim PERSON: Stephen A. Fisher The name for identifying an experiment. experiment name PERSON: Junhyong Kim PERSON: Stephen A. Fisher https://www.illumina.com/products/by-type/sequencing-kits/cluster-gen-sequencing-reagents/hiseq-rapid-sbs-kit.html A processed material that is a sequencing kit used with Illumina HiSeq 2500 and 1500 sequencers allowing for single-end or paired-end sequencing. Illumina TruSeq Rapid SBS Kit v2 PERSON: Junhyong Kim PERSON: Stephen A. Fisher https://www.illumina.com/products/by-type/sequencing-kits/library-prep-kits/truseq-stranded-mrna.html A processed material that is used to create a sequencing library that retains mRNA strand orientation. Illumina TruSeq Stranded PERSON: Junhyong Kim PERSON: Stephen A. Fisher A protocol document that contains detailed instructions on how to perform a library preparation planned process. library preparation protocol PERSON: Junhyong Kim PERSON: Stephen A. Fisher http://scap-t.org/?q=content/sample-attributes-data-dictionary A data item that indicates the number of lanes in a flow cell that were used to sequence a specimen number of flow cell lanes PERSON: Junhyong Kim PERSON: Stephen A. Fisher http://scap-t.org/?q=content/sample-attributes-data-dictionary A data item that indicates the number of flow cells used to sequence a specimen. number of flow cells PERSON: Junhyong Kim PERSON: Stephen A. Fisher A software application that implements a pairwise sequence alignment algorithm to attempt to align two molecular sequences (nucleic acid or protein sequences). pairwise sequence alignment application PERSON: Junhyong Kim PERSON: Stephen A. Fisher An area within a material anatomical entity that is distinct from the other areas of that entity such as the parietal cortex, hippocampus CA1 and amygdala. region of material anatomical entity PERSON: Junhyong Kim PERSON: Stephen A. Fisher A molecular entity that is an engineered molecule which enables the collection of RNA from cells. RNA harvest compound PERSON: Junhyong Kim PERSON: Stephen A. Fisher A material entity that contained the specimen during harvesting such as a primary culture, tissue, or fixed section. specimen harvest source type PERSON: Junhyong Kim PERSON: Stephen A. Fisher A material entity that represents the number of number of material entities used during harvesting; for example, a single cell, pooled single cells, tissue, a culture dish, or a coverslip. specimen harvest source unit PERSON: Junhyong Kim PERSON: Stephen A. Fisher https://github.com/alexdobin/STAR A software application that implements a RNA-seq alignment algorithm that uses uncompressed suffix arrays and a mapping algorithm similar to those used in large-scale genome alignment tools to align RNA-seq reads to a genomic reference. star PERSON: Junhyong Kim PERSON: Stephen A. Fisher doi:10.1038/nmeth.2804 A molecular entity that is an engineered molecule which upon photoactivation enables mRNA capture from single cells in live tissue. TIVA tag PERSON: Junhyong Kim PERSON: Stephen A. Fisher http://kim.bio.upenn.edu/software/verse.shtml An alignment counting application that assigns gene features to genomic alignments using a hierarchical assignment scheme, which allows simultaneous quantification of multiple feature types or annotation levels without repeatedly assigning reads. verse A collection of sequences (often chromosomes) taken as the standard for a given organism and genome assembly. reference_genome A level of depth of a taxon in a taxonomic hierarchy. taxonomic_rank species Anatomical entity that has mass. material anatomical entity taxonomic rank A role played by a a biological sample in the context of an experiment where the intent is that biological or technical variation is measured. James Malone replicate FASTQ format is a text-based format for storing both a biological sequence (usually nucleotide sequence) and its corresponding quality scores. FASTQ format BAM is the compressed binary version of the Sequence Alignment/Map (SAM) format BAM format A genome is the full genetic content of an organism, contained in either DNA or RNA (such as for viruses). James Malone genome The transcriptome is the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non-coding RNA produced in one or a population of cells. transcriptome CEL data file format describes the format used in a CEL file for storing the results of the intensity calculations on the pixel values of a DAT file. This includes an intensity value, standard deviation of the intensity, the number of pixels used to calculate the intensity value, a flag to indicate an outlier as calculated by the algorithm and a user defined flag indicating the feature should be excluded from future analysis. The file stores the previously stated data for each feature on the probe array. James Malone CEL data file format A BLAST algorithm that searches/aligns a nucleotide query sequence against a nucleotide database at a nucleotide level. BLASTn