ContextualAllele

A representation of one of the multiple variant sequences at a contiguous region in a particular ReferenceSequence.

Scope and Usage

Genetic variation is often represented with respect to an explicit reference sequence. At a locus where variation exists in the population, there exist both the allele that occurs in the reference as well as one or more alternate alleles. Each one of these alleles (including the reference) is represented by a distinct instance of ContextualAllele.

ContextualAllele as here defined is similar to the SO term sequence_variant, but where that definition describes a difference with respect to a sequence, ContextualAllele explicitly allows the reference allele to be represented. Note that the reference allele is not guaranteed to be the minor allele in any population.

A ContextualAllele should either represent a contiguous section of the ReferenceSequence, or the replacement of such a section with a new contiguous section (including an empty section forr a deletion). While it would be possible to represent two widely spaced SNPs as a ContextualAllele that reproduces the ReferenceSequence between them, this is not the intended use; such cases should be handled through the use of a ComplexAllele.

This definition of ContextualAllele allows for large deletions or insertions; it is left as an implementation decision whether there is a size limit above which the implementing system will no longer use this representation.

Each ContextualAllele is the representation of an allele within a single sequence. For example, if a variant occurs within the coding region of a transcript, three different ContextualAlleles will be used to represent the same variant: one representing the allele in the context of the ReferenceSequence for the genome, a second representing the allele in the context of the ReferenceSequence for the transcript, and a third in the context of the ReferenceSequence for the amino acid chain.

Resource Model

ContextualAllele Resource Diagram

Related Resources: ReferenceSequence, CanonicalAllele

Definitions & Bindings

ContextualAllele

Definition
Resource representing one of the multiple variant sequences at a contiguous region in a particular ReferenceSequence.
Control
1..1

ContextualAllele.identifier

Definition
The set of Identifiers used to name this ContextualAllele. May be as simple as a set of one single Identifier distinguishing the ContextualAllele in the implementing system, or may expansively include the name of all ContextualAlleles in all implementing systems.
Type
Identifier
Control
0..*

ContextualAllele.canonicalAllele

Definition
The CanonicalAllele resource to which this ContextualAllele maps.
Type
CanonicalAllele
Control
0..1

ContextualAllele.contextualAlleleType

Definition
Type of contextual allele, as defined by the type of transcript the allele is mapped on.
Binding
ContextualAlleleType contextual-allele-types
Type
Code
Control
0..1

ContextualAllele.allele

Definition
The nucleotide or amino acid sequence that is defined for this allele at the reference coordinates. This is typically associated with the 'ALT' value in a VCF representation of a variant.
Control
0..1
Type
string

ContextualAllele.primaryNucleotideChangeType

Definition
Describes the change that occurs within a nucleotide contextual allele.
Control
0..1
Binding
PrimaryNucleotideChangeType primary-nucleotide-change-types
Type
Code
Requirements
Required (and allowed) only if the ContextualAllele maps to a nucleotide sequence.

ContextualAllele.ancillaryNucleotideChangeType

Definition
Additional descriptors that can be applied to describe the change effected by the allele.
Control
0..*
Binding
AncillaryNucleotideChangeType ancillary-nucleotide-change-types
Type
Code
Requirements
Optional, permitted only if the ContextualAllele maps to a nucleotide sequence.

ContextualAllele.primaryAminoAcidChangeType

Definition
Describes the change that occurs within an amino acid.
Binding
PrimaryAminoAcidChangeType primary-amino-acid-change-types
Type
Code
Control
0..1
Requirements
Required (and allowed) only if the ContextualAllele describes an allele on an amino acid reference sequence.

ContextualAllele.ancillaryAminoAcidChangeType

Definition
One of the set of allowable ancillary-amino-acid-change-types. Allowed only if the ContextualAllele describes an allele on an amino acid reference sequence.
Type
Code
Control
0..1

ContextualAllele.referenceCoordinate

Definition
The genetic loci of the allele.
Control
1

ContextualAllele.referenceCoordinate.identifier

Definition
The set of Identifiers used to name this ReferenceCoordinate. May be as simple as a set of one single Identifier distinguishing the ReferenceCoordinate in the implementing system, or may expansively include the name of all ReferenceCoordinates in all implementing systems.
Type
Identifier
Control
1..*

ContextualAllele.referenceCoordinate.referenceSequence

Definition
The ReferenceSequence resource on which the ReferenceCoordinate (and, by extention, the ContextualAllele is mapped.
Type
ReferenceSequence
Control
1

ContextualAllele.referenceCoordinate.refAllele

Definition
The string of nucleotides or amino acids at this reference coordinate on the related reference genome. This is typically associated with the 'REF' value in a VCF representation of a variant.
Type
string
Control
0..1

ContextualAllele.referenceCoordinate.primaryTranscriptRegionType

Definition
One of the set of allowable primary-transcript-region-types
Type
Code
Control
0..1

ContextualAllele.referenceCoordinate.ancillaryTranscriptRegionType

Definition
One of the set of allowable ancillary-transcript-region-types
Type
Code
Control
0..1

ContextualAllele.referenceCoordinate.start

Definition
The starting position of the reference coordinate. NOTE: The data model defines a 0-based coordinate numbering system as described above.
Type
Position
Control
1

ContextualAllele.referenceCoordinate.end

Definition
The end position of the reference coordinate. Note that the data model defines a 0-based coordinate system as described above.
Type
Position
Control
1

Position

Definition
Information needed to locate one end of a ReferenceCoordinate.
Control
1..1

Position.index

Definition
A zero-based position within the ReferenceCoordinate
Type
integer
Control
1..1

Position.externalOffsetLength

Definition
The zero-based position of an intronic location as counted from the index attribute.
Type
integer
Control
0..1

Position.externalOffsetDirection

Definition
The direction of the ReferenceCoordinate relative to the transcript coordinate.
Type
Code
Control
0..1

Position.externalGenomicReferenceSequence

Definition
For an intronic position in a ReferenceCoordinate defined on a transcript, the externalGenomicReferenceSequence is a genomic sequence to which the transcript is mapped.
Type
GenomicReferenceSequence
Control
0..1

Position.externalGenomicPositionIndex

Definition
For an intronic position in a ReferenceCoordinate defined on a transcript, the corresponding position on the defined externalGenomicReferenceSequence.
Type
integer
Control
0..1

ContextualAllele.alleleName

defintion
The name given to the allele by a naming organization (i.e. an HGVS expression)
Control
0..*

ContextualAllele.alleleName.nameType

Definition
One of the set of allowable contextual-allele-name-types
Type
Code
Control
1

ContextualAllele.alleleName.legacy

Definition
Flag set if the name is deprecated or no longer used in the relevant allele naming system. Default is false.
Type
boolean
Control
0..1

ContextualAllele.alleleName.preferred

Definition
Flag set if the name is the preferred name for this allele in the implementing system.
Type
boolean
Control
0..1

ContextualAllele.related

definition
Other, related contextual alleles, for example, the ContextualAllele that represents the amino acid change caused by a nucleotide contextual allele, and vice-versa.
Control
0..*

ContextualAllele.related.relatedType

Definition
One of the allowable set of relationship types in contextual-allele-relationship-type
Type
Code
Control
1

ContextualAllele.related.target

Definition
The ContextualAllele described in the relationship.
Type
ContextualAllele
Control
1

Schema

schema: ContextualAllele.xsd

Examples

idname
AI301AI301-NM_001014794.2-ILK-c.448+14A>G-b37json-ldxmljson
AI302AI302-NM_001014794.2-ILK-c.448+13A>G-b38json-ldxmljson
AI303AI303-NC_000011.9-g.6630034A>G-b37xmljson
AI304AI304-NC_000011.10-g.6608803A>G-b38xmljson
AI310AI310-NM_007294.3-BRCA1-c.5297T>Gxmljson
AI311AI311-U14680.1-BRCA1-n.5416T>Gxmljson
AI312AI312-NP_009225.1-BRCA1-p.Ile1766Serxmljson
AI313AI313-NR_027676.1-BRCA1-n.5433T>Gxmljson
AI314AI314-NC_000017.10-g.41203115A>C-b37xmljson
AI315AI315-NC_000017.11-g.43051098A>C-b38xmljson
AI316AI316-NG_005905.2-BRCA1-g.166886T>Gxmljson
AI317AI317-LRG_292-BRCA1-g.166886T>Gxmljson
AI318AI318-LRG_292t1-BRCA1-c.5297T>Gxmljson
AI319AI319-LRG_292p1-BRCA1-p.Ile1766Serxmljson