Schema for Ensembl Genes - Ensembl Genes
|
|
Database: anoCar2 Primary Table: ensGene Row Count: 38,557   Data last updated: 2021-05-26
Format description: Ensembl gene predictions. On download server: MariaDB table dump directory
field | example | SQL type | info | description |
bin | 73 | smallint(5) unsigned | range | Indexing field to speed chromosome range queries. |
name | ENSACAT00000009589.4 | varchar(255) | values | Ensembl transcript ID |
chrom | chr1 | varchar(255) | values | Reference sequence chromosome or scaffold |
strand | - | char(1) | values | + or - for strand |
txStart | 79276 | int(10) unsigned | range | Transcription start position (or end position for minus strand item) |
txEnd | 182777 | int(10) unsigned | range | Transcription end position (or start position for minus strand item) |
cdsStart | 79437 | int(10) unsigned | range | Coding region start (or end position for minus strand item) |
cdsEnd | 182777 | int(10) unsigned | range | Coding region end (or start position for minus strand item) |
exonCount | 26 | int(10) unsigned | range | Number of exons |
exonStarts | 79276,80767,82885,83755,850... | longblob | | Exon start positions (or end positions for minus strand item) |
exonEnds | 79910,80924,83017,84004,851... | longblob | | Exon end positions (or start positions for minus strand item) |
score | 0 | int(11) | range | always 0 for Ensembl genes |
name2 | ENSACAG00000009394.4 | varchar(255) | values | Ensembl gene ID |
cdsStartStat | cmpl | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS start annotation (none, unknown, incomplete, or complete) |
cdsEndStat | cmpl | enum('none', 'unk', 'incmpl', 'cmpl') | values | Status of CDS end annotation (none, unknown, incomplete, or complete) |
exonFrames | 1,0,0,0,1,1,2,1,1,1,1,1,1,0... | longblob | | Exon frame {0,1,2}, or -1 if no frame for exon |
|
| |
|
|
Connected Tables and Joining Fields
|
|
Sample Rows
|
|
bin | name | chrom | strand | txStart | txEnd | cdsStart | cdsEnd | exonCount | exonStarts | exonEnds | score | name2 | cdsStartStat | cdsEndStat | exonFrames |
---|
73 | ENSACAT00000009589.4 | chr1 | - | 79276 | 182777 | 79437 | 182777 | 26 | 79276,80767,82885,83755,85003,85290,86736,89468,92811,94630,95135,98845,100314,100980,104570,106809,107316,108645,110155,111567, ... | 79910,80924,83017,84004,85119,85404,86822,89496,92928,94744,95249,98959,100479,101131,104744,106856,107430,108759,110269,111687, ... | 0 | ENSACAG00000009394.4 | cmpl | cmpl | 1,0,0,0,1,1,2,1,1,1,1,1,1,0,0,1,1,1,1,1,2,1,1,0,0,0, |
73 | ENSACAT00000043269.1 | chr1 | - | 79276 | 182777 | 79437 | 182777 | 25 | 79276,80767,82885,83755,85003,85290,86736,89468,92811,94630,95135,98845,100314,100980,104570,106809,108645,110155,111567,112070, ... | 79910,80924,83017,84004,85119,85404,86822,89496,92928,94744,95249,98959,100479,101131,104744,106856,108759,110269,111687,112201, ... | 0 | ENSACAG00000009394.4 | cmpl | cmpl | 1,0,0,0,1,1,2,1,1,1,1,1,1,0,0,1,1,1,1,2,1,1,0,0,0, |
586 | ENSACAT00000055868.1 | chr1 | - | 192024 | 193476 | 192024 | 193476 | 1 | 192024, | 193476, | 0 | ENSACAG00000036384.1 | cmpl | cmpl | 0, |
586 | ENSACAT00000023724.2 | chr1 | + | 257461 | 258814 | 257461 | 258814 | 1 | 257461, | 258814, | 0 | ENSACAG00000025793.2 | cmpl | cmpl | 0, |
587 | ENSACAT00000009367.4 | chr1 | - | 264452 | 286789 | 264452 | 286789 | 4 | 264452,270582,274191,286772, | 264693,270820,274256,286789, | 0 | ENSACAG00000009373.4 | cmpl | cmpl | 2,1,2,0, |
589 | ENSACAT00000041856.1 | chr1 | - | 548160 | 549495 | 548160 | 549495 | 1 | 548160, | 549495, | 0 | ENSACAG00000035586.1 | cmpl | cmpl | 0, |
589 | ENSACAT00000046245.1 | chr1 | - | 548160 | 549495 | 549495 | 549495 | 1 | 548160, | 549495, | 0 | ENSACAG00000044607.1 | none | none | -1, |
589 | ENSACAT00000057638.1 | chr1 | - | 550714 | 552442 | 550714 | 552442 | 1 | 550714, | 552442, | 0 | ENSACAG00000041053.1 | cmpl | cmpl | 0, |
590 | ENSACAT00000055875.1 | chr1 | - | 692251 | 693729 | 692251 | 693729 | 4 | 692251,692610,693619,693710, | 692526,692918,693665,693729, | 0 | ENSACAG00000041825.1 | cmpl | cmpl | 1,2,1,0, |
591 | ENSACAT00000037286.1 | chr1 | + | 826170 | 829320 | 826170 | 829320 | 1 | 826170, | 829320, | 0 | ENSACAG00000039322.1 | cmpl | cmpl | 0, |
|
Note: all start coordinates in our database are 0-based, not
1-based. See explanation
here.
| |
|
|
Ensembl Genes (ensGene) Track Description
|
|
Description
These gene predictions were generated by Ensembl.
For more information on the different gene tracks, see our Genes FAQ.
Methods
For a description of the methods used in Ensembl gene predictions, please refer to
Hubbard et al. (2002), also listed in the References section below.
Data access
Ensembl Gene data can be explored interactively using the
Table Browser or the
Data Integrator.
For local downloads, the genePred format files for anoCar2 are available in our
downloads directory as ensGene.txt.gz or in our
genes download directory in GTF format.
For programmatic access, the data can be queried from the
REST API or
directly from our public MySQL
servers. Instructions on this method are available on our
MySQL help page and on
our blog.
Previous versions of this track can be found on our archive download server.
Credits
We would like to thank Ensembl for providing these gene annotations. For more information, please see
Ensembl's genome annotation page.
References
Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J,
Curwen V, Down T et al.
The Ensembl genome database project.
Nucleic Acids Res. 2002 Jan 1;30(1):38-41.
PMID: 11752248; PMC: PMC99161
| |
|
|
|