Schema for Ensembl Genes

Home
Genomes
Genome Browser
Tools
Mirrors
- Euro/Asia Mirrors
- Mirroring Instructions
- US Server
- European Server
- Asian Server
Downloads
My Data
Projects
Help
About Us
- News
- Publications
- Blog
- Cite Us
- Credits
- Release Log
- Staff
- Conditions of Use
- Our History
- Jobs
- Licenses
- Contact Us

field

example

SQL type

info

description

bin

585

smallint(5) unsigned

range

Indexing field to speed chromosome range queries.

name

ENSTRUT00000048539.1

varchar(255)

values

Name of gene (usually transcript_id from GTF)

chrom

chrM

varchar(255)

values

Reference sequence chromosome or scaffold

strand

char(1)

values

+ or - for strand

txStart

int(10) unsigned

range

Transcription start position (or end position for minus strand item)

txEnd

int(10) unsigned

range

Transcription end position (or start position for minus strand item)

cdsStart

int(10) unsigned

range

Coding region start (or end position for minus strand item)

cdsEnd

int(10) unsigned

range

Coding region end (or start position for minus strand item)

exonCount

int(10) unsigned

range

Number of exons

exonStarts

longblob

Exon start positions (or end positions for minus strand item)

exonEnds

68,

longblob

Exon end positions (or start positions for minus strand item)

score

int(11)

range

score

name2

ENSTRUG00000019221.1

varchar(255)

values

Alternate name (e.g. gene_id from GTF)

cdsStartStat

none

enum('none', 'unk', 'incmpl', 'cmpl')

values

Status of CDS start annotation (none, unknown, incomplete, or complete)

cdsEndStat

none

enum('none', 'unk', 'incmpl', 'cmpl')

values

Status of CDS end annotation (none, unknown, incomplete, or complete)

exonFrames

-1,

longblob

Reading frame of the start of the CDS region of the exon, in the direction of transcription (0,1,2), or -1 if there is no CDS region.

      fr2.ensGtp.transcript (via ensGene.name)
      fr2.ensPep.name (via ensGene.name)
      fr2.ensemblSource.name (via ensGene.name)
      fr2.ensemblToGeneName.name (via ensGene.name)
      knownGeneV39.knownToEnsembl.value (via ensGene.name)

bin

name

chrom

strand

txStart

txEnd

cdsStart

cdsEnd

exonCount

exonStarts

exonEnds

score

name2

cdsStartStat

cdsEndStat

exonFrames

585

ENSTRUT00000048539.1

chrM

68,

ENSTRUG00000019221.1

none

-1,

585

ENSTRUT00000048540.1

chrM

1015

68,

1015,

ENSTRUG00000019222.1

none

-1,

585

ENSTRUT00000048541.1

chrM

1015

1089

1015,

1089,

ENSTRUG00000019223.1

none

-1,

585

ENSTRUT00000048542.1

chrM

1089

2755

1089,

2755,

ENSTRUG00000019224.1

none

-1,

585

ENSTRUT00000048543.1

chrM

2755

2828

2755,

2828,

ENSTRUG00000019225.1

none

-1,

585

ENSTRUT00000047991.1

chrM

2828

3803

2828

3803

2828,

3803,

ENSTRUG00000018673.1

cmpl

585

ENSTRUT00000048544.1

chrM

3805

3875

3805,

3875,

ENSTRUG00000019226.1

none

-1,

585

ENSTRUT00000048545.1

chrM

3874

3945

3874,

3945,

ENSTRUG00000019227.1

none

-1,

585

ENSTRUT00000048546.1

chrM

3944

4013

3944,

4013,

ENSTRUG00000019228.1

none

-1,

585

ENSTRUT00000047992.1

chrM

4013

5060

4013

5060

4013,

5060,

ENSTRUG00000018674.1

cmpl

Description

These gene predictions were generated using the Ensembl annotation system.

Methods

For a description of the methods used in Ensembl gene prediction, refer to Hubbard et al. (2002) in the References section below.

Credits

The Fugu genome was annotated using the Ensembl system by the Fugu informatics group at the Institute of Molecular and Cell Biology (IMCB) in Singapore. Thanks to IMCB's Shawn Hoon for providing these annotations.

References

Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T et al. The Ensembl genome database project. Nucleic Acids Res. 2002 Jan 1;30(1):38-41. PMID: 11752248; PMC: PMC99161