Schema for Old UCSC Genes - Previous Version of UCSC Genes
  Database: hg38    Primary Table: knownGeneOld12    Row Count: 247,541   Data last updated: 2021-02-03
Format description: Transcript from default gene set in UCSC browser
On download server: MariaDB table dump directory
fieldexampleSQL type info description
name ENST00000619216.1varchar(255) values Name of gene
chrom chr1varchar(255) values Reference sequence chromosome or scaffold
strand -char(1) values + or - for strand
txStart 17368int(10) unsigned range Transcription start position (or end position for minus strand item)
txEnd 17436int(10) unsigned range Transcription end position (or start position for minus strand item)
cdsStart 17368int(10) unsigned range Coding region start (or end position if for minus strand item)
cdsEnd 17368int(10) unsigned range Coding region end (or start position if for minus strand item)
exonCount 1int(10) unsigned range Number of exons
exonStarts 17368,longblob   Exon start positions (or end positions for minus strand item)
exonEnds 17436,longblob   Exon end positions (or start positions for minus strand item)
proteinID  varchar(40) values UniProt display ID, UniProt accession, or RefSeq protein ID
alignID uc031tla.1varchar(255) values Unique identifier (GENCODE transcript ID for GENCODE Basic)

Sample Rows

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Old UCSC Genes (knownGeneOld12) Track Description


The Old UCSC Genes track shows genes from the previous version of the UCSC Genes build, which was built with GENCODE v32 models. See the description page for more information on how the new GENCODE v36 track was built.

The new release has 232,184 total transcripts, compared with 247,541 in the previous version. The total number of canonical genes has decreased from 66,622 to 60,675 . Comparing the new gene set with the previous version:

  • 221,698 transcripts did not change.
  • 20,017 transcripts were not carried forward to the new version.
  • 4,860 transcripts are "compatible" with those in the previous set, meaning that the two transcripts show consistent splicing. In most cases, the old and new transcripts differ in the lengths of their UTRs.
  • 966 transcripts overlap with those in the previous set, but do not show consistent splicing (i.e., they contain overlapping introns with differing splice sites)