Schema for MGC Genes - Mammalian Gene Collection Full ORF mRNAs
  Database: hg19    Primary Table: mgcFullMrna    Row Count: 32,576   Data last updated: 2020-01-27
Format description: Summary info about a patSpace alignment
On download server: MariaDB table dump directory
fieldexampleSQL type info description
bin 587smallint(5) unsigned range Indexing field to speed chromosome range queries.
matches 995int(10) unsigned range Number of bases that match that aren't repeats
misMatches 0int(10) unsigned range Number of bases that don't match
repMatches 0int(10) unsigned range Number of bases that match but are part of repeats
nCount 0int(10) unsigned range Number of 'N' bases
qNumInsert 0int(10) unsigned range Number of inserts in query
qBaseInsert 0int(10) unsigned range Number of bases inserted in query
tNumInsert 0int(10) unsigned range Number of inserts in target
tBaseInsert 0int(10) unsigned range Number of bases inserted in target
strand +char(2) values + or - for strand. First character query, second target (optional)
qName BC137547varchar(255) values Query sequence name
qSize 995int(10) unsigned range Query sequence size
qStart 0int(10) unsigned range Alignment start position in query
qEnd 995int(10) unsigned range Alignment end position in query
tName chr1varchar(255) values Target sequence name
tSize 249250621int(10) unsigned range Target sequence size
tStart 367639int(10) unsigned range Alignment start position in target
tEnd 368634int(10) unsigned range Alignment end position in target
blockCount 1int(10) unsigned range Number of blocks in alignment
blockSizes 995,longblob   Size of each block
qStarts 0,longblob   Start of each block in query.
tStarts 367639,longblob   Start of each block in target.

Connected Tables and Joining Fields
        hg19.all_mrna.qName (via mgcFullMrna.qName)
      hg19.mgcGenes.name (via mgcFullMrna.qName)
      hg19.mrnaOrientInfo.name (via mgcFullMrna.qName)
      hgFixed.gbCdnaInfo.acc (via mgcFullMrna.qName)
      hgFixed.gbSeq.acc (via mgcFullMrna.qName)
      hgFixed.imageClone.acc (via mgcFullMrna.qName)

Sample Rows
 
binmatchesmisMatchesrepMatchesnCountqNumInsertqBaseInserttNumInserttBaseInsertstrandqNameqSizeqStartqEndtNametSizetStarttEndblockCountblockSizesqStartstStarts
5879950000000+BC1375479950995chr12492506213676393686341995,0,367639,
5879941000000+BC1375689950995chr12492506213676393686341995,0,367639,
5899950000000-BC1375479950995chr12492506216210586220531995,0,621058,
5899941000000-BC1375689950995chr12492506216210586220531995,0,621058,
591211630000106690+BC024295214332122chr12492506218711458799541186,44,90,138,163,116,79,500,125,111,667,3,89,133,223,361,524,640,719,1219,1344,1455,871145,871232,874419,874654,876523,877515,877789,877938,878632,879077,879287,
59119163000073353+BC033213194001919chr12492506218746898799618151,163,116,79,500,125,111,674,0,151,314,430,509,1009,1134,1245,874689,876523,877515,877789,877938,878632,879077,879287,
5912754600001812297-BC003555280102760chr124925062187959589465219585,90,136,114,144,102,114,112,140,189,114,111,79,91,121,132,175,153,58,41,626,716,852,966,1110,1212,1326,1438,1578,1767,1881,1992,2071,2162,2283,2415,2590,2743,879595,880436,880897,881552,881781,883510,883869,886506,887379,887791,888554,889161,889383,891302,891474,892273,892478,894308,89 ...
591192700000112569+BC1666181990341961chr124925062189607390056912107,260,122,222,117,214,145,168,89,74,182,227,34,141,401,523,745,862,1076,1221,1389,1478,1552,1734,896073,896672,897008,897205,897734,898083,898488,898716,899299,899486,899728,900342,
591219220000146313+BC101386219402194chr124925062190188191038815113,100,147,81,73,128,132,81,76,137,150,141,219,49,567,0,113,213,360,441,514,642,774,855,931,1068,1218,1359,1578,1627,901881,902083,905656,905900,906065,906258,906456,906703,907454,907667,908240,908879,909212,909695,909821,
591245230000146052+BC101387245502455chr124925062190188191038815113,100,147,81,321,132,81,76,137,150,141,141,219,49,567,0,113,213,360,441,762,894,975,1051,1188,1338,1479,1620,1839,1888,901881,902083,905656,905900,906065,906456,906703,907454,907667,908240,908565,908879,909212,909695,909821,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

MGC Genes (mgcFullMrna) Track Description
 

Description

This track shows alignments of human mRNAs from the Mammalian Gene Collection (MGC) having full-length open reading frames (ORFs) to the genome. The goal of the Mammalian Gene Collection is to provide researchers with unrestricted access to sequence-validated full-length protein-coding cDNA clones for human, mouse, and rat genes.

Display Conventions and Configuration

The track follows the display conventions for gene prediction tracks.

An optional codon coloring feature is available for quick validation and comparison of gene predictions. To display codon colors, select the genomic codons option from the Color track by codons pull-down menu. For more information about this feature, go to the Coloring Gene Predictions and Annotations by Codon page.

Methods

GenBank human MGC mRNAs identified as having full-length ORFs were aligned against the genome using blat. When a single mRNA aligned in multiple places, the alignment having the highest base identity was found. Only alignments having a base identity level within 1% of the best and at least 95% base identity with the genomic sequence were kept.

Credits

The human MGC full-length mRNA track was produced at UCSC from mRNA sequence data submitted to GenBank by the Mammalian Gene Collection project.

References

Mammalian Gene Collection project references.

Kent WJ. BLAT--the BLAST-like alignment tool. Genome Res. 2002 Apr;12(4):656-64. PMID: 11932250; PMC: PMC187518