Schema for TCGA Pan-Cancer - TCGA Pan-Cancer mutations: 33 TCGA Cancer Projects Summary (Pan-Can 33)

Home
Genomes
Genome Browser
Tools
Mirrors
- Euro/Asia Mirrors
- Mirroring Instructions
- US Server
- European Server
- Asian Server
Downloads
My Data
Projects
Help
About Us
- News
- Publications
- Blog
- Cite Us
- Credits
- Release Log
- Staff
- Conditions of Use
- Our History
- Jobs
- Licenses
- Contact Us

field

example

description

chrom

chr1

Chromosome (or contig, scaffold, etc.)

chromStart

166069937

Start position in chromosome

chromEnd

166069938

End position in chromosome

name

A>C

Name of item

score

Score from 0-1000

strand

+ or -

thickStart

166069937

Start of where display should be thick (start codon)

thickEnd

166069938

End of where display should be thick (stop codon)

reserved

0,0,0

Used as itemRgb as of 2004-11-22

blockCount

Number of blocks

blockSizes

Comma separated list of block sizes

chromStarts

Start positions relative to chromStart

sampleCount

Number of samples with this variant

freq

0.00188679245283

Variant frequency

Hugo_Symbol

FAM78B

Hugo symbol

Entrez_Gene_Id

149297

Entrez Gene Id

Variant_Classification

3'Flank

Class of variant

Variant_Type

SNP

Type of variant

Reference_Allele

Reference allele

Tumor_Seq_Allele1

Tumor allele 1

Tumor_Seq_Allele2

Tumor allele 2

dbSNP_RS

novel

dbSNP RS number

dbSNP_Val_Status

dbSNP validation status

days_to_death

Number of days till death

cigarettes_per_day

Number of cigarettes per day

weight

127.0

Weight

alcohol_history

Any alcohol consumption?

alcohol_intensity

Frequency of alcohol consumption

bmi

47.8000677481

Body mass index

years_smoked

Number of years smoked

height

163.0

Height

gender

female

Gender

project_id

TCGA-UCEC

TCGA Project id

ethnicity

not hispanic or latino

Ethnicity

Tumor_Sample_Barcode

TCGA-A5-A2K5-01A-11D-A17W-09

Tumor sample barcode

Matched_Norm_Sample_Barcode

TCGA-A5-A2K5-10A-01D-A17W-09

Matcheds normal sample barcode

case_id

cf77afe9-3785-45d4-ba2a-61c9cb706225

Case ID number

chrom

chromStart

chromEnd

name

score

strand

thickStart

thickEnd

reserved

blockCount

blockSizes

chromStarts

sampleCount

freq

Hugo_Symbol

Entrez_Gene_Id

Variant_Classification

Variant_Type

Reference_Allele

Tumor_Seq_Allele1

Tumor_Seq_Allele2

dbSNP_RS

dbSNP_Val_Status

days_to_death

cigarettes_per_day

weight

alcohol_history

alcohol_intensity

bmi

years_smoked

height

gender

project_id

ethnicity

Tumor_Sample_Barcode

Matched_Norm_Sample_Barcode

case_id

chr1

166069937

166069938

A>C

166069937

166069938

0,0,0

0.00188679245283

FAM78B

149297

3'Flank

SNP

novel

127.0

47.8000677481

163.0

female

TCGA-UCEC

not hispanic or latino

TCGA-A5-A2K5-01A-11D-A17W-09

TCGA-A5-A2K5-10A-01D-A17W-09

cf77afe9-3785-45d4-ba2a-61c9cb706225

chr1

166070009

166070010

G>A

166070009

166070010

0,0,0

0.00188679245283

FAM78B

149297

3'Flank

SNP

novel

60.0

25.9695290859

152.0

female

TCGA-UCEC

not hispanic or latino

TCGA-A5-A0G2-01A-11W-A062-09

TCGA-A5-A0G2-10A-01W-A062-09

4abbd258-0f0c-4428-901d-625d47ad363a

chr1

166070107

166070108

A>C

166070107

166070108

0,0,0

0.00188679245283

FAM78B

149297

3'UTR

SNP

novel

66.0

29.3333333333

150.0

female

TCGA-UCEC

hispanic or latino

TCGA-AX-A05Z-01A-11W-A027-09

TCGA-AX-A05Z-10A-01W-A027-09

bf632368-8ce7-4b4b-8842-d1b1801e62ef

chr1

166070310

166070311

G>A

166070310

166070311

0,0,0

0.00188679245283

FAM78B

149297

Missense_Mutation

SNP

109.0

38.1639298344

169.0

female

TCGA-UCEC

not hispanic or latino

TCGA-D1-A163-01A-11D-A12J-09

TCGA-D1-A163-10A-01D-A12J-09

a438dce7-6592-4ea6-a401-57f5a8fe8ba6

chr1

166070354

166070355

G>A

166070354

166070355

0,0,0

0.00188679245283

FAM78B

149297

Silent

SNP

novel

127.0

47.8000677481

163.0

female

TCGA-UCEC

not hispanic or latino

TCGA-A5-A2K5-01A-11D-A17W-09

TCGA-A5-A2K5-10A-01D-A17W-09

cf77afe9-3785-45d4-ba2a-61c9cb706225

chr1

166070355

166070356

C>T

166070355

166070356

0,0,0

0.00188679245283

FAM78B

149297

Missense_Mutation

SNP

novel

127.0

47.8000677481

163.0

female

TCGA-UCEC

not hispanic or latino

TCGA-A5-A2K5-01A-11D-A17W-09

TCGA-A5-A2K5-10A-01D-A17W-09

cf77afe9-3785-45d4-ba2a-61c9cb706225

chr1

166070357

166070358

C>G

166070357

166070358

0,0,0

0.00188679245283

FAM78B

149297

Silent

SNP

65.0

female

TCGA-UCEC

not hispanic or latino

TCGA-E6-A1LZ-01A-11D-A142-09

TCGA-E6-A1LZ-10A-01D-A142-09

9205c164-7975-421a-90c3-edfe8def595c

chr1

166070404

166070405

G>A

166070404

166070405

0,0,0

0.00188679245283

FAM78B

149297

Missense_Mutation

SNP

rs747327778

56.0

24.2382271468

152.0

female

TCGA-UCEC

not reported

TCGA-EO-A22X-01A-11D-A17W-09

TCGA-EO-A22X-10A-01D-A17W-09

10eb7dfa-d43d-479f-8c46-c0424207f958

chr1

166070424

166070425

G>A

166070424

166070425

0,0,0

0.00188679245283

FAM78B

149297

Missense_Mutation

SNP

novel

58.0

22.65625

160.0

female

TCGA-UCEC

not hispanic or latino

TCGA-AX-A1CE-01A-11D-A135-09

TCGA-AX-A1CE-10A-01D-A135-09

4db38349-28d2-4af5-a12f-3d861937b0e0

chr1

166070538

166070540

insTTCTTGTGAGCACCTCTCCAGCTTTTGAAAAATTCATTCAGCCTCTTCTT

166070538

166070540

0,0,0

0.00188679245283

FAM78B

149297

Nonsense_Mutation

INS

TTCTTGTGAGCACCTCTCCAGCTTTTGAAAAATTCATTCAGCCTCTTCTT

novel

122.0

38.5052392375

178.0

female

TCGA-UCEC

not hispanic or latino

TCGA-D1-A102-01A-11D-A10M-09

TCGA-D1-A102-10A-01D-A10M-09

f9cf4125-0db3-4b66-b51b-bc2f0240c9ef

Description

This track shows the genomic positions of somatic variants found through whole genome sequencing of tumors as part of The Cancer Genome Atlas (TCGA) by the National Cancer Institute, made available through the Genomic Data Commons Portal. The data shown here is sometimes called the "Pan-Cancer dataset", a collection of thirty-three TCGA projects processed in a uniform way.

Display Conventions and Configuration

Variants can be filtered by project ID and gender from the track details page. Pressing the "All" button allows the user to specify whether the checked values all have to be true of a particular variant, or if only one of them need be present to satisfy the filter.

The vertical viewing range in full mode can also be used to filter what variants are shown. Variants that have a sampleCount more or less than the min and max values specificed in the viewing range are not displayed.

Data access

The raw data can be explored interactively with the Table Browser or the Data Integrator.

For automated download and analysis, the genome annotation for all the thirty-three projects is stored in a bigBed file that can be downloaded from our download server. There are also bigBed files for each of the thirty-three projects in that directory. Individual regions or the whole genome annotation can be obtained using our tool bigBedToBed which can be compiled from the source code or downloaded as a precompiled binary for your system. Instructions for downloading source code and binaries can be found here. The tool can also be used to obtain only features within a given range, e.g.,

bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/hg38/gdcCancer/gdcCancer.bb -chrom=chr21 -start=0 -end=100000000 stdout

Methods

All MuTect Variant calls were downloaded from the GDC portal in January 2019 and reformatted at UCSC to the bigBed format with a short script, cancerMafToBigBed.

Credits

Thanks to GDC for making the TCGA data available on their web site.