Schema for COVID GWAS v3 - GWAS meta-analyses from the COVID-19 Host Genetics Initiative
  Database: hg19    Primary Table: covidHgiGwasB2 Data last updated: 2020-09-16
Big Bed File Download: /gbdb/hg19/covidHgiGwas/covidHgiGwasB2.hg19.bb
Item Count: 14,972,711
The data is stored in the binary BigBed format.

Format description: Meta-analysis from COVID 19 Host Genetics Initiative (covid19hg.org). BED 9+12 for lollipop display
fieldexampledescription
chromchr1Reference sequence chromosome or scaffold
chromStart166167338Start position in chrom
chromEnd166167339End position in chrom
namers762598128dbSNP Reference SNP (rs) identifier or :
score0Score from 0-1000, derived from p-value
strand.Unused. Always '.'
thickStart166167338Start position in chrom
thickEnd166167339End position in chrom
color160,160,255Red (positive effect) or blue (negative). Brightness reflects pvalue
effectSize-1.331Effect size (beta coefficient)
effectSizeSE1.762Effect size standard error
pValue4.50e-01p-value
pValueLog0.347-log10 p-value
pValueHet7.40e-01p-value from Cochran's Q heterogeneity test
refANon-effect allele
altGEffect allele
alleleFreq0.000Allele frequency among the samples
sampleN644588Total sample size (sum of study sample sizes)
sourceCount2Number of studies
_radius4Lollipop radius; scaled ratio of sourceCount to total studies, for display
_effectSizeAbs1.331Effect size, abs value for display

Sample Rows
 
chromchromStartchromEndnamescorestrandthickStartthickEndcoloreffectSizeeffectSizeSEpValuepValueLogpValueHetrefaltalleleFreqsampleNsourceCount_radius_effectSizeAbs
chr1166167338166167339rs7625981280.166167338166167339160,160,255-1.3311.7624.50e-010.3477.40e-01AG0.000644588241.331
chr1166167492166167493rs618351440.166167492166167493160,160,255-0.0120.0397.59e-010.1204.16e-01TC0.197900653880.012
chr1166167562166167563rs1163301390.166167562166167563255,160,1600.1730.1452.34e-010.6321.57e-01GA0.033867195450.173
chr1166167579166167580rs1831982780.166167579166167580255,160,1600.7090.6782.95e-010.5297.21e-01GA0.008442849240.709
chr1166167708166167709rs1481627480.166167708166167709160,160,255-0.0720.1716.72e-010.1739.31e-01AG0.017867715560.072
chr1166167722166167723rs1156883410.166167722166167723255,160,1600.4060.2861.55e-010.8083.90e-01CA0.007890795450.406
chr1166167995166167996rs1904012730.166167995166167996160,160,255-1.0190.6241.03e-010.9889.84e-01AG0.007639323241.019
chr1166168577166168578rs5620419720.166168577166168578255,160,1600.9660.5921.03e-010.9883.29e-01AG0.002644588240.966
chr1166168653166168654rs1491068030.166168653166168654255,160,1600.1830.4426.79e-010.1684.86e-01GA0.006863380340.183
chr1166168710166168711rs1833936190.166168710166168711160,160,255-0.0590.4949.05e-010.0431.04e-01CT0.002644588240.059

COVID GWAS v3 (covidHgiGwas) Track Description
 

Description

This track set shows GWAS meta-analyses from the COVID-19 Host Genetics Initiative (HGI): a collaborative effort to facilitate the generation, analysis and sharing of COVID-19 host genetics research. The COVID-19 HGI organizes meta-analyses across multiple studies contributed by partners world-wide to identify the genetic determinants of SARS-CoV-2 infection susceptibility and disease severity and outcomes. Moreover, the COVID-19 HGI also aims to provide a platform for study partners to share analytical results in the form of summary statistics and/or individual level data where possible.

The specific phenotypes studied by the COVID-19 HGI are those that benefit from maximal sample size: primary analysis on disease severity. Two meta-analyses are represented in this track:

  • ANA_C2_V2: covid vs. population (6696 cases from 18 studies)
  • ANA_B2_V2: hospitalized covid vs. population (3199 cases from 8 studies)

Display Conventions

Displayed items are colored by GWAS effect: red for positive, blue for negative. The height of the item reflects the effect size. The effect size, defined as the contribution of a SNP to the genetic variance of the trait, was measured as beta coefficient (beta). The higher the absolute value of the beta coefficient, the stronger the effect. The color saturation indicates statistical significance: p-values smaller than 1e-5 are brightly colored (bright red    , bright blue    ), those with less significance (p >= 1e-5) are paler (light red    , light blue    ). For better visualization of the data, only SNPs with p-values smaller than 1e-3 are displayed by default.

Each track has separate display controls and data can be filtered according to the number of studies, minimum -log10 p-value, and the effect size (beta coefficient), using the track Configure options.

Mouseover on items shows the rs ID (or chrom:pos if none assigned), both the non-effect and effect alleles, the effect size (beta coefficient), the p-value, and the number of studies. Additional information on each variant can be found on the details page by clicking on the item.

Methods

COVID-19 Host Genetics Initiative (HGI) GWAS meta-analysis round 3 (July 2020) results were used in this study. Each participating study partner submitted GWAS summary statistics for up to four of the COVID-19 phenotype definitions.

Data were generated from genome-wide SNP array and whole exome and genome sequencing, leveraging the impact of both common and rare variants. The statistical analysis performed takes into account differences between sex, ancestry, and date of sample collection. Alleles were harmonized across studies and reported allele frequencies are based on gnomAD version 3.0 reference data. Most study partners used the SAIGE GWAS pipeline in order to generate summary statistics used for the COVID-19 HGI meta-analysis. The summary statistics of individual studies were manually examined for inflation, deflation, and excessive number of false positives. Qualifying summary statistics were filtered for INFO > 0.6 and MAF > 0.0001 prior to meta-analyzing the entirety of the data. The meta-analysis was done using inverse variance weighting of effects method, accounting for strand differences and allele flips in the individual studies.

The meta-analysis results of variants appearing in at least three studies (analysis C2) or two studies (all other analyses) were made publicly available. The meta-analysis software and workflow are available here. More information about the prospective studies, processing pipeline, results and data sharing can be found here.

Data Access

The data underlying these tracks and summary statistics results are publicly available in COVID19-hg Release 3 (June 2020). The raw data can be explored interactively with the Table Browser, or the Data Integrator. Please refer to our mailing list archives for questions, or our Data Access FAQ for more information.

Credits

Thanks to the COVID-19 Host Genetics Initiative contributors and project leads for making these data available, and in particular to Rachel Liao, Juha Karjalainen, and Kumar Veerapen at the Broad Institute for their review and input during browser track development.

References

COVID-19 Host Genetics Initiative. The COVID-19 Host Genetics Initiative, a global initiative to elucidate the role of host genetic factors in susceptibility and severity of the SARS-CoV-2 virus pandemic. Eur J Hum Genet. 2020 Jun;28(6):715-718. PMID: 32404885; PMC: PMC7220587