Schema for COVID GWAS v4 - COVID risk variants from GWAS meta-analyses by the COVID-19 Host Genetics Initiative (Rel 4, Oct 2020)
  Database: hg38    Primary Table: covidHgiGwasR4PvalA2 Data last updated: 2020-12-21
Big Bed File Download: /gbdb/hg38/covidHgiGwas/covidHgiGwasR4.A2.hg38.bb
Item Count: 11,587,871
The data is stored in the binary BigBed format.

Format description: Meta-analysis from COVID 19 Host Genetics Initiative (covid19hg.org). BED 9+12 for lollipop display
fieldexampledescription
chromchr1Reference sequence chromosome or scaffold
chromStart165974064Start position in chrom
chromEnd165974065End position in chrom
namers1260472294dbSNP Reference SNP (rs) identifier or :
score0Score from 0-1000, derived from p-value
strand.Unused. Always '.'
thickStart165974064Start position in chrom
thickEnd165974065End position in chrom
color160,160,255Red (positive effect) or blue (negative)
effectSize-0.099Effect size (beta coefficient)
effectSizeSE0.246Effect size standard error
pValue6.87e-01p-value
pValueLog0.163-log10 p-value
pValueHet7.73e-02p-value from Cochran's Q heterogeneity test
refGNon-effect allele
altAEffect allele
alleleFreq0.008Allele frequency among the samples
sampleN12486Total sample size (sum of study sample sizes)
sourceCount6Number of studies
_radius5Lollipop radius; scaled ratio of sourceCount to total studies, for display
_effectSizeAbs0.099Effect size, abs value for display

Sample Rows
 
chromchromStartchromEndnamescorestrandthickStartthickEndcoloreffectSizeeffectSizeSEpValuepValueLogpValueHetrefaltalleleFreqsampleNsourceCount_radius_effectSizeAbs
chr1165974064165974065rs12604722940.165974064165974065160,160,255-0.0990.2466.87e-010.1637.73e-02GA0.00812486650.099
chr1165974344165974345rs618367180.165974344165974345160,160,255-0.0510.0442.41e-010.6187.74e-01GC0.1146281261280.051
chr11659749981659749991:1659749990.165974998165974999160,160,255-0.0320.0786.77e-010.1696.16e-01CG0.0653793591070.032
chr1165975386165975387rs121181370.165975386165975387255,160,1600.0410.0271.38e-010.8612.56e-01CT0.4466281261280.041
chr1165975456165975457rs121367800.165975456165975457255,160,1600.0410.0271.32e-010.8783.18e-01AG0.4526281261280.041
chr1165975594165975595rs121376570.165975594165975595255,160,1600.0410.0271.39e-010.8573.89e-01TG0.4596281261280.041
chr11659756671659756681:1659756680.165975667165975668255,160,1600.0430.0271.19e-010.9233.22e-01TC0.4536281261280.043
chr11659758841659758851:1659758850.165975884165975885255,160,1600.0410.0271.32e-010.8783.92e-01AT0.4596281261280.041
chr11659762271659762281:1659762280.165976227165976228160,160,255-0.0510.0432.45e-010.6108.40e-01TC0.1146281261280.051
chr11659765211659765221:1659765220.165976521165976522255,160,1600.0170.0285.28e-010.2773.49e-01AG0.5736281261280.017

COVID GWAS v4 (covidHgiGwasR4Pval) Track Description
 

Description

This track set shows the results of the GWAS Data Release 4 (October 2020) from the COVID-19 Host Genetics Initiative (HGI): a collaborative effort to facilitate the generation of meta-analysis across multiple studies contributed by partners world-wide to identify the genetic determinants of SARS-CoV-2 infection susceptibility, disease severity and outcomes. The COVID-19 HGI also aims to provide a platform for study partners to share analytical results in the form of summary statistics and/or individual level data of COVID-19 host genetics research. At the time of this release, a total of 137 studies were registered with this effort.

The specific phenotypes studied by the COVID-19 HGI are those that benefit from maximal sample size: primary analysis on disease severity. For the Data Release 4 the number of cases have increased by nearly ten-fold (more than 30,000 COVID-19 cases and 1.47 million controls) by combining data from 34 studies across 16 countries.

The four tracks here are based on data from HGI meta-analyses A2, B2, C1, and C2, described here:

Due to privacy concerns, these browser tracks exclude data provided by 23andMe contributed studies in the full analysis results. The actual study and case and control counts for the individual browser tracks are listed in the track labels. Details on all studies can be found here.

Display Conventions

Displayed items are colored by GWAS effect: red for positive (harmful) effect, blue for negative (protective) effect. The height ('lollipop stem') of the item is based on statistical significance (p-value). For better visualization of the data, only SNPs with p-values smaller than 1e-3 are displayed by default.

The color saturation indicates effect size (beta coefficient): values over the median of effect size are brightly colored (bright red    , bright blue    ), those below the median are paler (light red    , light blue    ).

Each track has separate display controls and data can be filtered according to the number of studies, minimum -log10 p-value, and the effect size (beta coefficient), using the track Configure options.

Mouseover on items shows the rs ID (or chrom:pos if none assigned), both the non-effect and effect alleles, the effect size (beta coefficient), the p-value, and the number of studies. Additional information on each variant can be found on the details page by clicking on the item.

Methods

COVID-19 Host Genetics Initiative (HGI) GWAS meta-analysis round 4 (October 2020) results were used in this study. Each participating study partner submitted GWAS summary statistics for up to four of the COVID-19 phenotype definitions.

Data were generated from genome-wide SNP array and whole exome and genome sequencing, leveraging the impact of both common and rare variants. The statistical analysis performed takes into account differences between sex, ancestry, and date of sample collection. Alleles were harmonized across studies and reported allele frequencies are based on gnomAD version 3.0 reference data. Most study partners used the SAIGE GWAS pipeline in order to generate summary statistics used for the COVID-19 HGI meta-analysis. The summary statistics of individual studies were manually examined for inflation, deflation, and excessive number of false positives. Qualifying summary statistics were filtered for INFO > 0.6 and MAF > 0.0001 prior to meta-analyzing the entirety of the data.

The meta-analysis was performed using fixed effects inverse variance weighting. The meta-analysis software and workflow are available here. More information about the prospective studies, processing pipeline, results and data sharing can be found here.

Data Access

The data underlying these tracks and summary statistics results are publicly available in COVID19-hg Release 4 (October 2020). The raw data can be explored interactively with the Table Browser, or the Data Integrator. Please refer to our mailing list archives for questions, or our Data Access FAQ for more information.

Credits

Thanks to the COVID-19 Host Genetics Initiative contributors and project leads for making these data available, and in particular to Rachel Liao, Juha Karjalainen, and Kumar Veerapen at the Broad Institute for their review and input during browser track development.

References

COVID-19 Host Genetics Initiative. The COVID-19 Host Genetics Initiative, a global initiative to elucidate the role of host genetic factors in susceptibility and severity of the SARS-CoV-2 virus pandemic. Eur J Hum Genet. 2020 Jun;28(6):715-718. PMID: 32404885; PMC: PMC7220587

Pairo-Castineira E, Clohisey S, Klaric L, Bretherick AD, Rawlik K, Pasko D, Walker S, Parkinson N, Fourman MH, Russell CD et al. Genetic mechanisms of critical illness in Covid-19. Nature. 2020 Dec 11;. PMID: 33307546