Schema for CD8 RosettaMHC - CD8 Epitopes predicted by NetMHC and Rosetta

Home
Genomes
Genome Browser
Tools
Mirrors
- Euro/Asia Mirrors
- Mirroring Instructions
- US Server
- European Server
- Asian Server
Downloads
My Data
Projects
Help
About Us
- News
- Publications
- Blog
- Cite Us
- Credits
- Release Log
- Staff
- Conditions of Use
- Our History
- Jobs
- Licenses
- Contact Us

field

example

description

chrom

NC_045512v2

Reference sequence chromosome or scaffold

chromStart

20116

Start position in chromosome

chromEnd

20141

End position in chromosome

name

TLIGEAVKT

Short Name of item

score

640

Score from 0-1000

strand

+ or -

thickStart

20116

Start of where display should be thick (start codon)

thickEnd

20141

End of where display should be thick (stop codon)

reserved

246,169,138

Used as itemRgb as of 2004-11-22

id

335

ID of peptide

description

YP_009724389_TLIGEAVKT

Description

chrom

chromStart

chromEnd

name

score

strand

thickStart

thickEnd

reserved

description

NC_045512v2

20116

20141

TLIGEAVKT

640

20116

20141

246,169,138

335

YP_009724389_TLIGEAVKT

NC_045512v2

20161

20186

KVDGVVQQL

548

20161

20186

219,220,222

293

YP_009724389_KVDGVVQQL

NC_045512v2

20239

20267

SQMEIDFLEL

423

20239

20267

123,158,248

468

YP_009724389_SQMEIDFLEL

NC_045512v2

20242

20270

QMEIDFLELA

481

20242

20270

170,198,253

600

YP_009724389_QMEIDFLELA

NC_045512v2

20257

20285

FLELAMDEFI

423

20257

20285

123,158,248

469

YP_009724389_FLELAMDEFI

NC_045512v2

20347

20375

SQLGGLHLLI

488

20347

20375

176,203,251

613

YP_009724389_SQLGGLHLLI

NC_045512v2

20347

20372

SQLGGLHLL

505

20347

20372

189,210,246

315

YP_009724389_SQLGGLHLL

NC_045512v2

20350

20375

QLGGLHLLI

522

20350

20375

201,215,238

339

YP_009724389_QLGGLHLLI

NC_045512v2

20506

20534

DLLLDDFVEI

413

20506

20534

115,149,244

450

YP_009724389_DLLLDDFVEI

NC_045512v2

20509

20537

LLLDDFVEII

466

20509

20537

157,189,254

567

YP_009724389_LLLDDFVEII

Description

As a first step toward the development of diagnostic and therapeutic tools to fight the Coronavirus disease (COVID-19), it is important to characterize CD8+ T cell epitopes in the SARS-CoV-2 peptidome that can trigger adaptive immune responses. Here, we use RosettaMHC, a comparative modeling approach which leverages existing high-resolution X-ray structures from peptide/MHC complexes available in the Protein Data Bank, to derive physically realistic 3D models for high-affinity SARS-CoV-2 epitopes. We outline an application of our method to model 439 9mer and 279 10mer predicted epitopes displayed by the common allele HLA-A*02:01, and we make our models publicly available through an online database (https://rosettamhc.chemistry.ucsc.edu). As more detailed studies on antigen-specific T cell recognition become available, RosettaMHC models of antigens from different strains and HLA alleles can be used as a basis to understand the link between peptide/HLA complex structure and surface chemistry with immunogenicity, in the context of SARS-CoV-2 infection.

This track includes 718 CD8 epitopes restricted to HLA-A*02:01 as predicted by NetMHCpan4.0 and RosettaMHC. The structural models of all 718 epitopes are available in the database (see Description). All the epitopes are scored using a combined NetMHCPan4.0 (eluted ligand) predicted binding affinity and binding energy calculated in Rosetta force field (score = (0.5 * ( ((NetMHCPan affinity - Average NetMHCPan affinity) / range of NetMHCPan affinities) + ( (Rosetta binding energy - Average Rosetta binding energy ) / range of Rosetta binding energies) ) + 1 ) * 500).

Methods

Epitopes of lengths 9 and 10 from all reading frames of SARS-CoV-2 proteome are generated and filtered using NetMHCPan4.0 (eluted ligand prediction). All the epitopes predicted as strong or weak binders (a total of 718) to HLA-A*02:01 by NetMHCPan4.0 (using default %Rank cut-off) are modeled using RosettaMHC. Further, binding energies of all 718 epitopes to HLA-A*02:01 is calculated in Rosetta. Alongside all the models, their NetMHCpan predictions and binding energies are made available through a database and Supplementary Table 1 from the reference, Nerli and Sgourakis. (2020) in the References section below.

Notes

For a full description of the methods used, refer to Nerli and Sgourakis. (2020) in the References section below.

Credits

Nikolaos Sgourakis (nsgourak@ucsc.edu)

Santrupti Nerli (snerli@ucsc.edu)

Data were generated and processed at UCSC. For inquiries, please contact Nikolaos Sgourakis from the Sgourakis Research Group at UCSC.

References

Nerli and Sgourakis. 2020 (Manuscript submitted) (BioRxiv).