emm1
General Information
Protegen ID
854
Sequence Strain (Species/Organism)
Streptococcus pyogenes
VO ID
VO_0012397
Taxonomy ID
1314
Other Database IDs
CDD:273479 CDD:291694 CDD:295219 CDD:279132 GOA:Q10372 HSSP: 1IC2 InterPro: IPR001899 InterPro: IPR005877 InterPro: IPR019931 InterPro: IPR019948 InterPro: IPR019950 UniProtKB/TrEMBL: Q10372
Molecule Role
Protective antigen
Molecule Role Annotation
Mice were intranasally immunized with diphtheria toxoid (DT) conjugated polypeptides encompassing a conformational-constrained B cell epitope of the M1 protein (J8). Vaccinated mice were challenged 10 days after the last boost by the intranasal route. Animals had 60% protection against challenge (Schulze et al. , 2006 ).
Related Vaccines(s)
S. pyogenes M Protein Vaccine
References
Schulze et al. , 2006: Schulze K, Olive C, Ebensen T, Guzmán CA. Intranasal vaccination with SfbI or M protein-derived peptides conjugated to diphtheria toxoid confers protective immunity against a lethal challenge with Streptococcus pyogenes . Vaccine . 2006; 24(35-36); 6088-6095. [PubMed: 16828529 ].
Gene Information
Gene Name
emm1
NCBI Nucleotide GI
311757
DNA Sequence
>gi|311757|emb|X62131.1| S.pyogenes emm1 (29/58) gene for M protein type 1
GGATCCAATGATAACATAAGGAGCATAAAAATGGCTAAAAATAACACGAATAGACACTATTCGCTTAGAA
AATTAAAAACAGGAACGGCTTCAGTAGCGGTAGCTTTGACTGTTTTAGGGGCAGGTTTTGCGAATCAAAC
AGAGGTTAAGGCTAACGGTGATGGTAATCCTAGGGAAGTTATAGAAGATCTTGCAGCAAACAATCCCGCA
ATACAAAATATACGTTTACGTCACGAAAACAAGGACTTAAAAGCGAGATTAGAGAATGCAATGGAAGTTG
CAGGAAGAGATTTTAAGAGAGCTGAAGAACTTGAAAAAGCAAAACAAGCCTTAGAAGACCAGCGTAAAGA
TTTAGAAACTAAATTAAAAGAACTACAACAAGACTATGACTTAGCAAAGGAATCAACAAGTTGGGATAGA
CAAAGACTTGAAAAAGAGTTAGAAGAGAAAAAGGAAGCTCTTGAATTAGCGATAGACCAGGCAAGTCGGG
ACTACCATAGAGCTACCGCTTTAGAAAAAGAGTTAGAAGAGAAAAAGAAAGCTCTTGAATTAGCGATAGA
CCAAGCGAGTCAGGACTATAATAGAGCTAACGTCTTAGAAAAAGAGTTAGATACGATTACTAGAGAACAA
GAGATTAATCGTAATCTTTTAGGCAATCGAAAACTTGAACTTGATCAACTTTCATCTGAAAAAGAGCAGC
TAACGATCGAAAAAGCAAAACTTGAGGAAGAAAAACAAATCTCAGACGCAAGTCGTCAAAGCCTTCGTCG
TGACTTGGACGCATCACGTGAAGCTAAGAAACAGGTTGAAAAAGATTTAGCAAACTTGACTGCTGAACTT
GATAAGGTTAAAGAAGACAAACAAATCTCAGACGCAAGCCGTCAAGGCCTTCGCCGTGACTTGGACGCAT
CACGTGAAGCTAAGAAACAGGTTGAAAAAGATTTAGCAAACTTGACTGCTGAACTTGATAAGGTTAAAGA
AGAAAAACAAATCTCAGACGCAAGCCGTCAAGGCCTTCGCCGTGACTTGGACGCATCACGTGAAGCTAAG
AAACAAGTTGAAAAAGCTTTAGAAGAAGCAAACAGCAAATTAGCTGCTCTTGAAAAACTTAACAAAGAGC
TTGAAGAAAGCAAGAAATTAACAGAAAAAGAAAAAGCTGAACTACAAGCAAAACTTGAAGCAGAAGCAAA
TGTACTTAAAGATCCATTAGCGAAACAAGCTGAAGAACTCGCAAAACTAAGAGCTGGAAAAGCATCAGAC
TCACAAACCCCTGATACAAAACCAGGAAACAAAGCTGTTCCAGGTAAAGGTCAAGCACCACAAGCAGGTA
CAAAACCTAACCAAAACAAAGCACCAATGAAGGAAACTAAGAGACAGTTACCATCAACAGGTGAAACAGC
TAACCCATTCTTCACAGCGGCAGCCCTTACTGTTATGGCAACAGCTGGAGTAGCAGCAGTTGTAAAACGC
AAAGAAGAAAACTAAGCTGAATTC
Protein Information
Protein Name
M protein type 1
NCBI Protein GI
311758
Protein Accession
CAA44062.1
3D structure: PDB ID
2OTO
Protein pI
6.86
Protein Weight
52910.57
Protein Length
546
Protein Note
Gram-positive signal peptide, YSIRK family; TIGR01168
Protein Sequence
>CAA44062.1 M protein type 1 [Streptococcus pyogenes]
MAKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPAIQNIRLRHEN
KDLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKELEEK
KEALELAIDQASRDYHRATALEKELEEKKKALELAIDQASQDYNRANVLEKELDTITREQEINRNLLGNR
KLELDQLSSEKEQLTIEKAKLEEEKQISDASRQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQIS
DASRQGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEA
NSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEANVLKDPLAKQAEELAKLRAGKASDSQTPDTKPGN
KAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN
Epitope Information
IEDB Linear Epitope
-- Assay Type --
T Cell Epitope
B Cell Epitope
IEDB ID
Epitope
Starting position
Ending position
60642
SREAKKQVEKAL
336
348
4740
ASREAKKQVEKALE
335
349
35120
LDASREAKKQVEKDLANLTAEL
249
271
12852
EKQISDASRQ
234
244
4737
ASREAK
251
257
39183
LRRDLDASREAKKQVEKALE
329
349
57290
SDSQTPDTKPGNKAVPGKGQ
409
429
2220
AKKQVEKALEEANSKLAALE
339
359
39182
LRRDLDASREAKKQVEKAL
329
348
32048
KLNKELEESKKLTEKEKAEL
359
379
12944
ELAKLRAGKASDSQTPDT
399
417
41143
MATAGVAAVVKRKEEN
469
485
28331
IRLRHENKDLKARLENAMEV
64
84
47492
PFFTAAALTVMATAGVAAVV
459
479
33086
KQVEKDLANLTAELDKVKEEKQ
299
321
102309
ANQTEV
34
40
11150
EANSKLAALEKLNKELEESK
349
369
55542
RRDLDASREAKK
246
258
94501
EKQISDASRQGLRRD
318
333
11092
EAKKQVEKALEE
338
350
7649
DASREAKKQVEKALEEANSK
334
354
133244
NGDGNPREVIEDLAANNPAIQNIRLR
42
68
133243
NGDGNPREVIEDLAANNPAIQNI
42
65
133242
NGDGNPREVIEDLAANNPAI
42
62
132984
AIQNIRLRHENKDL
60
74
133157
IRLRHENKDLKARL
64
78
133186
LAANNPAIQNIRLR
54
68
142167
GLRRDLDASREAKKQVEKAL
328
348
96674
NGDGNPREVIED
42
54
96526
LAANNPA
54
61
4739
ASREAKKQVEKA
335
347
520608
KVKEEKQISDASRQG
314
329
528937
VKEEKQISDASRQGL
315
330
519681
KEEKQISDASRQGLR
316
331
520874
LDKVKEDKQISDASR
270
285
513099
DKVKEDKQISDASRQ
271
286
520607
KVKEDKQISDASRQG
272
287
520875
LDKVKEEKQISDASR
312
327
513100
DKVKEEKQISDASRQ
313
328
526274
SLRRDLDASREAKKQ
244
259
521993
LRRDLDASREAKKQV
245
260
512083
ASRQSLRRDLDASRE
240
255
526838
SRQSLRRDLDASREA
241
256
525305
RQSLRRDLDASREAK
242
257
524579
QSLRRDLDASREAKK
243
258
520257
KQISDASRQSLRRDL
235
250
516882
GLRRDLDASREAKKQ
286
301
512448
DASRQGLRRDLDASR
281
296
512082
ASRQGLRRDLDASRE
282
297
526831
SRQGLRRDLDASREA
283
298
525280
RQGLRRDLDASREAK
284
299
524144
QGLRRDLDASREAKK
285
300
520256
KQISDASRQGLRRDL
277
292
528936
VKEDKQISDASRQGL
273
288
519673
KEDKQISDASRQGLR
274
289
514262
EDKQISDASRQGLRR
275
290
513078
DKQISDASRQGLRRD
276
291
514450
EEKQISDASRQGLRR
317
332
511197
AKLEEEKQISDASRQ
229
244
520013
KLEEEKQISDASRQS
230
245
519514
KAKLEEEKQISDASR
228
243
512224
AVALTVLGAGFANQT
23
38
528442
VAVALTVLGAGFANQ
22
37
514451
EEKQISDASRQSLRR
233
248
514821
EKQISDASRQSLRRD
234
249
514398
EEEKQISDASRQSLR
232
247
527600
TKLKELQQDYDLAKE
110
125
520025
KLKELQQDYDLAKES
111
126
515393
ETKLKELQQDYDLAK
109
124
528287
TVLGAGFANQTEVKA
27
42
529000
VLGAGFANQTEVKAN
28
43
511341
ALTVLGAGFANQTEV
25
40
522227
LTVLGAGFANQTEVK
26
41
528408
VALTVLGAGFANQTE
24
39
521288
LKELQQDYDLAKEST
112
127
519717
KELQQDYDLAKESTS
113
128
514921
ELQQDYDLAKESTSW
114
129
69204
VKEEKQISDASRQGLRRDLDA
315
336
MAKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPAIQNIRLRHENKDLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKELEEKKEALELAIDQASRDYHRATALEKELEEKKKALELAIDQASQDYNRANVLEKELDTITREQEINRNLLGNRKLELDQLSSEKEQLTIEKAKLEEEKQISDASRQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQISDASRQGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEANVLKDPLAKQAEELAKLRAGKASDSQTPDTKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN