L1 HPV 16
General Information
Protegen ID
512
Sequence Strain (Species/Organism)
Human papillomavirus type 16
VO ID
VO_0011108
Taxonomy ID
333760
Molecule Role
Protective antigen
Molecule Role Annotation
rAAV5, -8 and -9 vectors expressing an HPV16 L1/E7 fusion gene were generated and applied intranasally for combined prophylactic and therapeutic vaccination of mice. Vaccination with the rAAV vectors led to a significant protection of animals against a challenge with different HPV tumour cell lines (Nieto et al. , 2009 ).
Related Vaccines(s)
Human papillomavirus DNA vaccine pC16-L1 encoding L1
,
Human papillomavirus DNA vaccine VlJns-Ll encoding L1
References
Nieto et al. , 2009: Nieto K, Kern A, Leuchs B, Gissmann L, Müller M, Kleinschmidt JA. Combined prophylactic and therapeutic intranasal vaccination against human papillomavirus type-16 using different adeno-associated virus serotype vectors. Antiviral therapy . 2009; 14(8); 1125-1137. [PubMed: 20032542 ].
Gene Information
Gene Name
L1 HPV 16
NCBI Gene ID
1489082
Genbank Accession
K02718
Locus Tag
HpV16gp8
Gene Starting Position
4774
Gene Ending Position
6291
DNA Sequence
>NC_001526.4:4774-6291 Human papillomavirus type 16, complete genome
GATGTCTCTTTGGCTGCCTAGTGAGGCCACTGTCTACTTGCCTCCTGTCCCAGTATCTAAGGTTGTAAGC
ACGGATGAATATGTTGCACGCACAAACATATATTATCATGCAGGAACATCCAGACTACTTGCAGTTGGAC
ATCCCTATTTTCCTATTAAAAAACCTAACAATAACAAAATATTAGTTCCTAAAGTATCAGGATTACAATA
CAGGGTATTTAGAATACATTTACCTGACCCCAATAAGTTTGGTTTTCCTGACACCTCATTTTATAATCCA
GATACACAGCGGCTGGTTTGGGCCTGTGTAGGTGTTGAGGTAGGTCGTGGTCAGCCATTAGGTGTGGGCA
TTAGTGGCCATCCTTTATTAAATAAATTGGATGACACAGAAAATGCTAGTGCTTATGCAGCAAATGCAGG
TGTGGATAATAGAGAATGTATATCTATGGATTACAAACAAACACAATTGTGTTTAATTGGTTGCAAACCA
CCTATAGGGGAACACTGGGGCAAAGGATCCCCATGTACCAATGTTGCAGTAAATCCAGGTGATTGTCCAC
CATTAGAGTTAATAAACACAGTTATTCAGGATGGTGATATGGTTGATACTGGCTTTGGTGCTATGGACTT
TACTACATTACAGGCTAACAAAAGTGAAGTTCCACTGGATATTTGTACATCTATTTGCAAATATCCAGAT
TATATTAAAATGGTGTCAGAACCATATGGCGACAGCTTATTTTTTTATTTACGAAGGGAACAAATGTTTG
TTAGACATTTATTTAATAGGGCTGGTACTGTTGGTGAAAATGTACCAGACGATTTATACATTAAAGGCTC
TGGGTCTACTGCAAATTTAGCCAGTTCAAATTATTTTCCTACACCTAGTGGTTCTATGGTTACCTCTGAT
GCCCAAATATTCAATAAACCTTATTGGTTACAACGAGCACAGGGCCACAATAATGGCATTTGTTGGGGTA
ACCAACTATTTGTTACTGTTGTTGATACTACACGCAGTACAAATATGTCATTATGTGCTGCCATATCTAC
TTCAGAAACTACATATAAAAATACTAACTTTAAGGAGTACCTACGACATGGGGAGGAATATGATTTACAG
TTTATTTTTCAACTGTGCAAAATAACCTTAACTGCAGACGTTATGACATACATACATTCTATGAATTCCA
CTATTTTGGAGGACTGGAATTTTGGTCTACAACCTCCCCCAGGAGGCACACTAGAAGATACTTATAGGTT
TGTAACATCCCAGGCAATTGCTTGTCAAAAACATACACCTCCAGCACCTAAAGAAGATCCCCTTAAAAAA
TACACTTTTTGGGAAGTAAATTTAAAGGAAAAGTTTTCTGCAGACCTAGATCAGTTTCCTTTAGGACGCA
AATTTTTACTACAAGCAGGATTGAAGGCCAAACCAAAATTTACATTAGGAAAACGAAAAGCTACACCCAC
CACCTCATCTACCTCTACAACTGCTAAACGCAAAAAACGTAAGCTGTA
Protein Information
Protein Name
major capsid protein L1
NCBI Protein GI
1046490004
Protein Accession
NP_041332
3D structure: PDB ID
2R5H
Protein pI
8.38
Protein Weight
53132.12
Protein Length
505
Protein Note
major capsid L1 protein; Two structural proteins are involved in papillomavirus capsid formation, a major (L1) and a minor (L2) protein; L1 forms the pentameric assembly unit of the viral shell while L2 mediates several facets of viral entry including endosomal escape after uncoating
Protein Sequence
>NP_041332.2 major capsid protein L1 [Human papillomavirus type 16]
MSLWLPSEATVYLPPVPVSKVVSTDEYVARTNIYYHAGTSRLLAVGHPYFPIKKPNNNKILVPKVSGLQY
RVFRIHLPDPNKFGFPDTSFYNPDTQRLVWACVGVEVGRGQPLGVGISGHPLLNKLDDTENASAYAANAG
VDNRECISMDYKQTQLCLIGCKPPIGEHWGKGSPCTNVAVNPGDCPPLELINTVIQDGDMVDTGFGAMDF
TTLQANKSEVPLDICTSICKYPDYIKMVSEPYGDSLFFYLRREQMFVRHLFNRAGTVGENVPDDLYIKGS
GSTANLASSNYFPTPSGSMVTSDAQIFNKPYWLQRAQGHNNGICWGNQLFVTVVDTTRSTNMSLCAAIST
SETTYKNTNFKEYLRHGEEYDLQFIFQLCKITLTADVMTYIHSMNSTILEDWNFGLQPPPGGTLEDTYRF
VTSQAIACQKHTPPAPKEDPLKKYTFWEVNLKEKFSADLDQFPLGRKFLLQAGLKAKPKFTLGKRKATPT
TSSTSTTAKRKKRKL
Epitope Information
IEDB Linear Epitope
-- Assay Type --
T Cell Epitope
B Cell Epitope
IEDB ID
Epitope
Starting position
Ending position
134343
DLYIK
274
279
112443
ACQKHTPPAP
427
437
109085
GFGAMDF
204
211
450403
ACQKHTPPAPKEDPLKKYT
427
446
110652
LCLIGCKPPIGEHWGKGSPC
156
176
110975
QIFNKPYWLQRAQGH
305
320
110856
FSADLDQFPLGRKFL
455
470
110916
KLDDTENASAYAANA
125
140
110877
GTVGENVPDDLYIKG
265
280
110910
KAKPKFTLGKRKATP
475
490
110733
VGENVPDDLYIKGSG
267
282
110602
GLKAKPKFTLGKRKATPTT
473
492
110948
MVTSDAQIFNKPYWL
299
314
110855
FNRAGTVGENVPDDLYIKGS
261
281
110980
QPLGVGISGHPLLNKLDDTE
111
131
111785
SLFFYLRREQMFVRH
245
260
111533
LYIKGSGSTANLASS
275
290
110343
NLASSNYFPTPSGSM
285
300
110375
PSGSMVTSDAQIFNK
295
310
111708
RAQGHNNGICWGNQL
315
330
111946
WGNQLFVTVVDTTRS
325
340
111139
CAAISTSETTYKNTN
345
360
111230
ECISMDYKQTQLCLI
145
160
111418
IGEHWGKGSPCTNVA
165
180
111165
CPPLELINTVIQDGD
185
200
111272
FGAMDFTTLQANKSE
205
220
111110
ANKSEVPLDICTSIC
215
230
111176
CTSICKYPDYIKMVS
225
240
111424
IKMVSEPYGDSLFFY
235
250
111909
VGHPYFPIKKPNNNK
45
60
111634
PNNNKILVPKVSGLQ
55
70
111924
VSGLQYRVFRIHLPD
65
80
111952
YAANAGVDNRECISM
135
150
111113
APKEDPLKKYTFWEV
435
450
111826
TFWEVNLKEKFSADL
445
460
111733
RKATPTTSSTSTTAK
485
500
111907
VEVGRGQPLGVGISG
105
120
111089
ADVMTYIHSMNSTIL
385
400
111351
GLQPPPGGTLEDTYR
405
420
111232
EDTYRFVTSQAIACQ
415
430
111416
IFQLCKITLTADVMT
375
390
111422
IHLPDPNKFGFPDTS
75
90
111673
QLCLIGCKPPIGEHW
155
170
111218
DTTRSTNMSLCAAIS
335
350
111286
FPDTSFYNPDTQRLV
85
100
111846
TQRLVWACVGVEVGR
95
110
111725
RHGEEYDLQFIFQLC
365
380
111510
LPSEATVYLPPVPVS
5
20
111611
PCTNVAVNPGDC
174
186
111811
STILEDWNFGLQPPPGGTLE
396
416
111890
VDNRECISMDYKQTQLCLIG
141
161
111450
KGSPCTNVAVNPGDCPPLEL
171
191
111585
NKSEVPLDICTSICKYPDYI
216
236
111581
NGICWGNQLFVTVVDTTRST
321
341
111583
NKFGFPDTSFYNPDTQRLVW
81
101
112486
DTYRF
416
421
112524
GGTLEDTYRF
411
421
112581
LKKYTFWEVNLKEKFSADLD
441
461
112655
SADLDQFPLGRKFLL
456
471
112575
LFFYLRREQMFVRHLFNRAG
246
266
112442
ACQKH
427
432
112625
PSEATVYLPPVPVSKVVSTD
6
26
112716
VVSTDEYVARTNIYYHAGTS
21
41
112726
YFPTPSGSMVTSDAQIFNKP
291
311
112553
ITLTADVMTYIHSMNSTILE
381
401
112728
YIKGSGSTANLASSNYFPTP
276
296
112640
QRLVWACVGVEVGRGQPLGV
96
116
113838
SADLDQFPLGRKFLLQAGLK
456
476
110965
PNNNKILVPKVSGLQYRVFR
55
75
110825
EDTYRFVTSQAIACQKHTPPA
415
436
110938
LYIKGSGSTANLASSNYFPT
275
295
110898
IACQKHTPPAPKEDPLKKYTFWEVNLK
426
453
109478
LGKRKATPTTSSTSTTAKRKKRKL
482
506
146066
VPKVSGLQY
62
71
111098
AGTVGENVPDDLYIKGSGST
264
284
111493
LLQAGLKAKPKFTLGKRKATPTTSS
469
494
110863
GLKAKPKFTLGKRKATPTTS
473
493
110872
GSGSTANLASSNYFP
279
294
175665
VTSDAQIFNKPYWLQRAQGH
300
320
175636
PTPSGSMVTSDAQIFNKPYW
293
313
175639
QIFNKPYWL
305
314
175568
AQIFNKPYWL
304
314
175631
PLGVGISGHPLLNKLDDTEN
112
132
110651
LCLIGCKPPIGEHWGKGSP
156
175
558393
CPPLELIN
185
193
558523
YLPPVP
12
18
558480
PNKFGF
80
86
109332
IHSMNSTIL
391
400
604717
TSDAQIFNKP
301
311
833322
INTVIQDGDMVDTGFGAMDFTTLQANKSEVPLDICTSICKYPD
191
234
833581
KHTPPAPKEDPLKKYTFWEVNLKEKFSADLDQ
430
462
836325
VVSTDEYVARTNIYYHAGTSRLLAVGHPYFPIKK
21
55
834744
SFYNPDTQRLVWACVGVEVGRGQPLGVG
89
117
832586
CISMDYKQTQLCLIGCKPPIGE
146
168
833193
ICKYPDYIKMVSEPYGDSLFFYLRREQMFVRHLFN
228
263
834521
SADLDQFPLGRKFLLQAGLKAKPK
456
480
834005
MSLWLPSEATVYLPPVPVSKVVSTDE
1
27
836428
YFPIKKPNNNKILVPKVSGLQYRVFRIHLPDPNKFGFPDTSFYNPD
49
95
834191
QPLGVGISGHPLLNKLDDTENASAYAANAGVDNRECISMDY
111
152
835375
STSETTYKNTNFKEYLRHGEEYDLQFIFQLCKITL
349
384
MSLWLPSEATVYLPPVPVSKVVSTDEYVARTNIYYHAGTSRLLAVGHPYFPIKKPNNNKILVPKVSGLQYRVFRIHLPDPNKFGFPDTSFYNPDTQRLVWACVGVEVGRGQPLGVGISGHPLLNKLDDTENASAYAANAGVDNRECISMDYKQTQLCLIGCKPPIGEHWGKGSPCTNVAVNPGDCPPLELINTVIQDGDMVDTGFGAMDFTTLQANKSEVPLDICTSICKYPDYIKMVSEPYGDSLFFYLRREQMFVRHLFNRAGTVGENVPDDLYIKGSGSTANLASSNYFPTPSGSMVTSDAQIFNKPYWLQRAQGHNNGICWGNQLFVTVVDTTRSTNMSLCAAISTSETTYKNTNFKEYLRHGEEYDLQFIFQLCKITLTADVMTYIHSMNSTILEDWNFGLQPPPGGTLEDTYRFVTSQAIACQKHTPPAPKEDPLKKYTFWEVNLKEKFSADLDQFPLGRKFLLQAGLKAKPKFTLGKRKATPTTSSTSTTAKRKKRKL