VIOLIN Logo
VO Banner
Search: for Help
About
Introduction
Statistics
VIOLIN News
Your VIOLIN
Register or Login
Submission
Tutorial
Vaccine & Components
Vaxquery
Vaxgen
VBLAST
Protegen
VirmugenDB
DNAVaxDB
CanVaxKB
Vaxjo
Vaxvec
Vevax
Huvax
Cov19VaxKB
Host Responses
VaximmutorDB
VIGET
Vaxafe
Vaxar
Vaxism
Vaccine Literature
VO-SciMiner
Litesearch
Vaxmesh
Vaxlert
Vaccine Design
Vaxign2
Vaxign
Community Efforts
Vaccine Ontology
ICoVax 2012
ICoVax 2013
Advisory Committee
Vaccine Society
Vaxperts
VaxPub
VaxCom
VaxLaw
VaxMedia
VaxMeet
VaxFund
VaxCareer
Data Exchange
V-Utilities
VIOLINML
Help & Documents
Publications
Documents
FAQs
Links
Acknowledgements
Disclaimer
Contact Us
UM Logo

POLY

Gene Name POLY
Sequence Strain (Species/Organism) Dengue virus 4
NCBI Gene ID 5075729
NCBI Protein GI 12084823
Locus Tag DV4_gp1
Genbank Accession AF326825
Protein Accession NP_073286
Taxonomy ID 11070
Gene Starting Position 101
Gene Ending Position 10264
Gene Strand (Orientation) +
Protein Name polyprotein
Protein pI 8.63
Protein Weight 354952.36
Protein Length 3387
Protein Note Also known as polyprotein gene
DNA Sequence
>NC_002640.1:101-10264 Dengue virus 4, complete genome
AATGAACCAACGAAAAAAGGTGGTTAGACCACCTTTCAATATGCTGAAACGCGAGAGAAACCGCGTATCA
ACCCCTCAAGGGTTGGTGAAGAGATTCTCAACCGGACTTTTTTCTGGGAAAGGACCCTTACGGATGGTGC
TAGCATTCATCACGTTTTTGCGAGTCCTTTCCATCCCACCAACAGCAGGGATTCTGAAGAGATGGGGACA
GTTGAAGAAAAATAAGGCCATCAAGATACTGATTGGATTCAGGAAGGAGATAGGCCGCATGCTGAACATC
TTGAACGGGAGAAAAAGGTCAACGATAACATTGCTGTGCTTGATTCCCACCGTAATGGCGTTTTCCCTCA
GCACAAGAGATGGCGAACCCCTCATGATAGTGGCAAAACATGAAAGGGGGAGACCTCTCTTGTTTAAGAC
AACAGAGGGGATCAACAAATGCACTCTCATTGCCATGGACTTGGGTGAAATGTGTGAGGACACTGTCACG
TATAAATGCCCCCTACTGGTCAATACCGAACCTGAAGACATTGATTGCTGGTGCAACCTCACGTCTACCT
GGGTCATGTATGGGACATGCACCCAGAGCGGAGAACGGAGACGAGAGAAGCGCTCAGTAGCTTTAACACC
ACATTCAGGAATGGGATTGGAAACAAGAGCTGAGACATGGATGTCATCGGAAGGGGCTTGGAAGCATGCT
CAGAGAGTAGAGAGCTGGATACTCAGAAACCCAGGATTCGCGCTCTTGGCAGGATTTATGGCTTATATGA
TTGGGCAAACAGGAATCCAGCGAACTGTCTTCTTTGTCCTAATGATGCTGGTCGCCCCATCCTACGGAAT
GCGATGCGTAGGAGTAGGAAACAGAGACTTTGTGGAAGGAGTCTCAGGTGGAGCATGGGTCGACCTGGTG
CTAGAACATGGAGGATGCGTCACAACCATGGCCCAGGGAAAACCAACCTTGGATTTTGAACTGACTAAGA
CAACAGCCAAGGAAGTGGCTCTGTTAAGAACCTATTGCATTGAAGCCTCAATATCAAACATAACTACGGC
AACAAGATGTCCAACGCAAGGAGAGCCTTATCTGAAAGAGGAACAGGACCAACAGTACATTTGCCGGAGA
GATGTGGTAGACAGAGGGTGGGGCAATGGCTGTGGCTTGTTTGGAAAAGGAGGAGTTGTGACATGTGCGA
AGTTTTCATGTTCGGGGAAGATAACAGGCAATTTGGTCCAAATTGAGAACCTTGAATACACAGTGGTTGT
AACAGTCCACAATGGAGACACCCATGCAGTAGGAAATGACACATCCAATCATGGAGTTACAGCCATGATA
ACTCCCAGGTCACCATCGGTGGAAGTCAAATTGCCGGACTATGGAGAACTAACACTCGATTGTGAACCCA
GGTCTGGAATTGACTTTAATGAGATGATTCTGATGAAAATGAAAAAGAAAACATGGCTCGTGCATAAGCA
ATGGTTTTTGGATCTGCCTCTTCCATGGACAGCAGGAGCAGACACATCAGAGGTTCACTGGAATTACAAA
GAGAGAATGGTGACATTTAAGGTTCCTCATGCCAAGAGACAGGATGTGACAGTGCTGGGATCTCAGGAAG
GAGCCATGCATTCTGCCCTCGCTGGAGCCACAGAAGTGGACTCCGGTGATGGAAATCACATGTTTGCAGG
ACATCTTAAGTGCAAAGTCCGTATGGAGAAATTGAGAATCAAGGGAATGTCATACACGATGTGTTCAGGA
AAGTTTTCAATTGACAAAGAGATGGCAGAAACACAGCATGGGACAACAGTGGTGAAAGTCAAGTATGAAG
GTGCTGGAGCTCCGTGTAAAGTCCCCATAGAGATAAGAGATGTAAACAAGGAAAAAGTGGTTGGGCGTAT
CATCTCATCCACCCCTTTGGCTGAGAATACCAACAGTGTAACCAACATAGAATTAGAACCCCCCTTTGGG
GACAGCTACATAGTGATAGGTGTTGGAAACAGCGCATTAACACTCCATTGGTTCAGGAAAGGGAGTTCCA
TTGGCAAGATGTTTGAGTCCACATACAGAGGTGCAAAACGAATGGCCATTCTAGGTGAAACAGCTTGGGA
TTTTGGTTCCGTTGGTGGACTGTTCACATCATTGGGAAAGGCTGTGCACCAGGTTTTTGGAAGTGTGTAT
ACAACCATGTTTGGAGGAGTCTCATGGATGATTAGAATCCTAATTGGGTTCTTAGTGTTGTGGATTGGCA
CGAACTCGAGGAACACTTCAATGGCTATGACGTGCATAGCTGTTGGAGGAATCACTCTGTTTCTGGGCTT
CACAGTTCAAGCAGACATGGGTTGTGTGGCGTCATGGAGTGGGAAAGAATTGAAGTGTGGAAGCGGAATT
TTTGTGGTTGACAACGTGCACACTTGGACAGAACAGTACAAATTTCAACCAGAGTCCCCAGCGAGACTAG
CGTCTGCAATATTAAATGCCCACAAAGATGGGGTCTGTGGAATTAGATCAACCACGAGGCTGGAAAATGT
CATGTGGAAGCAAATAACCAACGAGCTAAACTATGTTCTCTGGGAAGGAGGACATGACCTCACTGTAGTG
GCTGGGGATGTGAAGGGGGTGTTGACCAAAGGCAAGAGAGCACTCACACCCCCAGTGAGTGATCTGAAAT
ATTCATGGAAGACATGGGGAAAAGCAAAAATCTTCACCCCAGAAGCAAGAAATAGCACATTTTTAATAGA
CGGACCAGACACCTCTGAATGCCCCAATGAACGAAGAGCATGGAACTCTCTTGAGGTGGAAGACTATGGA
TTTGGCATGTTCACGACCAACATATGGATGAAATTCCGAGAAGGAAGTTCAGAAGTGTGTGACCACAGGT
TAATGTCAGCTGCAATTAAAGATCAGAAAGCTGTGCATGCTGACATGGGTTATTGGATAGAGAGCTCAAA
AAACCAGACCTGGCAGATAGAGAAAGCATCTCTTATTGAAGTGAAAACATGTCTGTGGCCCAAGACCCAC
ACACTGTGGAGCAATGGAGTGCTGGAAAGCCAGATGCTCATTCCAAAATCATATGCGGGCCCTTTTTCAC
AGCACAATTACCGCCAGGGCTATGCCACGCAAACCGTGGGCCCATGGCACTTAGGCAAATTAGAGATAGA
CTTTGGAGAATGCCCCGGAACAACAGTCACAATTCAGGAGGATTGTGACCATAGAGGCCCATCTTTGAGG
ACCACCACTGCATCTGGAAAACTAGTCACGCAATGGTGCTGCCGCTCCTGCACGATGCCTCCCTTAAGGT
TCTTGGGAGAAGATGGGTGCTGGTATGGGATGGAGATTAGGCCCTTGAGTGAAAAAGAAGAGAACATGGT
CAAATCACAGGTGACGGCCGGACAGGGCACATCAGAAACTTTTTCTATGGGTCTGTTGTGCCTGACCTTG
TTTGTGGAAGAATGCTTGAGGAGAAGAGTCACTAGGAAACACATGATATTAGTTGTGGTGATCACTCTTT
GTGCTATCATCCTGGGAGGCCTCACATGGATGGACTTACTACGAGCCCTCATCATGTTGGGGGACACTAT
GTCTGGTAGAATAGGAGGACAGATCCACCTAGCCATCATGGCAGTGTTCAAGATGTCACCAGGATACGTG
CTGGGTGTGTTTTTAAGGAAACTCACTTCAAGAGAGACAGCACTAATGGTAATAGGAATGGCCATGACAA
CGGTGCTTTCAATTCCACATGACCTTATGGAACTCATTGATGGAATATCACTGGGACTAATTTTGCTAAA
AATAGTAACACAGTTTGACAACACCCAAGTGGGAACCTTAGCTCTTTCCTTGACTTTCATAAGATCAACA
ATGCCATTGGTCATGGCTTGGAGGACCATTATGGCTGTGTTGTTTGTGGTCACACTCATTCCTTTGTGCA
GGACAAGCTGTCTTCAAAAACAGTCTCATTGGGTAGAAATAACAGCACTCATCCTAGGAGCCCAAGCTCT
GCCAGTGTACCTAATGACTCTTATGAAAGGAGCCTCAAGAAGATCTTGGCCTCTTAACGAGGGCATAATG
GCTGTGGGTTTGGTTAGTCTCTTAGGAAGCGCTCTTTTAAAGAATGATGTCCCTTTAGCTGGCCCAATGG
TGGCAGGAGGCTTACTTCTGGCGGCTTACGTGATGAGTGGTAGCTCAGCAGATCTGTCACTAGAGAAGGC
CGCCAACGTGCAGTGGGATGAAATGGCAGACATAACAGGCTCAAGCCCAATCGTAGAAGTGAAGCAGGAT
GAAGATGGCTCTTTCTCCATACGGGACGTCGAGGAAACCAATATGATAACCCTTTTGGTGAAACTGGCAC
TGATAACAGTGTCAGGTCTCTACCCCTTGGCAATTCCAGTCACAATGACCTTATGGTACATGTGGCAAGT
GAAAACACAAAGATCAGGAGCCCTGTGGGACGTCCCCTCACCCGCTGCCACTAAAAAAGCCGCACTGTCT
GAAGGAGTGTACAGGATCATGCAAAGAGGGTTATTCGGGAAAACTCAGGTTGGAGTAGGGATACACATGG
AAGGTGTATTTCACACAATGTGGCATGTAACAAGAGGATCAGTGATCTGCCACGAGACTGGGAGATTGGA
GCCATCTTGGGCTGACGTCAGGAATGACATGATATCATACGGTGGGGGATGGAGGCTTGGAGACAAATGG
GACAAAGAAGAAGACGTTCAGGTCCTCGCCATAGAACCAGGAAAAAATCCTAAACATGTCCAAACGAAAC
CTGGCCTTTTCAAGACCCTAACTGGAGAAATTGGAGCAGTAACATTAGATTTCAAACCCGGAACGTCTGG
TTCTCCCATCATCAACAGGAAAGGAAAAGTCATCGGACTCTATGGAAATGGAGTAGTTACCAAATCAGGT
GATTACGTCAGTGCCATAACGCAAGCCGAAAGAATTGGAGAGCCAGATTATGAAGTGGATGAGGACATTT
TTCGAAAGAAAAGATTAACTATAATGGACTTACACCCCGGAGCTGGAAAGACAAAAAGAATTCTTCCATC
AATAGTGAGAGAAGCCTTAAAAAGGAGGCTACGAACTTTGATTTTAGCTCCCACGAGAGTGGTGGCGGCC
GAGATGGAAGAGGCCCTACGTGGACTGCCAATCCGTTATCAGACCCCAGCTGTGAAATCAGAACACACAG
GAAGAGAGATTGTAGACCTCATGTGTCATGCAACCTTCACAACAAGACTTTTGTCATCAACCAGGGTTCC
AAATTACAACCTTATAGTGATGGATGAAGCACATTTCACCGATCCTTCTAGTGTCGCGGCTAGAGGATAC
ATCTCGACCAGGGTGGAAATGGGAGAGGCAGCAGCCATCTTCATGACCGCAACCCCTCCCGGAGCGACAG
ATCCCTTTCCCCAGAGCAACAGCCCAATAGAAGACATCGAGAGGGAAATTCCGGAAAGGTCATGGAACAC
AGGGTTCGACTGGATAACAGACTACCAAGGGAAAACTGTGTGGTTTGTTCCCAGCATAAAAGCTGGAAAT
GACATTGCAAATTGTTTGAGAAAGTCGGGAAAGAAAGTTATCCAGTTGAGTAGGAAAACCTTTGATACAG
AGTATCCAAAAACGAAACTCACGGACTGGGACTTTGTGGTCACTACAGACATATCTGAAATGGGGGCCAA
TTTTAGAGCCGGGAGAGTGATAGACCCTAGAAGATGCCTCAAGCCAGTTATCCTACCAGATGGGCCAGAG
AGAGTCATTTTAGCAGGTCCTATTCCAGTGACTCCAGCAAGCGCTGCTCAGAGAAGAGGGCGAATAGGAA
GGAACCCAGCACAAGAAGACGACCAATACGTTTTCTCCGGAGACCCACTAAAAAATGATGAAGATCATGC
CCACTGGACAGAAGCAAAGATGCTGCTTGACAATATCTACACCCCAGAAGGGATCATTCCAACATTGTTT
GGTCCGGAAAGGGAAAAAACCCAAGCCATTGATGGAGAGTTTCGCCTCAGAGGGGAACAAAGGAAGACTT
TTGTGGAATTAATGAGGAGAGGAGACCTTCCGGTGTGGCTGAGCTATAAGGTAGCTTCTGCTGGCATTTC
TTACGAAGATCGGGAATGGTGCTTCACAGGGGAAAGAAATAACCAAATTTTAGAAGAAAACATGGAGGTT
GAAATTTGGACTAGAGAGGGAGAAAAGAAAAAGCTAAGGCCAAGATGGTTAGATGCACGTGTATACGCTG
ACCCCATGGCTTTGAAGGATTTCAAGGAGTTTGCCAGTGGAAGGAAGAGTATAACTCTCGACATCCTAAC
AGAGATTGCCAGTTTGCCAACTTACCTTTCCTCTAGGGCCAAGCTCGCCCTTGATAACATAGTCATGCTC
CACACAACAGAAAGAGGAGGGAGGGCCTATCAACACGCCCTGAACGAACTTCCGGAGTCACTGGAAACAC
TCATGCTTGTAGCTTTACTAGGTGCTATGACAGCAGGCATCTTCCTGTTTTTCATGCAAGGGAAAGGAAT
AGGGAAATTGTCAATGGGTTTGATAACCATTGCGGTGGCTAGTGGCTTGCTCTGGGTAGCAGAAATTCAA
CCCCAGTGGATAGCGGCCTCAATCATACTAGAGTTTTTTCTCATGGTACTGTTGATACCGGAACCAGAAA
AACAAAGGACCCCACAAGACAATCAATTGATCTACGTCATATTGACCATTCTCACCATCATTGGTCTAAT
AGCAGCCAACGAGATGGGGCTGATTGAAAAAACAAAAACGGATTTTGGGTTTTACCAGGTAAAAACAGAA
ACCACCATCCTCGATGTGGACTTGAGACCAGCTTCAGCATGGACGCTCTATGCAGTAGCCACCACAATTC
TGACTCCCATGCTGAGACACACCATAGAAAACACGTCGGCCAACCTATCTCTAGCAGCCATTGCCAACCA
GGCAGCCGTCCTAATGGGGCTTGGAAAAGGATGGCCGCTCCACAGAATGGACCTCGGTGTGCCGCTGTTA
GCAATGGGATGCTATTCTCAAGTGAACCCAACAACCTTGACAGCATCCTTAGTCATGCTTTTAGTCCATT
ATGCAATAATAGGCCCAGGATTGCAGGCAAAAGCCACAAGAGAGGCCCAGAAAAGGACAGCTGCTGGGAT
CATGAAAAATCCCACAGTGGACGGGATAACAGTAATAGATCTAGAACCAATATCCTATGACCCAAAATTT
GAAAAGCAATTAGGGCAGGTCATGCTACTAGTCTTGTGTGCTGGACAACTACTCTTGATGAGAACAACAT
GGGCTTTCTGTGAAGTCTTGACTTTGGCCACAGGACCAATCTTGACCTTGTGGGAGGGCAACCCGGGAAG
GTTTTGGAACACGACCATAGCCGTATCCACCGCCAACATTTTCAGGGGAAGTTACTTGGCGGGAGCTGGA
CTGGCTTTTTCACTCATAAAGAATGCACAAACCCCTAGGAGGGGAACTGGGACCACAGGAGAGACACTGG
GAGAGAAGTGGAAGAGACAGCTAAACTCATTAGACAGAAAAGAGTTTGAAGAGTATAAAAGAAGTGGAAT
ACTAGAAGTGGACAGGACTGAAGCCAAGTCTGCCCTGAAAGATGGGTCTAAAATCAAGCATGCAGTATCA
AGAGGGTCCAGTAAGATCAGATGGATTGTTGAGAGAGGGATGGTAAAGCCAAAAGGGAAAGTTGTAGATC
TTGGCTGTGGGAGAGGAGGATGGTCTTATTACATGGCGACACTCAAGAACGTGACTGAAGTGAAAGGGTA
TACAAAAGGAGGTCCAGGACATGAAGAACCGATTCCCATGGCTACTTATGGTTGGAATTTGGTCAAACTC
CATTCAGGGGTTGACGTGTTCTACAAACCCACAGAGCAAGTGGACACCCTGCTCTGTGATATTGGGGAGT
CATCTTCTAATCCAACAATAGAGGAAGGAAGAACATTAAGAGTTTTGAAGATGGTGGAGCCATGGCTCTC
TTCAAAACCTGAATTCTGCATCAAAGTCCTTAACCCCTACATGCCAACAGTCATAGAAGAGCTGGAGAAA
CTGCAGAGAAAACATGGTGGGAACCTTGTCAGATGCCCGCTGTCCAGGAACTCCACCCATGAGATGTATT
GGGTGTCAGGAGCGTCGGGAAACATTGTGAGCTCTGTGAACACAACATCAAAGATGTTGTTGAACAGGTT
CACAACAAGGCATAGGAAACCCACTTATGAGAAGGACGTAGATCTTGGGGCAGGAACGAGAAGTGTCTCC
ACTGAAACAGAAAAACCAGACATGACAATCATTGGGAGAAGGCTTCAGCGATTGCAAGAAGAGCACAAAG
AAACCTGGCATTATGATCAGGAAAACCCATACAGAACCTGGGCGTATCATGGAAGCTATGAAGCTCCTTC
GACAGGCTCTGCATCCTCCATGGTGAACGGGGTGGTAAAACTGCTAACAAAACCCTGGGATGTGATTCCA
ATGGTGACTCAGTTAGCCATGACAGATACAACCCCTTTTGGGCAACAAAGAGTGTTCAAAGAGAAGGTGG
ATACCAGAACACCACAACCAAAACCCGGTACACGAATGGTTATGACCACGACAGCCAATTGGCTGTGGGC
CCTCCTTGGAAAGAAGAAAAATCCCAGACTGTGCACAAGGGAAGAGTTCATCTCAAAAGTTAGATCAAAC
GCAGCCATAGGCGCAGTCTTTCAGGAAGAACAGGGATGGACATCAGCCAGTGAAGCTGTGAATGACAGCC
GGTTTTGGGAACTGGTTGACAAAGAAAGGGCCCTACACCAGGAAGGGAAATGTGAATCGTGTGTCTATAA
CATGATGGGAAAACGTGAGAAAAAGTTAGGAGAGTTTGGCAGAGCCAAGGGAAGCCGAGCAATCTGGTAC
ATGTGGCTGGGAGCGCGGTTTCTGGAATTTGAAGCCCTGGGTTTTTTGAATGAAGATCACTGGTTTGGCA
GAGAAAATTCATGGAGTGGAGTGGAAGGGGAAGGTCTGCACAGATTGGGATATATCCTGGAGGAGATAGA
CAAGAAGGATGGAGACCTAATGTATGCTGATGACACAGCAGGCTGGGACACAAGAATCACTGAGGATGAC
CTTCAAAATGAGGAACTGATCACGGAACAGATGGCTCCCCACCACAAGATCCTAGCCAAAGCCATTTTCA
AACTAACCTATCAAAACAAAGTGGTGAAAGTCCTCAGACCCACACCGCGGGGAGCGGTGATGGATATCAT
ATCCAGGAAAGACCAAAGAGGTAGTGGACAAGTTGGAACATATGGTTTGAACACATTCACCAACATGGAA
GTTCAACTCATCCGCCAAATGGAAGCTGAAGGAGTCATCACACAAGATGACATGCAGAACCCAAAAGGGT
TGAAAGAAAGAGTTGAGAAATGGCTGAAAGAGTGTGGTGTCGACAGGTTAAAGAGGATGGCAATCAGTGG
AGACGATTGCGTGGTGAAGCCCCTAGATGAGAGGTTTGGCACTTCCCTCCTCTTCTTGAACGACATGGGA
AAGGTGAGGAAAGACATTCCGCAGTGGGAACCATCTAAGGGATGGAAAAACTGGCAAGAGGTTCCTTTTT
GCTCCCACCACTTTCACAAGATCTTTATGAAGGATGGCCGCTCACTAGTTGTTCCATGTAGAAACCAGGA
TGAACTGATAGGGAGAGCCAGAATCTCGCAGGGAGCTGGATGGAGCTTAAGAGAAACAGCCTGCCTGGGC
AAAGCTTACGCCCAGATGTGGTCGCTTATGTACTTCCACAGAAGGGATCTGCGTTTAGCCTCCATGGCCA
TATGCTCAGCAGTTCCAACGGAATGGTTTCCAACAAGCAGAACAACATGGTCAATCCACGCTCATCACCA
GTGGATGACCACTGAAGATATGCTCAAAGTGTGGAACAGAGTGTGGATAGAAGACAACCCTAATATGACT
GACAAGACTCCAGTCCATTCGTGGGAAGATATACCTTACCTAGGGAAAAGAGAGGATTTGTGGTGTGGAT
CCCTGATTGGACTTTCTTCCAGAGCCACCTGGGCGAAGAACATTCATACGGCCATAACCCAGGTCAGGAA
CCTGATCGGAAAAGAGGAATACGTGGATTACATGCCAGTAATGAAAAGATACAGTGCTCCTTCAGAGAGT
GAAGGAGTTCTGTA
Protein Sequence
>NP_073286.1 polyprotein [Dengue virus type 4]
MNQRKKVVRPPFNMLKRERNRVSTPQGLVKRFSTGLFSGKGPLRMVLAFITFLRVLSIPPTAGILKRWGQ
LKKNKAIKILIGFRKEIGRMLNILNGRKRSTITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKT
TEGINKCTLIAMDLGEMCEDTVTYKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGERRREKRSVALTP
HSGMGLETRAETWMSSEGAWKHAQRVESWILRNPGFALLAGFMAYMIGQTGIQRTVFFVLMMLVAPSYGM
RCVGVGNRDFVEGVSGGAWVDLVLEHGGCVTTMAQGKPTLDFELTKTTAKEVALLRTYCIEASISNITTA
TRCPTQGEPYLKEEQDQQYICRRDVVDRGWGNGCGLFGKGGVVTCAKFSCSGKITGNLVQIENLEYTVVV
TVHNGDTHAVGNDTSNHGVTAMITPRSPSVEVKLPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQ
WFLDLPLPWTAGADTSEVHWNYKERMVTFKVPHAKRQDVTVLGSQEGAMHSALAGATEVDSGDGNHMFAG
HLKCKVRMEKLRIKGMSYTMCSGKFSIDKEMAETQHGTTVVKVKYEGAGAPCKVPIEIRDVNKEKVVGRI
ISSTPLAENTNSVTNIELEPPFGDSYIVIGVGNSALTLHWFRKGSSIGKMFESTYRGAKRMAILGETAWD
FGSVGGLFTSLGKAVHQVFGSVYTTMFGGVSWMIRILIGFLVLWIGTNSRNTSMAMTCIAVGGITLFLGF
TVQADMGCVASWSGKELKCGSGIFVVDNVHTWTEQYKFQPESPARLASAILNAHKDGVCGIRSTTRLENV
MWKQITNELNYVLWEGGHDLTVVAGDVKGVLTKGKRALTPPVSDLKYSWKTWGKAKIFTPEARNSTFLID
GPDTSECPNERRAWNSLEVEDYGFGMFTTNIWMKFREGSSEVCDHRLMSAAIKDQKAVHADMGYWIESSK
NQTWQIEKASLIEVKTCLWPKTHTLWSNGVLESQMLIPKSYAGPFSQHNYRQGYATQTVGPWHLGKLEID
FGECPGTTVTIQEDCDHRGPSLRTTTASGKLVTQWCCRSCTMPPLRFLGEDGCWYGMEIRPLSEKEENMV
KSQVTAGQGTSETFSMGLLCLTLFVEECLRRRVTRKHMILVVVITLCAIILGGLTWMDLLRALIMLGDTM
SGRIGGQIHLAIMAVFKMSPGYVLGVFLRKLTSRETALMVIGMAMTTVLSIPHDLMELIDGISLGLILLK
IVTQFDNTQVGTLALSLTFIRSTMPLVMAWRTIMAVLFVVTLIPLCRTSCLQKQSHWVEITALILGAQAL
PVYLMTLMKGASRRSWPLNEGIMAVGLVSLLGSALLKNDVPLAGPMVAGGLLLAAYVMSGSSADLSLEKA
ANVQWDEMADITGSSPIVEVKQDEDGSFSIRDVEETNMITLLVKLALITVSGLYPLAIPVTMTLWYMWQV
KTQRSGALWDVPSPAATKKAALSEGVYRIMQRGLFGKTQVGVGIHMEGVFHTMWHVTRGSVICHETGRLE
PSWADVRNDMISYGGGWRLGDKWDKEEDVQVLAIEPGKNPKHVQTKPGLFKTLTGEIGAVTLDFKPGTSG
SPIINRKGKVIGLYGNGVVTKSGDYVSAITQAERIGEPDYEVDEDIFRKKRLTIMDLHPGAGKTKRILPS
IVREALKRRLRTLILAPTRVVAAEMEEALRGLPIRYQTPAVKSEHTGREIVDLMCHATFTTRLLSSTRVP
NYNLIVMDEAHFTDPSSVAARGYISTRVEMGEAAAIFMTATPPGATDPFPQSNSPIEDIEREIPERSWNT
GFDWITDYQGKTVWFVPSIKAGNDIANCLRKSGKKVIQLSRKTFDTEYPKTKLTDWDFVVTTDISEMGAN
FRAGRVIDPRRCLKPVILPDGPERVILAGPIPVTPASAAQRRGRIGRNPAQEDDQYVFSGDPLKNDEDHA
HWTEAKMLLDNIYTPEGIIPTLFGPEREKTQAIDGEFRLRGEQRKTFVELMRRGDLPVWLSYKVASAGIS
YEDREWCFTGERNNQILEENMEVEIWTREGEKKKLRPRWLDARVYADPMALKDFKEFASGRKSITLDILT
EIASLPTYLSSRAKLALDNIVMLHTTERGGRAYQHALNELPESLETLMLVALLGAMTAGIFLFFMQGKGI
GKLSMGLITIAVASGLLWVAEIQPQWIAASIILEFFLMVLLIPEPEKQRTPQDNQLIYVILTILTIIGLI
AANEMGLIEKTKTDFGFYQVKTETTILDVDLRPASAWTLYAVATTILTPMLRHTIENTSANLSLAAIANQ
AAVLMGLGKGWPLHRMDLGVPLLAMGCYSQVNPTTLTASLVMLLVHYAIIGPGLQAKATREAQKRTAAGI
MKNPTVDGITVIDLEPISYDPKFEKQLGQVMLLVLCAGQLLLMRTTWAFCEVLTLATGPILTLWEGNPGR
FWNTTIAVSTANIFRGSYLAGAGLAFSLIKNAQTPRRGTGTTGETLGEKWKRQLNSLDRKEFEEYKRSGI
LEVDRTEAKSALKDGSKIKHAVSRGSSKIRWIVERGMVKPKGKVVDLGCGRGGWSYYMATLKNVTEVKGY
TKGGPGHEEPIPMATYGWNLVKLHSGVDVFYKPTEQVDTLLCDIGESSSNPTIEEGRTLRVLKMVEPWLS
SKPEFCIKVLNPYMPTVIEELEKLQRKHGGNLVRCPLSRNSTHEMYWVSGASGNIVSSVNTTSKMLLNRF
TTRHRKPTYEKDVDLGAGTRSVSTETEKPDMTIIGRRLQRLQEEHKETWHYDQENPYRTWAYHGSYEAPS
TGSASSMVNGVVKLLTKPWDVIPMVTQLAMTDTTPFGQQRVFKEKVDTRTPQPKPGTRMVMTTTANWLWA
LLGKKKNPRLCTREEFISKVRSNAAIGAVFQEEQGWTSASEAVNDSRFWELVDKERALHQEGKCESCVYN
MMGKREKKLGEFGRAKGSRAIWYMWLGARFLEFEALGFLNEDHWFGRENSWSGVEGEGLHRLGYILEEID
KKDGDLMYADDTAGWDTRITEDDLQNEELITEQMAPHHKILAKAIFKLTYQNKVVKVLRPTPRGAVMDII
SRKDQRGSGQVGTYGLNTFTNMEVQLIRQMEAEGVITQDDMQNPKGLKERVEKWLKECGVDRLKRMAISG
DDCVVKPLDERFGTSLLFLNDMGKVRKDIPQWEPSKGWKNWQEVPFCSHHFHKIFMKDGRSLVVPCRNQD
ELIGRARISQGAGWSLRETACLGKAYAQMWSLMYFHRRDLRLASMAICSAVPTEWFPTSRTTWSIHAHHQ
WMTTEDMLKVWNRVWIEDNPNMTDKTPVHSWEDIPYLGKREDLWCGSLIGLSSRATWAKNIHTAITQVRN
LIGKEEYVDYMPVMKRYSAPSESEGVL
Molecule Role Protective antigen
References