SGP |
General Information |
Protegen ID |
1651 |
Sequence Strain (Species/Organism) |
Sudan ebolavirus strain Boniface |
Taxonomy ID |
186540
|
Other Database IDs |
CDD:279888 CDD:197367 |
Molecule Role |
Protective antigen |
References |
|
Gene Information |
Gene Name |
SGP |
NCBI Nucleotide GI |
1041223
|
DNA Sequence |
>gi|1041223|gb|U28134.1|EVU28134 Sudan Ebola virus strain Boniface virion spike glycoprotein (SP) gene, complete cds, and small/secreted glycoprotein precursor (SGP) gene, complete cds
ATTTGATGAAGATTAAGCCTGATTAAGGCCCAACCTTCATCTTTTTACCATAATCTTGTTCTCAATACCA
TTTAATAGGGGTATACTTGCCAAAGCGCCCCCATCCTCAGGATCTCGCAATGGAGGGTCTTAGCCTACTC
CAATTGCCCAGAGATAAATTTCGAAAAAGCTCTTTCTTTGTTTGGGTCATCATCTTATTTCAAAAGGCCT
TTTCCATGCCTTTGGGTGTTGTGACCAACAGCACTTTAGAAGTAACAGAGATTGACCAGCTAGTCTGCAA
GGATCATCTTGCATCAACTGACCAGCTGAAATCAGTTGGTCTCAACCTCGAGGGGAGCGGAGTATCTACT
GATATCCCATCTGCGACAAAGCGTTGGGGCTTCAGATCTGGTGTGCCTCCCCAAGTGGTCAGCTATGAAG
CAGGAGAATGGGCTGAAAATTGCTACAATCTTGAAATAAAGAAACCGGACGGGAGCGAATGCTTACCCCC
ACCGCCGGATGGTGTCAGAGGCTTTCCAAGGTGCCGCTATGTTCACAAAGCCCAAGGAACCGGGCCCTGC
CCGGGTGACTATGCCTTTCACAAGGATGGAGCTTTCTTCCTCTATGACAGGCTGGCTTCAACTGTAATTT
ACAGAGGAGTCAATTTTGCTGAGGGGGTAATCGCATTCTTGATATTGGCTAAACCAAAGGAAACGTTCCT
TCAATCACCCCCCATTCGAGAGGCAGCAAACTACACTGAAAATACATCAAGTTACTATGCCACATCCTAC
TTGGAGTACGAAATCGAAAATTTTGGTGCTCAACACTCCACGACCCTTTTCAAAATTAACAATAATACTT
TTGTTCTTCTGGACAGGCCCCACACGCCTCAGTTCCTTTTCCAGCTGAATGATACCATTCAACTTCACCA
ACAGTTGAGCAACACAACTGGGAAACTAATTTGGACACTAGATGCTAATATCAATGCTGATATTGGTGAA
TGGGCTTTTTGGGAAAATAAAAAAATCTCTCCGAACAACTACGTGGAGAAGAGCTGTCTTTCGAAACTTT
ATCGCTCAACGAGACAGAAGACGATGATGCGACATCGTCGAGAACTACAAAGGGAAGAATCTCCGACCGG
GCCACCAGGAAGTATTCGGACCTGGTTCCAAAGGATTCCCCTGGGATGGTTTCATTGCACGTACCAGAAG
GGGAAACAACATTGCCGTCTCAGAATTCGACAGAAGGTCGAAGAGTAGATGTGAATACTCAGGAAACTAT
CACAGAGACAACTGCAACAATCATAGGCACTAACGGTAACAACATGCAGATCTCCACCATCGGGACAGGA
CTGAGCTCCAGCCAAATCCTGAGTTCCTCACCGACCATGGCACCAAGCCCTGAGACTCAGACCTCCACAA
CCTACACACCAAAACTACCAGTGATGACCACCGAGGAATCAACAACACCACCGAGAAACTCTCCTGGCTC
AACAACAGAAGCACCCACTCTCACCACCCCAGAGAATATAACAACAGCGGTTAAAACTGTTTGGCCACAA
GAGTCCACAAGCAACGGTCTAATAACTTCAACAGTAACAGGGATTCTTGGGAGCCTTGGACTTCGAAAAC
GCAGCAGAAGACAAGTTAACACCAGGGCCACGGGTAAATGCAATCCCAACTTACACTACTGGACTGCACA
AGAACAACATAATGCTGCTGGGATTGCCTGGATCCCGTACTTTGGACCGGGTGCAGAAGGCATATACACT
GAAGGCCTTATGCACAACCAAAATGCCTTAGTCTGTGGACTCAGACAACTTGCAAATGAAACAACTCAAG
CTCTGCAGCTTTTCTTAAGGGCCACGACGGAGCTGCGGACATATACCATACTCAATAGGAAGGCCATAGA
TTTCCTTCTGCGACGATGGGGCGGGACATGTAGGATCCTGGGACCAGATTGTTGCATTGAGCCACATGAT
TGGACCAAAAACATCACTGATAAAATCAACCAAATCATCCATGATTTCATCGACAACCCTTTACCCAATC
AGGATAATGATGATAATTGGTGGACGGGCTGGAGACAGTGGATCCCTGCAGGAATAGGCATTACTGGAAT
TATTATTGCAATCATTGCTCTTCTTTGCGTCTGCAAGCTGCTTTGTTGAATATCAACTTGAATCATTAAT
TTAAAGTTGATACATTTCTAACATTATAAATTATAATCTGATATTAATACTTGAAAATAAGGCTAATGCC
AAATTCTGTGCCAAACTTGAAAGTAGGTTTACCAAAATCCTTTGAACTGGAATGCTTTAATGCTCTTTCT
CAATACTATATAAGTTCCTTCCCAAAATAATATTGATGAAGATTAAGAAAAA
|
Protein Information |
Protein Name |
virion spike glycoprotein precursor |
NCBI Protein GI |
1041225
|
Protein Accession |
AAB37096.1 |
Protein pI |
5.67 |
Protein Weight |
72335.57 |
Protein Length |
754 |
Protein Note |
subtype: Sudan |
Protein Sequence |
>AAB37096.1 virion spike glycoprotein precursor [Sudan ebolavirus]
MEGLSLLQLPRDKFRKSSFFVWVIILFQKAFSMPLGVVTNSTLEVTEIDQLVCKDHLASTDQLKSVGLNL
EGSGVSTDIPSATKRWGFRSGVPPQVVSYEAGEWAENCYNLEIKKPDGSECLPPPPDGVRGFPRCRYVHK
AQGTGPCPGDYAFHKDGAFFLYDRLASTVIYRGVNFAEGVIAFLILAKPKETFLQSPPIREAANYTENTS
SYYATSYLEYEIENFGAQHSTTLFKINNNTFVLLDRPHTPQFLFQLNDTIQLHQQLSNTTGKLIWTLDAN
INADIGEWAFWENKKNLSEQLRGEELSFETLSLNETEDDDATSSRTTKGRISDRATRKYSDLVPKDSPGM
VSLHVPEGETTLPSQNSTEGRRVDVNTQETITETTATIIGTNGNNMQISTIGTGLSSSQILSSSPTMAPS
PETQTSTTYTPKLPVMTTEESTTPPRNSPGSTTEAPTLTTPENITTAVKTVWPQESTSNGLITSTVTGIL
GSLGLRKRSRRQVNTRATGKCNPNLHYWTAQEQHNAAGIAWIPYFGPGAEGIYTEGLMHNQNALVCGLRQ
LANETTQALQLFLRATTELRTYTILNRKAIDFLLRRWGGTCRILGPDCCIEPHDWTKNITDKINQIIHDF
IDNPLPNQDNDDNWWTGWRQWIPAGIGITGIIIAIIALLCVCKLLC
|
Vaxign Prediction |
Localization(Probability) |
(Prob.=0) |
Adhesin Probability |
0.616 |
Trans-membrane Helices |
1 |
Detailed Vaxign Results |
Vaxign Results |
Epitope Information |
IEDB Linear Epitope |
|
IEDB ID |
Epitope |
Starting position |
Ending position |
47064 |
PDCCIEPHDWTKNIT |
606 |
621 |
147815 |
IHDFIDNPLPNQDNDD |
627 |
643 |
478550 |
GEWAF |
286 |
291 |
739466 |
LFLRATTELRT |
571 |
582 |
769763 |
TDKINQIIHDFIDNPL |
620 |
636 |
832628 |
ENCYNLEIKKPDGSEC |
106 |
122 |
833854 |
LHVPEGETT |
353 |
362 |
852598 |
GAFFLYDRLAST |
157 |
169 |
858649 |
LEIKKPDGSE |
111 |
121 |
|
|
MEGLSLLQLPRDKFRKSSFFVWVIILFQKAFSMPLGVVTNSTLEVTEIDQLVCKDHLASTDQLKSVGLNLEGSGVSTDIPSATKRWGFRSGVPPQVVSYEAGEWAENCYNLEIKKPDGSECLPPPPDGVRGFPRCRYVHKAQGTGPCPGDYAFHKDGAFFLYDRLASTVIYRGVNFAEGVIAFLILAKPKETFLQSPPIREAANYTENTSSYYATSYLEYEIENFGAQHSTTLFKINNNTFVLLDRPHTPQFLFQLNDTIQLHQQLSNTTGKLIWTLDANINADIGEWAFWENKKNLSEQLRGEELSFETLSLNETEDDDATSSRTTKGRISDRATRKYSDLVPKDSPGMVSLHVPEGETTLPSQNSTEGRRVDVNTQETITETTATIIGTNGNNMQISTIGTGLSSSQILSSSPTMAPSPETQTSTTYTPKLPVMTTEESTTPPRNSPGSTTEAPTLTTPENITTAVKTVWPQESTSNGLITSTVTGILGSLGLRKRSRRQVNTRATGKCNPNLHYWTAQEQHNAAGIAWIPYFGPGAEGIYTEGLMHNQNALVCGLRQLANETTQALQLFLRATTELRTYTILNRKAIDFLLRRWGGTCRILGPDCCIEPHDWTKNITDKINQIIHDFIDNPLPNQDNDDNWWTGWRQWIPAGIGITGIIIAIIALLCVCKLLC
|