VIOLIN Logo
VO Banner
Search: for Help
About
Introduction
Statistics
VIOLIN News
Your VIOLIN
Register or Login
Submission
Tutorial
Vaccine & Components
Vaxquery
Vaxgen
VBLAST
Protegen
VirmugenDB
DNAVaxDB
CanVaxKB
Vaxjo
Vaxvec
Vevax
Huvax
Cov19VaxKB
Host Responses
VaximmutorDB
VIGET
Vaxafe
Vaxar
Vaxism
Vaccine Literature
VO-SciMiner
Litesearch
Vaxmesh
Vaxlert
Vaccine Design
Vaxign2
Vaxign
Community Efforts
Vaccine Ontology
ICoVax 2012
ICoVax 2013
Advisory Committee
Vaccine Society
Vaxperts
VaxPub
VaxCom
VaxLaw
VaxMedia
VaxMeet
VaxFund
VaxCareer
Data Exchange
V-Utilities
VIOLINML
Help & Documents
Publications
Documents
FAQs
Links
Acknowledgements
Disclaimer
Contact Us
UM Logo

C4424

Gene Name C4424
Sequence Strain (Species/Organism) Escherichia coli CFT073
VO ID VO_0010990
NCBI Gene ID 1038067
NCBI Protein GI 26250246
Locus Tag c4424
Genbank Accession AE014075
Protein Accession NP_756286
Taxonomy ID 199310
Gene Starting Position 4205983
Gene Ending Position 4211319
Gene Strand (Orientation) +
Protein Name adhesin
Protein pI 4.25
Protein Weight 166959.01
Protein Length 1778
Protein Note Escherichia coli O157:H7 ortholog: z5029
DNA Sequence
>NC_004431.1:4205983-4211319 Escherichia coli CFT073, complete genome
AATGAACAAAATATTTAAAGTTATCTGGAATCCGGCAACAGGCAGTTACACCGTTGCCAGCGAAACGGCG
AAGAGCCGTGGTAAAAAAAGCGGGCGCAGTAAGCTGTTAATTTCTGCACTGGTTGCGGGTGGGTTGTTGT
CGTCGTTTGGGGCAAGTGCAGATAATTACACTGGGCAGCCAACTGATTATGGCGATGGCTCAGCAGGTGA
CGGCTGGGTTGCTATCGGTAAAGGGGCAAAAGCAAATACCTTTATGAACACTAGTGGCGCGAGTACAGCT
TTAGGATATGACGCGATAGCCGAAGGTGAGTACAGTTCTGCCATCGGGTCAAAAACCCTTGCAACTGGTG
GAGCATCCATGGCGTTCGGGGTTAGTGCAAAAGCAATGGGTGACAGAAGTGTCGCGCTAGGTGCATCGTC
AGTAGCAAATGGCGATCGTTCGATGGCTTTTGGTCGTTACGCAAAGACGAATGGTTTTACATCTCTTGCT
ATTGGGGACTCCTCCCTTGCCGATGGTGAAAAAACTATTGCGTTAGGAAATACGGCTAAAGCTTACGAAA
TTATGAGCATCGCCCTCGGTGATAATGCCAATGCGTCAAAAGAGTATGCAATGGCGCTGGGAGCAAGTAG
CAAAGCTGGCGGTGCTGATAGCCTCGCATTCGGCAGAAAATCTACAGCTAATAGCACTGGCTCACTGGCA
ATAGGTGCTGACAGTAGCAGTTCGAACGATAACGCCATCGCGATAGGGAACAAAACGCAAGCCCTGGGAG
TGAATTCGATGGCCCTGGGTAATGCAAGTCAGGCATCTGGCGAATCCAGTATTGCATTAGGTAACACCAG
TGAAGCCAGCGAACAAAATGCGATTGCGCTGGGGCAAGGTAGCATTGCAAGCAAAGTGAACTCAATCGCG
TTGGGAAGTAACAGTTTGTCCTCGGGAGAGAATGCCATCGCATTGGGAGAGGGTAGTGCCGCTGGTGGCA
GCAACAGCCTTGCTTTCGGTAGCCAGTCCAGGGCAAACGGCAATGATTCTGTCGCCATCGGTGTAGGGGC
TGCAGCAGCGACCGACAATTCTGTCGCTATCGGCGCAGGATCGACCACAGATGCAAGCAATACGGTTTCA
GTTGGCAACAGCGCAACAAAACGCAAAATTGTTAATATGGCTGCTGGTGCCATAAGCAACACCAGTACCG
ATGCCATCAACGGCTCACAGCTTTATACGATCAGTGATTCAGTCGCCAAGCGACTCGGAGGAGGCGCTAC
TGTAGGCAGCGATGGCACCGTAACCGCAGTAAGCTACGCGTTGAGAAGCGGAACCTATAATAACGTGGGT
GATGCTCTGTCAGGAATCGACAATAATACCCTACAATGGAATAAAACCGCGGGGGCGTTCAGCGCCAATC
ACGGTGCAAATGCCACCAACAAAATCACTAATGTTGCTAAAGGTACGGTTTCTGCAACCAGCACCGATGT
AGTAAACGGCTCTCAATTGTACGACCTGCAGCAGGATGCTCTGTTGTGGAACGGCACAGCATTCAGTGCC
GCACACGGCACCGAAGCCACCAGCAAAATCACTAACGTCACCGCTGGCAACCTGACTGCCGGCAGCACTG
ACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGACAACGTGACGACCAACACCACCAACATCGCCAC
TAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAAC
AAAGCAGCTGGCGCATTCAGCGCCGCGCACGGCACCGAAGCCACCAGCAAAATCACCAACGTCACCGCTG
GCAACCTGACTGCCGGTAGCACTGACGCCGTTAACGGCTCCCAGCTCAAAACCACCAACGACAACGTGAC
GACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTC
GGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCGCCGCGCACGGCACTGACGCCACCA
GCAAGATCACCAACGTCACCGCTGGCAACCTGACTGCCGGCAGCACTGACGCCGTTAACGGCTCCCAGCT
CAAAACCACCAACGACAACGTGACGACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAAC
CTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCG
CCGCGCACGGCACTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGGTGACCTGACAGCTGGCAGCAC
TGACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGATAACGTGTCGACCAACACCACCAACATCACC
AACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCA
GCGCCGCTCACGGCACTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGGTGACCTGACAGCTGGCAG
CACTGACGCCGTTAACGGCTCCCAGCTCAAAACCACCAACGATAACGTGTCGACCAACACCACCAACATC
ACTAACCTGACGGATTCCGTTGGCGACCTTAAGGACGATTCTCTGCTGTGGAACAAAGCGGCTGGCGCAT
TCAGCGCCGCGCACGGTACCGAAGCTACCAGCAAGATCACCAACTTACTGGCTGGCAAGATATCTTCTAA
CAGCACTGATGCCATTAATGGCTCACAACTTTATGGCGTAGCGGATTCATTTACGTCATATCTTGGTGGT
GGTGCTGATATCAGCGATACGGGTGTATTAAGTGGGCCAACCTACACTATTGGTGGTACTGACTACACTA
ACGTCGGTGATGCTCTGGCAGCCATTAACACATCATTTAGCACATCACTCGGCGACGCCCTACTTTGGGA
TGCAACCGCAGGCAAATTCAGCGCCAAACACGGCATTAATAATGCTCCCAGTGTAATCACTGATGTTGCA
AACGGTGCAGTCTCGTCCACCAGCAGCGACGCCATTAACGGTTCACAACTTTATGGTGTTAGTGACTACA
TTGCCGATGCTCTGGGCGGGAATGCTGTGGTGAACACTGACGGCAGTATCACTACACCAACTTATGCCAT
CGCTGGCGGCAGTTACAACAACGTCGGTGACGCGCTGGAAGCGATCGATACCACGCTGGATGATGCTCTG
CTGTGGGATACAACAGCCAATGGCGGTAACGGTGCATTTAGCGCCGCTCACGGGAAAGATAAAACTGCCA
GTGTAATCACTAACGTCGCTAACGGTGCAGTCTCTGCCACCAGCAACGATGCCATTAATGGCTCACAGCT
CTATAGCACTAATAAGTACATCGCTGATGCGCTGGGTGGTGATGCAGAAGTCAACGCTGACGGTACTATC
ACTGCACCGACTTACACCATTGCAAATACCGATTACAACAACGTCGGTGAAGCCCTGGATGCGCTCGATA
ATAACGCGCTGCTGTGGGATGAAGACGCAGGTGCCTACAACGCCAGCCATGATGGCAATGCCAGCAAAAT
CACCAACGTTGCGGCTGGTGATCTCTCCACAACCAGTACCGATGCTGTTAACGGTTCCCAGTTAAACGCA
ACCAATATTCTGGTTACGCAAAATAGCCAAATGATTAACCAGCTTGCTGGTAACACTAGCGAAACCTACA
TCGAGGAAAACGGTGCGGGTATTAACTATGTACGTACCAACGACAGCGGCTTAGCGTTCAACGATGCCAG
CGCTTCAGGTATTGGCGCTACAGCTGTAGGTTATAACGCAGTTGCCTCTCATGCCAGCAGTGTAGCCATC
GGTCAGGACAGCATCAGCGAAGTTGATACGGGTATCGCTCTGGGTAGCAGTTCCGTTTCCAGCCGTGTAA
TAGTTAAAGGGACTCGTAACACCAGCGTATCGGAAGAAGGTGTTGTGATTGGTTATGACACCACGGATGG
CGAACTGCTTGGCGCGTTGTCGATTGGTGATGACGGTAAATATCGTCAAATCATCAACGTCGCGGATGGT
TCTGAAGCCCATGATGCGGTCACTGTTCGCCAGTTGCAAAACGCCATTGGTGCAGTCGCAACCACACCAA
CCAAATACTATCACGCCAACTCAACGGCTGAAGACTCACTGGCAGTCGGTGAAGACTCGCTGGCAATGGG
CGCGAAAACCATCGTTAATGGTAATGCGGGTATTGGTATCGGCCTGAACACGCTGGTTCTGGCTGATGCG
ATCAACGGTATTGCTATCGGTTCTAACGCACGCGCAAATCATGCCGACAGCATTGCAATGGGTAATGGTT
CTCAGACTACCCGTGGTGCGCAGACCAACTACACTGCCTACAACATGGATGCACCGCAGAACTCTGTGGG
TGAGTTCTCTGTCGGCAGTGAAGACGGTCAACGTCAGATCACCAACGTCGCAGCAGGTTCGGCGGATACC
GATGCGGTTAACGTGGGTCAGTTGAAAGTAACGGACGCGCAGGTTTCCCAGAATACCCAGAGCATTACTA
ACCTGAACACTCAGGTCACTAATCTGGATACTCGCGTGACCAATATCGAAAACGGCATTGGCGATATCGT
AACCACCGGTAGCACTAAGTACTTCAAGACCAACACCGATGGCGCAGATGCCAACGCGCAGGGTAAAGAC
AGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACAACAGCGTCGCACTGGGCACGGGTTCCGTAG
CAGACGAAGAAAACACCATCTCTGTGGGTTCTTCTACCAACCAGCGTCGTATCACCAACGTTGCTGCCGG
TGTTAATGCCACCGATGCGGTTAACGTTTCGCAACTGAAGTCTTCTGAAGCAGGCGGCGTTCGCTACGAC
ACCAAAGCTGATGGCTCTATCGACTACAGCAACATCACTCTCGGTGGCGGCAATAGCGGTACGACTCGCA
TCAGCAACGTTTCTGCTGGCGTGAACAACAACGACGCAGTGAACTATGCGCAGTTGAAGCAAAGTGTGCA
GGAAACGAAGCAATACACCGATCAGCGCATGGTTGAGATGGATAACAAACTGTCCAAAACTGAAAGCAAG
CTGAGTGGTGGTATCGCTTCTGCAATGGCAATGACCGGTCTGCCGCAGGCTTACACGCCGGGTGCCAGCA
TGGCCTCTATTGGTGGCGGTACTTACAACGGTGAATCGGCTGTTGCTTTAGGTGTGTCGATGGTGAGCGC
CAATGGTCGTTGGGTCTACAAATTACAAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGT
GCCGGTATTCAGTGGTA

Protein Sequence
>NP_756286.1 adhesin [Escherichia coli CFT073]
MNKIFKVIWNPATGSYTVASETAKSRGKKSGRSKLLISALVAGGLLSSFGASADNYTGQPTDYGDGSAGD
GWVAIGKGAKANTFMNTSGASTALGYDAIAEGEYSSAIGSKTLATGGASMAFGVSAKAMGDRSVALGASS
VANGDRSMAFGRYAKTNGFTSLAIGDSSLADGEKTIALGNTAKAYEIMSIALGDNANASKEYAMALGASS
KAGGADSLAFGRKSTANSTGSLAIGADSSSSNDNAIAIGNKTQALGVNSMALGNASQASGESSIALGNTS
EASEQNAIALGQGSIASKVNSIALGSNSLSSGENAIALGEGSAAGGSNSLAFGSQSRANGNDSVAIGVGA
AAATDNSVAIGAGSTTDASNTVSVGNSATKRKIVNMAAGAISNTSTDAINGSQLYTISDSVAKRLGGGAT
VGSDGTVTAVSYALRSGTYNNVGDALSGIDNNTLQWNKTAGAFSANHGANATNKITNVAKGTVSATSTDV
VNGSQLYDLQQDALLWNGTAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIAT
NTTNITNLTDAVNGLGDDSLLWNKAAGAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVT
TNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVTAGNLTAGSTDAVNGSQL
KTTNDNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGST
DAVNGSQLKTTNDNVSTNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGS
TDAVNGSQLKTTNDNVSTNTTNITNLTDSVGDLKDDSLLWNKAAGAFSAAHGTEATSKITNLLAGKISSN
STDAINGSQLYGVADSFTSYLGGGADISDTGVLSGPTYTIGGTDYTNVGDALAAINTSFSTSLGDALLWD
ATAGKFSAKHGINNAPSVITDVANGAVSSTSSDAINGSQLYGVSDYIADALGGNAVVNTDGSITTPTYAI
AGGSYNNVGDALEAIDTTLDDALLWDTTANGGNGAFSAAHGKDKTASVITNVANGAVSATSNDAINGSQL
YSTNKYIADALGGDAEVNADGTITAPTYTIANTDYNNVGEALDALDNNALLWDEDAGAYNASHDGNASKI
TNVAAGDLSTTSTDAVNGSQLNATNILVTQNSQMINQLAGNTSETYIEENGAGINYVRTNDSGLAFNDAS
ASGIGATAVGYNAVASHASSVAIGQDSISEVDTGIALGSSSVSSRVIVKGTRNTSVSEEGVVIGYDTTDG
ELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQLQNAIGAVATTPTKYYHANSTAEDSLAVGEDSLAMG
AKTIVNGNAGIGIGLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVG
EFSVGSEDGQRQITNVAAGSADTDAVNVGQLKVTDAQVSQNTQSITNLNTQVTNLDTRVTNIENGIGDIV
TTGSTKYFKTNTDGADANAQGKDSVAIGSGSIAAADNSVALGTGSVADEENTISVGSSTNQRRITNVAAG
VNATDAVNVSQLKSSEAGGVRYDTKADGSIDYSNITLGGGNSGTTRISNVSAGVNNNDAVNYAQLKQSVQ
ETKQYTDQRMVEMDNKLSKTESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSA
NGRWVYKLQGSTNSQGEYSAALGAGIQW

Molecule Role Protective antigen
Molecule Role Annotation Active immunization of BALB/c mice with C4424 antigen in Freund's adjuvant protects mice from lethal challenge with ExPEC strain S26 (Durant et al., 2007).
Related Vaccines(s) E. coli C4424 protein vaccine
References