|
|
| Record Information |
| Record ID |
144 |
| Pubmed ID |
32544459 |
| SNP |
|
| Gene ID |
6927 |
| Protein ID |
P20823 |
| Vaccine Adverse Events |
|
| Vaccine |
BCG Vaccination |
| Gene Name |
HNF1A |
| Official Symbol |
HNF1 homeobox A [Homo sapiens (human)] |
| Aliases |
HNF-1-alpha, HNF-1A, HNF1, HNF1alpha, HNF4A, IDDM20, LFB1, MODY3, TCF-1, TCF1 |
| Other Designations |
hepatocyte nuclear factor 1-alpha; albumin proximal factor; hepatic nuclear factor 1; interferon production regulator factor; liver-specific transcription factor LF-B1; transcription factor 1, hepatic |
| Chromosome |
12 |
| Location |
12q24.31 |
| Annotation |
Chromosome 12 NC_000012.12 (120978543..121002512) |
| MIM |
142410 |
| DNA Sequence |
>NC_000012.12:120978543-121002512 Homo sapiens chromosome 12, GRCh38.p14 Primary Assembly
AGCTCCAATGTAAACAGAACAGGCAGGGGCCCTGATTCACGGGCCGCTGGGGCCAGGGTTGGGGGTTGGG
GGTGCCCACAGGGCTTGGCTAGTGGGGTTTTGGGGGGGCAGTGGGTGCAAGGAGTTTGGTTTGTGTCTGC
CGGCCGGCAGGCAAACGCAACCCACGCGGTGGGGGAGGCGGCTAGCGTGGTGGACCCGGGCCGCGTGGCC
CTGTGGCAGCCGAGCCATGGTTTCTAAACTGAGCCAGCTGCAGACGGAGCTCCTGGCGGCCCTGCTCGAG
TCAGGGCTGAGCAAAGAGGCACTGATCCAGGCACTGGGTGAGCCGGGGCCCTACCTCCTGGCTGGAGAAG
GCCCCCTGGACAAGGGGGAGTCCTGCGGCGGCGGTCGAGGGGAGCTGGCTGAGCTGCCCAATGGGCTGGG
GGAGACTCGGGGCTCCGAGGACGAGACGGACGACGATGGGGAAGACTTCACGCCACCCATCCTCAAAGAG
CTGGAGAACCTCAGCCCTGAGGAGGCGGCCCACCAGAAAGCCGTGGTGGAGACCCTTCTGCAGTAAGGAG
CCCTGCCCCGTCCCCGCTCCCAGGAGAGCCTAGAGGGGCCCCCCTCAGCTCCTAACGAGCCCCCCTTCTG
AGTTGAGTCCCCATGACCTTCAGCCTTTAGCCTAGTTGCTGGGAAGGGGGACAGGGCCCATGAGAGCCCA
GGGGTCCTTGCTTGGAGGTTTGAGCCTCCAGCCCCTGAACTGCTCCTCTGCAGAGTCCCAAATCCCATGA
GCCCAGGCCTTTAGCCCAGTCCTTGGGCAAGGGGGACATTTCCCAGGGGGGTCCAAGATGGGAGAAAAAG
CAGGTGAGTTCACAACTCAAATGCCTGGAGAACTGGGAGGTGGTGGAAATTTTTAGCCACTCTGATCATT
TATACTCTCCAGACCCTAACTCCTGCACTGAGTCCTCAGTGGGTAGGCTCCGGGGCCCAGGAAGCACTGC
CAGGGAGGAGGGCCCGAAGCTGGGGTTTGGGGGCTTTCCTGGCCTCCTAGGGACTCATGCGTTTAGCCAG
GGAAGCCCAGGTCTTTCTTACCTGAGGCAGACAGAGCCTCGAGGTGGGAGCTGCTCCCCTTTCCTGTATG
GCCACAGGCACCCCACCTCACCAGCAGGCGCCATTAGAGGCTGCCCGTTCTACATCCCCATCCGCTGGCG
GACTCCCCGTCTCCTGGAGAACTGGGTGGGTCCTGAGCTCGATGCCTCCCCTCTCCTCAGCAGAGCTGAC
CTAGTTACTAATTACCCATGTGCTTTATTATTTCTCTTTTGTTTTATAAATTTATAAATGGCAAAAGGCT
CTGATAAGGTCTGTGATCATTAACTTGTAGGAATGGTGGCCTGGAAAGATCTGGGCCAACCCACCTCCCA
CCTAGTGGGGGTGGGGGATGGGGGTGTGGGGGAGAAGGAAGGATGGCCTGGGGGCTACAGGGGCCACCCC
TCCCCAGCGGCTCCCTGACCTCCACCCTACAGCGCCTGCTTGAGGAGGTTCACTACTTCCTGGGCGCTGT
GCTGAGGTTTCACAGGAATAATTTCCCTTCATGGTGAGGAGTCCTGGGCAGAAGCTATTGGTGACTCTGC
TTCACAAATGGGGAAACTGGGGCTTAGAGCCGGGAAGGAGCTAGACTAAGTTCACCAAACTGAGGCCCCA
AACCTACTCTGTTCGGTGCTCTCAATTGCTATGACCGGGGACCAGAGGCCTGCAGGGGGTGGCCAGGGAG
TCAGAGAGAGACCGAGCCCCAAAGATGCTCCAAAGGGGCCCATCTCTTTTCTGTCCAAGGTCCTGATCCC
AGCTGGGGTGGGGGAGGGCTGGTGGATTCCTGCTTTCCCAGAGGCCTCAGGGAGCCTCCCGAGGGCCTGG
ACAGGGGGCAGGGGAGCAGTCAGGGCCCCCTGACTTGGCAGGAAGAGGAGGGCAAAGGACTCCAGAGCTG
CAGGGGAGGGGCAAAGAGTCAGGCATCCCAGACCCAGGGCGAAGGCAACCTGACGGGGTGGGGCTGGGCC
TGACCTGCCCAGGGCCCCAGATGGGACTGGGGGCTTTGGGGTGAGGGTGGGGGCAGGGGATGCTTTCTTA
CCTGGCTGAGTGAGCTCTCTCGGTCAGCAGCCCCCTTTGGTGGGATGGTGGGGTTGGGGAGGCTTGATCC
ACAGCATTTGAAGGGGAAGCAGAGGTCAAAGTGCTTCCTAGGGACCAGCAGAGACCTGGAAGCTGAGGCA
GAGAGTGTAGAGAGAGGCCATTGGGGGAGATGGACAGAGAAGCTGGGAGAAGCAGAACGGAGGAGCCAGG
GGGGCGGGGGCTCAGACCCAGCTGGGAGAAGGGGCACGGGAGAGAATGAGGCGCCCCAAGCTTGCAAGGA
GGCTGGAGTGCTTCCCGCCTGCCCCGATTGGGTTTTCTTTAATGCTAACAGCATGCTATTTACTTTCCAT
TTAAATTTGAGATGTTGCTATAAATTATCAACCAGCTCCTTGTTCCTGCAGAGTTTATAACTAACTACCT
GGGTTACTTATTGTTCAGGTAACAAAAGGGATCGGAAAACGCCCTGAGTGAAAAAAGTGGGGCCTCCAGC
GTCAGGGTCAGGAAAGGAGCCAGGGAGAGAGGGGCGGGGGACCCCTATTGAAGGCCTGGGCCATTGGGGA
GGTTGAGGCTGGGAAGACGGTACAGAGGCAGAATGTCTAGTAGAAGCTGTTTCCCGGGAGAAGTCAGTGT
CTGGGAAGAAGCCGGGGTGGGTCCTGGGTCCCAGGCCTCCCTGGGGTGCCCCATGTTGGTGGCCACAGCA
GGAACCTGAGCTCTTTCTTCAGATCTCCCCCTTAGCTCCCTGAGTGACCTTGGGAAAGTCTGTTCCTCTC
CTTCTGCCTCAGTTTCCCTCCCTGGCCAAGGAATAAAGTCCAAACTCCCTGCCTTGGTTTCGTTCCCCTC
CCTCCACCCCACCAAATTCCTGGATCTCTTCCCAGGTCTTTCTCTTCCTCTCGGCCTTTGCTGCAGTCAT
TTTGGCTTTGCTGTTCCTCAGTACCCCAAGCCAATTCCCACCCCAGGGCCTTTGCACGTGCTCTTCCTGC
TGCTCAGAACCTCTTTCCAGATCTTCCCATGGCCGATTCCTTCTCCTTCAGCTCCCTGATTGCTCCATCC
AAAAGGGGGCCCCACCCCCGTCACTCTCTTCCACTACTGTTGTTCACTTTCTCTGGTTCCCTGTCAAGAC
TTGTGATCATCTGATTTGTTTGTTTTCTATCTCCTGCCATGAAAATAGAAGCTCTTCAAGGACAAGGCCC
GTGTTGCCTTATTTACCACTTGTACCCACATCCTGCCCAGTGGCTGATGCACAGCAGGAATGAGTGAGTG
ACTGGGATTGTCTGAGGCCCTGAAGATATCTGCTGCCCTGTTGGTCAGGGCCCAGCAGCCTGAGACGTGG
CCAAGGGAGAAACTGGGACCCAGAGTTCCTTCCTGGGTGTCCTCCCGGGTCCTTTCTGTCCCTGATCAGC
TCAAACCCCAGCGTGTTTCTTTTTTGTTGTTGTTGTTGTTATTTTTGAGACGGAGTCTCACTCTGTCGGC
CCGGCTGGAGTGCATGGCGCGATCTCGGCTCACTGCAAGCTCTGCCTCCTGGGTTCACGCCATTCTCCTG
CCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGCACCACCACGCCTGGCTGATTTTTTGTATTTTAGT
AGAGATGGGGTTTCACTATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCGCCCGCCTCG
GCCTCCCAAAGTGCTGGGATTACAGGCACTCAGAGCCACTGCCCTCAGCCTTCAAACCCCAATCTTGAAG
TATTCTGATCCTGCCCATCAGGCAGCATCAGGGCACTGAGACTCAGGAAAACAGTAGCCACGGCAGGAGG
GGGGGGGGACAGAGGATCTGACGTCCCCAACAAAAGCCAGACTGCCTGCATCCTAATCCTAGCTCTGCCC
CTTTCTAGCTTGAGCAGGTGATTTTATACTTCTGAGCCTCAGTTTCGTCTGCAGGGTAGAGATGACAATA
GCCCTTGTCCACAGGGTTAAATGGCATGGCACTTGTAGAGTAAACATGTAGTAACTGCAATATACAGCAG
TTATTATTAGTGATCCCATTTTACAGATGAGCAAATTAAGGCTGAGAGAGGCAGTGCCTTACCGAACGAA
ACTCTTCCAAATGACTGAAGGCAAGCCAGGACTTGGACCTGGGTCCTCCTACCTCACGCTCCATACAGCA
ACCTCGAGGTCACACTTTCCCCTCCCCAGTCATTCCTGCTGTTTAAGTTGACCATCAACTCACTTCCATA
CAGTGCTTTGGAGTTCACAAAGGGCTTACAAAATGCGCTAAGCTCATTTTTTCCTCCTCCCCATCTCTGA
AGTCTGAAATATCATTAGCCCCATTTCACAGATGCGGAAACTGAGGCACACGGAGGTTAAGCAAAGAATC
AAGGCCAGGTCTAAAGCCCCCACACTGAGAAGTCAGCTCCCCTCCAGATGTGGAGATGAATACCACAACC
TTGCATGCCCTGAAACCCCAGGGGCCAGAGGGCAAGAGGCTCAAAGCCTTGGACATCCGATTGACCCTGG
AAAGGGTAGGAGAGAGAGGAGGAAGATGTGGGCGGGTCTTCACTTCTCCTGGGCTCTGCTGTGTGGGGCC
AAGGGTCTCCAGGGCTGCATTGTTAAGATGATGATGATGACAATATTATTCCACAAATTCAGCACTTACT
GTGCACCAGGCATTGCCCTAAGTTCATCCTCCCTATTTGAAGCTCATTTAATTTTCCCAGTAGCAGTGTG
AGGTAGGGGTGGATTTACCCCACGTTACAGGGCAGATGTGGGTCATGCAAGAGCACGCGTGGCTAGGTCA
CAGTGGTGGGATTTGAACCCACATTGGTTTGACTCTCAAACCCATTAGAGTCAAAGCAGGAGATGCCTGG
AGACTTGTCCTCTTAGTTCTTTCCTTTTTCTTTCTTTCTTTTTTTTTTTTTGAGATGGAGTCTCACTCTG
TTGCTGGGGGTGGAGTGCAGTGGCGCAGTCTTGGCTCATTGCAACCTCCGCCTCCCGGGTTCAAGTGATC
CTCCTGCCTCAGCCTCCTGAGTAGCTGGGACCACAGGCATACGACACCACACCCAGCTAATTTTTGTATT
TTTAGTAGAGACAGGGTTTCACTGTGTTGGCCAGGGTGGTCTCTAACTCCTGACCTCATGTGATCCACCC
GCCTCGGCCTCCCAAAGTGTGGGGATTACAGGCATGAGCCACCACGCCCAGCCTTGGCCTCTTAGTTCTG
AGCTAGGACAGTTGGAGAATCTGCCCTCAAAAATAGTCAGGCCCTTGAAGACTCTGTGTGTGTGTGTGTG
TGTGTGTGTGTGTGTGTTTGTGTAAGAGACACACAGAGAGACAGAAGAGAGAGAGTGATAGCAAGGGAGT
GGGGGGTAGGGAGAGAGAGACATCGATTCAGTAGGTTCTAGAAAAAAGTGGCACAGGTGGCTTCTCCCAG
AAAAGGGAAAGACATGCCCTCTTGGTTCTCTGACTCCTAACTGGCATTTAAAATTCACTTTTTTGTTGTG
GATTTTTTGCCCAACTCTACTAGGAATTGTTTAAAAGTGGAACGTTCGTATCAGCTCACTTTAACAGTTT
GCTTTCCATTTTCACATTCAAAAGCCTGTCTTCGGTTTCTAAGGGTTGCACATGTATTTGTGCAGGGTGG
CTGGCCTTTGTTTTATTTTCAAGTTCAGCTTAAATTTGACCTGGCATGAGAGAATGAGAATGAGAGAGAG
AGAGAGAGAAAGAAACAGACAGACATGGAATCAGATGGACAGGGGCACATAGAAAATCTTTGGTTTGAGC
CCAGGAGAGGCTCTCAAGAGGTGAGAGGAAGACTGACCCTATTCTCTTTGTGGGAGAGAGAAGGGAAGGA
GGGAGGAAGGGAGGACACAGGAGGACATGGCACAGGAGGGTGATGGCTGCAGATGGGGGAGGGGAGGCAT
CAGGCAGCCTGGCTCTAGGATCAGATAAACATGTTTGCCACAGAGCACCAGGCTGGACTTTGTACTGCCC
TTCCTTCTGAGCTCTGACTGGCACTCAGCAAAGCAGGGAGATGAAGGAGAGAGGATCCTAGGAGCCCCGG
AAAGAAGGGAGAGAGAGAAGCTGGGGAGAGGCCTGCGGGGGTGGGGTGGTCCTGGCAACATGAAATATTT
TTGAGTCCCAACTAGATCACCTTGCACAAGTGACTGCACCTTTCTGAGCCTCAGTTTTCTCCTCTGCAAA
ATGGGCACACCGTCTACTTCCTAGGGTTATAAGATGTAAGTGGCCAGTCCTCAGCACGGTGCCTGGCTCA
CAGTAAATCCTCAAGAAACATGAGTTCTCAGACTTTTTTTTTTTTTTTTTTGAGACAGGATTTCACTCTG
TCGCCAAGGCTGGAGTGCAGTGGCGCGATCACAGCTCATTGCAGCCCCGACCTCTCGGGCTCAAGAGATC
CTCTTGTCTCAGCCTTCTAAGTAGCTGGGACTACAGGCTTGTGCCACCACACTTAGCTAATTTTTCTATT
TTTTGTAGAAACAGGGTCCCACTATGTTGTCCAGGCTGGTCTCAAACTCCTGGGCCCAAGGAATCCTCCC
ACCTTAGCCTCCTGAGGAGCTGGGACTACAGGTGTGCACCACCATGCCTGGCTAATTTATTTTTTTTTTT
ATAGAGACAGAGTCTCACTATGTTGTCCAGGCTGGTCTTGAACTGCAGGGCTCAAGTGATCCTCCCACCT
CGGCCTTTCAAAGAGCTGGGATTACCACCATGAGCCACCATACCCAGTTTACAATTCAGTTTTAAATATT
ATATATGCCAAGGTGGGAGGATCGCTTGAGCCCAGGAGTTTGAGACCAGCCTGGCCAACATAGCAAGACC
CTGTCTCTATGAAAAAAAAAAAATAGAAAAAAGAAGAAAAAGTACAAAAATTAGCTGGGCATGTTGGCAT
GTGCCTGTGGTCCCAAGTACTTGGGAGTCCGAGGTGGGAGAACCAATTGAGCCCAGGAGTTTGAGGCTAC
AGTGAGCTATGATCACGCCACTGTGCTCCAGCCTGGGAGACAGAGCAAGTAGCCTCTAAAAAGAAAAATA
AATAAATAAAAATAAATATTGGCTGGGCACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAA
GGCAGATGGATCACCTGAAGTCGGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACT
AAAAATATAAAAATTAGCTGGGTGTGGTAGCGCACACCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAG
GAGAATCGCTTGAACCTGGGAATCAGAGGTTGCAGTGAGCTGAGCTCGCGCCACTGCACTCCAGCCTGGC
CACAGAGCAAGGCTTTGTCTCAAAAAAAAAAACTAAATAAAAATAAATATCATGTATGCATATATAGTAT
ATATTATGTAATTTACAAAGTACACAAGGACAACAGAGAAAACTGTCCTTCTCACTCCCATCCCCAGCAC
CCCGTTCTCCTCTGGAAGGAACCAGAGTCCCCAGTTCCTTGCGTGCCCTCCGGTATTTTACACGTGCACA
AATATGTAGAAATCCTCTTTTACCTCACTTCAAATTATTCATAAGTAGCAGCACAGCAGACACACCAGAT
AAACTCTGCTCCACCTGGGCACACAGAGCTTCCTCAGCGGGGTTTTTTCAGTGGTTGCATTGAACAGATG
CACCTCCATTTAGTTAATCAGTATCCTTTTGCTGGGCTCCTTGGTGGCTTCCAATCTTTTGCTATGGCAA
GCAACGTGGCAAGGAATTGCCTTCTACTTGTGCCATTTGTCACCTGTGCCAATATTAAAAGCCACAAATT
CTAGAAGCGAATTGCTCAGCGAGGCAGGGGAAAGCACAGGCTTTGGAATCACGCAGACCCAAGTTCAAAT
CCCAGTCTGCCTGTTTCAGTGCTGCTGTATTGAGCTGTATTTATCTGTAACTTTTTGTTTGTTTGTTTGT
TTTTTGAGACGGAGTCTCACTCTGTTGCCCAGGCTGGAATGCAGTGGCGCGATCTCGGCTCACTGCAACC
TCTGCCTCCCGGGTTCAAGCAATCCTCCTGCCTCAGCCTCCCGAGTAGCCGGGATTACAGGCACCTGCCA
CAATATCTGACTTATTTTTTATTTTTATTATTTTTTTCGTAGAGATGGTTTTCTCCATGTTGGCCAGGCT
GGTCTTGAACTCCTAACCTCAAAAGACCCGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTGA
GTCACTGCACCCAGCCTATCTGTGACATATTGGTAATATAAGTTCAGGAGAGGGAGAGAGAAAGAGGCAG
GGATTGAGACAGAGCAGGAGAGGAGGAGAGAAAATTTATATGGCTCAGGCAGTCTGATCCCTTCTGTTCC
CCCACAGGGAGACCCACAGCAGAGACATGACTCACAGGTGGCATCAGGTCCCTTTGAGTCTCTCTGGTGG
GAGAATCTCAACCCACAGAGTAGGATTCCAGTGTTCACATGCATTTTTGGTACTATGAGGCCTCTGAATG
TCAACCCTGTCACCTGAGACTCTGTTGAAAAACCAGCCGCGGCCGGGCGCGGTGGCTCACGCCAGTAATC
CCAGCACTTTGGGAGGCCGAGGCAGGCAGATCACAAGATCAGGAGATCGAGACCATCCTGGCTAACACGG
TGAAACCCCGTCTCTACTAAAAATACAAAAAAATTAGCCGGGCGTGGTGGCGGGTGCCTGTAGGCCCAGC
TACTTGGGAGGCTGAGGCAGGAGAACGGCATGAACCCAGGAGGTGGAACTTGCAGTGAGCCGAGATCGCG
CCACTGCACTCGAGCCTGGGTGACAGAGTGAGACTCCATCTCAAAAAAAAAAAAAAAAAAGAAAAAGAAA
AGAAAAACCAGCCTCCTGTTCTGGTGCTGCAGGGAAACAGGCCTGGCCACAGCCAAGGGTGCAGATTTTC
AGGAAGCATTTTTAAAAATATATATATATATATATACACATATATATATATACAGAGAGAGAGAAGCCAC
TGAGGCCCACAGAATTTGCATCATTTTATTCCTTGTCCAAGGTCACAGGACAAGCAGAGTCCCACCCACC
TGAAAGGATTCAGTTCTAAGACAGCCTTTAGGGCAAAAAGTCACAAAGTTGATCTATCTATCTATCTATC
TATCTATCTATCTATCTATCTATCTATCTACGTGTTTGCACCCTAATCACTGGCAGTAAATGCACTTTTT
TTCTTTTCTTTTCTTTTTTTTGAGACAGGGTCTTGTTCTGTCACCCAGGCTGGAGTAGGTGTAAAATTGG
AGAATCCATATTTCTTTCCTTGTTTTCAGATTTAATGCTTTCCTTTTTCCAGCAGGTCTCCTTCGTCCAT
CCATCCATCATCCATCTACCCACCCACCCACCAATCCATCCATCCATTCATCCAACCATCCATCCATCCA
TCATCCATCCACCCACCCACCCACCAATCCATCCATCCATTCATCCAACCATCCATCCATCCATCATCCA
TCCACCCACCCACCCACCAATCCATCCATCCATTCATCCAACCATCCATCCATCCATCATCCATCTACCC
ACCCACCCACCAATCCATCCATCCATTCATCCAACCATCCATCCATCCATCATCCATCCACCCACCCACC
CACCAATCCATCCATCCATTCATCCAACCATCCATCCATCCATCATCCATCCACCCATCTATCTGTCCAT
CCACTCTACCCTACCATCCATCCACCAGTCCATCCATCTATCCACCAATTCATCCATCCATCCACCCATT
CGCCCATCCATCCATCCACCCACCCATCCATCCATCCGTCCATCCACCCATTCATCCATTCATCCATTCA
CCCATCCATCCATCCACATATCTTCATCTGTGTTGTGTGTCTGTGTATCCATGTTTCTAAACCTTTATCT
GTTCCAGTGTCTGTATCCATAGGCCTGTGTCCACGTTTGTCATGTGTGTGCGTCTACAAGTCTCTGTCCT
CATGACCATGTGTCTGTGTCCCTGTGTCCTGGCATAAATGACCATACCTCACCGTCCCTGAGTCTATGTG
TAGGCCCCTGGGCTCCATAACTGCTTTCATGCACAGTCCCCACCCTCAGGGTTGACAAGGTTCCAGCACC
CAGGACCGCAGCCCCACCTATGGGGAGAGACAGCCCTTGCTGAGCAGATCCCGTCCTTGCCCTCTCCCAG
GGAGGACCCGTGGCGTGTGGCGAAGATGGTCAAGTCCTACCTGCAGCAGCACAACATCCCACAGCGGGAG
GTGGTCGATACCACTGGCCTCAACCAGTCCCACCTGTCCCAACACCTCAACAAGGGCACTCCCATGAAGA
CGCAGAAGCGGGCCGCCCTGTACACCTGGTACGTCCGCAAGCAGCGAGAGGTGGCGCAGCGTAAGTAATG
ACCCTACCCCGCATCTTCCCTGGGAGGGCCCAGGACTCTCCCCTAACTCATAGGTGGGGGCTGGAAGCTT
CACCATCCCCATTACACAGACAGGTAGATGGAAAGGAAGTCAGTGGGATTCAACCTGCATTTATTACCTA
TTCTGCGCCAGGCACTCTGTGGGACGGGAGTAGACTTGGTCCTGAACATCCAAAGATGAATGAAATGGGT
CCCTGCTTTCTTTTTCTTTTTTTAGATAGAGTTTTGCTCTTATTGCCCAGGCTGGAGTTCAGTGGTGCGA
CCTCAGCTCACTGTGACCTCCTCCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTCCGGAATAGCTGG
GATCACAGGTGCCCACCACCATGCCTGGCTAATTTTTTGTAGTTTTAGTAGAGACGGGGGTTTCACCATG
TTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCACCCACCTCAGCCTCCCAAAGTGCTGGGAT
TACAGGCGTGAGCCACCATACCCGGCCTGGGTCCCTGCTTTCTCAGAGCCCATGGTTAAGTGGGAAAGTG
GGAGAAAAGCTCATTATGATCCAGCGAGTCATGTATTAGAATGGGACAACAGCAGCCACAACAACAACAA
TAATGGCCACTATTTATCATGCATTTTTGGGGTGCCTGGTCCAGTGCAGAGTATTACACAAAGCATTTCA
GGGTGCTGAGCTGACTGTCGACTGGGGAGTAACCTGTTCTTCCTCAGGAAGGCAGAGGTATCAGGAAGCC
TCACATTGCAAATATGACCTTTAGAATATGACCTTTAGACTAGGTCTTAAAGTGGGAGTTGAATTTTGAC
CCATGGAGACGCAGGGAAGGGTGTTTCAGGCAGAAGGAACAGCATCAGTAAAGGCTCAGAGGCAAGGTCT
ACGGTCAGGAAATCCTCCACTGCAAACTCAGGAAGCTCTGGGAGCCCTATTACTCAGGGAATGGTGGGTT
TTCTTCTCCAGGCCTCATACCCCACCAAATCCCAAGGGCACTGTCACCTTCCAACCTCCATCTATGTGTG
ACATTTTTTGACATTTCTTTTATTAATTATCCAGGTTAATTTTTTTTTATTATTTTTTTATTTTTTTGAG
ACAGAGTCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGGCGTGATCTCGGCTCACTGCAAGCTCCGCCTT
CCGGGTTCATGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGCCCGCCACCACGCCC
AGCTAATTTTTTGTATTTTTAGTAGAGTCGGGGTTTCACCATGGTCTCGATCTCCTGACCTCGTGATCCG
CCCGCCTTGGCCTCCCAAAGCGCTGGGATTATAGGCGTGAGCCACCGCGCCTGGCCTTAATTATCCAGGT
TTTTAAAAAAGGAAATAGCCAGGCATGGTGGTGTGCCATAATCTCAGCTACTGAGGAGGCTGAGGTGGGA
GGACTGTTTGAGCCCCCAGGGGTTCGGCTCCAACCTGGACAACATGGCAAGGTTCCTCCTCTAAAAAAGA
AAGAAAGGAAAGATGATAGGAAAGGGAGGAAAGATAGGAAAGGGAGGAAAGATAGGAAAGGGAGGAAAGG
TAGGAAAGGGAGGAAAGGTAGGAAAGGGAGGAAGGAAGGAAGGAAGGAAAAGAAAAGAAAGAAAAAGAAC
GAGAGAAAGAAAGAAAGGCAATACACATTTGGTTAAAAAAAAAGAAGAAAAACAGAGTTTATAAGAAAAA
TTAGATGCCTTTACCCTAAAGCAGCCACTGGAATTCCCAGACTCTTGTGTATCCTTCCAGAGAGATTTTC
TGCATATAGTAGCAATAGATATGTTATTTTCTTGCTCTTTTTCTGATTAATGGGAACATACTAGACACAC
TATTTTACATGTTGTGTTTTGTCTCTTGGCAATAAATCCTGTATATCTTTTCATATTGGCACTGAGAGAT
TGATGGATCTCATTCTTTATAAGGGCTGCTGGCTGTTATTTGTTTAACTGGCCACCTACTGAAGGACTTT
TAGGGATTTTGCAATCTGCGAATACAACCAACTGCAGCAAAAAACATCCTTCTTTATATAAGTGCATGTT
CACATGTGACTGTGTCTGTAGGATAAATTCCTAGAAGTGGACTGCTGGAACACAGAGTAGCTGCTTTAAA
ATCTTGGGTACATATTGGCATAGTTCCTTCCAAAGAAGTTATATCAATGTACACTTCCACCAGCAGTAAA
AATCACTGTTGAAAAAAAAAATGTATGCAGGCCGGGCGCAGTGGCTCATGCCTGTAATCCCAGCACTTTG
GGAGGCCAAGGCAGGTGGCTCACTTGAGGGCAGGAGTCTGAGACCAGCCTGGTCAACATGGTGAAACCCC
ATCTTAACTAAAAATACAAAAATTAGCTGGACATAGTGGCGCATGCCTGTAATCCCAGCTACTTGGGAGG
CCAAGGCAAGGGAATCGCTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCCAAGATCACGCCACTGCACTC
CAGCCTGGGTGACAGAGGGAGACTCTGTCAAATAAATGTATGTATGTATGTATGTATGTATGTATGATGT
ATGTATGTATGCATGCATGCATGCATGCAATAGACAACTCTAGTCCTTACTCTATAGCTACCCCTCATCC
CAATTATTGGGGTGTTCACACTCTACTGTGCTGTAATACACACTTGGAAAAACAGTACATTTTAATACTA
TTTTAATTTGTATGAGAAGAAACAAAGATTTATTCTGAAAATCTGTTAGAACTAGATTTAGTTTCTGAGA
GAAATAGTGTTCTCCTAAAACCATCACTAGACGAAATTTTACAATATTCGATAAAATGCATTAAAAAATT
TCTAACAGTTAGAAATGTCATCAAGGACTTCAGCCTAGAGCTGAGAATAGAGATGCAAACTGTGGCCCAA
ATAGCTTTCCCTGGGGGATTTTTAACTTTTAAGGAGAAATTGGAGAATTTGAGATGCAGAGTCAATATAT
GACTGTAGTATTAGGACTCTTTCAGTAACAGGGGACAGAACCCCAACTCAAATGGATTTAAACAGAAAAG
GGAATGAATTGACTGAAGCAATGTGAGGAGTCCAGGGCTGCTTCAGGCATGGCTAGATCAAGGGGCTCAA
ATGAGGTTCACACAGCATTTCTTGACTCTTTTCTTTTTTGTTGGCTTCATTCTCAGACAGGCTCTCCTCA
TATAGGCAAAGATGGGTTCTGGAAGCCCTGGGCTGCCATCCTACCAGCTTAGCAACCTTGGAAGGAAAAG
AGCTCTTCTTTTCAGCTACTTTCAGTTGAATGACTCTTTTTATACTTGATAATGTAAACTTTTATTTATT
TTATTTTTTGAGATGAGGTCTTGCCATGTTGCCCAGGCTAGTTTTGAACTTCCGGGCTCAAGTGATCCCT
GCCTCAGCCTCTTAAGTAGCTGGAATGGCAGGCACACATCACTGTGTCTGCTATAACAGAAACTTTTAAA
TGAAGTCTAACATAAAAAGTGCCCAAATCCAAAGCATACAGTTCAATGAATTTTCACAAAGTGAACACAC
CAATACAGATAAAAAATAGAATATTACCAGCTCCCAGCCTGGTGTGGTTGTACCCACCTGTAATCCCAGC
ACATTTGGGAGGCAAAAGTGGGAGGATCACTTGAGCCTAAGAGTTCAAGACCAGCCTGGGCAACAGAAGG
AGACCCCATCTCTCCAAAAAATACGAATAAATGAGCCAGGTGTGGTGCAGCAGGTGTGTAGTCCTAGCCA
CTTATGCCCGCTGCTGTTTCTTGGCACTGAGATGGTGAGGGCCCTGCTGCTGCTGCTGCGCCCCTGATTC
AGAATCTACCTTCCTCTCTTCTCATAAAGTGCTGTCCTAGCACTTGTGTGTCCCAGTCCTTTGGTGGCCT
GCTCAGGGATAAGCTGGTAGACCAGTTTCCAGTAAAGTGGTACCAGCTGATGAAGGCTGGCTGGGTCTCT
TCCTCTAGCTAACATTTGCATTTTAAGCCTGATTTTCAAGTCTGGAAAGCTGAATATAATTCTGAATACA
CTGAGGACATATGGCTCAAATTTTTGCTCTACTGCGTGACTCTGGAAAAATATGTAAGCTCTCTGAGCCT
CAGCTTCCTCATCTGTACAATGGGGATAGTAAATGTGCCAAATCAGAACAAATGCTAATGCTTACCTGCA
GTCTTGTACTGAGAAGGATGGTGAGATCATATCTTGGGTTGGTAGGAAAGCATTCAGGGATTGATTAGTG
ATGTTTGCCTTGAACACAGGTTAAGAAAGTGATGGCATGTGTGCTGTGTGTTTGTCATCAGTAGATTAGA
TGATTTCTAAGTTCTAGCTGTAAGCTCCTCTGGTTCAGCGCCATGGCAATGAGAAAGAATCAAGGGCAAG
GTCAGGGGAATGGACGTGGGAAGGTGAGAGTGGCCAGTACCCCACTCACGGCTTTCTGTGCCTGCAGAGT
TCACCCATGCAGGGCAGGGAGGGCTGATTGAAGAGCCCACAGGTGATGAGCTACCAACCAAGAAGGGGCG
GAGGAACCGTTTCAAGTGGGGCCCAGCATCCCAGCAGATCCTGTTCCAGGCCTATGAGAGGCAGAAGAAC
CCTAGCAAGGAGGAGCGAGAGACGCTAGTGGAGGAGTGCAATAGGTACAACGGCGGGCGGGAAACAGTGC
TGGTTTGGTCTGGGCTGCGGCAAGGCCAGGGAAGGGGAAGGTGACTCTAGGTCCTGTAAAAGGCTGTCCA
GTTGCCGAGAACTCCTGATATTGGCTTAGCCTGGCCCAGAAAATTGAGAATACTTGAACCTAAGCCCATT
CCTCGCAGCCCCCCTGCACCCTGGACACCAAGCAACCCCTTCCATGGATGCTCACCCAATTCGATTCTCT
CTACAATCCTATGGCTCTTTTGCTCACTTTATGAATGGAGAGACTGAGGTCAGACAGACTGTCAATTGCC
CAAGGTCACACAGCAGACCTGGCATTGGAACCCAGATCTGCCAGCCTCAAACCCTCCGGCAGAGCTCAGC
TTCTCAGAACCCTCCCCTTCATGCCCAGGACAGGGTTCCTCTGAGCCTGGCCTGGAGGCTCATGGGTGGC
TATTTCTGCAGGGCGGAATGCATCCAGAGAGGGGTGTCCCCATCACAGGCACAGGGGCTGGGCTCCAACC
TCGTCACGGAGGTGCGTGTCTACAACTGGTTTGCCAACCGGCGCAAAGAAGAAGCCTTCCGGCACAAGCT
GGCCATGGACACGTACAGCGGGCCCCCCCCAGGGCCAGGCCCGGGACCTGCGCTGCCCGCTCACAGCTCC
CCTGGCCTGCCTCCACCTGCCCTCTCCCCCAGTAAGGTCCACGGTAAGTGGTATGTGGGGACAAGGGACA
CGTGGGAAGGTGGGAGGGTTGGGGAGGACTGTCCCAGTGACAGCAGTCACCTAAACCTCTTTGCACTTCA
GTTTGGTTCCATTCCATTCATGCCACTCCTTATCACTCTACTTCACTCTGTTCATTCATCCATTCCACTC
TATCTCATTCCATTCACTCTACTCCTTTCCACTCTATTCACTCCATCCACCACAATTAACCCCATTCCAT
CCACTCCATCCACTACCTTCGACTCCACTCCATCCACTCTACTCCATTCACTCCACTCAACTCCACTCCA
TCCACTCCACTCCGTCCAACTTCATCCCATCCACTACATTCAACTCCACTCCATCCACTCTACTCCATCT
ACTACCTTCAACTCTACCCCATCCATCCACTCCACTCCATCCATTCCATCCAACTTCATCCCATCTACTA
CATTCAACTCTACTCCATCCACTCCACTCCATCCATTGCATCCAACTTCATCCCATCTACTACATTCAAC
TCCACTCCATCCACTCCACTCCATCCATTCCCTCCAACTTCATCCCATCCAGTACATTCAACTCCACTCC
ATCCACTGTACTCCATCTACTACATTCAACTCTACTCCATCCACTCCCCTCCATCCATTCCATCCAACTT
CATCCCATCTACTACATTCAACTCCACTCCATTCATTCCACTCCATCCATTCCATCCAACTTCATCCCAT
CCACTACATTCAACTCCACTCCATCCACTCCATACCTGGCTCCATCCACTCCACTCCATCTACTACATTC
AACTCTACTCCATCCACTCCATACTCTATTCCATCCACTTACTCCATCCACTCCATTCAGCTCCACTCCA
TCCACTCCACTCACTCAACTCCATCTACTCCACTCCCTCTACTTCATTCAACTCTGCTCCATCCACTCCA
CTCCACCCATTCCATCCACTGCACTCCAACCAGCTCCATTAGACTCCACTCCATCCACTCTACCCACTCT
TCTCTCCACTCCTCTCCACTACATACCATTTTATTCTATCTGTCCCATCCACTCAACTCCATTCACTCCA
CATGACTCCACATTTCATCCATTCCACTCTACTTCATCCACTCACTCCACTCTATACCATTCCACTCCAC
TCTATTCACATACTCCACCATTCCAGTCTACTCCATTCACTCCACTCCAACCCACTCACTCCACTCCATA
CCATTCCACTCCACTGTGTTCACACAACTCCATCCATTCCACTCTAGCCACTCCATTCATTCCACTCCAC
GCCACACTATTCCTCACCATTCCATCCACTCCACCCTATACCATTCCACTCCACTCTATTCCTCCCCACC
CGTCCTCTCCACCCTTTACCACTCCACTCGACTGTACCCATTCCACTTGATCCCACTCATTCCACTCAAT
TCCATCTACTCTACTCCACACCATCCACTCCACTTCATATCATTCCACTCAACTCAACCTAAGTTGATTT
GGGTTAATTCAATTCAATTCATTCATTTCAGATTGTATCAATTCAATTCATTTCAATTCAACTAAATTCA
GTTAAATTCAGTTCAGTTGTCTTCTACTGAGCACCTACTGCATGTCAGGTATAGCACTAGGCAGTGGGAG
GAATGGAGCTAATAAATGCAGTCCCAGCCTTCAAGGAACTGGGAGCAGCTGACCCAGGGCTTGGCAAAAG
GTAGAAACAAAGGCAGATTTGCTGGCTGCATAAAGGCAGACAGGCAGCTGGCCTAAGCAAACCAATGGAG
TTTGAAGTGCTGAGGGCTGTGGAGGCAGGGGAGGGCAGGGAAGTGGGGTGCTGAGGCAGGACACTGCTTC
CCTCTCCAGGTGTGCGCTATGGACAGCCTGCGACCAGTGAGACTGCAGAAGTACCCTCAAGCAGCGGCGG
TCCCTTAGTGACAGTGTCTACACCCCTCCACCAAGTGTCCCCCACGGGCCTGGAGCCCAGCCACAGCCTG
CTGAGTACAGAAGCCAAGCTGGTGAGTGTCCTTGCTTGTAAGGAAAACCCAACCTCATCTTTCCTTGGCA
GGGAGATTCTGGAGCAGTCCCTAGGGAGGCCCTGTGGGGACCCCGGCCCCCCGGACACAGCTTGGCTTCC
CCTCGTAGGTCTCAGCAGCTGGGGGCCCCCTCCCCCCTGTCAGCACCCTGACAGCACTGCACAGCTTGGA
GCAGACATCCCCAGGCCTCAACCAGCAGCCCCAGAACCTCATCATGGCCTCACTTCCTGGGGTCATGACC
ATCGGGCCTGGTGAGCCTGCCTCCCTGGGTCCTACGTTCACCAACACAGGTGCCTCCACCCTGGTCATCG
GTAAGCTGGTGGGGATGGGTGGGCACCTGGGTGGGAGGCTCATGGGGCAACCGCAGAATCCAGGAGCTGG
AAGAGCCACTGGGACTCATTCATTCATTCATACAACATGTATTTATCCAGTGCCTACTCTGGACCAGTCA
CTGTGCTACATCAGTGATACCTGGGTGAACCAAACAGACCAAAATCTCAGCAACTCAAGCAGGGAGGCAG
GCACTAAGCATAATACATCAATTCTGTGGTACCTCAGAAGGTGAAGGGTCTATGGGCAAATTACAGCAGG
GTAAGGGGGACTGATGTTTTCTAAGTTTTTGTTTTATGAAGAAAAATTAAGCCCAGAAAGCCCTTATTTG
CAGGTACAATTATGCAGAAGCCCAGTACATAAGATAGATAAGGAGCTGCTAGGAGAGGGGAGCAGAGAAC
TGACCCCATGGCCTTTGCACTGCTGTGGAACCCCAGGGCTCCAGGGAACCGCAGTTTGACAACTTTTGAA
CAAGTCACCGCCTGCCTCTCCCACTAGCCTAGACAAAGAGCTAAAGGCTCAGAGAGGGGGAATGACTTGC
CAGAGCCACTTAAATTAGTGGCAGGTCCCAGTGGAGGGCTGTTTCCTGACCACCCTGCCCCCTCCTCCAA
ACCACGGGCTCTGGGAAGGAGAGGTGGTGCCCTTGGGAGGTCTTGGGCAGGGGTGGGATATAACTGGGGG
GCCCAGCTGATTCCCTCCCCTTCCACTCCAGGCCTGGCCTCCACGCAGGCACAGAGTGTGCCGGTCATCA
ACAGCATGGGCAGCAGCCTGACCACCCTGCAGCCCGTCCAGTTCTCCCAGCCGCTGCACCCCTCCTACCA
GCAGCCGCTCATGCCACCTGTGCAGAGCCATGTGACCCAGAGCCCCTTCATGGCCACCATGGCTCAGCTG
CAGAGCCCCCACGGTGAGCGCCCTGTGCCCCACACAGCAGGAGATGATGATAGAGGTTGGCTGTCAATGG
ATGCAGGGGAAAGGGGTGCCTGGCAGGCATTGCAGTCTGCATGTGTCTCTGGGACAAGTGTGTTTCCGTG
ATTGAGGGTGTCTGCAGGCCAGTGTGTTCCCATGTGAATGCACGTATCTGTGTGTGTGCACGACTGCTTG
TGTGAGCAGATCCCTAGTGCGTGTCTGGGTGTGTATCGGTTGTGCATGCATTTGTGTGCATGCCTGTGTT
TCTCTGAAACTCTTAGGGCCATATGAATTTCTAAAATCTATTCAGATTTTAGAAAGGTAATCTGGGGCCA
GGCGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGAAGGCCGAGGTGGGCAGATCACTTGAGGTCAGG
AGTTCAAGACCAGCCTGGCCAACACGGTGAAACCCCGTCTCTACTAAAAGTACAAAAATTAGCCAGGCGT
GGAGGCACGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGAATCGCTTGAACCTGGGAGGCGGAG
GTTGCAGTGAGCTGAGATTGTGCCACTGCACTGCACTCCAGCCTGGGCAACAGAGTGAGTACTCTGCCAA
AAAAAAAAAAAAGAAAAAGAAAAGTAATTTGGTGGATATGCCAGCTATCACATAACACTCCCAGCAGGGT
TTGAGACAGCCCCCTGTGATCAAAGCACTGCTATTTCTGCAGCCAAAGATAGGATTTCCACAAGGCAGTT
TAAACACTCATAACCTCATACCAGGTTCAGGCTGGGTTTCGCCACCAAATGAGCTCTAGTGCCAATCTTA
AGGAAATCTTTTGGTTTCAGACCTTTTTGGAATTTGGGACTGGGTTAAGAGATTTGGGGCCTGTGTATGT
CTGTGTGTACATGGTTGAGAGTGTATGCGTGTGTGTGTGTTACATTTTTATCTGAGTCACTGTGAGAATA
TGTGGTTTATGTGCGTTTTCCTAGTGTCTGTGTGTCTCTTGTAGGGTCTATGTGTCTCCTGAGGGGTGGG
TGGAGGTGCTGTAGGCAAAACATGGGGTTATATTTGGAAAGGTGGCCTGGAGATGGAAGTACTGCTGTCC
TCTGCCTTGCTCTTTGACAGGGGCCACCTGAACTTACTACACAGCACCTGAGTTTAGTACAGTAGCACCT
GTGCTTAGTACATAGCACCTGTGTTTAGTATAGTAGCACCTGTGCTTAGTACATAGCGCCTGCATTTAGT
ACAGTAGCACCTGCGCTTAGTACGTAGCACCCGTGCTTAGTACAGTAGCCCTGCACCTGCACCTCCCGTT
CCCTTTCATACCTGCTGTCTCCAGGTGACAAGAGGAGCTGGAGTTGGTTTCTGGCCTCCTCCAGGCTCCC
CTGCATCAAGCGCAGCTGAGCAGTTCCCTGTAATGGGGAGAGGGTCTGTCCCTTTATCTGGAGCCTCCAG
TTTTGAAAATCAGCCCTGGATCTCCAACTGCTGCCCAGTCTGGCTGTTCAGCAGGCCCCATGCCCCCCTT
TCCCCAGTCTTGAGGCCTGGGACTAGGGCTGTCAGGCACGTCTGCCACGTCTGCCCCTCTCTCCCCTGCG
GCCAGCCCTCTACAGCCACAAGCCCGAGGTGGCCCAGTACACCCACACGGGCCTGCTCCCGCAGACTATG
CTCATCACCGACACCACCAACCTGAGCGCCCTGGCCAGCCTCACGCCCACCAAGCAGGTAAGGTCCAGGC
CTGCTGGCCCTCCCTTGGCCTGTGACAGAGCCCCTCACCCCCACATCCCCCGGGCTCAGGAGGCTGCTCT
GCTCCCCCAGGTCTTCACCTCAGACACTGAGGCCTCCAGTGAGTCCGGGCTTCACACGCCGGCATCTCAG
GCCACCACCCTCCACGTCCCCAGCCAGGACCCTGCCAGCATCCAGCACCTGCAGCCGGCCCACCGGCTCA
GCGCCAGCCCCACAGGTGAGAGGCCCTGGCTCCACCCCCTCCCTTACTGTCCCTGCCCCCTTCCATGTTG
GTCCCACCCCTTCTGTTGCTGTCCGTCACTGTGGGGCTGTGCATGCAGCAGGCCTAGGGCTGCTGTGAGG
AAGCACTGGCAGGCGTGGAGGGGTGGGGTGGGCTTCCATGAAGCCCAAGAGGCACAGGCGACCCCAGGAA
GATGGGGCCCACCTTACAAGAACATCTCAGGGACTGGACTGAGGAAAATGATGAAGTAATTGTTAGAGGC
AGTGGAGAGTAACGATAAGGACCAGGTCTGTGAGCCAGAAGCGACTCTTGCATGTGTTTGGTTTGGCTTG
GCTCAGGGTTGAACATTCTTTGAATTTGTTGCCAACATTTCAAAATCAGGAATTTTCACATAGAAATCCA
GATTTCTGGCATCTTTTGGAAAATTGGAAAAATTCATATTCCCATATAACAGTGCACCAGGCCTCACTGC
ATCCACTCACATCGGCTCCAATGACCCCTGGCGGTGGGGCGGTCACCTGAGTTTGTAATCTATGATGCCA
TTGTCAGGAGTGGGCACTTTGGAGTCAGGTGGGTGGGCTTTGCAGCCTGCCTTTTTGCAAGCTGCCCCCT
TTCTATAAGGTTCCTGGACCCGGCCCTGGCAGCTGTCATGACAGCCACTGAGGCCATTCTGTGTCCTGGC
ATTCACCCCATGAGGCGTGTCCATGACCACCCCCATTTTACAGATGAGAAAGTGGACACTCAAGGAGGGG
AGGGACTTGCCTCATCTCCCACAGCTACCAGATGGCAGGGCCAGGACTCGAACCCAGGTCCCACCTCAAA
GCCCATTCTCTTAACCATTGCCATCAGTCAGGGTTCTCCAGGAACACAGAACCAACAGATTTATTTGAAG
GACTTGGCCCGTGTCTGCGGGGGGCTGGCAGGCCTGAAATCTGTTGCTGGCAGCTCTGGGGAGTGAATTC
TGCAGCCTTGAGGCAGAATTCCTTCTCTGGGAAACCGGTTTTTGCTCTTACGGCCTTCCACTGATGTAGG
AGGCCCACCCATATTTTCAGGGTAATCTCCTTCACTCAAAGCCAACTGATTGTGGCATTAACCACATCTA
CAAAGCACCTGCATTCACAGCAGCGCCTAGATTAGTGTTTGACTCAGCCTAGCCAAGCCAACACGTACAA
CTACCTACCTCGGCATCTCACCGGGGCTTCTCCAGTGTTCACACTAAGATGTACTCAGGCCACTCCATGG
GCGGCCGTGGACCCTGGCTGGGAGGCTCCCTTTGAAGAACCGAGGGTAGAGGTGTGACTTTGGGGTTCCT
GTTATCTGCTGTGATCCAGGAGGTGTGGCCCTGCCTCCCCATCCTGAGTACCCCTAGGGACAGGCAGGTG
GGGTGGGTGTGGGTGCCTGGTGGGTGGCTAGCAGCCTTGTTTGCCTCTGCAGTGTCCTCCAGCAGCCTGG
TGCTGTACCAGAGCTCAGACTCCAGCAATGGCCAGAGCCACCTGCTGCCATCCAACCACAGCGTCATCGA
GACCTTCATCTCCACCCAGATGGCCTCTTCCTCCCAGTAACCACGGCACCTGGGCCCTGGGGCCTGTACT
GCCTGCTTGGGGGGTGATGAGGGCAGCAGCCAGCCCTGCCTGGAGGACCTGAGCCTGCCGAGCAACCGTG
GCCCTTCCTGGACAGCTGTGCCTCGCTCCCCACTCTGCTCTGATGCATCAGAAAGGGAGGGCTCTGAGGC
GCCCCAACCCGTGGAGGCTGCTCGGGGTGCACAGGAGGGGGTCGTGGAGAGCTAGGAGCAAAGCCTGTTC
ATGGCAGATGTAGGAGGGACTGTCGCTGCTTCGTGGGATACAGTCTTCTTACTTGGAACTGAAGGGGGCG
GCCTATGACTTGGGCACCCCCAGCCTGGGCCTATGGAGAGCCCTGGGACCGCTACACCACTCTGGCAGCC
ACACTTCTCAGGACACAGGCCTGTGTAGCTGTGACCTGCTGAGCTCTGAGAGGCCCTGGATCAGCGTGGC
CTTGTTCTGTCACCAATGTACCCACCGGGCCACTCCTTCCTGCCCCAACTCCTTCCAGCTAGTGACCCAC
ATGCCATTTGTACTGACCCCATCACCTACTCACACAGGCATTTCCTGGGTGGCTACTCTGTGCCAGAGCC
TGGGGCTCTAACGCCTGAGCCCAGGGAGGCCGAAGCTAACAGGGAAGGCAGGCAGGGCTCTCCTGGCTTC
CCATCCCCAGCGATTCCCTCTCCCAGGCCCCATGACCTCCAGCTTTCCTGTATTTGTTCCCAAGAGCATC
ATGCCTCTGAGGCCAGCCTGGCCTCCTGCCTCTACTGGGAAGGCTACTTCGGGGCTGGGAAGTCGTCCTT
ACTCCTGTGGGAGCCTCGCAACCCGTGCCAAGTCCAGGTCCTGGTGGGGCAGCTCCTCTGTCTCGAGCGC
CCTGCAGACCCTGCCCTTGTTTGGGGCAGGAGTAGCTGAGCTCACAAGGCAGCAAGGCCCGAGCAGCTGA
GCAGGGCCGGGGAACTGGCCAAGCTGAGGTGCCCAGGAGAAGAAAGAGGTGACCCCAGGGCACAGGAGCT
ACCTGTGTGGACAGGACTAACACTCAGAAGCCTGGGGGCCTGGCTGGCTGAGGGCAGTTCGCAGCCACCC
TGAGGAGTCTGAGGTCCTGAGCACTGCCAGGAGGGACAAAGGAGCCTGTGAACCCAGGACAAGCATGGTC
CCACATCCCTGGGCCTGCTGCTGAGAACCTGGCCTTCAGTGTACCGCGTCTACCCTGGGATTCAGGAAAA
GGCCTGGGGTGACCCGGCACCCCCTGCAGCTTGTAGCCAGCCGGGGCGAGTGGCACGTTTATTTAACTTT
TAGTAAAGTCAAGGAGAAATGCGGTGGAAA
|
| Protein Name |
HNF1A_HUMAN |
| Length |
631 |
| Moltype |
AA |
| Topology |
linear |
| Division |
PRI |
| Update Date |
12-OCT-2022 |
| Create Date |
01-FEB-1991 |
| Definition |
RecName: Full=Hepatocyte nuclear factor 1-alpha; Short=HNF-1-alpha; Short=HNF-1A; AltName: Full=Liver-specific transcription factor LF-B1; Short=LFB1; AltName: Full=Transcription factor 1; Short=TCF-1 |
| Primary Accession |
P20823 |
| Accession Version |
P20823.2 |
| Other SeqIDs |
sp|P20823.2|HNF1A_HUMAN,gi|51338763 |
| Organism |
Homo sapiens |
| Taxonomy |
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo |
| Comment |
On or before Jul 18, 2007 this sequence version replaced gi:106236, gi:121948941, gi:123436.; [FUNCTION] Transcriptional activator that regulates the tissue specific expression of multiple genes, especially in pancreatic islet cells and in liver (By similarity). Binds to the inverted palindrome 5'-GTTAATNATTAAC-3' (PubMed:12453420, PubMed:10966642). Activates the transcription of CYP1A2, CYP2E1 and CYP3A11 (By similarity). {ECO:0000250|UniProtKB:P22361, ECO:0000269|PubMed:10966642, ECO:0000269|PubMed:12453420}.; [SUBUNIT] Binds DNA as a dimer (PubMed:12453420). Heterotetramer with PCBD1; formed by a dimer of dimers (By similarity). Interacts with PCBD1 (PubMed:10966642). Interacts with BHLHE41 (By similarity). {ECO:0000250|UniProtKB:P22361, ECO:0000269|PubMed:10966642, ECO:0000269|PubMed:12453420}.; [INTERACTION] P20823; Q9Y463: DYRK1B; NbExp=4; IntAct=EBI-636034, EBI-634187; P20823; P61457: PCBD1; NbExp=3; IntAct=EBI-636034, EBI-740475; P20823; Q92786: PROX1; NbExp=3; IntAct=EBI-636034, EBI-3912635.; [SUBCELLULAR LOCATION] Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108, ECO:0000269|PubMed:10966642}.; [ALTERNATIVE PRODUCTS] Event=Alternative splicing; Named isoforms=8; Name=A; IsoId=P20823-1; Sequence=Displayed; Name=B; IsoId=P20823-2; Sequence=VSP_002250, VSP_002251; Name=C; IsoId=P20823-3; Sequence=VSP_002252, VSP_002253; Name=4; IsoId=P20823-4; Sequence=VSP_047736, VSP_047739; Name=5; IsoId=P20823-5; Sequence=VSP_047737, VSP_047738; Name=6; IsoId=P20823-6; Sequence=VSP_053324, VSP_053325, VSP_053326; Name=7; Synonyms=insIVS8; IsoId=P20823-7; Sequence=VSP_054302; Name=8; Synonyms=delta 2; IsoId=P20823-8; Sequence=VSP_054300, VSP_054301.; [TISSUE SPECIFICITY] Liver.; [POLYMORPHISM] The Ala-98/Val-98 polymorphism is associated with a reduction in glucose-induced serum C-peptide and insulin responses. {ECO:0000269|PubMed:9133564}.; [DISEASE] Hepatic adenomas familial (HEPAF) [MIM:142330]: Rare benign liver tumors of presumable epithelial origin that develop in an otherwise normal liver. Hepatic adenomas may be single or multiple. They consist of sheets of well-differentiated hepatocytes that contain fat and glycogen and can produce bile. Bile ducts or portal areas are absent. Kupffer cells, if present, are reduced in number and are non-functional. Conditions associated with adenomas are insulin-dependent diabetes mellitus and glycogen storage diseases (types 1 and 3). Note=The disease is caused by variants affecting the gene represented in this entry. Bi-allelic inactivation of HNF1A, whether sporadic or associated with MODY3, may be an early step in the development of some hepatocellular carcinomas.; [DISEASE] Maturity-onset diabetes of the young 3 (MODY3) [MIM:600496]: A form of diabetes that is characterized by an autosomal dominant mode of inheritance, onset in childhood or early adulthood (usually before 25 years of age), a primary defect in insulin secretion and frequent insulin-independence at the beginning of the disease. {ECO:0000269|PubMed:10078571, ECO:0000269|PubMed:10102714, ECO:0000269|PubMed:10482964, ECO:0000269|PubMed:10588527, ECO:0000269|PubMed:10966642, ECO:0000269|PubMed:12453420, ECO:0000269|PubMed:17573900, ECO:0000269|PubMed:8945470, ECO:0000269|PubMed:9032114, ECO:0000269|PubMed:9075818, ECO:0000269|PubMed:9075819, ECO:0000269|PubMed:9097962, ECO:0000269|PubMed:9166684, ECO:0000269|PubMed:9287053, ECO:0000269|PubMed:9392505, ECO:0000269|PubMed:9626139, ECO:0000269|PubMed:9754819}. Note=The disease is caused by variants affecting the gene represented in this entry.; [DISEASE] Diabetes mellitus, insulin-dependent, 20 (IDDM20) [MIM:612520]: A multifactorial disorder of glucose homeostasis that is characterized by susceptibility to ketoacidosis in the absence of insulin therapy. Clinical features are polydipsia, polyphagia and polyuria which result from hyperglycemia-induced osmotic diuresis and secondary thirst. These derangements result in long-term complications that affect the eyes, kidneys, nerves, and blood vessels. {ECO:0000269|PubMed:10333057, ECO:0000269|PubMed:9313763, ECO:0000269|PubMed:9867222}. Note=Disease susceptibility is associated with variants affecting the gene represented in this entry.; [MISCELLANEOUS] [Isoform 7]: Due to intron retention. {ECO:0000305}.; [SIMILARITY] Belongs to the HNF1 homeobox family. {ECO:0000305}.; [WEB RESOURCE] Name=Wikipedia; Note=Hepatocyte nuclear factors entry; URL='https://en.wikipedia.org/wiki/Hepatocyte_nuclear_factors'.; [WEB RESOURCE] Name=SeattleSNPs; URL='http://pga.gs.washington.edu/data/tcf1/'. |
| Source Db |
UniProtKB: locus HNF1A_HUMAN, accession P20823;; class: standard.; extra accessions:A5Z2R8,E0YMJ5,E0YMK0,E0YMK1,E2I9R4,E2I9R5,F5H5U3,Q2M3H2,Q99861; created: Feb 1, 1991.; sequence updated: Aug 16, 2004.; annotation updated: Oct 12, 2022.; xrefs: M57732.1, AAA88077.1, X71346.1, CAB59201.1, U72618.1, AAC51137.1, U72612.1, U72613.1, U72614.1, U72615.1, U72616.1, U72617.1, HM116552.1, ADM43489.1, HM116557.1, ADM43494.1, HM116558.1, ADM43495.1, HM449088.1, ADK56177.1, HM449089.1, ADK56178.1, EF641294.1, ABR09270.1, AC079602.15, CH471054.1, EAW98226.1, BC104908.1, AAI04909.1, BC104910.1, AAI04911.1, A36749, NP_000536.5, NP_001293108.1, 1IC8_A, 1IC8_B, 2GYP_A, 2GYP_B; xrefs (non-sequence databases): CCDS:CCDS9209.1, PDBsum:1IC8, PDBsum:2GYP, AlphaFoldDB:P20823, BMRB:P20823, SMR:P20823, BioGRID:112789, DIP:DIP-33544N, IntAct:P20823, MINT:P20823, STRING:9606.ENSP00000257555, DrugBank:DB04419, GlyGen:P20823, iPTMnet:P20823, PhosphoSitePlus:P20823, BioMuta:HNF1A, DMDM:51338763, jPOST:P20823, MassIVE:P20823, MaxQB:P20823, PaxDb:P20823, PeptideAtlas:P20823, PRIDE:P20823, ProteomicsDB:15197, ProteomicsDB:15200, ProteomicsDB:15201, ProteomicsDB:15217, ProteomicsDB:26985, ProteomicsDB:53806, ProteomicsDB:53807, ProteomicsDB:53808, DNASU:6927, Ensembl:ENST00000538646.5, Ensembl:ENSP00000443964.1, Ensembl:ENSG00000135100.19, Ensembl:ENST00000540108.1, Ensembl:ENSP00000445445.1, Ensembl:ENST00000541924.5, Ensembl:ENSP00000440361.1, GeneID:6927, KEGG:hsa:6927, UCSC:uc021rfb.2, CTD:6927, DisGeNET:6927, GeneCards:HNF1A, GeneReviews:HNF1A, HGNC:11621, MalaCards:HNF1A, MIM:142330, MIM:142410, MIM:600496, MIM:606391, MIM:612520, neXtProt:NX_P20823, OpenTargets:ENSG00000135100, Orphanet:319303, Orphanet:404511, Orphanet:324575, Orphanet:552, PharmGKB:PA36380, VEuPathDB:HostDB:ENSG00000135100, eggNOG:ENOG502QRPW, GeneTree:ENSGT00940000153818, HOGENOM:CLU_068818_0_0_1, InParanoid:P20823, PhylomeDB:P20823, TreeFam:TF320327, PathwayCommons:P20823, Reactome:R-HSA-210745, SignaLink:P20823, SIGNOR:P20823, BioGRID-ORCS:6927, ChiTaRS:HNF1A, EvolutionaryTrace:P20823, GeneWiki:HNF1A, GenomeRNAi:6927, Pharos:P20823, PRO:PR:P20823, Proteomes:UP000005640, RNAct:P20823, Bgee:ENSG00000135100, ExpressionAtlas:P20823, Genevisible:P20823, GO:0000785, GO:0005737, GO:0005634, GO:0032991, GO:0003677, GO:0001228, GO:0003700, GO:0000981, GO:0046983, GO:0046982, GO:0042803, GO:0000978, GO:0000976, GO:0042593, GO:0046323, GO:0030073, GO:0001889, GO:0031016, GO:0045893, GO:0045944, GO:0060261, GO:0006357, GO:0035623, CDD:cd00086, DisProt:DP01620, Gene3D:1.10.260.40, InterPro:IPR039066, InterPro:IPR006899, InterPro:IPR044869, InterPro:IPR023219, InterPro:IPR006898, InterPro:IPR006897, InterPro:IPR044866, InterPro:IPR009057, InterPro:IPR001356, InterPro:IPR010982, PANTHER:PTHR11568, Pfam:PF04814, Pfam:PF04813, Pfam:PF04812, SMART:SM00389, SUPFAM:SSF100957, SUPFAM:SSF46689, SUPFAM:SSF47413, PROSITE:PS51937, PROSITE:PS00027, PROSITE:PS50071, PROSITE:PS51936 |
| Sequence |
mvsklsqlqtellaallesglskealiqalgepgpyllagegpldkgescgggrgelaelpnglgetrgsedetdddgedftppilkelenlspeeaahqkavvetllqedpwrvakmvksylqqhnipqrevvdttglnqshlsqhlnkgtpmktqkraalytwyvrkqrevaqqfthagqgglieeptgdelptkkgrrnrfkwgpasqqilfqayerqknpskeeretlveecnraeciqrgvspsqaqglgsnlvtevrvynwfanrrkeeafrhklamdtysgpppgpgpgpalpahsspglpppalspskvhgvrygqpatsetaevpsssggplvtvstplhqvsptglepshsllsteaklvsaaggplppvstltalhsleqtspglnqqpqnlimaslpgvmtigpgepaslgptftntgastlviglastqaqsvpvinsmgsslttlqpvqfsqplhpsyqqplmppvqshvtqspfmatmaqlqsphalyshkpevaqythtgllpqtmlitdttnlsalasltptkqvftsdteassesglhtpasqattlhvpsqdpagiqhlqpahrlsasptvsssslvlyqssdssngqshllpsnhsvietfistqmasssq |
|