Basic Information

Gene Symbol
ci
Assembly
GCA_012977825.2
Location
Scaffold:878586-909388[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.012 1.1 9.8 1.7 3 23 2979 3002 2977 3002 0.91
2 5 3.1 2.9e+02 2.2 0.1 8 23 3022 3037 3010 3037 0.79
3 5 0.00011 0.01 16.2 2.7 1 23 3043 3067 3043 3067 0.98
4 5 0.00041 0.039 14.4 1.3 1 23 3073 3098 3073 3098 0.97
5 5 1.6e-05 0.0015 18.9 0.3 1 23 3104 3129 3104 3129 0.97

Sequence Information

Coding Sequence
ATGCCAACTATGGAATCTTCGACTGCGATTACATCATTTGATTTTCTAGATGATGCACAAAGTAACGAGGAAGTTGAAGATGAAGGAAGAAGATTCAATCTCGCAACAAGAGTAATGTCTAATGGGGTTGAGGTAATCGTTGCCGGTACAACACCCGCGTATCGCTGGGAAAATTCAAATCCGCAACCTACGCTCACCCTGTCGGAAGCAGTTGTAATGCTTTTGCCGCAAGATAAGCCAAATGAATTTGTTACAAAGACTTGTACAACGACCTTCACCTACCTCAATACTATAACGCGCGATGGGACGACTATCGTATCAACCGACGAACAGGTGATTGCTAATACCGCAACTGAAGAAAGGCACAGAAAACCAGTTTCCGAAGCTGCTTCTGTTACACTCGAGGCGTCGCCGACTCTACAGACTGAAGTTTTCAAAACAACGTACACTTACTTGACGTTGAACACCGATCATCCTGATGAAAATAATGCCCTCGGAAGCAGCACGAGAGTCATCACTAATACGGTTACTGCCCCACAATACTATCTGGATATGGTACTCGAGCCATCGGAAACACAACAGCCCGAGACGAATACGTACATGAGTACCAGAGCCCTTGAAAAGACCTACATCGAAGATGGCAAGACTAGGGTTGAGATGACTCACGATATAGTTACGCAGCTGATCATAACGGAATCGGCACCGCCACCTAAGGCAGTAAGTGTGACCACAACATTGACAGCCTTGGACAATGTGTCAACGACGGATGTTGCGAAAACCTACTACATTACCTACACCTATTTAAATACTTATTTAGAAAAAGATAGTACCGTAGTAAAAACTAATATCGCCACGTCATCGGATATCGTTTACGAAAAAGTGCCTGTAAAGAAAACGGCTACTAAATCGGTGCCAGTCACCGCGACACCAGAACCGATTCAAATTTTCGCAACAAAAACTTATTTAACGACCTACACATATTTCACCACACTTCTGCAGGCTGGAGTGGACGGCGAAACTTCTACAACCGTTTCATCACGTACCCAAATAGTGGAGAATGTCGTCACGGAATCTATCGCTCCGAGTTTACTCGAGCCCGGCTACATGAACGCTCTACTTACGACTTCGCATCATTCGGATACTTTGAAAAACGTGGTCACTGGATCGACCATCATTTTCTTCGACGATGAAGAAGAGGTGACTCCCTCGACGACAAGCGCTTACGATCAATCAGCAACGGCCTCTATCGATAAGATAGATGACCTCGAAACTGCATCCACTTCTTCGATCGCGGATTCTGTTGGTTCGGAAACAAACGAGGCTACTTCTAATCCTATCGAACCGGATAATTCTCACAACGAGGAAGAAATACCAGAGGAATCGACAAATCCACCGCCGAGTAAAAAGCCGTCTCAAGTCAGCAATCTGCTGAGCTTGGGCTCTTTGGGCATAAACAGTCTGTCAGCTTTGGGTCCAGTAATCACAGCCATGGCTGGTTTGCTTCAGGGCAAAACTTCAGCCACGCGTAGAAATGACACTGAGCCAATAACCGAACCACCGGTGACTACGACTCAGAGATCTCCGATTTATATACCAGTTGCGGAATTCGTCGACGGCGACATCGAGACAGCCGAGAGTCAGAATATCGCGCTTCACTTGGCCAACAACAATCATCTTCCGGAGACAAGACACAAGGTGACCGCAAGTTTAGCCGATGGCATACCGATATCTCCCGGAGAGGTAATAACAGCGAATAGCGACGTTATCATCGGAAAGCCAGGAAAAATGGGCCCAAGACCGCCTCAGACATTTGGCCAAGATGAAAACATCGGTATGAAACCTCCGCCGGTATCCGTGCCGAACATTCCGGTGCATCCGGTTCTAGAAGTGGTCGATAACGAGCCTCCCAAGCTTCCGATAAAAATTCAAAAAGGTTCGCAACAGCTCGAGATTTATCCCGCGCATCAAATTTACGAAGAGCACATTAAGGGACCGATAGTTTTTCACGACCCTGCTATTTCCCAGCGACCGCAACAACATCAGCAGCAAATTTATTCCAACCCAGCACCCACGAGGATCAGCTCGAAACAGGATATTCTGGAAAACGACCCACTTCTCATTCCTCCTTCCAGACCGTTGAACTCGAACATTGACTCGAAGCATCGTAACTCGGATAAAAATAAGCGACCGCCATGGAGTCCACAAGATCCTCTTGTCACGTCGAATGTTTTAATCGATGACAAGCATCGATTACACTCGAAGGAATTCGTCAAAGCGGATAACGTCGGTAGCTATGATGGATCGAGAATCAAAGCACATGTGCCGGAGCCTATTGTTCATCAAGTTCCCCACGTCATCGATAGATCTACAGGTCAGCCGCTTTTAGTCAATATCCAGCCTAGTCAAGTAGCCAACGTTGTTATACCGCAGGGCGGTACTCAAGCTCTAATTTTTGGAGATACCAGTGAACCGCACATTTCCGGACAATACTTCGACGATCCATCGCCGTATCCCGAGCCGGAAGTTGGACCTGGCTTCATTGGAATAGACAAGGTTGAAAATCTCCCGCAGTATTCAAAAAATGATGATCCCGCAGACTACATGAGACCACCAGCACTTCCAGCGAACCAAATTCAAAATCTGGCCAATAGACACCATTCCATACCGATGTCTCAAGGCTCTGAAAAAGTACCGATTCGCTATCACGACAAATTGAACTTATCCACCGGTCAAGGACACGTTCACTCGGAAATTCTCGTGCATCACGGAGCCGAGACTGGCCACGTCAGACCTCAGACGAGACCAGCCACCAACCCTCCGCGTAACTACCAGAACGTGCAGTTTCCTCCAAGGAGAGAGCACACGAAAATTCACCTAGGAAACGAGCAGATTCCACCCCGAAGAACTGAAGTGCCTTTACCAACTGAAAATTCATTCTACGACTCGTCAAATGCAGGCTCGCTGCACACAAATTGGAAAAACGAAAAGAAACCTCTGCTGAATCAGCAAACCCGTTGGCCAAGGCCAACTAGTAGGCCTCGGCCTCCGACCAGAATAACGGCTCGTCCTTTCGGTCGACCACCTACGCGAATCGAAAGACCAAAAATTCCCATCAGGCAACAGTCTCCTGTCAGATTGCCGCAGAATATTTACTACCAGCAGCAGCAGCAGCAGCAGCAGCCAAGTGTAGCGCCTTTCACGGAGTCGAATAAGCAGATATACAATCCTGCTCACCAAGAACAGGCTGCCGGTAATTCCGCCAATAAAGTACACCACTATCACGAGGAAAATTACAAGATTCCATCGCAGGAGCCGATAAACAATCAGTCGCAGCAGGAGAACGTCGACTACATCACCGATGATCACGCAGGCGATTTCGGCTATAAAAATCCTTCGTCCAACGATCCCGAGCATGGCTATTCCGACAAACCGATCGATTCCGAGGTCGATGCCTCGGAAAACGTGAAAGTGACATCCGACACCGAAGAGAGCCATCACGGACATTCCAACTCGAACAAATACGAATCCGGCGTCAGAAACGAGGGGAACAATTATTACACTCGAAACGACAGCGATAACGGCCCAGTACCGTCGTCGATCGATAAGCATGTTTACAACAACAGCCACGATGAAGGCGATATAATTATCGGCAGCGAGAAGAAACAGGATCAATATTTAACGCAAGCTCATAACTCATACGACACCGGTATCGATAATGGACCCGATCGGAAGCAGCCTGGCTACTCACACGAGATCATAGATCTGAAACCGCCAGCGATAATACCAGTATTCGAGCCAAATGGTCACAGTAGGCCGTTCTCCAAACCTAATTTCGCCATATCGAATTCGGACAGCAACTCCGAAGTAATAACTGAATCTTCGGATATAAATCACGTCTCGCAAGTGAACGTGCGACCGGTTCTCGGGCAAGTTTTCCAGATTCAGAGCCACACAAAAGAACGAGAAAGCCAACAACAACAGCACCGAAATAAAACTTACACGTACAGAGGACGACCGCAATACGATCACGCGACGAGAACTCATCCTGACCACCCGCAAATAATGACGAAGCCGAAACCACAGGTACCAGGACCGGTTGTTATTTCCAGTGGAAAAGTTCAAAAACCAAGCAACCATCACAGAAAGTCGACCGTGCAGTTTTCACTGCCGATAGACTCGACCGGCGAGGAAAACGAAAGTCAGACACAAACGGAACGAGCACCGAGTGCTAGTACCGGATTGAAGAAACCCACGGAAGAGTCTCTCGCCACTGCCTTCCAGACTAATTTCGCCAGCACGGATTCCAAGGAAGACAGCGATAACGACAGGGAAGATGGCTCAAAGTCGGGCGATCTTTCTCAGGACATGGTACCACCACCCGTTGGTAACAACCAAGGCCAGATGGACGAAGGACTGAGACCACCACCGTCACCCACCGACGTTCTTGGTCTCTCGCCACCTCCGGTTGACATCACGTCAATAAGACCGACCACCACTTCTATGGCTACGGTAATGGCGAGTTCGACTACGACGCCGGTGATGACCACAACGGCGAGCACTCGGCCTATCGACAACAAGTCCAAGATAAAATCTGACGTCTCTGGTCTTAAGCCTCCACCGCTGTACATCCCCTTGAAGGAATCCAGCGCAGCGCCTCCTCTGCCTAGCGTCAATATGGTACCACCAAGCCCAAGGCCGTCAGTGGTACGACCGTATTTGGCCGATATTCTGTCCCAGGACATGGTACCACCTCCCCCGGCTGTGAAAACAACTCGACCGCTTGAAATAGCGACCGTTCGACCGGGAGTGGCAGTATCGGGCTCCATTCAAATTGGCACCGCTGTGGCGACATCTCACATACCGGTGATACAGGATATCGAATCAAAAATTCCGATTGTGCACGGAACTGTGGACTTACCGGTCGTCGTGAACGTGCCAGAGGAGTTCCTTAAACCGATCGACACAAAGAAACCGGATCGTGTCAGTGTCTATACTGCTCGGCCGTTCGAAACTAAACACAGACACGTTGTGAAACCCACGATCGCTTCGATAAGCCCGTCGAGTTCCACTGCAGTCAAGCATTCTCAGTACTACAACGATCAGCACAGTTTCCACAACAATCACCAGAGACTGTCGCCCACGAGAATAGCGCCATCTACGATTTCGAGAAAACCGAGTAAAGTACCCGAGTTTTCCATAGTATTGGAGCCAAGTGTTGACGTTCACTTACCATCGACGCGTATCGAGCCGACCGAGACGTTGACCGTTAATTACAACAGTAATAAGAGAACAAAGCAGAGGACTGAAATAGATGACGTGACGATGGTCGCCAACGTTGAAAATTCGGAAATCACAAGAAACATCGCCAAGACCAGCTTAGCTACGAAGCAGGAAACGAAGAAAACGGATCCAAAGATAAAGGAGTCGGCATCGAAAATGATGGAGATCATCGGTACAGTTGTGCACGAGTTTATTCAGAACTCGACCAACGAAAGAGTGGAAGTGAAGCCACAAGAGGTTACGCGCTTCGAGACGCTGACGGTCACGAGGACAGAGACTTCCGTAGTGGGTTCACCTCCCACGACGCGAACTCTGCTAATTACCCACACCTTAACATCGGTGAGGATCGAGACCGTTACAAAGACTCTACTGCGACCTACTAGCGTCATTTCTACAATCACGTCAACAATACTTCAATCGGTTACACGCTCGCCTGGCTATGAGAATGGAAACGATAACGAGTCGATCTTTGTCGTGATGAGCGATCAAAAGCCACCCGCTGCTGGAGCGGAAGAGGTGGAAGCCGAGTACGGAGAAGAAGAGATTTCTCGCGACGAGCAGGATGTGAGCGGCAACGAAATTCATCGCGTTCTGTCCGGAGGTATACTGGGAGCACCATCGGTTCCAGTACGACCACCGAAGGTTCAGTGCGTACCGGAATGCAAAGCTTCGAAGTCTGAGATTTGCGCAGAGTCCAATGGAGAGATGCGCTGTGTCTGCAGACGAGGCTTTGCACGAATGTTCCCGGATCGTCCTTGCAAGCCGACCTACACATTCACCGTCGGAGTTGGTCTCGAGCGACTTGGTCGCGACCGCATCGTCTTCGATTCAGCGATGAACGACACCAGCTCTATGGTATTCAGACGTTACGCTCACCCCATCAAAGAAGCTTTGGATCGCACGCTGATGCAAAGCGATTTAAGGGACGTGTATCGCGCTCTGAACATCGCCAAGTTTACCAGAGACCCACCCAAGGTTGTGTTCAACGTGCAGTTGACCGCTAACAGCGACGAAACAAGGTTAAAGGAGGTTATCCGTAAATATCTGGTTATGAGCAACTACAGCCTGGGAGGAACCGAAGTGTACGCGTCGAAAGATCTGGGAATGGTGGAAGCGATCGACTTTGACGAGTGTGGCGTCGAAGAGGGAGGACCTCATCACGACTGTTCGCCAAACGCGGCCTGTTTCAATCTCAAGGGCTCGTATCAGTGCTCCTGCAAGGAAGGCTACGCAGATTTGTCGGAAAATCCGGCCTACCCCGGCCGAGTCTGTTCTCAGGCTCCTCTGGGATGCGCCGCGTGCAACAACAAGGGACACTGCGCCGTCAACGTTCACGGACAGGAAGTCTGCGAGTGCTTCGCTTGGCACAGTGGCCAGAAGTGTCAAATTAACCTCAAAGTTCTGCTGATCGCTCTGGTGACGACGGGAACGATCTTGGTGGCTCTGCTGGCGGTCTGTCTGGGTATGGCTTGTTTCCGCAATCCAAGCCGCAGATCCAGATCAGCAGCATGCGACAGAAGAGCGATGATATCCGGTCAAGGAGCCGGTGACACGAGCAGCGAAGGAAGTCTCGCTGAACTGGCCATCCCTCATCACGTACCACACATCCTTCCACCCCCTCCTCAGATGGTGGCGCCAGCGCCCCCTAGCGCCAAGAGGCCCGCGCGCAAGATCAGCAACAACAAGGCACGACGAGCACCAAGGAAACCCGTTCCGGCCGCACCTATCATAGCTCCAGTGGTCGCGAGTAATAACAACAGCTGTCCGTCGAACGATCAGCGCGACCGATCACTGACGGTCATGATTCCTCGGGCAAAGTACCGCTCGGCGCCACAATCTGCCACACCGCAGAACTACAAGCCTATGAGCAGCTTCGCCGTCGAGGAGCACAAGCTGATCGACTATCTCGAGGCCGGCTGCGATTACAAGTCCGAACAGGACTCGAGGTTGGCCAAACAACAGCAGCATCAGCCGACGGGTGCGCTCGTCAGTGCTGGCTTCCAAGTGTCGGCGACGGTGACGAGGACGTTGGAAGCGTCGGCCGAGTCGACGATCGAACCGTCGAGCAAGAGCGAACACGAGCTGGAGACGACGATACAAGCCTCGACGAAACAGCTGATGCGACTCGATCTCGCGGACGCGGGTTCGACCTTGGCCAGGTCCTGCGGCGAGACGACGATTCAGGCCCCGACGAAGATGGCCGAGCACCGGAATAAAGACTGCAACAGAGACGCTAGAGACAGCGCGAGCGAGGGACACACTATGGCCGAGAGGGATCTCGGCAGCACGTTGAGACTGCCCGCACAGCACGCGCCGCTTTACCATCAGGATAGGTCGAGCAGCACGATGCCGGAGAAGGAAGTGGCTTATCACCAGGAGACGTTCTCTCTGCTGCCGCCACCGCCGGCGCCACACCTCAACCACCCGCACCACTCGGCTTTTCACGCGGCAACCAGTTTCCAACCTCATCATCCCTCGGCTTTTCAGCTGCAGACTGGACCGTCGGTAGCAGCAGCCGCCGCTGCAGCCGCGGCCGCGGCCTGGGAACATCACGCCGCCGCCCTCGCTTATAACGCCCTGCCTCCTCACCCTCCATTATCGGGTAGCACGAGTTTGGCAAGTCCATCGAGAGGAAAGGAGAACGGCTCATCTACAGCCGAGCAACCTGGGGAAACTGGCAACAGCAGCTCTGCCGCCGGAACGACCGAAGCAAACTCGGCAGCAGCGGCGGCAGCAGCGGCAGATTTCCTGCGCCGTAGTCATCCACTTTCGGAGCATACGGCTCAGCTTCATCCAACATATCGGCTCAACTACATGGACCACCTTTATCATCAGCTCCAGCACAGCCCCAGCGCTTCTTTACACGGACTGGGCGCTCTGGGGCCAGAGTACCTCCTGCACGCGGCAGCCCCTGGCAGCACGATAGCCTCCTCGGAATTTCCCTTCTCGATCGACGGTTCGAGACTGGGCAGTCCCAGGGCCTCGGCGATACGAGCCAGTCGTAAGAGGGCACTCAGTAGTTCGCCCTACTCGGATCGCTTCGACATCGACAGTATGATTCGCTTTAGTCCCAACAGTCTTGCCTCCATTGTCAATGGCTCCAGGAGTAGCAGTGCCAGCGGAAGCTACGGACATCTCTCTGCCGCGATGAGTCCGGCATTGGGTATGCATCCTGGCATGGCGCCGCATCTTCAGCAGATCCAGGCGCAGTTGCTGCGCAACGCTGCCGTCGCGGTTCTCCACGGCCATCCCAGCCCCGTTCACCCACACCTTCACCCCCACTCCCATCCACCGCATCCACACGCCCATCACCATCCGCACCCCCACCCTCACACTCACCCTCATCCGCACGCGCACCCACATTCTCAGCTCTACCCCGTGTCCAGCCACGTGATGCCCCCGCACACGGCAGCCCCCACTGCACCGGCCCCAGTGCCACCTAAGACTGAGATCCCGGCAGCTGCGGCCGAGTCCAGCAACAAGTCCGTCACCGCCGAAGCCGACACCTCTTCCAAGCGCGGCAGTAGTAGCTCCAGCAAGGTCAAGCGCGAGCCGGCAACCAGTACTATACCCGCCGTGTCTCACCCCCAGGGCCTAAGTCCCAGCGATGATCCCCGCGACGAGCCCGAGCCCGGCGACTTTGTCGAGACCAACTGCCACTGGAAAGACTGTGGAATGGAGTTCGCTCATCAGGAGCATCTCGTCGAGCACATTACCGAAGATCATATCAAAAAAGATAAGAAGGTCTTTATCTGTGGCTGGGAGAACTGCTCCCGCGAGGAAAAACCCTTCAAGGCTATGTACATGCTCGTCGTGCACATGCGACGGCACACCGGTCAGAAGCCCCACAAGTGCACGTTTGAGGGCTGTCAGAAGGCGTACTCGAGACTGGAGAACCTCAAGACCCACCTCAGGTCCCACACGGGAGAAAAGCCCTACACCTGCGAGTATCCTGGCTGCCACAAGGCCTTCAGCAACGCTAGCGATCGAGCGAAGCACCAGAACAGGACACATTCCAACGAGAAACCGTACGTCTGCAAAGCTCCGGGTTGTACCAAGCGCTACACGGACCCATCCTCGCTACGTAAGCACGTCAAGACAGTGCACGGGCCCGAGTTCTACGCGAACAAGAAGCACAAGGGAGGCGGTAACGGCGACGCCCCAGGCAGCGACGAAGCCGGCCATGGAGGTCACAGCAGTCCCAGCCGCAGCGAAGATCTGCACCCGAAGACGCCCAGTTTGTCGAGCCCCAGTGTCAAGTCCGAAAGCGAGGCCAACAGTCCTCCGAGTATGATGAACCAGCACGGCAGCCCGTTGTCCATGCACGCAGCTGGCTGCACCGAGGACGGCTCCAACATGGTCATCCCCGGAGACAACGTACTCCTACAGCCGGATGGCAACTGGAACGAGGAGCCCGAAGACCTGGACATCGCTGATTTGCCACTCGCCATTCGCGCCATCGTTGGCGGATTGGACTCACACCAGCAGATCCCAGCGGGCTCGCGGAACCGCTTGAAGAATCGATTGGGTGCCAAAGCCGGCTCGATGTCGCCTGCGTCGCTGATGTCCGGCTCGAATCTGCGCGTAAGTCGCGGTCTCGGCGATCTCAACAGGCGTATCACCGATCTCAAAATGGAAGGCGGCGCGATCACCAACCGCCAGACCAGCCTCTCCGATCTGCAGCTCAGGCTGCAACCCAATAGCAACGAGCCAAGGCGTGACAGCAACAGCACCGTCAGCACTTACTACGGCAGCATGAAGTCCGCGGACTTTGGCAGCAGCCGAAGAAGCAGTCAGGCCAGCGGTGTCAGTGCCATCAAAGGTCCAGCTTTCGGCCCGGGAAGCTTCTACGATCCCATCAGTCCAGGCACGTCCAGAAGGAGCAGCCAACTGAGCACCGCATCCACGAGATTCGGCAATCTTTCCTCGCAGCTGCAAGTACCATACTCGACCAGCAACCTCGTCGTGCAGACGCAGAACATGTCTCTCCAAGGTGCACACAGCAATCCAAACGAGTGGGCGATGCCAGGTCACTGTGCTCAAGGGGCAACAAGTGACCGTCGCATGTCCGAGCCAGCTCGTGGTCACCAGAGCCAACGCGTGTCACCGCCGATGCCACCGAGGCCGCGTTCGGCGCAACTTCCAGATCTTCATCCCAACCAGGAAGTCGTCCTCGATGAAGTCGGCGAGGGCGAGATGGTCGAAAACAAGCTCGTCATTCCTGACGAGATGATGCAGTACCTGAACCAAGTTCAAGCAGGTGGCAGCTCGCAATTCAGCTACCGCGGCAGTCCACTGCCGTACTGCAGATCTCCCATTTGTACGAATCCCCAGCACTGTATGCAGAGATCACAACCTCAGTGCAATTACAACCCACAGTGCTACAACCCGTCGAACCAACAGAACCAGATGCCCAACCACAACAGCAACGGTATCGGCTCACCGGCAGACTACAGTTCCGTCGGATCTCCTTACTCTCAGTGTCCAAATTCTCGCACTGGCCAGACTCAGAGTTACTGTCCAACACAGAACTATGGACCTCAGTTGTGCTCGTCGCAGCTGCCCCGTCCGGGACAAGTGATGTCACCCAGCGGCTCGCACTACGCACCGAGTCAGTTGAGCGAGCAGCCCATGACATCTCCAGCTGCCGGAGCGTTGGCTCCGCCTCAGGGAATGCAGAACCTTCATCAGAATAGTGCACAGATGAGCCGAGTGAACTGCCATGGCCAGAACGCCGAACACAATGGCTACTACTCCGGTTACGGCTGCCAGACTCAAATGGGCCAGAATTGCGGGGCTCAGCAGATGCAGGCTCAGCCTTGCTCTCCTATGCACCCCAGCATGGCTCAAACCTGCGGACACCCGCCGAGAGCTCCCAGCAACGGCAGCAGTCACTGTCAACCTATGTCTCCTGTCTGTCAGCAACAACAGATGCAGCAGCAGATGCAAGCCAACCCGCGACCCATGTCCGGCCATGGCACGATGCAAAACCAGTGCGGTATGCCGAAACCGATGAACGTCATGACACCAGTCTTGAGTCCCGCAATGTCCGACCAGTGCCCGAGGTCTGTCGGCTCGCACTGCTCGCAACCGAGCCTAGCTAGTCAAAACGCTGCAAATTCAGTGCACCCCGCAGACATGCAGGTCCAGAGTCCCTGCTCGCAGATGTCCACTGGAAGCTGCGTTCATCCGAACACAATGGCCAACCATCGCCCGATGAACCCCCAGAACATGTACCCCAAGGAAGCTCCCAAGACCTGCCAGTCCAACAACTGCCACGCGCAGTATAATTGCTGCCATATGCACCAACATCAACAACCTCAAGGTCAACAGCAGCAGCGAATGAACGGATGCGGCAACGAGTGCCAGTGGGTATACAACAACGAGCAATGCTGCATCGGTGGAATGGCTCATCACGGACACGGAGCAGCGGGACACGTGCCAGAGATTCAGTGCCGTGACATAAGCCAGTCCCAGGGATCGCCCGTCAAGCCTCCTCAAGGCATGCGCCAAGACTCTTATCGCAGAACACTCGAATACGTGCAGCAGTGCAGAAATTGGTCAGGCAACATGTCGGCTCATGCTCCCGAAGCGAGCGTCTCCAGTTCGACGCATCCAATGCAGTTGCCACAGCCGCTGCCAGCGAGCGCCAACATGATCGTTAACGACATGACCTCCTCTCTAAGCTCACTGCTCGAAGAAAACCGGTATCTGCAGATGATCCAGTGA
Protein Sequence
MPTMESSTAITSFDFLDDAQSNEEVEDEGRRFNLATRVMSNGVEVIVAGTTPAYRWENSNPQPTLTLSEAVVMLLPQDKPNEFVTKTCTTTFTYLNTITRDGTTIVSTDEQVIANTATEERHRKPVSEAASVTLEASPTLQTEVFKTTYTYLTLNTDHPDENNALGSSTRVITNTVTAPQYYLDMVLEPSETQQPETNTYMSTRALEKTYIEDGKTRVEMTHDIVTQLIITESAPPPKAVSVTTTLTALDNVSTTDVAKTYYITYTYLNTYLEKDSTVVKTNIATSSDIVYEKVPVKKTATKSVPVTATPEPIQIFATKTYLTTYTYFTTLLQAGVDGETSTTVSSRTQIVENVVTESIAPSLLEPGYMNALLTTSHHSDTLKNVVTGSTIIFFDDEEEVTPSTTSAYDQSATASIDKIDDLETASTSSIADSVGSETNEATSNPIEPDNSHNEEEIPEESTNPPPSKKPSQVSNLLSLGSLGINSLSALGPVITAMAGLLQGKTSATRRNDTEPITEPPVTTTQRSPIYIPVAEFVDGDIETAESQNIALHLANNNHLPETRHKVTASLADGIPISPGEVITANSDVIIGKPGKMGPRPPQTFGQDENIGMKPPPVSVPNIPVHPVLEVVDNEPPKLPIKIQKGSQQLEIYPAHQIYEEHIKGPIVFHDPAISQRPQQHQQQIYSNPAPTRISSKQDILENDPLLIPPSRPLNSNIDSKHRNSDKNKRPPWSPQDPLVTSNVLIDDKHRLHSKEFVKADNVGSYDGSRIKAHVPEPIVHQVPHVIDRSTGQPLLVNIQPSQVANVVIPQGGTQALIFGDTSEPHISGQYFDDPSPYPEPEVGPGFIGIDKVENLPQYSKNDDPADYMRPPALPANQIQNLANRHHSIPMSQGSEKVPIRYHDKLNLSTGQGHVHSEILVHHGAETGHVRPQTRPATNPPRNYQNVQFPPRREHTKIHLGNEQIPPRRTEVPLPTENSFYDSSNAGSLHTNWKNEKKPLLNQQTRWPRPTSRPRPPTRITARPFGRPPTRIERPKIPIRQQSPVRLPQNIYYQQQQQQQQPSVAPFTESNKQIYNPAHQEQAAGNSANKVHHYHEENYKIPSQEPINNQSQQENVDYITDDHAGDFGYKNPSSNDPEHGYSDKPIDSEVDASENVKVTSDTEESHHGHSNSNKYESGVRNEGNNYYTRNDSDNGPVPSSIDKHVYNNSHDEGDIIIGSEKKQDQYLTQAHNSYDTGIDNGPDRKQPGYSHEIIDLKPPAIIPVFEPNGHSRPFSKPNFAISNSDSNSEVITESSDINHVSQVNVRPVLGQVFQIQSHTKERESQQQQHRNKTYTYRGRPQYDHATRTHPDHPQIMTKPKPQVPGPVVISSGKVQKPSNHHRKSTVQFSLPIDSTGEENESQTQTERAPSASTGLKKPTEESLATAFQTNFASTDSKEDSDNDREDGSKSGDLSQDMVPPPVGNNQGQMDEGLRPPPSPTDVLGLSPPPVDITSIRPTTTSMATVMASSTTTPVMTTTASTRPIDNKSKIKSDVSGLKPPPLYIPLKESSAAPPLPSVNMVPPSPRPSVVRPYLADILSQDMVPPPPAVKTTRPLEIATVRPGVAVSGSIQIGTAVATSHIPVIQDIESKIPIVHGTVDLPVVVNVPEEFLKPIDTKKPDRVSVYTARPFETKHRHVVKPTIASISPSSSTAVKHSQYYNDQHSFHNNHQRLSPTRIAPSTISRKPSKVPEFSIVLEPSVDVHLPSTRIEPTETLTVNYNSNKRTKQRTEIDDVTMVANVENSEITRNIAKTSLATKQETKKTDPKIKESASKMMEIIGTVVHEFIQNSTNERVEVKPQEVTRFETLTVTRTETSVVGSPPTTRTLLITHTLTSVRIETVTKTLLRPTSVISTITSTILQSVTRSPGYENGNDNESIFVVMSDQKPPAAGAEEVEAEYGEEEISRDEQDVSGNEIHRVLSGGILGAPSVPVRPPKVQCVPECKASKSEICAESNGEMRCVCRRGFARMFPDRPCKPTYTFTVGVGLERLGRDRIVFDSAMNDTSSMVFRRYAHPIKEALDRTLMQSDLRDVYRALNIAKFTRDPPKVVFNVQLTANSDETRLKEVIRKYLVMSNYSLGGTEVYASKDLGMVEAIDFDECGVEEGGPHHDCSPNAACFNLKGSYQCSCKEGYADLSENPAYPGRVCSQAPLGCAACNNKGHCAVNVHGQEVCECFAWHSGQKCQINLKVLLIALVTTGTILVALLAVCLGMACFRNPSRRSRSAACDRRAMISGQGAGDTSSEGSLAELAIPHHVPHILPPPPQMVAPAPPSAKRPARKISNNKARRAPRKPVPAAPIIAPVVASNNNSCPSNDQRDRSLTVMIPRAKYRSAPQSATPQNYKPMSSFAVEEHKLIDYLEAGCDYKSEQDSRLAKQQQHQPTGALVSAGFQVSATVTRTLEASAESTIEPSSKSEHELETTIQASTKQLMRLDLADAGSTLARSCGETTIQAPTKMAEHRNKDCNRDARDSASEGHTMAERDLGSTLRLPAQHAPLYHQDRSSSTMPEKEVAYHQETFSLLPPPPAPHLNHPHHSAFHAATSFQPHHPSAFQLQTGPSVAAAAAAAAAAAWEHHAAALAYNALPPHPPLSGSTSLASPSRGKENGSSTAEQPGETGNSSSAAGTTEANSAAAAAAAADFLRRSHPLSEHTAQLHPTYRLNYMDHLYHQLQHSPSASLHGLGALGPEYLLHAAAPGSTIASSEFPFSIDGSRLGSPRASAIRASRKRALSSSPYSDRFDIDSMIRFSPNSLASIVNGSRSSSASGSYGHLSAAMSPALGMHPGMAPHLQQIQAQLLRNAAVAVLHGHPSPVHPHLHPHSHPPHPHAHHHPHPHPHTHPHPHAHPHSQLYPVSSHVMPPHTAAPTAPAPVPPKTEIPAAAAESSNKSVTAEADTSSKRGSSSSSKVKREPATSTIPAVSHPQGLSPSDDPRDEPEPGDFVETNCHWKDCGMEFAHQEHLVEHITEDHIKKDKKVFICGWENCSREEKPFKAMYMLVVHMRRHTGQKPHKCTFEGCQKAYSRLENLKTHLRSHTGEKPYTCEYPGCHKAFSNASDRAKHQNRTHSNEKPYVCKAPGCTKRYTDPSSLRKHVKTVHGPEFYANKKHKGGGNGDAPGSDEAGHGGHSSPSRSEDLHPKTPSLSSPSVKSESEANSPPSMMNQHGSPLSMHAAGCTEDGSNMVIPGDNVLLQPDGNWNEEPEDLDIADLPLAIRAIVGGLDSHQQIPAGSRNRLKNRLGAKAGSMSPASLMSGSNLRVSRGLGDLNRRITDLKMEGGAITNRQTSLSDLQLRLQPNSNEPRRDSNSTVSTYYGSMKSADFGSSRRSSQASGVSAIKGPAFGPGSFYDPISPGTSRRSSQLSTASTRFGNLSSQLQVPYSTSNLVVQTQNMSLQGAHSNPNEWAMPGHCAQGATSDRRMSEPARGHQSQRVSPPMPPRPRSAQLPDLHPNQEVVLDEVGEGEMVENKLVIPDEMMQYLNQVQAGGSSQFSYRGSPLPYCRSPICTNPQHCMQRSQPQCNYNPQCYNPSNQQNQMPNHNSNGIGSPADYSSVGSPYSQCPNSRTGQTQSYCPTQNYGPQLCSSQLPRPGQVMSPSGSHYAPSQLSEQPMTSPAAGALAPPQGMQNLHQNSAQMSRVNCHGQNAEHNGYYSGYGCQTQMGQNCGAQQMQAQPCSPMHPSMAQTCGHPPRAPSNGSSHCQPMSPVCQQQQMQQQMQANPRPMSGHGTMQNQCGMPKPMNVMTPVLSPAMSDQCPRSVGSHCSQPSLASQNAANSVHPADMQVQSPCSQMSTGSCVHPNTMANHRPMNPQNMYPKEAPKTCQSNNCHAQYNCCHMHQHQQPQGQQQQRMNGCGNECQWVYNNEQCCIGGMAHHGHGAAGHVPEIQCRDISQSQGSPVKPPQGMRQDSYRRTLEYVQQCRNWSGNMSAHAPEASVSSSTHPMQLPQPLPASANMIVNDMTSSLSSLLEENRYLQMIQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-