Basic Information

Gene Symbol
-
Assembly
GCA_905115235.1
Location
NC:27124197-27167208[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 40 0.039 2.1 8.6 3.7 1 23 166 188 166 188 0.97
2 40 4.9e-05 0.0027 17.7 1.6 1 23 250 272 250 272 0.98
3 40 0.34 19 5.6 2.0 1 20 278 297 278 300 0.83
4 40 0.00012 0.0067 16.4 5.6 1 23 306 328 306 328 0.98
5 40 1.5e-07 8.2e-06 25.6 0.4 1 23 334 356 334 356 0.95
6 40 1.8e-05 0.00098 19.1 0.3 1 23 362 384 362 384 0.97
7 40 4.9e-05 0.0027 17.7 1.7 1 23 390 412 390 412 0.98
8 40 1.2e-05 0.00068 19.6 3.2 1 23 418 440 418 440 0.99
9 40 9.8e-07 5.3e-05 23.0 1.6 1 23 446 468 446 468 0.98
10 40 7.9e-06 0.00043 20.2 0.1 1 23 474 496 474 496 0.97
11 40 0.00036 0.02 15.0 3.7 2 23 773 794 773 794 0.98
12 40 2.9e-05 0.0016 18.4 1.1 2 23 806 827 805 827 0.97
13 40 0.0003 0.016 15.2 0.6 1 23 834 856 834 856 0.97
14 40 4.4e-05 0.0024 17.8 0.3 1 23 873 895 873 895 0.97
15 40 4.9e-06 0.00027 20.8 0.7 1 23 904 926 904 926 0.99
16 40 2.6e-05 0.0014 18.6 0.4 1 23 932 954 932 954 0.99
17 40 0.042 2.3 8.5 7.8 1 23 960 982 960 982 0.98
18 40 2.7e-05 0.0014 18.5 0.4 1 23 988 1010 988 1010 0.95
19 40 0.022 1.2 9.3 0.5 1 23 1016 1038 1016 1038 0.97
20 40 1.1e-05 0.0006 19.7 1.7 1 23 1044 1066 1044 1066 0.98
21 40 2.5e-06 0.00013 21.8 0.5 1 23 1072 1094 1072 1094 0.98
22 40 3.1e-06 0.00017 21.5 1.8 1 23 1100 1122 1100 1122 0.99
23 40 0.00035 0.019 15.0 0.2 1 23 1128 1150 1128 1150 0.98
24 40 0.00033 0.018 15.1 1.0 1 23 1359 1381 1359 1381 0.97
25 40 8.7e-06 0.00047 20.1 2.2 1 23 1398 1420 1398 1420 0.99
26 40 3.3e-06 0.00018 21.4 0.8 1 23 1430 1452 1430 1452 0.98
27 40 2.3e-05 0.0012 18.7 0.1 1 23 1458 1480 1458 1480 0.98
28 40 2.5e-05 0.0014 18.6 0.4 1 23 1486 1508 1486 1508 0.97
29 40 1.2e-06 6.8e-05 22.7 0.2 1 23 1514 1536 1514 1536 0.98
30 40 0.1 5.5 7.3 0.7 2 23 1543 1564 1542 1564 0.94
31 40 0.0003 0.016 15.2 1.0 1 23 1570 1592 1570 1592 0.98
32 40 0.0005 0.027 14.5 0.7 1 23 1598 1620 1598 1620 0.98
33 40 1.4e-05 0.00075 19.4 3.3 1 23 1626 1648 1626 1648 0.98
34 40 8.7e-05 0.0048 16.9 0.2 1 23 1654 1676 1654 1676 0.97
35 40 0.00026 0.014 15.4 2.3 1 23 2043 2066 2043 2066 0.97
36 40 7.1e-07 3.8e-05 23.5 1.1 2 23 2077 2098 2076 2098 0.98
37 40 0.00011 0.0057 16.6 3.9 1 21 2104 2124 2104 2126 0.96
38 40 4.7e-06 0.00025 20.9 3.3 1 23 2132 2154 2132 2154 0.98
39 40 0.00085 0.046 13.8 3.0 1 20 2160 2179 2160 2182 0.95
40 40 5.6e-05 0.0031 17.5 0.6 1 23 2188 2210 2188 2210 0.98

Sequence Information

Coding Sequence
ATGAGCCACTCTTCATTCTCCAACATATGTCGAGCATGCAAATTCcaaaattcaacattttttaagATCTTTAATCAAAGCAGTGAGAATTCAGTTCAAATATCAACAATGTTTTCATCTATCTCTACTCTCAATATCGACCCCAATGATCAACTTCCCGACTTAATTTGTCCAAGCTGCTATAGTATACTCTTAAGGGCTTATGAATTCCAAGAAATGTGCATTAATTCTGATAAAGAAATGCGTGCCGAACTAGCAAGGATTTCTGAAATAAGCGAATGCCAACCGTTTAATGAACCATTCAAGGACGATCCACCAGAAGCGAATTCGATGTTCATGCATGATGGACAGGAGCAGGGAGAAACTTTTGATGCAACGAGCTCGAGTATTAAGGGTGGTGAAAATGTTACGTTCGAGAATGCGTCCGGAGACAAAAGTTCCATTGAAACTATCAAAGAGACCCTCGTAAGTGAAAGCTTACTGAAAAAGCCTCGACATATCTGTAAAATATGCAATCGACAATGGAGGACAGAATCAAGCTTTTATCGACATTTGAAAACTCATGGGGTCAAGACGGAAGACAACTCACAAGAACAACCAAAAATTACTAATACATCATATGAAGATTCAAATATGTGTGACATAGAAGTTAAATTAGAACGCAATGAGTATTCAGAAGAATTCAAGAACCAAGATTTTGCAGCACCTAAATTGGAGAAACGCGACCACCATTCGGAAAATAAACGATATTCATGCGATTCGTGTCCAAAGACATTTTCCCAGCTCAATGTATTAACGCGTCATCGGAAAACTCATGGCGAAACTAGAAATTTTCAATGCAATAAATGCGATAGAACATTTTTACTTCAAGTTCAATTGGATGAACATTTGAATTGTCACAACGGCCTTAGGCCCCATGTCTGCCCTATTTGCCATAAAGGTTTTCAGCAATCATGTACGCTCAAAGATCACATTAGGACACATAACGGCGAAACaccttttctgtgtccggaatGCGGGAAAGCTTTCAACAATAGCAGCAACTTGAGACAGCATTTACTACGCCATAGTGGCGTGAAACCATTTGCATGTCCTGAATGTCCTAGTCGTTTCACAACTAAAGGTGGCCTGAAATCTCACATGACAACGCATTCAAATGAAAAGCCGTTTCGCTGTGATCAGTGTGATGCGAAGTTTACTAAACCATATTCGCTGAGTAAACACAAACGCATTCACACTGGTGAGAGACCTTATTCCTGTGAGGTTTGTTCTATGAGATTTAATTCAAATGATCATGTTAAACGACACATGCGAACACACACCGGTGAAAAGCCATATAGTTGCAAGTTTTGTGACAGAACCTTTGCTCAAAGTAATGATTTGGTAAAGCATATGCGAAGTCATGTTGGCGAGAAAACTTATGCATGTCAAGAATGTCCGGAGGCTTTTCGGTTAAATAGTGAATTACGTGCTCATATGCGAGAACATTTTAAACAGCGTAAGGAGGATGTGGTTTATGCTGATATGAAATCAAATTTGCAAGATAGTGATACAACACTGCACAGTACAACTGAATTTCAGGATTTGAGGAAAGTCCGTGCGAGaatgaatctcAGCGAATTGTGCAGGATTTGCATACAAAATGACAAGAAACTGCATCAGATCGCCAAGTCCCTAGCGCCGGATATAAAAGTGACCTGTAGCAGTGTTGTTGAATTGTGGAATATTTTTCATAAGCTTCTAATAAGCACCGACGATGATTTGCCTGATAAAATTTGTGATGATTGCCTTGAAAAAGTCCACGAAGCATACAtttttctgggaaaatgtgAAGCATCAAGAGGAAATTTACTTCAGATTAAAGACGCTCAGCTACAAACTGAACAAAACAATGAACCATTTCTGGTTGAAGAGCATCAAGTTCAAACAGATATCAAGGAGGAAATAAAAGATGACTTCTTTCTGGATGATATTAAAGGAAGTATTCAACCAGTTGAACCAGTCAATGAAGTTAAAGAAACGGTAGCGAAtgaggttgtggatcaaatggAAGTTGTGGTTGAAGGAGAAGCAACGTTTTCTGTGAAAGAGATTAACACTTTTGAGACAAATGGcgctgaagatgatgatgacgataactTCGATGATGATAAttctgaagatgatgatgatgaagattggCAACCACCAGCGAAATCGGATGATCAATCCGAAGATATAAAACCGATGAAGGAAAATCCGGAAGATGATACACCAAAAAGGAGGGGTCGAAGGAAACGAACCGTAAAAGAAACTGATGACGATAACGAGAAGCCTCGATGTGAAATTTGCCATAAAGACTTCGACAAAAGGTCGCGGCTGGTACGCCATATGAAATCTCATATAAGCGGGCGAAGGCGCAAAAAAGAGCCAATGGAATGTGAAGTATGCAAtaagaaatttgagaaaaaatcaCGCCTTCTTCGTCATATGAATGTACACGACCCCGATGGACGGCCTTACGAATGTGAGGTGTGTAAAGTTCGATTTGTTCATGAAAGTAATATGCGGCGTCATGCAATTGTACATACGGAAGTTATAAGCGAATCTTCAACTGTATTGAAACAACAGACCGTGTTCAAATGTATTGAGTGTCCGAAAGAGTTTGCTAAGCAGGAATCATTGGCATCGCATATGAGAACTCATCGAAGTAAGGGAGAGAAGAAGTCTTATACTTGTGAAATATGCACGAAAGTGTATCCACAATTAAATATGCTTACAAGACATATGAAAAGTCACGAGGAATATCGAAATTATCAGTGCAATATATGCGGAAAGAGATTCGCTTTAACTTCGCGTCTTATCGATCATATTAATCGTCATAACGGAGTAAAGCCCCACGTTTGTCATTTTTGTAACAAAGCCTTTTTGCAAAGCTGTACACTTAAGGACCATCTACGTACGCACACGGGTGAGCAACCGTATCTTTGTGCCCAGTGCGGCAAAGCATTTAATAATGGCAGCAACTTGCGTCAACATCTTTTGCGCCACACTGGAATCAAACCATTCGCTTGTCCCGAATGTCCAAGTCGATTTGCATGCAAAGGAGAGTTGAAGGGACATATGACCACCCATCAAAATATCAAACCGTACAGTTGCGAACAGTGCGATTCGAAATTCACAAAGCCACAGTCCCTGAAAAAGCATATGCGAATTCATACAGGAGAGCGACCGTATGCCTGTGATGCGTGCGGAATGACTTTCACCTCATCATATCCTCTGAAGCGTCACATGCGCACCCATACGGGCGAGAAACCTTATCAATGTAAATTCTGCGATAGATCGTTTGCTCAAAGCAATGATCTTGTAAAACATACCCGCAGTCACGTTGGAGAGAACACTTATCAATGTGAAGAATGTCCTGCAGCTTTTCGATTACATAATGAATTACGTGCGCACATACAAGAACATTTCCGCGAGAAAAATGAACGAAAAGATTTCAATGAAGATCTGCAAGCGGCAATTATTCAACAGAAAGCCGCTGCAGAATCGGGTAAATTTATGAGTGCTCTGATGCCTGCAATAGCAGCTCCCACGGTGAATAATGAAACAGCCCCAACGACTGAAGAGTTCACTACGGCGATTGATGACGATCGATTCATAGAATCTGCGTTTGTATGTAGTGAATGCGTTGCAGAAATCCAGCGCAGCTATGAATTTATTGTGAAGTGCGAACAATCAAACAGTCTtttagagaaattttcaacatgtaAGCAAAAATCAAAAGGCACCCTCTTACCAAATAGCAGCAACAGTGATTCTCATTTATCGAAAGAAGAGCTTGATAATACAGAACAGGATGCTAGCAATGTGGAAAAATCAAAGCAGTTGAAGTACGAACCAAAAGTGGGTTTAACTCCTAATGAAACTGTCTTAGATTGCAGTCAAGAAATGCATGACGACAGTGACGATGAAGAATATTTGCTGCAAATCgttactgatgatgatgatgaacccgAAAAAGCCACAAAAAttgaaaccccgaaaagaagACGGAACACTTCAGTTCTTACATGTAGAACATGCGGAAGACCTTTTGAATGCGAAGTCTGTAAGAAGCTGTTCACAACTGAAGAGAATAAGAATCGTCATATGCTGTTACACAACGCCGTAATATCAGGAACAGCAACGATCCTAAGTCCAAGAACCAGCTTTAGATGTATGGACTGTTCTAGAGTGTTCACCAAGCAAGAATCCTTATCATCACATATGCGCAAACATCAAAATTTGCCTAATCGAGAGCGAAAATATAGTTGCAAGGTATGTCCTGAAACATTTCCACGATCATCATTACTGTCGCGACATAATAGAACTCACTATGAGAATCGGAAATTTCAATGCAATATCTGCGGAAAAAGATACGCTGGTAATGGACAACTAGTTGACCATTTGAATAGGCATAACGGGATCAAGACACATTCTTGTGAAGTTTGTGGAAAAGCATTCTATTATCCAAGTCCGTTGAGAGAACATCTTAGACTGCACTCTGGAGAGGCCCCATATCCTTGTCCCCAATGCGATAAAGTTTTTGCAAGTATAGGGAATTTACGGCAGCATTTGAAGCGCCATTCATCTGAAAAATCAATAGAATGCCCGCAATGCCCAAGCCGGTTCAAGTGTCAAAGCAATTTAGTTTCACATGCCTATGTTCATAAGGATGCAAACCTCTATGAGTGCGATGAATGTGGTATGAAATTTTCGAAAACATATGTATTGAAAAAGCATAAACTTGTGCATTCTGGGGAGAAGCCTTACGAGTGCCAGAGATGTCCTATGAAGTTCAATTCATTGTTTCCTTTGAAACGGCACATGCTTATACACACTGGAGAGAAACCATATCAATGCAAATACTGCGATCGTGCATTTACACAATGCAATGATTTAGTGAAACATACGAGGGGCCACGTTGGAgagaatatatatatgtgtgaAGAATGCCCAATGGCATTTCGATATCAGAGAGAACTTCGAGCTCATGTTAAAGATCATTTCAAAAAGAAAGCATTGAGAAGTTCACAAAATATGGATCTCGGCAAAACGTGCAGGACGTGTCTGCGTTCCGATGGGACATTGAGTCCTTTGGTTGTGTCGATTACTCCGAAAGTTGGTTCGGTCTCGAGCGGCGTCGTTGACCTGTGGAAAATTTACACTCGCTTATTGGTAACCTACGATTCTGCTTTGCCTGATCTCATATGCGATCATTGCGTGGACGAAATTCAACAAACGTACTCTTTTCTCGTGAAATGTGAAAGCTCTGAGTCATATTTCAGAGTTTTGAAGGAAAGTAAAGAAGAGACTGCGAATATCGATGACGAGTATATGGATATATCAGAAATTAAAAATGAGAGTTTACACGTCAATGGATTCATTCCACAATATACTGATAGTTCAGCTAAACATTCCGTTCAGAAAACGCCAGCAAATATAAACCATTCTAATAAAACTACCCTTCAGGCAGCAACAGACCATTCCAATAAAAGTGACCTTCAGACAGCCACTGACCACTCTAATAAAAGTACCCTTGAGGCAGCCACAGACAATTTTAATAAAAGCACCCTTGGGCCAGCTACAGACAATTCTAATAAAAGCACCCTTGAGGCAGACGCAGACAATTCTGATAAAAGCATCCTTCAGGCAGCTCCAGATGAAGTAACCCAAAATTCCTGCGAGTTCAATGAGACAAGTTATAAATTCGAAGTAACTGATATCACAGACAATGTTAAATATACTTTGGAAATTGTTGGTGATCGAGTTTCGCTAGCTGGAGGGAAAGTTAAACAAGAAGTAGAGGTAAATTTCGACGAAGACTCTACCTCCTTAGACGAGAATACGATTTGTAAAGAGATTAGTCACAAAAACACCAAAACTTGTCCTGCATGTTCCTATCAATGTGATACACCAGAAGTTCTCGGAAGGCATATAGTAGATGACCATATATTGGGCTCGATTTGTACTATATGCAAGCAAGAGTTTAAATGTACTCCGTTGGAACACAGTTCTACACATCTCAGTATAAAGCAGGCGGCAGCATCCGACAACGAATCCGATAGCAATCTAGATAAAGTTCCTAGTCAGATAGGTTTCCCATATAAATGTGAATATTGTAGGAAACCATTTCGTAGTAAAACTGGATACAGTTACCATATGTTGGTGAAgcacgactacgctgatgagaaGGAGAAAGTAATTCAATGCACTGTTTGCTCGAAGTTTTTCAAATCGAATAGTGCTCTTGAATCTCACATGCGCACCCATAACGGACAGAAACCATACACTTGTGGGTATTGTTCGAAAATGTTTGTGCAGAAGGGACATTTAACCGTTCACATTCGTTGCCATACTGGTGATCGACCGTACAAATGTACGTTCTGTCCCAAAGACTTTTCAGATAAGAGTAATTTCAATAAACACATACAATTTCACACTGGCGAACGACCATATAAATGCAAAGTCTGTCCTCGTACTTTTCAACGTAAAAAATTACTAGAAATTCATTTGTTATGTCATACAGCCGAAAAACAGTTTAAATgtgaaatttgtttgaaatcgTTTCGTCAGCGATCGGCGTGGAAAGAACATGCTAAAATTCATTATTAA
Protein Sequence
MSHSSFSNICRACKFQNSTFFKIFNQSSENSVQISTMFSSISTLNIDPNDQLPDLICPSCYSILLRAYEFQEMCINSDKEMRAELARISEISECQPFNEPFKDDPPEANSMFMHDGQEQGETFDATSSSIKGGENVTFENASGDKSSIETIKETLVSESLLKKPRHICKICNRQWRTESSFYRHLKTHGVKTEDNSQEQPKITNTSYEDSNMCDIEVKLERNEYSEEFKNQDFAAPKLEKRDHHSENKRYSCDSCPKTFSQLNVLTRHRKTHGETRNFQCNKCDRTFLLQVQLDEHLNCHNGLRPHVCPICHKGFQQSCTLKDHIRTHNGETPFLCPECGKAFNNSSNLRQHLLRHSGVKPFACPECPSRFTTKGGLKSHMTTHSNEKPFRCDQCDAKFTKPYSLSKHKRIHTGERPYSCEVCSMRFNSNDHVKRHMRTHTGEKPYSCKFCDRTFAQSNDLVKHMRSHVGEKTYACQECPEAFRLNSELRAHMREHFKQRKEDVVYADMKSNLQDSDTTLHSTTEFQDLRKVRARMNLSELCRICIQNDKKLHQIAKSLAPDIKVTCSSVVELWNIFHKLLISTDDDLPDKICDDCLEKVHEAYIFLGKCEASRGNLLQIKDAQLQTEQNNEPFLVEEHQVQTDIKEEIKDDFFLDDIKGSIQPVEPVNEVKETVANEVVDQMEVVVEGEATFSVKEINTFETNGAEDDDDDNFDDDNSEDDDDEDWQPPAKSDDQSEDIKPMKENPEDDTPKRRGRRKRTVKETDDDNEKPRCEICHKDFDKRSRLVRHMKSHISGRRRKKEPMECEVCNKKFEKKSRLLRHMNVHDPDGRPYECEVCKVRFVHESNMRRHAIVHTEVISESSTVLKQQTVFKCIECPKEFAKQESLASHMRTHRSKGEKKSYTCEICTKVYPQLNMLTRHMKSHEEYRNYQCNICGKRFALTSRLIDHINRHNGVKPHVCHFCNKAFLQSCTLKDHLRTHTGEQPYLCAQCGKAFNNGSNLRQHLLRHTGIKPFACPECPSRFACKGELKGHMTTHQNIKPYSCEQCDSKFTKPQSLKKHMRIHTGERPYACDACGMTFTSSYPLKRHMRTHTGEKPYQCKFCDRSFAQSNDLVKHTRSHVGENTYQCEECPAAFRLHNELRAHIQEHFREKNERKDFNEDLQAAIIQQKAAAESGKFMSALMPAIAAPTVNNETAPTTEEFTTAIDDDRFIESAFVCSECVAEIQRSYEFIVKCEQSNSLLEKFSTCKQKSKGTLLPNSSNSDSHLSKEELDNTEQDASNVEKSKQLKYEPKVGLTPNETVLDCSQEMHDDSDDEEYLLQIVTDDDDEPEKATKIETPKRRRNTSVLTCRTCGRPFECEVCKKLFTTEENKNRHMLLHNAVISGTATILSPRTSFRCMDCSRVFTKQESLSSHMRKHQNLPNRERKYSCKVCPETFPRSSLLSRHNRTHYENRKFQCNICGKRYAGNGQLVDHLNRHNGIKTHSCEVCGKAFYYPSPLREHLRLHSGEAPYPCPQCDKVFASIGNLRQHLKRHSSEKSIECPQCPSRFKCQSNLVSHAYVHKDANLYECDECGMKFSKTYVLKKHKLVHSGEKPYECQRCPMKFNSLFPLKRHMLIHTGEKPYQCKYCDRAFTQCNDLVKHTRGHVGENIYMCEECPMAFRYQRELRAHVKDHFKKKALRSSQNMDLGKTCRTCLRSDGTLSPLVVSITPKVGSVSSGVVDLWKIYTRLLVTYDSALPDLICDHCVDEIQQTYSFLVKCESSESYFRVLKESKEETANIDDEYMDISEIKNESLHVNGFIPQYTDSSAKHSVQKTPANINHSNKTTLQAATDHSNKSDLQTATDHSNKSTLEAATDNFNKSTLGPATDNSNKSTLEADADNSDKSILQAAPDEVTQNSCEFNETSYKFEVTDITDNVKYTLEIVGDRVSLAGGKVKQEVEVNFDEDSTSLDENTICKEISHKNTKTCPACSYQCDTPEVLGRHIVDDHILGSICTICKQEFKCTPLEHSSTHLSIKQAAASDNESDSNLDKVPSQIGFPYKCEYCRKPFRSKTGYSYHMLVKHDYADEKEKVIQCTVCSKFFKSNSALESHMRTHNGQKPYTCGYCSKMFVQKGHLTVHIRCHTGDRPYKCTFCPKDFSDKSNFNKHIQFHTGERPYKCKVCPRTFQRKKLLEIHLLCHTAEKQFKCEICLKSFRQRSAWKEHAKIHY

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-