Basic Information

Insect
Cenopis cana
Gene Symbol
ZFY
Assembly
GCA_951800055.1
Location
OX637496.1:738431-750581[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 33 0.0055 0.43 12.3 3.4 2 21 166 185 163 186 0.87
2 33 0.0055 0.43 12.3 0.6 1 23 219 242 219 242 0.93
3 33 0.00012 0.0091 17.5 0.7 2 23 248 269 247 269 0.95
4 33 0.028 2.2 10.0 1.7 2 21 273 292 272 293 0.92
5 33 0.086 6.6 8.5 0.4 2 23 336 358 335 358 0.94
6 33 1.9 1.5e+02 4.3 0.5 3 23 366 385 365 385 0.76
7 33 0.0028 0.21 13.2 0.2 1 23 391 413 391 413 0.98
8 33 0.0083 0.64 11.7 1.7 1 23 419 442 419 442 0.97
9 33 0.0094 0.73 11.5 0.2 1 23 451 474 451 474 0.97
10 33 0.39 30 6.4 5.6 1 23 484 506 484 506 0.98
11 33 5.6 4.3e+02 2.8 0.7 2 10 625 633 624 644 0.92
12 33 0.0027 0.21 13.2 0.4 2 23 676 698 675 698 0.95
13 33 0.0041 0.32 12.7 4.2 1 23 703 726 703 726 0.96
14 33 0.27 21 6.9 2.8 3 21 731 749 730 750 0.93
15 33 8.4e-05 0.0065 18.0 0.2 1 23 761 784 761 784 0.94
16 33 0.00018 0.014 17.0 3.4 1 23 791 813 791 813 0.97
17 33 0.0071 0.54 11.9 0.6 1 23 820 842 820 842 0.98
18 33 0.025 1.9 10.2 0.4 1 20 862 881 862 883 0.92
19 33 0.00032 0.025 16.2 1.0 2 21 981 1000 980 1001 0.94
20 33 0.012 0.94 11.2 3.8 3 23 1034 1054 1033 1055 0.95
21 33 0.0034 0.26 12.9 0.1 3 23 1062 1082 1060 1082 0.96
22 33 0.00055 0.042 15.4 1.4 1 23 1118 1141 1118 1141 0.95
23 33 0.00022 0.017 16.7 0.5 1 23 1147 1169 1147 1169 0.96
24 33 2.8 2.2e+02 3.7 0.5 1 23 1203 1226 1203 1226 0.88
25 33 0.00035 0.027 16.0 1.4 2 21 1353 1372 1352 1373 0.92
26 33 0.13 9.7 8.0 0.2 1 23 1406 1429 1406 1429 0.93
27 33 0.0081 0.62 11.7 0.4 2 23 1434 1455 1433 1455 0.96
28 33 2.7 2.1e+02 3.8 1.8 2 21 1459 1478 1459 1479 0.91
29 33 2.1e-05 0.0016 19.9 1.0 1 23 1505 1528 1505 1528 0.95
30 33 0.003 0.23 13.1 0.4 1 23 1534 1556 1534 1556 0.92
31 33 2.7 2.1e+02 3.8 0.1 3 23 1564 1584 1562 1584 0.97
32 33 0.36 27 6.6 1.8 1 23 1590 1613 1590 1613 0.94
33 33 0.0025 0.19 13.4 1.0 2 23 1620 1642 1619 1642 0.95

Sequence Information

Coding Sequence
ATGAATCGTCAAGTGGATGTAAAAGCTTTAGTTTCCCACATAGTGAGAGGCGATGGCTTGGACAAATGTCGGATTTGTATGGGAGACACGACCGAAGGGCAAGTCTACCTCGGAGACACGGTTATGATGGACGGAGACCGACCCGTAACGCTTGCTGAGCTTCTGGAAATAATCACAGGCATCGAGGTTGCCTTGGAGGAAGACCTACCGGCCGGGCTGTGCACTTCATGTGCCCAGAGAGCCATGGAGGCTGCCAGCTTCCGAACGCTGTGCAGGCAAGCCGCTAACCAATGGGACAACACTCTGCAGCTACTAACAGACCACCTTCCAATCAAGTACAAAAGTGCAAGCATGTATATCTTATTGGATAAAGATCAAATCACTATACTTGATGATTTGAAAGGTTCAAATCCATCTAGACTTCCTCGAGAAAAATACACTTCAAATATACAGAGAGGTAAAGAGGTTAAAGTGAGAGACGTAAAGCATAAATGTCAATGTCTGGATTGTGGGAAGAAATTTGCATATCCGCAACATTTGTATCTACATTTGAAGGAATCTATGGATTTGAAGCGTGCTTGCTATGTATGTGCTAAGATCATGACTAGACACAAATTGATTCTGCATCTGAAATCTGACCATCAGCTGAAAAGTTACGATTGTAAGAAGTGTCCGGCCTTGTTCCGGTCACATGAGATGTTAACTCGGCATATAGAGGATGCCCACAGTCCTGGCGCTTGTACTTGTGGGGACTGCGGACGTAGCTTTAAGTCCAGACCGGCTTACAATGCTCATCTGTCCGTCCACACTAGCAAAACCTGTCCTGGATGCGACAGGATCTTCAGAAATCAAACTTGTTACTTGTATCATGTGAAAAAATGCTGCGATTTGGATGTCAGCAAGAAAACTCACCAGACTAAGAATAGAGTGACCATAGAAGTCAAGAATAAAAAGAGTGAGAGGCGAACTAAAGTTGGTCTCCGTGGAGCAGCGGATAAACAATGTATATGTGATTATTGCGGCAAAAAGCTAGCAGGGAAAAAGTTTATAACGGCACACATACAGATAGTGCACCTTAAAAACACTCACAGACCCTGTCCGTACTGCGGGAAGTTCCTCGCTTCTGCCCATATGGGTGAACACCTTAAAAAGCACGAGTTCGAGACTCAATTCACGTGCGAATACTGTGGAATAGTCTTAAAAACCAAACTAGGATATGTCCAGCATATACGTCTACATACCGGTGAAAAGCCATACGCTTGCCAGCACTGTGGTGAGACTTTCTCAGCGTCTTCGAGACGATCAGAGCATATCCGAAAAACTCACAGAGCTGGCAGTGTTGTTCTGAAATATGCCTGCGAATACTGTCCGGCGAAGTTCCAATTACCATATAAACTTAGGAACCATGTGTCAACTGTGCATAATACGGAGGACACAGCTGCAGTACCGTTTTCTTGTaagatttgtatggaaaagttcAGCAGTTGTAGAGGGCTTGCACATCATAGTAGAAAGCATCAGAAAATGGAGACCTCAGAACTGCTGCCCAACAGAATCTGTGGAGGCTGTGCAAACAACATACGCCCTACCATCCACCTCATAGGACACCTCAGAGATGCCCACCGAAAATGGACGGACGCCACCGAATACCTGCACCAGTTGGGATTCAAGAATAACGTCAAAACAGCTTACATAGTCGTCGAGAATGACATCAAGGACATAACAACCTATACTGAGAATGTAAGGAGGTCGGGATGCGCTAAAAAAGAGCTGTACAAGAGCTTTAAAAAGTGCAAAAGTATGCGTACCGCTGTAAAAAGGCGTACCGAGCTCGAAAGGAAACTGCGCAGTTCAGAATTAAGCACGAGCTTAAAATGCCCAGAATGCGAGAGATCTTTTCTGTGTGTAtacaaatataatatgcatataaGATATAGTGATAAACGGGCTTGCGTGCATTGCAGACAAATGATCAAATTGGAAGAGTTAAAAGATCATTACGCTGAGCACGGAGTTGAAGCAATGCAGTGCGATATCTGCTTTGAGACTTTCAAGAAAGAAGATGCACTGATAAAACATAAAGTGACTTACCACACCAAGGGACCATATTTCTGCTACATATGTAAACTGTCTTACAGAGACCCTCATCATTTAGCGTCGCATATGACAAGCAAACATGAGCCAAGGATTTGCTTCGGGTGCGATCGGAAATTCTCAAACGTGTACTGCTTTAGAACACACTCTAGAAGATGCAAGAATTCCAACAGGAAAAGCAAAGTGTTCATCTGCGATATTTGCTCTAAAGTGTACAGCTCTAAATACGCTCTAAAAGTGCATTTAGAGTATATACATATGAATAAAGAGCATTCTCATCAGTGTGAGCATTGCGGGAAAGTGTTCAACAGTGCTATTCATCTTTTGGAACATGGTAATAAGCACAATAAAGTACCAGATCGATTTGTTTGTAACATCTGCGGTGTGCCTATGAGCACCCGTCGAGGGTACCAAAGACATTACAAGAGGCATCTGAAGAACCCAAATTACGTCCCAAGAGGGGCTAAACCGTCTAAAACAAAGATTAAGTATGATTGTGAATTTTGCGAAGAGACGTTTAACCTTAAAACTCATTTGAGGGAGCATGTTGAAGAGGACGCCGGGTTGCCTCAAGGTCTCTGCAGAGCGTGTTCTGAGGACGTCATCGCTGCTGCACACTTCAGACAACTAACTACCATCTCCCACCATCACTGGAACAGTGCCGTCGACTCCCTCTCCTGCGTACACGACCCCGAAGACAACATCAGGACATACTATATCTTCTACAGCAACGGAGAAATGCTCATAAGTGAGGAGCCTCGAAAATCCATAGACAAAAAGGACGTTGTTGTACACCTGAATGAAGAAAGGCCAAAAAGGTCTTACACGAAATCAGCAGGGAAGTGCAAATGTCCTGACTGCCACAAAGATTTCCCTGTAGCTTACAATCTGAACGTACATTTGAAGAATACTATGAAGCGGGCGTGTATAAGATGCGGGATCGTTTTGGAGAGGGAGAAATTGGCGGATCACCTAGCTAAAGTTCATGGGAAATTGTTCGCGGATTGCGATATTTGCTTCAAACTGTTTGAGCATGAAAGTCTACTCAAACAGCATTATCAGACGCATCACAGCAGATTTTCCTTCGGCTGTCAGATTTGCGGACGAGGCTACACTAACGATAGGGCGCTGAGAGCCCACATGTACGCTCACACGCTATTCCATTGCTTGTCTTGTAACTCGAGCTTCGAGAACCGTCGATGCTATAAGCATCATCAGAAGAAATGCAAGTCTGCTAAAGCGCCGCAAGCTACCAGCTTCACATGCGACTACTGTGGACACGTGTACAATAAAAAACCATCGCTACGCATACATATAATACAAAAACACTTAAACGTACTACCCTACGTTTGCGAGACATGCGGGAAAAGGACTTCAACGCTAGCTCACTTGAGGTCCCACGAAGCGATTCACACAACAGAAAGGAAGTTATATAAGTGCGCTTGTGGCGCGGAGATGCGAACAGAGCTAGGTTACCATCTACATCAAAGAATACACACCGGAGAAAAGCCTTATCAATGTGAAGAATGCGGTGACCGCTTCTTATCTGCATCTAGACGCTTGGATCACATTAAAAGACGACATCGGAGCACTAAAGACATGCCTCACGGCTGTGATTTGTGCCAAGCTCGGTTCATAATAGACGAATCGCTCTGCCCAAGCGGTATCTGCGCAACATGCACTGCAAACGCCATCGCCGCGCAAGAGTTCCGCTCTTTAGTTTCCGACTCCATCAAAATTTGGTCCTTCGCCATTAAACAACTCGAGATAATACCTCTCCAAGACGCTCCCTCCGTCAAATCGCTATGCGCCTTCTTCAGATCCGACAACCTCAACATCCAGATAGCTAAAGACTACTCCGTCAACGAGAAAACCACTCCACTAACCAGGTTAAAGTCGCGAATGAACAGAAAGAAAAGCGATGAGAGAAAACCTAGGGTGCACAGAACTGGTCCACCCTGCAAATGCACAGACTGTGGCAAAGAGTTCATCAGTCCTTACTACTTGAATGTCCACTTCAAGAATAGTGGTCAAAAAGACGCGTGCATCACTTGCGGAGCTATTCTCTTGCGAGGTGCACAGATGAGGGATCATCTAGAGAAAGTTCATAGAGAAACTGCCTTCTTGTGCAAAGAATGCCCTGCGATTTTCAACAATGAAACTGATGCGAAGAAACATGAGAAAAATGCGCATAAGACTGGATTAACTTGTGGAGATTGCGGGCGGACGTTCCTTAGGAGCGCTTCGTTTGAAACTCATTCCCAAATGCACGCTGTACGGACGTGTAGAGCCTGTGGATTACAGTTCACCAATAGAGCGTGTTACAGAGAGCATAGGTCCAAGTGTGAGCCAGATGCGAAACCTGATAGAGACAGTGTACCTCGCAATAGACGGTCTAACATTCGCGATCCGGCTACATTCACGTGTGATTACTGCAAGAAGACTTACACATCGCGCCCTCAATTGAAGAACCATATTCTTTGGATACATATGGACGTTCGCCCGCATCAATGTCAATGGTGCGGGAAGAGATTCTACACACCAGCGCGTTTAGCGGAACACAGCGTAGTACACACGCGGGAGCGGAATTTCGGTTGCGATATTTGCGGAGCGAAATTAGTTTCCAAAATGGCAGCCGTTTACCACAGACGCCGGCATACTGGTGAAAAGCCGTATCGTTGCGAAGACTGTGGGGACTGCTTCATATCTGCATCGAGAAGGTCCGAGCACGCGAAGAGAAAGCACGGAAAAGGGCCAAGATTGCAATGCACTAGATGCCCGTCGAGTTTCGTTCGGATTCACGAGCTGAAGAGGCATATTGAGAAGGCACATAGTCATGTCGTTCCTGTATATGCTTTTAAGAATCCAACTAGCATATAA
Protein Sequence
MNRQVDVKALVSHIVRGDGLDKCRICMGDTTEGQVYLGDTVMMDGDRPVTLAELLEIITGIEVALEEDLPAGLCTSCAQRAMEAASFRTLCRQAANQWDNTLQLLTDHLPIKYKSASMYILLDKDQITILDDLKGSNPSRLPREKYTSNIQRGKEVKVRDVKHKCQCLDCGKKFAYPQHLYLHLKESMDLKRACYVCAKIMTRHKLILHLKSDHQLKSYDCKKCPALFRSHEMLTRHIEDAHSPGACTCGDCGRSFKSRPAYNAHLSVHTSKTCPGCDRIFRNQTCYLYHVKKCCDLDVSKKTHQTKNRVTIEVKNKKSERRTKVGLRGAADKQCICDYCGKKLAGKKFITAHIQIVHLKNTHRPCPYCGKFLASAHMGEHLKKHEFETQFTCEYCGIVLKTKLGYVQHIRLHTGEKPYACQHCGETFSASSRRSEHIRKTHRAGSVVLKYACEYCPAKFQLPYKLRNHVSTVHNTEDTAAVPFSCKICMEKFSSCRGLAHHSRKHQKMETSELLPNRICGGCANNIRPTIHLIGHLRDAHRKWTDATEYLHQLGFKNNVKTAYIVVENDIKDITTYTENVRRSGCAKKELYKSFKKCKSMRTAVKRRTELERKLRSSELSTSLKCPECERSFLCVYKYNMHIRYSDKRACVHCRQMIKLEELKDHYAEHGVEAMQCDICFETFKKEDALIKHKVTYHTKGPYFCYICKLSYRDPHHLASHMTSKHEPRICFGCDRKFSNVYCFRTHSRRCKNSNRKSKVFICDICSKVYSSKYALKVHLEYIHMNKEHSHQCEHCGKVFNSAIHLLEHGNKHNKVPDRFVCNICGVPMSTRRGYQRHYKRHLKNPNYVPRGAKPSKTKIKYDCEFCEETFNLKTHLREHVEEDAGLPQGLCRACSEDVIAAAHFRQLTTISHHHWNSAVDSLSCVHDPEDNIRTYYIFYSNGEMLISEEPRKSIDKKDVVVHLNEERPKRSYTKSAGKCKCPDCHKDFPVAYNLNVHLKNTMKRACIRCGIVLEREKLADHLAKVHGKLFADCDICFKLFEHESLLKQHYQTHHSRFSFGCQICGRGYTNDRALRAHMYAHTLFHCLSCNSSFENRRCYKHHQKKCKSAKAPQATSFTCDYCGHVYNKKPSLRIHIIQKHLNVLPYVCETCGKRTSTLAHLRSHEAIHTTERKLYKCACGAEMRTELGYHLHQRIHTGEKPYQCEECGDRFLSASRRLDHIKRRHRSTKDMPHGCDLCQARFIIDESLCPSGICATCTANAIAAQEFRSLVSDSIKIWSFAIKQLEIIPLQDAPSVKSLCAFFRSDNLNIQIAKDYSVNEKTTPLTRLKSRMNRKKSDERKPRVHRTGPPCKCTDCGKEFISPYYLNVHFKNSGQKDACITCGAILLRGAQMRDHLEKVHRETAFLCKECPAIFNNETDAKKHEKNAHKTGLTCGDCGRTFLRSASFETHSQMHAVRTCRACGLQFTNRACYREHRSKCEPDAKPDRDSVPRNRRSNIRDPATFTCDYCKKTYTSRPQLKNHILWIHMDVRPHQCQWCGKRFYTPARLAEHSVVHTRERNFGCDICGAKLVSKMAAVYHRRRHTGEKPYRCEDCGDCFISASRRSEHAKRKHGKGPRLQCTRCPSSFVRIHELKRHIEKAHSHVVPVYAFKNPTSI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00659633;
90% Identity
-
80% Identity
-