Basic Information

Insect
Nineta flava
Gene Symbol
Zfy2
Assembly
GCA_963920215.1
Location
OY986042.1:23312265-23322884[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 37 0.033 1.2 10.3 6.1 1 23 11 34 11 35 0.96
2 37 0.00064 0.024 15.6 2.2 1 23 39 62 39 62 0.97
3 37 0.0018 0.065 14.3 0.5 1 23 71 93 71 93 0.97
4 37 0.21 7.9 7.7 0.1 2 23 99 120 98 120 0.91
5 37 5.3e-06 0.0002 22.2 1.4 1 23 126 148 126 148 0.98
6 37 4.3e-05 0.0016 19.3 2.9 1 21 154 174 154 176 0.96
7 37 0.29 11 7.3 0.3 1 16 182 197 182 199 0.82
8 37 0.16 5.9 8.1 0.3 1 14 266 279 266 283 0.85
9 37 0.0012 0.045 14.7 1.5 1 23 294 317 294 317 0.96
10 37 1.8e-05 0.00067 20.5 2.1 1 23 321 344 321 344 0.97
11 37 0.00031 0.012 16.6 3.4 1 23 350 373 350 374 0.97
12 37 0.00061 0.023 15.7 2.6 1 23 378 401 378 401 0.98
13 37 5.4e-06 0.0002 22.1 2.0 1 23 410 432 410 432 0.99
14 37 0.11 4.1 8.6 0.0 2 23 438 459 437 459 0.93
15 37 1.3e-05 0.00047 21.0 1.7 1 23 465 487 465 487 0.98
16 37 0.00028 0.01 16.8 3.7 1 21 493 513 493 515 0.96
17 37 0.29 11 7.3 0.4 1 17 521 537 521 538 0.83
18 37 0.14 5.1 8.3 7.3 1 23 569 592 569 592 0.96
19 37 2.2e-05 0.00083 20.2 0.7 1 23 598 620 598 620 0.98
20 37 0.00095 0.035 15.1 1.1 1 23 627 649 627 649 0.98
21 37 3.7e-05 0.0014 19.5 0.4 1 23 655 678 655 678 0.91
22 37 9.2e-05 0.0034 18.3 5.5 1 23 683 705 683 705 0.99
23 37 0.0014 0.05 14.6 4.3 2 23 715 736 714 736 0.96
24 37 4.5e-05 0.0017 19.3 0.1 1 23 739 761 739 761 0.98
25 37 0.0012 0.044 14.8 0.8 2 23 768 789 767 789 0.97
26 37 3.8e-06 0.00014 22.6 0.3 1 23 795 817 795 817 0.98
27 37 2.5e-05 0.00095 20.0 3.4 1 23 823 846 823 846 0.98
28 37 0.21 7.7 7.7 4.7 1 23 880 902 880 902 0.93
29 37 0.0066 0.25 12.4 7.8 1 23 908 930 908 930 0.97
30 37 0.029 1.1 10.4 4.4 1 23 937 959 937 959 0.98
31 37 0.00011 0.0041 18.0 1.4 1 23 964 987 964 987 0.96
32 37 0.00063 0.024 15.6 2.3 1 23 992 1014 992 1014 0.98
33 37 2.5e-06 9.4e-05 23.2 1.2 2 23 1024 1045 1023 1045 0.97
34 37 0.00011 0.0043 18.0 2.6 1 23 1048 1070 1048 1070 0.98
35 37 4.3e-05 0.0016 19.3 3.6 1 23 1076 1098 1076 1098 0.98
36 37 6.5e-07 2.4e-05 25.1 0.4 1 23 1104 1126 1104 1126 0.98
37 37 0.089 3.3 8.9 0.7 1 23 1132 1155 1132 1155 0.96

Sequence Information

Coding Sequence
ATGGATCGTGTACATTTACAAaggaaaaaacataaatgtcacTTATGTGAAACTGCTTTTCATTCTAAAAATGGTCTAGAAATTCATATAAATCGCGACCATCATGGTTCAAAGTATGAATGTACTGTTTGTCATAAAGAATTTAGAACACCTAGTTATCTCGCATTCCATGAACGTACCCAACATGATCCCACTTATGTTGGTAAGAAATTTGCATGTGATATTTGCAACAAAGAGTATTCAACGAATGCAATCTTAAAAGTACATAAAAGGAAACATAATGGCTCGTATATAGTGTGTGATGTGTGTGGAAAACGTTTTTTAAGTTTAGCAAGGTTGAATTTACATGCTTCGATTCACACGAATTTGAAACAATACGCATGCAAAACTTGCGGTAAACAATTTCGTAATAAACCGTTGTTGAGGAATCATGAACGTACACATACCGGCGTGAAACCATTTCAATGCAATATATGCGAGAAAAGTTTCACACAACGTGGGACATTAACCATACACCTACGATGTCACAATGGCCAACGACCGTATTCGTGTAATTTATGTGATAAGACATTTCTAACAAACACATTGTTGAAagtaaaaataaaaagtgagCTTAATGCTGTTGTAAACGTTGAAGACAATGAAAAAAACGCAATCATACCTTCTGAACAAGATAATAATAGTAGTGATACCGATACTAGTGACGATGAACCCTATATCAAACcatacaaatgtaaaatttgtaaatgtgATTGTAAAAAAGACCAATCGTATACcataaacaacaaaacaatatttaaatgtaagaTCTGTGCAAAAGTATTTACATCAGAACGAGGTTGGCAAAAAGATTACCATCGACATATCTTACCCAAAAATTATACATGCGATGAATGCGATAACAGTTATACGTATAGAATCGaattaacaaaacataaacGATTCGAACATTCTGATGAATTCATGTGTGAATTATGCAATAAAAAGTATACAACTAGACGTAATTTGAATATGCATATGAATcgtatacatttaaaattgaaaaagtatAAGTGCGATACATGTGAAAAATCTTTTCATTCTAAAAATGGTCTTGAAATTCATATTAATCGTGACCATCACGGTTCAAAGTATGAATGTACTGTTTGTCATAAAGAATTTAGAACACCAAAGTATGTAGAATTACATAAACGTTCGCAACATGATCCTAATTATGTTGGTAAGAAATATACATGCGATATTTGCAACAAACAGTATACATCAAATGGAATGTTAAAACAACATCAGAGAAAACATAATGGCTCATATATAGTATGTGATGTTTGCGGAAAACGTTTCCTAAGTGCTCTCAGTCTAAATTTACATGCTTCGATTCATACGAATTTAAAACAATACGCATGCAAAACTTGCGATAAACAATTTCGTAATAAACCGTTGTTGAGAAATCATGAACGTACACATACCGGTGTGAAACCATTTCAATGCAATATGTGTGAAAAACGTTTTACACAACGTGGGACATTAACCATACACCTACGATGTCACAATGGCCAACGACCATATTCGTGTAATTTATGTGATAAGACATTTCTAACAAACACATTGTTGAAGAACGATGATGCTGTACAGGACATTGAACAGGATTTTGTTGGTGCTGCTGAAATGATAAGAGAAGAGGAAGTATTCAAGTTAAAACATAGTGGTTATCACTGCAACATATGCGATAAACGTTTCTTCCAAAAAGAGCTATGTCGTCGACATTATTTACGTATGCATATAGATTCAGCTGGTTATCCATGTGAAgtatgcaataaaatatttaaatataaatggaatTTAATAACACACACAAAGTTACATGACTTAAATCGACAAAAGTATACATGCGAGATATGTAATCGTGAATATGCAACACCACAATCATATAAAATGCATAAAAATCGTCATTTGAAAAACTATTCATTTAAATGTAATCAATGTGGTAAAGGCTTTTATCAGAAAGctgaattaaaaattcatacagATGCCGAACATAATGCAATACGTTATCAATGTCAAATGTGTAAAAAAACCTTCTGTAGTAAAGGCTACCTATCGATACATTATCGATCACATGATCCAAATTATGTACCACCAGAATTAAAATGTCatatttgtggtaaaacgtaTCATTCAACAGATCTATTTCGACGACATGTCAAAGCACACGAAGGATTTCCATGTGAATTATGTGGTAAAACTTTGACATCACAGATTAGCTTAAagaatcatatattaatacatcgTGGTGAGAAGCCGTTAAGTTGTAATGTATGTAGTAAACGGTTTAATAAACGTGCGATATTAAAAGTACACATGCGCATACATACAAATGAAAAACCTTACGAATGTAAAGAATGTGGTAAACGATTTGCCCAACGTAGTCCTTTAGTTATACATTTAAGATATCATACTGGCGAACGACCCTATCAATGTCAACATTGTGGTGAAGCTTTCGTAAGCAATACATGTTTAAAAgctcatattaaaaaaaatcataattattatccaaatatAGTTTTAATCAAACAAGAGCCTCTGACTATTCTAGAAATAAAACAAGAAGAAAATGAAGAATATACACAATCATTTAATGGTTACCATTGTAATTattgtggaaaattttttttaactaaaaaatattggaaatcacaTTTTGTTGTACATTTAAAAGAAAGACCACATAGATGTGAAGTGTGCAACAAAACGTTTAAACGAAAGAATCAATTAACTTGTCACAAGCGTATCCATTGCACTGATCGAAAGAAATTCATATGTGATATATGCTGTAGAGAGTTTGCAAGTTGGCATTACTTCCAAGATCATCAAAGACGTCATAATGGATCAGCATACGAATGTAGTCAATGTGATAAATCATTCCTCACCAAATACcaattaaaaaatcatattgACGTCAAACATAACGATGTTCGATATAATTGTGATCAATGTACGAAATCATTTTTATCGAAAGAGTATCTGGTACGACATACACGTACACACGATCCAAGCTATGTTGCACCAAtattaaaatgtgaaatatgtGGTAAATCGTATAAATCGAGATCAGAATATAATCGACATGTGAAATCACATGAAGGTTTCCCATGTCATCAATGtggacaaatatttaaatacgcagagaaattaaaatatcatatgTATACGCATACGGGTGAGAAACCGTACAATTGTACGTTTTGTGAGAAAAAATTCCGAAGGAAAATCTATTTAACTGAACATTTACGAACACATACTAAAGAGCGACCTTTTGAGTGTTCGGTGTGTGGCAAAGGATTCACACAACGTAGTCCACTGAAAATACATATGCGTAATCATACTGGTGAACGACCATATAGTTGTATGTTGTGCGCCGAAGCGTTTATTAGTAAACATCGACTTACGTTACACTTACAAAATGTACACTGTTGTTCCGGATAA
Protein Sequence
MDRVHLQRKKHKCHLCETAFHSKNGLEIHINRDHHGSKYECTVCHKEFRTPSYLAFHERTQHDPTYVGKKFACDICNKEYSTNAILKVHKRKHNGSYIVCDVCGKRFLSLARLNLHASIHTNLKQYACKTCGKQFRNKPLLRNHERTHTGVKPFQCNICEKSFTQRGTLTIHLRCHNGQRPYSCNLCDKTFLTNTLLKVKIKSELNAVVNVEDNEKNAIIPSEQDNNSSDTDTSDDEPYIKPYKCKICKCDCKKDQSYTINNKTIFKCKICAKVFTSERGWQKDYHRHILPKNYTCDECDNSYTYRIELTKHKRFEHSDEFMCELCNKKYTTRRNLNMHMNRIHLKLKKYKCDTCEKSFHSKNGLEIHINRDHHGSKYECTVCHKEFRTPKYVELHKRSQHDPNYVGKKYTCDICNKQYTSNGMLKQHQRKHNGSYIVCDVCGKRFLSALSLNLHASIHTNLKQYACKTCDKQFRNKPLLRNHERTHTGVKPFQCNMCEKRFTQRGTLTIHLRCHNGQRPYSCNLCDKTFLTNTLLKNDDAVQDIEQDFVGAAEMIREEEVFKLKHSGYHCNICDKRFFQKELCRRHYLRMHIDSAGYPCEVCNKIFKYKWNLITHTKLHDLNRQKYTCEICNREYATPQSYKMHKNRHLKNYSFKCNQCGKGFYQKAELKIHTDAEHNAIRYQCQMCKKTFCSKGYLSIHYRSHDPNYVPPELKCHICGKTYHSTDLFRRHVKAHEGFPCELCGKTLTSQISLKNHILIHRGEKPLSCNVCSKRFNKRAILKVHMRIHTNEKPYECKECGKRFAQRSPLVIHLRYHTGERPYQCQHCGEAFVSNTCLKAHIKKNHNYYPNIVLIKQEPLTILEIKQEENEEYTQSFNGYHCNYCGKFFLTKKYWKSHFVVHLKERPHRCEVCNKTFKRKNQLTCHKRIHCTDRKKFICDICCREFASWHYFQDHQRRHNGSAYECSQCDKSFLTKYQLKNHIDVKHNDVRYNCDQCTKSFLSKEYLVRHTRTHDPSYVAPILKCEICGKSYKSRSEYNRHVKSHEGFPCHQCGQIFKYAEKLKYHMYTHTGEKPYNCTFCEKKFRRKIYLTEHLRTHTKERPFECSVCGKGFTQRSPLKIHMRNHTGERPYSCMLCAEAFISKHRLTLHLQNVHCCSG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-