Cfor012559.1
Basic Information
- Insect
- Cylas formicarius
- Gene Symbol
- -
- Assembly
- GCA_029955315.1
- Location
- JARGXY010000006.1:35076686-35094243[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 48 0.0012 0.094 13.6 1.5 1 23 10 32 10 32 0.98 2 48 0.51 41 5.3 0.4 1 23 41 64 41 64 0.95 3 48 1.8e-06 0.00015 22.4 0.7 1 23 70 93 70 93 0.97 4 48 0.3 24 6.0 1.3 3 23 104 126 102 126 0.92 5 48 3.6 2.9e+02 2.6 1.7 3 23 133 154 131 154 0.95 6 48 0.0012 0.094 13.6 2.8 1 23 162 184 162 184 0.98 7 48 5.9e-05 0.0047 17.7 0.2 1 23 190 212 190 212 0.98 8 48 0.00066 0.053 14.3 0.2 1 17 218 234 218 235 0.94 9 48 0.013 1.1 10.2 0.1 1 23 290 313 290 313 0.97 10 48 0.00056 0.045 14.6 0.3 1 23 323 347 323 347 0.95 11 48 0.00047 0.038 14.8 1.3 1 23 354 376 354 376 0.94 12 48 0.0083 0.67 10.9 1.1 1 23 383 405 383 405 0.98 13 48 1.4e-05 0.0011 19.6 0.5 1 23 411 433 411 433 0.98 14 48 0.0041 0.33 11.8 1.0 3 23 516 537 514 537 0.95 15 48 0.0051 0.41 11.5 0.8 2 23 566 588 566 588 0.96 16 48 0.81 66 4.6 6.2 2 23 593 612 592 613 0.92 17 48 4.7 3.8e+02 2.2 1.9 1 23 618 641 618 641 0.79 18 48 0.042 3.4 8.7 1.0 1 23 648 672 648 672 0.95 19 48 2.8e-06 0.00023 21.8 3.4 1 23 677 699 677 699 0.98 20 48 0.00017 0.014 16.2 0.6 1 23 706 728 706 728 0.98 21 48 4e-06 0.00032 21.3 0.7 1 23 734 756 734 756 0.98 22 48 0.00038 0.03 15.1 0.4 1 23 762 785 762 785 0.95 23 48 0.0021 0.17 12.7 0.4 2 23 874 896 874 896 0.96 24 48 0.4 32 5.6 3.2 2 23 901 920 901 921 0.86 25 48 0.04 3.2 8.7 0.9 1 23 926 949 926 949 0.97 26 48 0.0016 0.13 13.1 1.1 2 23 957 980 956 980 0.90 27 48 4.7e-05 0.0038 18.0 4.7 1 23 985 1007 985 1007 0.99 28 48 0.00024 0.019 15.7 0.6 1 23 1016 1038 1016 1038 0.98 29 48 3.8e-06 0.00031 21.4 2.0 1 23 1044 1066 1044 1066 0.98 30 48 0.00038 0.03 15.1 0.4 1 23 1072 1095 1072 1095 0.95 31 48 0.00027 0.022 15.6 0.5 2 23 1191 1213 1191 1213 0.95 32 48 0.047 3.8 8.5 0.3 1 12 1217 1228 1217 1230 0.90 33 48 0.13 10 7.2 0.8 1 23 1243 1266 1243 1266 0.96 34 48 0.0019 0.15 12.9 5.9 3 23 1275 1297 1273 1297 0.90 35 48 1.1e-05 0.00091 19.9 3.1 1 23 1302 1324 1302 1324 0.97 36 48 0.00064 0.052 14.4 0.6 2 23 1334 1355 1333 1355 0.96 37 48 0.00024 0.02 15.7 1.9 1 23 1361 1383 1361 1383 0.93 38 48 6.2e-05 0.005 17.6 0.5 1 23 1389 1412 1389 1412 0.95 39 48 0.0094 0.76 10.7 1.3 1 23 1468 1491 1468 1491 0.92 40 48 0.0027 0.22 12.4 0.8 1 23 1497 1519 1497 1519 0.98 41 48 0.0033 0.26 12.2 0.4 2 23 1543 1565 1543 1565 0.95 42 48 0.047 3.8 8.5 0.3 1 12 1569 1580 1569 1582 0.90 43 48 0.17 14 6.7 1.4 1 23 1595 1618 1595 1618 0.97 44 48 5e-05 0.004 17.9 1.4 1 23 1625 1649 1625 1649 0.95 45 48 3.4e-05 0.0028 18.4 3.3 1 23 1656 1678 1656 1678 0.97 46 48 3.8e-05 0.0031 18.2 0.5 1 23 1684 1706 1684 1706 0.98 47 48 8e-05 0.0064 17.2 1.8 1 23 1712 1734 1712 1734 0.97 48 48 0.0038 0.3 12.0 1.0 1 23 1740 1763 1740 1763 0.94
Sequence Information
- Coding Sequence
- ATGCTCACCCATTCCGGCGAGAGAAGGTTCAAGTGCACAGTGTGTGCGAGGGCTTTCGCTCATAAATCGGTGCTCAGCAAACATAAGTTGGTGCACAGCGACACCGCTTGCCACAGACGCTTCACTTGCGATGTTTGCGGATTGTTGTTCCTGGACAAAAAGTCCGTCGAGTTGCACAAACGTAACCTCCACCTGAACGACTACCCTTTCTGCTGCGACCAATGCGGAAAAGCGTTCGTTTCGAGGGGGGCATTGAACCGTCACATCAAGATGGCGCACGAACAAACGCCCGAGCCCGTCCTCATACCGTGCACCTTCGACAACTGTCCGGAGGTGTTCCGGTGTAAGTCGACTCTCAAGTACCACTTGGTGAAGCACGAGCTGAAAACCTACGGCTGCGAGTACTGCCTGGAGGTGTTCCTACACCCGCATCGATTGAAGGTCCACGTGAGCAAATATCACGGCGCTAAAACGAAGCCCCTTTTCACTTGTCACCTGTGCTCGAAGCAGGTGACGACCAAGTTGAGTTTACGGGATCACCTCCGGATTCATACCGGGGAGAAACCGTTCGTGTGCGAGTGGTGTGGACAAAACTTTGCCACCAGTCAGTACTTGAAGGTCCACAGGCTGGTGCACACGAAGGAGAAACCGCACACTTGTCAGATTTGCGGCAAGGGGTTCACCCAGAGGAGTTCGTTGAACGTTTGCGTCGCGAGCGAACTCTGTCCCTGTTTCGAATGCAAAGCGCGTTTCGTCTCGGAACCCTCGACAACCTCAAAGTATTTCGTAAGGTTCCCCGTCACCTTCAAGGTCGTGAACAATGACGTTAAGAAGGCTAGACCTACGACTGCGATCAAAAGGTACAAATACCCTTGCGTGCTGTGCCCTCGACGCTTCATGTACGAAGAGGGCCTCATCGAGCATAAGCGCTCCGCTCACGACGACGACGAAACTCCGCGACAGCTGTTCGCTTGTCCGCAGGATTCCTGCGGCAAAGTCTACCGGTCCAAGTGGACCATGAGGGAACACATGCTCAAGCACGACCCTTGCAACACTCAGTTCGCTTGCGCGCAGTGCTCGAAGGTTTATACCCATCCGAAATCTCTCAAGAGGCACGAGCTCGAACACAAGAACGGCAAGAAGCACTACACGTGCGGCACATGCTCGAAACGTATATCGAACCTGCAGAGTTTTAAGAATCACCTGTTGATTCACACTGGGGATTTGCCGTTCGTGTGCGAGCAGTGCGGCAGGCCGTTCCGTCTCAGGAGGAACTTGGTGACGCACATGCTCAGACATGCCGAAGGGAGGGGCTACGTGTGCAAGGTGTGCGGCAAGAAGATTTGCGTCGGAAACGCGCACTGTCGATGTTTCGAATGCAAAGTTTATTATAAGAGCGTGGAACCCAACGAACCCCAACACGACGCCCACGAGTTCGTACCTAGGGCGTGCATCAGGGTCATCGAGGACGCCAAGTCAGTCCTTACTTTATTCCCCCAAGCCCACGCAATTACCCGCCTCTGTTGCAGGGAGAAAAAATTCGGGTGTCCCATGTGTGCGAAACATTTCGTCAACAGGTCGTGCCTCGAAAGTCACGTGATCGTTGCGCACAACGACGCCAAACCTGATCTCGTACGCGACGATCCCGAGGACTATTATTTGCTTAATGATCCTGTCCGCGATCTCGCAAACCGGACGTGCCGAGAGTGCTCCAAAGTATTTCCCAACGTGCGCAGTTACAAGAGTCACGTGCGCAACGAGCACAAGACGCAGGATCAGTGCGAGTTTTGCGGCAAAATGTTCACCCATCTCCATGTGCATCTAAGGACGGTTCATCACAGGAAGTACAAGTTCAAGTGCGAGTTGTGCCATCGCGGTTTCCTCTACGAGAGCGGCTTCAACTTTCACGTGCAGACTGTGCACGAGAAGGACCGACAAGTGTTCGCGTGCTCGGAAAAAGATTGCGGCAGATGGTTCATGTCCAACCAGTCGCTCAAAGACCACCGGAAAAAACATGAAGGCGTTCAGTACCGTTGCGAGCAATGCCCGAAGGTGTTTAACCACCCTTCGTCGTTCAAGCGCCACCTCAAAAATCATAAAAGCGACAAAAAGGTGTTCCCCTGCGACGTGTGCGAGAAGAAGGTCAGCACCTTGCATAGTTTAAGAGATCACATGCTGATTCACTCGGGCGATATGCCTTTCGTGTGCGAAAATTGCGGACGTCGGTTTCGCGTCAAGAAGAATCTGGTGACGCACATGGTGACGCACACGAAAGAGAGACCTTTCCCTTGTAAAGTGTGCGGTCTTAGTTTTACTCAACGCGGGCCGATGAAGAGGCACTTTTTGAAGGCCCACCCTGACGAAAAAGTTTGCGTCGGAAACGCCCACTGTCCATGTTTCGAATGCCGAGTTTATTACAACAGTGGCGTGCCGAACGACCACCACGGCGCCCATTTCTTCCTCCCAAAAACGTCCATCAGGATCATCGAGGATGCCAAGGAGAAATACAtagaaatgtataaaaaaagtcGCTTGGCAACCGCGCCCGACGTCAAACCCTTCGCGTTGGGTTTCGACGAGGAATCGTCCTGTGACCCGGTCCTTGAACTTGTCAACAGGACTTGCAAGGAGTGCGATAGGATTTTTCCCGATGTGCGCAAATACGAAAGACATGTGCGCAACGAACACAAGACCCAAGAGCAGTGTGAGTTTTGCGGAAAAATGTTCACCCACCTCCAGATACATCTAGCCACGGTCCACCACAAGAATTACAAGTTCAAGTGCAAATTGTGCGACCGTGGCTTTCTCTACGAGAGCGGCTTCAACTATCACGTGCAGACCGTGCACGAGAAGAGCCGACAGGTGTTTGCGTGCGCGGAAAAAAACTGTGGCAAATTGTTCACGTCGAACCGGTCCCTCAAAGACCACCGCCAAAAGCACGAAGGCGTCGAGTATTCTTGCGAGCAGTGCCAGAAAGTGTTTTGTCATCCGGTGTCGTTCAGGATTCACCTCAGAACTCATAAAGCCAAACCTGACAAGAAGATTTTTTCTTGCAACTTGTGCGATAAGAAGGTGAACAGCTTGCGCAGCTTGAGGGATCACATGCTGATCCATTCGGGTGGTTTGCCATTCGTTTGCGAACATTGTGGACGTAGGTTTCGCGTCAAGAGGAATCTGGTGACGCACATGGTGACCCACACTAAGGAGAGGCCTTTTCCTTGTAAGGTGTGCGGTCTGAGTTTTACTCAACGCGGACCGATGAAGAGACATTTCTTAAAGGCACATCCGGACGACAAAGTCTGCGTCGGAAACGCACACTGTCGATGTTTTGAGTGCCGAGCTTATTACAACAGCGGCGTGCCGAACAAACAGGACATCCATCCCTTCGTTGCGAAAACATCCGTCGGGATCATCGAGGACGCCGAGCAAAAAATACTTTCTCCTTTGTTCAGTGAGCGTAACCTTAAAAGACACGTTGCGATCACACGTGACGGTAAATCCGGTCGCCAACTCGAAGATCTAGACGACTATTACTCGATGAACGACCCCGCCCTCGAGCTGGCGAGTAGGACCTGCAGGGAGTGCGGCAAAGTCTTTTCCGATGTGCGCAGCTACAAGAATCATGTGCGCGACGATCACAAGGGGCTGCATCAGTGCGAGTATTGCGGCAAAATGTTCGCGCAGATCTACGGACACTTAAGCGCCGCTCATCGCAGAACATACAAGTTCGAGTGCGAGTTGTGCGACCGAGGTTTTCTCTACGAGGGCGCGTTGCACCATCACGTGCAGAGTGTGCACGAACAAGCCCGACAGGTATGTGCGTGTGCCGAAAAACACTGCGGCAAAGTGTTCACCTCGAATCAGTCCCTCAAGGACCATCATAAGAAACACGAAGGTGTCGAGTACTCTTGCGAGCAATGCCCGAAGATGTTTAACCACCCTTCGTCGTTCAAGCGGCACCTGAAAGATCATAGAGACAAACTCAACAAAAGGGTTGTCCCTTGCGACGTGTGCGACAAGAAGGTGTGCAGCTTAAAAAGCTTGAGGGAACACATGCTGATCCACTCGGGCGATATGCCGTTCGTCTGTGAACACTGTGGACGTAGGTTTCGCTTGAGGAAGAATCTGATGACGCACAGTGTGGTGCACACTAAAGAAAGGCCTTTCCCGTGTAATGTGTGCGGCGTTAGTTTTACTCAACGCGGGTCGATGAAGAGGCATTTCTTGAAGGCACATCCAGACCAGAAGTTAGTGTGCATCGGCAACGCATACTGTCAATGTTTCGAATGCAGAGTTTATTACAAGAATTCCGATCCCTCAGAAAACTTCGACGGCACCCATCCTTTTGCGCCGCCAGCGTCGATCAGGGTCATCAATATCCAGTCGCCcagGCCGAAAAACTTCTTGTGCCGCATCTGCTGGAGACCTTTCGTTAAAAGGAAAAATTACGAAAGACATTTTAAGCTCGAGCACCCGGATGCGCACCCGTATCCCTGCGAACTGTGCACGAACGAATTCCACTCTTACGATCAGTTGTTGTGGCACATGCGCGAGCACGGGATCGTCCCCGACGGCGAGAATGACTACTATTCGATTCGTGATGCCACCATGCAGCTGACAAACAGGACGTGCAAGGAGTGCGGCATGACCTTTGTCGACGTGCGCAAATACAAGGAACATGTCCGCGAAGATCACAAGAGGTTGCACCAGTGCGAATATTGCGGAAAAATGTTCGCGCAAATCTACGGTCATCTAAGTTCTGCCCATCGCAGGAAGTACAAGTTCAAGTGCGAGGTGTGCGAGCGTGGGTTCCTCTACGAAGGCGCGTTGCACCATCACGTGCAGAGAATCCACGAAGAAGCCCGCCAGTTGTTCGCGTGTACGGCAAAGGGCTGCAAAAAAGTGTTCACGTCGAACTATTATCTCAAAGATCACCTGAAGAGACACGAAGGTCCCAAGGCCGAGTACTCTTGCAAGGAGTGCTCGAAGGTGTTTACCCACCCGATCTCGTTCAAGCGCCACCTCAAAGACCACAAAGACAAAAAGGTGTACCCTTGTGAGGTGTGCAACAAGACGGTGAACAGCGCTCAAAGCTTGAGACATCACATGCTGATCCACACGGGCGACATGCCCTACGTGTGCGAACACTGTGGACGCAGGTTTCGCTTGAGGAAGAATCTGGTGACGCACAGGGTGGTGCACACCAAAGAGAGGCCTTTCGCGTGCAAAGTGTGCGGTGCCAGCTTTACCCAACACGGACCGATGAAGAAACATTTCTTGAAAGCGCATCCCGATGAGAAATTTTAA
- Protein Sequence
- MLTHSGERRFKCTVCARAFAHKSVLSKHKLVHSDTACHRRFTCDVCGLLFLDKKSVELHKRNLHLNDYPFCCDQCGKAFVSRGALNRHIKMAHEQTPEPVLIPCTFDNCPEVFRCKSTLKYHLVKHELKTYGCEYCLEVFLHPHRLKVHVSKYHGAKTKPLFTCHLCSKQVTTKLSLRDHLRIHTGEKPFVCEWCGQNFATSQYLKVHRLVHTKEKPHTCQICGKGFTQRSSLNVCVASELCPCFECKARFVSEPSTTSKYFVRFPVTFKVVNNDVKKARPTTAIKRYKYPCVLCPRRFMYEEGLIEHKRSAHDDDETPRQLFACPQDSCGKVYRSKWTMREHMLKHDPCNTQFACAQCSKVYTHPKSLKRHELEHKNGKKHYTCGTCSKRISNLQSFKNHLLIHTGDLPFVCEQCGRPFRLRRNLVTHMLRHAEGRGYVCKVCGKKICVGNAHCRCFECKVYYKSVEPNEPQHDAHEFVPRACIRVIEDAKSVLTLFPQAHAITRLCCREKKFGCPMCAKHFVNRSCLESHVIVAHNDAKPDLVRDDPEDYYLLNDPVRDLANRTCRECSKVFPNVRSYKSHVRNEHKTQDQCEFCGKMFTHLHVHLRTVHHRKYKFKCELCHRGFLYESGFNFHVQTVHEKDRQVFACSEKDCGRWFMSNQSLKDHRKKHEGVQYRCEQCPKVFNHPSSFKRHLKNHKSDKKVFPCDVCEKKVSTLHSLRDHMLIHSGDMPFVCENCGRRFRVKKNLVTHMVTHTKERPFPCKVCGLSFTQRGPMKRHFLKAHPDEKVCVGNAHCPCFECRVYYNSGVPNDHHGAHFFLPKTSIRIIEDAKEKYIEMYKKSRLATAPDVKPFALGFDEESSCDPVLELVNRTCKECDRIFPDVRKYERHVRNEHKTQEQCEFCGKMFTHLQIHLATVHHKNYKFKCKLCDRGFLYESGFNYHVQTVHEKSRQVFACAEKNCGKLFTSNRSLKDHRQKHEGVEYSCEQCQKVFCHPVSFRIHLRTHKAKPDKKIFSCNLCDKKVNSLRSLRDHMLIHSGGLPFVCEHCGRRFRVKRNLVTHMVTHTKERPFPCKVCGLSFTQRGPMKRHFLKAHPDDKVCVGNAHCRCFECRAYYNSGVPNKQDIHPFVAKTSVGIIEDAEQKILSPLFSERNLKRHVAITRDGKSGRQLEDLDDYYSMNDPALELASRTCRECGKVFSDVRSYKNHVRDDHKGLHQCEYCGKMFAQIYGHLSAAHRRTYKFECELCDRGFLYEGALHHHVQSVHEQARQVCACAEKHCGKVFTSNQSLKDHHKKHEGVEYSCEQCPKMFNHPSSFKRHLKDHRDKLNKRVVPCDVCDKKVCSLKSLREHMLIHSGDMPFVCEHCGRRFRLRKNLMTHSVVHTKERPFPCNVCGVSFTQRGSMKRHFLKAHPDQKLVCIGNAYCQCFECRVYYKNSDPSENFDGTHPFAPPASIRVINIQSPRPKNFLCRICWRPFVKRKNYERHFKLEHPDAHPYPCELCTNEFHSYDQLLWHMREHGIVPDGENDYYSIRDATMQLTNRTCKECGMTFVDVRKYKEHVREDHKRLHQCEYCGKMFAQIYGHLSSAHRRKYKFKCEVCERGFLYEGALHHHVQRIHEEARQLFACTAKGCKKVFTSNYYLKDHLKRHEGPKAEYSCKECSKVFTHPISFKRHLKDHKDKKVYPCEVCNKTVNSAQSLRHHMLIHTGDMPYVCEHCGRRFRLRKNLVTHRVVHTKERPFACKVCGASFTQHGPMKKHFLKAHPDEKF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -