Carc023244.1
Basic Information
- Insect
- Coenonympha arcania
- Gene Symbol
- -
- Assembly
- GCA_036785405.1
- Location
- CM072078.1:269716-296261[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 28 2.3 1.7e+02 3.8 0.3 2 23 39 59 39 59 0.96 2 28 0.45 33 6.0 1.7 1 20 67 86 67 88 0.94 3 28 0.45 33 6.0 1.7 1 20 163 182 163 184 0.94 4 28 0.45 33 6.0 1.7 1 20 259 278 259 280 0.94 5 28 0.45 33 6.0 1.7 1 20 355 374 355 376 0.94 6 28 0.45 33 6.0 1.7 1 20 451 470 451 472 0.94 7 28 0.45 33 6.0 1.7 1 20 547 566 547 568 0.94 8 28 0.45 33 6.0 1.7 1 20 643 662 643 664 0.94 9 28 0.45 33 6.0 1.7 1 20 739 758 739 760 0.94 10 28 0.45 33 6.0 1.7 1 20 835 854 835 856 0.94 11 28 0.45 33 6.0 1.7 1 20 931 950 931 952 0.94 12 28 0.45 33 6.0 1.7 1 20 1027 1046 1027 1048 0.94 13 28 0.45 33 6.0 1.7 1 20 1123 1142 1123 1144 0.94 14 28 0.45 33 6.0 1.7 1 20 1219 1238 1219 1240 0.94 15 28 0.45 33 6.0 1.7 1 20 1315 1334 1315 1336 0.94 16 28 0.45 33 6.0 1.7 1 20 1411 1430 1411 1432 0.94 17 28 0.45 33 6.0 1.7 1 20 1507 1526 1507 1528 0.94 18 28 0.45 33 6.0 1.7 1 20 1603 1622 1603 1624 0.94 19 28 0.45 33 6.0 1.7 1 20 1699 1718 1699 1720 0.94 20 28 0.45 33 6.0 1.7 1 20 1795 1814 1795 1816 0.94 21 28 0.45 33 6.0 1.7 1 20 1891 1910 1891 1912 0.94 22 28 1.2 91 4.7 4.9 1 23 1987 2009 1987 2009 0.96 23 28 2.4 1.8e+02 3.8 0.6 1 23 2033 2056 2033 2056 0.95 24 28 0.002 0.15 13.4 0.7 2 23 2067 2088 2066 2088 0.95 25 28 0.0051 0.38 12.1 1.4 1 23 2094 2116 2094 2116 0.95 26 28 0.00061 0.046 15.0 2.4 1 23 2122 2144 2122 2144 0.96 27 28 1.7e-05 0.0013 19.9 2.9 1 23 2150 2172 2150 2172 0.97 28 28 0.018 1.3 10.4 0.0 1 21 2178 2198 2178 2199 0.95
Sequence Information
- Coding Sequence
- ATGTGCTTCGAGGAGTGCGGCTCGCGCGCCGAGCTGCTGCGACACTTCCAGTCGCACGGCGCGCAGGCGCAGGCGACGGAGctggcgccgccgcccgcgctcggCGACTCGCCGCGCTGTCTGCGATGTCAGAAGGTGGTGCAGGCGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCATCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGGTACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGTGAGGCTGTGCAGTGCTCCTGCTCATCGTCTGTACTATACGTCGCAGTGGGCGGAGCACTGGCGGCGGCACTACGCGCGCGACGTGCGGCCCTACCACTGCGCCGTGTGCTTCCGCGCCTTCCGCGACCACCACCAGATACTCAAACACGGGTTGTCGCACACCGTGGAGGAGTTACGCGCGGACAACGACGGCGCGGAGGAACCGAATCCAGACGCGTCGCAGAAACGCTTCGCGTGCGACGTGTGCGCTGAAGGCTTCGTCTATATGAGGTGCCTGCTGTCGCACCGCACGCGCGCGCACccggaggcggcggcgcgcgcgctgcgactgcgctgcgcggcgtgcgcgcgctcgtTCGCGCACGCCAACTCGCTGCGGCGCCACCTGCGCGTGCACTCGGGCGAGCGCAACTTCCTGTGCAGCGTGTGCGGCAAGGCGCTGTCGTCGCGCGAGCACCTCAAGTTCCACCTGCGCATCCACTCGGGCTACAAGCCGCACGCGTGCGCCACGTGCGGCAAGGCGTTCGTCAAGAAGTGCAACCTCACGCTGCACGAGCGCGTGCACTCGGGCGAGAAGCCGCACGTGTGCTCGCACTGCGGCAAGGCCTTCTCGCAGCGCTCCACGCTCGTCATACACGAGAGGTACCACAGCGGCGCACGGCCGTACTCGTGCGAGCTGTGCGGGCGCGGCTTCGTGGCCAAGGGGCTGCTGTCCATGCACCTCAAGAGCGCGTGCGTCAACTTGCCCGCGCCAAACCAAGCTAGCCTCCCAGCTGCCGCCCTGGTGCGGTGCCACAGGTAA
- Protein Sequence
- MCFEECGSRAELLRHFQSHGAQAQATELAPPPALGDSPRCLRCQKVVQASQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIICTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHVLLLIVCTIRRSGRSTGGGTTRATCGPTTAPCASAPSATTTRYSNTVRLCSAPAHRLYYTSQWAEHWRRHYARDVRPYHCAVCFRAFRDHHQILKHGLSHTVEELRADNDGAEEPNPDASQKRFACDVCAEGFVYMRCLLSHRTRAHPEAAARALRLRCAACARSFAHANSLRRHLRVHSGERNFLCSVCGKALSSREHLKFHLRIHSGYKPHACATCGKAFVKKCNLTLHERVHSGEKPHVCSHCGKAFSQRSTLVIHERYHSGARPYSCELCGRGFVAKGLLSMHLKSACVNLPAPNQASLPAAALVRCHR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -