Ccar053212.1
Basic Information
- Insect
- Catonia carolina
- Gene Symbol
- -
- Assembly
- GCA_035578175.1
- Location
- JAQMRL010000015.1:13258785-13287668[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 35 1e-07 1.6e-05 26.8 1.8 1 23 110 132 110 132 0.99 2 35 0.0038 0.6 12.4 3.9 1 23 138 160 138 160 0.98 3 35 2.7e-05 0.0043 19.2 1.0 1 23 166 188 166 188 0.98 4 35 0.00015 0.024 16.8 6.5 1 23 194 216 194 216 0.98 5 35 0.00036 0.057 15.6 4.4 1 23 222 245 222 245 0.97 6 35 1.8e-05 0.0029 19.7 2.6 1 23 252 274 252 274 0.98 7 35 1.3e-05 0.0021 20.2 0.2 1 23 308 330 308 330 0.98 8 35 0.0077 1.2 11.5 5.2 1 23 335 358 335 358 0.97 9 35 3.4e-07 5.4e-05 25.2 2.6 1 23 426 448 426 448 0.99 10 35 0.017 2.7 10.4 4.9 1 23 454 476 454 476 0.95 11 35 0.75 1.2e+02 5.2 1.7 1 23 482 504 482 504 0.95 12 35 6.9e-05 0.011 17.9 6.4 1 23 510 532 510 532 0.98 13 35 0.00035 0.056 15.7 5.0 1 23 538 560 538 560 0.98 14 35 0.00032 0.051 15.8 4.0 1 23 566 589 566 589 0.97 15 35 8.4e-05 0.013 17.6 4.8 1 23 596 618 596 618 0.98 16 35 2.6e-06 0.00041 22.4 1.6 1 23 716 738 716 738 0.99 17 35 0.0038 0.6 12.4 4.9 1 23 744 766 744 766 0.96 18 35 0.04 6.3 9.2 2.3 1 23 772 794 772 794 0.98 19 35 0.00013 0.021 17.0 2.6 1 23 800 822 800 822 0.98 20 35 0.00046 0.073 15.3 3.5 1 23 828 851 828 851 0.97 21 35 8.4e-05 0.013 17.6 4.8 1 23 858 880 858 880 0.98 22 35 2.6e-06 0.00041 22.4 1.6 1 23 978 1000 978 1000 0.99 23 35 0.0033 0.52 12.6 4.5 1 23 1006 1028 1006 1028 0.96 24 35 5.5e-05 0.0088 18.2 0.9 1 23 1034 1056 1034 1056 0.98 25 35 0.00046 0.073 15.3 3.5 1 23 1062 1085 1062 1085 0.97 26 35 0.00016 0.025 16.8 3.0 1 23 1092 1114 1092 1114 0.98 27 35 1.9e-07 3e-05 26.0 4.5 1 23 1213 1235 1213 1235 0.99 28 35 0.0033 0.52 12.6 4.5 1 23 1241 1263 1241 1263 0.96 29 35 1.8e-06 0.00028 22.9 1.2 1 23 1269 1291 1269 1291 0.98 30 35 0.00053 0.085 15.1 2.7 1 23 1297 1318 1297 1318 0.97 31 35 8.1e-07 0.00013 24.0 0.8 1 23 1324 1346 1324 1346 0.98 32 35 1.1e-05 0.0017 20.4 3.0 1 23 1352 1374 1352 1374 0.97 33 35 9.7e-07 0.00015 23.7 0.7 1 23 1380 1402 1380 1402 0.98 34 35 0.00041 0.065 15.5 3.5 1 23 1408 1430 1408 1430 0.98 35 35 3.7e-05 0.0059 18.8 2.2 1 23 1436 1458 1436 1458 0.98
Sequence Information
- Coding Sequence
- atggaaggcggTGAATGTGTTAAAAGCGGGACAGAGACAATGTTATCAGGAACGCACAAAACTGCACTCATAAAATATTCCAATGTACCCAATCAACCCGATTTTGTGACTGATTTTGTTTTTCAGGACGCAGCCAGCGGTTCTGAAGAGGTGAAATTGGAAGTCGAAAAATCTAGTGGAGAACTGGTGTCGATTTATATAGACTATGGTTGTTCTAGCAGTGATAAAGACAATGATTGTGGTGCAACACGTGTCACCACAAGCAGTGATTCTGAATTAAATAGCGCAACGTATCGTAAGGTATCAACGCGTGGTGGTGTTAATCCATATAAATGTAACATCTGTGAAAAAGAGTTCACTCAGAGAAGTAATTTGGAACGTCACCGGTTAACACATACCGGTGCTAAACCGTTTAAATGCGATTTGTGCCGAAATGAATTCACGCAGAAATGTTCGCTGAATACTCATATGTTATTGCATACCGATTCGAAGCCGTTCAAGTGCGATTTATGCCGGAAAGAATTTACGCAAAAAGTGGCGCTGAGTAGtcacatgttatcacatagtggcgCTAAACCGTTTAAATGCGATTTGTGTCGTCACGAATTCAGACAGAAATCTCATCTAACCACGCACATGTTGTCACATAGTGGCGCTAAGCCGTTTAAATGCGATTTGTGTCGTGCCGAATTCAGACAGAAACATCATCTAGCCAAGCATATGTTGTCGTCACATAGtggtaatgttaatttatttacatgccATATATGTCCACAGCGGTTCAGCGACAAAAATGAACTGAACTACCATTTGTCAACGCATAGCGATGCCATGTCTTTTAAGTGCGATCTCTTCCAAAAGAACTTGTGTTCACAAACTGTTTTGAAAACTTATTTGTTGACGCATAATAGTGCTAAGCCATTTAAATGCGATGTCTGCGGAAAGGAATTTCCCATCAAAAGCTATTTAAATCAGCACTTATTAATTCATAGCGGTAATTCGTTCAAATGCGATTTTTGTGAAAATGAATACAAGCAGAAATGTTCTCTTACCAGACACATGAAATTACAGCATGATGACGCAGCCAGCGGTTCTGAAGAGGTGAAATTGGAAGTCGAAAAATCTAGTGAAGAACTGGTGTCGATTAATAGAGACTACAGTTATTCTAGCGGTGATAAAGACAGTCATTCTGGTGCAACACGTGTCACCACAAGTAGTGATTCTGAATTAAATAGCGGAACGTATCGTAAGATATCGCGTGGTGGTGTTACGCCATTTAAATGTAACGTCTGTAATAAAGAGTTCACTCAGAGAGGTAATTTGAAACGTCACCGGTTAACACATACCGGTGCTAAACCGTTTAAATGCGAATTGTGCCAAAAGGAATTCACGCAGAAACGTTCGCTGATGTGTCATATGTTTTTACATACCGATTCAAAGCCGTTCAAGTGCGATTTATGCGGGAACGAATATACGAAAAAATTAGCGCTGAGTTGTCACATGTTATTACATAGTGACATTATGCTGTTTAAATGTGATTTGTGCAGTCACGAATTCAGACGGAAATCTCATCTAACCACGCACATGTTGTCACATAGTGGCGCTAAGCCGTTTAAATGCGATTTGTGTAGTCACGAATTCAGACAGAAACCTCATCTAACCACGCACATGTTGTCACATAGTGGCGCTAAGCCGTTTAAATGCGATTTGTGTAGTCACGAATTCAGACAGAAACCTCATCTAGCCAAGCATATGTTGTCGTCACATAGtggtaatgttaatttatttacatgccATATATGTCCACAGCGGTTCAGCCACCAAAATGAACTGAACTACCATTTGTCAACGCATAGTGGTGCTAAGCCAAATGAATACAGTGATTGTGATGCAACACGTGTCACCATAAGTAGTGATTCAAAATTAAGTAGCTCAACGTGTCGTAAGGACGCAGCCAGCGGTTCTGAAGAGGTGAAATTGGAAGTCGAAAAATCTAGTGAAGAACTGGTGTCGATTAATAGAGACTACAGTTATTCTAGCGGTGATAAAGACAGTGATTCTGGTGCAACACGTGTCACCACAAGTAGTGATTCTGAAATAAATAGCGCAACGTATCGTAAGATATCGCGTGGTGGTGTTACGCCATTTAAATGTAACGTCTGTAATAAAGAGTTCACTCAGGGAGGTAATTTGAAACGTCACCGTTTAACACATACCGGTGCTAAACCGTTTAAATGCGAATTGTGCAAAAAGGAATTCACGCAGAAAAGTTCGCTGATGTGTCATATGTTTTTACATACCGATTCAAAGCCGTTCAAGTGCGATTTATGCGGGAACGAATATACGAAAAAATCAGGGCTGAGTTGtcacatgttatcacatagtggcATTAAGCCGTTTAAATGTGACTTGTGCAGTCACGAATTCAGCAAGAAATCTGCTCTAACCACACACATGTTGTTACATAGTGGCGCTAAGCCGTTTAAATGCGATTTGTGTAGTCACGAATTCAGACAGAAACCTCATCTAGCCAACCATATGTTGTCGTCACATAGtggtaatgttaatttatttacatgccATATATGTCCACAGCGGTTCAGCCACCAAAATGAACTGAACTACCATTTGTCAACGCATAGTGGTGCTAAGCCAAATGAATATAGTGATTGTGATGCAACACGTGTCACCATAAGTAGTGATTCAAAATTAAGTAGCTCAACGTGTCGTAAGGACGCAGCCAGCGGTTCTGAAGAGGTGAAATTGGAAGTCGAAAAATCTAGTGAAGAACTGGTGTCGATTAATAGAGACTACAGTTATTCTAGCGGTGATAAAGACAGTCATTCTGGTGCAACACGTGTCACCACAAGTAGTGATTCTGAAATAAATAGCGCAACGTATCGTAAGATATCGCGTGGTGGTGTTACGCCATTTAAATGTAACGTCTGTAATAAAGAGTTCACTCAGGGAGGTAATTTGAAACGTCACCGGTTAACACATACCGGTGCTAAACCGTTTAAATGCGAATTGTGCCAAAAGGAATTCACGCAGAAAAGTTCGCTGATGTGTCATATGTTTTTACATACCGATTCAAAGCCGTTCAAGTGCGATTTATGCGGGAACGAATATACGAAAAAATCAGGGCTGAGTCGtcacatgttatcacatagtggcATTAAGCCGTTTAAATGTGATTTGTGCAGTCACGAATTCAGACAGAAACCTCATCTAGCCAACCATATGTTGTCGTCACATAGtggtaatgttaatttatttacatgctATATATGTCCACAGCGGTTCAGCCACCAAAATGAACTGAACTACCATTTGTCAACGCATAGTGGTGCTAAGCCAAATGAATATAGTGATTGTGATGCAACACGTGTCACCATAAGTAGTGATTCAAAATTAAGTAGCTCAACGTGTCGTAAGGATAACGCAAGCGGTTCTGAAGAGGTGAAATGGGAAGTCGAAAAATCTAGTGGAGAACTGGTGTCGATTAATATAGACTATAGTTATTCTAGCGGTGATATAGACAGTGATTGTGGTGCAACACGTGTCACCATAAGTAGTGATTCTGAATTAAATAGCGCAACGTATCGTAAGGTATCATCGCGTGGTGGTGTTACGCCATTTAAATGTAACGTCTGTAATAAAAAGTTCACTCAGAGAAGTAATTTGAAACGTCACCGGTTAACACATACCGGTGCTAAACCGTTTAAATGCGAATTATGCCAAAAGGAATTCACGCAGAAAAGTTCGCTGATGTGTCATATGTTTTTACATACCGATTCGAAGCCGTTCAAGTGCGATTTATGCGGAAAAGAATTTACGAAAAAATCGTCGCTGAGTAGTCACATATTATCACATAGTGGTGCTAAGCCGTTTAAATGCAATGTTTGCCAAAAGGAATTCAAGAAAAATTCGCTGAACAAACATATGTTATTGCATGCCGATTCGAAGCCATTCAAGTGCGATTTATGCGGGAAAGAATTTACGCAAAAAGGGTCGCTGAGTAGtcacatgttatcacatagtggcgCTAAGCCGTTTAAATGCGAATTGTGCCGAAAGGAATTCACGCAGAAAAGTTCGCTGAAAAGTCATATGTTATTTCATGCCGATTCGAAGCCTTTCATGTGCGATTTATGCGGGAAAGAATTTACGCAAAAAGCAACGCTGAGTAGgcacatgttatcacatagtggcgCTAAGCCGTTTAAATGTGATTTGTGTCCGAACGAATACACACAGAAATCTCATCTAACCAAGCACAAGTTATCACATAGTGGCGCTAAACCGTTTAAATGCAATGTTTGCCGAAAGGAATTCATGCAGAAATATTCGCTGAACACTCATATGTTATTGCATACCGATTCGAAGTTTTTAAGTGCGATTTATGCGAGAAATAATTTAGCCAAAAATCGAATCTGA
- Protein Sequence
- MEGGECVKSGTETMLSGTHKTALIKYSNVPNQPDFVTDFVFQDAASGSEEVKLEVEKSSGELVSIYIDYGCSSSDKDNDCGATRVTTSSDSELNSATYRKVSTRGGVNPYKCNICEKEFTQRSNLERHRLTHTGAKPFKCDLCRNEFTQKCSLNTHMLLHTDSKPFKCDLCRKEFTQKVALSSHMLSHSGAKPFKCDLCRHEFRQKSHLTTHMLSHSGAKPFKCDLCRAEFRQKHHLAKHMLSSHSGNVNLFTCHICPQRFSDKNELNYHLSTHSDAMSFKCDLFQKNLCSQTVLKTYLLTHNSAKPFKCDVCGKEFPIKSYLNQHLLIHSGNSFKCDFCENEYKQKCSLTRHMKLQHDDAASGSEEVKLEVEKSSEELVSINRDYSYSSGDKDSHSGATRVTTSSDSELNSGTYRKISRGGVTPFKCNVCNKEFTQRGNLKRHRLTHTGAKPFKCELCQKEFTQKRSLMCHMFLHTDSKPFKCDLCGNEYTKKLALSCHMLLHSDIMLFKCDLCSHEFRRKSHLTTHMLSHSGAKPFKCDLCSHEFRQKPHLTTHMLSHSGAKPFKCDLCSHEFRQKPHLAKHMLSSHSGNVNLFTCHICPQRFSHQNELNYHLSTHSGAKPNEYSDCDATRVTISSDSKLSSSTCRKDAASGSEEVKLEVEKSSEELVSINRDYSYSSGDKDSDSGATRVTTSSDSEINSATYRKISRGGVTPFKCNVCNKEFTQGGNLKRHRLTHTGAKPFKCELCKKEFTQKSSLMCHMFLHTDSKPFKCDLCGNEYTKKSGLSCHMLSHSGIKPFKCDLCSHEFSKKSALTTHMLLHSGAKPFKCDLCSHEFRQKPHLANHMLSSHSGNVNLFTCHICPQRFSHQNELNYHLSTHSGAKPNEYSDCDATRVTISSDSKLSSSTCRKDAASGSEEVKLEVEKSSEELVSINRDYSYSSGDKDSHSGATRVTTSSDSEINSATYRKISRGGVTPFKCNVCNKEFTQGGNLKRHRLTHTGAKPFKCELCQKEFTQKSSLMCHMFLHTDSKPFKCDLCGNEYTKKSGLSRHMLSHSGIKPFKCDLCSHEFRQKPHLANHMLSSHSGNVNLFTCYICPQRFSHQNELNYHLSTHSGAKPNEYSDCDATRVTISSDSKLSSSTCRKDNASGSEEVKWEVEKSSGELVSINIDYSYSSGDIDSDCGATRVTISSDSELNSATYRKVSSRGGVTPFKCNVCNKKFTQRSNLKRHRLTHTGAKPFKCELCQKEFTQKSSLMCHMFLHTDSKPFKCDLCGKEFTKKSSLSSHILSHSGAKPFKCNVCQKEFKKNSLNKHMLLHADSKPFKCDLCGKEFTQKGSLSSHMLSHSGAKPFKCELCRKEFTQKSSLKSHMLFHADSKPFMCDLCGKEFTQKATLSRHMLSHSGAKPFKCDLCPNEYTQKSHLTKHKLSHSGAKPFKCNVCRKEFMQKYSLNTHMLLHTDSKFLSAIYARNNLAKNRI
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -