Clat022679.1
Basic Information
- Insect
- Cantharis lateralis
- Gene Symbol
- -
- Assembly
- GCA_963170105.1
- Location
- OY720628.1:37476681-37495061[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 45 0.45 23 6.4 2.7 1 23 76 98 76 98 0.98 2 45 1e-05 0.00052 21.1 1.8 1 23 104 126 104 126 0.98 3 45 0.013 0.65 11.3 5.0 1 23 132 154 132 154 0.98 4 45 0.0023 0.12 13.6 0.7 1 23 160 182 160 182 0.98 5 45 0.00043 0.022 15.9 1.8 1 23 188 210 188 210 0.98 6 45 0.0022 0.11 13.7 4.1 1 23 216 238 216 238 0.98 7 45 0.0017 0.088 14.1 5.4 1 23 324 346 324 346 0.98 8 45 0.00039 0.02 16.1 1.1 1 23 352 374 352 374 0.97 9 45 0.00039 0.02 16.1 2.6 1 23 380 402 380 402 0.98 10 45 0.0052 0.27 12.5 1.5 3 23 410 430 408 430 0.96 11 45 0.0022 0.12 13.7 1.4 3 23 438 458 437 458 0.99 12 45 0.064 3.3 9.1 1.6 2 23 577 598 576 598 0.97 13 45 0.00014 0.0074 17.4 0.6 3 23 606 626 604 626 0.98 14 45 0.0048 0.25 12.7 3.9 1 23 632 654 632 654 0.97 15 45 0.0032 0.17 13.2 2.1 2 23 661 682 660 682 0.97 16 45 0.0072 0.37 12.1 1.0 3 23 690 710 688 710 0.98 17 45 0.0031 0.16 13.2 2.2 1 23 716 738 716 738 0.98 18 45 0.00049 0.026 15.7 0.6 1 23 744 766 744 766 0.99 19 45 0.00047 0.024 15.8 3.9 1 23 772 794 772 794 0.97 20 45 0.0046 0.24 12.7 3.4 1 23 800 822 800 822 0.99 21 45 7.7e-05 0.004 18.3 1.4 1 23 828 850 828 850 0.98 22 45 0.033 1.7 10.0 4.1 1 23 856 878 856 878 0.98 23 45 0.001 0.054 14.7 5.1 1 23 884 906 884 906 0.99 24 45 0.0017 0.088 14.1 3.4 3 23 914 934 912 934 0.97 25 45 0.00098 0.051 14.8 4.3 1 23 940 962 940 962 0.98 26 45 0.0012 0.062 14.5 1.7 1 23 968 990 968 990 0.98 27 45 0.00079 0.041 15.1 1.8 1 23 996 1018 996 1018 0.99 28 45 0.026 1.4 10.3 1.0 1 23 1024 1046 1024 1046 0.97 29 45 0.0085 0.44 11.9 5.6 1 23 1052 1074 1052 1074 0.98 30 45 0.15 7.8 7.9 2.3 3 23 1082 1102 1080 1102 0.97 31 45 0.00063 0.033 15.4 3.9 1 23 1210 1232 1210 1232 0.97 32 45 3.2 1.7e+02 3.8 0.3 1 23 1238 1260 1238 1260 0.95 33 45 3e-05 0.0016 19.6 4.7 1 23 1266 1288 1266 1288 0.98 34 45 0.0024 0.12 13.6 2.0 1 23 1294 1316 1294 1316 0.98 35 45 0.0055 0.29 12.5 6.0 1 23 1322 1344 1322 1344 0.98 36 45 8.8e-05 0.0046 18.1 0.7 1 23 1350 1372 1350 1372 0.98 37 45 0.00027 0.014 16.6 1.2 1 23 1378 1400 1378 1400 0.96 38 45 0.0061 0.32 12.3 0.2 3 23 1408 1428 1406 1428 0.97 39 45 0.00027 0.014 16.6 2.5 1 23 1434 1456 1434 1456 0.98 40 45 0.073 3.8 8.9 1.2 2 23 1531 1553 1530 1553 0.97 41 45 1.5 77 4.8 0.5 1 23 1559 1582 1559 1582 0.96 42 45 0.037 1.9 9.8 0.4 1 20 1595 1614 1595 1616 0.96 43 45 0.023 1.2 10.5 3.4 2 23 1756 1778 1756 1778 0.92 44 45 0.42 22 6.5 0.1 2 23 1797 1818 1796 1818 0.85 45 45 0.0071 0.37 12.1 3.5 1 23 1829 1852 1829 1852 0.97
Sequence Information
- Coding Sequence
- ATGGATGTCAAGGTGGAAGTCATCGACAGCTCCAATGATATCGCAGCTAAAAGTTCAATTACCTCCGAAGTTGAAAACGAAGATGTGGATTTCGAATTGGAATCGTTGCATCATTCCATTGagataaaagaagaaattttaacgGATCCAGATTGCGAATTACAATTGCCTGCAATCAAATGTATTTCTGAGCAGAAATGGACCCGTCGTACCGGCAAGACGCTGTTCGAGTGTAATTTTTGCGACTATAAACGTTCAAGTGAACAGCATGTGATCGATCACAGTCGAACTCATACTGGGGAGAGACCTTTCGAGTGTTCtgtttgcgattttaaatctACCACCAAATCTAATTTGAAGAGACATTTACGAATTcacactggcgagaaaccatttgagTGTACTTTTTGCGGTTATAAATGTTCTCTGGCGCATCGTTTGAAGGCACATTTGcgaactcataccggcgagagACCATTCGAGTGTAGtctttgcgattataaatcttcGGCAAAATCTGATTTGAACGTGCATTCGCGAACTCATACCGGTGAAAAACCATTTGAGTGTACtctttgcgattataaatcttcGTCAAAGGCTGGTTTGCACAACCATATAcgaactcataccggcgagaaaccgttcgaATGTActgtttgtgattataaatgtgcTCAAAAAGCTCATTTGTTGAGTCATTCCCGAACTCATACCGAAACTAACGTGGCTAAAATTCCTGCCGACAACATGGAAGTCGAACTAGAAAATTTACACCATTCGGTGGATATTAAGGACGACAATCTGGTTTTACAATGTCAACAGGATGGTACCGAgatAACCCGCAGTCATTCCCATGATGGTTTGCAAATTAAGACGGAAGAAGAAATGGAAGTCAATTCGGAAGTTTTACATCATTCCATCGACCTTAAAGAAGAAACTGGGGTGGCCAAAACTGGCAAAAAGCTGTTtcagtgtaacatttgcgattataattGCTCGAAAAAATGGAACtttacaaaacatttatttactcATACGGGAGAGAAGCCCTATGCGTGTGACGTTTGCGATTATAGAAGCACTCATAAAGGAACTTTAAAGCGTCATATCCAAGCTCATACGGGTACGAAACCGTATGCTTGTAACTtgtgcgattataaatgtacttATAACGGAAACCTGAAAActcatttactaactcatatgGGCGAAAAACCGATAGCATGTGATCtgtgcgattataaatgcaaGGTAAAAGTACAGTTAACACGACATTTACTAAAGCATGCCGGTCCAAAGCCGCATGGATGTAACGTTTGCGATTTTAGATGTTATCGTTTGCCAAATTTGAATGCGCATTTGCGAACTCATACAGGCGAGAAATCGTTTGAGTGTTACGATTGCGGTTATAAATGTGCTCAGAAAAAAAAGTATCCGTGGTGGCAAGCTCCTGGTAGTGGATGTGACGGTGATAAAATTCCAGCCGACAACATGGAAGTCGAACTAGAAAATTTACACCATTCGattgatattaaagaagaCAATCTGATTTTCGAATATCAACAGGATAGTACAGagaaaaccCCAAGTCAGCTGgccagttattttaaaatcaaggtGGAAGAAGAAGTGGAAAACAACACGAAGGTTTTGCTTGATTCCATCGACCTTAAAGAAGAAGCTTCGATGACGGAAACTGAAGTGGATCGTGGCAAAAATCTGATTCAATGCagcatttgcgattataattGTACCGCACAGGCGCggtttaaaaatcatttattaaccCACACGGGGGAGAAGCCCTATGGGTGTGacgtttgcgattataaaactACTCAGAAGGCATATTTAAAACGTCATATCCAAACGCATACGGGCACGAAGCCGTATGCGTGTAATTTGTGCGAATATAAATGTTCGCATAAGGGAAACCTGAAAGCTCACTCGCTGACTCACATCGGCGAGAAAACGATCGAGTGTGATCAAtgcgattataaatgcaaaGTGAAAGTGCAGTTGACTCGTCATTTGCTGAAGCATTCCGGTCCGAAGCCGTATGGGTGTAACGTTTGCGATTTTAGATGTTATCGTTTGCCAAATTTGAATGCGCATTTACTAACTCACACTGGCGAGAAATCGTTTGAGTGTTACGATTGCGGTTATAAATGTGCTCAGAAATACCGCTTAAAagaacatttattaattcatacGGGTGAGAAACCGTACGAGTGTAATATCTGCGAGTATAAATGTGCTGTCCGAGGAGCCCTGAGAATTCATTTAAGAACTCATATGCTGGATAAACCGTACGAGTGTAACGATTGTGGATATAAATGTTCGCATAAGGGACAATTAAGGCAACATTCACTAATTCATTCGGATCATAAACCGTATGAGTGTTACATATGCGATTATAAAAGTACTCGGAAAGTATGTCTGCGACTGCATTTACAAACACATGCTGGGGAGAAACCGTTCGTGTGcgacatttgcgattataaatgtgccCGAAAGgcaatgttaaatttacatatgCGAAATCACACTGGCGAGAAACAATATCattgtaacatttgcgattacaaGTGTACCGTCAGGGCCCGCTTACGGGATCATTTATTAACACACACTGGTGAGAAACCATTTAAGTGTAACCTTTGCGATTATACTTGTTCGGTGAAAGGATTGTTACGACATCATCTGCAAACGCATACGGGTGTGAAACCGTATGGGTGCAAACTTTGCGAttacagatcttcaactaaaaGCCATTTAAAGCAACATTTACTGAGACACTCGGGCGAGAAACCGCACGAGTGTAAGTTTTGCGATTATAGATCTTCGAGCAAAACATCCCTGAAAGTCCATTTGAGAACTCATACGGGCGATAAACCGTACGAGTGCAATATTTGCCAGTATAAATGCTCTGCGAAGTCAACTCTGAAAGTCCATTTACTGATTCATTCGGGGGTGAAGCCCTACGAGTGTGAAACCTGCGGTTATCGGTGTGCACTCAAAACTCTCTTAAACCGTCATGCACGAACTCATGCGGAGGATAAACCGTATGCGTGCAACATCTGCGATTACAAGTGTGGAATAAAGGCAAGTTTAAAGCAGCATTTACTAAATCATTCTGGCAAGAAACCGCACGAGTGcaacatttgcgattttaaatgttctgcGAAGGGAAATTTAAGAcatcatttactaactcattcGGACGAGAAACCGTTCGGCTGTAACCTTTGCAGTTACAAGTCTGCGAGAAAGGAACATGTAAAGCGGCATTTACTGAGCCATACGGGCGAGAAACCTGTcgagtTCTTAATAGTCGGAGTTGTTGATGAGGCTGGCGTGGAGCTAGAAGGAGGTGGTGTTGTAGTGGGCGACGTCGTGGTGGTCGTTGGTGTAGTGGTCGTCGGTGTGGTGGTTGTCGGGGTGGTGGTTGTCGGTGTGGTGGTCGTTGGTGTAGTGGTCGTCGATGTAGTGGGAACTCCACccgttcaaattaaaatcgaacAAATGGAATTCAAGTTGGAAGAACTGCATCATTCGATTGACATTAAAGAAGAGCCTTCAAATGCAGTATTTCAGTCCGTCACAGAAACAAGCGAGAAATTATTCGCCTGTGAATATTGCGGTCTCAAATGTCAACGTAAGAAGAATTTACGCCTTCATTTACGAAGACACACCGGTGAGAAATTATTCAAGTGCATAGTTTGCGAATATAAATCTCCAATTAAAAAGGATTTAATGATTCATACCAGGATCCATACTGGGGAAAAACCCTTCcgatgtgaaatttgtgatttttgtttcGCACGAAaggatcatttaaaaaatcatctgGTAACCCATGTCGACGAGAAGCTGTTTTCGTGCGAAATTTGTCTTTCGAAATTTAAACGgaaggaatttttaaagagtCATTTGAGAATTCACACCGGCGAGAAATTATTCTCGTGTGAACTTTGCGACTACAAATGTACGCATAAAGTATCGCTGACGATTCACCGTAAAACCCATACCGGTGAGAAATTATTCACGTGCGAAATATGTGGGCATAAAACTGCTCGCAGTGGAGATTTGAAAGTGCACGTAAGAAGCCATTCCGGAGTGAAACCATTTTtgtgtgaaatttgtaattatagaACTTCACGTAGAGGAAGTTTAAACATTCATCGGAGAACTCACTCTGCCGAAAAGCCCTTCGGGTGTGATATTTGTGATTTTACTAGTAAACGGAAGGCAGCTTTACAACAGCATTATGAAATTCACACCGGCCGAACGCCGTTTGTGTGTAATTTTTGTGATCGTAAATGTGCAAATCGAGACAGCTTAAAAGTTCATCTAAGACTTCATAACGCAATGACGTCACCAAAATTGATTACCCGTGGCATTAAAAGGACAAGATCAGGTTTATCAAAGTCTGAAAGAGCGCAAGTTGTTGCGGACGGGTATTTTGCAGGTCCCTGGGGATCGTCTCTAACCGATATTGGGCTAAACTTAGTTCATCAGTATTTACAAGACGAACCAGCCGAAGAAATGGCCGACTACATGGAAGCCACAAGACTTCCTCTtatcaaatgtaaaatttgcgacTTCGTAACCGAATCTTCGAACGTTATGACTCATCACTACCGAACCCCCCACTTCATCCGGTCCAAATATCGGTGCAATTTTTGTTCGGAAAAATCGACCGATTCTCAAATCATCGAGGATCATATGCGAAGTATTCATGGAATCGACGTTCTTGTGGAAAACCCACCGAAAAAATACCAGTGCTCCAATTGCACTTTCGAGGGTACCCAAAAAGGGCAACTGAGACGGCACCTGGTTGAGTGCGAAAGGAAGTTTAACTTggctaaaaatttaacgattCCGTTAAGCTGCCCGTTACCAGCGTACATATATAACCGTTTTAAAAAGGAGAAGAAACAGAAAAATCAAGCAGTAGATAAGCCCAGCACTTCGGATGTGCAGCCCAGCACTTCGGACGTGCAGCCCAGCACTTCGGGTGTGCAGCCCAGCACTTCGTACGTGCAGCCCAGCACTTCGGATGTGCAGCCCAGCACTTCGGATGTGCAGCCCAGCACTTCGGATGTGCAGCCCAGCACTGCGGATGTGCAGCCCTCGCTCGTAGAATGCGAGATCTGCGACGACCACATCCTAAACATGGAGTCCTTGTACAACCACATGAAGGAAGTGCACAAAGAAAATGTCTCTaaagaaaaaatgataaaacagAAACCAGGGAATAGTTGCACCCAGTGCCACCTATGTTTCTATAGCAAAGGTGGGCTGGAGCGGCATTACCTTGGTTCTCACCGATTGCTGACGAGTAAGCTGAAGAAAGCTTACAAGAAGAAAAAAGATTGTGGTCGTTGCCCGATTTGTCGAGAGTCCTTTGGTAGATCTGGATTAATTGACCACGTGTCTGAAAAGCATGACAGAAACCTGCAAGTCTTAAGTATTTCTTACAGATGCAGAGAATGCGACGAAATTTTTACCCACATGAAAGATTTTTATAAGCACACCCGCTTAGAGCATGGTTCAACTCAATAA
- Protein Sequence
- MDVKVEVIDSSNDIAAKSSITSEVENEDVDFELESLHHSIEIKEEILTDPDCELQLPAIKCISEQKWTRRTGKTLFECNFCDYKRSSEQHVIDHSRTHTGERPFECSVCDFKSTTKSNLKRHLRIHTGEKPFECTFCGYKCSLAHRLKAHLRTHTGERPFECSLCDYKSSAKSDLNVHSRTHTGEKPFECTLCDYKSSSKAGLHNHIRTHTGEKPFECTVCDYKCAQKAHLLSHSRTHTETNVAKIPADNMEVELENLHHSVDIKDDNLVLQCQQDGTEITRSHSHDGLQIKTEEEMEVNSEVLHHSIDLKEETGVAKTGKKLFQCNICDYNCSKKWNFTKHLFTHTGEKPYACDVCDYRSTHKGTLKRHIQAHTGTKPYACNLCDYKCTYNGNLKTHLLTHMGEKPIACDLCDYKCKVKVQLTRHLLKHAGPKPHGCNVCDFRCYRLPNLNAHLRTHTGEKSFECYDCGYKCAQKKKYPWWQAPGSGCDGDKIPADNMEVELENLHHSIDIKEDNLIFEYQQDSTEKTPSQLASYFKIKVEEEVENNTKVLLDSIDLKEEASMTETEVDRGKNLIQCSICDYNCTAQARFKNHLLTHTGEKPYGCDVCDYKTTQKAYLKRHIQTHTGTKPYACNLCEYKCSHKGNLKAHSLTHIGEKTIECDQCDYKCKVKVQLTRHLLKHSGPKPYGCNVCDFRCYRLPNLNAHLLTHTGEKSFECYDCGYKCAQKYRLKEHLLIHTGEKPYECNICEYKCAVRGALRIHLRTHMLDKPYECNDCGYKCSHKGQLRQHSLIHSDHKPYECYICDYKSTRKVCLRLHLQTHAGEKPFVCDICDYKCARKAMLNLHMRNHTGEKQYHCNICDYKCTVRARLRDHLLTHTGEKPFKCNLCDYTCSVKGLLRHHLQTHTGVKPYGCKLCDYRSSTKSHLKQHLLRHSGEKPHECKFCDYRSSSKTSLKVHLRTHTGDKPYECNICQYKCSAKSTLKVHLLIHSGVKPYECETCGYRCALKTLLNRHARTHAEDKPYACNICDYKCGIKASLKQHLLNHSGKKPHECNICDFKCSAKGNLRHHLLTHSDEKPFGCNLCSYKSARKEHVKRHLLSHTGEKPVEFLIVGVVDEAGVELEGGGVVVGDVVVVVGVVVVGVVVVGVVVVGVVVVGVVVVDVVGTPPVQIKIEQMEFKLEELHHSIDIKEEPSNAVFQSVTETSEKLFACEYCGLKCQRKKNLRLHLRRHTGEKLFKCIVCEYKSPIKKDLMIHTRIHTGEKPFRCEICDFCFARKDHLKNHLVTHVDEKLFSCEICLSKFKRKEFLKSHLRIHTGEKLFSCELCDYKCTHKVSLTIHRKTHTGEKLFTCEICGHKTARSGDLKVHVRSHSGVKPFLCEICNYRTSRRGSLNIHRRTHSAEKPFGCDICDFTSKRKAALQQHYEIHTGRTPFVCNFCDRKCANRDSLKVHLRLHNAMTSPKLITRGIKRTRSGLSKSERAQVVADGYFAGPWGSSLTDIGLNLVHQYLQDEPAEEMADYMEATRLPLIKCKICDFVTESSNVMTHHYRTPHFIRSKYRCNFCSEKSTDSQIIEDHMRSIHGIDVLVENPPKKYQCSNCTFEGTQKGQLRRHLVECERKFNLAKNLTIPLSCPLPAYIYNRFKKEKKQKNQAVDKPSTSDVQPSTSDVQPSTSGVQPSTSYVQPSTSDVQPSTSDVQPSTSDVQPSTADVQPSLVECEICDDHILNMESLYNHMKEVHKENVSKEKMIKQKPGNSCTQCHLCFYSKGGLERHYLGSHRLLTSKLKKAYKKKKDCGRCPICRESFGRSGLIDHVSEKHDRNLQVLSISYRCRECDEIFTHMKDFYKHTRLEHGSTQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -