Mrel059751.1
Basic Information
- Insect
- Mantis religiosa
- Gene Symbol
- lin-13
- Assembly
- GCA_030765055.1
- Location
- CM060956.1:107405478-107433748[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 36 0.00015 0.067 16.0 2.8 1 23 159 181 159 181 0.97 2 36 0.046 21 8.1 3.2 1 23 210 233 210 233 0.93 3 36 6.5e-05 0.029 17.1 1.9 2 23 240 262 239 262 0.97 4 36 0.0034 1.5 11.7 0.8 1 23 326 348 326 348 0.95 5 36 0.0086 3.8 10.4 0.1 1 23 456 478 456 478 0.96 6 36 0.12 53 6.9 3.1 1 23 495 518 495 518 0.93 7 36 0.0016 0.7 12.8 0.1 1 23 581 604 581 604 0.96 8 36 0.26 1.2e+02 5.8 1.1 1 23 645 667 645 667 0.97 9 36 0.0059 2.6 11.0 0.2 3 23 734 755 732 755 0.94 10 36 2.7 1.2e+03 2.6 1.9 1 23 777 800 777 800 0.94 11 36 0.56 2.5e+02 4.7 1.7 2 19 807 826 806 831 0.84 12 36 1.1 5.1e+02 3.8 0.0 2 23 840 862 840 862 0.91 13 36 0.11 49 6.9 0.3 2 23 869 891 868 891 0.91 14 36 8.5e-06 0.0038 19.9 2.8 1 23 899 921 899 921 0.97 15 36 2.3 1e+03 2.8 0.0 7 23 985 1003 979 1003 0.82 16 36 0.00069 0.31 13.9 1.9 1 23 1006 1029 1006 1029 0.97 17 36 0.00016 0.072 15.9 0.5 2 23 1036 1058 1035 1058 0.96 18 36 0.011 4.7 10.2 6.2 1 21 1087 1107 1087 1108 0.95 19 36 8.4e-05 0.037 16.8 2.4 1 23 1128 1151 1128 1151 0.96 20 36 4.3 1.9e+03 1.9 0.1 3 23 1156 1178 1154 1178 0.88 21 36 2.9 1.3e+03 2.5 2.0 2 23 1216 1238 1216 1238 0.88 22 36 1.1 4.8e+02 3.8 0.2 2 23 1245 1267 1245 1267 0.95 23 36 1 4.4e+02 3.9 3.5 2 23 1276 1297 1275 1298 0.92 24 36 0.089 40 7.2 0.3 1 23 1302 1325 1302 1325 0.94 25 36 0.069 31 7.6 0.1 3 23 1380 1401 1378 1401 0.90 26 36 1.9e-05 0.0085 18.8 0.2 1 23 1430 1452 1430 1452 0.98 27 36 0.002 0.89 12.4 1.4 1 23 1463 1487 1463 1487 0.90 28 36 4.1 1.8e+03 2.0 1.8 2 23 1595 1617 1594 1617 0.91 29 36 1.7 7.4e+02 3.2 0.5 2 23 1624 1646 1623 1646 0.94 30 36 2.5 1.1e+03 2.7 1.2 1 23 1657 1679 1657 1679 0.92 31 36 0.013 5.8 9.9 5.4 1 23 1695 1717 1695 1718 0.95 32 36 0.032 14 8.6 2.3 1 23 1735 1757 1735 1757 0.98 33 36 2.3e-06 0.001 21.7 2.4 1 23 1841 1863 1841 1863 0.98 34 36 3e-05 0.013 18.2 0.3 1 23 1896 1918 1896 1918 0.98 35 36 0.79 3.5e+02 4.3 0.0 1 23 1929 1952 1929 1952 0.94 36 36 0.0047 2.1 11.3 2.9 1 22 1956 1978 1956 1978 0.96
Sequence Information
- Coding Sequence
- ATGGTAGAAGGAAAAGTGGTCCAAGGAGTGCTCCTCCAGCACAACAATGCACTGCCGCACACCAGCAATACGGCGTTCATTGTGCCATACCCACCATATTCACCAGATCTTGCTCCATTTGATTATGCTCTATTCGACATCATGAAGTTGCTGCATGGGAAAAGGTTCAGCAACAATGAAGAATTGCAGGCAGCCGTCCACGAGTTTCTCCCAGTACCATCAGAGGCAACCAATGGGAGATCAAGTAGCCCTGTATCAGGACCAGCAACTCCATTGGAAGATGATATAGATGAGGAGGTAATGGATGTGGAAATGGAAGACAGCCAGGATGTCATTATTAACAGCAATTCTGTGGAATGTGAAGTGCTAGAAGATGTGGTGATGGAGGATTCTCTCGATTCCAGAAAGGACAAGAAAACCCCAGCAGAAGTAAAGCCAAGCAGGTTTGAGGATATCCAGGAAGAGAGCAGTTTACATACATGCAGCATCTGCAGTAAGGCGTTCTCCAGTAAGGGTCATCTTGCATTACATAACAAGATACACAGCAAAGAATCTGCATCTTCGAAGTCAGCACCAGATGCCTCTGCTGCTGCTGGAACTTCCTGGAAGAGTGGTTACTATCGCCCATACCAGTGTGACCTGTGCAGCAAATCATATTCAACAGCTAAGCATCGTTGGGGCCATGTCAGTGTGTGTCATCGTGGTGACCCTCTGGTGACATGTCCAGTGTGCTCCAGAGTCTTTTCCACCTGCTCTAATCTTGGAGAACATAAAAGGACAAAACATGGTATGGAGGACGACAATGAGGTTCCAGAGGAAAGTGTACCAGTTGTCAAAGCAAAGTCACTATCTAGCACTAGACATATGTCACCAACATCCTTCCCAAGCAATAACATAATAAAATCATCATCACCCACAGCCAGTACCCAAGAAGCAGCACATATCTCTTTGGATGAAGTAAACACAGCAGTTCTACACACTTGTGCGCAGTGTCTCAAAGTTTTCAAAACAGAGGGTGAACTGGAACAGCATTCTGTCACACATTTACAGTCTTCTTCAAAATGTTCATCCCCAGAACCCAGTCGAAAATCGCTTTCTAGTTTTACCAGTGAAATGAGTTTACTGAAGCAAACTTTGTTGCAGAATCGTTCTCTTGTAACTGAAGGAAAGAAGAGGCAGTCATCTGTGGAAGAAACAGAATTCTCTCCAAACAAGAAAGCTGCAGTTGAAAATCCTACTGAGGCTGCTCCAACTGGTGGCAGCAGACgaaaaaattcaaagccacgaAAAATTGAGCAGCGCAATGAAGAGAATGCAAGAGAAAATTTAGAACTAAGTAGTGAAGACTCTATGGCAAATCAAGGTTATGTATGTTCTGTTTGTTCGCAGATGTTTTCGTCTATTATTGAGCTTCAAGAACACAGTATTATCCATGAAGAGGAAGCTGTGGCAACAACTTCAGAAAATTGTGTTGCTAAGCCTTTCTCTTGCCTACTCTGTGAAAAGGATTTCTCACTTCGCACATCACTGAGTAGACATTTTAATGCATGTCATGGCATTGATCCTGCTGAAATTATGGATATCTCAAAATACCAACGTACACCAAAGAAAACAGTTCCCCAAGCCAAAGAGGAGGAGCAGGTTTCTGCTATAGATGAAGATGATCTTTTTGATGTTGATGCAGAGGATGAAGGAAATGATACTAGAGgaacagaaaacaaatcacaacaGGGTGGCAGCTTTGCCTGTGAGGTTTGCAGCCGAGAATTTGGTGATCGTGCCTCATTGTGGTTACACTTACGCTACACACACAAAGAGTATGCTGCATATGCCTGTGGAGTTTGCTTACAGATTTGTGGAGATAACACACACATGTATGAACATTGGACTACCCATCATCCTCCAGATCGTGCATCCACAGAACAAAGAAGGTACAGCTGCCAAATGTGTGGACGACAACATGATTCACGCAAGAAACTTCTGGCTCATGTTCTCGTTCACAACTTGGATGATGGCTCAGGTGGCACATATGATCCAGAGCTGCTTGTCACTTTAAACACTGGCTTCTATAACTTTGAAGGGCGAGATAATCGTGACAGTCCTGCAACAgtcaatgaatttaattcaacATTGGAAGCAGAATTTGGCTCTGCAGAATGTAGTGGAGGAGAATACACTGGAAACCCAGCCTTGTTTGGCTGCGAGTTGTGTTACAAATCTtttcccactgaagatggtcttgTGAAACATAAAAAAGGAGCCCATAAGTTGGACCCTAGAGCAAGCTCTTCAACACCAAACAACTCAAAGGGTTCATATCAGTTGTATTTTGTTTGTGAGCTCTGTGGAAGCTCACATAGAAGCAAATCTGAGCGTTGGAGACATGTGTTCAGGACACATGGGGGAGAGCAGACACTAACTTGTGACAGACCAGGATGTGGAAAGGTGTTTCCAACTAGGGCTTTGAGACAAGACCACTGCACCAATCATCATCGTCTTCAAGGAGCAACACCTAATGTCTGTGAGATATGTGGCAAACTTTGGGGTACTCGTGTTGACTTTTGGAAACACCTCATGGGTGTGCATCCAGACTGTGTTCCTTTGACCTGTGGTGTCTGCCTCAAGATCTTTTGTTCAGTTCCAGAACTTCAGAATCATGTAAGTGCAAACCATATGCCTCTGACAGGTGGCGATTTTTGCTGTGACATTTGTGGTCGTCCTTACTCCAATAGATCAAAGCTTTCACGCCACCGCAGAATTCATTTGGTTGGTGAATCAGCAGAAATTTCATATTATCCAGAAATTCTGCTTGGTACTCCCGGAACAGTCACAAAACAATCTGCCTTTCCTACTTTAACACCAAAATTTGTCCAAAATTCTGCAGGCGAGTTGCAAGGACTTAAAAGCAAGCACAAATCTCCAGGATTACCACCCCTCTACTGTGATGCCTGTCCAGATGCAGTTTTTGCAGACATATTTAACTTAGCTGAACATCGTCGCACAATCCATGGCCTTTTTCCATGTGATCTTTGTTCCAAGTTTTATGGTCGCACATCGCATTTGTGGAAGCATGTCAAACGAGTACATAATAACCATCCTGAATTGACTTGCCCAATATGTCAACGTATATCTGCAAGTAAATCCCATTTGGAAACTCATATTGCTACCAAACATCGATCATCGGAAACAAAAACTTCTACTCCTAAGACACAAGAACTAGAAGTTCCTTCACTGAAAGCACCACATATACCGACATTGTATCCTTGCCATAAGTGTTCCAGGCGATTCTATAAACGACACCTACTGACAAAACATCAACTTCATTGTGCAGCAAGACGTCATCGTGAAAAACCAAAGTCAGAAGAAGGCAGCATAACGGAGCATCCTTGTGATAATTGTGCAAAAGTGTTTTCTTCTCATTCTAAGCTGCGAGATCATCAACGGGTAATCCATGCTCCACAGTATTGTGAACTATGCCCCAACATAATATATAATTCCAAAATAGATCTCATGCAGCATATTAAAGATTCTCATGCAGATGATCCTAATTTTGCATGCTCTCATCCTGGATGCAGTCGCACAATGAGATCACGTGCAGATCGTGACCGCCATGTAGCAATGCATCGTTCGTTCAAGTATCCACCTACATGTTCCCTTTGTGGCGAGAACTGTACAAATCGTGTTAAGACATGGAGTCATTACTCAGCTCGTCATAAATCTTCAATTTCAAATGTATGTGGTGTTTGTTATGATCATTTCCCTACCACTAGTGTTTTAATAGAGCATGTACGTACACTTCATACAAAAGTGCTTAGAAATAAGGATACCTGCAAAATCTGTGTGAAAACATACTCCAGTTTTTATAAAGTACTATTGCATCATGCAAATCATCACCCAGATTACCATATCTGTCGTCAATGTCTTGAGGTATTTCCTGAAAAGCAGGAACTAGAGAATCATGTTGAAAAAGAACATACAAaaaaagaggaggaagaggaggaaaacaatgatgaggaggaagaagaagataaaTCACAACAGACACCAAAGGATACTAGTGTTTCCGGTGAGAACGTTGAAGAACTGCCATTACCAATTGCTGTAACGATCAAGGAAGAGAAAAGGGATGAGATAGATTGTAGCCATTGTGGAGAAGAATTTGCCACAGAGGATGAACGTGACCAACATATTGCATTGACACATTCTGATGAAGGTACTGCAACTAAGCGCCGACTTAATTCTGATGATGGGGGCAATCCCAGTTCTGTTAAGCGTCCCCGTCGTTCATATGATTGTGACAAGTGTCCTGAGGTATTCCGCTCTCCGTCAGATCTAGTGGAGCACAAGAAGCGGCATCAAAAGACACCAGTGGCTGAACCTAAGCCTTATCATTGTAATCCGTGTGAAAAATATTTTAGCAACAAATCATCCTATTGGAGGCATATTAACTCTCCAGCACACCATGCAGCAAAGTTACAGATGCAACAGAAATTGTCTGATGAACATTCCACTCCAAGAAGTGAAGGTGAAGCAAATAAAAGTACTGCTATGGTGTACATCCCTGAATCCAACCTTAGGGAGACATTAATGAGAAGAGTAAAAATGGAGTTGGACATGCTGGATGATAGCGACCAAGATAGCAAGAGCTTAACTGAGAGTGAAGTTGATAAGGAGGCAAGTTTAGAGACTGAAAGCATCAAGGAGTCACCAGTAACTATAAAACCTGCTAGAAAGCGTCCAGAATCACGCAAAGTATATCGTGGGGGACCAGATGTGCCATGCCAGTGTCAATTGTGTGGAAAGCAATGGCCTGCAATGAAGCATTTGTGGCAGCATCTGATTCGTAATCACCGTCATGAGGCTGCAGTAACATGTGGAGTTTGCCTTGAGGTCTGCCAAGACTATCGCAGTCTCTCTGCACACTTAACCTCACAGCATCCCGATAACTTTATAGGTGAAGGAAACAACTTCACTTGTCGTATCTGTGGGCGCTACCACAATGCAAGATCAAAGCTTATTCAACATGCTACCATTCATATTATTCTAGGCCCAGCTCCAGAACAGCAACCACAATCAAATGTTCATTCATGCCGCTTCTGTTTCAAAGCTTTTACCAATGAAAATACCCTCCAAGAGCATGTTAAAATGCACCATTCGTCTGGGAGTGTATCAAAGGCTGTACGTCCTACTGAACCTTCCCCACATAAATGTGATGTATGCTTCAAAGTATGTGGCAATGAAGGAGCATTGGTAAGTCACAAGAAAAGTCACCAGAGGGGAACAACAGAATACAGCCGTCCAAAAGCAGATGTAGATGTGGTAGAGACAGACGACGATGACGAGAACGACAATGAATCTACTACGGAACATGGTAGCAAAGAAGAGAATGATGAAGAggaagatgatgatgaggaggatgatgacgatgatgatgatgacgacgatgaagatgaagaggatgacgatgaggatggaGAACAAAGTGAACAAACATCTGTTACCAGTGAACCCAAACAACTTTATAGGTGTGATATGTGCCCAAAGACATTTACAGATAAAGATAATTTCAAAGATCACACAAAGAACCATTCTCAAATTGCTGACTTGCATATGCAAATGAGGAGAAAAAGACCTGTAGTGGCTATACCTGAGTTGCCAAAACAAGAAGTAGAGAAGCAAACCTTTTATGTGTGTAGCATTTGTAGTCAGGTGTTCAAAACACAAGCAGCACTTTCAGTGCATCAAACCAAACACCAACAACAAAGATCGCTTAGCACTGGTAAATTCCCTTGTGCTTCTTGCAGTGAGGGATTTCCTACAATTGCTGCACTCTCAGAGCATGTTAAAGCAAGCCATGCCAAAATGTTCACCTGTACCAGATGCAACAAGAAATCCTTTTCTTCGTATGTGAATCTCACCAAACATTATCGCACTTGTCAACCACAGGCACTGCATTCTCAAAACAAAAGTCAGCTTGAGAGCTCAACTAACTTGTCTGAAGATAGTCAGGACTCTTCTGGAGATGAAACTAGCTGCACCAAAAGCAGCACACATACAACTCCAATATATTCCAACACAAGTATTAATAACAAGAGTATTGCAGACCAAGGAGCCAAAAGGACAGAAGCCCAGAAAAGTTTAGATTTAGTGATGGGAATCCCAAATATTGGTTTGAGAAGACTAAGTAGCAGTTCTGATGACTCAGATGAGGAAAACATCACAACACAGACTTCTACTATTAACAATATTGAAACGCTAACAGAAGTGGAATCTGACAACAAATTGCCAGAAAACAGCAATCCTGACGTTTCCGATCATGAAATCAAAACTTCTAATAATAGCGTGGTTTCCTCAGAAGCTAGAGAGGATTCTGCAAATATGGGTCAGGAAGAATGCATAACAGTCCCTGATAGTGCTAATAGCAACCAAGAAATTATAAAATCCGTGATTACACTAAGTGATGATGAATCTCAAACTTCTGTGAAATCTCTAAAGCAGAAGGAGGACATATTTGTGGTGTCACTGAGTGATGACTCAGAAATGTCCGTTAAAGACTGTAAAGAAGATACAGCAAGTGAAAACTGTGTAGTTTCAGAAGCGATTAATAGTGAATCAAAGCAAGAACAGACGGCACCAGAGGAATCTGAGCTGGAACCTGTTAGTGATAGGACTGAAAGTGATAGGGATATAATAGACGTGGCAGCACCTGATGCTGAAAAACAAAGTAGTGTATGTGAAGTTAAAACCGAATCAGAAGATGAACCAAAACCAGAAGGAACAAAGGGTAAAGACTGTTTGTTTTCTTTAGAAAATGCACTGGATAAGTTAGAGGCTGAAAATAGTGAAAAGTGA
- Protein Sequence
- MVEGKVVQGVLLQHNNALPHTSNTAFIVPYPPYSPDLAPFDYALFDIMKLLHGKRFSNNEELQAAVHEFLPVPSEATNGRSSSPVSGPATPLEDDIDEEVMDVEMEDSQDVIINSNSVECEVLEDVVMEDSLDSRKDKKTPAEVKPSRFEDIQEESSLHTCSICSKAFSSKGHLALHNKIHSKESASSKSAPDASAAAGTSWKSGYYRPYQCDLCSKSYSTAKHRWGHVSVCHRGDPLVTCPVCSRVFSTCSNLGEHKRTKHGMEDDNEVPEESVPVVKAKSLSSTRHMSPTSFPSNNIIKSSSPTASTQEAAHISLDEVNTAVLHTCAQCLKVFKTEGELEQHSVTHLQSSSKCSSPEPSRKSLSSFTSEMSLLKQTLLQNRSLVTEGKKRQSSVEETEFSPNKKAAVENPTEAAPTGGSRRKNSKPRKIEQRNEENARENLELSSEDSMANQGYVCSVCSQMFSSIIELQEHSIIHEEEAVATTSENCVAKPFSCLLCEKDFSLRTSLSRHFNACHGIDPAEIMDISKYQRTPKKTVPQAKEEEQVSAIDEDDLFDVDAEDEGNDTRGTENKSQQGGSFACEVCSREFGDRASLWLHLRYTHKEYAAYACGVCLQICGDNTHMYEHWTTHHPPDRASTEQRRYSCQMCGRQHDSRKKLLAHVLVHNLDDGSGGTYDPELLVTLNTGFYNFEGRDNRDSPATVNEFNSTLEAEFGSAECSGGEYTGNPALFGCELCYKSFPTEDGLVKHKKGAHKLDPRASSSTPNNSKGSYQLYFVCELCGSSHRSKSERWRHVFRTHGGEQTLTCDRPGCGKVFPTRALRQDHCTNHHRLQGATPNVCEICGKLWGTRVDFWKHLMGVHPDCVPLTCGVCLKIFCSVPELQNHVSANHMPLTGGDFCCDICGRPYSNRSKLSRHRRIHLVGESAEISYYPEILLGTPGTVTKQSAFPTLTPKFVQNSAGELQGLKSKHKSPGLPPLYCDACPDAVFADIFNLAEHRRTIHGLFPCDLCSKFYGRTSHLWKHVKRVHNNHPELTCPICQRISASKSHLETHIATKHRSSETKTSTPKTQELEVPSLKAPHIPTLYPCHKCSRRFYKRHLLTKHQLHCAARRHREKPKSEEGSITEHPCDNCAKVFSSHSKLRDHQRVIHAPQYCELCPNIIYNSKIDLMQHIKDSHADDPNFACSHPGCSRTMRSRADRDRHVAMHRSFKYPPTCSLCGENCTNRVKTWSHYSARHKSSISNVCGVCYDHFPTTSVLIEHVRTLHTKVLRNKDTCKICVKTYSSFYKVLLHHANHHPDYHICRQCLEVFPEKQELENHVEKEHTKKEEEEEENNDEEEEEDKSQQTPKDTSVSGENVEELPLPIAVTIKEEKRDEIDCSHCGEEFATEDERDQHIALTHSDEGTATKRRLNSDDGGNPSSVKRPRRSYDCDKCPEVFRSPSDLVEHKKRHQKTPVAEPKPYHCNPCEKYFSNKSSYWRHINSPAHHAAKLQMQQKLSDEHSTPRSEGEANKSTAMVYIPESNLRETLMRRVKMELDMLDDSDQDSKSLTESEVDKEASLETESIKESPVTIKPARKRPESRKVYRGGPDVPCQCQLCGKQWPAMKHLWQHLIRNHRHEAAVTCGVCLEVCQDYRSLSAHLTSQHPDNFIGEGNNFTCRICGRYHNARSKLIQHATIHIILGPAPEQQPQSNVHSCRFCFKAFTNENTLQEHVKMHHSSGSVSKAVRPTEPSPHKCDVCFKVCGNEGALVSHKKSHQRGTTEYSRPKADVDVVETDDDDENDNESTTEHGSKEENDEEEDDDEEDDDDDDDDDDEDEEDDDEDGEQSEQTSVTSEPKQLYRCDMCPKTFTDKDNFKDHTKNHSQIADLHMQMRRKRPVVAIPELPKQEVEKQTFYVCSICSQVFKTQAALSVHQTKHQQQRSLSTGKFPCASCSEGFPTIAALSEHVKASHAKMFTCTRCNKKSFSSYVNLTKHYRTCQPQALHSQNKSQLESSTNLSEDSQDSSGDETSCTKSSTHTTPIYSNTSINNKSIADQGAKRTEAQKSLDLVMGIPNIGLRRLSSSSDDSDEENITTQTSTINNIETLTEVESDNKLPENSNPDVSDHEIKTSNNSVVSSEAREDSANMGQEECITVPDSANSNQEIIKSVITLSDDESQTSVKSLKQKEDIFVVSLSDDSEMSVKDCKEDTASENCVVSEAINSESKQEQTAPEESELEPVSDRTESDRDIIDVAAPDAEKQSSVCEVKTESEDEPKPEGTKGKDCLFSLENALDKLEAENSEK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01410930;
- 90% Identity
- -
- 80% Identity
- -