Alag030444.1
Basic Information
- Insect
- Acanthococcus lagerstroemiae
- Gene Symbol
- ZFAT
- Assembly
- GCA_031841125.1
- Location
- CM063113.1:62980810-62992404[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 36 6.2e-05 0.01 17.3 0.1 2 23 207 228 206 228 0.97 2 36 0.011 1.8 10.2 1.6 2 23 232 253 231 253 0.97 3 36 0.042 6.7 8.4 0.7 2 20 265 283 264 285 0.91 4 36 8.2e-06 0.0013 20.1 0.5 1 23 290 313 290 313 0.98 5 36 1.2 1.9e+02 3.8 0.1 2 13 335 346 334 355 0.78 6 36 1e-05 0.0016 19.7 0.2 1 21 361 381 361 382 0.94 7 36 0.00076 0.12 13.9 0.6 1 23 389 411 389 411 0.98 8 36 0.00043 0.069 14.7 0.1 1 23 421 444 421 444 0.96 9 36 0.058 9.3 7.9 3.6 2 23 453 475 452 475 0.91 10 36 9.6e-05 0.015 16.7 0.6 1 23 483 506 483 506 0.98 11 36 0.0024 0.39 12.3 0.4 2 23 538 560 537 560 0.94 12 36 0.0025 0.4 12.2 0.1 1 20 601 620 601 622 0.94 13 36 1.5e-05 0.0024 19.2 0.3 1 23 631 653 631 653 0.98 14 36 0.02 3.2 9.4 0.7 1 20 662 681 662 683 0.95 15 36 0.52 84 4.9 1.6 3 22 687 706 686 708 0.88 16 36 0.0035 0.56 11.8 1.4 1 20 753 772 753 773 0.96 17 36 2.3e-05 0.0038 18.6 2.4 1 23 781 804 781 804 0.98 18 36 0.00077 0.12 13.8 0.9 1 23 809 831 809 831 0.95 19 36 2.2 3.6e+02 3.0 2.0 5 21 840 856 837 857 0.91 20 36 0.17 28 6.4 0.2 1 20 1148 1167 1148 1170 0.93 21 36 2.4e-05 0.0039 18.6 2.1 1 23 1175 1197 1175 1197 0.97 22 36 0.00048 0.078 14.5 1.4 2 20 1206 1224 1205 1226 0.94 23 36 1.2 1.9e+02 3.8 0.4 1 23 1231 1254 1231 1254 0.92 24 36 0.19 31 6.3 0.1 2 23 1276 1296 1275 1296 0.91 25 36 9.6e-05 0.015 16.7 0.0 1 21 1302 1322 1302 1323 0.95 26 36 1.6e-06 0.00026 22.3 0.5 1 23 1330 1352 1330 1352 0.98 27 36 0.0055 0.89 11.1 0.4 1 23 1362 1385 1362 1385 0.97 28 36 4.8e-05 0.0077 17.6 0.9 1 23 1393 1416 1393 1416 0.98 29 36 0.00046 0.074 14.6 1.8 1 23 1424 1447 1424 1447 0.98 30 36 0.0014 0.23 13.0 0.7 1 20 1534 1553 1534 1554 0.96 31 36 0.084 13 7.4 0.1 1 23 1562 1585 1562 1585 0.89 32 36 0.012 1.9 10.1 0.5 1 20 1590 1609 1590 1609 0.97 33 36 0.42 67 5.2 1.6 1 23 1683 1706 1683 1706 0.91 34 36 0.006 0.97 11.0 1.0 2 23 1795 1817 1795 1817 0.96 35 36 0.017 2.8 9.6 0.1 2 23 1819 1841 1819 1841 0.97 36 36 0.4 65 5.3 0.2 1 20 1945 1964 1945 1966 0.94
Sequence Information
- Coding Sequence
- ATGCTGGAATTTTTGCGCGGCCGCTATCGTGTCGCCTTTCTTTATCTGAATTGTAATCCACATTACGTTGGTCTTTGTAGGTGCGATGGTCGCAAAATTGTGAACTGGGCGGCGACCGACGTGGGCGGAATGCGAAACTATTCGGTACCGGCGGCGGCGTTACTGTCATTTGCGTCCGTGAGTGGCAACGAATCGAACCATCTGGTCGATAGAAAATGCGCCGTATGCGGGGCCGAGACTTGTGCGAGGAGCGGCCGAGGCTTGAATGGCGAACTCCTGTGCCCAGGTTGTGTCGACGAGCAGACAAACGTGGCGAAAAAGCAACCATCCAAAGAGACCGAAGCGGCCGCAACGGAGGTGGTGGCTGTGTCCAACATTCCCGAACATTTACAACGTTTCGTCGTCGCGCGTCCCGGCCAACCGACCTATATGCGGTGCGATAGATGCGATTTCTTGTGCAGGCCATTCGctcttcttttcgaaaaacactaTCTCGCTCACGAGTTCGAAAATcaggcggcggcggtggcggtggcgtcAGGAGTAGTCGTCTCTCCCGACGAAGAGTTGGCCAGCAGCGTGACCGCGGCGTCGCATTTTGACGGCGACGATGATATTCAGTGCGCCATCTGCGATAAAGTGTTTCTGCGTAAAGGTAGCCTAATCGCGCATCTGCGGAGACACGACGAGCTGACGTGTAAACTGTGCTTCGCCACTTTCAAAAGCGTAGCGCTGCTCAAGTCTCATAAGAAGCGCCACGTACGTCGAAAGTTGATTAACAATACGCTGATTTATTGCGACGTAtgtaaaaagtatttcaaaacgAAGAAAGGCTACGCCTATCATCGAGAGAAAATCTGCGTGCAATACAAGTGTTACGTGTGCAACAAAACGTTCGCGATGAATTCGGCGCTGCAGGATCACGTGAGACGGGTACACGACGTAGAGCAGAACCTGTTGAATGAGGTGCGAGTGCCGCGCGACAACGAGACCCTCATGAGCGTGTGCACGATATGCGGCAAGGCGGTGACGCACGCGGTGCTGCCGCGTCACATGGTGCTGCACGACGAAGCGCGTCCGTTTGCCTGCGATTTGTGCGGCAAACGCTTCAAGCGGAAGAACTCGTTACAGGATCACATTATGATCGAAAtgggaatgaaaaattacgtGTGCGAAGTATGCGGTATGAAGTTCCTGAAACAGGGTTACCTTAATAAACACGTGCGCTACCATAAACTGAATAACGGCGAATTTCAGGGTTTCCAATGCGAAGTGTGCGGCAAACGTTTCCCGGAAAAATGGCGACTCGGCGTCCACCAGCGAGCGGTTCACAAAGGAGGAGAATTCGCGAGCTGTCGGTGCGACATATGCGACAAGAAGTTCGCCGAACGCTGGATGGTGCGCAAGCACAAGCATCTCGAGCATAACGGCGAAGAGTTTAAAGAGTACGTATGCGACATTTGCGGTAAAACGTTCCAGGAGAAGTGGATGATCAAGTCGCACCAGCGAAGCACTCATCGCGGAGGCGTTTTGAGACAGTATTGCGACTTGTGCGGCCGTGTCGACCATACGGCCAAAGACTGCGCACACCGACACGCTCTGCAGGAGGTGAAATGCGTATTGTGCGGCGAGTCGTTCTCGTCCACCTACTTTCTTAAGGAACACGTAACCTCGACTCATAGTTTTGAAGTGAAAGCCGAAATGAAGGCCGGCGATGGCGTTAACGATGGCGACGGCGACGGTGACGACGTGAAGTCATTTTCGGGCAACGACCCGTTAGAACGGAAGCAGAATGTGCTCTACATTTGCGCCAACTGCGGTCTCCATTTCGCCGATAAAAACGACTTGAGAGCGCACGAAATGGAGTTCAAGTGCGGCGGAGAAGGCAAGTACGAGTGCACGGAATGCGGCATCGTGTACACGTCCAAATTCAGCTTGAGCAACCACTTGATGATTCACAAGAAAGCGTCGACGCTAGTTCCGTTCAAATGCGCTTTATGCGACAGGGACTTTTTTACGCATGAGGCGCTAGACAAGCATCAGTCGTTGCCGAAGATCTGCTTGCTATGCGACAAGGTTTATCCGTGCAACGATCACTTGAAAAATCACGTGTTCGGCGAACACAAAGACGACGAGCTCGACGACACGAGCTCTACAAACGTGGTAGAGTCGCACGACGACGACgatggcggcggcggcggcggtggtggtggtggtggtggtggccaACTGGACGCGGTGGCAGCGGAGAAACAGTTCGAGTGTAAGCTGTGTCGCAAGAAGTTTGCCCGCAAGCAGGCCATGAAGAATCATTTATTCGCCGAGATGAACTTACGTCGGTACGTGTGCGAATTCTGCGACAAAAGTTACAACTACTATTCCCATCTGAAAGAGCATATCATCACCAATCATGGCGAAAAGGAGTTTGTGTGCGGCTATTGTGGCAAAGACTTTCCGACCAAAAAACGATTTCGCGATCACGTTACTCTACACTCGGAAGAGAAACCATTTCGCTGTACCTGCGGATTAGCTTTTAAACTCAGTCGATATCTAAACAAGCATAAGAAACATTTAAAGCTCAACTGGAACATTTTAATTCGAATCAAAAAGGTTGATCGATTCGACTACTGTTACCCGTGTAAAGCATCGCGAAAGCCGCAGTTGCGTGATCGACTTTTGTATTCGACCACGAATGGCCGGTTTCAATTGACAGTGAAATTCGATCTCACCAAGTTTCTGGAACCAATAATACGACGTACGTATCGGTTTGCCTTTAATCTTGCGACTTTGATCGGAATGGTATATAAGTATAAACGAACGATCATTAGATTGCGCCAACATTTAGCTGGCTGGCGAGTCGAAAAGTGCGAATTTACTCGGtgGTCTAATCGCAGCAATCAGTACGTGCACGTGGACGTGGAAAGCTCATCTAGTTCCACTCTCGCGCCTGCCAGTAATAGCCCATTCGTAGCCGAAGATGACTTACGAATCGAACATACCGTATCCGAGTACGAGCTGGAACAGGAGCTCGAGTGCGATCGCGAGGAGGTCGAGGTCGATCTGTTTTGTGTAAGTAAACGCGGTGACGAACGCGAGAATATGAGCAACGAGAGGCTTGCCAAATCCGTTAGTCCAACTGTTGACCAAGCTGTCGTCGACGAAGCCTCTTCGTTTCGCGCTCCCGCCTCCTGCTCCTCGACTAATGACGCTGTCGACTACGATCGGTTCGTTGTCGTGTATAGCAACGGTATAAAGTATATGCAATGCGACCGGTGCACATTCATATGTCGTCCGCTAATCAGCGTATTTTCGAATCACTATCGTTCGCACGCTGACACCGGCAcgtccgccgccgccgccgccgcctccgagCATCCGAGGTGGTCGCCTGATACAGCCGAAGAGCGCCATCAGCGTGTCGCTGGTAACAGTCGCACGTCGTTTCGGTGCGCCGTATGCGGCAAACTGCTGGCCGACGAAGCCGCGCTAACGAACCACGCCAAGTGTCACGAGCCGCGAGTATTCGCGTGCAAACTGTGTGGCTTGAAATTTCGCACCAAATTCGAGCTACAGTCGCACCGACAGACTCATCCGGTGAGGTCGAGCGTGCCCATACGGTGCAATTTCTGCGGCAAGGATTTCAAAACAAAGTCCGGTTACAATCACCATCGCGCCAATGTTTGCGTCAAATACCGATGTGATTTGTGCGGCGAGCTATTTCTGGTCAAGACGTTGTTTTTGGATCATTCGAGGAAGGTGCATAACGTCGAGGTGGTCGACCTGCTGTCGGGTCAGCCGCAGAAGAGAGAACCGGCCGCGGCGAGCGTGTGTCCGGTTTGCGGTAAGAGTATCAGCGAGAACGCCATGAGCCGCCATTTGCAACTGCATACCGAAGAGAAGCCGTACGTGTGCGACCTGTGCGGCAAACAGTTTCGGGTCAAATGGTCGCTGCGGGAACACATTATGGTCGAAATCGGCATGAAAGACTACGTATGCGAGATTTGCGGCAAGAAATTCGTCATACAGGCTTACCTCAACAAACATATGCGTTTTCACTTAATGTCCGATGGCAAACTGGAAGGTTATCAGTGTGAAATGTGTGGAAAAAAATACGCCGAAGAATGGCGCGTCAAAGAGCACCAGCGCAATTCGCACCAAAATCGCCAACCGCGAAAATACAAGTGCGACGTGTGCGACAAAAAGTACTCGCGTAAGTGGTTAGTGCGGTCGCACAAGAGGGTCGCGCATAAAGAATTTTTCGACGACGAGTTTCAGTGCGAGCACTGCGATAAGAAGTTCGTCGAGAAGTGGATGATCAAATTGCATCTGCAAACGTTGCATAATGTGCGTTTTCGACAGCAGCGAGAACGACAAATTGAAGATGGACAAACTGCTGCTGCTGCCGCTGCCGCCGTCGTCGTTGGAACGACCGACGAGCTAGAAAAACTCCTCGAATATAATAACGctatcaaaaacgaattaacggcggcggcggcgggtGCTGctgctactactactactactactactactattacCTCCGATGCTGTTTCTGGCGAAGAAAAAGACGAGCAACAGCGGGCCGCTCCCCGCAAATCGTTCGAATGCGAACTGTGTAAGAAAACCTTTGCCACCAAGCAGGGCATGAAGAATCATTTGTATGCCGAGTTGAATTTACGAAAGTACGCGTGTGACGTTTGCGGCAAACGATATAACTGGTGGATGGGCTTAAAAGAGCATTCGATTATGAGCCACGGTGAAAAGGAGTTCGTGTGCAGCGTATGCGAAAAAGACTTCCCCACCAAGAAGCGTTACCAGGATCACATGTGGCTACCGCGAAGTAAAAAACCATTCCTCGGAGATACCAGCGTTTCGGCTTTGCGACCACCAGCTCAGTGTGTACCGGACGGCAACGATCTCGATAcaaatacgagtacgagtaaCGCCTCCGCCGTCAATCGGCCAGTAGGCCGCCTAACGACGACTGCCGGAACGAGTGAAAGCCGTAACAAGTTTCACATCACCGAGATCGACGACGTCCAGTATTTCAAGTGCTCCGTCTGTTCGTTCGTCAGTCGATCTTACAATTCTAATTTCACGGCTCATCTCAAATTGCATCCGACCGAAATTGCTGCCGTCGCGTCCACTGCCAGTAGTAAGTCGACGAATGCGCAGAAGAAAAATGTCTCGACGAACCCGCTTACATCGGTGATGGTCATACAGGTGAAACCCAAAAAGAAGCGAGTGGCTAGTCGATCGGCAGAAACGGACGACGAGATACGAATGGAaacaaaaacgcgaaaaaaaatgACTGGCGGTGACGATGATACGCGAAAAGCAGCCGATGATGCGAACGCCGATCGGAATAGAGTGCCGGACCGGACTTGCGAACAATGCCGAATGTCTTTCAAGTCTTCCTCTGGCTACGGCAACCATCTTCGTCTCAGCCATCCGAAATGCGGTATCTGTAATCACGCGCTTCCCTCGCCCGGCTATCTAGTCGACCACTTGCGCTCCGTGCACGCCCTCAAAATACCCGGCGCCGACGGCGATTCTTCGCGCACGTCGTCGGATCAGTCGCTGCAGCGCGGAGCCTCTGCCGTAAATGGCCAACGCGGACCGTTGTCTCCTGTTCGGAATGCGGAATCGACAGTCGCCGACGGCGGCGTTGCTGGCAACTGTGACGACGCAACGCACTCCGCTCTGAACGGCGACCGCGACGAAAGTACAGTCACGGTGGAAGGACGTCGAAAGGAGGCGGCTGCGGCTACGGCTAAGGTAGCTCACGACCAGTCCGATGCGTCGCCGCGAAAATTGGCGGAAGATGGCGCGTATCCGTGTCATTTATGCTCGCAGTTGTTTCCCTACGAGTGGATGTTGAATATTCATCGAACGAACTAG
- Protein Sequence
- MLEFLRGRYRVAFLYLNCNPHYVGLCRCDGRKIVNWAATDVGGMRNYSVPAAALLSFASVSGNESNHLVDRKCAVCGAETCARSGRGLNGELLCPGCVDEQTNVAKKQPSKETEAAATEVVAVSNIPEHLQRFVVARPGQPTYMRCDRCDFLCRPFALLFEKHYLAHEFENQAAAVAVASGVVVSPDEELASSVTAASHFDGDDDIQCAICDKVFLRKGSLIAHLRRHDELTCKLCFATFKSVALLKSHKKRHVRRKLINNTLIYCDVCKKYFKTKKGYAYHREKICVQYKCYVCNKTFAMNSALQDHVRRVHDVEQNLLNEVRVPRDNETLMSVCTICGKAVTHAVLPRHMVLHDEARPFACDLCGKRFKRKNSLQDHIMIEMGMKNYVCEVCGMKFLKQGYLNKHVRYHKLNNGEFQGFQCEVCGKRFPEKWRLGVHQRAVHKGGEFASCRCDICDKKFAERWMVRKHKHLEHNGEEFKEYVCDICGKTFQEKWMIKSHQRSTHRGGVLRQYCDLCGRVDHTAKDCAHRHALQEVKCVLCGESFSSTYFLKEHVTSTHSFEVKAEMKAGDGVNDGDGDGDDVKSFSGNDPLERKQNVLYICANCGLHFADKNDLRAHEMEFKCGGEGKYECTECGIVYTSKFSLSNHLMIHKKASTLVPFKCALCDRDFFTHEALDKHQSLPKICLLCDKVYPCNDHLKNHVFGEHKDDELDDTSSTNVVESHDDDDGGGGGGGGGGGGGQLDAVAAEKQFECKLCRKKFARKQAMKNHLFAEMNLRRYVCEFCDKSYNYYSHLKEHIITNHGEKEFVCGYCGKDFPTKKRFRDHVTLHSEEKPFRCTCGLAFKLSRYLNKHKKHLKLNWNILIRIKKVDRFDYCYPCKASRKPQLRDRLLYSTTNGRFQLTVKFDLTKFLEPIIRRTYRFAFNLATLIGMVYKYKRTIIRLRQHLAGWRVEKCEFTRWSNRSNQYVHVDVESSSSSTLAPASNSPFVAEDDLRIEHTVSEYELEQELECDREEVEVDLFCVSKRGDERENMSNERLAKSVSPTVDQAVVDEASSFRAPASCSSTNDAVDYDRFVVVYSNGIKYMQCDRCTFICRPLISVFSNHYRSHADTGTSAAAAAASEHPRWSPDTAEERHQRVAGNSRTSFRCAVCGKLLADEAALTNHAKCHEPRVFACKLCGLKFRTKFELQSHRQTHPVRSSVPIRCNFCGKDFKTKSGYNHHRANVCVKYRCDLCGELFLVKTLFLDHSRKVHNVEVVDLLSGQPQKREPAAASVCPVCGKSISENAMSRHLQLHTEEKPYVCDLCGKQFRVKWSLREHIMVEIGMKDYVCEICGKKFVIQAYLNKHMRFHLMSDGKLEGYQCEMCGKKYAEEWRVKEHQRNSHQNRQPRKYKCDVCDKKYSRKWLVRSHKRVAHKEFFDDEFQCEHCDKKFVEKWMIKLHLQTLHNVRFRQQRERQIEDGQTAAAAAAAVVVGTTDELEKLLEYNNAIKNELTAAAAGAAATTTTTTTTITSDAVSGEEKDEQQRAAPRKSFECELCKKTFATKQGMKNHLYAELNLRKYACDVCGKRYNWWMGLKEHSIMSHGEKEFVCSVCEKDFPTKKRYQDHMWLPRSKKPFLGDTSVSALRPPAQCVPDGNDLDTNTSTSNASAVNRPVGRLTTTAGTSESRNKFHITEIDDVQYFKCSVCSFVSRSYNSNFTAHLKLHPTEIAAVASTASSKSTNAQKKNVSTNPLTSVMVIQVKPKKKRVASRSAETDDEIRMETKTRKKMTGGDDDTRKAADDANADRNRVPDRTCEQCRMSFKSSSGYGNHLRLSHPKCGICNHALPSPGYLVDHLRSVHALKIPGADGDSSRTSSDQSLQRGASAVNGQRGPLSPVRNAESTVADGGVAGNCDDATHSALNGDRDESTVTVEGRRKEAAAATAKVAHDQSDASPRKLAEDGAYPCHLCSQLFPYEWMLNIHRTN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -