Cpal013367.1
Basic Information
- Insect
- Carterocephalus palaemon
- Gene Symbol
- Znf296
- Assembly
- GCA_944567795.1
- Location
- CALYMS010000187.1:160544-181325[-]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 31 0.0039 25 5.2 0.0 24 48 148 172 130 177 0.86 2 31 0.76 5e+03 -2.1 0.1 26 44 446 464 439 468 0.85 3 31 0.0029 19 5.6 0.1 20 48 540 568 523 573 0.83 4 31 0.028 1.8e+02 2.4 0.1 23 46 571 594 568 602 0.81 5 31 0.028 1.8e+02 2.5 0.0 23 46 620 643 617 651 0.81 6 31 0.028 1.8e+02 2.5 0.0 23 46 669 692 666 700 0.81 7 31 0.028 1.8e+02 2.5 0.0 23 46 718 741 715 749 0.81 8 31 0.028 1.8e+02 2.5 0.0 23 46 767 790 764 798 0.81 9 31 0.028 1.8e+02 2.5 0.0 23 46 816 839 813 847 0.81 10 31 0.028 1.8e+02 2.5 0.0 23 46 865 888 862 896 0.81 11 31 0.028 1.8e+02 2.5 0.0 23 46 914 937 911 945 0.81 12 31 0.028 1.8e+02 2.5 0.0 23 46 963 986 960 994 0.81 13 31 0.028 1.8e+02 2.5 0.0 23 46 1012 1035 1009 1043 0.81 14 31 0.028 1.8e+02 2.5 0.0 23 46 1061 1084 1058 1092 0.81 15 31 0.028 1.8e+02 2.5 0.0 23 46 1110 1133 1107 1141 0.81 16 31 0.028 1.8e+02 2.5 0.0 23 46 1159 1182 1156 1190 0.81 17 31 0.028 1.8e+02 2.5 0.0 23 46 1208 1231 1205 1239 0.81 18 31 0.028 1.8e+02 2.5 0.0 23 46 1257 1280 1254 1288 0.81 19 31 0.028 1.8e+02 2.5 0.0 23 46 1306 1329 1303 1337 0.81 20 31 0.028 1.8e+02 2.5 0.0 23 46 1355 1378 1352 1386 0.81 21 31 0.028 1.8e+02 2.5 0.0 23 46 1404 1427 1401 1435 0.81 22 31 0.028 1.8e+02 2.5 0.0 23 46 1453 1476 1450 1484 0.81 23 31 0.028 1.8e+02 2.5 0.0 23 46 1502 1525 1499 1533 0.81 24 31 0.028 1.8e+02 2.5 0.0 23 46 1551 1574 1548 1582 0.81 25 31 0.028 1.8e+02 2.5 0.0 23 46 1600 1623 1597 1631 0.81 26 31 0.028 1.8e+02 2.5 0.0 23 46 1649 1672 1646 1680 0.81 27 31 0.028 1.8e+02 2.5 0.0 23 46 1698 1721 1695 1729 0.81 28 31 0.028 1.8e+02 2.5 0.0 23 46 1747 1770 1744 1778 0.81 29 31 0.028 1.8e+02 2.5 0.0 23 46 1796 1819 1793 1827 0.81 30 31 0.028 1.8e+02 2.5 0.0 23 46 1845 1868 1842 1876 0.81 31 31 0.028 1.8e+02 2.5 0.0 23 46 1894 1917 1891 1925 0.81
Sequence Information
- Coding Sequence
- ATGAGTTCTATCCTgcaaaaaactgaaataaaaatcGAAGCCCTCTGCCGAACATGCTTGTCGAAGGAGATTGAATTGCTGTCGGTATTCGATGCGTGCCCTGGGGACGAGGAAGTCACACTGGACAGCGTCATCGCAACAATTACCGGCGTTAAGATCGTGGCCGGCGATGGCTTGCCCGCGACGGTGTGCCGAGAGTGTTCGGAGCAGGCGCACAGCGCGTTCTCGTtccgcgggcgcgcgcggcgggccgaCGACGCGCTGCGCGGCCTGGTGCGCGTCAAGATGGAGGAAGCGGAGCCGGCTGTGGATGTCAAGATCGAGGAGTTCAGCGGCGATATCCATGATGACTACCTGGATATGGACTTCACCATGACACTGCCCAATGGTGACGAACACCCGCCTGATTTTGCGGCGGACGCGGTAGGCGAGTGCGACGAGGGCACATACTGTCCGGTGTGCGGCGCTAGTTTTGAGGACGCGGCGGGCCTGACGCGCCACGTGTGGCAGCGGCACGCGGACCTCATGGGCCCGAAGAAGCGGGGGCGGCCCAAGCAACTGCTCACTAGCACGATCCTGAGCCGCATGTCGGGGcgagcgggcgcgggggcggcggcgggggcggcgggggcggcgggggcggcgccgCAAGCGTGCGCGCTGTGCCGCCTCGCGGTCGACTCGCACGACGACCTGGCGACGCACATGATGCTGCACAAGGACGAGAAAGTTCTAAGCTGTCTGTGCTGCAAGAAGATGTACCTCCAGCGAGAGGACTTCGACCGTCACAGCTGCGCGCCCGCAGGCCACGAGGAGCACCAGGTTGACCACGGCGCGGCATGCGCAGTTGAGATCGCTCTACAGGAGCTACTGGCCCGCGGAGACTCGgTGGAGGTGTGCGACGGCTGCGGCGGCGTGttcggcggcgcgggcgagctggcgcgcCACCGCGACGCGGAGCACCCCGAGCGGTCGCTGCGCTGCGGGCACTGCCTCAAGGTGTTCGCGTCcctgcgcggcgcggcgcggcaccGGCGCTCGTGCGCGCTGGTGGAGCGCGCGCacgcgtgcgcggcgtgcgggCTGCGCTTCCAGCACGCGATCACGCTGAACAAGCACATCCTGCGCTCGCACGCCGGCCTGCCGGTCGCCGTGCGCttccccgcgcccgcccccgcgcccgcccccgcctccgcgcccgcgcccgcgcccgccggcacTCGCGCCGCACGGGCGGGCGTCGCGCTGGTGTGCGACACGTGCGGACGCACTTTCCGCAGGAAAGAGTTGCTATTGAGGCACGCCAAGCTTCACCAGCCTGACGCCAAGAGCTTTGAATGCGACGTGTGCAAGAAGCGGTTCAACCGGAGGAACAACTTGCGGTCCCACATGCGCACGCAcgaggcgggcgcgcgggcggcgggcgcggcgggggcggcggcggcggggggagCGGCCAGCTCGTGCCTGTGCCTGTACTGCGGCCGCGGGTTCTCCAACTCCTCGAACCTTATAGTGCACATGCGCCGCCACACCGGCGAGAAGCCGTACAAGTGTGACTTCTGCGGCAAAGGCTTCCCGCGCTCGTCGGACCTGCAATGCCACCGGCGCTCCCACACCGGCGAGAAGCCCTGCGTGTGCGGCGTCTGCGGAAAAGGGTTCTCCCGCAGCAACAAGCTGTCGCGGCACATGCGCGTGCACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACCGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACCGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagATGCCTGTACACACCGGCCTCAAGCCGTACAAGTGTCCGTACTGCGAGAAGGCGTTCTCGCAGAGCAACGACCTCACGCTGCACGTGCGCCGGCACACGGGCGACAAGCCGTAcgtgtgcgagctgtgcggcgaccggttcatacagGGCACAGCGCTGCACAACCACCGTCGCGCGCACGGCCACTTCCCGCCGGCAGggggcgcgcccgcgccgccctacgccgcgcgcgcgctgcccgactGA
- Protein Sequence
- MSSILQKTEIKIEALCRTCLSKEIELLSVFDACPGDEEVTLDSVIATITGVKIVAGDGLPATVCRECSEQAHSAFSFRGRARRADDALRGLVRVKMEEAEPAVDVKIEEFSGDIHDDYLDMDFTMTLPNGDEHPPDFAADAVGECDEGTYCPVCGASFEDAAGLTRHVWQRHADLMGPKKRGRPKQLLTSTILSRMSGRAGAGAAAGAAGAAGAAPQACALCRLAVDSHDDLATHMMLHKDEKVLSCLCCKKMYLQREDFDRHSCAPAGHEEHQVDHGAACAVEIALQELLARGDSVEVCDGCGGVFGGAGELARHRDAEHPERSLRCGHCLKVFASLRGAARHRRSCALVERAHACAACGLRFQHAITLNKHILRSHAGLPVAVRFPAPAPAPAPASAPAPAPAGTRAARAGVALVCDTCGRTFRRKELLLRHAKLHQPDAKSFECDVCKKRFNRRNNLRSHMRTHEAGARAAGAAGAAAAGGAASSCLCLYCGRGFSNSSNLIVHMRRHTGEKPYKCDFCGKGFPRSSDLQCHRRSHTGEKPCVCGVCGKGFSRSNKLSRHMRVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQMPVHTGLKPYKCPYCEKAFSQSNDLTLHVRRHTGDKPYVCELCGDRFIQGTALHNHRRAHGHFPPAGGAPAPPYAARALPD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00277508;
- 90% Identity
- iTF_00277508;
- 80% Identity
- iTF_00277508;