Fatl026040.1
Basic Information
- Insect
- Flexamia atlantica
- Gene Symbol
- -
- Assembly
- GCA_035578135.1
- Location
- JAQJVK010000003.1:61571014-61591205[-]
Transcription Factor Domain
- TF Family
- CG-1
- Domain
- CG-1 domain
- PFAM
- PF03859
- TF Group
- Unclassified Structure
- Description
- CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein [2]. CG-1 domains are associated with CAMTA proteins (for CAlModulin -binding Transcription Activator) that are transcription factors containing a calmodulin -binding domain and ankyrins (ANK) motifs [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 0.67 3.1e+04 -3.0 0.0 36 55 7 26 3 27 0.74 2 17 0.97 4.6e+04 -3.6 0.0 24 55 61 92 57 93 0.85 3 17 0.43 2e+04 -2.4 0.0 23 55 126 158 121 160 0.88 4 17 0.029 1.3e+03 1.4 0.0 23 56 192 225 180 228 0.89 5 17 0.28 1.3e+04 -1.8 0.0 22 55 282 315 261 317 0.79 6 17 0.025 1.2e+03 1.5 0.0 24 56 350 382 339 385 0.88 7 17 0.47 2.2e+04 -2.6 0.0 24 55 416 447 410 449 0.89 8 17 1.3 6.2e+04 -4.0 0.0 25 55 483 513 480 514 0.83 9 17 0.099 4.6e+03 -0.4 0.0 23 56 547 580 539 582 0.87 10 17 0.65 3.1e+04 -3.0 0.0 24 55 695 726 692 728 0.84 11 17 0.027 1.3e+03 1.4 0.0 24 56 761 793 750 796 0.88 12 17 0.012 5.7e+02 2.6 0.0 23 56 851 884 837 887 0.89 13 17 1.3 6.1e+04 -4.0 0.0 25 55 919 949 915 950 0.84 14 17 0.031 1.5e+03 1.2 0.0 23 56 983 1016 976 1019 0.88 15 17 0.32 1.5e+04 -2.0 0.0 24 55 1050 1081 1043 1083 0.86 16 17 0.025 1.2e+03 1.5 0.0 24 56 1116 1148 1105 1151 0.88 17 17 0.56 2.6e+04 -2.8 0.0 24 55 1182 1213 1177 1215 0.87
Sequence Information
- Coding Sequence
- ATGTCTCCATTGCCGGCCAAGTGCCGTGTTTACTATAAGCCACGTGGTGCAGCCAGATTTTTCAATACAGACGGGCATGTCTCCATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTATTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTCGTATCTGTTATGCAGATTTTTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAAGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTTTCAATACAGACGGGCATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTTTTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTTTCAATACAGACGGGCAAGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGTGGCGTCGCTCGGTTTATCACGTTGTATCTGTTGTGCAGATTTTTCAATACAGACGGGCATGTCTCCACTGCCGGCCAAGTGCTGTGTTTACTACAAGCCACGTGATGTAGCCAGATTTGTCAATACAGACGGGCATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGTCGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTTTTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGACGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTTGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCAATTGCCGGCCAAGTGCCGTGTTTACTGCAAGCCACGCGGTGTAGCCAGATTTGTCAATACAGACGGACATGTCTCGATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGCGGTGTAGCCAGGTCAGTGCGGCTTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGCGGTGTAGCCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGCAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGACGTATCTGTTATGCAGATTTTTCAATACAGACGGGCATGTCTCCATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTATTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTTAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTTTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCTTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTTCAATACAGACGGGCAAGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGGAGATTTTTCAATACAGACGGGCATGTCTCCACTGCCGGCCAAGTGCTGTGTTTACTACAAGCCACGTGATGTAGCCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGTCGTGTTTACTACAAGCCACGTGGTATAGCCAGATTTTTCAATACAGACGGACATGTCTCCACTGCCGGCCAAGTGACGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTTTCAATACAGACGGGCAAGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTTTCAATACAGACGGGCATGTCTCCACTGCCGGCCAAGTGCTGTGTTTACTACAAGCCACGTGATGTAGCCAGATTTTTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGTCGTGTTTACTACAAGCCACGTGGTATAGCCAGATTTGTCAATACAGACGGACATGTCTCAATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGCGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTTTCAATACAGACGGGCATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTTTTCAATACAGACGGACATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCTTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCCACTGCCGGCCAAGTGCTGTGTTTACTACAAGCCACGTGATGTAGCCAGATTTGTCAATACAGACGGGCATGTCTCCATTGCCGCCCAAGTGCCGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGTCGTGTTTACTACAAGCCACGTGGTGTAGCCAGATTTTTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGACGTGTTTACTACAAGCCACGTGGTGTAGCCAGGTCAGTGCGGCGTCGCTTGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCAATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGCGGTGTAGCCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGCGGTGTAGCCAGGTCAGTGCGGCTTCGCTCGGTTTATCACGTTGTATCTGTTATGCAGATTTGTCAATACAGACGGACATGTCTCCATTGCCGGCCAAGTGCCGTGTTTACTACAAGCCACGCGGTGTAGCCAGTTCCGACATGATGCGACCATCCAGCTTTAACACTAAAAGCTCCGGAGTCGACGGTTATATCTGGTTCCCTTACATACGTGATGATGGCGAGGAGCGAACCGCGGGCCGAGTCGTGACTCAACTGGTAATCGACCCGCGATTTCCATATCTGGGTCTATGCGACTTGAAAGTTTCAATGCGGCGGGGCTGGCCGGGCGCCAGGCGGGGGCGATTAGGCCGAGAGGTAAAGAAGGGGGTGATTAAAACTCGGGCCAATCTTATTAGCGGCGGGGCTTTGTTCAATTTTGGTAATTTGCAAACGGCCCAAGACTGGAGACAGCGACCGGGGCTGGCGGCCCGCCACCCCGCGTATACAGCGGCGTGTCGATAG
- Protein Sequence
- MSPLPAKCRVYYKPRGAARFFNTDGHVSIAGQVPCLLQATWCSQVSAASLGLSRCICYADYSIQTDMSPLPPKCRVYYKPRGVARFVNTDGHVSIAGQVPCLLQATWCSQVSAASLGLSRRICYADFSIQTDMSPLPPKCRVYYKPRGVARFVNTDGHVSIAAQVPCLLQATWCSQVSAASLGLSRCICYADFSIQTGMSPLPPKCRVYYKPRGVARFFNTDGHVSIAAQVPCLLQATWCSQIFNTDGQVSIAAQVPCLLQATWCSQVSVASLGLSRCICCADFSIQTGMSPLPAKCCVYYKPRDVARFVNTDGHVSIAAQVPCLLQATWCSQVSAASLGLSRCICYADLSIQTDMSPLPAKCRVYYKPRGVARFFNTDGHVSIAGQVTCLLQATWCSQVSAASLGLSRCICYADLSIQTDMSQLPAKCRVYCKPRGVARFVNTDGHVSIAGQVPCLLQATRCSQVSAASLGLSRCICYADLSIQTDMSPLPAKCRVYYKPRGVARFVNTDGHVSIAGQVPCLLQATWCSQVSAASLGLSRRICYADFSIQTGMSPLPAKCRVYYKPRGVARLFNTDGHVSIAAQVPCLLQATWCSQVSAASLGLSRCICYADFSIQTDMSPLPPKCRVYYKPRGVARSVRLRSVYHVVSVMQIFNTDGQVSIAAQVPCLLQATWCSQVSAASLGLSRCICYGDFSIQTGMSPLPAKCCVYYKPRDVARFVNTDGHVSIAAQVPCLLQATWCSQVSAASLGLSRCICYADLSIQTDMSPLPAKCRVYYKPRGIARFFNTDGHVSTAGQVTCLLQATWCSQIFNTDGQVSIAAQVPCLLQATWCSQVSAASLGLSRCICYADFSIQTGMSPLPAKCCVYYKPRDVARFFNTDGHVSIAAQVPCLLQATWCSQVSAASLGLSRCICYADLSIQTDMSPLPAKCRVYYKPRGIARFVNTDGHVSIAGQVPCLLQATRCSQVSAASLGLSRCICYADFSIQTGMSPLPPKCRVYYKPRGVARFFNTDGHVSIAAQVPCLLQATWCSQVSAASLGLSRCICYADLSIQTDMSPLPAKCCVYYKPRDVARFVNTDGHVSIAAQVPCLLQATWCSQVSAASLGLSRCICYADLSIQTDMSPLPAKCRVYYKPRGVARFFNTDGHVSIAGQVTCLLQATWCSQVSAASLGLSRCICYADLSIQTDMSQLPAKCRVYYKPRGVARFVNTDGHVSIAGQVPCLLQATRCSQVSAASLGLSRCICYADLSIQTDMSPLPAKCRVYYKPRGVASSDMMRPSSFNTKSSGVDGYIWFPYIRDDGEERTAGRVVTQLVIDPRFPYLGLCDLKVSMRRGWPGARRGRLGREVKKGVIKTRANLISGGALFNFGNLQTAQDWRQRPGLAARHPAYTAACR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -