Basic Information

Gene Symbol
-
Assembly
GCA_028455855.1
Location
JAPTHL010000059.1:764814-769922[+]

Transcription Factor Domain

TF Family
CUT
Domain
Homeobox|CUT
PFAM
PF02376
TF Group
Helix-turn-helix
Description
The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein (eg Swiss:P10180).
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 23 0.026 49 3.6 0.0 38 55 370 387 337 391 0.73
2 23 0.26 5e+02 0.4 0.0 39 55 441 457 409 460 0.87
3 23 0.35 6.7e+02 -0.0 0.0 39 55 476 492 474 496 0.91
4 23 0.021 41 3.9 0.0 14 55 525 562 512 566 0.70
5 23 0.26 5e+02 0.4 0.0 39 55 616 632 584 635 0.87
6 23 0.36 7e+02 -0.1 0.0 39 55 651 667 649 670 0.91
7 23 0.32 6e+02 0.1 0.0 39 55 686 702 681 706 0.91
8 23 0.17 3.2e+02 1.0 0.0 38 55 790 807 758 810 0.72
9 23 0.36 7e+02 -0.1 0.0 39 55 826 842 824 845 0.91
10 23 0.35 6.7e+02 -0.0 0.0 39 55 861 877 859 881 0.91
11 23 0.021 41 3.9 0.0 14 55 910 947 897 951 0.70
12 23 0.24 4.7e+02 0.5 0.0 39 55 1001 1017 969 1021 0.87
13 23 0.17 3.2e+02 1.0 0.0 38 55 1070 1087 1038 1090 0.72
14 23 0.36 7e+02 -0.1 0.0 39 55 1106 1122 1104 1125 0.91
15 23 0.36 7e+02 -0.1 0.0 39 55 1141 1157 1139 1160 0.91
16 23 0.35 6.7e+02 -0.0 0.0 39 55 1211 1227 1207 1230 0.91
17 23 0.29 5.6e+02 0.2 0.0 39 55 1281 1297 1276 1306 0.90
18 23 0.17 3.2e+02 1.0 0.0 38 55 1315 1332 1283 1335 0.72
19 23 0.35 6.7e+02 -0.0 0.0 39 55 1351 1367 1347 1370 0.91
20 23 0.087 1.7e+02 1.9 0.0 12 55 1398 1437 1388 1440 0.71
21 23 0.36 7e+02 -0.1 0.0 39 55 1456 1472 1454 1475 0.91
22 23 0.36 7e+02 -0.1 0.0 39 55 1491 1507 1489 1510 0.91
23 23 0.35 6.7e+02 -0.0 0.0 39 55 1526 1542 1524 1546 0.91

Sequence Information

Coding Sequence
ATGGCGAGTCTTTTAAGACAGCTAATATCGTCATATGCGTCTTACCATAGGCGTACTGCTACAATTAAAAATGGAAAATTACCAGATATTGATAAAAAGATAGATATTGTTGGCTTAATTCATACAGATCTCATTAAACAATCATTTTATGGGAATAAATCGAGTATAAACGGTGCTTTAATTGGTCCAACCTTATGTCAAAAAATTTTACAAGCTGGAATTGGTGTTACTATCAATTATTTCCCACAAGAGGAAGAAACACCACAGGAAATTGGTATTTCAGAAGATATAGCCTCATCATCTGATAAAATATCACTCTCAAATACTGCACATGCTATCCGTGATCGTCCTATACAAGACGAAGGATTACCAGCCGACTTTGATACGTCTGTTAAAGTAGAAAACACGTCTTTATTATGCAATAAAAAAATGCTACACTTAGATCCATTTGCTGGTTGTGAAACTCCACGTAATCCAATCAATCAACCAGTGAATGATTCAACACATACCGAATCATCATCAGTTAATGAGTGGTCTACAAAATCATACTTGAAATCAATAAGTGGCGTATCAAAGAAGTCACTATCGTGCTTCATCTCAGTAACTACCGTAGCTACCATGACTACCTTGTTTTTCAAAACACGACGTTGGCCAATAATTATTACAACTGGTAGTTGGTTTATTGGTAGTTTATATTTATATAAACGCCTAACACGAACATATGAACCAAAAATGGACTCATTTAAAGAACAGCATACATCAGACATTAATCACAACTATGAAATTAATTCAGAATTATCAATACTTTCAAAAGAAAATGAAACGGGAAATAGTTTTGGAAACGAATTACCTGAATCACACGAAGATGCGGAAGAATGTGAAAAAAGAGACGAAAATGATAATGAATTGGATGTATCACAAGAAAATGTTGACGATTTACAATTAGATGATGAATTAAACTACAAACAAAATAATCAGACTGACATTGATAGTAAATATATCCCTCCACTTTCAACAATTTTGGAAGAGCCGGAAGCTTTTGAGCAGTTAGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGAAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAATCGGAAACTATTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCAGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAATCGGAAGCTTTTGAGCAGTTAGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGAAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAATCGGAAACTATTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCAGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAAGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCAGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAATCGGAAACTATTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAAGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAATCGGAAGCTTTTGAGCAGTTAGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGAAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAATCGGAAACTATTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGAGAATGAATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAAGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAATCGGAAGCTATTGAGCAGTTGGAGGAAGAATTCGACGACAATGTATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAGAATCGGAAGCTATTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGTTGACATTGATAATAAATATATCCCTACACTTTCAACAATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGTTGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATTTAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGAAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTATCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAAGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACTATTTTGGAAGAACCGGAAGCTTTTGAGCAGTTGGAGGAAGAATTCAACGACAATGAATATAATCAGGCAGACATTGATAATAAATATATCCCTACACTTTCAACTATTTTGGAAAAATCGGAAGCTATTGAGCAGTTGGAGGAAGAATTCAACGACTATGTATATAATGAGGCTGACAATGATAGTAAATATATCCCTACACTTTCAACAATTTTGGAAGAGCCTGAAACTGCTGACAACACAAATGATGAATCCGATGAATCAGAAGAATCAGATGATTCTGTTTCCATTCTTCCTAATTACATGCATTGTTGCATCGTTTGA
Protein Sequence
MASLLRQLISSYASYHRRTATIKNGKLPDIDKKIDIVGLIHTDLIKQSFYGNKSSINGALIGPTLCQKILQAGIGVTINYFPQEEETPQEIGISEDIASSSDKISLSNTAHAIRDRPIQDEGLPADFDTSVKVENTSLLCNKKMLHLDPFAGCETPRNPINQPVNDSTHTESSSVNEWSTKSYLKSISGVSKKSLSCFISVTTVATMTTLFFKTRRWPIIITTGSWFIGSLYLYKRLTRTYEPKMDSFKEQHTSDINHNYEINSELSILSKENETGNSFGNELPESHEDAEECEKRDENDNELDVSQENVDDLQLDDELNYKQNNQTDIDSKYIPPLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEEPKAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEESETIEQLEEEFNDNEYNEADIDNKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEESEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEEPKAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEESETIEQLEEEFNDNEYNEADIDNKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADIDNKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEESETIEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEESEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEEPKAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEESETIEQLEEEFNDNEYNEADIDNKYIPTLSTILEEPEAFEQLEEEFNENEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEESEAIEQLEEEFDDNVYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEESEAIEQLEEEFNDNEYNEVDIDNKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEEPEAFEQLEEEFNDNEYNEVDIDNKYIPTLSTILEEPEAFEQLEEEFNDNEFNEADNDSKYIPTLSTILEEPEAFEQLKEEFNDNEYNQADIDNKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNEADNDSKYIPTLSTILEEPEAFEQLEEEFNDNEYNQADIDNKYIPTLSTILEKSEAIEQLEEEFNDYVYNEADNDSKYIPTLSTILEEPETADNTNDESDESEESDDSVSILPNYMHCCIV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-