Basic Information

Gene Symbol
-
Assembly
GCA_944452905.1
Location
CALYCC010000037.1:1073139-1079102[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 25 0.044 54 4.8 0.3 28 49 554 575 544 580 0.76
2 25 0.0032 3.9 8.5 2.8 28 61 596 629 592 635 0.82
3 25 0.002 2.4 9.2 7.5 27 62 651 686 648 695 0.61
4 25 0.0026 3.2 8.8 8.0 26 63 713 750 711 757 0.70
5 25 0.0032 3.9 8.5 1.7 32 61 775 804 771 810 0.86
6 25 0.0039 4.7 8.2 8.0 26 62 825 861 823 869 0.72
7 25 0.013 16 6.6 5.3 32 61 887 916 883 918 0.67
8 25 0.00044 0.54 11.3 5.6 25 64 908 947 906 948 0.90
9 25 0.01 13 6.9 1.8 26 61 990 1025 983 1029 0.67
10 25 0.00096 1.2 10.2 4.0 25 64 1003 1042 1002 1043 0.93
11 25 0.018 22 6.1 7.8 26 62 1053 1089 1037 1092 0.67
12 25 0.0054 6.5 7.8 6.5 25 61 1094 1130 1085 1134 0.79
13 25 0.003 3.7 8.6 5.0 26 64 1109 1147 1106 1151 0.92
14 25 0.0031 3.8 8.5 1.8 25 62 1198 1235 1197 1238 0.88
15 25 0.0022 2.7 9.0 4.4 26 63 1213 1250 1212 1252 0.90
16 25 0.007 8.6 7.4 2.6 26 62 1227 1263 1225 1266 0.89
17 25 0.0018 2.2 9.3 3.1 27 62 1256 1291 1254 1294 0.90
18 25 0.0017 2 9.4 4.0 26 62 1269 1305 1267 1308 0.91
19 25 0.012 15 6.7 4.1 26 62 1283 1319 1280 1326 0.90
20 25 0.015 18 6.4 6.3 27 61 1312 1346 1295 1350 0.78
21 25 0.0021 2.6 9.1 4.4 26 62 1325 1361 1322 1364 0.91
22 25 0.0058 7.1 7.7 3.1 26 62 1339 1375 1336 1378 0.89
23 25 0.028 34 5.5 2.1 26 61 1353 1388 1352 1392 0.88
24 25 0.014 17 6.4 2.0 27 61 1368 1402 1366 1402 0.90
25 25 0.00027 0.33 11.9 2.1 26 60 1395 1429 1392 1432 0.88

Sequence Information

Coding Sequence
ATGGCTCCCGCAGCGGTGGCTATATTTTTAATCTCTCTATTGGCCGAGACATTCTCGGCCCCAAGTGGATGCGCCGAGTGCCAAACATGGGAATCGCACAGTGGCTCTCATAGTCAGAGAGGATTTAGCAGCCAGACAACTGAAGACGATTTATCTCAGAGGTCTGGAAAGTTGGAAGATTTAACACAACAAGCCGAATTCGAGCTGAACAGATCCCGCAATAAATTAGCCTTTGATAATACAAGGCCTGGGAATTGGACCGACGTAAATCATTATGGGACATCCGATGGTCATGGAAGAGTATATGAAGAACAAGGCCAACGTGTGGATGGGCCAACCCGAATCAGATATATGAGGAAAAATTTCACTTCCAGTTATAGCAGCGGAGGCTTGGGCTCCTTTGGAGAAACTAATTTGGGACGTATATACCCGCATATTAGTCAAGATGCCAGCCAGTCATTGAATTTAGATCAGACGCAAAATTCGGCTTATGATCAATTCGCCATCCGAAGAAATTCTCACAACACGCAGGACTCTGTACATTCTATGGAAAGGGTTAACAGCCGTGACGATGCATCCAGGTATTACGGAACTCATGGCAGTAGCGGTCGGCTGAGAGAAACTAGTTCTGACCAATCAGCACAGCAAGGACTAGACGCGCTGGGTCAAATAAGGCCAGGAAATTGGAGCACAGTTAACACTTATAGAACTGATGGGGGTAATGGCAGGGTTTACGAAGAACGAAGGCAGATTGTAACTGGGCCGAGACAGGTTATTTCCTATAGAAGAAATTATACCACAAGTTACAGCTCTGGCGGAGATATTCCAACTTTTGGTGCGGGAAGCGACGAAACAAGGAACATCGAAAGCAATGTGCAGCAGCAGCAAAGGCAGTTTGATAGCTACGGAAGAGAGCTTCAAGAAACTAGTGGTGGTTCAACAAATGGCGGTTACACTCGGTACTATTCTGGACATCATATAACGCCTGGGCAAGCAAACTACAGATATGTATCAAGGCCTGGTAACTATGAGCCACTAGAGTCGAATCCTCATCAAACGTACCAGCAGACGACTAATTTTGGAAATCAGCACGGGTCTCAGTTTAGAACTGTAAGTGGAACGGAACAAGTTAACGAAAGAATTCCAATTTCTGGAGGTTCGACCGGCAGTTATGTTAGTAGTAGTTCATATGGCACTCGCAGAACTCCAGAAGTGTCATCTGACGGAAGATTGTCTGACTGGGAATCACAAAGAGGGCAGGTTGTTTATGGTCGTCATCCAGACGCAGGTTATACGCGTGTTTCTGGATCGCAAGATTTGAATGGAAGGGGAATTTCTGGTGGTAGTTTGAATAATCGAGAAGAATCTTATTACACTAGAACGCATGGTACTGAGATACCTTTAGGCCAAATTCGGCATTATGATCAATTTCAAACCTTCTCCACTTCAGGCTCCCCCTCTGCTTCCTCAATAAGTCGCCCTGACATGCATGTGACAACTATTCAATCTGGTCATGGGGTACAGAGTGGTCATCTAGTTACACAGGGACTTGATTTAGGACAAATATCACAAATTCCTGATTGTGATGAAGGTACTAATGGACATAGTTCATATAAACAATCTTACAGCCATAGAGTCTATAGGGGGGCTAACGAACCTCAGCACCTTACTCAACAAGCGGAAGATCTTACCCAGCAAACGGAAGATCTTACCCAACAAACAGAGGATTTCGGACAGCAATCTTTTTTGAAGCCTGGTAAATTGCAAGTTGAGAATGAACACTTTCAAGATCTGACTCAGCAAACTCAGGATCTTACCCAGCAAACTCAGGATCTTACCCAGCAAACCCAGGATCTTACCCAGCAGACTGAAGATTTTACACAACAAAGTCAAGATTTCGGTCAACAATCATCTTCGAGACCGGGTAAATTGGAATTTGGGAGTCAACAAGTTGAAGATCTGACTCAGCAAACTCAGGATCTTACCCAACAAACCGAAGATCTTACCCAGCAAACTCAGGATCTTACCCAGCAAACCCAGGATCTTACCCAGCAGACTGAAGATTTTACACAACAAAGTCAAGACTTCGGTCAACAATCATCTTCGAGACCGGGTAAATTGGAAGTTGGGAGTCAACAAGTTCAAGATCTGACTCAGCAAACTCAGGATCTTACCCAACAAACCGAAGATCTTACCCAGCAAACTCAGGATCTTACCCAGCAAACCCAGGATCTTACCCAGCAGACTGAAGATTTTACACAACAAAGTCAAGATTTCGGTCAGCAATCATCTTCGAGACCGGGCAAATTGGAAGTTGGGAGTCAACAAGTTGAAGATCTTACCCAGCAAACTCAGGATCTTACCCAGCAAACCCAGGATCTTACCCAACAGACTGAAGATTTTACACAACAAAGTCAAGATTTCGGTCAGCAATCATCTTCGAGACCGGGTAAATTGGAAGTTGGGAGTCAACAAGTTCAAGATCTGACTCAGCAAACTCAGGATCTTACCCAGCAAACTCAGGATCTTACCCAACAAACCGAAGATCTTACCCAACAAACTCAGGATCTTACCCAACAGACTGAAGATTTTACACAACAAAGTCAAGATTTCGGTCAGCAATCATCTTCGAGACCGGGTAAATTGGAAGTTGGGAGTCAACAAGTTCAAGATCTGACTCAGCAAACTCAGGATCTTACCCAGCAAACTCAGGATCTTACCCAGCAAACTCAGGATCTTACCCAACAAACCGAAGATCTTACTCAACAAACTCAGGATCTTACCCAACAGACGGAAGATCTTACTCAACAAACAGAAGGAATGCAACAAGATTATTCCCCACTTTGGAACCATGAACGTTGGCGAACTAATGACCCACATTATGCACCTCGGCCAATTGAGCATGAAGAAGTTGTGAGACCACAAAACTCAGGTATCGCTTATCAACAGCCCAGggatcttacacaacaaacacaggatctgactcaacaaacggaagattttggccaacaaactgaggatctgactcaacaaacggtggatctcggccaagaaaatgaggatctcactcaacaaacggtagatcttggccaacaaaccgaggatctgactcaacaaacggaagatcttggtcaacaaacggttgatcttggtcaagaaaatgaggatctgactcaacaaacggttgatcttggtcaagaaaatgaggatctgactcaacaaacggtagatcttggccaacaaaccgaggatctgactcaacaaacggaagatcttggtcaacaaaccgaggatctgactcaacaaacggttgatcttggtcaagaaaatgaggatctgactcaacaaacggTTGATCTTGGTCAAGAAAATGAGGATCTGACTCAACAAACGGTAGATCTTGGCCAACAAACCGAGGATCTGACTCAACAAACGCAAGACCTATCGCAGCAAGGGCATATACCAGCTGACCCTCCACTCTGGCACCACGAAACATCTCCAATTATCCGCCCAGATTATGAACCGCCTACATTATCCCCGTTAAGTGTTCACCAGGAAGTTGAGCCTCCCAAGACCTCAGATATTACTAGTCAACAGACAGAGGATCTCGCTCAACAAACGGTAGATTTCGGTCAAGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCCAAGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCCAAGAAAATGAGGATCTCACTCAACAAACGATGGATCTTGGCTATGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCCAAGAAAATCAGGATCTCACTCAACAAACGGTAGATCTTGGCCAAGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCCAAGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCTATGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCCAAGAAAATCAGGATCTCACTCAACAAACGGTAGATCTTGGCCAAGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCCAAGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCTATGAAAATGAGGATCTCACTCAACAAACGGTGGATCTTGGCTATGAAAATGAGGATCTCACTCAACAAACGGTTAATCTTGGCCAAGAAAATGAGGATCTCACTCAACAAACGGTAGATCTTGGCCAAGAAAATGAGGATCTTACTCAACAGAATGAAGGCTTATCGCAACAAATTCAAGGTACTGATGATTATCTACAAATACCAGCAGACCGTGATTTTGGCCAGGAATCTTCTTGGAAGCCAGGACAAGTAGAGTTTGGAAGTCAACAAACAGAAAACCTTAACCAGCAAACGCAAGGTTTCGGTGAAGAAACCGGTGGCCAAATACAACAAACTGAAAATGGTGATTATGACCGGGGTCAACTAACGAGCGATTCAGGATTTGGACAGCACACTTCTTGGTACTCTCAGTACCCAGAAAATTCAGGTCAACAAACGGAAGATCTTAATCAGCAAACGCAAGGTTTTGGTCAAGAAGCCAATGGCCAAATACAACAAACTGAAAATGGTGATTATGACCGGGGTCAACTAACGAGCGATTCAGGATTTGGACAGCAGACTTCTTGGTACTCTCAGTACCCAGAAAATTCAGGTCAACAAACGGAAGATCTTAATCAGCAAACGCAAGGTTTTGGTCAAGAAGCCAATGGTCAAATACAACAAACTGAAAATGGTGATTATGACCGGAGTCAAATAACGAGCGACTCAGGATTCGGACAGCAGACTTCTTGGAACTCTCACCACCTGGAAAATGCAGGTCAACGAACTGAAACCTTTGATCAAGAAAATCAGTTTGGTGGACAACAAGCAATCATCCATCCCGGACAAGCAACAGGACGAGCACCGAAACCTGCACCAAAACCAAGACGTCCAAGACCCACAAATTCCCATCACACTCAACAGATTAATATAGAGATGGAAGAACCAGCTGTACCTAATGCCGACAGTCATACGCGACAACACAATGATCAGGAAAATAGTGAAAAATGGGTATCAACAAATGTTCCTTCCAGTTCACAAAGAGGCGATCAAGGTATTAACGCAAACTCTATCGAATCGGAAGAAGCGGATATTCAAATTGAATCAGAGATACCCAAAGTGCCCGAACATCAAGTGGAATATGTCCCGCCGTATCCAGATTCATCTAGCCAGGGAATCACCAGTGAGAATCATTTCAGACAAACTCAGACTACTAAAACCAAGACAAGTCGCAGAGGCGGGCATCCCGGTGCGCAATACCAAGGTCCGCAAGGATGGCATTCTCCTCGTGAGCCACAAGTTGTTCAAGAGCCGACAATGGTACTTATCGACAGACGTCAACTTGGGCAAGAAAACTCAGGTTACTCAAGCGCCAGGCAAGTTGGAGAAGATTTTGAGCAAGATTTGACTAGTGTGCCTAAGAAAATTCAACCTCTTGGTGCAGACATAGAATCTAGACAAGCAAGCAGTCGGTCAGATAGAATAATCTTTCCCGACTCTCCGGAAATCTCTTTTGAACCTCGGATCTTGGAAGCATTCGGAGCTAAAGGACCATATGGAGAACATGATCCGGACATATTTGATTCGGCCAAACCAAACACTGACCCTGTAGTTTTAACACCACCCGAAGAGGGCAATGATTGGGATATTCGCGAGGTTGATTCGCGAGTTacatccacgacgactctgccgccaacaacaacgccaacaacaacaacaacaacaacaacaacaacaacaacaacaacaacatcgacaacgccacttcctccaacttcaaCTCCACCTCCGCCAACTCCGGCTCCTGGATTCTGGAAAAAGTTTGGCAACACTTTGGCCAGCACCGTAGACAAAGCCAAGGAAAAGGCACGAGACTGGTTCGGCTAA
Protein Sequence
MAPAAVAIFLISLLAETFSAPSGCAECQTWESHSGSHSQRGFSSQTTEDDLSQRSGKLEDLTQQAEFELNRSRNKLAFDNTRPGNWTDVNHYGTSDGHGRVYEEQGQRVDGPTRIRYMRKNFTSSYSSGGLGSFGETNLGRIYPHISQDASQSLNLDQTQNSAYDQFAIRRNSHNTQDSVHSMERVNSRDDASRYYGTHGSSGRLRETSSDQSAQQGLDALGQIRPGNWSTVNTYRTDGGNGRVYEERRQIVTGPRQVISYRRNYTTSYSSGGDIPTFGAGSDETRNIESNVQQQQRQFDSYGRELQETSGGSTNGGYTRYYSGHHITPGQANYRYVSRPGNYEPLESNPHQTYQQTTNFGNQHGSQFRTVSGTEQVNERIPISGGSTGSYVSSSSYGTRRTPEVSSDGRLSDWESQRGQVVYGRHPDAGYTRVSGSQDLNGRGISGGSLNNREESYYTRTHGTEIPLGQIRHYDQFQTFSTSGSPSASSISRPDMHVTTIQSGHGVQSGHLVTQGLDLGQISQIPDCDEGTNGHSSYKQSYSHRVYRGANEPQHLTQQAEDLTQQTEDLTQQTEDFGQQSFLKPGKLQVENEHFQDLTQQTQDLTQQTQDLTQQTQDLTQQTEDFTQQSQDFGQQSSSRPGKLEFGSQQVEDLTQQTQDLTQQTEDLTQQTQDLTQQTQDLTQQTEDFTQQSQDFGQQSSSRPGKLEVGSQQVQDLTQQTQDLTQQTEDLTQQTQDLTQQTQDLTQQTEDFTQQSQDFGQQSSSRPGKLEVGSQQVEDLTQQTQDLTQQTQDLTQQTEDFTQQSQDFGQQSSSRPGKLEVGSQQVQDLTQQTQDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFTQQSQDFGQQSSSRPGKLEVGSQQVQDLTQQTQDLTQQTQDLTQQTQDLTQQTEDLTQQTQDLTQQTEDLTQQTEGMQQDYSPLWNHERWRTNDPHYAPRPIEHEEVVRPQNSGIAYQQPRDLTQQTQDLTQQTEDFGQQTEDLTQQTVDLGQENEDLTQQTVDLGQQTEDLTQQTEDLGQQTVDLGQENEDLTQQTVDLGQENEDLTQQTVDLGQQTEDLTQQTEDLGQQTEDLTQQTVDLGQENEDLTQQTVDLGQENEDLTQQTVDLGQQTEDLTQQTQDLSQQGHIPADPPLWHHETSPIIRPDYEPPTLSPLSVHQEVEPPKTSDITSQQTEDLAQQTVDFGQENEDLTQQTVDLGQENEDLTQQTVDLGQENEDLTQQTMDLGYENEDLTQQTVDLGQENQDLTQQTVDLGQENEDLTQQTVDLGQENEDLTQQTVDLGYENEDLTQQTVDLGQENQDLTQQTVDLGQENEDLTQQTVDLGQENEDLTQQTVDLGYENEDLTQQTVDLGYENEDLTQQTVNLGQENEDLTQQTVDLGQENEDLTQQNEGLSQQIQGTDDYLQIPADRDFGQESSWKPGQVEFGSQQTENLNQQTQGFGEETGGQIQQTENGDYDRGQLTSDSGFGQHTSWYSQYPENSGQQTEDLNQQTQGFGQEANGQIQQTENGDYDRGQLTSDSGFGQQTSWYSQYPENSGQQTEDLNQQTQGFGQEANGQIQQTENGDYDRSQITSDSGFGQQTSWNSHHLENAGQRTETFDQENQFGGQQAIIHPGQATGRAPKPAPKPRRPRPTNSHHTQQINIEMEEPAVPNADSHTRQHNDQENSEKWVSTNVPSSSQRGDQGINANSIESEEADIQIESEIPKVPEHQVEYVPPYPDSSSQGITSENHFRQTQTTKTKTSRRGGHPGAQYQGPQGWHSPREPQVVQEPTMVLIDRRQLGQENSGYSSARQVGEDFEQDLTSVPKKIQPLGADIESRQASSRSDRIIFPDSPEISFEPRILEAFGAKGPYGEHDPDIFDSAKPNTDPVVLTPPEEGNDWDIREVDSRVTSTTTLPPTTTPTTTTTTTTTTTTTTTSTTPLPPTSTPPPPTPAPGFWKKFGNTLASTVDKAKEKARDWFG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-