Basic Information

Gene Symbol
-
Assembly
GCA_907164935.1
Location
NC:2096046-2102670[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 39 8e-05 0.035 14.5 2.0 32 60 38 66 23 69 0.82
2 39 5.3 2.3e+03 -0.9 0.2 28 58 91 121 89 127 0.75
3 39 0.17 73 3.9 6.0 26 62 172 208 167 211 0.80
4 39 0.032 14 6.2 1.3 27 62 208 243 205 245 0.88
5 39 0.016 7.1 7.1 3.9 26 63 249 286 246 288 0.85
6 39 0.085 37 4.8 6.4 21 62 289 330 286 333 0.91
7 39 0.47 2e+02 2.5 8.6 25 63 335 373 331 375 0.87
8 39 0.14 62 4.1 4.5 26 63 371 408 368 410 0.88
9 39 0.87 3.8e+02 1.6 5.3 23 52 410 439 404 451 0.71
10 39 0.034 15 6.1 0.5 33 64 462 493 458 494 0.91
11 39 0.01 4.5 7.8 5.4 24 64 474 514 472 529 0.69
12 39 0.00035 0.15 12.5 1.5 24 65 502 543 494 543 0.90
13 39 0.00053 0.23 11.9 4.3 26 62 539 575 534 578 0.59
14 39 3.3e-06 0.0014 19.0 7.0 21 64 590 633 589 634 0.95
15 39 0.0089 3.9 8.0 2.9 22 65 626 669 625 669 0.94
16 39 0.021 8.9 6.8 7.7 24 65 657 697 656 697 0.84
17 39 0.00051 0.22 12.0 9.7 25 63 692 730 691 732 0.84
18 39 0.024 10 6.6 8.9 35 63 730 758 726 760 0.88
19 39 0.0017 0.75 10.3 1.9 29 62 766 799 762 801 0.89
20 39 0.12 51 4.4 5.1 22 59 808 852 807 858 0.76
21 39 0.00097 0.42 11.1 2.7 24 64 873 913 870 914 0.94
22 39 0.7 3e+02 1.9 6.5 26 62 917 953 912 956 0.86
23 39 1 4.5e+02 1.4 7.3 28 65 951 991 948 997 0.68
24 39 0.069 30 5.1 6.7 25 57 1007 1039 989 1051 0.55
25 39 0.0021 0.92 10.0 4.7 24 64 1041 1081 1031 1082 0.84
26 39 0.0016 0.69 10.4 1.1 21 62 1052 1093 1051 1096 0.88
27 39 0.039 17 5.9 4.6 33 60 1109 1136 1100 1144 0.77
28 39 0.021 9.3 6.8 2.2 31 64 1152 1185 1142 1186 0.89
29 39 8.9 3.9e+03 -1.6 11.9 21 64 1191 1248 1190 1249 0.72
30 39 0.0016 0.7 10.4 4.7 35 64 1247 1276 1242 1277 0.92
31 39 0.45 2e+02 2.5 6.5 30 62 1277 1309 1275 1314 0.72
32 39 0.036 16 6.0 7.9 28 63 1303 1338 1301 1340 0.87
33 39 0.22 97 3.5 3.2 28 59 1345 1376 1339 1378 0.66
34 39 0.028 12 6.4 9.3 28 63 1373 1408 1359 1410 0.59
35 39 0.002 0.88 10.0 7.1 22 62 1409 1449 1408 1452 0.86
36 39 0.78 3.4e+02 1.8 4.4 32 64 1440 1472 1436 1482 0.64
37 39 0.96 4.2e+02 1.5 5.0 29 62 1465 1498 1449 1501 0.63
38 39 0.0068 2.9 8.4 10.1 24 60 1495 1531 1474 1536 0.56
39 39 6.8 2.9e+03 -1.2 0.7 36 52 1723 1739 1696 1746 0.57

Sequence Information

Coding Sequence
ATGATATCGTATTTGAGATTAAAACTTGCGGAAACCGAGGAGGACGTCAGCTGTCCCGCCATATATCGCCTGAGAGCGAAACTTCGTGAATTAATGAAAGGCGGTCAATCGAGGGACTGCGATGATCTACAAGCTGAGAACGATCGTCTTCTAGCTGAAATAGCTGAGTTACGACACCAGTTGTCTGGCATGGAGGAAAAGGAATTAACTGGGAGGATGGTGCAACCGAGATCGGTCGAAACAACCACCGTTCCAGAATACATTGACGTATCTGAGTTGCTGCAGAAGCTTAAAGATTGCGAAGGGACTGTGTCTGATTTGAGAGAACAATTAGCAGAAAAGGACAATCTCGTTAATTCCTTGAAGGAAGGAATGAGTGATCAGAAAGAATTATTGGATGAAATTGCGGCACTGAAAGCGGAACTCCAAAAGAAAGACGACAAgATCAAGGAACTATTAGACGAGCTAAGGCAATCAGAAATAAACTTAGAAGGTTTGAACAATTTGAAATCACAATTAGAAGATCTTAAGCCAAAATTATATGATCTAGAATTAGAAAGAGATCAGTTATTAGAAGAGCTTGCCAAACTTCAAAACGAACTTGCTCATTGTAACGCTATGAAAGAGGACCTGGAATCAAAAATAGAGTccttaaaaaatgacaaagaaaaattattgaagGATCTAGATGATTCTAGAGATCAAATTTCAGCCCTGTCTAAACAATTGGACGATGAAAGATCAGCCAGAAGCGCTCTTGAAAATAACCTTAACAATTGTCAAGatgaaatgaagaaattgCAAGACGAAAATTCGAACCTGAAGAATCGATTAGACACCGCGGAGAAAGATTTAGAAGCCTCGAAGAAGCTCGCCGAAGAGATGGAAATGTTGAAGACGGAGCTggagaagatgaagaagatgAACGAGGAGTTGGCAAATTTGAACGACACCTTGAAGAACGACTACCAGAGCATGAAGCAAGCGTTAGGCAACCTGGAAGCTGAAATTAACAGACTGGAGGAGCAATTGAATAAAACAGCCGAAGAACGGGACGCGTTGTTGAAGGAGAACGAGGACGCGAAGAGGCGGCTCGAAGAAGCGATCGCGAAGAACGAGAGTCTGAAATCCGAACTGGATAACGTCACCGAACAACTCGGCAAATTGAAATTAGAGAAGGATAAACTGGAAGAGGCTCTCAACGATACGAGGCTGAAGAACGACGCCCTGAAACGGGATATCCAGGCTTTGGAAAATGACCTTGAACGTGCAAAACAGGAGATGGAGGATCTGAAAAGATCCGAGGAGGCGTTGAAAGCTGCGGACAAGGGAAAAGATTCGGAACTACGGAAATTCGAGGAGGATTTGAAAAATGCGATAACTGAAAGGGATCGTTTGATGAACGAAAATGGCGATTTGAAGGACAGGAACGCTGAACTAGAAGGAAAACTCAAGGACGTCGTGGACCAGCTGGAACAAATGAAATCGCGGAACGCCGATTTACTGGCTGAGATCGATCGGCTGAAAAAGGAGCTCGAGAAATCTGGCCAGGAAATCGATCAATTGAAGTCTGAAATTACTTCGCTGAAGAACGCCATCGATAAATGTACGGACGAGTTGGACAAACTGCAAACAGAAAATAGGGATCTTAAATCAGAGAATCAGGctatcaaaaatgaaatcgaaAAGTGCAGAAGTGAAAGAGATAATCTGAAGAGTGAGCTCGAAGAGGCAAAAGCACAATTAAGGGACGTCAGTGATGAATTGCAGAAATTAAAAACTAGAAACGATTCGTTGgaggaaaatttgaaaaaattgaccgaTGAAAATGACAATTTGAAATCAGAGGTTGGAGCATTAAGGGACGAGGCGGAGAAGCTGAAGAAAGCAGCCGAGGAACTGCAGGCGGCGGCTACCAGAGGGAAAGATGAAGAAGATCGGTTGAAAAGCGAAATTGATGGGATTAAGCGTGAAAGAGACGATTTCCTGGAAAAATTAAACGAGCTGAAGAACGAGCATTCTGCTTTACAAAACGAGGCGAACGAGTTGAAGAAACAATTGAACGATTGCAGGgcggaaaatgaaaaattgaaagcgGAGAAAGATCAGTTAAAAGCGGAGAAAGATCAGTTAAAAGCTGAAAACGATAGTTTGAAAGCTGAGTTGAATTCgtgtaaaaaggaaaatgaagaaCAGAAGACTGAAAATGAGAAACTGCAAAAACAAATTCAATCGTTAAACGacgaattaaataaattgaagagTCAATTGGAAAATGCAGAAAATACGATTAAGGATCTTCAGCCTCTGGTTGATCGTTTGCAAAACGAAAATGatcgtttgaaaaatgatttaagAATGTTAGAGAATGAAGCAGCAGATTTGAAGGCAAAATTGCTGAATGAAATGAATGACAATCAAAAAATGCGAAACGATTTGATGATCCTGGAAAATCAAGTAAATGATTTGAATGAGAAACTGAAGGGAATTATAGGGGAGAAGGAAAAATTGGAGCAAGAAAATCAAATGCTAAAGGCAAACTTGTTAGAAGCGAATAATGAATTGTCTCAGGCGAAAGGGGATAGTCAGGACTTGAAATTcgaaattgataatttaaagAATAAGATTTCTGAATTAGAGGCGAAAATTGCTAAATCAGAGGCGGATGTTGAACATTGGAAATTAGAGAATTGTAAGCTTCAGATGGAAATCGATAAATTGAAAGCTGAACTTGAAAAGGCGTTGAAGGATTTGAATGAGTTACAGGCTCAGAAAAAAGCACAAGAAGAAGAGTTAGCCCGCCTTCGAACTGTAAACGCTGCTCTAGATAAACAGATCTCTGATCTAAATTCACAAATGGATCAGCAAAGAAGTGCCTTAGAATCAGAAAAATCAGCCAAGGAGAAAAATGACTTGGAAATGAAAGCTCTGAAAGAGGAATTAGGAAGTTTGAAAAACGAATTAGAAAAACTTAGAATGGACAATAACAATTGCAATcacgaaatagaaaatttaagaaaGCAACTTTCAGCGTTGCAAAGCGAATTAAATTCTTGCAGAGAAGAAATGTCTGCGTTAAGAGCCACCAATGATGCACTGAAGACTGAATTAAACGGATTGAAAGCTGAGAAACATAATCTACAAGACGAATACGATAAATTGAAAGCCAAGGCggataatttagaaaaagaaatagcgGATCTTcgagaaggaagaaagaaactCGAAGATGAATACAGCAAACTTAGAGGAGAGGGTGATGGACAGAAGGTTGAAATTGATAAACTCAAGTCAGACTTGGCAGCCGCAGAGAAGGTGAAGatggatttaaaaaattgtcaagaagaaaatgaaaaattacaagCGCAAGTGAAGGATTTGGAAAGATGTAGAAACGAAGTTGATAGACTGAATGCTGAACTGGATGATCTGAAGAAGGCACTCGCAGCTGCCGAGGCTAAAGTGAAATCCTTGGACGATCAACTCTCGTCTCTTAGGAATGAAAGAGAGGAATTGATCAATGAACTTAATCATCATCGAGACAATCTTAACAAtcttaaaaatgaaatggatAAACAGAAAGCTGAGAGGGATTTGGCTGCGAGGGAGTTGGCGGACCTGGAAGAGGAACTGGATGCCCTCAAAGCTACCTTAGATAAGATTCGCGGTGAGAAAGAATCATTGCTAAAGGATAATGAGAAGTTGAACGAAGAATTAAAAACAgtgaaaaatgataatgaaaagttgaaaaatgataatgaaaagttgaaaaatgataatgtaaagttgaaaaatgaaatggatGAATTGAAGTCAAAATTGAAAGAGGCAGAGAATCGGATAAATGATCTGGAGAATCAGAACAAGAACTTGAAGAATACGTTAACTGAAGTTCAGACTGCGCTAAAAGAACTCGAAGCTgtgaaaaaacaattagaagaTGAATTGGATAAACTGAGAGCAGAGCTCGATAAATTGAAGTCTGACAACGAAAGACTGAAGGACGAGCTAACAGAAGAGCAGAACAAATCGAATAGATTGAAGGACGATTTGGAGAAACTGAAAACGGACAATGATAAATTGAAGAATGATTATCAAAACTTGAAGGGTGATTTaagtaaattgaaaaaagaaaaggaggaaTTGGAAAAATCTGATGCAGCAGCGAAGAaagaaattgaaaaattacgcAGGGAATTGAAAAACTTGAAGGATGATTTGATGAAACTGAAAACGGACAATGATAAATTGAAGAATGATTATCAAAACTTGAAGGaagaaaatgaaaaattacgcagggaattgaaaaatttgaaggaTGATTTGATGAAACTGAAAACGGACAATGATAAATTGAAGAATGATTATCAAAACTTGAAGGGTGATTTAGGTAaattgaaaaaggaaaaggaggaaTTGGAAAAATCTGATGCGGCAGCGAAGAGTGGTATAGATaggatgaaaaaggaaaatgacgAACTAAGAGCTGAGAATGTGAAATTAGCTAACCAATTGAATACTTGCCaagaggaaaatgaaaaattacgtAGGGAATTGGAAAACTTGCAGGTGGTGAATGATGCATTAAAGAAAGCAGCAGTGAAACACGTGGAACCAAAAGAAACGGAAATGGACAAAGATATTGTGGATGACTGTGGGGACTTCATTAAGGCAAACGATTTATTAaacgaaaaatttcaaaagcaGAACGAAgGTGTACAACGTGTTCGAGACTACATCATGTATTTAGAAGGTAAAGTTAAAGAGGAGCCAAAAATGGCTGATCAATTAGAGGAACCTGAAATCGATCCACAAAGAAGGGAAGATATTGCAAATTTGTTAAACAAGTCTAAATTGTTGTCTGAAAATATCAATCACACTGAAAAGGAAATACAACAGATTGGTGCAACCGCATTCGATCCTGATTCTTGGCTAAATtcGTTAACATTGACACAGTTAGCAGATCTTCACGACAAGATCTGCCTGTTGACATCGGATATGGTGCAGCATGATAACGGAGCAGTAACATGTGAACCAACAGCACCGACAGCATCAACAGTACCTTGCCCTGGGACGAATCCATTAAGAGCAAATTATAATACTTTGAACCAGCGAATAGCTGCTTTGCAAAAGCAGATCGCAGATAAACAAATAGAGGCAGGTTGGAAGCTGCAAGAGTTGAGACGAGCTCTACGACAGGAACAGGCTAATCTACTTCGAATCTCCGACGAAATGAATTTAGCGAGAAGACGAAATTTAGCTCTTCAGTTCAACATGGAcgattaa
Protein Sequence
MISYLRLKLAETEEDVSCPAIYRLRAKLRELMKGGQSRDCDDLQAENDRLLAEIAELRHQLSGMEEKELTGRMVQPRSVETTTVPEYIDVSELLQKLKDCEGTVSDLREQLAEKDNLVNSLKEGMSDQKELLDEIAALKAELQKKDDKIKELLDELRQSEINLEGLNNLKSQLEDLKPKLYDLELERDQLLEELAKLQNELAHCNAMKEDLESKIESLKNDKEKLLKDLDDSRDQISALSKQLDDERSARSALENNLNNCQDEMKKLQDENSNLKNRLDTAEKDLEASKKLAEEMEMLKTELEKMKKMNEELANLNDTLKNDYQSMKQALGNLEAEINRLEEQLNKTAEERDALLKENEDAKRRLEEAIAKNESLKSELDNVTEQLGKLKLEKDKLEEALNDTRLKNDALKRDIQALENDLERAKQEMEDLKRSEEALKAADKGKDSELRKFEEDLKNAITERDRLMNENGDLKDRNAELEGKLKDVVDQLEQMKSRNADLLAEIDRLKKELEKSGQEIDQLKSEITSLKNAIDKCTDELDKLQTENRDLKSENQAIKNEIEKCRSERDNLKSELEEAKAQLRDVSDELQKLKTRNDSLEENLKKLTDENDNLKSEVGALRDEAEKLKKAAEELQAAATRGKDEEDRLKSEIDGIKRERDDFLEKLNELKNEHSALQNEANELKKQLNDCRAENEKLKAEKDQLKAEKDQLKAENDSLKAELNSCKKENEEQKTENEKLQKQIQSLNDELNKLKSQLENAENTIKDLQPLVDRLQNENDRLKNDLRMLENEAADLKAKLLNEMNDNQKMRNDLMILENQVNDLNEKLKGIIGEKEKLEQENQMLKANLLEANNELSQAKGDSQDLKFEIDNLKNKISELEAKIAKSEADVEHWKLENCKLQMEIDKLKAELEKALKDLNELQAQKKAQEEELARLRTVNAALDKQISDLNSQMDQQRSALESEKSAKEKNDLEMKALKEELGSLKNELEKLRMDNNNCNHEIENLRKQLSALQSELNSCREEMSALRATNDALKTELNGLKAEKHNLQDEYDKLKAKADNLEKEIADLREGRKKLEDEYSKLRGEGDGQKVEIDKLKSDLAAAEKVKMDLKNCQEENEKLQAQVKDLERCRNEVDRLNAELDDLKKALAAAEAKVKSLDDQLSSLRNEREELINELNHHRDNLNNLKNEMDKQKAERDLAARELADLEEELDALKATLDKIRGEKESLLKDNEKLNEELKTVKNDNEKLKNDNEKLKNDNVKLKNEMDELKSKLKEAENRINDLENQNKNLKNTLTEVQTALKELEAVKKQLEDELDKLRAELDKLKSDNERLKDELTEEQNKSNRLKDDLEKLKTDNDKLKNDYQNLKGDLSKLKKEKEELEKSDAAAKKEIEKLRRELKNLKDDLMKLKTDNDKLKNDYQNLKEENEKLRRELKNLKDDLMKLKTDNDKLKNDYQNLKGDLGKLKKEKEELEKSDAAAKSGIDRMKKENDELRAENVKLANQLNTCQEENEKLRRELENLQVVNDALKKAAVKHVEPKETEMDKDIVDDCGDFIKANDLLNEKFQKQNEGVQRVRDYIMYLEGKVKEEPKMADQLEEPEIDPQRREDIANLLNKSKLLSENINHTEKEIQQIGATAFDPDSWLNSLTLTQLADLHDKICLLTSDMVQHDNGAVTCEPTAPTASTVPCPGTNPLRANYNTLNQRIAALQKQIADKQIEAGWKLQELRRALRQEQANLLRISDEMNLARRRNLALQFNMDD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-