Basic Information

Gene Symbol
-
Assembly
GCA_036983795.1
Location
CM072789.1:1932173-1943567[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 38 0.00011 0.12 12.9 1.7 30 58 38 66 25 70 0.89
2 38 7 7.8e+03 -2.5 0.1 28 47 93 112 91 117 0.55
3 38 2.2 2.4e+03 -0.8 1.6 37 62 136 161 132 171 0.51
4 38 0.012 13 6.4 9.6 24 59 175 210 168 215 0.75
5 38 0.0063 7.1 7.3 10.1 27 63 213 249 210 250 0.94
6 38 0.024 27 5.4 9.3 25 64 253 292 251 299 0.73
7 38 0.036 41 4.8 5.6 34 61 300 327 295 331 0.87
8 38 0.36 4.1e+02 1.6 9.3 25 57 347 379 336 387 0.73
9 38 0.82 9.2e+02 0.5 10.3 29 60 351 382 347 402 0.53
10 38 0.016 18 6.0 5.3 21 63 399 441 399 443 0.92
11 38 0.0011 1.2 9.7 7.1 21 62 462 503 461 506 0.89
12 38 0.00039 0.44 11.2 2.4 32 61 501 530 497 534 0.87
13 38 2.8e-05 0.032 14.8 1.1 25 65 515 555 513 555 0.94
14 38 2.8e-05 0.031 14.8 1.8 25 59 543 577 532 583 0.51
15 38 8.3e-05 0.093 13.3 5.6 33 63 572 602 570 610 0.53
16 38 0.00018 0.2 12.2 8.9 24 60 605 641 604 643 0.92
17 38 0.027 30 5.3 5.4 33 62 635 664 634 666 0.81
18 38 0.0008 0.9 10.2 3.8 26 64 670 708 667 709 0.88
19 38 0.00028 0.31 11.6 11.1 24 64 703 743 702 744 0.92
20 38 0.029 32 5.2 5.5 33 62 733 762 732 764 0.81
21 38 0.0016 1.8 9.2 3.7 26 60 768 802 765 805 0.76
22 38 0.0004 0.45 11.1 5.3 22 63 813 861 812 863 0.77
23 38 0.0021 2.4 8.8 2.3 27 64 881 918 878 925 0.79
24 38 0.31 3.5e+02 1.9 4.3 31 57 934 960 918 967 0.61
25 38 0.039 44 4.8 9.6 27 63 979 1015 962 1017 0.69
26 38 0.0091 10 6.8 1.7 26 65 1006 1045 1004 1052 0.75
27 38 0.0079 8.9 7.0 4.7 23 52 1055 1084 1045 1097 0.61
28 38 0.26 2.9e+02 2.1 6.1 30 57 1111 1138 1102 1162 0.61
29 38 0.00061 0.69 10.5 3.8 25 63 1158 1196 1154 1198 0.81
30 38 0.0057 6.5 7.4 7.9 24 65 1213 1254 1212 1254 0.90
31 38 1.3e-05 0.015 15.8 10.3 22 63 1239 1280 1238 1282 0.93
32 38 0.03 34 5.1 3.7 28 60 1287 1319 1280 1324 0.75
33 38 2.6e-05 0.029 14.9 10.2 24 63 1332 1371 1327 1373 0.91
34 38 0.047 53 4.5 3.1 29 60 1379 1410 1371 1415 0.76
35 38 0.11 1.3e+02 3.3 11.6 29 65 1414 1450 1399 1450 0.85
36 38 0.012 14 6.4 6.2 28 57 1441 1470 1436 1476 0.74
37 38 0.0011 1.2 9.7 3.9 34 65 1479 1510 1474 1510 0.91
38 38 0.017 19 5.9 3.5 25 47 1505 1527 1504 1530 0.72

Sequence Information

Coding Sequence
ATGATATcgcatttgaaattaaaactggCGGAAACCGAGGAGGACGTCAGCTGTCCTGCCATATATCGTTTAAGGGCAAAGCTTCGCGAATTGATGAAAGGTGGTCGGGATTTGCCGAAGAACTGCGATGATCTGCAAGTTGAAAACGATCGTCTTTTAGCTGAAATAGCTGAGTTACAACGCCAGTTGGCCAACCTTGACGAAAGGGAAATAACCGAGCGAATGTTGCCAGCGAAATCGGTCGAAACAACCACTGTGCCGGAATACATAGATGTATCTGAGTTGTTGCGAAAGCTCAAAGATTGTGAAAATACTGTATTCGGTTTGACACAACAGTTAGCAGAAAAAGACGATCTCATTGATTCATTGAATAAGGGACTTGAAGGCACGATCAGTCAAAAAGATTTATTGGATGAGATCGCGGCACTGAAAGCGGAACTTCAAGAAAAAGATGACAAGATCCAGGAACTATTAAACGAACTAAGACAAgcagaaataaatttgctcgagttaaataatttgaaatcacaACTGGATGACCTGAAATCAGAATTAGAGGATTTAGAATTGGAGAAGAACCAACTGTTGGAAGAGCTGGCTAAATTACAAAACGAACTTGCGCACTGTAACGCAATAAAGGAAAGTCTAGAAAAGCAGCTAGAAGTTTTAAGGAATGACaacgaaaaattattgaaagaacTGGATAATGCGAAGGAACAACTCCTGGCACTGACTAATCAGTTGGAAGAGGAAAGGGCGGCCAGAAATGCCttagaagaaaatttgaaaaattgccaaGATGAACTCGAAAGGTTACAGAAAGATAATACGAATCTGAGGGATCAGCTGGAGGCtgcaaaagaagaaaataataaactgcGTGAAGATGTTGAAGCGGCGAAGAAGCTGGCTGAAGAGAACGAGAGGTTGAAAGCGGACCTGGAGAAGATGAAGAAAGAGAACGAggaattgatgaatttgaacAACGTCTTAAAGAGCGATTACGACAGCATGAAGCAAGCATTGGATAACCTAGAAGCAGAAATTAACAGACTGCAGAACGAATTGAATAAGGCTGAAGAAGAACGCAAAGCGTTGCTAGACGAGAACAGCAACATTAAGAAGCAACTTGAAGAAGCAATGGCGAGGAACGAGAGTCTAAAAGCTGAATTGGATAATGTCGGTGAACAActcaacaaaatgaaattagagAAGGATAAACTGCAAGAGGCTCTGAACGATATGAAGCTTGAGAACGATGCGTTGAAACAAGATGTGCGGGCTTTGCAAAGCGACCTTGATCATGCGAGAAAGGAAGCGGAAGACCTAAGAGGCGCTGGAGACGCATTAAGAGCCGCGGACAAAGACAAACTGTCCGAGCTTCAAAAGCTCAAAGATgaattggacaatttgacAACCGAAAAGGATCGCTTAACGAACGAAAATATCGATTTGAAGGCCAGAAATGCGGAGCTCGAGAAAAAACTCAAGGACGCAATGGAACAGGTGGAACAAATGAAATCGGAGAATGCCGATTTACTGGCCGAGATCGATCGTCTAAAAAAGGAGCTCGACAAAGCTGCGAACGAAGTCGATCGATTGAAATCTGAAATAGGTTCTTTGAAGGACGCTGTCGACAAGTGTATGGACGAATTGGAGAAATTGCAAACCGAAAATGGCGATCTTAAATCAGAGAATGAAGCTGTTAAAAGTGAAATTGAGAAGTGCAAAGCTGAGAGGGACGCTTTGAAACAGGAAAATTCTACTTTGCAAAACGAGATTGACGAGTTGAGGAAACAACTGAACGATTGTAAAACAGAGAACGAAAACTTGAAAGCGCAGAAAAATCAATTGGAAGCTGAAAATGATAAGTTGAGAGAAGAGTTGAACGCTTGCAAACAAGAAAATGAGGCGATGAAGGctgaaagtgaaaaattacGAGAACAGGTACAATCGTTGAATGACGAATTGAGTAAGCTACGGAATCAGCTGGATATTGCAGAACgtaaaattcaggaattcgagCCTTTGGTCGATCGTTTGCAAAAGGAAAATGATAAATCGCAAAACGAGATTGACGAGTTGAGAAAACAACTGAACGATTGTAAAACAGAGAACGAAAACTTGAAAGCGCAGAAAAATCAATTGGAAGCTGAAAATGATAAGTTGAGAGAAGAGTTGAACGCTTGCAAACAAGAAAATGAGGCGATGAAGGctgaaagtgaaaaattacGAGAACAGGTACAATCGTTGAATGACGAATTGAGTAAGCTACGGAATCAGCTGGATATTGCAGAACGTAAAATTCAGGAACTCGAGCCTTTGGTCGATCGTTTGCAAAAGGAAAacgataaattgcaaaatgatcTGAAAGCGTTAGAGGATGATGCAAAAAACTTAAGATCAAGGCTAGATGGCGGAATGAGTGACAATGAAAGAATGCGAAACGACATGGCGATATTAGAAAATCAAGTAGGAGATTTGAATGAGAAATTAAAGGGAGCTAAAGCAGAAAATGACGCTTTGCAGCAAGAGAATCAAACGCTACGAGCAAAACTATTAGAACTGGATGACGAATTGTCTCAAGCGAAAGCAGAATGCGCGGATTTGAAGGCGGAAATTgctgatttaaataatttaatttccgaattacgagcaaaaattgctaaattggAAGAGGATGTAGAACATTGGAAACTGGAGAATTGTAAGCTGCAGATGGaaatagataaattaaaaGCGGATCTCGAGAAAGCATTAAAAGATTTATCCGAATGCCAGGCGCTGAAAAAAGCACAAGAAGCAGAGTTGAACCGGCTGCAGATCGAAAAAGCTGAGTTGAATAAACAAATCGCCGGTCTAACTGCGCAGATAGAACAACAGAAGAAAGCTGCTGAATTAGAAAAATCCGCCAAGGACGAAAGTGAGGCAAAACTCAAAGCTTTGCGGGAAGAGTTGGACGCATTGAAGAAGGAGTTAGAGAAACTTCGAATGGAGAATAACGATTACAAGAACGAAAtggataatttgaaaagacaGCTTTCTACGTTAAACGGTCAGTTAGATTCGTGCAAAGAAGAGATCGCTGCGTTGAGAGCCACAAATGATGCGTTGAAGACTGAATTAAACGCATTGAGTGGTCTAAAAGACGAATACGATCAACTAAAAGCTAAAGTGAACAGTTTGGAAAATGAAATCGCGAGTCTTCAAGAAAACGCAAGGAATTTGGAACAGGAACGCAACAAACTTAGAGGAGAGGGTGACGGACagagaattgaaattgataaactgAAATCAGACTTGGATGCCGAAAAAGCAGCCGCAGGGAAACTGAAGttagatttggagaattgtcAAGCAGAAAACGACAGATTACGGGCACAGTTgaaagatttagaaaaatgtaaaagcgAGATTGATCGGTTAAATGCCGAAGTTGGTGAACTGAAGAAAGCACTAGCGGCCGCTGAAGCTAAAGCGAAGTCGTTGGAAGATCAACTCTCGAACCTCAAAGATGAAAAGCAACAGTTGATTAATGAACTCGACAATCTTCGTGGAGATCTGAGCGATCTTAggaatgaaatagaaaaacaGACAGCCGCAAAGGACAAGGCGTTAAAGGAGTTGGCTGACGTTAAAGAGGAGCTGAATGCTCTGAAGTCAACGTTGGATAAAATGCGcaatgaaaatgaaacgttACTGAACGAGAATGAAAAGTTGAAGTCGAAATTGGCAGAATTAAATGGACAGTTGGAAGCGTTAAGAAATGAGAACGAAAagttaaagaaagaaaatgagaaCTTAAAGAACGAGATTGCAAAATTGACTGCGGAATTAGCTACgatgacaaataaattaaaagaagcgGAGGATCAGTTGAACGCGCTAAAGAATGAGAACGATACTCTGAAGAATACGATAGCTGAACAAGAGAAAACAATAAAAGACCTCGAAGCAGCAAAAATACAATTAGAGCAGGCTATCAACGAGTTGAAGCCGAAATTGGCAGAATTAAATGAACAGTTAGAAACGTTaagaaacgaaaatgaaaagttaaagaaagaaaatgaggATTTAAAGAACCAGACTGCAAAATTGACTTCGGAATTAGATGCaatgacaaataaattaaaaggcGCGGAGGATCAGTTGAACGCGCTAAAGAATGAAAACGATACTCTGAAGAATACGATAGCTAAACAAGAGAAAGCAATAAAAGAGCTCGAAGCAGCAAAAATACAATTAGAGCAGGCTATGAAGGAGTTGAAGTCGGAAAATGAACGACTGAAAGGCAAGCTAGAAGACGCGCAAAACACagcgaataaattgaaaaacgatTTGGAGAAGCTGAAAACAGACAAcgcgaaattgcaaaatgaattAGGTAAATTAAaagaggagaaggagaagTCTGATGCAGCGGCGAAAGGTGATGCagatagaataaaaaaagaaaatgagaaattgagaGCTGAAAACGAGAAATTGATGGACGAGTTGAACACTTGTCGAGCAGAAAACGAAGAGTTACGTAAACAATTGGAAAAGTTACAGggagaaaatgataaattgaaaagagCTGCAGGTCtgacaattattataatatcacttaaattatttaaacatacatTTGTTATTACTAGAACAATCAGTACCCGTCAGTATGGTACTTGA
Protein Sequence
MISHLKLKLAETEEDVSCPAIYRLRAKLRELMKGGRDLPKNCDDLQVENDRLLAEIAELQRQLANLDEREITERMLPAKSVETTTVPEYIDVSELLRKLKDCENTVFGLTQQLAEKDDLIDSLNKGLEGTISQKDLLDEIAALKAELQEKDDKIQELLNELRQAEINLLELNNLKSQLDDLKSELEDLELEKNQLLEELAKLQNELAHCNAIKESLEKQLEVLRNDNEKLLKELDNAKEQLLALTNQLEEERAARNALEENLKNCQDELERLQKDNTNLRDQLEAAKEENNKLREDVEAAKKLAEENERLKADLEKMKKENEELMNLNNVLKSDYDSMKQALDNLEAEINRLQNELNKAEEERKALLDENSNIKKQLEEAMARNESLKAELDNVGEQLNKMKLEKDKLQEALNDMKLENDALKQDVRALQSDLDHARKEAEDLRGAGDALRAADKDKLSELQKLKDELDNLTTEKDRLTNENIDLKARNAELEKKLKDAMEQVEQMKSENADLLAEIDRLKKELDKAANEVDRLKSEIGSLKDAVDKCMDELEKLQTENGDLKSENEAVKSEIEKCKAERDALKQENSTLQNEIDELRKQLNDCKTENENLKAQKNQLEAENDKLREELNACKQENEAMKAESEKLREQVQSLNDELSKLRNQLDIAERKIQEFEPLVDRLQKENDKSQNEIDELRKQLNDCKTENENLKAQKNQLEAENDKLREELNACKQENEAMKAESEKLREQVQSLNDELSKLRNQLDIAERKIQELEPLVDRLQKENDKLQNDLKALEDDAKNLRSRLDGGMSDNERMRNDMAILENQVGDLNEKLKGAKAENDALQQENQTLRAKLLELDDELSQAKAECADLKAEIADLNNLISELRAKIAKLEEDVEHWKLENCKLQMEIDKLKADLEKALKDLSECQALKKAQEAELNRLQIEKAELNKQIAGLTAQIEQQKKAAELEKSAKDESEAKLKALREELDALKKELEKLRMENNDYKNEMDNLKRQLSTLNGQLDSCKEEIAALRATNDALKTELNALSGLKDEYDQLKAKVNSLENEIASLQENARNLEQERNKLRGEGDGQRIEIDKLKSDLDAEKAAAGKLKLDLENCQAENDRLRAQLKDLEKCKSEIDRLNAEVGELKKALAAAEAKAKSLEDQLSNLKDEKQQLINELDNLRGDLSDLRNEIEKQTAAKDKALKELADVKEELNALKSTLDKMRNENETLLNENEKLKSKLAELNGQLEALRNENEKLKKENENLKNEIAKLTAELATMTNKLKEAEDQLNALKNENDTLKNTIAEQEKTIKDLEAAKIQLEQAINELKPKLAELNEQLETLRNENEKLKKENEDLKNQTAKLTSELDAMTNKLKGAEDQLNALKNENDTLKNTIAKQEKAIKELEAAKIQLEQAMKELKSENERLKGKLEDAQNTANKLKNDLEKLKTDNAKLQNELGKLKEEKEKSDAAAKGDADRIKKENEKLRAENEKLMDELNTCRAENEELRKQLEKLQGENDKLKRAAGLTIIIISLKLFKHTFVITRTISTRQYGT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00964227;
90% Identity
-
80% Identity
-