Basic Information

Gene Symbol
-
Assembly
GCA_951802685.1
Location
OX637836.1:11562744-11576712[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 43 0.13 1.4e+02 3.1 0.1 25 50 56 81 52 87 0.61
2 43 0.26 2.8e+02 2.1 0.6 39 62 101 124 97 127 0.65
3 43 0.52 5.7e+02 1.1 0.2 40 56 200 216 183 220 0.87
4 43 0.033 35 5.0 1.7 27 59 342 374 319 380 0.66
5 43 0.6 6.5e+02 0.9 3.2 24 46 360 382 358 404 0.74
6 43 0.12 1.3e+02 3.1 1.9 26 53 407 438 391 447 0.70
7 43 0.032 35 5.0 1.6 24 62 444 482 441 485 0.85
8 43 0.0055 6 7.5 5.1 25 63 480 518 476 520 0.88
9 43 0.014 15 6.2 8.0 25 59 536 570 525 576 0.63
10 43 0.24 2.6e+02 2.2 1.8 33 60 565 592 564 594 0.89
11 43 0.0032 3.4 8.2 2.2 29 60 589 620 574 632 0.59
12 43 1.2e-06 0.0013 19.2 2.2 23 63 646 686 644 688 0.93
13 43 0.0011 1.2 9.7 2.7 28 58 693 723 686 730 0.52
14 43 0.0054 5.8 7.5 0.2 22 63 736 777 728 779 0.75
15 43 7.8e-05 0.085 13.4 4.5 26 60 782 816 777 818 0.92
16 43 1.6e-05 0.017 15.6 3.5 25 64 816 855 815 856 0.89
17 43 0.0018 1.9 9.0 1.3 26 64 866 904 864 905 0.95
18 43 8.8e-05 0.095 13.2 2.2 25 64 921 960 911 961 0.85
19 43 0.0011 1.2 9.7 0.4 28 64 952 988 948 989 0.90
20 43 0.00038 0.41 11.2 5.6 20 61 972 1013 965 1017 0.90
21 43 0.066 71 4.0 3.3 36 63 1016 1043 1011 1045 0.78
22 43 2.2e-05 0.023 15.2 1.1 24 64 1046 1086 1044 1086 0.96
23 43 0.00072 0.78 10.3 2.4 25 64 1103 1142 1092 1143 0.72
24 43 0.00016 0.17 12.4 4.5 24 63 1151 1190 1148 1192 0.93
25 43 0.00014 0.15 12.6 4.4 26 62 1181 1217 1180 1226 0.92
26 43 0.004 4.3 7.9 0.4 24 58 1228 1262 1221 1268 0.84
27 43 0.014 15 6.2 8.8 24 62 1284 1322 1267 1332 0.82
28 43 0.0088 9.4 6.8 2.6 30 60 1325 1355 1319 1366 0.61
29 43 0.076 82 3.8 2.4 24 64 1340 1380 1339 1387 0.91
30 43 0.07 75 3.9 9.4 27 63 1392 1428 1379 1430 0.82
31 43 0.00061 0.65 10.5 5.5 18 62 1432 1476 1428 1479 0.67
32 43 0.93 1e+03 0.3 0.2 30 60 1479 1509 1473 1514 0.73
33 43 0.0001 0.11 13.0 3.3 24 61 1511 1548 1510 1551 0.91
34 43 0.00061 0.66 10.5 11.9 23 64 1580 1621 1566 1622 0.87
35 43 0.0003 0.33 11.5 8.3 25 64 1621 1660 1619 1661 0.92
36 43 1.6 1.7e+03 -0.4 3.5 45 63 1676 1694 1660 1714 0.50
37 43 0.4 4.3e+02 1.5 2.0 36 62 1695 1721 1688 1731 0.63
38 43 0.17 1.8e+02 2.7 1.9 26 65 1706 1745 1702 1745 0.87
39 43 0.0011 1.2 9.7 4.9 24 63 1739 1778 1737 1780 0.89
40 43 0.024 26 5.4 0.4 28 65 1788 1825 1783 1825 0.88
41 43 0.024 26 5.4 4.3 34 58 1871 1895 1837 1899 0.63
42 43 0.0034 3.6 8.2 1.7 24 60 1882 1918 1881 1922 0.91
43 43 3 3.2e+03 -1.3 0.7 29 60 1991 2022 1980 2026 0.64

Sequence Information

Coding Sequence
ATGGAGGTGTGTCGATGCGGATGCGCTTCATCGACGTCCCTAGAATCGTTCAATCCGCCGAACGAGCCATGCTGTTGCTGCAGTTACAATCCCTTCAGCGACAATTCCAAAGAATCAGAGATCTATGATCTCTCGTTCGCCCTGAGGAAGCTCGGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAACGACTTCAGCTCGAGAGCGAAAATAGGTCGTTGAAACAAACCCTGCAGTCATTCGGTGTAAATGCGGATGAGATATTGAAGCCTGATCCACTGCTAGTGCACTCCAGAGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGAGTCTCGAAGATAAAGTGAGGGATCTGGAGGAAATTCTCGCTGAACGAGATTGCTGCGACGATCCTGACGCCACGATTCACTTCCTCAGAGGGAAGATGAGACATCTCAGGGAGCGTTTCGCGCTTGAAAAGAAAGACATGCGGGACATAATATCGAATTTGAAGTTGAAGCTGGCGCAAACCGAGGAGGACGTCAGTTGTCCGGCGATATATCGTCTTCGGGCGAAGTTGCGTGACTTGATGAAGGGTCAAACGAACCTTGGCGAAAACGATCATCTCCTTGCCGAGATAGACGAGCTGCGACGCGGGTTGGGTGGCCGCGAGAAAAGGGAAGTAGTAGAGACGAGGGCAATGGATACGGCCACTGTGCCTGAATACGCTGACGTCTCGGATCTATtgaaaaaattgaacgacTGCGAAGACGTTGTGGGCGACCTGGAGAGGAAGTTAGAAGAGAAGGACGACAAGATCGCCGACCTCCAGAACAGACTGAAGCAGTTCGACCTACTGCAGGCCGAACTCGAAGATCTTAAACGTAAACTGGGAGAACGCGAATCGGAGAATGGCAAGCTCCAGGAAGAATACGACGATAGAACTTCCATCCTGCAGTTGCAACACGACGGCAGAACTTCCATGTTGCAGATACGAATGGACGAATTGTTGGCCGAAATAATGGGTCTGAATCAGACCATCGGCACCTTGAACAAAAACATCGACGATTTGAATCAAGAGATCGCCGATCTGAAGTCGCGAATAGCCGATCTGGAGGCCCAACTGGAGCAGTGTAACAAGCGAAACGCCGAACTGGAAGAGTGTTGCCAGGACAGGGACTCCGTTTTACAGAGGAACGGGGCCCTGGAGCAAGAGCTCGCGTCAGCAATGGCCAGGATAAAAGACCTCGAGGCACAGGTGGACGCTTTGAAGGCGCAGATCGAGGCAGAGGTGGACGCTTTGAAGGCGCAGATCGAGGCACTGACCGGGCATCTGGAGAACGAGAAGGCGGCGAGGAGCTCGCTGGAAAAAGAACTGCAGGATAGCCGGGGTGAGATCGAAAGTCTGCGCAAGGAGAATTCGGATCTGAAGGGCCAGATAGACGCAGCGAACGCTGAGAACGATGAACTTCGCGACGCGTTAAGGAAATTGTCCGAGGAGGACGAACAGCTGAAGGCGCAGATCGAGCAACTGCAGAACAATATCAAGGACCTGGAGGGCCTGAATAACGCCCTGAGGGGCGACTATGACAACATGAAGCAAGCGCTGGATAATTTGGAGGCGGAGATCAACAGACTGGAGGAGGAATTAAGGAGGGCAGAGGATGAACGGCAGGCACTGTTGAATGAGATCGACAGCACTAGAAAGCAGCTGGAGGAGGCGTTGGCAGTGAACGAGACGCTGAAGTCTGAACTGGACGAGGCCGGCGAGCAACTCGGCAAACTGAGAGCGGAGAAGGACGAGCTGCAGAAGAGTGTCGACGCGCTCCGGCACGAGAACGATTCGTTCAAACGGGATATGAGGACGTTAAGGGAGGATCTCGATGACGCGCGGAGGCAGGTGGACGAGCTGAAAGCATCTGGCGACGCGCTGAGAGCCGCGGACGACGATAAGAAGTCTGAAGTCGAGAGGCTGACACAGGAGGTGGAGGGTTTGATGTCTGAAAGGGATCGCCTGACGAGCGAGAACGCCGATCTGAAAGCCGAAAACGCGGAACTGGGGCGCAAACTGGATGATGCCGTGAAGGAGTTGGAACAATTGAGAGCGGAAAATGCTGATTTACGAGCGGAGGTGGATCGTTTGGGGCAGGAAttggagagagcgagaggCGAGCTGGATCAATTGAAATCGGAAATGGCTGCCGTGAAGGACGCGCTCGATAAGTGCGTCGACGAGTTGGAAAAGTTGAAGGGTGAAAACAGTGATCTTAAATCGCTGAACGAGGCCTTGAAGACGGAGATTCAAGGGGTTAGGGGCGAAGGTGATGGTTTGGCTGTCGAGTTGAACTCTCTGAAGGGCGAGAGGGATCAATTGAAGGACCAACTGAGCGAGTGTAACGCGGAGAACGAGAGATTGAAAACTGAGAAGAGCCAGTTGGAGGGCGAAATCGATAGCTTGAAGACGGAGAAGGGTCAATTGGAGGCCGAAGTCGAGAAGCTGAAGGGAGAGAATGCCGGGCTGAAGGATGAGATTGAGAAGCTGCGAGGACAATTGCAATCATTGAACAATGAAATGAATGCATTGAAGACGCAGCTTGAGGAGGCCGGGAACAAAATTCAGTCACTGGAGTCATCGCTCTCGAGTCTTCAAGGTGAAAATgagaaatttcgaaatgatttgaGTTCGCTGCAGGACGAGATAAACAATTTGACAGCACAATTGGAGAAGGAAGCCGCTGATAATCTAAAGTTGAAGAACGATCTCAAGGTGCTCGAGGACCAGGTGCAAGATCTGAGCAAGCAGGTGGAAAATGCCAGGGCAGAAAACGAGGCCTTGAAGAAGGAGATTCAAGGGGTTAGAGGTGAAGGTGACGGTTTGGCGGCCGAATTGAACGCTCTGAGGGATGAAAAGTCCGCTttgacaggcgagagggatcGATTGAAGGAACAACTAAGCGGATGCAACGCGGAAATCGAGAGATTGAAAACAGAGAAGGACCAATTGGAAGCCGAAGTCGACAGGCTGAAGGGAGAGTTAAATACCTCCAAAGAAGAAAATGGCAGGCTGAAGGATGAAATGGAAAAACTGCGAGGACAATTGCAGTCATTGAACGATGAAATGAATGCATTGAAGGCGCAGCTTGAGGAGGCCAGagacaaaattcaatcacTGGAGTCAACGGTCTCGAGCCTTCAAGGCGAGAATGAGAAGTTTCGAAATGATTTGAGCTCTCTGCAGGGCGAGGTGCAAAATTTGAGAGCGCAATTGGAGAAggaagctggcgataatctgAAGTTGAGAAACGATCTGAAGCAACTGGAGGACCAAGTGCAAGATTTGGCCAAGCAGTTGCAAAATTCCAAGGCGGAAAACGAGGCCCTGAAGTCGGAGATTCAAGGGGTCAGAGGCGAAGGTGACGGTTTGGCGGTCGAATTGAACGCTCTAAGGGACGAAAATTCCGTTCTGAAGGATGAGAGAGATCGATTGAGGAATCAATTTAACGAGTCCAACGCGGAGAACGAGAGACTGAGAGCTGAGAAGGGCCAATTGGAGGCTGAAATCGAGAGACTTAAGGGAGAATTAAATACTTGCAAAGAAGAAAATGCCAGGCTGAAAGACGAAATTGAACAACTGCGAGGACAGTTGCAGTCATTGAACGATGAAGTTAATGGATTGAAGGCGCAGCTTGAGGAGGCCAGGAACAAAATTCAGACACTCGAGTCAACGGTCTCGAGTCTTCAAGGTGAAAATgagaaatttcgaaatgatttgaGCTCTCTGCAGACCGAGATAAAGAATTCGAGAGCGCAATCGGACAGGGACGCCGATGAAAATCTGAAGCTGAAGAATGATCTGAAGATGCTGGAGGACCAGGTGAAGGATTTGATCAAGAAATTGGAAAATGCCAGGGCAGAAAACGACGCCTTGAAGGAGGAAAATCAAAATTTGAAAGCACAGGTGAACGATATGAATCAAGAATTAGCCGGTTTAAGAACAGAGTGCGCGGACCTGAAGTCAGAGATCGCGGATTTGAAGAAGTCAATCGAAGAATTGAAAGCAAAGATTGCTAAACTGGAGGAAGATTTGGAGTACTGGAAATTGGAGAACTGTAAATTTCAGATGGAGGTGGACAAAGTGAAAGCTGAACTTCAGAAAGCTCTGAAGGACTTGAGTGAAGCTCAGTCTCTGAAACAGCAGCTAGAGGCAGAGGTGAATCGTCTCAAGGGCGAGAAGGCCGAGCTTGAGAAAAAACTTGCAGACGCAATGACACAGCTCGAGCAACTGAAGAAAGCACTCGAAGCGGAAAAATCGGCGAAAGATAAGGCCGAGTCACAAGTCGCGGCCCTAAGGGCGGAACTCGATGCATTGAAAGCGGAACTAGAAAAGCTGAGGGCCGAGAACAGCAAGTGCAAAGGTGAGCTGGACGATCTTGCAAGACAGCTGTCAGCCACGAAGAACGAACTTGACGCGTGCAAACAAGAGTTGTTCGCATTGAGGGAGGCAAACAACGCTCTGAAGTCGGATTTCGACGCGTTGAAGGGCATAAAGGACGAGCACAATAAATTAAAGGCGGACGCGGATGCAATGAAAGCGGAGAACGCGAATCTTCAACAAGAGAGACAAAGGCTGCAAGACGAGTACGCAAAGTTGAAAGCGGAGGCTGATGGTCAAAAGCTGGAACTGGACAAGCTGAGATCGGACTTGGCTTCGGAGAGAGCAGCCGCAGAAAAGCTTAGAACAGATCTGCAGAACCGGCAAGCTGAAAACGCCAAGCTGCAGTCAGAACTGAACGAACTGAAAGCTGAATTGGACAGATTGAAAAACGAGAGCGCCAGTTTGAGAAACGAATTGAACGAGCTGAAGAAAGCGCTCGAAGCCAAGATCAAGTCCTTGGAGGATCAACTATCGAACCTTCTGGCCGAGAAGGAGGAGCTGCTGAAGGAACTGGATCGTCTGCGCGACGAGGCGGCCAGTCTGAAAAACGAAGTCGAGAAACAGAAAGCAGCCAAGGAAGCAGCGTTGAGCGAACTGGCGGGTCTGAAGAAGGAGCTGGCCGACCTAGAGAATGTCTTGGCCGGCCTCAGAGCGACGCTGGACAAGGCTAACAGTGAAAACGAAGCGCTGCGCAAGGACAACGAGAAGATGAAAGCAGACCTGGCGAACGCGAGCAAAGATCTGGAAGCGCTtagaaaccagaacacgcagTTGGCGGCTGATTTAGCTGGGGCGAAGGAGAAATTAACAGACTTGGAGAAACAGTACGAAGACTTGAAGAAGGAGAACGGCGATCTGAAGTACAAAGCCGCCCGCCTCGAAGACACGTTGGCAGAGCTAGAAGCTTTGAAGAAACAATTGGACGGTGCTAATGCGGAGCTGGATAAGCTGCAGTCAGGATTGGATAGATTGAATTCGGAAAATGCAAAACTGCGGGACGATTTCAATAGGGCGACGGACGAAGCGAATAAGTTGAAGGACAGTTTGGATAAACTGCGAAACGAATACAACGGGCTGCAGGATGATTTGTCTAAGGCGCAGGATGAATTATCTAAGACACAGAATGATTTGTCTAAGCTGCAGGACGAAGCTGCGAAGCAGAAGGCACTTGGGGCATCGCAAGTAGGGGATCTGGATAAATTGCAGAAGGAAAATAACGATCTGCGGGATCAAATCGCGAAGCTGAAGAGTCAGTTGCACGATTGCCGCGAGGAGAGCGACGGGCTGCACAAGGAACTGGGAAATTTGCAGAGGAAGAATGCCAAGCTGATGGAGGCAGCAGAGAGGCCGATGGAGCCAAAGGAGACGGAAATGGACAAAGATATTCTGGATGAATGCGGCGACTTCATTAAGGCGAACGAATTACTGGGGGACAAGTTTGAAAAGCAGAACGaagGCGTTCAACGTGTTCGAGATTATATATCATATTTGGAAGGTAAAGGCGAGGAGCCCAAAATGGCTGATCAATTAGAAGAATCGCATATTGAACCGCAATTAAGGGAAGACATTGCTGACTTGTTGAAGAAATCTCAGGCGTTATCGAAATCTATTTATGAAACGGAGCAGACAATACAAAATCTTACTGATATCCTGAAAAAACCTGCGGAGGCCGTGGCTCCAGGAGCCTTCGATCCCGATTCTTGGCTGAGTGCGTTGACGTTGACGCAGTTAGCGGAGCTTCACGATAAGATTTGCCTGTTGACATCGGACATGGTGCAGCAGGATAGCAAAGCGGTGCCATGCAGGTCAGCCCCCGGGCCGGCGAATCCGCAAGCGGATTACAATGTTTTGAACAAGCGGATAGCCGCTTTGCAGAAACAGATAGCAGAGAAGCAAATGGAGGCAGCGTGGAAGATGCAGGAATTGAAGCGGTCTCTTCGAATGGAGCAGGCTAACCTAATCGAAATCTCCGACTATATGAATTTGGAGAGAAAACGAAACTTGGCCCTTCAGTTCACCATTGATGAAGCACCATAG
Protein Sequence
MEVCRCGCASSTSLESFNPPNEPCCCCSYNPFSDNSKESEIYDLSFALRKLGVMKCQMKKWRMERLQLESENRSLKQTLQSFGVNADEILKPDPLLVHSREEIERLQNANASLEDKVRDLEEILAERDCCDDPDATIHFLRGKMRHLRERFALEKKDMRDIISNLKLKLAQTEEDVSCPAIYRLRAKLRDLMKGQTNLGENDHLLAEIDELRRGLGGREKREVVETRAMDTATVPEYADVSDLLKKLNDCEDVVGDLERKLEEKDDKIADLQNRLKQFDLLQAELEDLKRKLGERESENGKLQEEYDDRTSILQLQHDGRTSMLQIRMDELLAEIMGLNQTIGTLNKNIDDLNQEIADLKSRIADLEAQLEQCNKRNAELEECCQDRDSVLQRNGALEQELASAMARIKDLEAQVDALKAQIEAEVDALKAQIEALTGHLENEKAARSSLEKELQDSRGEIESLRKENSDLKGQIDAANAENDELRDALRKLSEEDEQLKAQIEQLQNNIKDLEGLNNALRGDYDNMKQALDNLEAEINRLEEELRRAEDERQALLNEIDSTRKQLEEALAVNETLKSELDEAGEQLGKLRAEKDELQKSVDALRHENDSFKRDMRTLREDLDDARRQVDELKASGDALRAADDDKKSEVERLTQEVEGLMSERDRLTSENADLKAENAELGRKLDDAVKELEQLRAENADLRAEVDRLGQELERARGELDQLKSEMAAVKDALDKCVDELEKLKGENSDLKSLNEALKTEIQGVRGEGDGLAVELNSLKGERDQLKDQLSECNAENERLKTEKSQLEGEIDSLKTEKGQLEAEVEKLKGENAGLKDEIEKLRGQLQSLNNEMNALKTQLEEAGNKIQSLESSLSSLQGENEKFRNDLSSLQDEINNLTAQLEKEAADNLKLKNDLKVLEDQVQDLSKQVENARAENEALKKEIQGVRGEGDGLAAELNALRDEKSALTGERDRLKEQLSGCNAEIERLKTEKDQLEAEVDRLKGELNTSKEENGRLKDEMEKLRGQLQSLNDEMNALKAQLEEARDKIQSLESTVSSLQGENEKFRNDLSSLQGEVQNLRAQLEKEAGDNLKLRNDLKQLEDQVQDLAKQLQNSKAENEALKSEIQGVRGEGDGLAVELNALRDENSVLKDERDRLRNQFNESNAENERLRAEKGQLEAEIERLKGELNTCKEENARLKDEIEQLRGQLQSLNDEVNGLKAQLEEARNKIQTLESTVSSLQGENEKFRNDLSSLQTEIKNSRAQSDRDADENLKLKNDLKMLEDQVKDLIKKLENARAENDALKEENQNLKAQVNDMNQELAGLRTECADLKSEIADLKKSIEELKAKIAKLEEDLEYWKLENCKFQMEVDKVKAELQKALKDLSEAQSLKQQLEAEVNRLKGEKAELEKKLADAMTQLEQLKKALEAEKSAKDKAESQVAALRAELDALKAELEKLRAENSKCKGELDDLARQLSATKNELDACKQELFALREANNALKSDFDALKGIKDEHNKLKADADAMKAENANLQQERQRLQDEYAKLKAEADGQKLELDKLRSDLASERAAAEKLRTDLQNRQAENAKLQSELNELKAELDRLKNESASLRNELNELKKALEAKIKSLEDQLSNLLAEKEELLKELDRLRDEAASLKNEVEKQKAAKEAALSELAGLKKELADLENVLAGLRATLDKANSENEALRKDNEKMKADLANASKDLEALRNQNTQLAADLAGAKEKLTDLEKQYEDLKKENGDLKYKAARLEDTLAELEALKKQLDGANAELDKLQSGLDRLNSENAKLRDDFNRATDEANKLKDSLDKLRNEYNGLQDDLSKAQDELSKTQNDLSKLQDEAAKQKALGASQVGDLDKLQKENNDLRDQIAKLKSQLHDCREESDGLHKELGNLQRKNAKLMEAAERPMEPKETEMDKDILDECGDFIKANELLGDKFEKQNEGVQRVRDYISYLEGKGEEPKMADQLEESHIEPQLREDIADLLKKSQALSKSIYETEQTIQNLTDILKKPAEAVAPGAFDPDSWLSALTLTQLAELHDKICLLTSDMVQQDSKAVPCRSAPGPANPQADYNVLNKRIAALQKQIAEKQMEAAWKMQELKRSLRMEQANLIEISDYMNLERKRNLALQFTIDEAP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01067355;
90% Identity
iTF_01065989;
80% Identity
-