Basic Information

Gene Symbol
-
Assembly
GCA_951802735.1
Location
OX637919.1:10012351-10026136[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 0.13 1.3e+02 3.1 0.1 25 50 56 81 52 87 0.61
2 42 0.27 2.6e+02 2.1 0.6 39 62 101 124 97 127 0.65
3 42 0.69 6.6e+02 0.8 0.2 40 56 200 216 183 220 0.87
4 42 0.5 4.8e+02 1.3 2.7 32 62 276 306 264 309 0.74
5 42 0.032 30 5.1 0.0 27 63 321 357 319 359 0.90
6 42 0.64 6.1e+02 0.9 3.5 24 46 360 382 359 403 0.70
7 42 1.9 1.8e+03 -0.6 8.4 24 49 433 458 387 474 0.60
8 42 0.0064 6.1 7.3 5.3 25 63 469 507 465 509 0.88
9 42 0.013 13 6.3 7.3 25 57 525 557 514 565 0.58
10 42 0.00088 0.84 10.1 1.5 26 59 575 608 559 614 0.63
11 42 4.3e-05 0.041 14.3 1.2 23 63 635 675 633 677 0.89
12 42 0.0013 1.2 9.6 4.2 32 63 672 703 668 719 0.65
13 42 0.0054 5.2 7.5 0.2 22 63 725 766 716 768 0.75
14 42 6.6e-05 0.063 13.7 3.8 26 60 771 805 768 807 0.92
15 42 9.1e-06 0.0087 16.4 2.9 24 64 804 844 803 845 0.90
16 42 0.0092 8.8 6.8 2.4 26 64 855 893 854 893 0.95
17 42 3.5e-05 0.033 14.6 0.4 25 65 910 950 908 950 0.92
18 42 0.00044 0.42 11.0 7.3 20 61 961 1002 954 1006 0.88
19 42 0.091 87 3.6 3.7 36 63 1005 1032 1000 1034 0.78
20 42 2.3e-05 0.022 15.1 1.3 24 64 1035 1075 1033 1076 0.96
21 42 0.0027 2.5 8.5 3.5 29 64 1089 1131 1082 1132 0.68
22 42 0.019 18 5.8 0.8 28 62 1123 1157 1119 1160 0.81
23 42 0.00017 0.16 12.4 4.4 24 63 1140 1179 1138 1181 0.93
24 42 0.00017 0.16 12.4 3.7 27 62 1171 1206 1170 1215 0.92
25 42 0.0042 4 7.9 0.3 24 59 1217 1252 1214 1257 0.88
26 42 0.0087 8.4 6.9 7.9 23 62 1272 1311 1256 1314 0.84
27 42 0.00048 0.46 10.9 4.8 23 65 1293 1335 1291 1342 0.70
28 42 0.044 42 4.6 1.5 24 64 1329 1369 1328 1370 0.92
29 42 0.036 35 4.9 8.6 27 63 1381 1417 1368 1419 0.81
30 42 0.00041 0.4 11.1 5.7 18 63 1421 1466 1417 1468 0.83
31 42 0.75 7.2e+02 0.7 0.2 30 59 1468 1497 1460 1503 0.65
32 42 0.0001 0.096 13.1 3.3 24 61 1500 1537 1499 1540 0.91
33 42 0.00047 0.45 11.0 4.5 22 62 1512 1552 1511 1554 0.90
34 42 0.0031 3 8.3 11.0 23 63 1569 1609 1555 1611 0.84
35 42 0.0005 0.48 10.9 9.7 24 64 1609 1649 1605 1650 0.89
36 42 0.64 6.1e+02 0.9 1.7 36 62 1656 1682 1643 1685 0.66
37 42 0.19 1.8e+02 2.6 2.3 26 65 1695 1734 1690 1734 0.86
38 42 0.0011 1 9.8 4.0 24 63 1728 1767 1727 1770 0.89
39 42 0.064 61 4.1 0.5 28 64 1777 1813 1772 1814 0.86
40 42 0.014 14 6.2 3.6 34 58 1860 1884 1826 1889 0.64
41 42 0.0088 8.4 6.9 0.8 24 59 1871 1906 1869 1911 0.90
42 42 3.1 3e+03 -1.3 0.7 29 60 1980 2011 1969 2015 0.64

Sequence Information

Coding Sequence
ATGGAGGTGTGTCGATGCGGATGCGCTTCATCGACGTCCCTAGAATCGTTCAATCCGCCGAACGAGCCATGCTGTTGCTGCAGTTACAATCCCTTCAGCGACAATTCCAAAGAATCAGAGATCTATGATCTCTCGTTCGCCCTGAGGAAGCTCGGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAACGACTTCAGCTCGAGAGCGAAAATAGGTCTTTGAAACAAACCCTGCAGTCATTCGGTGTAAATGCGGATGAGATATTGAAGCCTGATCCACTGCTAGTGCACTCCAGAGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGAGTCTCGAAGATAAAGTGAGGGATCTGGAGGAAATTCTCGCTGAACGAGATTGCTGCGACGATCCTGACGCCACGATTCACTTCCTCAGAGAGAAGATGAGACATCTCAGGGAGCGTTTCGCGCTTGAAAAGAAAGACATGCGGGACATAATATCGAATTTGAAGTTGAAGCTGGCGCAAACCGAGGAGGACGTCAGTTGTCCGGCGATATATCGTCTTCGGGCGAAGTTGCGTGACTTGATGAAGGGTCAACCGAACCTTGGCGAAAACGATCATCTCCTTGCCGAGATAGACGAGCTGCGACGCGGGTTGGCTGGCCGCGAGAAAAGGGAAGTAGTAGAGACGAGGGCAATGGATACGGCCACTGTGCCTGAATACGCTGACGTCTCGGATCtattgaaaaaattgaacgactgCGAAGACGTTGTGGGCGACCTGGAGATGAAGTTAGAAGAGAAGGACGACAAGATCGCCGACCTTCAGAACAGACTGAAGCAGTTCGACCTACTGCAGGCCGAACTCGAAGATCTTAAACGTAAACTGGGAGAACGCGAATCGGAGAATGGCAAGCTCAAGGAAGAATACGACGATAGAACTTCCATCCTGCAGTTGCAACACGACGGCAGAACTTCCATGTTGCAGATACAAATGGACGAATTGTTGGCCGAAATAATGGGTCTGAATCAGACCATCGGCACCTTGAACAAAAACATCGACGATTTGAATCAAGAGATCGCCGATCTGAAGTCGCGAATAGCCGATCTGGAGGCCCAACTGGAGCAGTGTAACAAGCGAAACGCCGAACTGGAAGAGTGTTGCCAGGACAGGGACTCCGTTTTAGAGAGGAACGGGGCCCTGGAGCAAGAGCTCGCGTCAGCAATGGCCAGGATAAAAGACCTCGAGGCACAGGTGGACGCTCTGAAGGCGCAGATCGAGGCACTGACCGGGCAACTGGAGAACGAGAAGGCGGCGAGGAGCTCGCTGGAAAAGGAACTGCAGGATAGCCGGGGTGAGATCGAAAGTCTGCGCAAGGAGAATTCGGATCTGAAGGGCCAGATAGACGCAGCGAACGCTGAGAACGATGAACTTCGCGACGCGTTAAGGAAATTGTCCGAGGAGGACGAACAGCTGAAGGCGCAGATCGAGCAACTGCAGAACAATATCAAGGACCTGGAGGGCCTGAATAACGCCCTGAGGGGCGACTACGATAACATGAAGCAAGCGCTGGATAATTTGGAGGCGGAGATCAACAGACTGGAGGAGGAATTAAGGAGGGCAGAGGATGAACGGCAGGCATTGTTGAATGAGATCGACAGCACTAGAAAGCAGCTGGAGGAGGCGTTGGCAGTGAACGGGACGCTGAAGGCTGAACTGGACGAGGCCGGCGAGCAACTCGACAAACTGAGAGCGGAGAAGGACGAGCTGCAGAAGAGTGTCGACGCGCTCCGGCACGAGAACGATTCGTTCAAACGTGATATGAGGACGTTAAGGGAGGATCTCGATGACGCGCGGAGGCAGGTGGACGAGCTGAAAGCATCTGGCGACGCGCTGAGAGCCGCGGACGACGATAAGAAGTCTGAACTCGAGAGGCTGACACAGGAGGTGGAGGGTTTGATGTCTGAAAGGGATCGCCTGACGAGCGAGAACGCCGATCTGAAAGCCGGAAACGCGGAACTGGGGCGAAAACTGGATGATGCCGTGAAGGAGTTGGAACAATTGAGAGCGGAAAATGCTGATTTACGCGCGGAGGTGGATCGTTTGGGGCAGGAATTGGAGAGGGCGAGAGGCGAGCTGGATCAATTGAAATCGGAAATGGCTGCCGTGAAGGACGCGCTCGATAAGTGCGTCGACGAGTTGGAAAAGTTGAAGGGTGAAAACAGTGATCTTAAATCGCTGAACGAGGCCTTAAAGACGGAGATTCAAGGGGTTAGGGGCGAAGGTGATGGTTTGGCGGTCGAGTTGAACTCTCTGAAGGGCGAGAGGGATCAATTGAAGGACCAACTGAGCGAGTGTAACGCGGAGAACGAGAGATTGAAAACTGAGAAGAGCCAGTTGGAGGGCGAAATCGATAGCTTGAAGACGGCGAAGGGTCAATTGGAGGCCGAAGTCGAGAAGCTGAAGGGAGAGAATGCCGGGCTGAAGGATGAGATTGAGAAGCTGCGGGGACAATTGCAATCGTTGAACGATGAAATGAATGCGTTGAAGACGCAGCTTGAGGAGGCCGGGAACAAAATTCAGTCACTGGAGTTATCGCTCTCGAGTCTTCAAGGTGAAAATgagaaatttcgaaatgatttGAGCTCGCTGCAGGACGAGATAAACAATTTGAGAGCACAATTGGAGAAGGAAGCCGCTGATAATCTAAAGTTGAGGAACGATCTCAAGGTGCTCGATGACCAGGTGCAAGATCTGACCAAGCAGGTGGAAAATGCCAGGGCAGAAAACGAGGCCTTGAAGTCGGAGATTCAAGGGGTTAGAGGTGAAGGTGACGGTTTGGCGGCCGAATTGAACGCTCTGAGGGATGAAAAGTCCGCTTTGACGGGCGAGAGGGATCGATTGAAGGAACAACTAAGCGAATGCAACGCGGAAATCGAGAGATTGAAAACAGAGAAGGACCAATTGGAAGCCGAAGTCGACAGGCTGAAGGGAGAGTTAAATACCTCCAAAGAAGAAAATGGCAGGCTGAAGGATGAAATGGAAAAACTGCGAGGACAATTGCAGTCATTGAACGATGAAATGAATGCATTGAAGGCGCAGCTTGAGGAGGCCAGAGACAAAATTCAGTCACTGGAGTCAACGGTCTCGAGCCTTCAAGGCGAGAATGAGAAGTTTCGAAATGATTTGAGCTCTCTGCAGGGCGAGGTGCAAAATTTGAGAGCGCAATTGGAGAAGGAAGCTGGCGATAATCTGAAGTTGAGAAACGATCTGAAGCAACTGGAGGACCAAGTGCAAGATTTGGCCAAGCAGTTGCAGAATTCCAGGGCGGAAAACGAGGCCCTGAAGTCGGAGATTCAAGGGGTCAGAGGCGAAGGTGACGGTTTGGCGGTCGAATTGAACGCTCTAAGGGACGAAAATTCCGTTCTGAAGGATGAGAGAGATCGATTGAGGAATCAATTTAACGAGTCCAACGCGGAGAACGAGAGACTGAGAGCTGAGAAGGGGCAATTGGAGGCTGAAATCGAGAGACTGAAGGGAGAATTAAATACttgcaaagaagaaaatgcCAGGCTGAAAGACGAAATTGAACAACTGCGAGGACAGTTGCAGTCATTGAACGATGAAATTAATGGATTGAAGGCGCAGCTTGAGGATGCCAGGAACAAAATTCAGACACTCGAGTCAACGGTCTCGAGTCTTCAAGGTGAAAATgagaaatttcgaaatgatttGAGCTCTCTGCAGACCGAGATAAAGAATTCGAGAGCGCAATCGGACAGGGACGCCGATGAAAATCTGAAGCTGAAGAATGATCTGAAGATGCTGGAGGACCAGGTGAAGGATTTGAtcaagaaattggaaaatgcCAGGGCAGAAAACGACGCCTTGAAGGAGGAAAATCAAAATTTGAAAGCACAGGTGAATGATATGAATCAAGAATTAGCCGGTTTAAGAACAGAATGCGCGGACCTGAAGTCAGAGATCGCGGATTTGAAGAAGTCAATCGAAGAATTGAAAGCAAAGATTGCTAAACTGGAGGAAGATTTGGAGTACTGGAAATTGGAGAACTGTAAATTTCAGATGGAGGTGGACAAAGTGAAAGCTGAACTTCAGAAAGCTCTGAAGGACTTGAGTGAAGCCCAGTCTCTGAAGCAGCAGCTAGAGGCAGAGGTGAGTCGTCTCAAGGGCGAGAAGGCCGAGCTTGAGAAAAAACTTGCGGACGCAATGACACAGCTCGAGCAACTGAAGAAAGCACTCGAAGCGGAAAAATCGTCGAAAGATAAGGCCGAGTCACAAGTCGCGGCCCTAAGGGCGGAACTCGATGCATTGAAAGCGGAACTAGAAAAGCTGAGGGCCGAGAACAGCAAGTGCAAAGGTGAGCTGGACGATCTTGCAAGACAGCTGTCAGCCACGAAGAACGAACTTGACGCGTGCAAACAAGAGCTGTTCGCATTGAGGGAGGCAAACAACGCTCTGAAGTCGGATTTCGACGCGTTGAAGGGCATAAAGGACGAGCACAATAAATTAAAGGCGGACGCGGATGCAATGAAAGCGGAGAACGCGAATCTTCAACAAGAGAGACAAAGGCTGCAAGACGAGTACGCCAAGTTGAAAGCGGAGGCTGATGGTCAAAAGCTGGAACTGGACAAGCTGAGATCGGACTTGGCCTCGGAGAGAGCAGCCGCAGAAAAGCTTAGAACAGATCTGCAGAACCGGCAAGCTGAAAACGCCAAGCTGCAGTCAGAACTGAACGAACTGAAAGCTGAATTGGACAGATTGAAAAACGAGAGCGCCGGTTTGAGAAACGAATTGAACGAGCTGAAGAAAGCGCTCGAAGCCAAGATCAAGTCCTTGGAGGATCAACTATCGAACCTTCTGGCCGAGAAGGAGGAGCTGCTGAAGGAACTGGATCGTCTGCGCGACGAGGCGGCCAGTCTGAAAAACGAAGTCGAGAAACAGAAAGCAGCCAAGGAAGCAGCGTTGAGCGAACTGGCGGGTCTGAAGAAGGAGCTGGCCGACCTAGAGAATGTCTTGGCCGGCCTCAGGGCGACGCTGGACAAGGCTAACAGTGAAAATGAAGCGCTGCGCAAGGACAACGAGAAGATGAAAGCAGACCTGGCGAACGCGAGCAAAGATCTGGAAGCGCTTAGAAACCAGAACTCGCAGTTGGCGGCTGATTTAGCTGGGGCGAAGGAGAAATTAACAGACTTGGAGAAACAGTACGAAGACTTGAAGAAGGAGAACGGCGATCTGAAGTACAAAGCCGCCCGCCTCGATGACACGTTGGCAGAGCTAGAAGCTTTGAAGAAACAATTGGACGGTGCTAATGCGGAGCTGGATAAGCTGCAGTCAGGATTGGATAGATTGAATACGGAAAATGCAAAACTGCGGGACGATTTCAATAGGGCGACGGACGAAGCGAATAAGTTGAAGGACAGTTTGGATAAACTGCGAAACGAATACAACGGGCTGCAGGATGATTTGTCTAAGGCGCAGGATGAATTATCTAAGACACAGAATGATTTGTCTAAGCTGCAGGACGAAGCTGCGAAGCAGAAGGCACTTGGGGCATCGCAAGTAGGGGATCTGGATAAATTGCAGAAGGAAAATAACGATCTGCGGGATCAAATCGCGAAGCTGAAGAGTCAGTTGCACGATTGCCGCGAGGAGAGCGACGGGCTGCACAAGGAACTGGGAAATTTGCAGAGGAAGAATGTCAAGCTGATGGAGGCAGCAGAGAGGCCGATGGAGCCAAAGGAGACGGAAATGGACAAAGATATTCTGGATGAATGCGGCGACTTCATTAAGGCGAACGAATTACTGGGGGACAAGTTTGAAAAGCAGAACgaagGCGTTCAACGTGTTCGAGattatatatcatatttggaaGGTAAAGGCGAGGAGCCCAAAATGGCTGATCAATTAGAAGAATCGCATATTGAACCGCAATTAAGGGAAGACATTGCTGACTTGTTGAAGAAATCTCAGGCGTTATCGAAATCTATTTATGAAACGGAGCAGACAATACAAAATCTTACTGATATCCTGAAAAAACCTGCGGAGGCCGTGGCTCCAGGAGCCTTCGATCCCGATTCTTGGCTGAGTGCGTTGACGTTGACGCAGTTAGCGGAGCTTCACGATAAGATTTGCCTGTTGACATCGGACATGGTGCAGCAGGATAGCAAAGCGGTGCCATGCAGGTCAGCCCCCGGGCCGGCGAATTCGCAAGCGGATTACAATGTTTTGAACAAGCGGATAGCCGCTTTGCAGAAACAGATAGCAGAGAAGCAAATGGAGGCAGCGTGGAAGATGCAGGAATTGAAGCGGTCTCTTCGAATGGAGCAGGCTAACCTAATCGAAATCTCCGACTATATGAATTTGGAGAGAAAACGAAACTTGGCCCTTCAGTTCACCATTGATGAAGCACCATAG
Protein Sequence
MEVCRCGCASSTSLESFNPPNEPCCCCSYNPFSDNSKESEIYDLSFALRKLGVMKCQMKKWRMERLQLESENRSLKQTLQSFGVNADEILKPDPLLVHSREEIERLQNANASLEDKVRDLEEILAERDCCDDPDATIHFLREKMRHLRERFALEKKDMRDIISNLKLKLAQTEEDVSCPAIYRLRAKLRDLMKGQPNLGENDHLLAEIDELRRGLAGREKREVVETRAMDTATVPEYADVSDLLKKLNDCEDVVGDLEMKLEEKDDKIADLQNRLKQFDLLQAELEDLKRKLGERESENGKLKEEYDDRTSILQLQHDGRTSMLQIQMDELLAEIMGLNQTIGTLNKNIDDLNQEIADLKSRIADLEAQLEQCNKRNAELEECCQDRDSVLERNGALEQELASAMARIKDLEAQVDALKAQIEALTGQLENEKAARSSLEKELQDSRGEIESLRKENSDLKGQIDAANAENDELRDALRKLSEEDEQLKAQIEQLQNNIKDLEGLNNALRGDYDNMKQALDNLEAEINRLEEELRRAEDERQALLNEIDSTRKQLEEALAVNGTLKAELDEAGEQLDKLRAEKDELQKSVDALRHENDSFKRDMRTLREDLDDARRQVDELKASGDALRAADDDKKSELERLTQEVEGLMSERDRLTSENADLKAGNAELGRKLDDAVKELEQLRAENADLRAEVDRLGQELERARGELDQLKSEMAAVKDALDKCVDELEKLKGENSDLKSLNEALKTEIQGVRGEGDGLAVELNSLKGERDQLKDQLSECNAENERLKTEKSQLEGEIDSLKTAKGQLEAEVEKLKGENAGLKDEIEKLRGQLQSLNDEMNALKTQLEEAGNKIQSLELSLSSLQGENEKFRNDLSSLQDEINNLRAQLEKEAADNLKLRNDLKVLDDQVQDLTKQVENARAENEALKSEIQGVRGEGDGLAAELNALRDEKSALTGERDRLKEQLSECNAEIERLKTEKDQLEAEVDRLKGELNTSKEENGRLKDEMEKLRGQLQSLNDEMNALKAQLEEARDKIQSLESTVSSLQGENEKFRNDLSSLQGEVQNLRAQLEKEAGDNLKLRNDLKQLEDQVQDLAKQLQNSRAENEALKSEIQGVRGEGDGLAVELNALRDENSVLKDERDRLRNQFNESNAENERLRAEKGQLEAEIERLKGELNTCKEENARLKDEIEQLRGQLQSLNDEINGLKAQLEDARNKIQTLESTVSSLQGENEKFRNDLSSLQTEIKNSRAQSDRDADENLKLKNDLKMLEDQVKDLIKKLENARAENDALKEENQNLKAQVNDMNQELAGLRTECADLKSEIADLKKSIEELKAKIAKLEEDLEYWKLENCKFQMEVDKVKAELQKALKDLSEAQSLKQQLEAEVSRLKGEKAELEKKLADAMTQLEQLKKALEAEKSSKDKAESQVAALRAELDALKAELEKLRAENSKCKGELDDLARQLSATKNELDACKQELFALREANNALKSDFDALKGIKDEHNKLKADADAMKAENANLQQERQRLQDEYAKLKAEADGQKLELDKLRSDLASERAAAEKLRTDLQNRQAENAKLQSELNELKAELDRLKNESAGLRNELNELKKALEAKIKSLEDQLSNLLAEKEELLKELDRLRDEAASLKNEVEKQKAAKEAALSELAGLKKELADLENVLAGLRATLDKANSENEALRKDNEKMKADLANASKDLEALRNQNSQLAADLAGAKEKLTDLEKQYEDLKKENGDLKYKAARLDDTLAELEALKKQLDGANAELDKLQSGLDRLNTENAKLRDDFNRATDEANKLKDSLDKLRNEYNGLQDDLSKAQDELSKTQNDLSKLQDEAAKQKALGASQVGDLDKLQKENNDLRDQIAKLKSQLHDCREESDGLHKELGNLQRKNVKLMEAAERPMEPKETEMDKDILDECGDFIKANELLGDKFEKQNEGVQRVRDYISYLEGKGEEPKMADQLEESHIEPQLREDIADLLKKSQALSKSIYETEQTIQNLTDILKKPAEAVAPGAFDPDSWLSALTLTQLAELHDKICLLTSDMVQQDSKAVPCRSAPGPANSQADYNVLNKRIAALQKQIAEKQMEAAWKMQELKRSLRMEQANLIEISDYMNLERKRNLALQFTIDEAP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01067355;
90% Identity
iTF_01065989;
80% Identity
-