Basic Information

Gene Symbol
-
Assembly
GCA_963969485.1
Location
OZ018301.1:6874060-6887419[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 44 0.14 88 3.1 0.1 25 50 56 81 52 87 0.61
2 44 0.46 2.9e+02 1.5 0.3 39 62 101 124 97 127 0.67
3 44 2.6 1.6e+03 -0.9 0.2 40 55 200 215 184 219 0.84
4 44 0.57 3.5e+02 1.2 2.4 35 62 279 306 264 309 0.74
5 44 0.027 17 5.4 0.1 27 64 321 358 318 360 0.89
6 44 0.35 2.2e+02 1.9 3.1 24 53 360 382 354 394 0.53
7 44 1.1 7e+02 0.2 5.5 33 60 442 469 398 473 0.65
8 44 0.0049 3 7.8 6.4 25 64 462 501 458 502 0.90
9 44 0.0076 4.7 7.2 4.3 23 64 509 550 507 558 0.75
10 44 0.016 9.9 6.2 5.5 20 60 534 574 530 575 0.92
11 44 0.0053 3.3 7.7 5.6 30 61 558 589 551 614 0.60
12 44 3.7e-07 0.00023 21.0 4.4 23 63 628 668 627 670 0.95
13 44 0.003 1.9 8.5 4.8 28 61 675 708 670 712 0.68
14 44 0.0058 3.6 7.6 0.3 22 63 718 759 710 761 0.76
15 44 7.6e-05 0.047 13.6 3.7 26 60 764 798 761 800 0.92
16 44 8.1e-06 0.005 16.7 3.6 24 65 797 838 796 838 0.92
17 44 4.2e-05 0.026 14.4 1.2 26 64 848 886 847 886 0.96
18 44 2.7e-05 0.017 15.0 0.5 22 65 900 943 898 943 0.90
19 44 0.0013 0.82 9.6 6.1 20 62 954 996 944 999 0.88
20 44 0.007 4.4 7.3 3.9 21 59 997 1035 993 1036 0.92
21 44 3.8e-05 0.023 14.6 1.7 24 64 1028 1068 1025 1068 0.95
22 44 0.00056 0.35 10.8 0.6 25 64 1085 1124 1075 1125 0.73
23 44 0.00018 0.11 12.4 4.4 24 63 1133 1172 1131 1174 0.93
24 44 0.00014 0.088 12.7 2.1 26 62 1163 1199 1162 1209 0.92
25 44 0.0043 2.7 8.0 1.0 24 59 1210 1245 1203 1251 0.87
26 44 0.027 17 5.4 3.7 24 64 1252 1292 1248 1293 0.82
27 44 0.00098 0.61 10.0 4.2 23 62 1265 1304 1263 1307 0.88
28 44 0.00049 0.31 11.0 3.7 24 64 1294 1334 1293 1343 0.89
29 44 0.041 25 4.8 1.3 24 64 1322 1362 1321 1363 0.92
30 44 0.0051 3.1 7.8 7.5 26 63 1373 1410 1368 1412 0.90
31 44 0.00048 0.3 11.0 4.3 19 58 1415 1454 1410 1461 0.55
32 44 0.61 3.8e+02 1.1 0.4 31 60 1462 1491 1458 1496 0.81
33 44 0.00015 0.091 12.7 3.7 24 61 1493 1530 1492 1533 0.91
34 44 5.7e-05 0.035 14.0 6.4 25 61 1550 1586 1548 1590 0.90
35 44 0.00015 0.094 12.6 9.4 25 63 1603 1641 1598 1643 0.92
36 44 0.51 3.2e+02 1.3 0.7 37 63 1650 1676 1638 1678 0.74
37 44 2 1.3e+03 -0.6 1.8 40 57 1681 1698 1656 1713 0.60
38 44 0.42 2.6e+02 1.6 0.6 26 65 1688 1727 1684 1727 0.85
39 44 0.0074 4.6 7.2 6.9 24 62 1721 1759 1720 1762 0.87
40 44 0.52 3.2e+02 1.3 2.0 28 62 1756 1790 1753 1793 0.79
41 44 0.49 3e+02 1.4 1.2 32 63 1795 1826 1785 1835 0.68
42 44 0.027 17 5.4 3.9 35 58 1854 1877 1819 1881 0.65
43 44 0.0036 2.2 8.2 1.6 24 60 1864 1900 1863 1904 0.91
44 44 1.8 1.1e+03 -0.4 0.4 24 60 1946 1990 1943 2004 0.62

Sequence Information

Coding Sequence
ATGGAGGTGTGCCGATGCGGATGCGCTTCATCGACGTCTCTAGAATCGTTCAATCCGCCGAACGAGCCATGCTGTTGCTGCAGTTACAATCCCTTCAGCGACAATTCCAAAGAATCAGAGATCTATGATCTCTCGTTCGCCCTGAGGAAGCTCGGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAACGACTTCAGCTCGAGAGCGAAAATAGGTCTCTGAAACAAACCCTGCAGTCATTCGGTGTAAATGCGGATGAGATATTGAAGCCTGATCCACTGCTGGTGCACTCCAGAGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGAGCCTCGACGATAAAGTGAGGGATCTGGAGGAAATTCTCGCTGAACGAGATTGCTGCGACGATCCGGACGCCACGATTCACTTCCTCAGAGAGAAGATGAGACATCTCAGGGAGCGTTTCGCGCTTGAAAAGAAAGAATTGCGGGACATAATATCGAATTTGAAGTTGAAGCTGGCGCAGACCGAGGAGGACGTCAGTTGCCCGGCGATATATCGCCTTCGGGCGAAGTTGCGTGACCTGATGAAGGGTGAAACGAGCCTCGGCGAAAACGACCATCTTCTTGCCGAGCTAGACGAGCTTCGACGGGGGTTGGCCGGTCGGGAGAAACGGGAAGCAGTCGAGACGAGGGCAATGGAAACGGCCACGGTGCCTGAATACGTTGACGTCTCGGATGTattgaaaaaattgaacgactgCGAAAACGTTGTGGGCGACCTGGAGAGGAAGTTAGAAGAGAAAGACGAGAAGATCGCGGACCTCCTGGACAGGCTAAAGCAGTTCGACATACTGCAGGCCGAACTCGAAGATCTTAAACGTAAACTGGCAGAACGCGAATCGGAGAATGGCAAGCTCAAGACAGAATACGACGACAGAGCGTCGAACCTGCAGTTACAACACGAGGGCAGAACCTCCATGCTGCAGATACAAATGGACGAATTGTTAGCCGAAATAATGGGTCTGAATCAAACCATCGGCACCTTGAACAAAAACATCGACGATTTGAATCAAGAGATCGCCGCGCTGAAGTCGCGAATAGCCGATCTGGAGAACCAACTGGAGGAGTGTAACAAGCGAAACGCCGAGCTGGAGGAGTGTTGCCAGGACAGGGAGTCCGTTTTAGAGAGGCACGGGGCCGCAATGGCCACGATAAAAGACCTCGAGGCAGAGGTGGACGCCTTGAAGGCGCAGATCGCGGCGCTGACCGGGCAGCTGGCGAACGAGATGGCGGCGAGAGGCACGCTGGAAAAAGAACTGGAGGACAGCCGGGGTGAGATTGAAAGTCTGCGCAAGGAGAATTCGGATCTGAAGGGTCGGATAGACGCGGCGAACGCCGAGAACGAGGAACTTCGCGACGCGTTAAGGAAATTGTCCGAGGAGGACGAACAGCTGAAGGCGCAGATCGAGCAACTGCGGAACAACATCAAGGACCTGGAGGATCTGAATAACGCCCTGAGGGGCGACTACGACAGCATGAAACAGGCGCTGGATAATTTGGAGGCAGAGATCAACAGACTGGAGGCGGAAGGAAGGAGGGCAGCGGATGAAAGACAGGCATTGTTGAATGAGATCGACGGCGTTAGAAGGCAGCTGGAGGAGGCGCTGGCCGAGAACGAGACGCTGAAAGCCGAACTGGACGAGGCCGGCGAGCAACTCGAGAAACTGAGAGCGGAGAAGGACGAGCTGCAGAAGAGTGTCGACGCGCTTCGGCACGAGAACGATTCGTTCAAACGGGATATGAGGGCGTTAAGGGAGGATCTCGATGACGCTAGGAGGCAGGTGGACGAGCTGAAAGCGTCTGGCGACGCGCTGATGGCCGCGGACGGCGATAAGAAGTCTGAAATCGAGAGGCTGACGCAGGAGGTGGAGGGTCTGAGGTCTGAAAAGGATCGCCTGACCAGCGAGAACGCCGAATTGAAAGACGAAAACGCGGAACTCGGACGAAAACTGGAGGATGCCGTGAAGGAGTTGGAACAATTGAGAGAGAAAAATGCTGATTTACGGGCGGAGGTGGATCGTTTGGCGCAGGAATTGGAGAGGGCGCGGGGCGAGCTGGATCAATTGAAATCGGAAATGGCTGCGGTGAAGGACGCGCTCGATAAGTGCGTCGACGAGTTGGAAAAGTTGAAGGGTGAAAACAGTGATCTTAAATCGCTGAACGAGGCCTTGAAGACGGAGATTCAAGGGGTCAGGGGTGAAGGTGACGGTTTGGCGGTCGAGTTGAACACTCTGAAGGGCGAGAGGGATCAATTGAAGGGCCAACTGAGCGAGTGTAACGCGGAGAACGAGAGACTGCAAACAGAGAAGAGCCAGTTGGAGGGCGAAATCGATAGCTTGAAGACGGAGAAGGGTCAATTGGAGGCCGAGGTCGAGACGCTGAAGGGAGAGAATGCCGGGCTGAAGGACGAGATCGAGAAGCTGCGAGGACAATTGCAATCGTTGAACGATGAATTGAATGCGTTGAAGACGCAGCTCGAAGAGGCTGGGAACAAAATTCAGGCGCTGGAGTCAACGGTCTCCAGTCTTCAAGGTGAAAATGAGAAATTTCGGAATGATTTGAGCTCTCTCCGGGGCGAGGTAGAAACTTTGAAGGCGCAATTGGAGAAGGAAGCTGGCGATAATCTAAAGTTGAGGAACGATCTCAAGATGCTCGAGGACCAGGTGCAAGATCTGACCAAGCAGGTGGAAAACGCCAGGGCAGAGAACGAGGCCTTGAAGACGGAGATTCAAGGTGTAAGAGGCGAAGGTGACGGTTTGGCGGCCGAATTGAACGCTCTGAGGGATGAAAGGTCCGCTTTGACGGGCGAGAGGGATCGATTGAAGGAACAACTGGGCGAATGCAACGCGGAAATCGAGAGATTGAAAACAGAGAAGAACCAATTGGATGCCGAAGTCGACAGGCTCAAGGGCGAGTTAAACACCtctaaagaagaaaatgacaGGCTGAAGGACGAAATGGAAAAACTGCGAGGACAATTGCCGTCACTGAACGACGAAATGGATGCACTGAAGGCGCAGCTCGAGGAGGCCAGAAACAAAATTCAATCACTGGAGTCAACGGTCTCGAGTCTCCAAGGTGAAAAtgagaaatttcgaaacgatttgaGCTCTCTCCAGGGCGAGGTGGACAATTTGAAAGCGCAATTGGAGAAGGAAGCTGGCGATAATCTGAAGTTGAGAAACGATCTGAAGCAACTGGAGGACCGTGTGCAAGATTTGGCCAAGCAATTGGACAACTCCAGGGCAGAGAACGAGGCACTGAAGTCGGAGATTCAAGGGGTTAGAGGCGAAGGCGACGGTTTGGCGGTCGAATTGAACGCTCTAAGGGACGAGAATTCGGTTCTGAAGGACGAGAGAGATCGATTGAGGAATCAATTTAACGAGTCCAACGCGGAGAACGAGAGACTGAGAGCCGAGAAGGGCCAATTGGAGGCCGAGATCGAGAGACTGAAGGGAGAATTAAATACttgcaaagaagaaaatggcAGGCTGAAAGACGAAATAGAACAACTGCGAGGACAAGTGCAGTCGCTGAACGATGAAATTAATGGATTGAAGACGCAGCTCGACGAGGCCAGAAACAAAATTCAGACACTGGAGTCAGCGGTCTCGAGTCTTCAAGGTGAAAacgagaaatttcgaaatgattTAAGCTCTCTGCAGACCGAGCTAAAGAATTCGAGGTCGCAATCGGACAGGGAAGCTGACGAAAATCTGAAGTTGAAGAACGATCTGAAGATGCTGGAGGACCAGGTGAACGATTTGATCAAGAAGTTGGAGAACGCCAGGGCAGAAAACGACGCCTTGAAGGCGGAAAATCAAAATTTGAAGGCACAGGTGAACGACATGAATCAAGAATTAGCCGGTCTAAGAACAGAGTGCGCGGACCTGAAGTCAGAGATCGCGGAATTGAAGAAGTCAATCGAAGAATTGAAGGCAAAGATTGCCAAACTGGAGGAAGATTTGGAGTACTGGAAATTGGAGAACTGTAAATTTCAGATGGAGGTGGACAAAGTAAAAGCTGAACTTCAGAAAGCTCTGAAGGACTTGAGTGAAGCTCAGGCTCTGAAACAGCAGCTGGAGGCAGAGGTGAATCGTCTCAAGGGCGAGAAGGCCGAGCTTGACAAAAAACTTGCAGACGCAACGGCACAGCTCGAGCAACTGAAGAAAGCACTCGAAGCGGACAAATCGGCGAAAGACAAGGCCGAGTCGCAAGTCGCGGCACTAAGGGCGGAACTCGATGCGTTGAAAGCGGAACTGGAAAAGCTGAGGGCCGAGAACAGCAAGTGCAAAGGTGAGCTGGACGATCTGGCCAGGCTGCTGTCGGCCACGAAGAACGAGCTTGACGCGTGCAAACAGGAGCTGTTCGCGTTGAGGGAGGCGAACAACGCTCTGAAGTCGGATTTGGACGCGTTGAAAGGCGTAAAGGACGAGCACAATAAATTAAAGGCGGATGCGGATGCAATGAAAGCCGAGAATGCGAATCTCCAGCAAGAGAGACAAAGACTGCAGGACGAGTACGCCAAGCTGAAAGCGGAGGCGGACGGTCAAAGGGTGGAATTGGACAAGCTGAGATCGGACTTGGCGTCGGAGAGGGCAGCCGCGGAAAAGCTGCGATCGGATCTGCAGAACCGGCAAGCCGAGAACGCGAAGCTGCAGTCAGAAGTGAACGAACTGAGAAACGAATTGGACAGATTGAAAAACGAGAGCGCCAGTttgagaaacgaaatgaacgagCTGAAGAAAACGCTCGAAGCTAAGATCAAGTCCTTGGAGGATCAACTTTCGAACCTCCTGGCGGAGAAGGAGGAGCTGCTGAAGGAACTGGCTCGTCTGCGCGACGAGGTGGCCAATCTGAAGAACGAgatcgagaaacagaaagcCGCCAAGGAGGCAGCCTTGAGCGAACTGGCGGGCCTGAGGAAGGAGCTGGCCGATCTAGAAAATGTTTTGGCCGGCCTCAGAGCGACGCTGGACAAGGCGAACGGTGACAACGAAGCGCTGCGCAAGGACAACGAGAGGATGAAAGCGGACCTGGCGAACGCGAGCAAAGATCTGGAAGCGCTTAGAGGCCAGAACTCGCAGTTGGCTGCCGACTTAGCTGGGGCGAAAGAGAAATTGACAGACTTGGAGAAACAGCTCGAAGACTTGAAGAAGGAGAACGGCGACCTGAAGTACAAAGCTGCCCGCCTCGAGGACACGTTGACGGAGCTCGAAGCCTTGAAGAAGCAATTGGAGGGCGCTAACACGGAGCTGGAGAAACTACGGTCCGGATTGGATAGATTGAATTCGGAGAACGCAAAGCTGCGGGACGATTTCAACAGGGCGACGGACGAAGCGAATAAGTTGCAGGCCAGTTTGGACAAACTGCGAAACGACTACAACGGGCTGCAGGACGAACTATCTAAGACGCAGGATAAATTATCCAAGACGCAGAATGATTTGTCTAAGCTGCAGGACGAAGATGCGAAGCAGAAAGCGCTCGGAGCGTCGCAAGTAGGTGATCTGAATAAATTGCAGAAGGAGAATAACGATCTGCGGGATCAAATCGCGAAGCTGAAGAGCCAGTTGCACGACTGCCGCGAGGAGAGTGATGGGCTGCACAAGGAGCTGGGAAATTTGCAGAGGAAGAATGCCAAGCTGATGGAGGCAGCAGAGAGGCCGATGGAGCCAAAGGAGACGGAAATGGACAAAGATATTCTGGATGAATGCGGCGACTTCATTAAGGCGAACGAATTACTGGGGGACAAGTTTGAAAAGCAGAACGaagGTGTTCAACGCGTTCGAGATTATATATCATATTTGGAAGGTAAGGCCGAGGAGCCGAAAATGGCGGATCAATTGGAAGAATCGCATATTGAACCCCAATTAAGGGAAGACATTGCTGACTTGTTGAAGAAATCACAGGCGTTATCGAAATCTATTTATGAAACGGAGCAGACAATACAAAATCTTACTGATATCATGAAAAAACCAGCGGAGGCCGTGGCTCCTGGAGCCTTCGATCCCGATTCTTGGCTGAGTGCGTTGACGTTGACGCAGTTAGCGGAGCTTCACGATAAGATTTGCCTGTTGACATCGGACATGGTGCAGCAGGATAGCAAAGCGGTGCCATGCAGTTCGGCCCCCGGGCCGGCGAATCCGCAAGCGGATTACAATGTTTTGAACAGGCGGATAGCCGCTTTGCAGAAACAGATAGCAGAGAAGCAAATGGAGGCAGCGTGGAGGATGCAGGAATTGAAGCGGTCCCTTCGAATGGAGCAGGCTAACCTAATTGAAATCTCCGACTATATGAATTTGGAGAGAAAGCGAAACTTGGCCCTGCAGTTCACCATCGATGAATCGCCATAG
Protein Sequence
MEVCRCGCASSTSLESFNPPNEPCCCCSYNPFSDNSKESEIYDLSFALRKLGVMKCQMKKWRMERLQLESENRSLKQTLQSFGVNADEILKPDPLLVHSREEIERLQNANASLDDKVRDLEEILAERDCCDDPDATIHFLREKMRHLRERFALEKKELRDIISNLKLKLAQTEEDVSCPAIYRLRAKLRDLMKGETSLGENDHLLAELDELRRGLAGREKREAVETRAMETATVPEYVDVSDVLKKLNDCENVVGDLERKLEEKDEKIADLLDRLKQFDILQAELEDLKRKLAERESENGKLKTEYDDRASNLQLQHEGRTSMLQIQMDELLAEIMGLNQTIGTLNKNIDDLNQEIAALKSRIADLENQLEECNKRNAELEECCQDRESVLERHGAAMATIKDLEAEVDALKAQIAALTGQLANEMAARGTLEKELEDSRGEIESLRKENSDLKGRIDAANAENEELRDALRKLSEEDEQLKAQIEQLRNNIKDLEDLNNALRGDYDSMKQALDNLEAEINRLEAEGRRAADERQALLNEIDGVRRQLEEALAENETLKAELDEAGEQLEKLRAEKDELQKSVDALRHENDSFKRDMRALREDLDDARRQVDELKASGDALMAADGDKKSEIERLTQEVEGLRSEKDRLTSENAELKDENAELGRKLEDAVKELEQLREKNADLRAEVDRLAQELERARGELDQLKSEMAAVKDALDKCVDELEKLKGENSDLKSLNEALKTEIQGVRGEGDGLAVELNTLKGERDQLKGQLSECNAENERLQTEKSQLEGEIDSLKTEKGQLEAEVETLKGENAGLKDEIEKLRGQLQSLNDELNALKTQLEEAGNKIQALESTVSSLQGENEKFRNDLSSLRGEVETLKAQLEKEAGDNLKLRNDLKMLEDQVQDLTKQVENARAENEALKTEIQGVRGEGDGLAAELNALRDERSALTGERDRLKEQLGECNAEIERLKTEKNQLDAEVDRLKGELNTSKEENDRLKDEMEKLRGQLPSLNDEMDALKAQLEEARNKIQSLESTVSSLQGENEKFRNDLSSLQGEVDNLKAQLEKEAGDNLKLRNDLKQLEDRVQDLAKQLDNSRAENEALKSEIQGVRGEGDGLAVELNALRDENSVLKDERDRLRNQFNESNAENERLRAEKGQLEAEIERLKGELNTCKEENGRLKDEIEQLRGQVQSLNDEINGLKTQLDEARNKIQTLESAVSSLQGENEKFRNDLSSLQTELKNSRSQSDREADENLKLKNDLKMLEDQVNDLIKKLENARAENDALKAENQNLKAQVNDMNQELAGLRTECADLKSEIAELKKSIEELKAKIAKLEEDLEYWKLENCKFQMEVDKVKAELQKALKDLSEAQALKQQLEAEVNRLKGEKAELDKKLADATAQLEQLKKALEADKSAKDKAESQVAALRAELDALKAELEKLRAENSKCKGELDDLARLLSATKNELDACKQELFALREANNALKSDLDALKGVKDEHNKLKADADAMKAENANLQQERQRLQDEYAKLKAEADGQRVELDKLRSDLASERAAAEKLRSDLQNRQAENAKLQSEVNELRNELDRLKNESASLRNEMNELKKTLEAKIKSLEDQLSNLLAEKEELLKELARLRDEVANLKNEIEKQKAAKEAALSELAGLRKELADLENVLAGLRATLDKANGDNEALRKDNERMKADLANASKDLEALRGQNSQLAADLAGAKEKLTDLEKQLEDLKKENGDLKYKAARLEDTLTELEALKKQLEGANTELEKLRSGLDRLNSENAKLRDDFNRATDEANKLQASLDKLRNDYNGLQDELSKTQDKLSKTQNDLSKLQDEDAKQKALGASQVGDLNKLQKENNDLRDQIAKLKSQLHDCREESDGLHKELGNLQRKNAKLMEAAERPMEPKETEMDKDILDECGDFIKANELLGDKFEKQNEGVQRVRDYISYLEGKAEEPKMADQLEESHIEPQLREDIADLLKKSQALSKSIYETEQTIQNLTDIMKKPAEAVAPGAFDPDSWLSALTLTQLAELHDKICLLTSDMVQQDSKAVPCSSAPGPANPQADYNVLNRRIAALQKQIAEKQMEAAWRMQELKRSLRMEQANLIEISDYMNLERKRNLALQFTIDESP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2