Basic Information

Gene Symbol
-
Assembly
GCA_951802695.1
Location
OX637853.1:6590114-6604154[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 45 0.14 1.5e+02 3.0 0.4 25 50 56 81 50 88 0.59
2 45 0.25 2.7e+02 2.1 0.6 39 62 101 124 97 127 0.65
3 45 0.63 6.9e+02 0.8 0.2 40 56 200 216 183 220 0.87
4 45 0.92 1e+03 0.3 4.6 33 62 277 306 264 309 0.69
5 45 0.029 32 5.1 0.0 27 63 321 357 319 359 0.90
6 45 0.4 4.4e+02 1.4 3.3 24 60 360 399 359 403 0.70
7 45 1.3 1.4e+03 -0.2 7.1 37 57 411 431 387 463 0.58
8 45 0.04 44 4.7 2.0 24 62 433 471 430 474 0.85
9 45 0.0054 5.9 7.4 5.2 25 63 469 507 465 509 0.88
10 45 0.023 25 5.4 8.1 25 56 525 556 514 565 0.56
11 45 0.0011 1.2 9.7 2.3 28 58 577 607 562 621 0.55
12 45 4.7e-07 0.00052 20.4 2.6 23 63 635 675 633 677 0.95
13 45 0.00026 0.28 11.7 5.5 24 63 664 703 663 705 0.94
14 45 0.0014 1.5 9.3 3.0 28 58 682 712 677 719 0.52
15 45 0.005 5.5 7.5 0.2 22 63 725 766 716 768 0.75
16 45 7.1e-05 0.078 13.5 4.4 26 60 771 805 768 809 0.92
17 45 1e-05 0.011 16.2 2.9 25 64 805 844 804 845 0.89
18 45 0.00045 0.49 10.9 1.8 26 64 855 893 853 894 0.96
19 45 0.00048 0.52 10.8 2.5 25 64 910 949 900 950 0.80
20 45 0.0026 2.9 8.4 0.9 28 62 941 975 937 978 0.88
21 45 0.00064 0.7 10.4 6.7 20 61 961 1002 954 1006 0.88
22 45 0.015 17 6.0 10.1 21 64 976 1019 976 1034 0.78
23 45 2.4e-05 0.026 15.0 1.5 24 64 1035 1075 1032 1076 0.96
24 45 0.0029 3.2 8.3 3.8 29 64 1089 1131 1082 1132 0.68
25 45 0.027 30 5.2 0.1 29 55 1124 1150 1118 1153 0.82
26 45 0.00023 0.25 11.8 4.8 24 63 1140 1179 1137 1180 0.93
27 45 9.7e-05 0.11 13.0 4.4 26 64 1170 1208 1169 1215 0.92
28 45 0.0018 2 9.0 0.2 24 59 1217 1252 1214 1257 0.89
29 45 0.0088 9.6 6.8 8.1 23 62 1272 1311 1256 1314 0.84
30 45 0.00025 0.28 11.7 3.6 23 65 1293 1335 1291 1335 0.92
31 45 0.044 48 4.5 2.0 29 59 1313 1343 1308 1355 0.64
32 45 0.23 2.6e+02 2.2 1.9 24 64 1329 1369 1328 1378 0.92
33 45 0.021 23 5.5 6.8 27 63 1381 1417 1377 1419 0.88
34 45 0.00056 0.61 10.6 5.5 18 62 1421 1465 1417 1468 0.67
35 45 0.49 5.3e+02 1.2 0.8 31 61 1469 1499 1463 1503 0.77
36 45 4.1e-05 0.044 14.2 3.2 24 61 1500 1537 1499 1540 0.91
37 45 0.00071 0.77 10.3 12.2 23 64 1569 1610 1554 1611 0.87
38 45 0.00022 0.24 11.9 7.4 25 63 1610 1648 1607 1650 0.90
39 45 0.44 4.8e+02 1.3 0.2 39 63 1659 1683 1651 1685 0.84
40 45 3.7 4.1e+03 -1.7 6.9 26 63 1695 1732 1663 1734 0.69
41 45 0.00086 0.94 10.0 3.8 24 63 1728 1767 1726 1769 0.89
42 45 0.023 25 5.4 0.4 28 65 1777 1814 1772 1814 0.88
43 45 0.33 3.6e+02 1.7 3.9 40 63 1845 1882 1798 1884 0.74
44 45 0.0027 2.9 8.4 1.5 24 60 1871 1907 1868 1911 0.91
45 45 2.8 3.1e+03 -1.3 0.7 29 60 1980 2011 1969 2015 0.64

Sequence Information

Coding Sequence
ATGGAGGCGTGTCGATGCGGATGCGCTTCATCGACGTCCCTAGAATCGTTCAATCCGCCGAACGAGCCATGCTGTTGCTGCAGTTACAATCCCTTCAGCGACAATTCCAAAGAATCAGAGATCTATGATCTCTCGTTCGCCCTGAGGAAGCTCGTCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAACGACTTCAGCTCGAGAGCGAAAATAGGTCTTTGAAACAAACCCTGCAGTCATTCGGTGTAAATGCGGATGAGATATTGAAGCCTGATCCACTGCTAGTGCACTCCAGAGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGAGTCTCGAAGATAAAGTGAGGGATCTGGAGGAAATTCTCGCTGAACGAGATTGCTGCGACGATCCTGACGCCACGATTCACTTCCTCAGAGAGAAGATGAGACATCTCAGGGAGCGTTTCGCGCTTGAAAAGAAAGACATGCGGGACATAATATCGAATTTGAAGTTGAAGCTGGCGCAAACCGAGGAGGACGTCAGTTGTCCGGCGATATATCGTCTTCGGGCGAAGTTGCGTGACTTGATGAAGGGTCAAACGAACCTTGGCGAAAACGATCATCTCCTTGCCGAGATAGACGAGCTGCGACGCGGGTTGGCTGGCCGCGAGAAAAGGGAAGTAGTAGAGACGAGGGCAATGGATACGGCCACTGTGCCTGAATACGCTGACGTCTCGGATCTAttgaaaaaattgaacgactGCGAAGACGCTGTGGGCGACCTGGAGAGGAAGTTAGAAGAGAAGGACGACAAGATCGCCGACCTCCAGAACAGACTGAAGCAGTTCGACCTACTGCAGGCCGAACTCGAAGATCTTAAACGTAAACTGGCAGAACGCGAATCGGAGAATGGCAAGCTCAAAGAAGAATACGACGATAGAACTTCCATCCTGCAGTTACAACACGACGGCAGAACTTCCATGTTGCAGATACAAATGGACGAATTGTTGGCCGAAATAATGGGTCTGAATCAGACCATCGGCACCTTGAACAAAAACATCGACGATTTGAACCAAGAGATCGCCGATCTGAAGTCGCGAATAGCCGATCTGGAGGCCCAACTGGAGCAGTGTAACAAGCGAAACGCCGAACTGGAAGAGTGTTGCCAGGACAGGGACTCCGTTTCAGAGAGGAACGGGGCCCTGGAGCAAGAGCTCGCGTCCGCAATGGCCAGGATAAAAGACCTCGAGGCACAGGTGGACGCTTTGAAGGCGCAGATCGAGGCACTGACCGGGCAACTGGAGAACGAGAAGGCGGCGAGGAGTTCGCTGGAAAAGGAACTGCAGGATAGCCGGGGTGAGATCGAAAGTCTGCGCAAGGAGAATTCGGATCTGAAGGGCCAGATAGACGCAGCGAACGCTGAGAACGATGAACTTCGCGACGCGTTAAGGAAATTGTCCGAGGAGGACGAACAGCTGAAGGCGCAGATCGAGCAACTGCAGAACAATATCAAGGACCTGGAGGGCCTGAATAACGCCCTGAGGGGCGACTACGACAACATGAAGCAAGCGCTGGATAATTTGGAGGCGGAGATCAACAGACTGGAGGAGGAATTAAGGAGGGCAGAGGATGAACGGCAGGCACTGTTGAATGAGATCGACAGCACTAGAAAGCAACTGGAGGAGGCGTTGTTAGTGAACGAGACGCTGAAGGCTGAACTGGACGAGGCCGGCGAGCAACTCGACAAACTGAGAGCGGAGAAGGACGAGCTGCAGAAGAGTGTCGACGCGCTCCGGCACGAGAACGATTCGTTCAAACGGGATATGAGGACGTTAAGGGAGGATCTCGATGACGCGCGGAGGCAGGTGGACGAGCTGAAAGCATCTGGCGACGCGCTGAGAGCCGCGGACGACGATAAGAAGTCTGAAATCGAGAGGCTGACACAGGAGGTGGAGGGTTTGATGTCTGAAAGGGATCGCCTGACGAGCGAGAACGCCGATCTGAAAGCCGAAAACGCGGAACTGGGGCGAAAACTGGAGGATGCCGTGAAGGAGTTGGAACAATTGAGAGCGGAAAATGCTGATTTACGAGCGGAGGTGGATCGTTTGGGGCAGGAATTGGAGAGAGCGAGAGGCGAGCTGGATCAATTGAAATCGGAAATGGCTGCCGTGAAGGACGCGCTCGATAAGTGCGTCGACGAGTTGGAAAAGTTGAAGGGTGAAAACAGTGATCTTAAATCGCTGAACGAGGCCTTGAAGACGGAGATTCAAGGGGTTAGGGGCGAAGGTGATGGTTTGGCGGTCGAGTTGAACTCTCTGAAGGGCGAGAGGGATCAATTGAAGGACCAACTGAGCGAGTGTAACGCGGAGAACGAGAGATTGAAAACTGAGAAGAGCCAGTTGGAGGGCGAAATCGATAGCTTGAAGACGGAGAAGGGTCAATTGGAGGCCGAAGTCGAGAAGCTGAAGGGAGAGAATGCCGGGCTGAAGGATGAGATTGAGAAGCTGCGAGGACAATTGCAATCATTGAACGATGAAATGAATGCGTTGAAGACGCAGCTTGAGGAGGCTGGGAACAAAATTCAGTCACTGGAGTCATCGCTCTCGAGTCTTCAAGGTGAAAATgagaaatttcgaaatgatTTGAGCTCGCTGCAGGACGAGATAAACAATTTGAAAGCACAATTGGAGAAGGAAGCCGCTGATAATCTAAAGTTGAGGAACGATGTCAAGGTGCTCGAGGACCAGGTGCAAGATCTGAACAAGCAGGTGGAAAATGCCAGGGCAGAAAACGAGGACTTGAAGTCGGAGATTCAAGGGGTTAGAGGTGAAGGTGACGGTTTGGCGGCCGAATTGAACGCTCTGAGGGATGAAAAGTCCGCTTTGACGGGCGAGAGGGATCGATTGAAGGAACAACTGAGCGAATGCAACGCGGAAATCGACAGATTGAAAACAGAGAAGGACCAATTGGAAGCTGAAGTCGACAGGCTGAAGGGAGAGTTAAATACCTCCAAAGAAGAAAATGGCAGGCTGAAGGATGAAATGGAAAAACTGCGAGGACAATTGCAGTCATTGAACGATGAAATGAATGCATTGAAGGCGCAGCTTGAGGAGGCCAGAGACAAAATTCAGTCACTGGAGTCAACGGTCTCGAGCCTTCAAGGTGAGAATGAGAAGTTTCGCAATGATTTGAGCTCTCTGCAGGGCGAGGTGCAAAATTTGAGAGCGCAATTGGAGAAGGAAGCTGGCGATAATCTGAAGTTGAGAAACGATCTGAAGCAACTGGAGGACCAAGTGCAAGATTTGGCCAAGCAGTTGCAAAATTCCAGGGCGGAAAACGAAGCCCTGAAGTCGGAGATTCAAGGGGTCAGAGGCGAAGGTGACGGTTTGGCGGTCGAATTGAATGCTCTAAGGGACGAAAATTCCGTTCTGAAGGATGAGAGAGATCGATTGAGGAATCAATTTAACGAGTCCAACGCGGAGAACGAGAGACTGAGAGCTGAGAAGGGCCAATTGGAGGCTGAAATCGAGAGACTGAAGGGAGAATTAAATACTTGCAAAGAAGAAAATGCCAGGCTGAAAGACGAAATTGAACAACTGCGAGGACAGTTGCAGTCATTGAACGATGAAATGAATGCATTGAAGGCGCAGCTTGAGGAGGCCAGAGACAAAATTCAGACACTGGAGTCAACGGTCTCGAGTCTTCAAGGTGAAAATgagaaatttcgaaatgatTTGAGCTCTCTGCAGACCGAGATAAAGAATTCGAGAGCGCAATCGGACAGGGACGCCGATGAAAATCTGAAGCTGAAGAATGATCTGAAGATGCTAGAGGACCAGGTGAAGGATTTGATCAAGAAATTGGAAAATGCCAGGGCAGAAAACGACGCCTTGAAGGAGGAAAATCAAAATTTGAAAGCACAGGTGAACGATATGAATCAAGAATTAGCCGGTTTAAGAACAGAGTGCGCGGACCTGAAGTCAGAGATCGCGGATTTGAAGAAGTCAATCGAAGAATTGAACGTGAAGATTGCTAAACTGGAGGAAGATTTGGAGTACTGGAAATTGGAGAACTGTAAATTTCAGATGGAGGTGGACAAAGTGAAAGCTGAACTTCAGAAAGCTCTGAAGGACTTGAGTGAAGCTCAGTCTCTGAAACAGCAGCTAGAGGCAGAGGTGAATCGTCTCAAGGGCGAGAAGGCCGAGCTTGATAAAAAACTTGCAGACGCAATGACACAGCTCGAGCAACTGAAGAAAGCACTCGAAGCGGAAAAATCGGCGAAAGATAAGGCCGAGTCACAAGTCGCGGCCCTAAGGGCGGAGCTCGATGCATTGAAAGCGGAACTGGAAAAGCTGAGGGCCGAGAACAGCAAGTGCAAAGGTGAGCTGGACGATCTTGCAAGACAGCTGTCAGCCACGAAGAACGAACTTGACGCGTGCAAACAAGAGTTGTTCGCATTGAGGGAGGCAAACAACGCTCTGAAGTCGGAATCCGACGCGTTGAAGGGCATAAAGGACGAGCACAATAAATTAAAGGCGGACGTGGATGCAATGAAAGCGGAGAACGCGAATCTTCAACAAGAGAGACAAAGACTGCAAGATGAGTACGCCAAGTTGAAAGCGGAGGCTGATGGTCAAAAGCTGGAACTGGACAAGCTGAGATCGGACTTGGCTTCGGAGAGAGCAGCCGCAGAAAAGCTTAGAACAGATCTGCAGAACCGGCAAGCTGAAAACGCCAAGCTGCAGTCAGAACTGAACGAACTGAAAGCTGAATTGGACAGATTGAAAAACGAGAGTGCCAGTTTGAGAAACGAATTGAACGAGCTGAAGAAAGCGCTCGAAGCCAAGATCAAGTCCTTGGAGGATCAACTATCGAACCTTCTGGCCGAGAAGGAGGAGCTGCTGAGGGAACTGGATCGTCTGCGCGACGAGGCGGCCAGTCTGAAAAACGAAATCGAGAAACAGAAAACCGCCAAGGAAGCAGCCTTGAGCGAACTGGCGGGTCTCAAGAAGGAGCTGGCCGACCTAGAGAATGTCTTGGCCGGCCTCAGGGCGACGCTGGACAAGGCTAACAGTGAAAATGAAGCGCTGCGCAAGGACAACGAGAAGATGAAAGCAGACCTGGCGAACGCGAGCAAAGATCTGGAAACACTTAGAAACCAGAACTCGCAGTTGGCGGCTGATTTAGCTGGGGCGAAGGAGAAATTAACAGACTTGGAGAAACAGTACGAAGACTTAAAGAAGGAGAACGGCGATCTGAAGTACAAAGCCGCCCGCCTCGATGACACGTTGGCAGAGCTAGAAGCTTTGAAGAAACAATTGGACGGTGCTAATGCGGAGCTGGATAAGCTGCAGTCAGGATTGGATAGATTGAATTCGGAAAATGCAAAACTGCGGGACGATTTCAATAGGGCGACGGACGAAGCGAATAAGTTGAAGGACAGTTTGGATAAACTGCGAAACGAATACAACGGGCTGCAGGATGATTTGTCTAAGGCGCAGGATGAATTATCTAAGACGCAGAATGATTTGTCTAAGCTGCAGGACGAAGCTGCGAAGCAGAAGGCACTTGGGGCATCGCAAGTAGGGGATCTGAATAAATTGCAGATGGAAAATAACGATCTGCGGGATCAAATCGCGAAGCTGAAGAGTCAGTTGCACGATTGCCGCGAGGAGAGCGACGGGCTGCACAAGGAACTGGGAAATTTGCAGAGGAAGAATGCCAAGCTGATGGAGGCAGCAGAGAGGCCGATGGAGCCAAAGGAGACGGAAATGGACAAAGATATTCTGGATGAATGCGGCGACTTCATTAAGGCGAACGAATTACTGGGGGACAAGTTTGAAAAGCAGAACgaagGCGTTCAACGTGTTCGCGATTATATATCATATTTGGAAGGTAAAGGCGAGGAGCCCAAAATGGCTGATCAATTAGAAGAATCGCATATTGAACCGCAATTAAGGGAAGACATTGCTGACTTGTTGAAGAAATCTCAGGCGTTATCGAAATCTATTTATGAAACGGAGCAGACAATACAAAATCTTACTGATATCCTGAAAAAACCTGCGGAGGCCGTGGCTCCAGGAGCCTTCGATCCCGATTCTTGGCTGAGTGCGTTGACGTTGACGCAGTTAGCGGACCTTCACGATAAGATTTGCCTGTTGACATCGGACATGGTGCAGCAGGATAGCAAAGCGGTGCCATGCAGGTCAGCCCCCGGGCCGGCGAATCCGCAAGCGGATTACAATGTTTTGAACAAGCGGATAGCCGCTTTGCAGAAACAGATAGCAGAGAAGCAAATGGAGGCAGCGTGGAAGATGCAGGAATTGAAGCGGTCTCTTCGAATGGAGCAGGCTAACCTAATCGAAATCTCCGACTATATGAATTTGGAGAGAAAACGAAACTTGGCCCTTCAGTTCACCATTGATGAAGCACCATAG
Protein Sequence
MEACRCGCASSTSLESFNPPNEPCCCCSYNPFSDNSKESEIYDLSFALRKLVVMKCQMKKWRMERLQLESENRSLKQTLQSFGVNADEILKPDPLLVHSREEIERLQNANASLEDKVRDLEEILAERDCCDDPDATIHFLREKMRHLRERFALEKKDMRDIISNLKLKLAQTEEDVSCPAIYRLRAKLRDLMKGQTNLGENDHLLAEIDELRRGLAGREKREVVETRAMDTATVPEYADVSDLLKKLNDCEDAVGDLERKLEEKDDKIADLQNRLKQFDLLQAELEDLKRKLAERESENGKLKEEYDDRTSILQLQHDGRTSMLQIQMDELLAEIMGLNQTIGTLNKNIDDLNQEIADLKSRIADLEAQLEQCNKRNAELEECCQDRDSVSERNGALEQELASAMARIKDLEAQVDALKAQIEALTGQLENEKAARSSLEKELQDSRGEIESLRKENSDLKGQIDAANAENDELRDALRKLSEEDEQLKAQIEQLQNNIKDLEGLNNALRGDYDNMKQALDNLEAEINRLEEELRRAEDERQALLNEIDSTRKQLEEALLVNETLKAELDEAGEQLDKLRAEKDELQKSVDALRHENDSFKRDMRTLREDLDDARRQVDELKASGDALRAADDDKKSEIERLTQEVEGLMSERDRLTSENADLKAENAELGRKLEDAVKELEQLRAENADLRAEVDRLGQELERARGELDQLKSEMAAVKDALDKCVDELEKLKGENSDLKSLNEALKTEIQGVRGEGDGLAVELNSLKGERDQLKDQLSECNAENERLKTEKSQLEGEIDSLKTEKGQLEAEVEKLKGENAGLKDEIEKLRGQLQSLNDEMNALKTQLEEAGNKIQSLESSLSSLQGENEKFRNDLSSLQDEINNLKAQLEKEAADNLKLRNDVKVLEDQVQDLNKQVENARAENEDLKSEIQGVRGEGDGLAAELNALRDEKSALTGERDRLKEQLSECNAEIDRLKTEKDQLEAEVDRLKGELNTSKEENGRLKDEMEKLRGQLQSLNDEMNALKAQLEEARDKIQSLESTVSSLQGENEKFRNDLSSLQGEVQNLRAQLEKEAGDNLKLRNDLKQLEDQVQDLAKQLQNSRAENEALKSEIQGVRGEGDGLAVELNALRDENSVLKDERDRLRNQFNESNAENERLRAEKGQLEAEIERLKGELNTCKEENARLKDEIEQLRGQLQSLNDEMNALKAQLEEARDKIQTLESTVSSLQGENEKFRNDLSSLQTEIKNSRAQSDRDADENLKLKNDLKMLEDQVKDLIKKLENARAENDALKEENQNLKAQVNDMNQELAGLRTECADLKSEIADLKKSIEELNVKIAKLEEDLEYWKLENCKFQMEVDKVKAELQKALKDLSEAQSLKQQLEAEVNRLKGEKAELDKKLADAMTQLEQLKKALEAEKSAKDKAESQVAALRAELDALKAELEKLRAENSKCKGELDDLARQLSATKNELDACKQELFALREANNALKSESDALKGIKDEHNKLKADVDAMKAENANLQQERQRLQDEYAKLKAEADGQKLELDKLRSDLASERAAAEKLRTDLQNRQAENAKLQSELNELKAELDRLKNESASLRNELNELKKALEAKIKSLEDQLSNLLAEKEELLRELDRLRDEAASLKNEIEKQKTAKEAALSELAGLKKELADLENVLAGLRATLDKANSENEALRKDNEKMKADLANASKDLETLRNQNSQLAADLAGAKEKLTDLEKQYEDLKKENGDLKYKAARLDDTLAELEALKKQLDGANAELDKLQSGLDRLNSENAKLRDDFNRATDEANKLKDSLDKLRNEYNGLQDDLSKAQDELSKTQNDLSKLQDEAAKQKALGASQVGDLNKLQMENNDLRDQIAKLKSQLHDCREESDGLHKELGNLQRKNAKLMEAAERPMEPKETEMDKDILDECGDFIKANELLGDKFEKQNEGVQRVRDYISYLEGKGEEPKMADQLEESHIEPQLREDIADLLKKSQALSKSIYETEQTIQNLTDILKKPAEAVAPGAFDPDSWLSALTLTQLADLHDKICLLTSDMVQQDSKAVPCRSAPGPANPQADYNVLNKRIAALQKQIAEKQMEAAWKMQELKRSLRMEQANLIEISDYMNLERKRNLALQFTIDEAP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01067355;
90% Identity
iTF_01065989;
80% Identity
-