Basic Information

Gene Symbol
-
Assembly
GCA_958009045.1
Location
OY253744.1:18182746-18190940[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 37 0.096 80 3.6 0.2 40 64 109 133 103 134 0.88
2 37 0.71 5.9e+02 0.8 1.2 34 57 151 178 142 198 0.70
3 37 0.02 17 5.7 0.2 30 55 223 248 215 251 0.73
4 37 5 4.2e+03 -1.9 0.1 27 47 370 390 363 398 0.49
5 37 0.35 2.9e+02 1.7 0.1 32 52 449 469 441 476 0.77
6 37 0.011 9.2 6.6 6.5 22 64 477 519 476 520 0.87
7 37 0.00087 0.73 10.1 3.0 24 61 500 537 498 547 0.84
8 37 0.039 32 4.8 5.1 25 64 550 589 548 590 0.91
9 37 0.0056 4.7 7.5 4.3 22 63 568 609 567 611 0.92
10 37 0.97 8.1e+02 0.3 0.6 31 63 605 637 598 645 0.83
11 37 2.2e-06 0.0018 18.4 8.6 23 62 646 685 644 688 0.92
12 37 0.061 51 4.2 1.8 27 55 685 713 682 719 0.58
13 37 0.0031 2.6 8.3 3.4 28 61 735 768 729 772 0.81
14 37 6.8e-06 0.0057 16.8 6.2 22 64 771 813 770 814 0.90
15 37 0.0057 4.7 7.5 1.5 35 63 812 840 809 842 0.90
16 37 1 8.6e+02 0.3 1.0 41 61 846 866 844 888 0.53
17 37 0.002 1.7 9.0 0.8 24 64 892 932 891 933 0.89
18 37 0.79 6.6e+02 0.6 0.1 26 56 943 973 940 979 0.75
19 37 0.0018 1.5 9.1 0.2 27 60 979 1012 975 1015 0.88
20 37 0.34 2.9e+02 1.8 5.9 25 63 1033 1071 1021 1073 0.88
21 37 5.3e-06 0.0044 17.2 6.3 28 62 1078 1112 1076 1115 0.90
22 37 0.06 50 4.2 1.3 30 62 1108 1140 1106 1150 0.62
23 37 0.11 93 3.4 0.5 26 55 1146 1182 1144 1188 0.85
24 37 0.011 9.4 6.5 0.7 30 63 1192 1225 1185 1227 0.76
25 37 0.0021 1.8 8.9 1.9 34 63 1231 1260 1226 1262 0.87
26 37 0.00016 0.14 12.4 2.2 33 63 1268 1298 1266 1300 0.93
27 37 0.3 2.5e+02 2.0 0.5 29 63 1306 1340 1301 1342 0.77
28 37 0.37 3.1e+02 1.7 5.0 27 62 1332 1367 1314 1370 0.78
29 37 0.00079 0.66 10.2 3.8 25 62 1372 1409 1369 1412 0.86
30 37 0.0032 2.7 8.3 0.8 27 56 1416 1445 1410 1450 0.58
31 37 0.079 66 3.8 4.0 23 64 1447 1488 1444 1489 0.90
32 37 0.0076 6.3 7.1 4.4 29 63 1502 1536 1497 1538 0.90
33 37 0.13 1.1e+02 3.2 0.8 38 62 1546 1570 1539 1573 0.87
34 37 0.0047 3.9 7.8 2.4 26 60 1576 1610 1573 1622 0.61
35 37 3.1 2.6e+03 -1.3 0.1 41 59 1629 1647 1619 1651 0.71
36 37 3.6e-06 0.003 17.7 1.5 27 62 1664 1699 1654 1702 0.91
37 37 3.7 3.1e+03 -1.5 1.3 35 56 1953 1975 1939 2002 0.60

Sequence Information

Coding Sequence
ATGGACTGCGATCATGATTCGAGATCGTGCTGTCTTTGCGGCTGTGAATCTCCCATGACAAGCGATATAGATGAGCCTCCCAGTGAGCCATGCTGTTGCTGCAATTACAATGCCTTCGGGAACGATTGCGACGAATCGGAAATTCATGATCTCTCATTCGCACTTCGTAAGCTCAGGGTGATGAAGTGTCAGATGAAGAAATGGCGAATGGAACGTCTTCAGTTGGAGAGCGACTGTCGATCGATGAAACAGACTCTTCAATCATTTGGTGTCAATACCGAGGAAGGCCTCAAAACCGACCCTCTGATTGTTCACTTTCGCGGTGAGAACATGAGATTGCAAAATATCAACGAGCTTCTCGATGAAAAGATTAAAGATCTTGAGGAAAGCTTAGCTGAGCGCGATCTGTGCGAGGATCCCTGCGCACGTGTACAATTAATGAGGGACAAGATTAGAAAACTGAAGGATCAATGTTttgccgagaaaaaaaaATTAAAAGCCACTGTAACGGAGCTGCGAATAAAATTAGCCGAAGCCGAGGACGACACATCGTGCGCTGCGTTAAATCATCTGAGGGCAAAGTTACGAGAGCTGATGAAGGGTGGCCAAAAAACTGATCAGCAGGTCTCTAAAGTGGttgaaaaatcgatcgagacaCTGTCCGATCTGTCGACGAGTTGCGACGCGCTTAAAGCGGAGAATGAGCGATTGCTCGCCGAGCTCGATGTCTGTCGGAATGCTTTGAGAGATAAGGGAGTTGATAGCATTATGATTTCTCCCAAAAGGGCTGAGACGCCTGAACTTGGAATCTTAAAATCAACCAATGGCAAAGAGTATGAAACTTTGATTGCCGATTTAAGGAAAAAATTGTCCGAGAACGAAGTTACGATAGCGGATTTGCGTAATCAACTTGACGATCGAACGACAACCGTCGCGAGATTGTCGAAAGATCTCGATAAAGTAATGGTCAGTCACAGGGCTCTTATGGCCGAGGTTACCGACGCTAAGAACGAACTTATGAAACGTGACAAAAAGGTCGAGGAACTTTTGCTAAGTGTTGGCAGATCTAGTCTCGAGATATTGGGCATGGAAGATTTGAGAAAAGAGATGGATTCGATGAAACCAAAGGTTTACGGTCTTGAGATCGAGAGAGCTCAGCTTTTGGACGAATTACACAAAATTCGTCTTGTCGTATCCGATCGAAACGATCAGATAATCAACATTTTGGAACACCGAGACAAGAGTATCGGCGATTATAAAACTAAAATGGATGCGATGCAAGCGAGAATAGATGCTCTTATTGGTCAAGAAGGTATCACGAGAAAAGAGGCGGAACATTCGACAATTCAAATAACCGAACTCGAGGACGAAATTTCGAGGCTCAAAGCGCGAATAGACGAGTTGGAAGGTAGCTGCAATGATTACAATAAACTCAAGAATGATTTCAACGAAATGAACGAGATGTTGGAAAAattgcgcaacgaaaatgtcgcTCTGCGCGACGAGACGGAAAAATTGACTGAAGAAACGACAAAGCTGAAGGACCAGACCGATAAGCTCAACGCAGAGATTGATAAGCTACGCGTGGAAAATGACAAACTCGTCGTCACGGTTGACGGGTTGAAAAAAGATTTAGCCGAtgagagaatgaaaaatgaCATGTTGGATAAAGAGCTAAGCCGCACGAAGGcggagaatgaaaaattgaaagataatgaaaataatatgagAGAAAGAGCGGACAATCTGGCGAAGCAATTACAGTCTTTGAACGATGAAAATTCGGCAGTGAAGAGTGAGTTGAACGCGACGAAAGGTGAATTGGCGAGCCTGATGGACGAGATGAAAAAATTGCGTGAAGATTTTAACAGACTCGTGGAAAAGAGTGACACTCTTGAGGTTGAACTCGACAAAGCGATGAAAGAATTGTCGGAAATGAGGGAGAAAAATGATTCACTGAGGACAAACGTGGAGGAGCTTGAGAGAGAAAACGATAGACTGAAAAAGGAGGTGAATAAATTGCGCGAAGAGCTGGAGAATATTAAAAGGGATATGGAAAAATGTCGAAAAGAGAATGCCGACGTGGAAAAAGAATTGGAACGATTGAGACGAGACAATGCCTCGTTGAAGTCGGATCTCGACGGTAAAACTAACGAGAGCGATTCTTCGAAGGCCGAGTCGATTGAATTAAAGAACGATTTATCGAAGCTCGAAAATGAGCTCGAAGCATTGCGGCGAGAGAACTACGGCCTGAGATCTCGCGCGGAGGCCCTTGAATCAGAAAATAATGACTTGAACAGCGCTGCGATGAAACTTAGAGACGAGTTAGAAAACGCTCAGGGCAAATTAGTGAAACTTGAGAATGAGAATAGTAGTTTACGGGAGGAAGTTGCTAAACTGCAGGAGGCTAATGCAGCTTTGAAAGCTAATCTCGACCGGCTCAATTCCCTTAACGATCAACTAAGAGAGCAAGTGTCTACGCTCACTCGCGAACTTGACGAGGCAAAGAGAAAGTTCAATGAAGCAATTGTCGAGAATGATGCTTTGAAAACGGAGATGCATAATTTGAAAACAGCGAATGATAATTTACGATCTGACATGGTCAAGTTGAACGCCGATTTGGAAATGATTcgagaaaaattggaaaaagtgGAGAACGAAAACGCCATTATTAAAGAGGATATGGAAAAATTACTGATGGATAATTCGAGTCTGAAAGACAATGTCGACAGGCTCGTGGAGGAAATAGACGAACTGAGGAGGAAAGGGGACGCATGTAAAGCGGAGAACGAAAGACTCACAGCAGCCGGTGACGGTCAGATAGAGAATCAAAAGCGTCTAGAGCACGAGATCGATGATTTGAGACACGAGAGAGCGGGAATGATAACGGACATGGAGAGACTGAAGAACGAACTCGGTGAAGCGATGAAAACGATCGGGAGTCTGGAATCTCGCATTCGCGATTTGCAGTCCAACAAGGATCAGCTTCAAAAAGACGTGGATGCATTGAAAGCCCTATCTGACGCACTGAAGGATGATTGGAATTCCGAGAAATCGGCGAGAGACGCAGCGGAGAAAGAACTTTTAGCTCTGAAAAACGAGCTGAAGGAGTTGCTCGGAAGATTAGACGATTTAAACAACGAGAAGGATAATCTAGTTAGGGATAAAGAGAACTTGAAGAAACAATTGAACGACGCCAAGGATCAAATCACCGAGAGCACCGGGGATAAAGATGAGTTTGAGAATGAAGCGAAACGTTTGACCTCCGAGTTGGAAAAACTGCGTAAGGAGAACGAGACGCTACGGCGAGAAGTATCTCAGCTTCAGGAAAACGTTGATCATTGGAAAATGGAGAATTGCAAGCTAAAGATGGAGGTCGAAAAGCTCAAAGGTGACATTGATAAATTGACAAAAGAGATTGAAAATTGTGGCGCTAGAGTCAAGGAACTCGAGAATCAATTGGTCGGGTTGGAGAGTGAAAAGGTAAAAAATTTCGGTACCATGGAACAATTAAAGAACGATATATCAAAGTTAACTGATGAAGTTAATATCGAGAAATCGGCGAAGGAAAGTGCTCTGAATCGTGCGAGAGCTCTGGAGGATGAAATAGAAGCGTTGAAAGCCGAACTAGCTAAGTGGAGAGATGAGAATGATCGTCTTGGACGAGAGTTCGAAGCTTCAACAAAACGTGAAGCTACGTTGGAGAGCGAATTGGCTAAACTGCGCGCTGAAATGTCAGCACTGCGAGAACAAAATGACAAAATGAAGTCTGACCTCGAGAGTGCAAAAGCCGACAAAGAGAAGATCGAAAAATTGTCGTCCGAGAACGCGGGCTTGAAGAGAGATCTGGAAAATCTTAAGAACTCGAATGATCAGCTAAAAGCAGAGCTTCAATCGTGCAATACCGAGAGTGACAAATTAAAATCCAATCTCACCGAATCCAAAGCTGAAATTGCCAAGCTCAAAAGCGACCTCGAACAATGTCGCTCGGAAGCCGACGCCCTGAGGGCTGATTTGAGCAGAGCCAATGACGAATCGAAGAGACTGAAGAATGAGCTCGATACGCTCAAGCCGGAATTGGAAAATCTACGAAAAGCACTGGCTAGTGCGGAGGCAAAGGTCAATGCTCTCGAGAGCCAACTGGCAGCTTTACAAGCCGAAAGAGCTAAGCTTGTCGAAGATCTGGAAGGTTTAAGACATGAAATGGCCAAGCTACGCAATGAACTTGACTCGGAGAAATTAGCGAAAGAAAATCTTGGCAAAGAAGTTGCGCAATTGAAATCAGAGCTTGGAACATTGAGGCTCGAGCTTGACAGAATGCGAGCCGATAACGATGCGGCCAAGGTAGAATCAAGTGGTCTGAAGAAGGAAATCGACAAATTGAAGTCTGATCTCGCCAGGCTCCAGGCCGAATTGTCCGAAAAAGAGGGTGAACTGAAATCGACGAAAGCCGAGAACGAGAGATTGAAAACGTCCCTCGATGCTGCCGAGACCGGGGCAAAACTTTTAGAGAATCAGCTGGAGGCTCTTGGACTCGAGAAGAAAAAGCTGGAGAACGAAGTCGCTGAACTCGTTGCCGAGGTTATGAGGCTTgggaaaaatttgtcgaatgaaaaatcattaaaagacGCGGTGATGGCGGAACTCGAGGCGCTGAAAAATACACTAGCTAGTCTGCGAGGAGAGTTGGAAAAATGTCGACAGGAAATGGAAATAATGAAGAAGGAGGTCGATCGAACGGAGATCGAGATGGGAAAATTGACGAACGAAACGAATAAACTGAAAGATGAAAATAATCGTATGCTAAATGAACTTGAGAGTTCGAAGGCTGATGTTGGCAAATTGCGCGCGGATTTGAAAGAAATGAACGGACTAaagggaaatttcgaaaaattacgttTGGACAACGAGTCATTGACGAATGATCGTGAAAAATATGTACGTGACGCGGATGAGGAACGAAGAAAGTCGAGGATATCCGACGCTGAGTCGAGTAACTTGAGAAACGAACTCGATCGTTTGAAGAGGGAAAACGAAAGGTTAAAGGCCGAGATTGGCGAATGCCAGAACGAGAGCGCTAAGCTCAGGAATAACATCGACGAGTTCAAAGACGCATCGGAGACCGCGAAGCGAACGGAAGAAGCAAAATACAAAGCCCCGGAACCTCGAAAAGCGGATTTGCCGAAGGAGCTTGAGGAAAAAGGTAGGGATTCGAAAATGGAGGATCGAACAACGAAAGCATCGGGTGATGTTCCAACTATTGTCCTGAAATCTGCGAAAGTTCCGTTTATGAACGACGATGGCATTTCCAAAACGAATCTCGATGAGCGCGGCGATTACGCGAGATCGAACGAGTTGCTGGATAAAAAACTTGATAAGCAGAatcaagGTGTGAAACGCGTTCGCGAGTACATCGGTCATATCGATGGCAAAGTTCCAACGAGACCTTCAATGGCTCGTGAACCCGACGACGAGCTGGACATCAGCGATTTTATCGAGCCTATCGTCGTACAAACGTTGCTGAAGTCTCGCGAACTTTCCGAGAATATTTACAACGTCGAGCTGGAGATACAAAAGATCGGCGACAGGCTCAACGATTGCAGCAAGGATAACGTTAGGTTAAAAAAGTTGCTCCAGGATCTTGGTGCCGAAGATCTCGAAGTTCAGGAATTTCACGAACAAATGGGCGACGCTTTCGACGCTGAGTCATGGCTGAGATCGTTGACGCTGACACAGTTGGCAGAACTTCATGATCGTATTTGCACGTTGACATCGGGAATGATGGAGAAAGACGGTAGTCAAGACGTGGCGGGTAAGAGGGATAAGCCGAAAAGCTCGCAAGTCCGAGTGGAAAAACAAGATCCACTCAGGGAGAATTACGAAATTCTCAACAAGCGTATAGCGGCATTGCAGAAAGAAATAGCCGAGAAGGAGGCGGATGCGGCGAGAAAGGTTTACGAGATGAGGCAATCTTTGCGTCAGGAGCAAGCTCACATGATAAGAATCTCGGATGAGATGAATCGCGAGAGAAAACGAAATTTAGCTCTTCATCTAACGATAGACGGTTCGCCTTGTTGGATGAACAATTGCAACGCTGAATGTAGCGGTGATTTTCAACAGTTGACTGCCAGGAACGTTTCGAACAATGTACCGAATGAATtaggaaaaaaacgatga
Protein Sequence
MDCDHDSRSCCLCGCESPMTSDIDEPPSEPCCCCNYNAFGNDCDESEIHDLSFALRKLRVMKCQMKKWRMERLQLESDCRSMKQTLQSFGVNTEEGLKTDPLIVHFRGENMRLQNINELLDEKIKDLEESLAERDLCEDPCARVQLMRDKIRKLKDQCFAEKKKLKATVTELRIKLAEAEDDTSCAALNHLRAKLRELMKGGQKTDQQVSKVVEKSIETLSDLSTSCDALKAENERLLAELDVCRNALRDKGVDSIMISPKRAETPELGILKSTNGKEYETLIADLRKKLSENEVTIADLRNQLDDRTTTVARLSKDLDKVMVSHRALMAEVTDAKNELMKRDKKVEELLLSVGRSSLEILGMEDLRKEMDSMKPKVYGLEIERAQLLDELHKIRLVVSDRNDQIINILEHRDKSIGDYKTKMDAMQARIDALIGQEGITRKEAEHSTIQITELEDEISRLKARIDELEGSCNDYNKLKNDFNEMNEMLEKLRNENVALRDETEKLTEETTKLKDQTDKLNAEIDKLRVENDKLVVTVDGLKKDLADERMKNDMLDKELSRTKAENEKLKDNENNMRERADNLAKQLQSLNDENSAVKSELNATKGELASLMDEMKKLREDFNRLVEKSDTLEVELDKAMKELSEMREKNDSLRTNVEELERENDRLKKEVNKLREELENIKRDMEKCRKENADVEKELERLRRDNASLKSDLDGKTNESDSSKAESIELKNDLSKLENELEALRRENYGLRSRAEALESENNDLNSAAMKLRDELENAQGKLVKLENENSSLREEVAKLQEANAALKANLDRLNSLNDQLREQVSTLTRELDEAKRKFNEAIVENDALKTEMHNLKTANDNLRSDMVKLNADLEMIREKLEKVENENAIIKEDMEKLLMDNSSLKDNVDRLVEEIDELRRKGDACKAENERLTAAGDGQIENQKRLEHEIDDLRHERAGMITDMERLKNELGEAMKTIGSLESRIRDLQSNKDQLQKDVDALKALSDALKDDWNSEKSARDAAEKELLALKNELKELLGRLDDLNNEKDNLVRDKENLKKQLNDAKDQITESTGDKDEFENEAKRLTSELEKLRKENETLRREVSQLQENVDHWKMENCKLKMEVEKLKGDIDKLTKEIENCGARVKELENQLVGLESEKVKNFGTMEQLKNDISKLTDEVNIEKSAKESALNRARALEDEIEALKAELAKWRDENDRLGREFEASTKREATLESELAKLRAEMSALREQNDKMKSDLESAKADKEKIEKLSSENAGLKRDLENLKNSNDQLKAELQSCNTESDKLKSNLTESKAEIAKLKSDLEQCRSEADALRADLSRANDESKRLKNELDTLKPELENLRKALASAEAKVNALESQLAALQAERAKLVEDLEGLRHEMAKLRNELDSEKLAKENLGKEVAQLKSELGTLRLELDRMRADNDAAKVESSGLKKEIDKLKSDLARLQAELSEKEGELKSTKAENERLKTSLDAAETGAKLLENQLEALGLEKKKLENEVAELVAEVMRLGKNLSNEKSLKDAVMAELEALKNTLASLRGELEKCRQEMEIMKKEVDRTEIEMGKLTNETNKLKDENNRMLNELESSKADVGKLRADLKEMNGLKGNFEKLRLDNESLTNDREKYVRDADEERRKSRISDAESSNLRNELDRLKRENERLKAEIGECQNESAKLRNNIDEFKDASETAKRTEEAKYKAPEPRKADLPKELEEKGRDSKMEDRTTKASGDVPTIVLKSAKVPFMNDDGISKTNLDERGDYARSNELLDKKLDKQNQGVKRVREYIGHIDGKVPTRPSMAREPDDELDISDFIEPIVVQTLLKSRELSENIYNVELEIQKIGDRLNDCSKDNVRLKKLLQDLGAEDLEVQEFHEQMGDAFDAESWLRSLTLTQLAELHDRICTLTSGMMEKDGSQDVAGKRDKPKSSQVRVEKQDPLRENYEILNKRIAALQKEIAEKEADAARKVYEMRQSLRQEQAHMIRISDEMNRERKRNLALHLTIDGSPCWMNNCNAECSGDFQQLTARNVSNNVPNELGKKR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-