Basic Information

Gene Symbol
-
Assembly
GCA_963667235.1
Location
OY763902.1:8313013-8325993[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 0.14 94 3.1 0.1 25 50 56 81 52 87 0.61
2 42 0.46 3.1e+02 1.5 0.3 39 62 101 124 97 127 0.67
3 42 1.3 8.5e+02 0.1 0.2 40 57 200 217 183 220 0.87
4 42 0.041 27 4.9 0.3 28 62 322 356 319 375 0.72
5 42 2.3 1.5e+03 -0.7 0.3 24 57 360 393 358 401 0.76
6 42 0.022 15 5.7 1.7 24 62 416 454 401 457 0.88
7 42 0.0074 4.9 7.2 3.6 26 64 453 491 448 492 0.87
8 42 0.024 16 5.6 5.5 23 57 499 540 497 548 0.67
9 42 0.021 14 5.8 6.3 20 60 524 564 519 566 0.92
10 42 0.0025 1.7 8.7 4.6 32 59 557 584 541 596 0.59
11 42 3.7e-07 0.00025 21.0 3.4 23 63 618 658 617 660 0.95
12 42 0.0011 0.76 9.8 4.1 28 59 665 696 661 702 0.56
13 42 0.0065 4.4 7.4 0.3 23 63 709 749 700 751 0.72
14 42 5.8e-05 0.039 14.0 4.1 26 60 754 788 749 790 0.92
15 42 2.1e-06 0.0014 18.6 5.3 24 64 787 827 786 828 0.91
16 42 0.0001 0.069 13.2 1.1 26 64 838 876 835 876 0.95
17 42 6.2e-05 0.041 13.9 0.5 22 65 890 933 888 933 0.90
18 42 0.00069 0.46 10.5 7.8 20 56 944 980 937 987 0.65
19 42 0.023 16 5.6 2.7 36 64 988 1016 983 1017 0.88
20 42 4.9e-05 0.033 14.2 1.7 24 64 1018 1058 1016 1058 0.95
21 42 0.00062 0.42 10.7 0.7 25 64 1075 1114 1065 1115 0.73
22 42 0.00015 0.1 12.6 4.2 24 63 1123 1162 1120 1164 0.93
23 42 0.00017 0.12 12.5 1.9 26 60 1153 1187 1152 1192 0.90
24 42 0.0034 2.2 8.3 0.3 24 60 1200 1236 1197 1240 0.87
25 42 0.013 8.8 6.4 8.1 24 62 1256 1294 1239 1304 0.85
26 42 0.002 1.4 9.0 5.1 22 64 1275 1317 1269 1318 0.87
27 42 0.027 18 5.4 6.2 24 64 1284 1324 1281 1336 0.80
28 42 0.14 97 3.1 3.0 24 64 1312 1352 1311 1353 0.92
29 42 0.0069 4.6 7.3 7.5 26 63 1363 1400 1360 1402 0.92
30 42 0.0012 0.79 9.8 3.4 26 64 1405 1443 1402 1444 0.79
31 42 0.59 3.9e+02 1.1 0.4 31 60 1452 1481 1446 1486 0.81
32 42 0.00029 0.19 11.7 5.1 26 61 1485 1520 1482 1536 0.74
33 42 0.0017 1.1 9.3 10.2 23 63 1552 1592 1538 1594 0.80
34 42 4.3e-05 0.029 14.4 6.5 25 63 1593 1631 1591 1633 0.93
35 42 8.8 5.9e+03 -2.6 5.5 42 63 1645 1666 1629 1703 0.59
36 42 0.29 2e+02 2.1 0.6 26 64 1678 1716 1673 1717 0.78
37 42 0.0026 1.7 8.7 3.1 24 61 1711 1748 1709 1752 0.79
38 42 0.84 5.6e+02 0.6 1.5 32 64 1757 1789 1749 1790 0.79
39 42 0.6 4e+02 1.1 1.8 22 63 1761 1802 1759 1811 0.58
40 42 0.009 6 7.0 4.0 34 60 1836 1862 1806 1865 0.79
41 42 0.018 12 6.0 4.2 24 60 1847 1883 1841 1887 0.90
42 42 3.4 2.3e+03 -1.3 0.3 41 59 1954 1972 1926 1987 0.49

Sequence Information

Coding Sequence
ATGGAGGTGTGTCGATGCGGATGCGCTTCTACGACGTCTCTAGAATCGTTCAATCCGCCGAACGAGCCATGCTGTTGCTGCAGTTACAATCCCTTCAGCGACAATTCCAAAGAATCAGAGATCTATGATCTCTCGTTCGCCCTGAGGAAGCTCGGCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAACGACTTCAGCTCGAGAGCGAAAATAGGTCTTTGAAACAAACCCTGCAGTCATTCGGTGTAAATGCGGATGAGATATTGAAGCCTGATCCACTGCTAGTGCACTCCAGAGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGAGCCTCGACGATAAAGTGAGGGATCTGGAGGAAATTCTCGCTGAACGAGATTGCTGCGACGATCCTGACGCCACGATTCACTTCCTCAGGGAGAAGATGAGACATCTCAGGGAACGTTTCGCGCTTGAAAAGAAAGACATGCGGGACATAATATCGAATTTGAAGTTGAAGCTGGCGCAGACGGAGGAGGACGTCAGTTGTCCGGCGATATATCGTCTTCGGGCGAAGTTGCGTGACTTGATGAAGGGTCAAACGAGCCTTGGCGAAAACGACCATCTTTTTGCCGAGTTGGACGAGCTTCGACGCGAGTTGGGTGGTCGCGAGAAACGGGAAGTAGTCGAGACGAGGGCAATGGAAACGGCCACAGTGCCTGAATACGTTGACGTCTCGGATCTattgaaaaaattgaacgacTGCGAGGACGTCGTGGGCGACCTGGAGAGGAAGTTAGAAGAGAAGGACGAGAAGATCGCGGACCTCCTTAACAGACTAAAGCAGTTCGACATACTGCAGGCGGAACTCGAAGATCTTAAACGTCAACTGGCGGAACGCGAATCGGAGAATGGCAAGCTCAAGACAGAATACGACGACAGAACGTCGAACCTGCAGTTACAACACGCCGGCAGAACTTCCATGCTGCAGATACAAATGGACGAATTGTTAGCCGAAATAATGGGTCTGAATCAAACCATCGGCACCTTGAACAAAAACATCGACGATTTGAATCAAGAGATCGCCGTTCTGAAGTCGCGAATAGCCGATCTGGAGTCCCAACTGGAGGAGTGTTGCCAGGACAGGGATTCCGTTTTAGAGAGGCAGGGGGTCCTGGAGCAAGAGCTCGCGTCCGCAATGGCCACGATAAAAGACCTCGAGGCGCAGATCGTGGCACTGACCGGGCAACTGGAGAACGAGAAGGCGGCGAGAAGCACGCTGGAAAAAGAACTGGAGGACAGCCGGGGTGAGATCGAAAGCCTGCGCAAGGAGAATTCGGATCTGAAGGGTCAGATAGACGCAGCGAGCGCCGAGAACGATGAACTTCGCGACGCGTTGAGGATATTGTCCGAGGAGGACGAACAGCTGAAGGCGCAGATCGAGCAACTGCGGAACAACATCAAGGACCTGGAGGACCTGAATAACGCCCTGAGGGGCGACTACGACAACATGAAACAGGCGCTGGATAATTTGGAGGCAGAGATCAACAGACTGGAGGAGGAGGGAAGGAGAGCAGCGGATGAACGGCAGGCATTGTTGAGTGAGATCGACGGCGTTAGAAAGCAGCTCGAGGAGGCGTTGGCCGAGAACGAGACGCTGAAAGCTGAACTGGACGAGGCCGGCGAGCAACTCGAGAAACTGAGAGCGGAGAAGGACGAGCTGCAGAAGAGTGTCGACTCGCTTCGGCACGAGAACGATTCGTTCAAACGGGACATGAGGGCGTTAAGGGAGGATCTCGATGACGCGAGGAGGCAGGTGGACGAGCTGAAAGCATCTGGCGACGCGCTGATGGCCGCGGACGGCGATAAGAAGTCTGAAATCGAGAGGCTGACACAGGAGGTGGAGGGTTTGAGGTCTGAAAAGGATCGCCTGACGAGCGAGAACGCCGATCTGAAAGCCGAAAACGCGGAACTGGGACGAAAACTGGAGGATGCCGTGAATGAGTTGGAACAATTGAGAGCGGAAAATGCTGATTTACGCGCGGAGGTGGATCGTTTGGCGCAGGAATTGGAGAGAGCGCGTGGCGAGCTGGATCAATTGAAGTCGGAAATGGCTGCCGTGAAGGACGCGCTCGATAAGTGCGTCGACGAGTTGGAAAAGTTGAAGGGTGAAAACAGTGATCTTAAATCGCTGAACGAGGCCTTGAAGGCGGAGATTCAAGGGGTTAGGGGCGAGGGTGATGGTTTGGCGGTCGAGTTGAACTCTCTGAAGGGCGAGAGGGATCAATTGAAGGGCCAACTGAGCGAGTGTAACGCGGAGAACGAGAGACTGAAAACAGAGAAGAGCCAGTTGGAGGGCGAAGTCGATAGCTTGAAGACGGAGAAGGGTCAATTGGAGGCCGAGGTCGAGAAGCTGAAGGGAGAGAATGCCGGGCTGAAGGATGAGATCGAGAGGCTGCGAGAACAATTGCAATCATTGAACGATGAAATGAACGCACTGAAGACGCAGCTTGAAGAGGCTGCGAACAAAATTCAGGCACTGGAATCGATGGTCTCGAGTCTTCAAGGTGAAAATGAGAAATTTCGGAATGATTTGAGCTCTCTCCGGGGCGAGGTAGAAAACTTGAAAGCGCAATTGGAGAAGGAAGCTGGCGATAATCTAAAGTTGAGGAACGATCTCAAGATGCTCGAGGACCAGGTGCAAGATCTGACCAAGCAGGTTGATAATGCCAGGGCAGAGAACGAGGCCTTGAAGACGGAGATTCAAGGGGTTAGAGGCGAAGGTGACGGTTTGGCGGCCGAATTGAACGCTCTGAGGGATGAAAGGTCCGCTTTGACGGGCGAGAGGGATCGATTGAAGGAACAACTGAGCGAATGCAACGCGGAAATCGAGAGATTGAAAACAGAGAAGAACCAATTGGAAGAGGAGGTCGACAGGCTGAAGGGCGAGTTAAATACCTCTAGAGAAGAAAATGGCAGGCTGAAGGATGAAATGGAAAAACTGCGAGGACAATTGCAGTCACTGAACGACGAAATGGACGCACTGAAGGCGCAGCTTGAGGAGGCCAGGAACAAAATTCAGTCACTGGAGTCAACGGTCTCGAGTCTTCAAGGTGAAAACgagaaatttcgaaatgattTGAGCTCTCTGCAGGGCGAGGTGGAAAATTTGAAAGCGCAATTGGAGAAGGAAACTGGCGATAATCTGAAGTTGAGAAACGATCTGAAGCAACTGGAGGACCGGGTGCAAGATTTGGCCAAGCAATTGGACAATTCCAGGGCAGAAAACGAGGCGTTGAAGTCGGAGATTCAAGGGGTTAGGGGCGAGGGTGACGGTTTGGCGGTCGAATTGAACGCTCTAAGGGATGAGAATTCGATTCTGAAGGACGAGAGAGATCGATTGAGGAATCAATTTAACGAGTCCAACGCGGAGAACGAGAGACTGAGAGCTGAGAAGGGTCAATTGGAGGCTGAGATCGAGAGACTGAAGGGAGAATTAAATACTTGCAAGGAAGAAAATGGCAGGCTGAAAGACGAAATTGAACAACTGCGAGGACAATTGCAGTCATTGAACGACGATATTAATGGATTGAAGACGCAGCTTGACGAGGCCAGAAACAAAATTCAGACATTGGAGTCAACGGTCTCGAGTCTTCAAGGTGAAAACgagaaatttcgaaatgattTAAGCTCTCTGCAGACCGAGATAAAGAATTCGAGAGCGCAAGCGGACCGGGAAGCTGATGAAAATCTGAAGTTGAAGAATGATCTGAAGATGCTGGAGGACCAGGTGAACGATTTGATCAAGAAATTGGAAAACGCCAGGGCAGAAAACGACGCCTTGAAGGCGGAAAATCAAAATTTGAAGGCGCAGGTGAACGATATGAATCAAGAATTGGCCGGTCTAAGAACAGAGTGCGCGGACCTGAAGTCAGAGATCGCGAATCTGAAGAAGTTAATCGAGGAATTGAAAGCAAAGATTGCTAAACTCGAGGAAGATTTGGAGTACTGGAAATTGGAGAACTGTAAATTTCAGATGGAGGTAGACAAAGTAAAAGCCGAACTTCAGAAAGCTCTGAAGGACTTGAGTGAAGCTCAgGCTCTGAAACAGCAGCTGGAGGCAGAGGTGAATCGTCTCAAGGGCGAGAAGGCCGAGCTTGACAAAAAACTTGCAGACGCAACGACACAGCTCGAGCAACTGAAGAAAGCACTCGAAGCGGACAAATCGGCGAAAGATAAGGCCGAGTCGCAAGTCGCGGCCCTAAGGGCGGAGCTCCATGCATTGAAAGCGGAACTAGAAAAGCTGAGGGCCGAGAACAGCAAGTGCAAAGGTGAGCTGGACGATCTGGCCAGAGTGCTGTCGGCCACGAAGAACGAGCTTGACGCGTGCAAACAAGAGCTGTTCGCATTGAGGGAGGCGAACAACGCTCTGAAGTCAGATTTGGACGCGTTGAAAGGCGTAAAGGACGAGCACAATAAATTAAAGGCGGATGCGGATGCAATGAAAGCGGAGAACGCGAATCTTCAACAAGAGAGACAAAGACTGCAAGACGAGTACGCCAAGTTGAAGGCGGAGGCTGACGGTCAAAAGGTGGAACTGGACAAGCTGAGATCGGACTTGGCGTCGGAGAGAGCAGCCGCGGAAAAGCTGAGAGCAGATCTGCAGAACCGGCAGGCCGAGAACGCCAAGCTGCAGTCAGAACTGAACGAACTGAGAAGTGAATTGGACAGATTGAAAAACGAGAGCGGCAGTTTgcgaaacgaaatgaacgaGCTGAAGAAAACGCTCGAAGCGAAGATCAAGTCCTTGGAGGATCAACTTTCGAGCCTCCTGGCCGAGAAGGAGGAGCTGCTGAAGGAACTGGGTCGTCTGCGCGACGAGGTGGCCAATCTGAAGAACGAGATCGAGAAACAGAAAGCCGCCAAGGAGGCAGCCTTGAGCGAATTGGCGGGCCTGAAGAAGGAGCTGGCCGACCTAGAAAATGTTTTGGCCGGCCTCAGAGCGGCGCTGGACAAGGCCAACGGCGATAACGAAGCGCTGCGAAAGGACAACGAGAAGATGAAGGCAGACCTGGCGAGCGCGAGCAAAGATCTGGAAGCGCTTAGGGGCCAGAACACGCAGTTGGCAGCCGATTTAGCTGGGGCGAAAGCGAAATTGACGGACTTGGAGAAACAGTTCGAAGACTTGAAGAAGGAGAACGGCGATCTGAAGTACAAAGCCGACCGCCTCGAAGACACGTTGACGGAGCTCGAAGCTTTGAAGAAACAATTGGACGGCGCCAATACGGAGCTGGACAAACTCAAACTGGAAAACGCGAAACTGCGGGACGATTTCAATAGGGCGACGGACGAAGCGAATAAGCTGAAGGACAGTTTGGACAAACTGCGAAACGACTACAACGGGCTGCAGGACGAATTATCTAAGACGCAGGATAAATTAACTAAGACGCAGAATGATTTGTCTAAGCTGCAGGACGAAGATGCGAAGCAGAAAGCGCTCGGAGCAACGCAAGTAGGGGATCTGAATAAATTGCAGAAGGAAAATAACGAGCTGCGGGATCAAATCGCGAAGCTGAAGAGTCAGTTGCACGATTGCCGCGAGGAGAGCGATGGACTGCACAAGGAGCTGGGAAATTTGCAGAGGAAGAATGCCAAGCTGATGGAGGCAGCAGAGAGGCCGATGGAGCCAAAGGAGACGGAAATGGACAAAGATATTCTGGATGAATGCGGCGACTTCATTAAGGCGAACGAATTACTGGGGGACAAGTTTGAAAAGCAGAacgaagGCGTTCAACGGGTTCGAGattatatatcatatttggaaGGTAAAGCCGAGGAGCCCAAAATGGCGGATCAATTGGAAGAATCGCATATTGAACCCCAATTAAGGGAAGACATTGCTGGCTTGTTGAAGAAATCTCAGGCGTTATCGAAATCTATTTATGAAACGGAGCAGACAATACAAAATCTTAGTGATATCCTGAAAAAACCAGCGGAGGCCGTGGCTCCAGGAGCCTTCGATCCCGATTCTTGGCTGAGTGCGTTGACGTTGACGCAGTTAGCGGAGCTTCACGATAAGATTTGCCTGTTGACATCGGACATGGTGCAGCAGGATAGCAAAGCGGTGCCATGCAGGTCAGCCCCCGGGCCGGCGAATCCGCAAGCGGATTACAATGTTTTGAACAAGCGGATAGCCGCTTTGCAGAAACAGATAGCAGAGAAGCAAATGGAGGCAGCGTGGAAGATGCAGGAATTGAAGCGGTCCCTTCGAATGGAGCAGGCTAACCTAATTGAAATCTCCGACTATATGAATTTGGAGAGAAAACGAAACTTGGCCCTTCAGTTCACCATTGATGAATCACCATAG
Protein Sequence
MEVCRCGCASTTSLESFNPPNEPCCCCSYNPFSDNSKESEIYDLSFALRKLGVMKCQMKKWRMERLQLESENRSLKQTLQSFGVNADEILKPDPLLVHSREEIERLQNANASLDDKVRDLEEILAERDCCDDPDATIHFLREKMRHLRERFALEKKDMRDIISNLKLKLAQTEEDVSCPAIYRLRAKLRDLMKGQTSLGENDHLFAELDELRRELGGREKREVVETRAMETATVPEYVDVSDLLKKLNDCEDVVGDLERKLEEKDEKIADLLNRLKQFDILQAELEDLKRQLAERESENGKLKTEYDDRTSNLQLQHAGRTSMLQIQMDELLAEIMGLNQTIGTLNKNIDDLNQEIAVLKSRIADLESQLEECCQDRDSVLERQGVLEQELASAMATIKDLEAQIVALTGQLENEKAARSTLEKELEDSRGEIESLRKENSDLKGQIDAASAENDELRDALRILSEEDEQLKAQIEQLRNNIKDLEDLNNALRGDYDNMKQALDNLEAEINRLEEEGRRAADERQALLSEIDGVRKQLEEALAENETLKAELDEAGEQLEKLRAEKDELQKSVDSLRHENDSFKRDMRALREDLDDARRQVDELKASGDALMAADGDKKSEIERLTQEVEGLRSEKDRLTSENADLKAENAELGRKLEDAVNELEQLRAENADLRAEVDRLAQELERARGELDQLKSEMAAVKDALDKCVDELEKLKGENSDLKSLNEALKAEIQGVRGEGDGLAVELNSLKGERDQLKGQLSECNAENERLKTEKSQLEGEVDSLKTEKGQLEAEVEKLKGENAGLKDEIERLREQLQSLNDEMNALKTQLEEAANKIQALESMVSSLQGENEKFRNDLSSLRGEVENLKAQLEKEAGDNLKLRNDLKMLEDQVQDLTKQVDNARAENEALKTEIQGVRGEGDGLAAELNALRDERSALTGERDRLKEQLSECNAEIERLKTEKNQLEEEVDRLKGELNTSREENGRLKDEMEKLRGQLQSLNDEMDALKAQLEEARNKIQSLESTVSSLQGENEKFRNDLSSLQGEVENLKAQLEKETGDNLKLRNDLKQLEDRVQDLAKQLDNSRAENEALKSEIQGVRGEGDGLAVELNALRDENSILKDERDRLRNQFNESNAENERLRAEKGQLEAEIERLKGELNTCKEENGRLKDEIEQLRGQLQSLNDDINGLKTQLDEARNKIQTLESTVSSLQGENEKFRNDLSSLQTEIKNSRAQADREADENLKLKNDLKMLEDQVNDLIKKLENARAENDALKAENQNLKAQVNDMNQELAGLRTECADLKSEIANLKKLIEELKAKIAKLEEDLEYWKLENCKFQMEVDKVKAELQKALKDLSEAQALKQQLEAEVNRLKGEKAELDKKLADATTQLEQLKKALEADKSAKDKAESQVAALRAELHALKAELEKLRAENSKCKGELDDLARVLSATKNELDACKQELFALREANNALKSDLDALKGVKDEHNKLKADADAMKAENANLQQERQRLQDEYAKLKAEADGQKVELDKLRSDLASERAAAEKLRADLQNRQAENAKLQSELNELRSELDRLKNESGSLRNEMNELKKTLEAKIKSLEDQLSSLLAEKEELLKELGRLRDEVANLKNEIEKQKAAKEAALSELAGLKKELADLENVLAGLRAALDKANGDNEALRKDNEKMKADLASASKDLEALRGQNTQLAADLAGAKAKLTDLEKQFEDLKKENGDLKYKADRLEDTLTELEALKKQLDGANTELDKLKLENAKLRDDFNRATDEANKLKDSLDKLRNDYNGLQDELSKTQDKLTKTQNDLSKLQDEDAKQKALGATQVGDLNKLQKENNELRDQIAKLKSQLHDCREESDGLHKELGNLQRKNAKLMEAAERPMEPKETEMDKDILDECGDFIKANELLGDKFEKQNEGVQRVRDYISYLEGKAEEPKMADQLEESHIEPQLREDIAGLLKKSQALSKSIYETEQTIQNLSDILKKPAEAVAPGAFDPDSWLSALTLTQLAELHDKICLLTSDMVQQDSKAVPCRSAPGPANPQADYNVLNKRIAALQKQIAEKQMEAAWKMQELKRSLRMEQANLIEISDYMNLERKRNLALQFTIDESP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01067355;
90% Identity
iTF_01065989;
80% Identity
-