Basic Information

Gene Symbol
-
Assembly
GCA_010645135.1
Location
WIUT01007039.1:1318-10334[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 0.0097 15 6.5 0.3 30 58 24 52 16 55 0.80
2 42 2.3 3.5e+03 -1.1 0.4 29 63 78 112 75 117 0.74
3 42 0.77 1.2e+03 0.4 1.8 35 61 118 144 106 148 0.86
4 42 0.022 32 5.4 1.5 24 62 162 200 156 204 0.91
5 42 0.00045 0.67 10.8 4.2 24 63 228 267 225 268 0.93
6 42 0.0018 2.6 8.9 5.9 31 63 284 316 270 318 0.72
7 42 0.00058 0.88 10.4 5.5 35 64 319 348 316 349 0.92
8 42 0.013 20 6.1 3.6 26 64 373 411 363 412 0.91
9 42 0.3 4.5e+02 1.7 10.1 16 63 384 431 376 433 0.89
10 42 0.13 1.9e+02 2.9 4.4 31 61 413 443 406 468 0.58
11 42 6e-05 0.09 13.6 8.3 23 65 482 524 476 524 0.91
12 42 0.32 4.9e+02 1.6 3.8 28 61 515 551 513 571 0.78
13 42 0.00037 0.55 11.1 2.8 33 62 572 601 556 603 0.83
14 42 0.0035 5.3 7.9 2.0 34 59 598 623 593 628 0.85
15 42 0.002 2.9 8.7 2.1 28 61 637 670 632 674 0.90
16 42 0.0013 2 9.3 0.1 36 63 673 700 669 702 0.86
17 42 0.07 1.1e+02 3.8 1.7 36 60 698 722 697 726 0.70
18 42 0.0012 1.9 9.4 4.4 24 63 707 749 706 751 0.76
19 42 0.00019 0.29 11.9 1.0 22 63 722 763 721 772 0.77
20 42 0.11 1.7e+02 3.1 0.0 33 63 768 798 763 800 0.79
21 42 0.012 18 6.2 0.6 27 63 783 819 779 821 0.87
22 42 9.6e-06 0.014 16.1 9.5 22 65 827 870 822 870 0.94
23 42 0.00059 0.89 10.4 5.0 34 65 853 884 852 890 0.92
24 42 0.00013 0.19 12.5 1.1 22 63 890 931 888 933 0.92
25 42 0.011 16 6.3 6.3 25 58 949 982 937 986 0.68
26 42 0.0046 7 7.5 3.7 27 64 979 1016 976 1017 0.82
27 42 0.00051 0.76 10.6 0.8 24 57 1011 1044 1008 1048 0.90
28 42 0.00048 0.73 10.7 5.6 22 64 1037 1079 1036 1080 0.88
29 42 0.0067 10 7.0 4.6 26 63 1104 1141 1079 1143 0.82
30 42 0.24 3.6e+02 2.0 0.7 33 61 1146 1174 1143 1178 0.76
31 42 0.0016 2.4 9.0 0.8 24 59 1175 1210 1174 1212 0.90
32 42 0.0057 8.6 7.2 7.9 24 61 1231 1268 1228 1285 0.84
33 42 0.00013 0.2 12.5 5.4 25 63 1288 1326 1285 1328 0.86
34 42 0.0093 14 6.6 9.7 26 65 1345 1384 1331 1384 0.83
35 42 5.9e-05 0.089 13.6 5.9 29 64 1383 1418 1382 1419 0.92
36 42 0.44 6.6e+02 1.2 3.2 31 59 1420 1448 1417 1457 0.80
37 42 0.089 1.3e+02 3.4 2.1 29 60 1463 1494 1457 1499 0.67
38 42 0.00017 0.25 12.1 5.7 24 63 1535 1574 1532 1576 0.92
39 42 0.015 23 5.9 2.1 22 57 1575 1610 1573 1614 0.73
40 42 0.016 24 5.8 1.4 24 54 1612 1642 1608 1644 0.81
41 42 0.00034 0.51 11.2 9.7 24 60 1633 1669 1632 1672 0.92
42 42 2.2 3.3e+03 -1.0 0.1 36 60 1731 1758 1710 1762 0.75

Sequence Information

Coding Sequence
ATGAAAGGTCAAGTGGCCGAGCAGCAGGTCTCCAAAGTTGTGGAGAAATCGATTGAAACCTTGGTGGACCTCTCGAAGAGCTGCGACGATCTACGTTTGGAAAACGAACGTCTTTTGGACGAGTTGGCTAACTTGCGTCGTGTGTTGGCTGATTACGAAGGGAAGGAAGTACCGGAATCAATCCTGAGAACAGTCGAAACGGTCACCGTGCCGGAATACGTTGACATTTCAGATCTACTGGACAAGCTGAACAATTGCGAGGATACCGTGGCTGACCTAAGAAAACagttagaagaaaaagacaagCTGATTGATACGTTGAACAAAGAATTGGAATCGATAGTCAGTCAAAAAGATCTAGAAGCCATGAAAGAGGATCTTAGAAGGAAGGAAGACAAGatCGCGGAACTGTTGAACGGCCTAAGACAATCAGAAATTGACCTGCTCGAGCTCTCCACTCTAAAATCCAAACTAAAAGACTTCAAAGAGAAAGTGGCCGACTTACAGTCGAAACTCGACAAGGCGAACCAAGATATTGACGATCTGATAGCTGAGATAGCCAATCTGAGGAACGAGTTGGAAGACTGTAACAAACGAAACGCGGAGCTGCAGGAGTATTGCATGGACAAGGACGCTCTTTCGAAGAAGCTGCGCGACCTGGAGGAGGATCTCGCGGCTGCGAAACTCACAATAACCGATCTCGAGAAAGAGGCGGACTTCTTGAGGAGAGACAAGGAGAATTTGTTGAACGAGCTAGACGAGGCGAGGAAACAGGTGGAGGCGTTGACTGAGCAACTGGAGGACGAGAGGGCGGCCAGGAACGCGTTGGAGAAGGAACTGGAGGATAGCCGAAATGAGATTGAAAAGTTGCAGAAGGAGAATTCGGATCTGAAGGATCAGATCGGCGCTGAGAGGAAGGAGAACGATAAACTTCGCCAGGCATTGGAAGCGTCGAAAGAGCTGGCCGACGAGAACGAAAAGTTAAAGGCTCGGCTGGAGCAGCTGAAGAACGAGAACGACAGCCTGACGCAGAGCATGAAGGAGCTGAACGATTTGAATAATCAGCTGAGAAACGACTACGATAGTATGAAACGGACGATGGGTAATTTTCAAGCGGAGATCGACAAACTGGCGGACGAGTTGGCCAACGCCGAACAGAAACGCGACGCGTTGTTGAACGAGAATAACACTATCAGAAAGCagctcgaacgaacgattgcGGAAAACGAGAGTCTGAGAGCCGAACTGGACGAGGCTGGCGAACAACTCGACAAACTGAGATCGGAGAAAAGCGAGCTGTTTAAGAGCCTCGACGAGATGAAGCTCGAGAACGATTCGTTGAAGCGGGATATGAAGGCTTTAAGGGACGACCTTGAGGATTCTAGGGGGCAAGTGAAGGAGCTGAAAGCCGCTGGCGATGCGTTAAGGGCGGCGGATAAGGATAAGAAACTCGAACTCGCCGAACTGGAACAACGAGTGGAGAGCTTCAAGTCCGAGAAGGATCGCTTGACGAAGGAGAACGACGACTTGAGAAACAGAAACATGGAATTGCAACGGAGATTGGAAGAGCTGGATCAGATAAAGGGGGAAAATGCAGATTTACTTGCTGAAATGGATCGTTCGAGAAAAGAGTTGGATAAAACCTTGGAGGACATTGATCAGTTAAAATCCGAAATAGGTTCCCTGAAGGACGGACTGGAAAATTGCGTGGGCGAAATGGAAAAACTGAAAACCGAGAACAATGACCTGAAGAAGGAGAACGAGTCCCTGAAGTCCGAAATTCAGGGAATTGCCAATCGCTTgatgaaagaaaacgacaGTTTGAAAGATGAAATTGCGGAATTGGAGAAAAAGCTGACGGAATTGGATGAACTGAAGGGAGAAAATGCCGATTTGCTCGGCGAACTAGATcgtttgaaacagaaattggAGGAAACCTGGAAGGAGGTTGACCAATTAAAATCCGAGGCAAGTTCGTTGAAGGACGCGCTCGACAAGTGTGTAGACGAGATGGGGAAGTTACGAACTGAGAATGATGATCTTAAATCGGAAAATCAAGCTTTGAAGTCCGATATTCAAGGACTCGGCGATCGTTTAACGAAGGGCGACGCCGATTTGAAAGCGAGAAACGAGGAACTGCGACAAAAATTAGGAGAGTTGGACAAACTGAGGTCGGAAAACGCGGATTTGCACGGCGAGGTCGATCATTTGAGACGCGAGGTGGAAAAACTTTTAGTGGTTATCGATCAATTGAAATCCGAGGTAGCTTCTTTGAAAGACGCGCTGGATAAGTGTGTCGGCGAGATGGAGAAGCTGAGAAGCGAGAACAATGGTTTGAAGTTTGAAATTCAGGGGATGAAAGGTGAAGGCGATAGTCTAGCCGTGGAGTTAAATAATCTGAAGAACGAGATTTCCACTTTGAAAGAGGATAGGGATCAATTGAGCAAGCAATTGAGCGACAATAAGACGGACAACGAGAAACTGCGAGCGGACAGCGAGAAACTACGAGCGGAAAAGGCTCGAGTTGAAGCCGAAAACGAGAAACTGAGAGAAGAGATAAATTCCTGCAAGCAGgagaatgataaattaaaagacGAACTTGCAGAATTACGAGAACAGTTGCAATCGTTGAACGacgaattgaataaattaaagtcAGACCTCGATAAATCCGAGGAGAAAATTCGGTCTCTGGAACCGTTGGTCTCTCGTTTACAGAGTGAAAAcgataaattacgaaatgatTTGACAGATTTGGGGAACGAGGCGAACGATTTGAAAGCAAAGATGCACAAAGAAACTGCCGACAACGAAAAAATGCGGAACGACTTGAAGATATTGGAGGATCAGGTGCAAGATCTGAATAAGAAGTTGAACAATGCCAGGACAGAAAACGATGCATTGAAACAGGAGAATCAAGATCTCAAAGCAAAGTTATTGAATACGGATCAAGATTTATGGAATTTGAAAGCGGAATGTGCCGAACTGAAACAAGAGATTGCTGACCTGAAGAAATTAGTTGACGagttaaaggaaaaaatcgcTAAATTGGAAGCAGACGTGGATCattggaaaatggaaaattgtaagCTTCAGTTAGAGATTGATAAATTGAGAGCTGATCTTGAGGGAGCGTTGAAAGACGTGAGCGAGTGTAAggCTAGAAATAACGCACTACAAGAAGAGTTAAACCatttgaagaatgaaaaagataagCTTGATCAAAAGCTAGCTGATTTAACGTCGCAATTCGAACGACAAAAGAAGACATTGGAAGcagaaaaatcggcgaaagATAAGGGTGATTCAGAAATCATAGACCTCAAAGATGAACTTGATGCATTGAAAAAGGAACTAGAAAAATTGAGAGCAGACAATAATAGATACAGAAACGAACTGGACGATCTAGGTAGACAGCTTTCAGCGGCAAGGAACGAACTAGAAAAGTACAAAGACGAGATCCTCGCGTTAAGAGACGCCAACAACGCGTTAAAATCAGAGCTGGATCTCCTGAAAAGTTTAAAGGATGAGTAcagtaaattgaaaattgatctaGACGCTCTGAAAGCGGAAAATGCGAACCTTCTGCAGGATAGAAAAAGTCTCGAAGACGAGTATGGCAAGTTGAGAGGAGAAGGCGATGGTCAGAAAATAGAGATCGATAGACTAAGATCTAACTTAGACGCAGAAAAGGAAGCTACGGAAAGATTAAGAGTAGATCTACAAAATTGCCAAGCTGAGAATGATAAGCTGCAAAATGAATTAAACGCGATGACAGACGAGCtgggaaaattaataaaggaaAACGATCGTACGAAGAACGAGATCGAGGAGTTGAAGAAGGCGTTGGCAGCTGCAGAAGCGAAGGTAAAATCTCTGGAAGATGAATTTTCCGATTTATTAGCCGAGAAAGAAGAGTTGATCAACGAACTTGATCGTCTCCGGCAAGAGCTGAACAATCTTCGAAACGAGGCAGAGAAGCAGATGACCGCGAAAGACGCAGCCGCGAAGGAGCTAGCCGCCCTGAAGGACGAACTGGCTGCCCTAAAAGCGGCGTTGGACGAGACCCGCCGCGAAAACGAAACGCTGAAGAATGAGAACGAGAGGCTGAATGCGGAATTAGCAAAGTTGAATAAGCAATTGGAAGCCTTGAAAGATGAGAATGCAAAGCTGGAAGACGAAAATGCGAATCTGAAGGATGAGAATGCGAAATTGACGGCGGAGTTAGCCGAAACGAAGAACAAATTGACGCAGGCGGAGAAACAGCTGAACGATTCGAGGAATGAGAACGATGACCTGAGGAAGAAACTGGCTGATCTCGAGAGCACGGTGAAGGAGCTGGAACCTCTAAAGAAACAATTAGAAGATGCTAAGAACGTGCTAGATAGATTGAGCCCGGAGCtagataaattgaaatcggAGAATGCAAAATTGCAGGAAGATTTGAATAACGCGACGAATGAATCGAACAGGCTGACGAACGATTTGGACAGATTGAAAGTCGATTATAAATTAGCACTTGCTGAACTGGCTGACTTGAGGAATGAGAAAGATATGCAAAAAGAACGCGACGACGCGTCAGAAGAAGAACTAATCAAAATAAAGGAAGAGAATGTGAAACTGAAAAGCGATTTGGATGATTGTCAAGTGGAGAATGAAAGATTGCGCAAAGAATTGCAAGacttgaaattgcaaaatgcaGGATTGCAGAACGATTTAAACAAGGCGAAGAACGAAGCGAGTAAATTAAAGAGCGATTTGGATAAATTGAAAAGCGATTACGACGACTTACGGTCAGAGTTAGATAAACTAAGGAATGAGAAGAATGGACATAAGGAACGCGATGCTACGTTAAACGGGGATCTTGATAAATtgaggaaagaaaatgatgaGTTGAAAGATGGGAATGAGAAACTGAAAAGCCAATTATTCGATTGCCAAGCAGAGAATGAAAGACTGCATGAGGAACTGGAAAAGTTGAAGAAGGAAAATGCTGAACTGAAGCAAGCGGAGAAGATAGAGCCTAGGAAAATGGACATGGATAAAGATATTCTAGAAGACTGCAGCGACTTCATTAAAGCGAACGAATTACtagagaagaaatttgaaaagcaAAACGAAGGTGTGCAACGCATTcggaattatattaaatatttggaagGTAAAATTAAAGAACCTAAAATGGCAGACAAGCCAGAGGAGGAACCAGAAATTGATTCAGAATTGAGAAAAGACATTGCAGACTTATTAAGAAAATCGCATAATCTGTCCAATGATATTTATCAAACGGAacaggaaataaaaaatgtcgGGgTTAAACCTTACGAAGGAACCTTCGATCCTGACAGTTGGCTGAATTCGTTGACACTGACACAGTTGGCAGAGCTTCACGATAAGATCTGTCAGTTGACATCGGACATGGTACACCAGGACAGCAAAGCAGTGCCTTGCGATTCATCTGGACCAGCTATTCCATCACAGGCGGATTACATTATTTTGAACCAGCGTATAGCTGCTTTGCAGAAACAGATAGCGAAGAAGCAAATAGAGACAGACTGGAAGCTACAAGAGCTGAGGAGAGTTCTTCGGAACGAGCAGGCTAACTTAATTCGAATCTCCGATCAGATGAACTTAGAGAGAAAACGCAATTTGAATTTACAGTACAGCATGGATGATTCACCTTAA
Protein Sequence
MKGQVAEQQVSKVVEKSIETLVDLSKSCDDLRLENERLLDELANLRRVLADYEGKEVPESILRTVETVTVPEYVDISDLLDKLNNCEDTVADLRKQLEEKDKLIDTLNKELESIVSQKDLEAMKEDLRRKEDKIAELLNGLRQSEIDLLELSTLKSKLKDFKEKVADLQSKLDKANQDIDDLIAEIANLRNELEDCNKRNAELQEYCMDKDALSKKLRDLEEDLAAAKLTITDLEKEADFLRRDKENLLNELDEARKQVEALTEQLEDERAARNALEKELEDSRNEIEKLQKENSDLKDQIGAERKENDKLRQALEASKELADENEKLKARLEQLKNENDSLTQSMKELNDLNNQLRNDYDSMKRTMGNFQAEIDKLADELANAEQKRDALLNENNTIRKQLERTIAENESLRAELDEAGEQLDKLRSEKSELFKSLDEMKLENDSLKRDMKALRDDLEDSRGQVKELKAAGDALRAADKDKKLELAELEQRVESFKSEKDRLTKENDDLRNRNMELQRRLEELDQIKGENADLLAEMDRSRKELDKTLEDIDQLKSEIGSLKDGLENCVGEMEKLKTENNDLKKENESLKSEIQGIANRLMKENDSLKDEIAELEKKLTELDELKGENADLLGELDRLKQKLEETWKEVDQLKSEASSLKDALDKCVDEMGKLRTENDDLKSENQALKSDIQGLGDRLTKGDADLKARNEELRQKLGELDKLRSENADLHGEVDHLRREVEKLLVVIDQLKSEVASLKDALDKCVGEMEKLRSENNGLKFEIQGMKGEGDSLAVELNNLKNEISTLKEDRDQLSKQLSDNKTDNEKLRADSEKLRAEKARVEAENEKLREEINSCKQENDKLKDELAELREQLQSLNDELNKLKSDLDKSEEKIRSLEPLVSRLQSENDKLRNDLTDLGNEANDLKAKMHKETADNEKMRNDLKILEDQVQDLNKKLNNARTENDALKQENQDLKAKLLNTDQDLWNLKAECAELKQEIADLKKLVDELKEKIAKLEADVDHWKMENCKLQLEIDKLRADLEGALKDVSECKARNNALQEELNHLKNEKDKLDQKLADLTSQFERQKKTLEAEKSAKDKGDSEIIDLKDELDALKKELEKLRADNNRYRNELDDLGRQLSAARNELEKYKDEILALRDANNALKSELDLLKSLKDEYSKLKIDLDALKAENANLLQDRKSLEDEYGKLRGEGDGQKIEIDRLRSNLDAEKEATERLRVDLQNCQAENDKLQNELNAMTDELGKLIKENDRTKNEIEELKKALAAAEAKVKSLEDEFSDLLAEKEELINELDRLRQELNNLRNEAEKQMTAKDAAAKELAALKDELAALKAALDETRRENETLKNENERLNAELAKLNKQLEALKDENAKLEDENANLKDENAKLTAELAETKNKLTQAEKQLNDSRNENDDLRKKLADLESTVKELEPLKKQLEDAKNVLDRLSPELDKLKSENAKLQEDLNNATNESNRLTNDLDRLKVDYKLALAELADLRNEKDMQKERDDASEEELIKIKEENVKLKSDLDDCQVENERLRKELQDLKLQNAGLQNDLNKAKNEASKLKSDLDKLKSDYDDLRSELDKLRNEKNGHKERDATLNGDLDKLRKENDELKDGNEKLKSQLFDCQAENERLHEELEKLKKENAELKQAEKIEPRKMDMDKDILEDCSDFIKANELLEKKFEKQNEGVQRIRNYIKYLEGKIKEPKMADKPEEEPEIDSELRKDIADLLRKSHNLSNDIYQTEQEIKNVGVKPYEGTFDPDSWLNSLTLTQLAELHDKICQLTSDMVHQDSKAVPCDSSGPAIPSQADYIILNQRIAALQKQIAKKQIETDWKLQELRRVLRNEQANLIRISDQMNLERKRNLNLQYSMDDSP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01420680;
90% Identity
iTF_01420680;
80% Identity
-