Basic Information

Gene Symbol
-
Assembly
GCA_013123115.1
Location
WUUM01000019.1:4712038-4732329[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 41 0.14 1e+02 2.8 0.6 25 50 62 87 57 94 0.62
2 41 0.049 37 4.2 2.7 33 56 108 131 105 133 0.73
3 41 1.4 1.1e+03 -0.4 0.2 36 60 156 180 147 184 0.80
4 41 1.4e-05 0.011 15.6 1.3 30 61 222 253 214 256 0.91
5 41 0.8 6e+02 0.4 1.4 27 50 279 302 277 321 0.81
6 41 2.6 2e+03 -1.3 0.1 37 59 327 349 306 352 0.66
7 41 0.011 8.1 6.4 1.4 25 57 363 395 361 402 0.88
8 41 0.1 77 3.2 1.6 25 59 423 457 416 463 0.63
9 41 1.9 1.4e+03 -0.8 4.9 25 57 437 479 433 490 0.48
10 41 0.001 0.77 9.6 3.3 24 63 488 527 486 529 0.92
11 41 0.22 1.7e+02 2.1 10.8 26 64 532 570 529 578 0.77
12 41 0.036 27 4.7 6.0 23 56 581 614 572 623 0.71
13 41 0.017 13 5.7 7.5 23 63 623 663 621 665 0.88
14 41 0.0032 2.4 8.0 0.8 26 63 668 705 665 713 0.83
15 41 9.4e-06 0.007 16.2 7.5 24 64 743 783 742 784 0.93
16 41 0.0026 2 8.3 8.8 24 65 771 812 771 819 0.93
17 41 0.0079 5.9 6.8 6.7 22 64 790 832 788 833 0.93
18 41 0.0061 4.6 7.1 5.9 26 64 829 867 818 874 0.71
19 41 0.0036 2.7 7.9 7.9 23 64 871 912 866 913 0.78
20 41 0.00016 0.12 12.2 2.7 29 62 898 931 897 934 0.91
21 41 1.3e-05 0.0095 15.7 8.0 27 65 910 948 907 948 0.93
22 41 0.00039 0.29 11.0 10.9 26 65 923 962 919 962 0.87
23 41 4.2e-05 0.032 14.1 2.3 24 63 970 1009 967 1010 0.94
24 41 0.0036 2.7 7.9 5.6 25 63 1027 1065 1014 1067 0.80
25 41 0.76 5.7e+02 0.4 0.2 31 56 1061 1086 1056 1090 0.68
26 41 0.0058 4.3 7.2 2.3 25 64 1083 1122 1080 1123 0.91
27 41 0.15 1.2e+02 2.6 6.2 30 63 1137 1170 1128 1175 0.66
28 41 0.0049 3.7 7.4 6.6 25 64 1181 1220 1173 1221 0.77
29 41 0.0015 1.1 9.1 1.3 32 64 1230 1262 1222 1263 0.90
30 41 0.00017 0.13 12.1 4.4 24 59 1257 1292 1254 1296 0.92
31 41 0.022 17 5.3 2.7 24 53 1313 1342 1310 1344 0.87
32 41 0.0045 3.3 7.6 7.4 21 59 1331 1369 1331 1374 0.85
33 41 5.4e-05 0.041 13.7 5.8 27 63 1372 1408 1368 1410 0.91
34 41 0.033 25 4.8 6.7 19 60 1413 1454 1411 1463 0.65
35 41 0.0069 5.2 7.0 0.6 33 64 1455 1486 1452 1487 0.89
36 41 0.008 6 6.8 2.1 31 63 1488 1520 1485 1525 0.84
37 41 0.0036 2.7 7.9 5.6 31 63 1526 1558 1516 1560 0.69
38 41 0.027 21 5.1 4.1 22 61 1566 1605 1565 1608 0.87
39 41 3.4e-05 0.025 14.4 8.4 20 65 1606 1651 1603 1651 0.93
40 41 0.00063 0.47 10.3 11.0 24 64 1617 1657 1617 1671 0.91
41 41 5.2 3.9e+03 -2.2 0.6 31 55 1845 1855 1837 1880 0.45

Sequence Information

Coding Sequence
ATGGAGTCTCAATATTCATCGTCGTTTTGTCGGTGTGGATGCAACGCATCGTCGTCACGTGAATCGATTGATCCACCGAATGAGCCATGTTGTTGCTGCAGTTACAATCCCTTCAGCGACAGGTCGAAAGACTCCGAGATCTACGACCTCTCGTTTGCCCTGAGGAAGCTCACCGTAATGAAGTGTCAGATGAAGAAGTGGCGAATGGAACGACTTCAGCTCGAGAGCGAAAATAGGTCTCTGAAACAAGCCCTCCAGTCATTCGATGTGAACGCGGATGAGATATTGAAGCCTGACCCGTTGTTGGTGCACTCCCGAGAAGAAATCGAAAGGCTGCAAAATGAAAACGCGGCGCTCGAAGACAAAGTGAGGGATCTAGAAGAAACTCTTGCCGAACGAGATTGCTGTGACGATCCTTGCGCGTCGGTTCAATGCATCAGAGCGAAGATGGCACATATGCGGGAGCGTTTTGCTCTCGAAAAGAAAGAATTAAGGGACATGATATCgcatttgaaattaaaactcGCGCAGACCGAGGAGGACGTCAGCTGTCCAGCGATATATCGTTTGAGGGCGAAGCTGCGCGAACTCATGAAAGGAGGCCAAACAGCAGATCAGCAGGTCTCAAAAGTAGTAGAGAGGTCGATAGAAACGTTGGTGGACCTGTCGAAAAGTTGCGAGGATCTGCGTGTTGAAAATGAACGACTCCTGGCCGAAGTGGCTGAGTTACGGCGCGCTTTGGCAGCTCTCGAGGAGAAAGAAATTCCTGTGGTGACGGAACGATTGTTACCGCAACAAACGGTCGAAACAACTACGGTTCCGGAGTACGTTGATATATCCGAGTTGTTGGCGAAATTAGAAGATTGCGAGAAAGTTGTTTCCGATTTGAGAAAGAAACTAGAGGAGAAGGATACGCTTATCGATTCGCTAAATAAAGAACTTGGAAAGATGATCGATCAGCAACAgttattcgacgaaattgcTGCCATGAAAGCGGAACTTCAGAGCAAAAATGACAAGatcAGAGACCTCTTGAATGATTTGAGGCAGTCGGAAATAGATCTGCTCGGTTTGGGAACATTAAGGTCTGAGATTGATAACTTGAAACCACAGTTATCTAATCTCGAAGTGGAACGGGACAAACTTTTAGACGAACTAACTGAACTGCGAAAAGCACTGGCCGACAGGAACAATCAAATAATCGACATACTGCAACAGAAGAATCAGCTAGAGAAAGTTTACGATGATAAGATTGCGGAAATGTTAGCTCAAGAGGGACACCTGAACAAACAGATCGACGATTTGAAGATGCAAATAGCGAACTTGCAGAACGAACTCGACGAGTGCAATAAGCGAAACGCGAATCTCGAAGAGTGTTGCTCGGACAGAGACGatctttcgaaaaaattgcaagCCATGGAGGAAGAGATGACGTCCCTGAGGAACGAGCTCGCAACGGCGAAAGCCACGATAGAAAATCTCGAAGCAGAAATGGGCGCTCTGAGGGAGGACAAGgagaaattgttgaaagaaTTAAACGATACCAAGGAACAAGTCGTGGCGCTGACCGGTCAACTGGCGGAAGCGACAGCGGCCAAGGACGCTTTACAGAAGGAGCTAGAAAATAACTTGGACGAGATGGAAAAGCTACGGAAGGAGAATTTGGAACTGAAGGATCAGCTCGATAGCGCGAGAGAAGAAAACAGGAAGCTCCAAGAGGCGGCGGAAGCGTTGAAGAACATGGTCGAGGAGAACGAAAAGTTGAAAGCGGCGCTCGAGGATTTGAAGAGGGAAAACAAGGAATTGGCGGACGAGATCGCGAAGCTGAACGATTTAAATAACGTCTTGAAGAGCGATTACGCAAACATGAAACAGGCGTTGGATAATCTGGAGGCAGAGGTCAATAGACTCGAGGAACAATTAAATAAGACGATGCAGGAGCGCGATGCGCTGCTGGAAGAAAACGGCAACATCAAGAAACAGCTGGAAGACGCGACAGCGGAGAACGAAGATTTGAAAGCCGAATTGAATAACATCGGTGAACAGTTCGATAAATTGACATCGGAGAGGGATCAGCTAAAAAAGGATCTTGATGCGATGAAGGTCGAGAATGATTCGCTGAAAGGGGATATAGAGGCGTTCAAAAGAGACATTGAAAACGCGAAGAACGAAGCGGACAAGCTGAGAGCCGATGCTGATGCGTTAAGAGCAGCGGATGAGAACGTGAAATCCGAAATGGAAGCACTGGAGGCGCAAGTGGAAAATCTGAAAACCGAGAAAAGTCGTTTGGAGAACGAGAACGCGAATTTGAAAGCGAAAAATATAGAGCTGGAACAGAAACTCGACGACTACGCGAAAGAAGttgagaaaatgaaattagaaaatgatCGACTGCTCGCTGAATTGGAACGTCTGAACGCAGAATTGGAGCGTTTGAATGGCGAATTGGCGAAAAGTAAGAACGAAATGGAAGGATTAAAATCGGAAATGGCTGCTTTGAAGGAAGCCATCGACAAGTGCGTGGACGAGATGGAGAAACTGAAAACCGAAAACAACGATCTTAGATCGAAGAACGACGCCCTCGAggcagaaaatgaaaaacgtaaAGCCGAAATCGATAAAAAGGATAGTTTGCTCGACGAATTAAGTAAGTTGAAGCAGGAACAGACTGCTTTGCAGAACGAGAGGGATCAATTGAGCAAACAGCTGGACGATTACATCGCTGATAACGAAAAGCTGAGGACAGAGAAGAGTCAGTTGGAGGCAGAGAATGAGAAGCTGAGAGGTGAGGTAAACACGTGCAGGGAAGAGAACCAGAAACTGAAAGCTGAGATTGAACAATTGCAAAGTCAATTGAAATCGTTGAACGACGAATTAAACAAGCTGAAGCCTCAGCTCGACGACGCCAAGAATAAAATTCGGTCCCTCGAGGACCAGTTGACTCGTTTACAAGGCGAAAATGATAGATTggtaaacgatttgaaagctCTGGAAAGCGAAGTGAACGACTTGAAGTCGAAACTAGCGGAGGAGACGAGCGCCAACCAAAAAATGAGGGACGACTTAGCGATGTTGGAGGACAAGCTACAGAATTTAAGCGCGGAACTCGACAAGGCCAGGGCTGAAAATAACGCTTTGAAGGAGGAGAATAAAAGCCTTCGAGGCGAGATGTTGGACCAGGAACAAATGTTATCGAGCTTGAAAGCAGAGGACGCTGACATGAAATCCGAGATCGCCAATTTAAAGAAGTTAATTGCCGATTTGCAAGCACAGATTGCCAAGTTGGAGGCTGACGTCGAACATTGGAAATTGGAGAACTGTAAGGTTCAGATGGAACTGGATAAATTAAAGGCTGACCTCGAGAAAGCGTTGAAAGATTTAAGCGAATGCCAGACATTGAAGAAGGCTTGCGAAGAAAAGTTAAATCAACTTCAGAACGAAAAAGCCGACCTTGATAAGAAGCTTGCCGATTTGACGTCGCAAATTGAGCAGCAAAAGAAAGATCTCGAAGCGATAAAGTCAACGAAAGATAAAGACGATGCAGAACTATTGGCGTTGAAGGCAGAGCTCGAAGCACTGAAGAAAGAATTAGAGAACCTCAGAGCAGAGAATAACAAGTGCAAGAACAACGTAGACGATTTGACGAGGCAGCTCTCAGCGCTGAAGAGCGACCTCGATGCGTGCAGAGAAGAAGTCTCTGCGCTGAGAGATCATAACGATAAATTGAAGGCTGAAGTAAACGCGTTGACCGTCGAGAAGAACAATCTGAAAGATGAACGCGATAAGCTGCAGTCCGAAGTAGACGCTCTGAAAGCGGAGAACGCGAGTCTCGATCAGGGTCGAAAGAGATTAGAAGAAGAGTCTAACAAATTGAGGGGCGAAGGTGACGGGCAGAGACTGGAGATCGATAAATTGAAGTCGGATCTAGCTGCAGAGAAGGCAGCGTCCGACAAGCTACAGTCAGACTTACGGAATTGCCAATCAGAAAATGACAAATTGCAAGAGCAGTTAAGCAAAATGAAGCAAGACCTAGAGAGGCTCAACGAGGAGAGTGGTCGGTTGAACAAAGAGatcgaagaattaaaaaaggCTCTCGCAGCTGCTGAAGCTACGGCGAAGACGTTGGAAGGTCAACTCTCAGATTTACAAGCTGAGAAACAAGAATTACTGAACGAGCTCTACCGTCTTCGCGACGAGTTGGGCAATCTTAGGGAAGAAATGGAAAAACAGACAGCCGCGAGAGACGCGGCCGTGAAAGAGTTGGCCGAGTTGAAGCAGGAACTGGACGATTTAATAGAAGCGTTGAACAAAGTTCGCGCTGAAAATCAAGCATTGCAAAACGAAAACGAGAAGCTGAGAACAGACATAGCGAAAATGAACGATGAACTGGAAACGTTGAGAAACGAGAACGCGAGAATGACAACGGACTTAGCTACGATGAAAAACAAACTAACGGAGGCAGAAGGTCGGATCAACGATCTGAAGAACGAGAGtgataatttgaaaaagaagatTGCCGATCTTGAGAATACAGTGAAGGAGTTGGAAGCTTTGAAGAAACAATTAGAAGATTGTAGGGCAGAGGTGGCCAGATTGATTTCGGAGTTAGATAAATTGAAGTCGGAGAAAGAAAGGCTGCAGAACGAGCTGGACAACGCGGGAAATGAGTTGAACAGCTTAAAAGATGATTTGGAAAAGTTGAAAACAGCGAACGATACGTTGAAGGCCGATCTGGAGAAGCTGAAGACAGCCAATGATACGTTAAAGGGCGATTTGGAAAAGCTGAAGACCGCCAACGATAAATTGAAGAATGAGAAGGACAAACAGAACGACTATGATGCGACGTTGAAGGCTGACATagagaaaatgaagaaagaaaACGATGAGCTCAGATCTGAGAATGCCAGGTTGAAAAGCGAGCTGCAAAATTGCCGCGAAGAAAATGAGAAGTTGCGTGGAGAATTGGAAAACCTGAAGGATGAACATGCCACGCTAAAAGCACATTCAGAGCGCAAATTGGAGATGAAGGAGACGGAAATGGACAAAGACATTGTGGACGATTGCGGTGACTTCATTAAGACAAACGATTTTCTGAAgacgaaatttgaaaaacagaaCGAAGGCATACAACGTGTACGAGATTATATCAGTTATTTGGAAGGTAAAGTTAAAGATGAACCCAAAATGGCAGAAAAGATGGACGACGATGTGAGCATCGATCCGAAAATGAAACAAGACATAGCAGATTTGTTgaataaatcgaaaacattgtccgaaaatatttatcgaacagaacaggaaatacaaaatattgctaagatattgaaacaatttcagGATGCAGGCCTTGGCGAGAGACCCACAATAGCTTTCGATGCTGATTCTTGGCTGAATTCATTGACGTTGACGCAGTTGGCGGATCTTCACGATAAGATTTGCCTACTGACATCGGATATGGTGCAACAAGACTCTCCAAAGGCTGTCAGCTGCGGCACGGGTAGTCCATTAAAAGTGGATTGCTACAATACTTTGAATCAAAGGATAGCAGCGTTGCAGAAGCAGATAGCGGAGAAACAAATAGAAGCAGGATGGAGGCTGCAAGAACTGAGACGAGCTCTTCGGCAGGAACAAAataatctcattttaatctcTGATCAAATGAATTTAGAGAGAAAACGCAATTTGGATATTCAGTTAAGAATGGACGATGCACCTTAA
Protein Sequence
MESQYSSSFCRCGCNASSSRESIDPPNEPCCCCSYNPFSDRSKDSEIYDLSFALRKLTVMKCQMKKWRMERLQLESENRSLKQALQSFDVNADEILKPDPLLVHSREEIERLQNENAALEDKVRDLEETLAERDCCDDPCASVQCIRAKMAHMRERFALEKKELRDMISHLKLKLAQTEEDVSCPAIYRLRAKLRELMKGGQTADQQVSKVVERSIETLVDLSKSCEDLRVENERLLAEVAELRRALAALEEKEIPVVTERLLPQQTVETTTVPEYVDISELLAKLEDCEKVVSDLRKKLEEKDTLIDSLNKELGKMIDQQQLFDEIAAMKAELQSKNDKIRDLLNDLRQSEIDLLGLGTLRSEIDNLKPQLSNLEVERDKLLDELTELRKALADRNNQIIDILQQKNQLEKVYDDKIAEMLAQEGHLNKQIDDLKMQIANLQNELDECNKRNANLEECCSDRDDLSKKLQAMEEEMTSLRNELATAKATIENLEAEMGALREDKEKLLKELNDTKEQVVALTGQLAEATAAKDALQKELENNLDEMEKLRKENLELKDQLDSAREENRKLQEAAEALKNMVEENEKLKAALEDLKRENKELADEIAKLNDLNNVLKSDYANMKQALDNLEAEVNRLEEQLNKTMQERDALLEENGNIKKQLEDATAENEDLKAELNNIGEQFDKLTSERDQLKKDLDAMKVENDSLKGDIEAFKRDIENAKNEADKLRADADALRAADENVKSEMEALEAQVENLKTEKSRLENENANLKAKNIELEQKLDDYAKEVEKMKLENDRLLAELERLNAELERLNGELAKSKNEMEGLKSEMAALKEAIDKCVDEMEKLKTENNDLRSKNDALEAENEKRKAEIDKKDSLLDELSKLKQEQTALQNERDQLSKQLDDYIADNEKLRTEKSQLEAENEKLRGEVNTCREENQKLKAEIEQLQSQLKSLNDELNKLKPQLDDAKNKIRSLEDQLTRLQGENDRLVNDLKALESEVNDLKSKLAEETSANQKMRDDLAMLEDKLQNLSAELDKARAENNALKEENKSLRGEMLDQEQMLSSLKAEDADMKSEIANLKKLIADLQAQIAKLEADVEHWKLENCKVQMELDKLKADLEKALKDLSECQTLKKACEEKLNQLQNEKADLDKKLADLTSQIEQQKKDLEAIKSTKDKDDAELLALKAELEALKKELENLRAENNKCKNNVDDLTRQLSALKSDLDACREEVSALRDHNDKLKAEVNALTVEKNNLKDERDKLQSEVDALKAENASLDQGRKRLEEESNKLRGEGDGQRLEIDKLKSDLAAEKAASDKLQSDLRNCQSENDKLQEQLSKMKQDLERLNEESGRLNKEIEELKKALAAAEATAKTLEGQLSDLQAEKQELLNELYRLRDELGNLREEMEKQTAARDAAVKELAELKQELDDLIEALNKVRAENQALQNENEKLRTDIAKMNDELETLRNENARMTTDLATMKNKLTEAEGRINDLKNESDNLKKKIADLENTVKELEALKKQLEDCRAEVARLISELDKLKSEKERLQNELDNAGNELNSLKDDLEKLKTANDTLKADLEKLKTANDTLKGDLEKLKTANDKLKNEKDKQNDYDATLKADIEKMKKENDELRSENARLKSELQNCREENEKLRGELENLKDEHATLKAHSERKLEMKETEMDKDIVDDCGDFIKTNDFLKTKFEKQNEGIQRVRDYISYLEGKVKDEPKMAEKMDDDVSIDPKMKQDIADLLNKSKTLSENIYRTEQEIQNIAKILKQFQDAGLGERPTIAFDADSWLNSLTLTQLADLHDKICLLTSDMVQQDSPKAVSCGTGSPLKVDCYNTLNQRIAALQKQIAEKQIEAGWRLQELRRALRQEQNNLILISDQMNLERKRNLDIQLRMDDAP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00359966;
90% Identity
iTF_00359966;
80% Identity
-