Basic Information

Gene Symbol
-
Assembly
GCA_900474275.1
Location
UCOH01012378.1:13576-20770[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 45 3.2 3.3e+03 -1.4 0.5 33 57 62 86 54 90 0.51
2 45 0.18 1.9e+02 2.5 0.2 38 62 105 129 101 132 0.78
3 45 1.4 1.5e+03 -0.3 0.1 38 58 157 177 150 181 0.85
4 45 0.00055 0.57 10.6 2.7 25 57 216 248 212 256 0.89
5 45 1.2 1.3e+03 -0.1 1.5 29 63 318 352 312 354 0.83
6 45 3.1 3.2e+03 -1.4 0.6 42 62 369 389 368 398 0.63
7 45 0.004 4.2 7.9 2.9 28 63 429 464 425 466 0.88
8 45 0.11 1.2e+02 3.2 1.2 35 62 474 501 464 503 0.79
9 45 0.0018 1.9 9.0 5.0 26 63 507 544 504 546 0.92
10 45 0.00074 0.78 10.2 7.4 27 65 550 588 544 588 0.93
11 45 0.04 42 4.7 5.8 25 63 590 628 586 630 0.73
12 45 2.5 2.7e+03 -1.1 9.8 26 59 661 694 634 695 0.81
13 45 0.047 49 4.4 6.3 24 60 687 723 684 732 0.82
14 45 0.54 5.7e+02 1.0 5.5 31 65 750 784 736 784 0.76
15 45 0.0063 6.6 7.2 5.1 23 63 784 824 782 825 0.82
16 45 5.8e-08 6.1e-05 23.4 4.5 24 65 813 854 810 854 0.95
17 45 0.001 1.1 9.8 2.1 35 63 852 880 849 889 0.67
18 45 0.0016 1.6 9.2 4.3 22 64 895 937 885 938 0.87
19 45 3.8e-06 0.004 17.6 3.8 24 65 918 959 917 959 0.95
20 45 0.00073 0.77 10.2 4.9 23 56 938 971 933 980 0.62
21 45 0.13 1.4e+02 3.0 10.3 22 59 1021 1058 1014 1060 0.88
22 45 0.00077 0.81 10.1 6.7 22 60 1042 1080 1041 1082 0.94
23 45 0.18 1.8e+02 2.6 5.6 25 60 1115 1154 1111 1168 0.71
24 45 0.0043 4.5 7.8 4.3 32 65 1157 1190 1147 1190 0.90
25 45 0.0038 4 7.9 9.5 26 63 1172 1209 1167 1218 0.88
26 45 0.16 1.7e+02 2.7 1.7 33 62 1228 1257 1217 1265 0.71
27 45 0.073 76 3.8 4.5 25 60 1262 1297 1260 1302 0.88
28 45 0.046 48 4.5 8.1 28 63 1300 1335 1293 1350 0.60
29 45 0.44 4.6e+02 1.3 0.1 22 63 1350 1391 1345 1393 0.72
30 45 9.4e-06 0.0098 16.3 0.4 24 65 1394 1435 1391 1435 0.91
31 45 3.1e-05 0.032 14.6 0.5 24 63 1415 1454 1412 1456 0.93
32 45 0.0091 9.6 6.7 1.5 24 62 1443 1481 1440 1495 0.84
33 45 5.3e-06 0.0055 17.1 3.7 25 64 1500 1539 1499 1540 0.95
34 45 0.099 1e+02 3.4 0.4 36 62 1546 1572 1541 1575 0.84
35 45 0.00018 0.19 12.2 1.7 31 65 1583 1617 1573 1617 0.88
36 45 0.00088 0.93 10.0 3.7 33 65 1585 1617 1583 1638 0.62
37 45 0.00099 1 9.8 6.9 26 64 1638 1676 1635 1677 0.91
38 45 0.0061 6.4 7.3 6.2 25 61 1665 1701 1664 1705 0.94
39 45 0.3 3.2e+02 1.8 6.8 26 65 1680 1719 1677 1719 0.90
40 45 0.00015 0.16 12.4 6.5 11 63 1707 1759 1705 1761 0.95
41 45 2.9e-05 0.031 14.7 1.9 34 64 1765 1795 1762 1796 0.93
42 45 0.065 68 4.0 2.1 22 60 1809 1847 1808 1852 0.91
43 45 0.00079 0.83 10.1 2.6 31 61 1881 1911 1871 1914 0.86
44 45 5.9 6.1e+03 -2.3 0.0 32 59 1948 1975 1945 1976 0.80
45 45 6.7 7e+03 -2.5 2.8 37 55 2117 2135 2106 2172 0.67

Sequence Information

Coding Sequence
ATGAATGTTTCTCATGATCAATTTTGTCATTGTGGCTGTGATGTTCCGGGTAATTTTAAACCAGAGGAACCACCACATGAACCATGTTGTTGTTGTGATTATAATCCATTCAGTGATAATTCTAAGGATGCAGAAATCTACGATCTTCCTTTTGCTCTGCGAAAGCTTTCGGTAATGAAGTGCCAAATGAAAAAGTGGCGAATGGAACGACTTCAATTGGAGAGTGAAACTCGATGTTTGAAGCAAGCTCTTCAACAATAtggcATTAATATCGACGAGGCTTTGAAACCTGATCCTCTAGTAATTCACTATCGAGAGGAAAATTTAagacttcaaaatattaatgagGAGTTGGAGGATAAAGTCGCAGACCTCACAGGGACTTTGGATGAACGAGAATACTCTGAAGATCCTTGCGATTCAGTAAAATATATCAGAGGGCAAATGAAGGACCTTCGAATACGTTTTGCTGATGAAAAGAAACAATTAAGAGACAAAATAGCAGCCTTGAATTTGAAGCTTGCAGAAATGGATGACGATACTTCATGTTCAGCCTTAAATCGACTCAGAGCAAAACTTCGTGAATTGATGAAAGGTGGACAGAAGGCTGACCAGCAAGTTTCGATGGTTGTGGAAAAATCTATAGAAACAATGGTGGAGCTTTCGAAAAGCtgcgaaaatttaaaattcgagaaTGAGAGACTTTTGACTCAATTAGAAGATCTTCGCAAAAAATTGGGACAAAGAGAACCTTCAAGAACAACATTAAAAGAAGATGAAATGGGACTTGAACTATCAGAAATAATTGAAGGACAAGATGTTTCTGAGTTAtatgacaaaattaaaaaatgcgaGGCTCTTGTTGCTGATTTAAGAAGACAACTTGCTGAGAAGGATCAAATGATTAACGAactagagaaaaatatttttactcaaaAGAGAGTTTCTGGACAATTGGAGGAAGATTTGGATAAAATGACTGTTAGCCATAAAGCATTGATGAAAGAAGCTGCTGCTATGAaggaagaattgaaaaagaggGATGAAAAGgtCACGGAtcttttagggaatttaagaAGTTCAGCTATAAATCTTCTCGGAATGGAGAAACTGACCAGTGAAATTGAGTTGCTTAAACCTCAGCTCTATAATCTTGAAACAGAAAGAGATCAACTTTTAGGCGATCTAACTAAAATTCGGGGCCTTGTATCAGAGAGAAACGatcaaattgtaaaaattttagagcAGAAAGATAAATTGTCTAGCGAATGTAATGCAAAATTACTTGCTATGCAGACAAGGGTAGATGCCTTATTAGATCAAGaagctcttttaaaaaaaaaaatagaagaccTAAAATCAGAAGTGAAAGAATTGGAAGATGAAATTACAGGGTTGCGAGTAAGAATATTGGAATTAGAACAAAAAGGcaatgaaaatgaagatctaattaaaaagattcatGATCTGGAGGATCAGcttgataaattaaatgcaGAACTTTCTTTAGCCCAAAATCAAGTAAAAGAACTTGAAGGTGAGgtaagtgatttaaaatctgaaaaggAGAAATTACTGAAGAATTTGATTGAAGCTCAGactgaaattgaagaaataaagaagGATTTACAGGCTGAAAGAATAGCTAAAGAAAGTCTTTCTAAAGATCTGGAACCTTGCAAGGatgaacttgaaaaattgaaattggaaaataaaactttggAATCGGAAAATCAGAGTTTAAaggataatttgaaaaaattaggaGATGAAATCAAGTTCttgaaagaagaaaacgaAATGTTAAAGGCAGAGAATAGTAATATTATTGCGGAGAATCAAAAGTTTCaagaagagatagaaaaactgaaagcagaaaatgaaaaagcgCAAGTCGAGattgataaattgaaaattgaagaaaagaatttagaagcagagaaaaatgttattgaaAATGAGTTGAATGTCTTAAAAGGTGAAATAGAAAAGTTAAAAGTTGATCTAAAGAGCTTAGAAGACGAAAATGCAGTACTTAAAGGAGAACTAAATAacattaaagaagaaaaagataaattagaaaaagattttattgctCTTCAagcagaaaatgaaaaattgcgaAGTGATTTAGAGATGGAGAAAACCAAAAAGGAAGCTGCTTTGAAAGAACTCGAGGACTATAAGAAACAGTTGAGTTTAATAAAAGATGagttggaaaatttaaaagttcaatTGGAAAAATCATCAGCTGAATTGGCAACTCTTGCGtatcagaataaaaaattaaaagatgagCTGGAAAATGTAAACCgagatttagaaaaatcaGTGGCTGAATTAACAGCTCTTAAGGAGCAAAAGGATAAATTGGAAGCAGCTTTACAAAAAATGAGAGcagatttggaaaatttgacAAATGAAAATAAGGATCTTAAGGCAAAAATTGATCATCTTAATTCTGagattgagaaatttaaagccgaaaatgataaattattaaatgaaattagtaAATTAAAGGAAGAAGTTGACcagttgaaaaatgaaaataaaaagcttCAGGATGAATATCAACATTTGaaagatgaaaatgtaatattaaaaaatgagaatgcaCAGTTGAAAGATAGTTTAGATAAATCTGTAATAGAATTGTCAGCAATTAAAGAGCAAAATGAAAAGATGAAGGCTTTCgttgaaaatatgaaagacGAAATTGAAACTTTAAGAAAAGATAATGAGGAACTTAAGGCAAAAATGAATTACCTTAATTCCGAAATAGAAAAACTGAATGTCGAGAATAAGAAATTATCGAAAGAAAATGATGATTTGAAATCCGATGTTGATCAGCTGAAAGATGAGAATAAGAAACTGAAAGATGAAAATACAAATCTAAAAgacaacttaaaaaaatcagaagctgatttggaaatattgaagactgattttgataaattgaaaacagaaaatgaagaattaaagaaagatCTAGAAAAAGGGGAAGCCGAGGctgaaaatatgaataatgaatTGAAGAAATTGGCAGCAGAATTGAATAAggtgagaaaagaaaaagataactTGGAAGAAAAGTTAGCAAACCTTGAAAgtgatttagaaaaattaaaagcggagaatattaatttgaatactGAATTGAAAAAAGCTAAAGAAGATAATGAGGCGTTAAGAAATGAAACGGATTCTTTAAGGAACGAAAATTCGAAACTTAAAGCAGCAGGAGATAttgaaaaggaagaaaagaaTAAGGCAAATGAAGAGATTAGATCACTGAAAgatcaaaatgaaaagataTCTCGTGATCTAGATAATGCTAGAtctgaaattaattctttaaaggGGCAagttgataatttaaaatctgaaatggagtccataaaagaaaaactacaATCAAAAGAAGATGAATTAAATGCTCTGAAAAATGAGTTGAAAGATTTCCTCTCGAAGTTTGATGAACTCAAGGATGAAAATAATCGttttaagatagaaagggagaGTCTTAGAAAGCaattaaatgatttgaagAACGAATTGTCAAGATTAGAGGCAGAAAATTTAGATCTGAAAGAAGAACTCGctaaattaaaatcagaatTAGTTAAGATTGTAGctgaaaatgataatttgaaaaaagaacttGCTGAAGCGTTGAAGGATTATGATCAATGTCAGACAGATGGTTGCAAGTTGAAGAATGAAATTGGTAAGTTAAAGGCAGCTGttgagaatttgaagaaagaatTGGGCAATTCTGAAGCAAAGATAAAAGAACTTGAAGGTCGTCTTTTAAGTAAGGAGAGCGAGAAAAATAGTTTGGAACGAGAAATCGAaggattgaagaaaaatttgtcagCTGAAAAAGCTGAGAAAGAGgcagttttgaaaaaactgGAAGATCTCAATGAAGAACTTAAGGTTTTGAAAGCAGAACTGGCTAGGTTAAGATCTGAAAATGATCAACACAAGGCAGAAATAgatgatttgaaaaaacaattagTATTGACTAAAGATGAATTGAATAAAGCTAAGAGTGAAATTGGAGCTTTGAAAGATGACAACAACAAATTGAAGGGCCATGGTGATACTTTAGCTAATGAAAGTGAAAAAGTGAGAGGAGAGTTGGGTAAGTGGAAAGCTGAAGTTGAAGCTCAACGAAATGAAAATGCTAAATTGAAAGGTGAGGTTGATAAACTGGGGGCAGAAAATactagatttaaaaatgaaattgaacgTTTGAATAATGAAGTTAGTGGtctaaaaaacataaatagtAAACTCCAAAATGAATTAGGAGCTGCAAAAAGTGAGGTTGATAAGTTGAAATCCGAATTGTCTGCTTCCAGAGCTGAAAATGAGATATTGAAAGGGGAAATGAATAGTTTGAAAAGTGAAGTTGGCAgattaaatgaagaaatgaataagataAAGCCGGAAATtgagagattgaaaaaatccTCAGCTGATGGTGaagctaaaattaaaaatttagaaggTCAAGTTGCATCTTTAGAATCAGAAAAAGGAACACTTgtgaaagaattaaatagtTTGAGGGACGAACAGGACACATTGAAAAAGGAATTAACAGCTGAGAAAAATGCCAAGGAGGCTGCACTTAAAGAAGTAGAAGGTCTTAAATCAGAATTATCGATGATGAGGGctgaattagaaaaatatagaacAGAAAATGATAAGTCAAAATCTGATTTGGAGGCATCAAAGAAGAAAGTTGATGAATTAACGAacgaaaatggaattttaaaagcagatatcgaaaaattaagaaatgaaTTGAATGTTTTGAAGAGTGAAAATACGAAGCTTAAAGATGAGTTggataaagtgaaaaataatctAGAGAAAGTGACAGCAGAGTATAATGCTTATAAAGcagaaaaggaaaaggaaattaataatCTCAAAACAGAGAATGATCGATTAAAGAAAGAACTTGCAGATGTACAagagaaaacgaaaaatttggAAGATCAGCTGAAGAGTCTTCAGGATGCTAAAGCGAAATTAGAAGCTGATGCTGTAGATTTAAAATCACAGTTGGAAAAGTTGAGACAAGATGTAGcggttgaaaaaagaaataaagaagcTGCTCGAAAAGAACTCGAagctttgaaaaatactttaggTGACTTGAGAAACGAGATGggaaaaattaaggaagaAAATGAGAAGTTGATAGTGGAGAATGcaagtttgaaaaaacaaattgatAAGTTGAGAAGTGATGTCGAAAAATCGAAAGGGGATTTGGATAAACTTGAGGCTGAGAATGGGAGACTGAGAGATGAGATTAAtaagttgaaagaagatttAGAGAAGTTGAGAGCagaattagagaaaataaaagcaGAGAAAGATAAGATGAAGGATGaacttgaaaaatcgaaagaaGATGCTGCAAAATTACGAGATGATTTACAAAAAGCGAAGGCAGATGCTGATAgattaatttctgaaaataaaaagctgAAGAATGATTTGGATGAGGTAAAGGCAGGATCTGATATGTTGAAAGCGAGGGTTAGTAAAGGAAGTGTAGAATCTAATAAGTGGAAGGAAGAATTGGAAAAGTTGAAACTTGAACTTGCGAAAGCAAAAGATGAAGCTAATCAATGTAAAGCGGAGGTAGGTAGGTTGAAATCCGAAATTGAAAGATTGAAGGCAGAAATggataaaatgaaaactcaGGATCGAAAATCAAAAGTTAGCATTGATCGTCCCAAAGCTGAAACACCTAAAGCTGAATCTAGAAGGATATcagatcttcctgaaaaatatgaatcaaAAGTTTCTGAAGACAAATGTGGGGACTTTATTAAgtctaatgaaattttgaggaagagGATACAGGAACAAAACAAGGcTATTGATCGGGTTAGAGAATTTATTGGTTTCGTGGAAGGTAAAAATCCATCGAAGCCTTCGATGGCTGAAGATTATAGTGCTGATGGCGACACTGGTGGTTGGATAGTACCAGCGGTAAATGAAGTTATGGAAAAGTCACAAAAactatcagaaaatattttcaaagctGAGAAAGAAATACAACATCTCGAAGCGTTTCTCCAACGAAAGagCGTTGGAGATTCGGAAGCGTTGGTTCCAGAAGGTCCCAGAGGAGATTTTAATGCCGATTCCTGGTTGGGAtcgTTGACCTTGACTCAATTGGCAGAACTACATGATAAGATTTGTTTGCTTACTGCTTCAATGGTTGACGCTGACAGTAATTCTATCGGTTCAGAAAGAGTACTTCCTGAGAATCAATTAGGAAACGAACAAAGTCCTCAAAAATCGAACAATCCTTTGGAAAGAGATTATGAAACTCTGAATAAAAGAATTGCAGCTTTGCAGAagcaaatttctgaaaaagaatTAGAGGCTGCGAGAAAGATGCATGAGATGAAGAATATTCTTCGTCAAGAGCAAATGCAGTTGATTAAGATTTCAGAAGAAATGACTAAGGAGAAGAAACGAAATTTAGCACTTCAGATGACAATTGATCAAGATCCTTGTTGGACGAAAAATTGCAATGCAGAATGTAGAGGAGATTTCGATAATATCAGAGAATATTAA
Protein Sequence
MNVSHDQFCHCGCDVPGNFKPEEPPHEPCCCCDYNPFSDNSKDAEIYDLPFALRKLSVMKCQMKKWRMERLQLESETRCLKQALQQYGINIDEALKPDPLVIHYREENLRLQNINEELEDKVADLTGTLDEREYSEDPCDSVKYIRGQMKDLRIRFADEKKQLRDKIAALNLKLAEMDDDTSCSALNRLRAKLRELMKGGQKADQQVSMVVEKSIETMVELSKSCENLKFENERLLTQLEDLRKKLGQREPSRTTLKEDEMGLELSEIIEGQDVSELYDKIKKCEALVADLRRQLAEKDQMINELEKNIFTQKRVSGQLEEDLDKMTVSHKALMKEAAAMKEELKKRDEKVTDLLGNLRSSAINLLGMEKLTSEIELLKPQLYNLETERDQLLGDLTKIRGLVSERNDQIVKILEQKDKLSSECNAKLLAMQTRVDALLDQEALLKKKIEDLKSEVKELEDEITGLRVRILELEQKGNENEDLIKKIHDLEDQLDKLNAELSLAQNQVKELEGEVSDLKSEKEKLLKNLIEAQTEIEEIKKDLQAERIAKESLSKDLEPCKDELEKLKLENKTLESENQSLKDNLKKLGDEIKFLKEENEMLKAENSNIIAENQKFQEEIEKLKAENEKAQVEIDKLKIEEKNLEAEKNVIENELNVLKGEIEKLKVDLKSLEDENAVLKGELNNIKEEKDKLEKDFIALQAENEKLRSDLEMEKTKKEAALKELEDYKKQLSLIKDELENLKVQLEKSSAELATLAYQNKKLKDELENVNRDLEKSVAELTALKEQKDKLEAALQKMRADLENLTNENKDLKAKIDHLNSEIEKFKAENDKLLNEISKLKEEVDQLKNENKKLQDEYQHLKDENVILKNENAQLKDSLDKSVIELSAIKEQNEKMKAFVENMKDEIETLRKDNEELKAKMNYLNSEIEKLNVENKKLSKENDDLKSDVDQLKDENKKLKDENTNLKDNLKKSEADLEILKTDFDKLKTENEELKKDLEKGEAEAENMNNELKKLAAELNKVRKEKDNLEEKLANLESDLEKLKAENINLNTELKKAKEDNEALRNETDSLRNENSKLKAAGDIEKEEKNKANEEIRSLKDQNEKISRDLDNARSEINSLKGQVDNLKSEMESIKEKLQSKEDELNALKNELKDFLSKFDELKDENNRFKIERESLRKQLNDLKNELSRLEAENLDLKEELAKLKSELVKIVAENDNLKKELAEALKDYDQCQTDGCKLKNEIGKLKAAVENLKKELGNSEAKIKELEGRLLSKESEKNSLEREIEGLKKNLSAEKAEKEAVLKKLEDLNEELKVLKAELARLRSENDQHKAEIDDLKKQLVLTKDELNKAKSEIGALKDDNNKLKGHGDTLANESEKVRGELGKWKAEVEAQRNENAKLKGEVDKLGAENTRFKNEIERLNNEVSGLKNINSKLQNELGAAKSEVDKLKSELSASRAENEILKGEMNSLKSEVGRLNEEMNKIKPEIERLKKSSADGEAKIKNLEGQVASLESEKGTLVKELNSLRDEQDTLKKELTAEKNAKEAALKEVEGLKSELSMMRAELEKYRTENDKSKSDLEASKKKVDELTNENGILKADIEKLRNELNVLKSENTKLKDELDKVKNNLEKVTAEYNAYKAEKEKEINNLKTENDRLKKELADVQEKTKNLEDQLKSLQDAKAKLEADAVDLKSQLEKLRQDVAVEKRNKEAARKELEALKNTLGDLRNEMGKIKEENEKLIVENASLKKQIDKLRSDVEKSKGDLDKLEAENGRLRDEINKLKEDLEKLRAELEKIKAEKDKMKDELEKSKEDAAKLRDDLQKAKADADRLISENKKLKNDLDEVKAGSDMLKARVSKGSVESNKWKEELEKLKLELAKAKDEANQCKAEVGRLKSEIERLKAEMDKMKTQDRKSKVSIDRPKAETPKAESRRISDLPEKYESKVSEDKCGDFIKSNEILRKRIQEQNKAIDRVREFIGFVEGKNPSKPSMAEDYSADGDTGGWIVPAVNEVMEKSQKLSENIFKAEKEIQHLEAFLQRKSVGDSEALVPEGPRGDFNADSWLGSLTLTQLAELHDKICLLTASMVDADSNSIGSERVLPENQLGNEQSPQKSNNPLERDYETLNKRIAALQKQISEKELEAARKMHEMKNILRQEQMQLIKISEEMTKEKKRNLALQMTIDQDPCWTKNCNAECRGDFDNIREY

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01390819;
90% Identity
iTF_01394028;
80% Identity
-