Basic Information

Gene Symbol
-
Assembly
GCA_951812415.1
Location
OX638310.1:131516885-131528826[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 23 0.73 1.1e+03 0.7 1.0 38 62 198 222 191 225 0.74
2 23 0.17 2.5e+02 2.7 1.0 23 60 279 331 275 336 0.73
3 23 1.8 2.7e+03 -0.6 0.5 23 38 399 414 394 420 0.78
4 23 0.22 3.2e+02 2.3 0.7 33 59 430 456 429 460 0.84
5 23 1.6 2.4e+03 -0.4 2.9 21 47 527 553 527 556 0.79
6 23 0.057 83 4.2 0.9 24 52 559 587 557 592 0.84
7 23 0.16 2.3e+02 2.8 2.1 27 49 595 617 591 620 0.84
8 23 0.041 60 4.7 0.7 40 63 645 668 637 672 0.88
9 23 6.4 9.3e+03 -2.3 10.3 18 62 682 730 673 732 0.79
10 23 0.00098 1.4 9.9 9.1 18 60 750 792 749 795 0.93
11 23 4.9e-08 7.1e-05 23.7 3.4 26 60 800 834 798 838 0.93
12 23 2.6 3.8e+03 -1.1 3.7 34 63 854 883 841 885 0.84
13 23 0.0088 13 6.8 0.8 25 63 898 936 894 938 0.84
14 23 2.2 3.3e+03 -0.9 4.0 23 57 932 966 932 974 0.83
15 23 0.016 23 6.0 2.6 27 63 972 1008 969 1010 0.85
16 23 1.8 2.6e+03 -0.5 0.0 38 60 1011 1033 1008 1037 0.70
17 23 0.43 6.3e+02 1.4 3.8 31 63 1072 1104 1062 1111 0.84
18 23 0.015 22 6.1 7.0 33 63 1128 1158 1126 1160 0.85
19 23 0.9 1.3e+03 0.4 0.6 27 39 1174 1186 1163 1190 0.67
20 23 0.15 2.2e+02 2.9 2.8 28 65 1204 1241 1201 1241 0.87
21 23 0.22 3.3e+02 2.3 3.6 25 52 1246 1273 1245 1277 0.86
22 23 0.39 5.7e+02 1.5 0.2 24 63 1366 1405 1363 1408 0.60
23 23 0.78 1.1e+03 0.6 2.1 21 50 1515 1544 1510 1552 0.84

Sequence Information

Coding Sequence
ATGGCATTTTCAACGCAAAAGgactttttaaaaattggttttttaaatGACTGCAACGATTTGAATACGGAGAGTCGAAGACGACATCAAGTCTCTTTAAAAACCTATACGTCGTGTACAGACTCGCGTGCATGGTCGTTGAGTCAAGAAAaacaagcgaaatatttaacAGATATAGTCGCTTGTGGAACAAGGTTCGTCGAAGATGGTGACGGGTTTGAAAGTGGAGACTCTGAAAATAATCTATCTGGAGATATGAACAGTGCACCAGGACCAAGCAGAGGAATTAATAGACCAGGTGCTAGTAGACCAGGTTCTAGTAGGCCTGATACTGCTAGAGCAATTACTGGTGCTGGTGGTTCAGGACCAGGAGATCGTAGACCTGGATCTAGTTCTGGACAAGGTACGGTAACTGGACAAGGGGCTGAGACTGGGAATGGCGGTGGATTCGACACAACTTCATATATATCAAGTGCGTATGGCGGACCAGGCAACTTCACTAGTGGTGACCAAGGAAATGGTGGTGGAATTTGTGCCGACGACAGTCTAATTGAAGCTGCTTGTATGAAGCTGGATGAAATCAATGCCATAGTACGAGAAAATTCCGAATTGAAAAAGAAACTAGAATTATCTGAAGATAAACTTGACCGACTAGACAacgaaaacaataaaatcaaagcTATCATAACGGATTCGACAGCCGGCAGTTTTGGTCCTGTAGAGGACGAAATCAAGGTACTGAAAGCTAAAGGCGAACTGAATACTGACAATgtattaaaacttttaaaaaagtaCATCGAAGACTCAAAAGTACAAAATCGTAAATGCGATATGATGTTAGTAGAAAATGAATCATTGAAAAGTCGAATCGAGAAACTCGAGGCACGCGTTAAGATCTCCGTTGTACCTGCGAAAAAGGAAACTAGTGCTGATGTGAATTATCTTCAGTCGACGGTCAACGATTTGCGCAATGAAATGTTACAATTGCGGGCCATCGAAGAGGATAGGGCTAAGGTTAAAGTTGAGTTCGATAATCTTAAGCAACAAATAAAACACATTGGTAGTCCGGACGACTTGTATAATATGAAGAAAGCATCAGTTGCCATTCGCGAAGTAAAATGTGATAGAGACCGTCTTCGAGATTTAATTAATTCGATGGTTGGTATTGAAGACGATCTGAAGAAAATTAAAGAGCGCGCAGAGAAGTCTGACAAATTGGAAAAACAAGTGGCTGAGTTGAGCATAAGACTTGAAAATGCTTCGAGTAATCAGATGGCGGGCGGGTTGAGATGTGAtgaaatgaaatcggaaatcgACAagttaaagaaagaaaaagctGAAGCACAGCTAAAACTCAAACATATGACTTCAGAAGATTCCGATTTCAGTCAAGCACAGGAAAAACTTAAGGAATTCGATCGAGTTTCGTGCGAACTGCATCTTTACaaaGAAATTATTCGAGAATTGGAATCTGAGAGAAAGAAATGTCCGCAGCAGTTTGAACAAGCTGGAGGAACGTCTCAAGATGCACATATTCAGGATTTACTTGCAATAACTAAAGTCACAACCATTTACATACAGAAAGAGACAGAAGCACGTAAAAAACTTGAAGAAGAGCTGAAAAAATGTAAAGCTGAGCTTGGGCGATTGAGCAAAGCAAACGTTGGAGGAAGTCGAGAGAAATTCGAAGACTTACAAGCAAAACTCAAGACTTGTGAGACGGAGATAGGTCGACTTAAAAAGAAGAATGAAGCGCTTTTAGGTCAAATTTCGAACGAAGATGTTGAGGACTTGCTTGATCAGTTAAACGCTTGCAAACGTGAAAACGAAAAGCTTCAAAACCAGGTCTCAAATGCAGGTGACAAACAGCAATTCGATTCAGTTTTTGGCGAACTAAACAGTTGCAAAGAGTTGTTAAATGCTAGCAAAGGCGAGAAAGATAAATTGAACGATAGAATAGCCAATTTGCAGAAAGAGCTCGATGAATGCCGAAAGAACAATAGTGCAGCAGCACTTCTTGAAGAGAAGGAAGACGCGCTTAAAGACTGTCAGGATAAAGTTAAGGACTTAGAAAATCAATTAAGAAAGATTACAGAAGAATTGGAGGAAAGCGCAAATAATTGTGCTGAACAACTTAAACAGTTGAAGGAACATTTTGAAAAAGAGAAGACAGACTTACAGGAGAAGCTGGACGATGAGCTACATTTTCAAAATACAAGCAGCACTAATTTGACATCGGACCTTAGCAATTGCCGAGACGAATTACAGAAATTGAGAGATCAATTAAAAGAAGAAGCCGATAAATGTCAAGCGGAAAAAGAACAATTACAGCAACAAGTTATTCAACTAAATGAAGCACAGAAAAACTTGGGTCAAGCTGGTAATACAATTTCCGAAGAtattaaaaatcttgaaaagaaATTGGCGGATTGTCAACGCGAAAACGAAGCTCTACAGAAAGAACGTGACTCGCTGACCAATGAACTTAACGAATGCAAAGCTGGTAATGACAATGAACTTTCGAGATTACGAAATGAACGTGACGAACAAATTAAAGCCAGCAAAGCGTGTGAAGATGAAAAAAATAGCCTCGCAAATCAACTTTTATCAGCGAATAACGATTTAAGCCAGTTGAAATCGCAGCTAGTACAACAAAAAGGACAGGGAGATGAAGACTGTGAGAGAGTCAAAGCGGCTTTAAAAGATAATCAAGATAAGTACAGCCAGTTAGAAAGCCGATACGATTCATTGCAACAGCATGACGCCAGCATTCAAAAGGCTCTAGATGAGTTGCGCAAAGAGTACGAGGTTAAAACTCTGGAATGGACGAAGGAGTTGAACGAACAAAAAAGTCTGCTTGCTCAAGCAAAGGCTTCCTTAGAAGCTCAACGAGGTGAAATATTTAGACTTAACGAGCTCCTTATACAGAACTCATGCAAAGACCTCGAGGATAAGCTGGCAAGTATGCTTGCAGAGAATAACCATCTCAAGGCCGAAAATGATAAAGCAAAACAGTCTCTTAAATCTAAGGAGGACGAAATTTCATCACTCCAGCAGAGAGTTGATAGCATTAATTCTCAGCTGGCGAAATGTGAAGAAGCGATTAAATCTACCGACTGTTCAAAGTATATTAATGAAATCGAAAGTTTGAAAAATGCTATTGCGATTCATCTTGACGAAATTAAACAACAAGCCGCACTACTGGAAAGCGTGAAAAGAGAGATCGAAGCAGCAAAAGCAAACAATGCATCTCTTCAATCAGAAATAGACGAAATAAAGGTCGAAAAGGAGGCTGCCAACCAAGCATTGAAAGCCAAGGAGGcagaaatgaacaaaataagAAATGATTTTGCCAACGCTTCTCAGCCACAGGACAATGATCAGCTTAGAAAATGTCTTGAAGAACAGCAACGTCTTGAAGATGAAAATAAGCGGCTTCAAGAAGAGTTCAAGCAGCTTGAAAATTTGGTAAAAGATATTGAAGAGGAGCGAAAGAAAAATACGCAGTTGGAATCGCTCATTGAGGAACTCGAACAAAGAATTAAAGACCTCCAAAGTCACCAGACCACTGTTGTTCCCGGCGCCGATATTAACGTTCAAAGTATTATTGATAAATTAAAAGATCAGCTTCAAatggaaacaaataaaaataacgcTCTCACGACCGAGCTCCAAACGTGCAAATCATCCCTAGGTAACTTACAAGAGAAGTTGAAAGAACTGGCGATTTGTAGAGATCAACTGCAGACAACGCAATCTCAACTGAAGGAAAAAGAAGCAGAGGTTGAAAACCTGAAGAGAGAAATCGAAAGTACAAAACTTACAGCCACAGTAAAACCAGTTGCTTCAGTGGGTCCAAGCGTTAGGCCCGTTCCTTCAGTAGATCCAACTTCTAAACCTGTGGCATCATCTGAGTCTATAGTAAAAGTACAACAAGTTACAGAATCAGTAGTATTGGGATCGGCACAATCTTTCGTTGGTCCACCCGTTATTGCGGTTGGTTCTGGCCACGGCAATATGAATCCGAATTTAATGTTGGCATCTACAATTAGTGAAATTCAAAATTCGCATTGCgctaatttagaaaaaatgcaGAAAATGTACGAAGAGCGAATGAGAGAAATGATGATCAATCATGATAAGGAAATTCAATCCTTACGAATGACCAATGAAGCCTTACGAAAGACTCTAGTTGACATAAAGGCCCAAATCGCTGCCTTAGGTGGAATTGGTGATATCGGAAAATTGAATCAAATCAACAAGGAATTAGAAGAACTTAAACTAGGCGTTTGTAAATGTAACCAGGAGTCCGACATTAACGTATGTGGACAAGATACCAAGCTCAATAAAATATGCACGAAAATACTGAAGACGTCAATCGATTGCCTTAGTCTACGAGAATTGAAATATTTGCATTGTGCTGTTTATAGGGCTGGTTGCAAAATGAAACCAATGATAACTGCCGATGCAAATGAAATAGGTCGTTGTTGCCAATGCACAGGCTTCGTTTGTAGTCTACCAAGTATTCCGAAAGAAAGTGAGAAGAAGTTTATAGAAAGAATAAATTGCCTTGAAAATGAATTGGAAAATGCCAATGATGTTTTACAACGTCTGAGAGAACGCTATGAATCAAAAGCTAACTGCAGTCGCCGATGTTGTGCAGGACAAACGGAAAGGCAAGCAAGTGATCCGACGGTACAACAACTATTCATCACGCCAATGTATTGTGAAAAGTAG
Protein Sequence
MAFSTQKDFLKIGFLNDCNDLNTESRRRHQVSLKTYTSCTDSRAWSLSQEKQAKYLTDIVACGTRFVEDGDGFESGDSENNLSGDMNSAPGPSRGINRPGASRPGSSRPDTARAITGAGGSGPGDRRPGSSSGQGTVTGQGAETGNGGGFDTTSYISSAYGGPGNFTSGDQGNGGGICADDSLIEAACMKLDEINAIVRENSELKKKLELSEDKLDRLDNENNKIKAIITDSTAGSFGPVEDEIKVLKAKGELNTDNVLKLLKKYIEDSKVQNRKCDMMLVENESLKSRIEKLEARVKISVVPAKKETSADVNYLQSTVNDLRNEMLQLRAIEEDRAKVKVEFDNLKQQIKHIGSPDDLYNMKKASVAIREVKCDRDRLRDLINSMVGIEDDLKKIKERAEKSDKLEKQVAELSIRLENASSNQMAGGLRCDEMKSEIDKLKKEKAEAQLKLKHMTSEDSDFSQAQEKLKEFDRVSCELHLYKEIIRELESERKKCPQQFEQAGGTSQDAHIQDLLAITKVTTIYIQKETEARKKLEEELKKCKAELGRLSKANVGGSREKFEDLQAKLKTCETEIGRLKKKNEALLGQISNEDVEDLLDQLNACKRENEKLQNQVSNAGDKQQFDSVFGELNSCKELLNASKGEKDKLNDRIANLQKELDECRKNNSAAALLEEKEDALKDCQDKVKDLENQLRKITEELEESANNCAEQLKQLKEHFEKEKTDLQEKLDDELHFQNTSSTNLTSDLSNCRDELQKLRDQLKEEADKCQAEKEQLQQQVIQLNEAQKNLGQAGNTISEDIKNLEKKLADCQRENEALQKERDSLTNELNECKAGNDNELSRLRNERDEQIKASKACEDEKNSLANQLLSANNDLSQLKSQLVQQKGQGDEDCERVKAALKDNQDKYSQLESRYDSLQQHDASIQKALDELRKEYEVKTLEWTKELNEQKSLLAQAKASLEAQRGEIFRLNELLIQNSCKDLEDKLASMLAENNHLKAENDKAKQSLKSKEDEISSLQQRVDSINSQLAKCEEAIKSTDCSKYINEIESLKNAIAIHLDEIKQQAALLESVKREIEAAKANNASLQSEIDEIKVEKEAANQALKAKEAEMNKIRNDFANASQPQDNDQLRKCLEEQQRLEDENKRLQEEFKQLENLVKDIEEERKKNTQLESLIEELEQRIKDLQSHQTTVVPGADINVQSIIDKLKDQLQMETNKNNALTTELQTCKSSLGNLQEKLKELAICRDQLQTTQSQLKEKEAEVENLKREIESTKLTATVKPVASVGPSVRPVPSVDPTSKPVASSESIVKVQQVTESVVLGSAQSFVGPPVIAVGSGHGNMNPNLMLASTISEIQNSHCANLEKMQKMYEERMREMMINHDKEIQSLRMTNEALRKTLVDIKAQIAALGGIGDIGKLNQINKELEELKLGVCKCNQESDINVCGQDTKLNKICTKILKTSIDCLSLRELKYLHCAVYRAGCKMKPMITADANEIGRCCQCTGFVCSLPSIPKESEKKFIERINCLENELENANDVLQRLRERYESKANCSRRCCAGQTERQASDPTVQQLFITPMYCEK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-