Basic Information

Gene Symbol
-
Assembly
GCA_021155775.2
Location
CM037746.1:19673282-19678588[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 19 0.0015 1.9 9.4 0.0 37 62 291 316 287 318 0.90
2 19 0.0021 2.6 9.0 5.4 25 64 585 624 584 625 0.88
3 19 0.0062 7.8 7.5 8.6 27 63 615 651 614 660 0.74
4 19 0.0043 5.4 8.0 3.0 29 61 666 698 662 702 0.78
5 19 0.0039 4.9 8.1 2.4 30 61 709 740 704 744 0.77
6 19 0.0058 7.3 7.6 3.5 30 61 751 782 746 788 0.72
7 19 0.0043 5.4 8.0 3.0 29 61 802 834 798 838 0.78
8 19 0.0043 5.4 8.0 3.0 29 61 844 876 840 880 0.78
9 19 0.002 2.5 9.1 2.5 25 58 889 922 882 929 0.61
10 19 0.15 1.9e+02 3.1 0.7 25 57 931 963 930 971 0.89
11 19 0.0055 6.9 7.6 6.1 23 63 1027 1067 1025 1069 0.87
12 19 0.029 36 5.3 2.8 25 55 1057 1087 1056 1089 0.83
13 19 0.22 2.7e+02 2.5 0.1 33 54 1111 1132 1109 1136 0.66
14 19 0.34 4.3e+02 1.9 0.1 33 54 1157 1178 1155 1183 0.59
15 19 0.37 4.6e+02 1.8 0.1 33 54 1203 1224 1201 1227 0.56
16 19 0.24 3e+02 2.4 0.0 33 55 1249 1271 1247 1274 0.79
17 19 0.12 1.5e+02 3.3 0.2 32 54 1294 1316 1290 1320 0.76
18 19 0.35 4.3e+02 1.9 0.1 33 54 1341 1362 1339 1366 0.57
19 19 0.14 1.8e+02 3.1 0.3 32 54 1386 1408 1382 1411 0.75

Sequence Information

Coding Sequence
ATGGCGCCAGCAGCGGTGGCTGTTTTCGTAATATCGCTGTTCGTAGAATCGTTCTCAGCACCGAACGGGTGTATCAGATGTGTGACATCAGATACTACCGGTGGCTACCAGACGCAGAGTGGTTGGGTAAATAGCAACAACTTATCGCAGAGGTCAGCAAATTTGGAAGATTTGACACAACAAGTAGAGGGTGAACTCGGTGGACCCCACAATCAGTTAGCCTTTGATAATACCAGGCCTGGAAACTGGAGGGACGTGAAGCAGTATCGAACAGCTGACGGTCATGGTAGGGTGTACGAAGAGCAAGGTCAACAAGTTCAAGGCCCAGTCCGAGTgagatattacaaaaaaaatttcacctcgaGCTACAGCAGCGGAAACCCAGGTGGTTTTGAAGTACCGTCTTTATCTGGATTTGATTCTGACAACATTCGATACGGCAGTCAATCGGCAAACCAGGGCAGTTTTGGCCAGGAACAGAACTCTGCTTACGATCAATCTGCAATCCGTGAAAATTCATACGCTTCGCAGGGTTCTTTACATTCGGCAAATCAATTCAGCAGCCAGGATGAAAGACTAGGAAGTCAACAGCGACGCGTAACGAGCAGTCAGTTTGGTGAAAATACATCCGGATTCAACAGTCGATCGACGCAGCAAGGATTGTCTGAACAGGAATCACTGAGGCCAGGAAATTGGACCACAGCTAACACTTATAAAACTGACGGAGGTAACGGCAGGGTATACGAAGAACGAGGACAAGTTGTTACAGGACCGCAAAAAATTCGTTTCTACAGGAAAAATTATACCTCGACTTACAGCTCGGATGGCGGGATTCCAAATTTGATTTCAGGGACTGATGGAGCTACAACCTTTGAAAGAGAAGTACAGCAACTGCAGAAACAGATTGATTCTATGGGACGAGAGGTTCATCAAACTGGTCAACATTTCTCAAATAGCGATTACGCGCAGCAGACTTCTAGAAATTACGGACAGTCAAGTCCGACTGTAGACCGTACAAACTTCAGACATGTGTCAACGACAGGTAACTACGGGTCACAGAACTATGATAACTTGGGATTAAATTCTCAACAAATACGGCACGGTGCTTATGACAGCGAAAATCAACGTGTATATCAGACTGGGAGCAGTATTAATAGGCAAATAGAAAGTGGAAACCAAAGGGGTCAAACTTATGGCCACATGACCGGCAGGTACGACAATACCAATGGACATGGTGCTGGGGGAGTACAAAGACCGATTTACCCAGAATCAAATACTCGGCAAACTCAGCGTCAGTATCTTTCTCAGTCTGAGCAGGAAGAGTATATACGCAATCACGGGTCTCAAAGTTGGAACCAAGATCAAGTGTCCACTGGTAATTTGAATCGTGAGCAGCAAATTCATCATTCTTCATCACATAACGCAGGAACAGGACCAGGCCGCATTCAGCATTATGATCAATTTCAAACTACCTCTTCGTCTTCGTCCCAGTATCCGAATATTGACAGACGAAACCTCCAAGCTGATAGTAATAGAGAAACACAACAGACGCATGGTGTGAGCCAGGATAAAACCGTCGGTAATAATTATCACAGGAATTACAGAATTCAATCTGGAAGACTGAGTACACAGGGTATTGATTTAGGGCGACTGGCACACGGACCTGATTGTGTAGATACTGCAAGTGGACATACCTCGTATGAGCAATCCCAATATCATACACAATATAGACGGAATACTGAAGATTTTGATCAACAAACCCAACATATCAACCAGCACACGGAAGATCTTACCCAGCAAACAGAAGATCTTACCCAACAAATAGAAGATCTTACCCAACAAACACAGGATCTTACCCAGCAAACGGAGGATCTTACCCAGCAAACACAGGATCTTACTCAGCAAACAGAGGATTTAACTCAGCAAACACAAGATCTTACCCAACAGACGGAGGATTTTAGTCAACAATCACAGGGTGTTATTGAGCAATCGGGAGACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATTTAACCCAGCAAACACAAGATCTTACCCAACAGACGGAGGATTTTAGTCAACAATCGCAGGGTGTTATTGAGCAATCGGGATACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATCTTACCCAGCAAACACAGGATCTGACCCAACAGACGGAGGACTTTAGTCAACAATCACAGGGTGTTATTGAGCAATCGGGAGACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATCTTACCCAGCAAACACAGGATCTGACCCAACAGACGGAGGACTTTAGTCAACAATCACAGGGTACGGAGGATTTTAGTCAACAATCACAGGGTGTTATTGAGCAATCGGGAGACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATCTTACCCAGCAAACACAGGATCTGACCCAACAGACGGAGGACTTTAGTCAACAATCACAGGGTGTTATTGAGCAATCGGGAGACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATCTTACCCAGCAAACACAGGATTTGACCCAACAGACGGAGGACTTTAGTCAACAATCGCAGGGTGTTATTGAGCAATCGGGAGACCTTACCCAGCAAACAGAGGATCTTGCCCAGCAAACACAGGATCTTACTCAACAAACAGAGGATCTTACCCAGCAAACACAAGACTTTACCCAACAGACGGACGACTTTACGCAACAATCACAGGATTTCACACAGCAAACGGAAGAACTCCCTCAACAAACACAAGGATATATTCAACAATCGCAAGATCTGACGCAACAGATAGAAGATCTTCCTCAACAAACGGAAGGATTTATTGCAGAATCAGGAAACCTTGGCCAGCAAGCTGAGGATCTTACCCAACAAACCGTAGATGTGACACAAAGTCTTCAACCAGAATCTGGTCACTCCATAGATGTAGGGGAGCCAAACTATGTGCCGCATGTATCAAATTTTAAAGATGtgcatgaattatttttcgatcatGAAAAAGAAGACAATAGACGTCAACAGATAGAGAAGATCTCTCCACAAGAGGAAGATCTCACCCAACAAACACAAGATCTTAACCAAGAAACAGAGGACCTTACCCAGCAAACACAGGACCTTACCCAGCAGACGGAGGACTTTAATCAACAATCGCAGGATCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTCGATCATTTTCAAGTTGGAGATTCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACAGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTCGATCGTTTTCAAGTTGGAGATCCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTCGATCGTTTTCAAGTTGGAGATCCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGAATAATTGGGGGCTCGATCGTTTTCAAGTTGGAGATCCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACGGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGATTAATTGGGGGCTTGATCGTTTTCAAGTTGGAGATCAACAGGCCGAAGACCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGAATAATTGGGGGCTCGATCGTTTTCAAGTTGGAGATCCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTTGATCGTTTTCAAGTTGGAGATCAACAGGCCGAAGACCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTCGATCGTTTTCAAGTTGGAGGTCAACAGGCCGAAGATCTGAACCAACAAGCAACAAGTAACTCAGGAATTTACAATggaaatttaaatcaattcgGACAAGATCAAGTATTGGACTACCCAGCGCAAGTAGGAATACAGCCTGCACCAAAACCAAAACCAAAACGTCCAAAACACAGAACAATCCACATACAAGAATTCTCCACAGAACAGGAACAGCCATCAACGGCTAATATACAAAATGGTGCACCAGAAATCATTGTGACTTCCCCATCAAACAGAGGAGATCAACCCAGTCATGAAACAATTTCCAATACAGAATTATACAGTAGTGAACAGTCACATCGAGTTCAGCCGACTAAAACTGGGGGCCGTCGGAGGAATAGGTATTACGGCATTCAACATAAACCTCAAGTTAACCAAGGACAACATGGGCAGGATTCGCAAGTAGAGCATGTTCCAAACGAGGTAACGACCAGCCAAGAGCCGACTCTAAGATACACCCATGTAAATCAACTTGACAAACAGAATTCAGGCGATTCAAATGTCAATCCAAAAAATGTACAACCGATTCCTCAGATTGGAACCAGAATTCTAGAAGCATACGGAGCAAATGGACCATACAACAGTGATCATGAACCAGATTTATTTAATACTGTTAAACCAAATCCAAGTGCAACATTACCACCTGTCTATGGCGACAAGGAGCCTTTCGAAATTATTTGGTCCTACGCagttccaaaaatttttaccaatacCGCCGCTCCAACTACATCCACAGAGCCCACAACTACATCCACAGAGCCCACTACTATGACCACCGAGGTCTCAATTCCAACAACCACTGAGCCTCCAATTGCTACATCCACAACTGTCCCTCCGCCAACAACAACTGCTGCTCCCTCATTATGGCGCAGATTTAGAAACAGAGTTAGTAACACAATAGACAAAGCCAGAGAACGCGCAGCGAGTATCTTTGGTTAA
Protein Sequence
MAPAAVAVFVISLFVESFSAPNGCIRCVTSDTTGGYQTQSGWVNSNNLSQRSANLEDLTQQVEGELGGPHNQLAFDNTRPGNWRDVKQYRTADGHGRVYEEQGQQVQGPVRVRYYKKNFTSSYSSGNPGGFEVPSLSGFDSDNIRYGSQSANQGSFGQEQNSAYDQSAIRENSYASQGSLHSANQFSSQDERLGSQQRRVTSSQFGENTSGFNSRSTQQGLSEQESLRPGNWTTANTYKTDGGNGRVYEERGQVVTGPQKIRFYRKNYTSTYSSDGGIPNLISGTDGATTFEREVQQLQKQIDSMGREVHQTGQHFSNSDYAQQTSRNYGQSSPTVDRTNFRHVSTTGNYGSQNYDNLGLNSQQIRHGAYDSENQRVYQTGSSINRQIESGNQRGQTYGHMTGRYDNTNGHGAGGVQRPIYPESNTRQTQRQYLSQSEQEEYIRNHGSQSWNQDQVSTGNLNREQQIHHSSSHNAGTGPGRIQHYDQFQTTSSSSSQYPNIDRRNLQADSNRETQQTHGVSQDKTVGNNYHRNYRIQSGRLSTQGIDLGRLAHGPDCVDTASGHTSYEQSQYHTQYRRNTEDFDQQTQHINQHTEDLTQQTEDLTQQIEDLTQQTQDLTQQTEDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGYLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGTEDFSQQSQGVIEQSGDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGDLTQQTEDLAQQTQDLTQQTEDLTQQTQDFTQQTDDFTQQSQDFTQQTEELPQQTQGYIQQSQDLTQQIEDLPQQTEGFIAESGNLGQQAEDLTQQTVDVTQSLQPESGHSIDVGEPNYVPHVSNFKDVHELFFDHEKEDNRRQQIEKISPQEEDLTQQTQDLNQETEDLTQQTQDLTQQTEDFNQQSQDLTQETEDLSQQITGGNSDFGQQTNWGLDHFQVGDSQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQTNWGLDRFQVGDPQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQTNWGLDRFQVGDPQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQNNWGLDRFQVGDPQAEDLNQRTQGLTQETEDLSRQITGGNSDFGQQINWGLDRFQVGDQQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQNNWGLDRFQVGDPQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQTNWGLDRFQVGDQQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQTNWGLDRFQVGGQQAEDLNQQATSNSGIYNGNLNQFGQDQVLDYPAQVGIQPAPKPKPKRPKHRTIHIQEFSTEQEQPSTANIQNGAPEIIVTSPSNRGDQPSHETISNTELYSSEQSHRVQPTKTGGRRRNRYYGIQHKPQVNQGQHGQDSQVEHVPNEVTTSQEPTLRYTHVNQLDKQNSGDSNVNPKNVQPIPQIGTRILEAYGANGPYNSDHEPDLFNTVKPNPSATLPPVYGDKEPFEIIWSYAVPKIFTNTAAPTTSTEPTTTSTEPTTMTTEVSIPTTTEPPIATSTTVPPPTTTAAPSLWRRFRNRVSNTIDKARERAASIFG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-