Basic Information

Gene Symbol
-
Assembly
GCA_963924015.1
Location
OZ001258.1:4701414-4706957[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 23 0.0066 7 7.4 11.4 25 60 544 579 534 584 0.56
2 23 0.0037 3.9 8.2 10.7 25 63 551 589 547 594 0.89
3 23 0.0045 4.8 7.9 4.6 32 64 607 639 606 640 0.89
4 23 0.0023 2.4 8.9 4.3 25 60 628 663 627 665 0.90
5 23 0.0073 7.7 7.2 5.5 32 63 677 708 675 710 0.78
6 23 0.0032 3.3 8.4 6.4 25 60 712 747 708 749 0.75
7 23 0.002 2.1 9.1 7.0 25 61 761 797 760 800 0.80
8 23 0.0064 6.7 7.4 4.2 25 54 796 825 794 828 0.87
9 23 0.0026 2.7 8.7 5.0 32 64 838 870 834 871 0.86
10 23 0.03 31 5.3 3.8 25 50 866 891 865 895 0.69
11 23 0.0054 5.6 7.7 8.3 25 61 908 944 907 946 0.80
12 23 0.0019 2 9.1 8.8 25 63 929 967 928 973 0.93
13 23 0.0011 1.2 9.9 7.6 25 64 985 1024 984 1025 0.92
14 23 0.045 48 4.7 3.8 25 50 1020 1045 1019 1050 0.70
15 23 0.0011 1.2 9.9 7.6 25 64 1062 1101 1061 1102 0.92
16 23 0.019 20 5.9 4.1 31 58 1096 1123 1095 1127 0.49
17 23 0.0077 8.1 7.2 5.2 32 62 1139 1169 1135 1172 0.81
18 23 0.0015 1.6 9.5 5.0 25 61 1174 1210 1172 1212 0.87
19 23 0.0063 6.6 7.5 5.6 26 61 1224 1259 1222 1262 0.68
20 23 0.0009 0.95 10.2 9.1 25 63 1244 1282 1242 1288 0.93
21 23 0.0023 2.4 8.9 5.8 25 64 1300 1339 1299 1340 0.82
22 23 0.022 24 5.7 2.8 25 63 1356 1394 1355 1396 0.90
23 23 0.096 1e+02 3.7 3.6 25 61 1398 1434 1391 1438 0.65

Sequence Information

Coding Sequence
ATGGCTCCCGCGGCGGtggcaatatttttaatctctcTATTCGTGGAAGGATTATCAGCTCCAAGTTGGTGCGGAGATTGTCAAACATGGGAATCGCACAGAGGCTCTCAAACTCATAGAGGATTTGGCAGGGaaataaatcaagaaaatttgTCTCAACGGTCAGAAAATTTGGAAGATTTAACACAACAAGCGGAAACCGAGTTCAACAGATCTCccaatcaatttaattttgataatacgAGGCCTGGAAATTGGACTGATGTTAATCATTACAGAACATCTGATGGTCATGGAAGAGTATACGAAGAACAAGGCCAGCGTGTAGATGGATCAAGTCGAATTAGATTCTCtaaaagaaatttcacttCCAGCTATAGTAGTGGAAACTTAGGTTCCTTTGGAGAAACTAATCTAGGACGTATATATCCTAACATAAGACAAGATGAGAGCCAGTTATTGAACCGTGAATCTTTAGATCAGTCACAAAATTCAGCTTATGATCGATTCGCCACTGGACGAAATTCTCATACTACACAGGACTCTTTACATTCTACCGAACGGGTGAACAGCCATAACGATGCATCCAGATATTATGAAAATCTTGGCAATAGTCAGATAAGCAAAATTACTTCTGGTCAATCAGTGCAGCAAGGAATTAATGCATTGGATCAAACAAGACCAGCAAATTGGAGCACGGTTAATACGTTTAGAACCAATGATGGTAATGGCAGGGTTTACGAAGAACGAGGGCAGTTTGTAACCGGGCCGAAGCGGGTTCATTTTTATGTAAGAAATTACACTTCAAGTTATGCCTCTGACGGAGGTATTCCGACTCTTGATTTAGGGGGTGAAGATACAAGGAATGTCGAAAGCAGCATACGGCAGCAGCAGAGACTATTCGATAGTTACGGAAGAGAGCTTCATCAAAGTAATGAGGGTTCAACAAATGGTGGTTACACTCAGCATTATCCTGGACATTATACATCACCCAGTCAAACGACGAGACAAACAAACTACAGATATGTATCGAGACCTGGTAGCTACGAATTACAAAATCAGAATGCTTTAGattcaaattctcaccaaactTACCAGCACACATCTAACTCAGGAAATCAGCATGTGTCTCAGTCTAGCAGTAGTTCTTTTGATGGATTGGGACAAGTTAATGGAAGAAATCCAAATTCTGCATATTCAACAGGCAGTTATACCAGTAATAGCAGATACAATCATCTGGGAACTGTGGAAATACCTTCTTCGGGACACACTGGACACAGGATTCCATATTACAATCAATTTCAAACCACCTCTGATTCCTCCTCTGCTGCAATATCTCATCCTGACACCGATTTAAGAACTATTCAATCTGGTAGCGATCAAGAAACACAGCGTACGCTTAATTCACACAACAGCTTTGATCAAACTACTAAGAATAATCAAAAGAGGATTTATAGGGTACAGAATGGGCAACTAGTTACACAGGGAATTGATTTGGGACAAATAGCACAAGCTCCTGATTGTGCAGAAGGTACAAATGGATATAGCTCATATGAACAGTCCTACCGTAGGGTCTATAGAGGGGCTCCCGAATCTCATGATTTTTCGCAACAAGTGCAAGATCTTACCCAGCAAACAGAGGATCTTACCCAGCAAACGGAAGATCTCACCCAGCAAACGGAAGATCTCACCCAGCAAACGGAAGATCTTACCCAGCAGACGCAGGATCTTACACAACAAACAGAAGATCTAACACAACAAAATCAAGATTTTGGACAGCAATCTTTTTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAACAGGTTGAAGATCTCACTCAGCAAACAGAGGATCTTACTCAACAAACGGAAGATCTTGCCCAGCAGACAGAAGGTCTTACTCAGCAATCAGAGGATCTTACCCAACAAACGGAAGGGCTAACCCAGCAAACGGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAAACAGAAGATCTTACTCAGCAATCAGAGGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACGGAAGATCTTACTCAGCAGACGGAAGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACCGAAGATCTTACTCAACAAACAGAAGACCTTACTCAGCAAACAGAGGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACGGAAGATCTTACCCAGCAGACAGAAGATCTTACACAACAGAATCAGGATTTCGAACAGCAATCTTCTTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTTACTCAGCAAACAGAGGATCTTACTCAGCAAACAGAAGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACGGAAGATCTTACCCAGCAGACGGAAGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAAACAGAAGACCTTACTCAGCAAACAGAGGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACAGAAGATCTTACCCAGCAGACAGAAGATCTTACACAACAGAATGAGGATTTCGTACAACAATCTTCTTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAATCAGAGGATCTTACCCAACAAACGGAGGGGCTCACCCAGCAAACAGAAGATCTCACCCAGCAGACGGAAGATCTTTCACAACAAACAGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAATCAGAGGATCTCACCCAACAAACGGAGGGGCTCACCCAGCAAACAGAAGATCTCACCCAGCAGACGGAAGATCTTTCACAACAAACAGAAGATCTTACACAACAAAATCAGCATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTTACGCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAAACAGAAGATCTTACTCAGCAATCAGAGGATGTTACCCAACAAACGGAAGGGCTCACCCAGCAAACAGAAGATCTGACCCAGCAGACGGAAGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAGGATTTCGGGCAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTGGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCACCAAACGGAAGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACGGAAGATCTTACACAACAAACAGAAGATCTCACTCAGCAAACAGAGGATCTTACCCAACAAACGGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTAGTCAACAAACACAGGATCTTACTCAGCAAACTGAAGATTTTGGTCAACAAACACAGGATCTTACTCAACAAACTGAAGATCTTGGTCaacaaattgatgattttgGTCAACAAACACAGGATCTTACTCAGCAAACTGAAGATTTTGGTCAACAAACACAGGATCTTACTCAACAAACTGAAGATCTTGGtcaacaaattgaaaatcttggtCAACAAACCGAAGATTTTGGTCAACAAACACAGGATCTTACTCAACAAACTGAAGATCTTGGtcaacaaattgaaaatcttggtCAACAAACCGAAGATTTTGGTCAACAAACACAGGATCTTGGTCAACAAACTGAAGATCTTAATCAGCAGACGTTAGGTATTTTCCAAGAAACTGATGGCCAATCACAGCaacctgaaaatttcaatagttGGGGGGAACAGATACCAAGTAGCCCAGGATTTGGACGGGAGTCTTCTTGGAACTCTGACAATCTGGAAATTGGGGGTCAAcaaactgaaaatttcaatcaagaaAATCAATTTGGTGACCAACAAACAATCATCCATCCCGAACAATCGAACAGACCAGCACCAAAGCCTGCACCTAAACCAAGACGTCAAAGACCTGGGAATTTCCATCATACTCAACAGATTAATATAGAGATTGAAGAACCAACTGTATCTAATGCACATAGTCATACAGCGCAACATAACAATCAgcaaaatagtgaaaaatggGTATCAACGGGTGTTCCTCCCATTCCACAAAGAGGTGATCAAGACTCAACCAGCCAGCAAACTACAAGTGGGAATCAATTCAGACAAACTCAACCTACTAAAACTAAGACAAGTCGCCGAAGAGGGAATGACCGTGCTCAATACCAAGGTCCACAAGGATGGCATTCTCGTGATTTGTCGATTAGTCAAGATCCATTGATGAGACTTGTTGATAGACGTATGAACTCAGATGATTCAAACTTGCCCCAATCAGCAAATACTGGGCAAGTTAGACAAGATTTTCAACAACATTTGACTAATCCTAATGAAACTGAACAGCTCGAATCTAGGCAAACAGTTCAACCTCTTGGTGCGGCTATAGAATCAAGACAACGGAGCAGTGATCAATCAGATAGAATTGTCTTTCCCGACTCTTCAGAAGCCTCTTTCAGCCCTAGAATTTTAGAGGCATTTGGGGCGAATGGACCATACGGCGAACATGATTTAGGTATATTTGATTCTGCCAAAAAGTATCCTGACACTACAACAGTTTTAACACCCCCTGAAAATGGAAATGATTGGGATATTCGTGAAGTTGATCGGACAGTTACAACCACAACTGAGGCTCCAACTCCTCTACCAACAACTACTACAACAACACCAGCTCCGCCTCCAACTCCGGCTCCTGGATTTTGGAAAAAACTGGGTAACACCTTCAGTACTACGCTAGAAAAAGCCAAGGAGAAGGCGAGCGGCTGGTTTGGCTAA
Protein Sequence
MAPAAVAIFLISLFVEGLSAPSWCGDCQTWESHRGSQTHRGFGREINQENLSQRSENLEDLTQQAETEFNRSPNQFNFDNTRPGNWTDVNHYRTSDGHGRVYEEQGQRVDGSSRIRFSKRNFTSSYSSGNLGSFGETNLGRIYPNIRQDESQLLNRESLDQSQNSAYDRFATGRNSHTTQDSLHSTERVNSHNDASRYYENLGNSQISKITSGQSVQQGINALDQTRPANWSTVNTFRTNDGNGRVYEERGQFVTGPKRVHFYVRNYTSSYASDGGIPTLDLGGEDTRNVESSIRQQQRLFDSYGRELHQSNEGSTNGGYTQHYPGHYTSPSQTTRQTNYRYVSRPGSYELQNQNALDSNSHQTYQHTSNSGNQHVSQSSSSSFDGLGQVNGRNPNSAYSTGSYTSNSRYNHLGTVEIPSSGHTGHRIPYYNQFQTTSDSSSAAISHPDTDLRTIQSGSDQETQRTLNSHNSFDQTTKNNQKRIYRVQNGQLVTQGIDLGQIAQAPDCAEGTNGYSSYEQSYRRVYRGAPESHDFSQQVQDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQNQDFGQQSFWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLAQQTEGLTQQSEDLTQQTEGLTQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQSEDLTQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEGLTQQTEDLTQQTEDLTQQNQDFEQQSSWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEGLTQQTEDLTQQTEDLTQQNEDFVQQSSWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQSEDLTQQTEGLTQQTEDLTQQTEDLSQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQSEDLTQQTEGLTQQTEDLTQQTEDLSQQTEDLTQQNQHFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQSEDVTQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSYWRPGKLEVGGQQVEDLTQQTEDLTHQTEDLTQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLSQQTQDLTQQTEDFGQQTQDLTQQTEDLGQQIDDFGQQTQDLTQQTEDFGQQTQDLTQQTEDLGQQIENLGQQTEDFGQQTQDLTQQTEDLGQQIENLGQQTEDFGQQTQDLGQQTEDLNQQTLGIFQETDGQSQQPENFNSWGEQIPSSPGFGRESSWNSDNLEIGGQQTENFNQENQFGDQQTIIHPEQSNRPAPKPAPKPRRQRPGNFHHTQQINIEIEEPTVSNAHSHTAQHNNQQNSEKWVSTGVPPIPQRGDQDSTSQQTTSGNQFRQTQPTKTKTSRRRGNDRAQYQGPQGWHSRDLSISQDPLMRLVDRRMNSDDSNLPQSANTGQVRQDFQQHLTNPNETEQLESRQTVQPLGAAIESRQRSSDQSDRIVFPDSSEASFSPRILEAFGANGPYGEHDLGIFDSAKKYPDTTTVLTPPENGNDWDIREVDRTVTTTTEAPTPLPTTTTTTPAPPPTPAPGFWKKLGNTFSTTLEKAKEKASGWFG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-