Basic Information

Gene Symbol
-
Assembly
GCA_947538915.1
Location
OX384535.1:4071667-4077516[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 19 0.00047 0.53 11.1 6.8 28 64 583 619 580 620 0.91
2 19 0.13 1.4e+02 3.3 0.3 32 54 657 679 653 686 0.76
3 19 0.02 23 5.9 0.7 32 56 699 723 695 728 0.78
4 19 0.00065 0.74 10.7 5.6 25 63 741 779 740 781 0.86
5 19 0.5 5.6e+02 1.4 0.3 32 55 804 827 800 833 0.79
6 19 0.017 19 6.2 0.4 32 56 846 870 842 876 0.78
7 19 0.017 19 6.2 0.4 32 56 888 912 884 918 0.78
8 19 0.00058 0.65 10.8 3.5 32 63 930 961 926 967 0.84
9 19 0.024 27 5.7 0.3 32 58 986 1012 982 1017 0.84
10 19 0.031 35 5.3 2.0 32 60 1028 1056 1024 1059 0.73
11 19 3.1 3.5e+03 -1.1 0.1 34 47 1072 1085 1069 1098 0.64
12 19 0.0058 6.6 7.6 4.0 25 64 1138 1177 1131 1178 0.92
13 19 0.013 14 6.5 4.0 25 61 1166 1202 1159 1206 0.71
14 19 0.0075 8.5 7.3 3.2 26 64 1195 1233 1192 1234 0.86
15 19 0.016 19 6.2 4.4 25 61 1236 1272 1229 1276 0.71
16 19 0.013 14 6.5 6.1 25 61 1278 1314 1271 1318 0.68
17 19 0.016 18 6.2 7.4 25 58 1320 1353 1313 1357 0.54
18 19 0.00051 0.58 11.0 7.9 25 65 1403 1443 1402 1443 0.95
19 19 0.012 14 6.6 4.1 25 49 1424 1448 1423 1466 0.84

Sequence Information

Coding Sequence
ATGGCTCCCGCGGCGGTAGCAATATTTTTAATCTCTCTGTTCCTTGAGGCATTCTCGGCTCCGAGTGGGTGCGGCGATTGCCAAACATGGGAATCGCACAGTGGCTCTCAATCCCAGAGAGGATTTAGCAGGCACACAAATCGAGACGATTTATCTCAGAGATCAGGAAACCTGGAAGATTTAACACAACAAGCCGAATTCGAGCTAAACAGATCCCACAATCAATTAGCTTTTGACGATACAAGGCCTGGAAATTGGACTGACGTGAATCATTACGGAACATCCGATGGTCACGGAAGAGTATATGAGGAACAAGGCCAGCGGGTGGATGGACCGACTCGAATTAGATATATGAGGAAAAATTTTACTTCCAGTTGGAGCAGCGGAGGCTTAGGTTCCTTCGGAGAAACTAATTTGGGACGTATATATCCTCACGTTAGTCAAGACGCCAGCCAGTTATCTAACCGCGAAGCTTTAGATCAAACGCAACATTCAGCTTATGATCAATTTGCCATCCGAAGAAATTCTCACAACACGCAAGACTCTTTACATTCTACAGAAAGGGTTAACAGCCGCAACGATGCATCCAGATATTCCGAAACTCATGGCAGTAGCGGTCGGTTTAGACAAACTAGTTCTGACCAATCAGCACAGCAAGGACTAAACATGTTGGATCAAACAAGACCAGGAAATTGGAGCACAGTTAACACTTATAGAACTGATGGGGGTAATGGCAGGGTTTATGAAGAACGAAGGCAAATTATAACAGGACCGAGGCAGGTTCTTTCCTATAGAAGAAATTATACCTCGAGTTACAGCTCTGGCGGAGATATTCCAGCTTTTGGTGCAGAAAGCGACGAAACACGGAATATCGAAAGCAGCGTTCAGCAGCAACAAAGACAATTTGATAGTTACGGAAGAGAGCTTCATGAAACTACTGGTGGTTCGATAAATGGTGGTTACACTCGGTACTATTCTGGACATCATACAACGCCTAGCCAAACGACGGGTCAAACAAACCACAGATATGTATCAAGGCCTAGTAACTACGAGCCTCTACATTCAAATTCTCATCAAACGTACCAGCACACAACTGGCTTTGGAAATCAGCATGGGTCTCAGTCTAGCCGTAGTTCTGTTAGTAGAACGGGACAAGTTAGCGAAAGAATTCCAATTTCTGGAGGTTCAACAAGTAATTATGCTAGTAGTAGTTCATACAATACTGAACAGTCTGGAAGATTGCCTGACTTGGCATCACAAACACGGCAGATTGCTTATGGTCGATATTCGGACGCAGATTATATGCGTATTTCTGGATCGCAAGATTCGAGCCAGAAGGAGATTTCTAGTAGTAATTCAAATAATCAACAAGAATCTTATTACACTGGGTCACATGGTAGTGAGAGACCTTTGGGCCAAATTCGGCATTACGATCAATTTCAAACCTCCTCCACGTCCGGCTCCCGCGCTGCTTCCTCAATAAGTCGCCCTGACATGGATGTGACGACTATTCAAGCTGGTAGTAATCAAGGAGCACAGCGTAGGTTCAATACACAAAATAGCTTGGAGCAAACTGTCAATGATAACTATGACCGGAGTTATGGGGTACAAAGTGGTCATCTAATTACACAGGGAATTGATTTAGGACAAATGTCACAAGTTCCTGATTGTGCAGAAGGTACTAGTGGACATAGCTCATATGAACAATCCTACAGCCATAGAGTCTATAGAGGAGCTACCGAACCTCAGCATCTTACTCAACAAGTAGAGGATCTTACCCAACAAACAGAGGATCTGACCCAACAAACACAGGATCTCACCCAACAAACAGAGGATCTTACCCAACAAACACAGGATCTTACCCAACAAACAGATGATCTTACCCAGCAGACCGAAGATTTTACACAACAAGGTCAAAATTTCGGTCAACGACCATCTTCGAGACCCGGTAAATTGGAAGTTGGAAGTCAACGAGTCGAAGATCTGACTCAACAAACTCAGGATCTTACCCAGCAAAACGTAGATTTCACACAACAAAGTCAAAATTTCGGTCAGCGACCATCTTCGAGACCCGGTAGATTGGAAGTTGGAAGTCAACAAGTCGAAGATCTGACTCAACAAACTCAGGATCTTACCCAGCAAAACGAAGATTTTACGCAACAAAGTCAAGATTTTGGTCAGCGACCATTTTCGAGACCCGGTAAATTGGAAGTTGGAAGTCAGCAAGTCGAAGATCTGACTCAACAAACACAGGATCTTACCCAACAAACAGAGGATCTTACCCAACAAACACAGGATCTTACCCAACAAACAGATGATCTTACCCAGCAGACCGAAGATTTTACACAACAAGGTCAAAATTTCGGTCAACGACCATCTTCGAGACCCGGTAGATTGGAAGTTGGAAGTCAACAAGTCGAAGATCTGATTCAACAAACTCAGGATCTTACCCAGCAAAACGTAGATTTTACACAACAAAGTCAAAATTTCGGTCAGCGACCATCTTCGAGACCCGGTAGATTGGAAGTTGGAAGTCAACAAGTCGAAGATCTGACTCAACAAACTCAGGATCTTACCCAGCAAAACGAAGATTTTACGCAACAAAGTCAAGATTTCGGTCAGCGACCATCTTCGAGACCCGGTAGATTGGAAGTTGGAAGTCAACAAGTCGAAGATCTGACTCAACAAACTCAGGATCTTACCCAGCAAAACGAAGATTTTACGCAACAAAGTCAAGATTTCGGTCAGCGACCATCTTCGAGACCCGGTAAATTGGAAGTTGGAAGTCAGCAAGTCGAAGATCTGACCCAACAAACAGAGGATCTTACCCAACAAACACAGGATCTTACCCAACAAACAGATGATCTTACCCAGCAAAACGAAGATTTTACACAACAAAGTCAAGATTTCGGTCAGCGACCATCTTCGAGACCCGGTAAATTGGAAGTTGGAAGTCAACAAGTCGAAGATCTGACTCAACAAACTCAGGATCTTACCCAGCAAAACGTAGATTTTACACAACAAAATCAAAATTTCGGTCAGCGACCATCTTCGAGACCCGGTAAATTGGAAGTTGGAAGTCAGCAAGTTGAAGATCTGACTCAACAAACTCAGGATCTTACCCAGCAAACAGCGGATCTGACACAACAAAGTCAAGATTTCGAACAGCAATCTTCTTGGAGACCAGGTCAATTGGAAGTTGGTAGTCAGCACGTTGAAGATCTCACCCAGCAAACGGAAGATCTTACTCAACAAACAGTAGAAGGAATGCAACAAGATTATTCCCCACTTCGGAACCACGAACATTGGCAAACTGATGACCCAAATTATGCACCTCTGCCAATTGAGCATAAAGAAGTTGAGAGGCCGCAAAACTTAGGTATTGCTAATCAACAGCCCAAGGATCTTACACAACAAACACAGGATCTGACTCAACAAACAGACGATTTTACTCAACAAACGGAAGATCTTGGCCAACAAACCGAGGATCTCACTCAACAAACGGAAGATCTCGGCCAACAAACCGAGGATCTCACTCAACAAACGGAAGATCTTGGACAACAAACCGTGGATCTCACTCAACAAACGGGAGATCTTGGCCAACAAACCGTGGATCTCACTCAACAAACGGAAGATCTTGGCCAACAAACCGAGGATCTTACGCAACAAACGGAAGATCTTGGCCAACAAACCGAGGATCTCACTCAACAAACGGAAGATCTCGGCCAACAAACCGAGGATCTCACTCAACAAACGGAAGATCTTGGACAACAAACCGTGGATCTCACTCAACAAACGGGAGATCTTGGCCAACAAACCGTGGATCTCACTCAACAAACGGAAGATCTTGGCCAACAAACCGAGGATCTTACGCAACAAACGGAAGATCTTGGCCAACAAACCGAGGATCTCACTCAACAAACGGAAGATCTCGGCCAACAAACCGTGGATCTCACTCAACAAACGGAAGATCTTGGCCAACAAACCGAGGATCTCACTCAACAAACGGAAGATCTCAGCCAACAAACCGAGGATCTCACTCAACAAACGGAAGACCTATCGCAGCAAGGTATTGTACCAGCTGACCCTCCACTTTGGCACCATGAAACATCTCCGATTATCCGCCCAGATTATGAACCACCCACATTATCCCCGTCAAGTGTTCACCAAGAAGCTGAACCTCCCAAAATCGCAGATATCACTAGTCAACAGACAGAGAATCTTGCTCAACAAACGGTAGATCTTGGTCAAGAAAATGAGGATCTCACTCAACAAACGGAAGATCTTGGACAACAAACTGAGGATCTTACTCAACAGAATGAAGAATTATCACAGCAAATTCAAGGACAAGTAGAGGTCGGAAATGAACAAACAGAAAACCTTAACCAGCAAACAGAAGGTTTCGGTCTAGAAACTGGTGGCCAAGTACAACAAACTGAAAATATTGATTACGACGGGGGTCAAATAACAAGCAATTCAGGATTTGGACAGCAAAATTCTTGGAACTTTCAGAACCTAGAAGATTCAAGTCAACAAACAGAAGGTTTAAATCAGCAGACACAAGGTTTTGGTCAAGAAGCCAATGGCCAAATACAACAAACTGAAAATATTGCATATGACCAGCAGACTTCTTGGAACTCTCAGCACTTGGACAATGCAGGTCAACAGACTGAAACTTTTAATCAAGAAAATCAATTTGGCGGACAACAACCATTCATCTATCCTGGACAAGCAACAGAACGAGCACGAAAACCTGCACCAAAACCAAGACGTCCAAGACCCACGAATTCCCATCGCACTCAACAGATTAATATAGAGATGGAAGAACCAACTGTATCTAATGCCGAAAGTCATACGCAGCAACATAATAATCAGGGAAATAATGAAAAATGGACATCAACAAATGTTCCTTCCACTCCACAAAGAGGTGATCAAGGTATTAACGCAAACTCTAACGAATCAGAAGAAACAAGTATTCAAATTGAATCAGAGATACCCAAGGTGCCCGAACATCAAGTAGAATATGTCTCGACATACCCGGATTCATCTAGCCAGGAAGCCAGACGTGAAAATCAATTCAGACAAACTCAACCTACTAAAACTAAGACAAGTCGCCGGGGAGGGCATCCAGGTGCGCAATACCAAGGTCCACAAGGATGGCATTCTCCTCGTGACTCGCAAGTTAGTCAAGAGCCGACTATGATATTTATCGATAGACGTCAACTTGGGCACAAAAACTCAGGTGATTCAAGCTTCCAGCAATCAACAAGCACCGGGCAAGTTACAGAAGATCTTCAACAACATTTCACTAGCATTGAAAAAGGGGATCGGCTTGAATCTGCGCAAACAGTTCAAACAGTTCAACCTCTTGGTGCGGACATAGAATCTAGACGAGCGCAAAGTAGTCAGTCAGATAGAATAGTCTTTCCCGACTCTCCGGAAATCTCTATCAAACCTCGAATCTTAGAGGCATTCGGAGCTAAAGGACCATATGGAGAACATGATTTGGATATATTTGATTCTGCCAAACCAAATACTGACCCTGTCGTCTTAACACCACCCAGAGAGGGCAATGATTGGGATATTCGTGAGGTTGATTCAAGAGTTACAACCACAACTGAGATCCCAACTACTTTGCCATCAACAACAACAACATCAACAACACCACTTCCTCCGCCTCCACCTCCGCCCACTGCAGCTCCTGGATTTTGGAAAAAGTTTGGTAACACTTTGGCTAGCACTGTAGACAAAGCCAGGGATAAGGCAAGAGACTGGTTTGGCTAA
Protein Sequence
MAPAAVAIFLISLFLEAFSAPSGCGDCQTWESHSGSQSQRGFSRHTNRDDLSQRSGNLEDLTQQAEFELNRSHNQLAFDDTRPGNWTDVNHYGTSDGHGRVYEEQGQRVDGPTRIRYMRKNFTSSWSSGGLGSFGETNLGRIYPHVSQDASQLSNREALDQTQHSAYDQFAIRRNSHNTQDSLHSTERVNSRNDASRYSETHGSSGRFRQTSSDQSAQQGLNMLDQTRPGNWSTVNTYRTDGGNGRVYEERRQIITGPRQVLSYRRNYTSSYSSGGDIPAFGAESDETRNIESSVQQQQRQFDSYGRELHETTGGSINGGYTRYYSGHHTTPSQTTGQTNHRYVSRPSNYEPLHSNSHQTYQHTTGFGNQHGSQSSRSSVSRTGQVSERIPISGGSTSNYASSSSYNTEQSGRLPDLASQTRQIAYGRYSDADYMRISGSQDSSQKEISSSNSNNQQESYYTGSHGSERPLGQIRHYDQFQTSSTSGSRAASSISRPDMDVTTIQAGSNQGAQRRFNTQNSLEQTVNDNYDRSYGVQSGHLITQGIDLGQMSQVPDCAEGTSGHSSYEQSYSHRVYRGATEPQHLTQQVEDLTQQTEDLTQQTQDLTQQTEDLTQQTQDLTQQTDDLTQQTEDFTQQGQNFGQRPSSRPGKLEVGSQRVEDLTQQTQDLTQQNVDFTQQSQNFGQRPSSRPGRLEVGSQQVEDLTQQTQDLTQQNEDFTQQSQDFGQRPFSRPGKLEVGSQQVEDLTQQTQDLTQQTEDLTQQTQDLTQQTDDLTQQTEDFTQQGQNFGQRPSSRPGRLEVGSQQVEDLIQQTQDLTQQNVDFTQQSQNFGQRPSSRPGRLEVGSQQVEDLTQQTQDLTQQNEDFTQQSQDFGQRPSSRPGRLEVGSQQVEDLTQQTQDLTQQNEDFTQQSQDFGQRPSSRPGKLEVGSQQVEDLTQQTEDLTQQTQDLTQQTDDLTQQNEDFTQQSQDFGQRPSSRPGKLEVGSQQVEDLTQQTQDLTQQNVDFTQQNQNFGQRPSSRPGKLEVGSQQVEDLTQQTQDLTQQTADLTQQSQDFEQQSSWRPGQLEVGSQHVEDLTQQTEDLTQQTVEGMQQDYSPLRNHEHWQTDDPNYAPLPIEHKEVERPQNLGIANQQPKDLTQQTQDLTQQTDDFTQQTEDLGQQTEDLTQQTEDLGQQTEDLTQQTEDLGQQTVDLTQQTGDLGQQTVDLTQQTEDLGQQTEDLTQQTEDLGQQTEDLTQQTEDLGQQTEDLTQQTEDLGQQTVDLTQQTGDLGQQTVDLTQQTEDLGQQTEDLTQQTEDLGQQTEDLTQQTEDLGQQTVDLTQQTEDLGQQTEDLTQQTEDLSQQTEDLTQQTEDLSQQGIVPADPPLWHHETSPIIRPDYEPPTLSPSSVHQEAEPPKIADITSQQTENLAQQTVDLGQENEDLTQQTEDLGQQTEDLTQQNEELSQQIQGQVEVGNEQTENLNQQTEGFGLETGGQVQQTENIDYDGGQITSNSGFGQQNSWNFQNLEDSSQQTEGLNQQTQGFGQEANGQIQQTENIAYDQQTSWNSQHLDNAGQQTETFNQENQFGGQQPFIYPGQATERARKPAPKPRRPRPTNSHRTQQINIEMEEPTVSNAESHTQQHNNQGNNEKWTSTNVPSTPQRGDQGINANSNESEETSIQIESEIPKVPEHQVEYVSTYPDSSSQEARRENQFRQTQPTKTKTSRRGGHPGAQYQGPQGWHSPRDSQVSQEPTMIFIDRRQLGHKNSGDSSFQQSTSTGQVTEDLQQHFTSIEKGDRLESAQTVQTVQPLGADIESRRAQSSQSDRIVFPDSPEISIKPRILEAFGAKGPYGEHDLDIFDSAKPNTDPVVLTPPREGNDWDIREVDSRVTTTTEIPTTLPSTTTTSTTPLPPPPPPPTAAPGFWKKFGNTLASTVDKARDKARDWFG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01411235;
90% Identity
-
80% Identity
-