Basic Information

Gene Symbol
-
Assembly
GCA_964006375.1
Location
OZ023265.1:141707694-141753055[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 26 1.4 6.2e+03 -0.3 0.9 27 40 113 126 94 138 0.70
2 26 0.094 4.2e+02 3.4 0.4 29 59 146 176 141 181 0.63
3 26 5.1 2.3e+04 -2.2 0.3 36 53 184 201 183 210 0.73
4 26 0.098 4.4e+02 3.3 7.8 24 60 204 240 201 243 0.85
5 26 0.24 1.1e+03 2.1 2.3 27 56 257 286 255 296 0.89
6 26 0.21 9.1e+02 2.3 9.5 20 56 320 356 302 365 0.59
7 26 0.27 1.2e+03 2.0 3.0 33 62 377 406 372 415 0.63
8 26 0.00041 1.8 11.0 4.7 26 65 496 535 495 535 0.94
9 26 0.00016 0.71 12.3 1.6 26 59 548 581 546 587 0.90
10 26 6.1e-08 0.00027 23.2 3.6 26 65 590 629 589 629 0.94
11 26 6.4e-05 0.28 13.6 3.8 25 62 641 678 636 681 0.77
12 26 0.00032 1.4 11.3 9.3 30 63 688 721 683 727 0.61
13 26 0.21 9.2e+02 2.3 8.0 30 63 716 749 715 758 0.90
14 26 1.3 5.7e+03 -0.2 0.7 22 40 747 765 745 781 0.63
15 26 0.0044 20 7.7 4.8 30 64 784 818 772 819 0.88
16 26 7.7e-06 0.034 16.5 1.1 23 62 812 851 810 854 0.91
17 26 0.00036 1.6 11.1 7.0 26 62 843 879 841 882 0.84
18 26 4.5e-06 0.02 17.2 4.1 24 63 883 922 880 923 0.94
19 26 0.42 1.8e+03 1.3 7.0 15 53 993 1031 975 1036 0.76
20 26 1.1 4.9e+03 -0.0 9.1 16 55 994 1033 992 1050 0.55
21 26 1.7 7.7e+03 -0.7 8.3 26 64 1025 1063 1022 1068 0.86
22 26 0.0009 4 9.9 4.6 25 64 1088 1127 1087 1128 0.92
23 26 0.048 2.1e+02 4.3 0.6 27 49 1144 1166 1141 1178 0.85
24 26 0.0034 15 8.0 2.9 25 49 1211 1235 1207 1241 0.88
25 26 2.8 1.2e+04 -1.3 5.3 27 63 1365 1401 1357 1402 0.90
26 26 1.2 5.4e+03 -0.2 0.2 32 59 1419 1446 1404 1452 0.74

Sequence Information

Coding Sequence
ATGTCCAATAATCTAGAAGAGGGCGCGCATAGTGCGGAGTTGAGCCGTCTTGCTGTAATTTCTCGGAAAAGTCAGGATCATTGTCATTCGAAAAAATTGCTGCAAGCGTTGGAACTAGTAGCTGTACTCAAGCTACTGAAGAAGCCAGAAGCTTTGGTTCGCGAAACGTCTCGAAACTCACCAAGTGAATTCGACCACGACAAAACAGCCCCGAAACCATTTAGCATTCCTACAGAAATTAGTGATATCACAAATTACGGCAAGCAGTATCATGAGTGGGAAAGCATACAAACACAAGCACCTGAAAAAGAGGATACTTCTCAAAAAGACGATCTCATACGTGAATTGCAAGATAAGCTTCAAAAACTGGAAAAAGAACTCGCTGAAGCCAACGACATTATATGCAAACTAAAAGGTGGTGTCAGTGCACAAGATATATTAGAAAATCAAATCAACCAACTCAAGAACAGAATAAGTGAATTAGAAAAACAACTAGCTGACGCTAATAATACCATACAGAAAATGAAAGGAGATCTAAGTGCCGGTGAAAGAATGAGTGAAGAAAATAGAGCTCTCAGAAGGCATTCTTCTGAAGTGCAAGAAGTAGCTAAGGAGCAAATACAGGAATTAGTGGGCCATTTGCAAAATGCCAAAGATAGAATTTCAGAATTAGAAGCTGAAAATGAACTACTGAAAAAACAACTGCAACAACTACAGCAGGTTCCAATTGGTTACAGCGCACCGGCAGGGTTGCCACCTAGTGATAGTGTTGAGAGGCTTCGAAAGCAACTTGCAGATATTGAACAAGAAAAAGCAAATCTAGAAAAACTAATGCAAGACTTAATGAAAGAACCAGGAACGGCCCCCGAAGAGGAGCTCATGAAACTTAAGATTGATTATGCCAACTTGGAAGCACAGCTGGAAAAAGAGATTCAATTAAGAAAAGCGGCAGAAGACAGGTTAAGGGAATTGGAGGCCAGACTACAAAGTAGTGAAGATATGGACAGACTAATTAAAGAAAACCAGCTTCTTAAAGAAGAGAACGCGAACTTGCAAGATCAGCTTAGAATAGCAGAAGAAAAGATACGAGATCTTGAAAGTGCTATGAAGGGCTCTAGTTCAAAAGAGGAACTTCAAAGAGTATTAGACGAATTAAGTAAACTTAAAGCTGAAAAGGCTAATTTGGAAAGAAAACTAGCCGAGGCAGAAGGCAAACTTGCCGATACAGAAGGCAAACTAGCCGATGCAGAAGGCAAACTGGCCAACGCAGAAGGCAAACTGGCCGACGCAGAAGGCAAACTGGCCGCCACAGAAGGCAAACTGGCTGACATGAAAAGCAAACTTGCCGACACAGAGGGCAAACTGGCTGACACAGAAGGCAAACTTAACGAcgctgtacaaaaaaatttacagGGCAAAGCTGAGTTTGAAGCGATGAGAAATAATGTTGCCGTATTAGGAGGTAGCGCAACCGAATTAGGCAGTCAAAACTTAGCATTGCAAAATAGAGCAGATGAGCTATTAAAGGAAAATAATAAGTCGAAAGCTGAGATCGATAGACTGCAAGCTGAACTTGATGCAATGAGGAAAAAGATGGAGGCACTAGAACGGGACGCTAGCAGTGCAACCGATCTAGGAGCTCAAAAGGCATCTTTACAAAAAGACCTAGACAATACGCAAAAAGAGAATGCCGATCTCAAAGCTGAGTTAGATAAACTGAAGAATCAGCTTGCACAGTTAGGTGGTGGTGCTTCAGAATTAGGTAGTCAGAATTTGGCATTACAAAATAAAGTAGATGATTTACAAAAAGAAAATTCCAGTTTGAAATCTGAGATTGATAGGTTGAAAGATGAAATGGAAGCAATGAAGAAACAGATGTCACAATTAGGCCAGCAAGCCAATAATGCAACTGAACTAGAAGCAAAGAATTCAAATTTACAGAAGGATTTAGACAAAATGCAAAAAGAAAACGCTGATCTGAAAGCAGAATTAGATAAACTGAGAAATCAGCTGGCTCTATTAGGTGGCAGTGCAACAGAACTAGATGGTAAAAACGCTGCATTACAAAAAGATGTAGATCAAGCGCGCAAAGAAAAGGCTGACCTAGAAAACGAAAATAAACGGTTACAAGCAGAACTTGATCAACTAAGGAAAAAACTTGATGAAGCAAAACAGGAGAATGACCTTCTCAAAAAGGACAATGACAACTTAAAGAAAGCTAACGAAGATCTTACAAGCAAGAACAAAAAAGCAAATGATGAGTTAGAGAAGCAGGCGAAGGAAAAAGATGCCAGACAGGAGAAAGGTAGTCTCAAAGAAGATCTGGAAACAAGCAATCAACTTGAATTCCTCCGTTACGAGTCTGAAAAGGCCAAAGCTGAAAACGACAAGCTTCAAgatgaaaataataaagttaaatCCGAACTCAACGCAATGAAACAGGACAATGACAAATTAAAGAAGGACATTGAGAACTACACCAAGGAAAATGCAAATCTCAAGAGTGAAGTAGATAATCTCAAGAAAGATATTAGTAACTTAAAGGATGACAATGATAATCTTAAGAAAGAAAATGAGCGCTTAAGAAAAGAGAATGAGAAccagaaaaaagaaatcgaagaTCTGAAGAAAGAAATTGATAATCTTAAAAAAGAAATTGATAATCTTAAGAAAGAAATTGAAGATCTCAAAAAGAAGAACGGTGACCTCCAGAAGGAAGCTGAAGGTTTGCAAAATGAGGTTTCAAGCCTCTTAGACAAGTTGAAAACGCCTGGTGTATCTCCACAGGATATGTTTAAGGATGATTTAGACAAGGAGCGCAGAGAAAAAGCAGCACTTCAAAGCCTTATCGACGATCTTCAGAACCAACTAAAGAATGCACCAAAAGAAATAGTAGTTCCTGGAGCACCCGATGAAATTAACAAAGCTCTTGAAGAcatggctaaaaaattaaatgacgAACAGCAAAGGGCTGATGCAGCGGAAGAAGCACTCAAGAAACACAAATTAGAAGCAGAGGCATTGAAAAATGAAGTTGGAGAATTGAAGAAACTAATAGACGATCTGACAGCAGAGAAAGACAAATTACAGAAGCTGCTTGCTGAAGCCAATGAAGACAAGGCGAAGCTGCAGGCTAAGTTGGACGATGCGCTCAAGGAATTAGAAAAATTGCGAAAGGAGTTGGCAGCTTATAAAGATGAACCTGTGCCTATAGTGTCTGAGAAAGCAGTAGACGTGAGAGCTGAAGAACCGGATTACGAACAGAAAATCAAAGACTTATTGAAACAATTAGAAGACCTGAAAAAGGAAAAGGATAACTTGGCGGGTGAACTCGAAAGACTGAAAGGTGAAAAAGGAGAtttggaagacaaaataaaagcCTTACAGGAAGCATTAGACCAAGAGCGCGCAGGAGACCAAGATGCAATCAAAGAAGGTCTTCAGAAACGTATTGACGAATTGGAAGCCTTACTAGCGAAACTGAAAAAAGAACACCAAGATACAATAGCGTCCCTCAAGGATCAGTATGAAAGAGATTTGAAAAATCTACACGAAAGAAGTGAAACTGGAATTGAGAAGCTACAAGGTTCTCAGGACGAGCAAATGAAAGATATGATTAAGTACTATGAGGATCAGATCAATGCACTTAAAGAACAGCTAAAGGCTTTGGAAAAAGAGAATGCAGAGTTGAAGAAGAAAGAGGCCAAACCAGAGGTCAAAGCAGAGGTCAAACCAGTGGTCAAGTCGGAGGACAAGGGCTTAGAGATGGCCACTTTCCAAAACGTCTGCATTTGCTCACATTTCCCAACATTGATCCCTATAATGCCACCATTGTCCGAATCGGACATGGCAGTGGTTGCACGTTCTGCCCCATGTACTGCTACTTCTACTCAGGACGACGATAGCTTCCACGCTCTCTTGAAAAAAATATCAGAACAGGGAATGGAGTCCCTGTCTTACCAGGAACTTCACATACTGCACACTAGAACATGTAACGCCACAGCTCGTGTATTACAGGCCAACGGGAAGCAGCTGGCGCCGTGTGCTTCAGGGGACGAAAAACTGGAGCTGGTAAGGCGGATATCCACTTTGGAGAGTGAGCTGATGAGGAAGCAGAAGCATGCACAGCAGAAAGTTACTGCCTTACAAATGAGCATCCAGCTAGAAAAGAAAAAACTAGAAGAGTTGAAAAGAACGTTAGAAATAGAAAAACAGAAGAATTCAGATTTACAGAGCAAAATAGGTCTTCACACAAAAGCCGTTGTTAATTTGCAAGTAGAGAGAGACCTCCTGCATCGCCAGAATAGTCACCATGAAGAAAAGTTAGACCATTATATGCGTctggttgaccaagaaagggatAGAGTGAAGAGACTTCAGGATGAACTTGAAAGGGAGCGACATCAAAGTGACCATCTGGCATCTCAATTGGAAAGAGCTAGAGCTAACAAGAAAAAAGACACGGAATCCCTAAAAGTATGTGACCTTAATGTATTTTTCCGCAATATTTCTTTGTTTGTTTCAACATTCCCCTTTTGGTTTCAGTCAATGAAAGGCATGCTGGAGAAGACATTGAACAATGAAATGAAATTGAGGATGCAATTGGAGGATAGAGAAGGAAGTAAACCTAAGATACAGGGACGAGGTCAGTGTCCGTGTATGCTTGGAAATAGCATGGCTCCCAGTAAAGACATTAGCATCACATCGGGCCAGAAAAAGACAAGGACATGTGCTTTTCGATCGGACTTTGCAATGCCagtaaatataattttgattCGATTAATTAACAATATTACACTTTTCTATCATATCGTGGCCAGCATCCACGTCATCGCAAGGACTACCTGTGCGTTATCCACGGCCACGTAA
Protein Sequence
MSNNLEEGAHSAELSRLAVISRKSQDHCHSKKLLQALELVAVLKLLKKPEALVRETSRNSPSEFDHDKTAPKPFSIPTEISDITNYGKQYHEWESIQTQAPEKEDTSQKDDLIRELQDKLQKLEKELAEANDIICKLKGGVSAQDILENQINQLKNRISELEKQLADANNTIQKMKGDLSAGERMSEENRALRRHSSEVQEVAKEQIQELVGHLQNAKDRISELEAENELLKKQLQQLQQVPIGYSAPAGLPPSDSVERLRKQLADIEQEKANLEKLMQDLMKEPGTAPEEELMKLKIDYANLEAQLEKEIQLRKAAEDRLRELEARLQSSEDMDRLIKENQLLKEENANLQDQLRIAEEKIRDLESAMKGSSSKEELQRVLDELSKLKAEKANLERKLAEAEGKLADTEGKLADAEGKLANAEGKLADAEGKLAATEGKLADMKSKLADTEGKLADTEGKLNDAVQKNLQGKAEFEAMRNNVAVLGGSATELGSQNLALQNRADELLKENNKSKAEIDRLQAELDAMRKKMEALERDASSATDLGAQKASLQKDLDNTQKENADLKAELDKLKNQLAQLGGGASELGSQNLALQNKVDDLQKENSSLKSEIDRLKDEMEAMKKQMSQLGQQANNATELEAKNSNLQKDLDKMQKENADLKAELDKLRNQLALLGGSATELDGKNAALQKDVDQARKEKADLENENKRLQAELDQLRKKLDEAKQENDLLKKDNDNLKKANEDLTSKNKKANDELEKQAKEKDARQEKGSLKEDLETSNQLEFLRYESEKAKAENDKLQDENNKVKSELNAMKQDNDKLKKDIENYTKENANLKSEVDNLKKDISNLKDDNDNLKKENERLRKENENQKKEIEDLKKEIDNLKKEIDNLKKEIEDLKKKNGDLQKEAEGLQNEVSSLLDKLKTPGVSPQDMFKDDLDKERREKAALQSLIDDLQNQLKNAPKEIVVPGAPDEINKALEDMAKKLNDEQQRADAAEEALKKHKLEAEALKNEVGELKKLIDDLTAEKDKLQKLLAEANEDKAKLQAKLDDALKELEKLRKELAAYKDEPVPIVSEKAVDVRAEEPDYEQKIKDLLKQLEDLKKEKDNLAGELERLKGEKGDLEDKIKALQEALDQERAGDQDAIKEGLQKRIDELEALLAKLKKEHQDTIASLKDQYERDLKNLHERSETGIEKLQGSQDEQMKDMIKYYEDQINALKEQLKALEKENAELKKKEAKPEVKAEVKPVVKSEDKGLEMATFQNVCICSHFPTLIPIMPPLSESDMAVVARSAPCTATSTQDDDSFHALLKKISEQGMESLSYQELHILHTRTCNATARVLQANGKQLAPCASGDEKLELVRRISTLESELMRKQKHAQQKVTALQMSIQLEKKKLEELKRTLEIEKQKNSDLQSKIGLHTKAVVNLQVERDLLHRQNSHHEEKLDHYMRLVDQERDRVKRLQDELERERHQSDHLASQLERARANKKKDTESLKVCDLNVFFRNISLFVSTFPFWFQSMKGMLEKTLNNEMKLRMQLEDREGSKPKIQGRGQCPCMLGNSMAPSKDISITSGQKKTRTCAFRSDFAMPVNIILIRLINNITLFYHIVASIHVIARTTCALSTAT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-