Basic Information

Gene Symbol
-
Assembly
GCA_963932425.1
Location
OZ010626.1:1642212-1647653[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 18 0.00022 0.15 12.1 2.8 21 60 590 629 590 632 0.92
2 18 7.8 5.5e+03 -2.5 3.0 43 43 672 672 641 702 0.47
3 18 0.17 1.2e+02 2.8 5.0 25 60 755 790 753 794 0.74
4 18 0.00093 0.65 10.1 3.6 25 61 811 847 810 849 0.83
5 18 0.11 79 3.4 7.7 28 56 863 891 855 904 0.66
6 18 0.012 8.4 6.5 8.3 32 64 896 928 889 941 0.49
7 18 0.65 4.6e+02 0.9 8.7 25 58 959 993 951 998 0.54
8 18 0.004 2.8 8.0 7.4 27 62 990 1025 987 1033 0.83
9 18 0.44 3.1e+02 1.5 0.3 32 50 1044 1062 1039 1069 0.59
10 18 0.0005 0.35 10.9 5.0 28 63 1082 1117 1078 1124 0.65
11 18 0.00037 0.26 11.3 4.1 30 63 1140 1173 1136 1180 0.61
12 18 0.0005 0.35 10.9 5.0 28 63 1194 1229 1190 1236 0.65
13 18 0.0005 0.35 10.9 5.0 28 63 1250 1285 1246 1292 0.65
14 18 0.0005 0.35 10.9 5.0 28 63 1306 1341 1302 1348 0.65
15 18 0.0005 0.35 10.9 5.0 28 63 1362 1397 1358 1404 0.65
16 18 0.0029 2 8.5 8.8 25 60 1422 1457 1414 1462 0.60
17 18 0.014 9.8 6.3 9.3 26 63 1458 1495 1449 1497 0.73
18 18 0.0026 1.8 8.6 7.9 25 61 1485 1521 1484 1532 0.74

Sequence Information

Coding Sequence
ATGGCTCCTGCGGTGGTGGGGATTTTCATAATCTCACTATTAGCTGAATCGTATTCTGCTCCATATAAGTGTCTTGATTGCCAAACCATCAATAGACAGAGCGATTATCGAACAAGTAGCGGATCTGGTAGATGGGTTGATCAAGACAACTTATCGCAAGTAGCACAGAATTTGGAAGATCTAACCGGGTCTCAAAATCGAATAGCCTTTGATGATTCGAGACCTGGAAACTGGACGGATGTTAATCGTTACAGAACGCCTGATGGCCATGGCCAAGTTTTTGAAGAAGAAGGCCAACGTGTCGACGGGCCAACGCGGCTCAAGTTTTATAGAAGAAACTATACTTCTAGCTACAGCAGTGGAGGAAACAACGGTGACCTCGGAACACCTGATCTGGGAAGTTTTGATTCCTGGGACAGAAGCAGACAGTCAGTGAATCAAGGTGGTTTTGCTCAGGGGCAACAGTCAAATTACGATCGATCTGCCACACGAGAAAATTCCTATGCCTCACATGGCTCTCAGCAATCTTTAGATGGATTTGACAGCCAAACTGGCGCATTCCGTGGTCAACAAAATCGTGGAAGTGGCGGTCGTCGTGGAGAAAATGCATCAAGATTCAATGGTCAATCTGTGGAACGTTATCAAGATCGTCAGAGTACTAATCAATTTGCAGATAGGGAATCCGAATTCATTGGTCAGTCTACGCAACAAGGATTGTCTTCACTAGATCAACGAAGACCAGGAAATTGGACCGAGGTTGATAGTTACAGAACCGATGGTGGTCGTGGTCGAGTTTACGAAGAACAAGGACAATTTGTTACAGGACCAAAGAGAGTCCGTTTCTACAAAAAGAATTACACTTCAAGCTACAGCTCGGATGGGAGCACATTGCCCACTAATCTAGGATTTACTGGCCAGGATGATTTTGAAAGACAATTTCAACTGGCACAAAGAGAATTCGGCAGCGTAGGAAGCGGCATTCATAGATCTACTCAAGATTCAGTAACTGACAACTATGCACAGCGTACTTCCGGCCATCAGGCTAATGGTTTAAATACTCAGCAAACTGAACAGTATGTGAGGGgattcgaaaatcaacatGTTTCTCGTGGTACTCCAGGTTCAGTACCTGACAATTACGGCCATGGTTTAAATACTCAGCAAACTGAACAGTATGTGAGgggatttgaaaatcaacatGTCTCTGGTGGATCGGTTAGAGGAAATGAACGCGGTCACAGTAACGGATATAATAATCGAGGAACACATGGCATGATAACACAAGAATCGGTAAGCACTAGACCCATTTATACAGGGTCAGAGACACGACAATCTTACTACAATCGTCTCACGCAGTCTGAACAAGCAGAATATAGGCGTGTATATGGAACTGAGGGTTTGGACCAAAGCCAAATAACACGGGGTAATGTAAATAATGCCCAACAGTCTTATCACACTGGATCATATGACACTGGAATGCGACCAGGCCAATTACAATATTACGATCAATTCCAATCCACATCGGGGTCGACTTATCATCCTCAAATTGATAGTTCTCAATATGTCGGTAGTAGTCAAGGAAGTCAACGAACGCACAACGCCAATTTTGATCAAACGAGGAATAGAAATTATGGTGGGTCTCAGAGAGTTCAATCAGGACAACTAGTGACGCATGGAATTGATTTGGGACAAATTGCTCAAGGCCCTGATTGTATGGATGGCGGAAACGGTTTTACGACGTACGAAGAATCGCGCTACCATAAACTAAATAGACGCGATGAAGAGTCTGACAGTTTGTCTCAACAAACGGAAGATCTTACTCAACAAACGGAAGATCTTACTCAACAAACCGAGGATCTTACTCAGCAAACACAAGACTTTGGGCAGGAGTCTTCATGGAGGCCTGGTAAATTGCGAACCGAAAGTCAACAAACTCAGGATCTCACACAACATACCGAAGATCTTACTCAGCAAACACAAGACTTTGGGCAGGAGGCTTCATGGAGACCTGGTAAATTGAGAACCGAAAGCCAACAAACTCAAGATCTCACACAACATACCGAAGATCTTACTCAGCAAACACAAGACTTTGGGCAGGAGTCTTCATGGAGGCCTGGTAAATTGCGAACCGAAAGTCAACAAACTCAAGATCTAACTCAACACACGGAGGATCTTACACAACAAACGCACGACCTTACCCAACAAACACAAGATTTTGGGCAAGAAGCTTTGTGGAAACCTAGTAGATTAGAAATTCAAAGCCAGCAGACTGAGGATCTAACCCAACAAACAGAAGATCTGACTCAACAAACGGAGGATCTAACTCAACACACAGAACATCAAACTCTACACACCCGGGATCTTACCCATCAAATGCAAGAGTTTGGGCAGGAAGCTTTGTGGCAACCTGGTGAATTAGAAATCGAAGGTCAGCAAACTGAGGATTTAACTCAACAAACAGAAGATCTAACTCAACAAACTGAAGATCTAACTCAACAAACAGAGGATCTTACTCAGCAAACACAAGACTTCGGACAAGAAGTTTCATGGAAACCTGGTAAATTAGAAATCGAAAGTCAGCACACTGAGGATTTAACTCAACAAACGGAGGATCTAACTCAACAAACAGAAGATCTGACTCAACAAACAGAGGATCTTACTCAACAAACAGAAGATGATCTGACTCAACAAACAGAAGATCTAACTCAACATACGCAGGATCTCACACAACAGACTGAGGATTTAACTCAACAAACAGAAGATTTAACTCAACAAACAGAGGATCTTACTCAGCAAACACAAGACTTTGGACAAGAAATTTCATGGCAACCTGGTAAATTAGAAATCGAAAGTCAGCACACTGAGGATTTAACTCAACAAACGGAGGATCTTACTCAACAAACAGAGGATCTAACTCAACAAACAGAAGATGATCTGACTCAACAAACAGAAGATCTAAGTCAACATACGCAGGATCTTACACAACAAACTGAGGATTTAACTCAACAAACAGAAGATTTAACTCAACAAACAGAAGATCTTACTCAACAAACAGAGGATCTTACTCAGCAATCTCAAGACTTTGGACAAGAAGTTTCATGGCAACCTGGTAAATTAGAAATCGAAGGTCAACAAACGGAGGATCTAACTCAACAAACAGAAGATCTGACTCAACAAACACAAGACTTTGGACAAGAAATTTCATGGAAGCCTGGTAAACTAGAAATCGAAAGTCAGCACACTGAGGATCTAACTCAACAAACAGAGGATCTTACTCAACAAACAGAAGATCTGACCCAACAAACAGAAGATCTTACTCAACAAACAGAGGATCTCACTCAGCAAACACAAGACTTTGGACAAGAAGTTTCATGGAAACCTGGTAAACTAGAAATCGAAAGTCACCACACTGAGGATCTAACTCAACAAACAGAGGATCTTACTCAACAAACAGAAGATCTGACCCAACAAACAGAAGATCTTACTCAACAAACAGAGGATCTCACTCAGCAAACACAAGACTTTGGACAAGAAATTTCATGGAAGCCTGGTAAACTAGAAATCGAAAGTCAGCACACTGAGGATCTAACTCAACAAACAGAGGATCTTACTCAACAAACAGAAGATCTGACCCAACAAACAGAAGATCTTACTCAACAAACAGAGGATCTCACTCAGCAAACACAAGACTTTGGACAAGAAGTTTCATGGAAACCTGGTAAACTAGAAATCGAAAGTCAGCACACTGAGGATCTAACTCAACAAACAGAGGATCTTACTCAACAAACAGAAGATCTGACCCAACAAACAGAAGATCTTACTCAACAAACAGAGGATCTCACTCAGCAAACACAAGACTTTGGACAAGAAGTTTCATGGAAACCTGGTAAACTAGAAATCGAAAGTCAGCACACTGAGGATTTAACTCAACAAACGGAGGATCTTACTCAACAAACAGAGGATCTAACTCAACAAACAGAAGATCTGACTCAACAAACAGAGGATCTTACTCAGCAAACACAAGACTTTGGACAAGAAGTTTCATGGAAACCTGGTAAACTAGAAATCGAAAGTCAGCACACTGAGGATTTAACTCAACAAACGGAGGATCTTACTCAACAAACAGAGGATCTAACTCAACAAACAGAAGATCTGACTCAACAAACAGAGGATCTTACTCAGCAAACACAAGACTTTGGACAAGAAGTTTCATGGAAACCTGGTAAACTAGAAATCGAAAGTCAGCACACTGAGGATTTAACTCAACAAACGGAGGATCTTACTCAACAAACAGAGGATCTAACTCAACAAACAGAAGATCTGACTCAACAAACAGAAGATCTTACTCAACAAACGGAGGATCTAACTCAACACACGGAGGATCTTACACAACAAACAGAAGATCTGACTCAACAAACAGAAGATCTAACTCAACACACGGAGGATCTTACACAACAAACTGAAGATCTAACTCAACAAACTGAAGATTTAACGCAACAAACAGAAGACCTTACTCAACAAACTGAAGATTTAACGCAACAAACAGAAGACCTTACCCAACAAACGGACAGTTTTGACCAACataatcaaaatattaatcaaggAATCGAGTTTGGTGACCATCAAACACGTGCTCACCCTGCTCAAATAATCACTGAAGCCCCAAAGCCTGCACCGAAACCAAAACGTGTGCGACCTGGCATTCCACATCCCACTGAACAGCTTGATGTGGTACTCGAAGGGCCGAGTGGACCAGATGAAATACCGCAAAATCTAGAAGAAACTGCAGTTCCTCCCAAAATAACCAGAGGTGACCAACCTGTTGATGACAATGTTAAGGAACTGgaagaaagtgaaagagaaatcaAACTTCGACCATCGCCAGCTGTTCAAGAAATTCAGCCATGGAATAGCCAACAAACTACAAGCCAAACTAACCGTAAACGAGTACCTGCAACTAAAGGAGGTAGACGAAAAGCTCATCCTTCCCACTGGTACCCCCCCAGTCAAGGTAACCAACCTGTTCAAGAACCAAAGATAGTTGCTCAACGCATAGTAGCGCAGGAGTCCTCAAGTCCAACTGACACAGATGTACCTTTTGAGATTGGAACTAGACTTCTGGAGGCATATGGAGCAAATGGACCCTATGACAGGGATCATCATCCAGATATATTTGATGCTGCCAAACCAAATCCTAGTGCAACTTTTAAACCTGAGGGTGATAGGGATCCTTGGAATATTCGCGAGAGGCCTGAAATAGTTCCAGGAGTGATAACAGTAAGAACTACTACTACAActacaacgacaacgacgacaacgcctccacctccacccccTGAACCAGAAACAACTGAAGCTCCTGGATTCTGGCGTAGAATCGGAAATAAGTTCACTGATACCATAGATAAAGCCAAGGAAAAGGCAAAAGAATTGTTTGGCTAA
Protein Sequence
MAPAVVGIFIISLLAESYSAPYKCLDCQTINRQSDYRTSSGSGRWVDQDNLSQVAQNLEDLTGSQNRIAFDDSRPGNWTDVNRYRTPDGHGQVFEEEGQRVDGPTRLKFYRRNYTSSYSSGGNNGDLGTPDLGSFDSWDRSRQSVNQGGFAQGQQSNYDRSATRENSYASHGSQQSLDGFDSQTGAFRGQQNRGSGGRRGENASRFNGQSVERYQDRQSTNQFADRESEFIGQSTQQGLSSLDQRRPGNWTEVDSYRTDGGRGRVYEEQGQFVTGPKRVRFYKKNYTSSYSSDGSTLPTNLGFTGQDDFERQFQLAQREFGSVGSGIHRSTQDSVTDNYAQRTSGHQANGLNTQQTEQYVRGFENQHVSRGTPGSVPDNYGHGLNTQQTEQYVRGFENQHVSGGSVRGNERGHSNGYNNRGTHGMITQESVSTRPIYTGSETRQSYYNRLTQSEQAEYRRVYGTEGLDQSQITRGNVNNAQQSYHTGSYDTGMRPGQLQYYDQFQSTSGSTYHPQIDSSQYVGSSQGSQRTHNANFDQTRNRNYGGSQRVQSGQLVTHGIDLGQIAQGPDCMDGGNGFTTYEESRYHKLNRRDEESDSLSQQTEDLTQQTEDLTQQTEDLTQQTQDFGQESSWRPGKLRTESQQTQDLTQHTEDLTQQTQDFGQEASWRPGKLRTESQQTQDLTQHTEDLTQQTQDFGQESSWRPGKLRTESQQTQDLTQHTEDLTQQTHDLTQQTQDFGQEALWKPSRLEIQSQQTEDLTQQTEDLTQQTEDLTQHTEHQTLHTRDLTHQMQEFGQEALWQPGELEIEGQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDFGQEVSWKPGKLEIESQHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDDLTQQTEDLTQHTQDLTQQTEDLTQQTEDLTQQTEDLTQQTQDFGQEISWQPGKLEIESQHTEDLTQQTEDLTQQTEDLTQQTEDDLTQQTEDLSQHTQDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQSQDFGQEVSWQPGKLEIEGQQTEDLTQQTEDLTQQTQDFGQEISWKPGKLEIESQHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDFGQEVSWKPGKLEIESHHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDFGQEISWKPGKLEIESQHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDFGQEVSWKPGKLEIESQHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDFGQEVSWKPGKLEIESQHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDFGQEVSWKPGKLEIESQHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDFGQEVSWKPGKLEIESQHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQHTEDLTQQTEDLTQQTEDLTQHTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTDSFDQHNQNINQGIEFGDHQTRAHPAQIITEAPKPAPKPKRVRPGIPHPTEQLDVVLEGPSGPDEIPQNLEETAVPPKITRGDQPVDDNVKELEESEREIKLRPSPAVQEIQPWNSQQTTSQTNRKRVPATKGGRRKAHPSHWYPPSQGNQPVQEPKIVAQRIVAQESSSPTDTDVPFEIGTRLLEAYGANGPYDRDHHPDIFDAAKPNPSATFKPEGDRDPWNIRERPEIVPGVITVRTTTTTTTTTTTTPPPPPPEPETTEAPGFWRRIGNKFTDTIDKAKEKAKELFG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-