Basic Information

Insect
Euura lappo
Gene Symbol
-
Assembly
GCA_018257835.1
Location
JAEUYN010001381.1:67591-73071[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 13 0.00062 0.84 10.5 6.7 22 64 653 695 648 702 0.71
2 13 0.0046 6.3 7.7 8.8 26 64 723 761 721 764 0.79
3 13 0.0013 1.8 9.4 5.8 30 64 786 820 781 824 0.55
4 13 0.01 14 6.5 7.8 26 64 841 879 839 880 0.90
5 13 0.0048 6.6 7.6 8.2 28 64 902 938 898 941 0.68
6 13 0.0087 12 6.8 8.1 26 64 959 997 957 998 0.90
7 13 0.0024 3.3 8.5 6.7 30 64 1022 1056 1017 1060 0.57
8 13 0.0033 4.5 8.1 8.3 26 64 1077 1115 1075 1118 0.81
9 13 0.0027 3.6 8.4 3.2 26 58 1136 1168 1134 1173 0.76
10 13 2.8 3.8e+03 -1.3 0.7 26 41 1188 1203 1186 1206 0.61
11 13 0.0012 1.6 9.6 8.5 26 64 1213 1251 1211 1252 0.91
12 13 0.002 2.6 8.9 8.1 26 62 1272 1308 1270 1311 0.88
13 13 0.0011 1.5 9.7 8.8 25 58 1344 1377 1330 1384 0.56

Sequence Information

Coding Sequence
ATGGTTCCCTCGACGGTGGCGCTTCTCGTCATCTCCTTATTCGCTGGAGCGTATTCGGCACCCCAGCTATGCATATCATGTACGACATCGGGATTGCTTCGTCAAGATCAGTCGTTGCAGAGATCAGAAAACTTAGAGGATTTGACGCAACAAGCGGCAACCGAATTGCACAGAATACCTACTCAGCTAGCCTTCGACAATAGCAAGCCTGGAAACTGGGAAGAACAAAACCAGTACAGAACTCCCGATGGTCATGGTCAAGTGTTCGAAGAACAGGGCCAGAGGGTAGACGGAGGAAGACGAATCAGGTATTACAGGAAAAATTTCACCTCTAGTTACTCCAGCGGTAACGCCGGCGGTCTCGCAGGAACTGATGTGGCAGGATTCGATTCTTCCGGTCGCCAGTCAACAACCCACAATTACTTAGGCCAGAGACAAAACGGAGCTTTTGATCAATCTACTATCCGTGGAAATTCCTACCTCACCTCTGGCCAACACAGCTTGGGAGGATATGATTCTTATGGCCGCCCATTGACGGGCTACCAGAGCACTCACGATCAAACCTCTACTGGCGGCAATACATATGTCTCTCAGGATTCTAGTCGGACTGTGGGAAGACTCACCAGCGAGAACAGTCACGGTTCCTATGGCCGCCTGTCATCGGGTCATTCAGCTGTGGATCAAGGGCAAAGTGCGGGTTACGACCAAACTATTACTCGTGGTAATTCTCACGTTGTCGAGGACTCTAGTCAGACCACGGGAGGTCTTACAGGCCAGAATACTTACAGACGCTACGATTCTTATGGCCGACCGTTGTCAGGCTGGCAAACCGTGGGTCATTCTGCTGTGAATCAGGGGCAGGGTGCAGATTACGAACAAACCATTACTCGTGGTAGTTCTCATGTTGTCGAAGGCTCTAGTCAGATCACGCAAGGTCTAGGAAGCTACGATTCTTATGGGCGACCGTTGTCAGGTGGGCAAACTGTGGGTGATGTACACGATCGAACGGTTACTCAAGATAATTCACACATCTATCAAAGTTCTGGGGTCAATGGACGAGAAACGCAGACAGGATCTGCAGGGTGGGATCGAACGAGACCTGGAAACTGGAGCGAGTTTGATACCTACAGGACAGACGGGGGTCGTGGAATAGTTCACGAACAGCAAGGACAGTATGTGACTGGACCGAAGCAAGTTCGTTTCTACGCAAAGAATTACACTTCGAGCTACACTTCTGGGGGAGGTGTCCCAAGTACCGGCGCAGGATTGGACTCAACGAGCATCCTTGAAAGGGAAATGCAGAACCTCAACAGAGAGATTCAGCAAGGTGATCAAGTGATTGGTGCTGGATCTACTTTCCCTCAAAGCCCTATTGGACGTAATGTACGTCCGATTGTCATCAGACCCGTTAGTACCTACCCGACGCAACATCCAATTGCTTCAGGCTCGAATTCTTACCAAACATACAGTCATCATACACAGAGAACTGAGGATGCATCGAGTCTGCCTGGAGTAGTACCACTTTCCTCTGATTCACAAGCTCTGCAGAATTACTATGACCGCCTTACACAGACACAGCAGTCGCAGTATGTTCAAACTTATGGAACTCGTGGTTTGAGCCAAAATCAAATATCTCTGAGCAACGTTCTTCAACAACCAGGACAAGTACGGCATTACGAAGAATTCCAGACTTCATCCACGTCCTCGTCGCAGCATCCGGTAGTCGATAGGAGCGTACTTCAAGTCAGTAGTAATAATGGAAGACAGCAAACGTATAACACGCAAAATAGCTTTGGCCGAACTTCCGGTTACGATCAGGGCTCCGTTTATACGGTTCCATCAGGACAACTAGTTACGCACGGGATTGATTTGGGACAGGTTGCACAAGCTCCCGACTGTGACGAAGGTACAAGTGGATATTCCCAATATCAACAGACCCAACGTCTCAGAACATACCGACGTGCCTCTCAATCTGATGATCTCGCACAACAAACTGAAGATCTTACTCAACAAACGCAGGATCTTACTCAGCAGACGGAAGATCTGACCCAACAGACAGAAGACCTCACTCAACAAACGGAAGACCTTACCCAACAATCGCAACACGTGGACCAATTTGGACAGCACTCTTCGTGGAAACCGGGTAAATTGGAATTTGGAAGTCAGCAGGGCCAAGATCTTACCCAACAAACAGAGGATCTGACTCAACAAACACAGGATCTTACTCAACAAACAGAAGATCTGACGCAACAGACTGAGGACCTCACTCAACAAACGCAACAGGTGAACCAATTTGGACAGCACTCTTCGTGGAAACCGGGTAAATTGGAATTTGGAAGTCAGCATGGCCAAGATCTTACTCAACAAACACAGGATTTAACTCAACAAACACAGGATCTTACTCAACAAACAGAAGATCTGACGCAACAGACTGAGGACCTCACTCAACAAACGCAACAGGTGGACCAATTTGGGCAGCACTCTTCGTGGAAACCGGGTAAATTGGAATTTGGAAGTCAGCAAAGCCAAGATCTTACTCAAAAAACAGAGGATTTAACTCAACAAACACAGGATCTTACTCAACAAACAGAAGATCTGATGCAACAGACTGAGGGCCTCACTCAACAAACGCAACAGGTGGACCAATTTGGACAGCACTCTTCGTGGAAACCGGGTAAATTGGAATTTGGAAGTCAGCAGGGCCAAGATCTTACCCAACAAACACAGGATTTGACTCAACAAACACAGGATCTTACTCAACAAACAGAAGATCTGACGCAACAGACTGAGGACCTCACTCAACAAACGCAACAGGTGGACCAATTTGGACAGCACTCTTCGTGGAAACCGGGTAAATTGGAATTTGGAAGTCAGCAAAGCCAAGATCTTACTCAACAAACAGAGGATTTAACCCAACAAACACAGGATCTTACTCAACAAACAGAAGATCTGATGCAACAGACTGAGGGCCTCACTCAACAAACGCAACAGGTGAACCAATTTGGACAGCACTCTTCGTGGAAACCGGGTAAATTGGAATTTGGAAGTCAGCAGGGCCAAGATCTTACCCAAAAAACACAGGATTTGACTCAACAAACACAGGATCTTACTCAACAAACAGAAGATCTGACGCAACAGACTGAGGACCTCACTCAACAAACGCAACAGGTGGACCAATTTGGACAGCACTCTTCGTGGAAACCGGGTAAATTGGAATTTGGAAGTCAGCAGGGCCAAGATCTTACCCAACAAACAGAGGATCTTACCCAACAAACACAGGATCTTACTCAACAAACAGAAGATCTGACGCAACAGACTGAGGACCTCACTCAACAAACGCAACAGGTGGACCAATTTGGACAGCACTCTTCGTGGAATCCGGGTAAATTGGAATTCGGAAGTCAGCAAAGCCAAGATCTTACTCAACAAATACAGGATTTAACTCAACAAACAGAGGACCTTACTCAGCAAACAGAGGATCTGACTCAGCAAACACAACATGTGGACCAATTTGGGCAGCACTCTTCGTGGAAACCGGGAAAATTGGAATTCGGAAGTCAGCAAAGCCAAGATCTTACTCAACAAACAGAGGACCTTACTCAGCAAACAGAGGGAAAATTGGAATTTGGAAGTCAGCAAAGCCAGGATCTTACTCAACAAACACAGGATTTAACTCAACAAACAGAGGACCTTACTCAGCAAATACAGGATCTTACTCAACAAACAGAGGATCTCACTCAGCAAACACAACAAGTAGACCAATTTGGGCAGCACTCTTCGTGGAAACCGGGGAAATTGGAATTTGGAAATCAGCAAAGCCAAGATCTTACTCAACAAGTGGAGGATCTGACTCAACAAACACAGGATTTGACTCAACAAACAGAAGATCTGACGCAACAGACTGAGGACCTCGCTCAACAAACACAACACATGGACCAATTTGGACAGCACTCTTCTTGGAGACCGGGTAAATTGGAATTTGGAAGTCAGCAAGACCAAGATCTAGGTCAACAAACTGAAGATTTAACGCAACAAACAGAAGATCTTACGCAACAAACCGAAGATCTGACTCAACAAACGGAAGACCTTACTCAACAAACACAGGATCTTACTCAACAAACAGAAGATCTGACGCAACAGACAGGACAACATTGGCAAATTGGTGAGCCAAATTATGTTCCATTTGTGGAAAGGTCACCCGAAATTGTACAGGTCTTACAGTATCCTGGTGTTCAACAAAATGAGGACCTAACTCAACAGACTCAAGACTCAGGTGACTTTGGACAGCAAACTTCGTGGAATTATGGCCAACTGGAATCTGGGAATCACCAAGTGGAAGACCTCACTCAACAAACTCAAAACTCAGGTGACTTTGGACAGCAAACTTCGTGGAATTATGGCCAATTTGAATCTGGGAATCACCGAGTGGAAGACCTCACTCAACAAACTCAAGACTCAGGTGACTTTGGACAGCAAACTTCGTGGAATTATGGCCAATCGCAGACTGGACACCATCAAGTAGAAGACCTCACTCAACAAACCCAAACCTCAGGAAACTTTGGACAGCAATCTTCCTGGGACTCGCATAATCAAGGAAGTGGATCCAGCCACAAACCCATCAGCCCTGGGCATGGTGCTGTACAGGAACGAAGGCCTGCACCGAAGCCTCAAGGTCATCCAAGACCGACGAGCTCATACTACATCTATCCAGTTACTATTCAGGTAACAGAGCAACCCAAGATCGACCCCATTCGGTTAAACTACGCCCAAGAGAACAATCAAAAATGGGAGACTGCCGAGGATCTGCATCACAGTGAAAAAGAAGTCACTGTCCCTGCTGCACCAAAGAGAGGCGATCAAGAGGTAGACCCTGATTCCACCGAGGACGAAGAGTACCTAGCCGGAGTTTCAACCGAACCAACCACACCTAAATCACAGGATGAAACAAAACGATTCACAAAGTCAGGACGCCGAGGCGGTTATCGCGGACCACAATACCGTCCCCAAGTCATCGAGCCATTGGCGCATTACGTCCAAAAGAGTGAACAGCAGACGGTACAAAAATCAGAACAGCTAGAAGCACGGCCACTATCTGTCATCAGTCCTCGTATTCTAGAAGCGTACGGAGGGAACGGACCTTACGACGCAGACCATAGGCCCGATTTCGAATCGGTTAAGCCAAATCCCAACCTAATTATAACGCCACCTGAAACAGATGCTTGGGACGTTTATGCACGGGGCTTATCAGCTCAGAAACCACCGACAACAACGACACCAGAACCAACAACGACAGAAGAAATTACCACAACTCCAGTACCGGCTTCTACAACCCCAGCGCCTGGATTCTGGGGCAGACTTGGACACAAGATTACCAATACTTACGAGAAGGCGAAGGAAAAGGCAAAAGTACTCATCGGCTGA
Protein Sequence
MVPSTVALLVISLFAGAYSAPQLCISCTTSGLLRQDQSLQRSENLEDLTQQAATELHRIPTQLAFDNSKPGNWEEQNQYRTPDGHGQVFEEQGQRVDGGRRIRYYRKNFTSSYSSGNAGGLAGTDVAGFDSSGRQSTTHNYLGQRQNGAFDQSTIRGNSYLTSGQHSLGGYDSYGRPLTGYQSTHDQTSTGGNTYVSQDSSRTVGRLTSENSHGSYGRLSSGHSAVDQGQSAGYDQTITRGNSHVVEDSSQTTGGLTGQNTYRRYDSYGRPLSGWQTVGHSAVNQGQGADYEQTITRGSSHVVEGSSQITQGLGSYDSYGRPLSGGQTVGDVHDRTVTQDNSHIYQSSGVNGRETQTGSAGWDRTRPGNWSEFDTYRTDGGRGIVHEQQGQYVTGPKQVRFYAKNYTSSYTSGGGVPSTGAGLDSTSILEREMQNLNREIQQGDQVIGAGSTFPQSPIGRNVRPIVIRPVSTYPTQHPIASGSNSYQTYSHHTQRTEDASSLPGVVPLSSDSQALQNYYDRLTQTQQSQYVQTYGTRGLSQNQISLSNVLQQPGQVRHYEEFQTSSTSSSQHPVVDRSVLQVSSNNGRQQTYNTQNSFGRTSGYDQGSVYTVPSGQLVTHGIDLGQVAQAPDCDEGTSGYSQYQQTQRLRTYRRASQSDDLAQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQTEDLTQQSQHVDQFGQHSSWKPGKLEFGSQQGQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQTQQVNQFGQHSSWKPGKLEFGSQHGQDLTQQTQDLTQQTQDLTQQTEDLTQQTEDLTQQTQQVDQFGQHSSWKPGKLEFGSQQSQDLTQKTEDLTQQTQDLTQQTEDLMQQTEGLTQQTQQVDQFGQHSSWKPGKLEFGSQQGQDLTQQTQDLTQQTQDLTQQTEDLTQQTEDLTQQTQQVDQFGQHSSWKPGKLEFGSQQSQDLTQQTEDLTQQTQDLTQQTEDLMQQTEGLTQQTQQVNQFGQHSSWKPGKLEFGSQQGQDLTQKTQDLTQQTQDLTQQTEDLTQQTEDLTQQTQQVDQFGQHSSWKPGKLEFGSQQGQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQTQQVDQFGQHSSWNPGKLEFGSQQSQDLTQQIQDLTQQTEDLTQQTEDLTQQTQHVDQFGQHSSWKPGKLEFGSQQSQDLTQQTEDLTQQTEGKLEFGSQQSQDLTQQTQDLTQQTEDLTQQIQDLTQQTEDLTQQTQQVDQFGQHSSWKPGKLEFGNQQSQDLTQQVEDLTQQTQDLTQQTEDLTQQTEDLAQQTQHMDQFGQHSSWRPGKLEFGSQQDQDLGQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQTGQHWQIGEPNYVPFVERSPEIVQVLQYPGVQQNEDLTQQTQDSGDFGQQTSWNYGQLESGNHQVEDLTQQTQNSGDFGQQTSWNYGQFESGNHRVEDLTQQTQDSGDFGQQTSWNYGQSQTGHHQVEDLTQQTQTSGNFGQQSSWDSHNQGSGSSHKPISPGHGAVQERRPAPKPQGHPRPTSSYYIYPVTIQVTEQPKIDPIRLNYAQENNQKWETAEDLHHSEKEVTVPAAPKRGDQEVDPDSTEDEEYLAGVSTEPTTPKSQDETKRFTKSGRRGGYRGPQYRPQVIEPLAHYVQKSEQQTVQKSEQLEARPLSVISPRILEAYGGNGPYDADHRPDFESVKPNPNLIITPPETDAWDVYARGLSAQKPPTTTTPEPTTTEEITTTPVPASTTPAPGFWGRLGHKITNTYEKAKEKAKVLIG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-