Basic Information

Insect
Apis dorsata
Gene Symbol
-
Assembly
GCA_000469605.1
Location
NW:15346-25383[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 44 0.29 1.3e+02 1.8 1.8 33 55 25 47 22 50 0.69
2 44 4 1.8e+03 -1.9 0.4 40 57 77 94 71 100 0.66
3 44 4.2e-06 0.0019 17.3 0.7 30 63 126 159 120 161 0.89
4 44 0.46 2.1e+02 1.1 3.8 28 64 182 218 180 219 0.90
5 44 0.51 2.3e+02 1.0 2.2 29 64 262 297 260 298 0.89
6 44 5.5 2.5e+03 -2.3 7.6 27 59 320 348 312 355 0.58
7 44 1.2 5.3e+02 -0.2 11.6 32 59 380 407 358 413 0.48
8 44 0.62 2.8e+02 0.7 4.5 33 58 412 437 405 440 0.74
9 44 0.011 4.7 6.4 1.5 29 60 436 467 433 469 0.89
10 44 0.0011 0.51 9.5 5.4 25 63 467 505 465 507 0.92
11 44 0.023 10 5.3 7.3 21 61 519 559 512 563 0.76
12 44 3.9e-05 0.017 14.2 6.1 28 63 579 614 574 616 0.91
13 44 0.0069 3.1 7.0 2.8 24 63 610 649 609 650 0.56
14 44 7.1e-06 0.0032 16.5 4.5 24 65 638 679 636 679 0.95
15 44 0.16 71 2.6 1.2 33 51 675 693 674 698 0.81
16 44 0.051 23 4.2 6.3 24 63 705 744 702 746 0.93
17 44 0.004 1.8 7.7 2.1 28 62 737 771 730 774 0.58
18 44 9.6e-06 0.0043 16.1 3.6 22 63 773 814 772 816 0.94
19 44 0.0065 2.9 7.1 7.0 23 60 851 888 817 893 0.92
20 44 0.064 29 3.9 2.7 28 60 891 923 887 926 0.85
21 44 0.0038 1.7 7.8 11.4 23 63 907 947 905 949 0.94
22 44 0.068 30 3.8 9.6 31 64 943 976 940 991 0.80
23 44 0.0026 1.2 8.3 4.1 26 63 994 1031 991 1032 0.94
24 44 3e-05 0.013 14.6 3.3 25 62 1049 1086 1045 1089 0.92
25 44 0.0039 1.8 7.8 1.9 30 63 1082 1115 1078 1117 0.89
26 44 5.5 2.4e+03 -2.3 0.1 33 58 1127 1152 1117 1155 0.76
27 44 0.13 56 2.9 3.3 28 63 1157 1192 1151 1194 0.87
28 44 0.00051 0.23 10.6 1.2 18 62 1196 1240 1192 1243 0.88
29 44 0.23 1e+02 2.1 0.1 32 60 1245 1273 1242 1278 0.76
30 44 0.00027 0.12 11.5 0.8 27 59 1278 1310 1274 1312 0.90
31 44 0.0016 0.71 9.0 3.6 24 63 1331 1370 1328 1372 0.89
32 44 0.002 0.88 8.7 8.3 21 63 1349 1391 1345 1393 0.93
33 44 0.022 9.6 5.4 7.7 25 64 1388 1427 1386 1428 0.94
34 44 0.23 1e+02 2.1 7.5 24 64 1415 1455 1412 1462 0.87
35 44 0.021 9.1 5.5 11.3 25 64 1444 1483 1436 1491 0.66
36 44 0.004 1.8 7.7 6.5 22 62 1469 1509 1468 1519 0.85
37 44 0.0087 3.9 6.7 4.9 25 60 1514 1549 1511 1556 0.78
38 44 2 8.9e+02 -0.9 3.2 37 50 1550 1563 1543 1588 0.44
39 44 0.7 3.1e+02 0.5 3.4 26 59 1588 1621 1572 1625 0.77
40 44 0.0011 0.47 9.6 2.3 24 58 1621 1655 1617 1657 0.91
41 44 0.0054 2.4 7.3 5.2 33 62 1651 1680 1650 1683 0.91
42 44 5.4e-05 0.024 13.7 7.6 16 65 1711 1760 1710 1760 0.94
43 44 0.0015 0.68 9.1 4.7 35 63 1744 1772 1740 1773 0.89
44 44 5.9 2.6e+03 -2.4 0.9 36 50 1957 1971 1930 1976 0.53

Sequence Information

Coding Sequence
ATGTGTTTTTGTGTAGGTGTAAATGCGGATGAGATATTGAAGCCCGATCCACTGCTGGTGCACTCCCGGGAAGAAATCGAAAGGCTGCAAAATGCAAACGCGGCGCTCGAAGATAAAGTGAGGGATCTGGAGGAAACTCTCGCCGAACGAGACTGTTGCGACGATCCTGGCGCCACGATTTTCTTCCTCAGGGAGAAGATGAGATATTTTAGGGAGCGTTTCGCGCTTGAAAAGAAAGAATTGCGAGACACGATATCGGAATTGAAGTTGAAATTGGCGCGGATCGAGGAGGACGTCAGCTGCCCGGCGATACATCGTCTAAGGGCGAAGCTTCGTGACCTCATGAAAGGGGAGCCAGTCGAGACGTTGGTGTCCATCTCGAAAAGCTGCGAAGATCTGCGCTCCGAGAACGAGAGCCTTAAGGCCGAAGTGGCCGATTTGAAACGCGCGTTGGCCCAGtgcgaaaaaaagataaaagagataGAGGAGCCGAGGGAAGTGAAAGTGGTCGAGACGGTCACTGTGCCTGAATACATCGATATCGAAGATTTGATgaacaaattgaataattgcgAGGAAATGGTGGCCAATCTGACCAAAGAGTTGGAGGAGAAGAACAATCAAATCGACGCGTTGAACAAAGAATTGGAGCAGATGTTCATTTTGAAAAGTCTGGAGGAGCAAATAGCGATAATGAAACAGGATCTCAAGAAGAAAGACGAGAAGATTTCGGATCTTTTGAACACGTTAAGACAGTCGGAGATAAACTTGCTCGGTTTGGACGGTTTGAAAGCTAAGCTGGAACGTCTGGAGCCGGAATTGTACGAACTGAGGGAAACGAAAGAGGAAATAATGGACGAGTTAAATATGATGCGCGACACATTGAAGGAGAGGAACGATCAAATAATCCAGATATTGGAAGATAAGGCTAAGCTGGAGGAATATTACAAGAACAAGGTTAACGAGTTAGAGGCGAGGTTCGAAGAGCAGGAGAGTTTAAGCGAGGAGCTCGAGAATCTGAGGAATGAGTTGGAAAAGCAAAGGATAAAGAACAAAGAGCTCGAGGCGTGTTGCGAGGACATGGATGTCCTCGAGAACAAGCTGGCCGAGTTGGAGAAGGAGCTGGAGGATAACAGAGGCGAGCTCGAGAGATTGCAAAAGGAGAATTTGAATCTGAAGGATGAGATAGAGGTGGAAAGGATGGAGAACGATAAACTTCGCGACCGGTTGGAAGAGTCGAAAAAGTTGATGGAGGacaacgaaaatttaaaggcTCAGCTGGAGCAGCTGCGAGGCGAGAACAATGATCTGATGGGGCAGAAGAAGGCGCTCGAGGATTTAAATAAACAGTTAAACGAGGATAACGAGAGCATGAAGCGAACGATGGGCAATTTGGAGGCGCGAATCGACAATCTCTCGAATGAATTGAGCAACGTGGAGCGAGAACGGGACGCGTTGCTGGACGAGAATCAGAGTGTGAAGAGGGAGCTGGAACGAACGTTGACGGAGAACGAAAATCTGAAGACGGAGTTGGACAAGGCCGATGAACAACTTGACAAACTGAAAGCGGAGAAGAACGAGTTGCAGAGGAATTTCGACACGATGAAACTCGAGAACGAGACGTTGAAGGAGGACGTAAAGGCTTTGAAGGACGACCTTGAAGAGTCGAAGAGAGAGATGAAAGCGGTCGGTGATGCGTTAAAGGATAAGGAAGAGTTAAAGGACGTCGAGTTTCGAGAACTGCAACAAAATATGCAGAACTTGAAGACTGAAAATGGCGAGTTGAAGAAAGAGAACAAcgatttgaagaaagaaaacgacgATTTGAAAACGAGAGCTTCGGAATTGGAACACAAGTTGGACGACGTTAAGAAAGAATTGGATGAAGTGGAGTCGCAGAATGCCGATTTACGTGCCAAAATCGATAATTTGGAAAAGGAATTGGAAAAAGACAAGAAAGAGATCGAGCAATTAAAATCAGAGATATCTTCGTTGAAGAATGCTCTGGATAAATGCGTGGACGAGATGGAAAAACTGAAGGTAGAGAACGAGAAACTTAagacggagagagagaagatcgAAACGACCTGGTCGGACGAAAATACGAGTTTGAAAGCTAAAAACACGGAATTGGAACAAAATTTGACCACCGCTGTAAAGGAATTGGACAAAATAAGATCGGAAAACGCCGATTTGCTTAGCGAGTTGAATCGTTTGAAGCAAGAATTGGAAAGTGGTAAGAAGGAGATCGATCAATTGAAATCCGAGATAGGTTCTATGAAAGACGCTCTCGAGAAGTGTGTTGACGAGATAGAAAAGCTAAAAACTGAGAACAAGGACCTTAAATCGGAAGTTGAAGGCTTGGAATCTGAGAGGGATCGCTTGACGAACGAAGTTGCCGATCTGAAAccaaaaatttcagaattgcAACAAAAATTAACCGACGCTAGCAAAAAGTTGGATGAAGCGAAAGTAGAGGATAGCGATTTGCGTGCCGAAGTGGATCGTTTGAAAAAGGAATTGGAAAATGCTGGGAAGGAGATCGATCAATTGAAAACCGAGATGAACTCCTTGAAAAGCGGTCTCGACAAATGTGTGGACGAGatggaaaagttgaaaaatgagAACAGCGAGCTGAAATCTCAAGTTCACGGACTTAGGGGCGAGGGGGACAGTTTAGCCAGTGAATTGACTAACATGAAGGGCGAAAATTCTGCTTTGAAAGATGAGAAAGATCAATTGAACAAGCAATTGGCGGAGAATAAGACGGAAaatgaaagattgaaaaagcAGAATGATGAATTAGAAACTGAGAatacgaagataaaaaaagagttgGAATCGTGCAAGGgtgaaaataacaatttgaaggacgaaaataacaaattgaagGATGAACTTGAAAAATTACGAGGACAATTGAAATCGTTGAACgatgaaacgaataaattaaaacgcgAGCTCGAGGAAGCTGAACATAAAATTCAGATTTTGGAACCGCAGTTGTCGCGTGTGCAAAGCGAGAACGAGAAATCGCAAAGTGAATTGGCGGTTTTGAGAAACGAGGCGAACGAATTAAAAGCTAAATTAGATAGAGAAACGCTCGACAATGTAAATATGCGAAACGCTATGAAAATATTGGAGGATCAAATGCACgatttgaataagaaattgGACAATTGTAGGGCGGAGAACGATGCTTTGAAACAGGAAAATAAAGATCTTAAAACGAAGTTATCGGACACGGAACAAGttgtattgaatttgaaaacggaatgtgataatttgaaagaagatATTACCGATTTGCAGAGAAAGATCGAACAACTCAAGCAGAAAATCACTGATCAGGAAGCGGAGATAGACCATTGGAAAGTGGAGCATTGCAAATTCGAAttggataatgaaaaattaaaagctgatctcgagaaaatattgaaagatttgAACGAATGTcaggtTGCGAAAAAAGTCATTGAATCAGATCTAATCAAACTGAAGAACGAGAAAGACgatctgaataaaaaattgaccGATTTAACGTCGCAATTGGACCAACAGAAGAAAACATTGGAAGCGGAAAAATCGGCCAAAGATAAGGGTGATGTACAAATCGCTTCCTTGAATTCCGAACTCGAAGCGTTGAAGAAAGAATTAGAGAAATTGAGAGCGGATAACAGCAAGTACAGAAGTGAGATAGACGATCTAGGAAAACAACTCGTATCGGCGAAGAACGAGCTGAAGGATTGCAGAGATGAAGTCGTTGTGTTGAAAAACGCCAATAACGCGTTAAGGTCTGAATTGGATCCTCTGAGGAGTTTGAAGGATGATTACAATAGGCTGACAACCGAATTGGACGATCTGAAATCTGAGAACACGAAACTTCTCCAAGACAAGAGAAGTTTGGAGGACGAGTTCGGGAAATTGAGAGGAGAGGGTGATGGGCAGAGGGTGGAGATCGACAGATTGAAGACGACTTTGGACGCGGAGAAAACAGCCGCGGAAAAATTAAGATCGGATCTTCAAAGTTGCAAGactgaaaatgataaattgcaAGCGCAAATCAACGAGATGAAGAGGAATCTGGATAAAATGGAAACGGAGAATGATCGTCTGAAGAGAGAGCTCGACGAGTCGAAGAAGAAACTCGAGGATATGGAGGCGAAGATGAAGTCTCTGGAAAATCAGCTATCGAATTTGTCGGTGGAGAAAGAGGAGTTGGTCAAGGAACTCTATCGTACCCGTGAAGATTTGAACAATCTCAGAAACGAATTGGAGAAACAAACGGCTGTCAAAGATACAATGGCAAAGGAGTCGACCAATCTGAAGGAGGAGCTGGAAGCGTTGAAAGCGGAATTGAATAAGACTCGTGATGAAAACGACAAGCTGAAGAACGAGAATGACAAGCTGAATGCGGAAATAGCGAGATTGAATAAACAATTGGACGCACTGAAAGATGAAAGTGCGAATCTGAAGAATGAGATCGAGAATCTGAACGAAAGGAACGCGGAATTGTCTAAGGAATTAGCCGGGGCGaaagataatttgaaagatatgGAGACTCAATTGAACAATCTGAAAAGAGAGAATgacgatttgaaaaataaaatcaatagtcTCGAGGATAAGATGCAAGAGGTTGatgatttgaagaaaaagttgGAAGAGACTAAAAAGGAATTGGATAAGCCGAGCCTGGAATTGGATACATTAAAATCaaccaataaaaaattagaagacgATTTAAATAATGCGAGAAACGAATcgttaaatttgaagaatgaTTTGGATAAGTtgcaaaatgattataataatttgcaaacTGAACTTACTGATTTGAAAATGGAGAGAGATACATTTAGAGAACGTGCTGCCGCGTTGGAAAAAGATCTGGTCAGGGTGAAGAGGGAAAATGACGAATTGAAAGATCAAAACGCGAAATTTAAAACAGAATTAGACGATTGCCAGGAGGAGAACAATAGGTTGCTTAAAGAATTGGAGAAGTTGAAATCGGAGAATGTGAAACTGCAAGACAACTTGATCAATGCGAAAAGCGAAGGGGATAGATTAAAAGAAGATttgaacaaattgaaaaaagattataccGACTTGCGGGGCGATTTGACCAAGGCGAGAGAAGACAGGGAcataagaaaggaaaaggatgTGGAGTTGGACAAAGAGATTGACGAGTTGAAGACCACAAATGCAAAATTGAAAAGCGATTTATACGATTGCCAGAAAGAGAACGAAAAGTTGCGCAAGCAACTGACTAAATTAAAAGCGGAAAATGACAAACTGAAAAAAGCTTTAGATAAAGATGTGGATCAAAAAGAAACGGAAATTGACAAAGATATTCTGGATGATTGCAGCGAGTATGTTAAAGCAAACAAAATACtgcagaaaaattttgaaaaccagTTAAAAGgtGTACAACGTCTTCGAGACTACATCAATTATTTAGACGGTAAGGGTACAGAACCCAAAATGGCGGATAAATTGGAAGAATCAGAAATAAATCCAAATACAACAAATGACATTGAGATTGAGGATTTGTTGAAGATGTCTCAAGATTTaagtatcaatattaataaggCAGAAcaagaaatacaaaatcttgctaaattattaaaacctgTGgaACTTGGTGCCTTCGATCCTGATGGTTGGTTAAATTCATTGACACTTACGCAATTGGCAGAACTTCATGACAAGATATGTGAATTGACATCAGACATGGTGCAACAAGATTCCAAAGCTGTTCCATGTGAAGCACCCCAACGATCAGAGAATTCGAcggatattttgaataaacgtATAGCTGCTTTGCAGAAACAGATTGCGGAGAAGCAAATGGAGGCAGGATGGAAGCTTCAGGAGCTGAAGAGAGCTCTTAGAAATGAGCAGgctaatttaattcgattgtcCAATGAGATGAATTTGGAGAGACAACgcaattttaatcttcaatcTAGGTTGGATTCATCTTGA
Protein Sequence
MCFCVGVNADEILKPDPLLVHSREEIERLQNANAALEDKVRDLEETLAERDCCDDPGATIFFLREKMRYFRERFALEKKELRDTISELKLKLARIEEDVSCPAIHRLRAKLRDLMKGEPVETLVSISKSCEDLRSENESLKAEVADLKRALAQCEKKIKEIEEPREVKVVETVTVPEYIDIEDLMNKLNNCEEMVANLTKELEEKNNQIDALNKELEQMFILKSLEEQIAIMKQDLKKKDEKISDLLNTLRQSEINLLGLDGLKAKLERLEPELYELRETKEEIMDELNMMRDTLKERNDQIIQILEDKAKLEEYYKNKVNELEARFEEQESLSEELENLRNELEKQRIKNKELEACCEDMDVLENKLAELEKELEDNRGELERLQKENLNLKDEIEVERMENDKLRDRLEESKKLMEDNENLKAQLEQLRGENNDLMGQKKALEDLNKQLNEDNESMKRTMGNLEARIDNLSNELSNVERERDALLDENQSVKRELERTLTENENLKTELDKADEQLDKLKAEKNELQRNFDTMKLENETLKEDVKALKDDLEESKREMKAVGDALKDKEELKDVEFRELQQNMQNLKTENGELKKENNDLKKENDDLKTRASELEHKLDDVKKELDEVESQNADLRAKIDNLEKELEKDKKEIEQLKSEISSLKNALDKCVDEMEKLKVENEKLKTEREKIETTWSDENTSLKAKNTELEQNLTTAVKELDKIRSENADLLSELNRLKQELESGKKEIDQLKSEIGSMKDALEKCVDEIEKLKTENKDLKSEVEGLESERDRLTNEVADLKPKISELQQKLTDASKKLDEAKVEDSDLRAEVDRLKKELENAGKEIDQLKTEMNSLKSGLDKCVDEMEKLKNENSELKSQVHGLRGEGDSLASELTNMKGENSALKDEKDQLNKQLAENKTENERLKKQNDELETENTKIKKELESCKGENNNLKDENNKLKDELEKLRGQLKSLNDETNKLKRELEEAEHKIQILEPQLSRVQSENEKSQSELAVLRNEANELKAKLDRETLDNVNMRNAMKILEDQMHDLNKKLDNCRAENDALKQENKDLKTKLSDTEQVVLNLKTECDNLKEDITDLQRKIEQLKQKITDQEAEIDHWKVEHCKFELDNEKLKADLEKILKDLNECQVAKKVIESDLIKLKNEKDDLNKKLTDLTSQLDQQKKTLEAEKSAKDKGDVQIASLNSELEALKKELEKLRADNSKYRSEIDDLGKQLVSAKNELKDCRDEVVVLKNANNALRSELDPLRSLKDDYNRLTTELDDLKSENTKLLQDKRSLEDEFGKLRGEGDGQRVEIDRLKTTLDAEKTAAEKLRSDLQSCKTENDKLQAQINEMKRNLDKMETENDRLKRELDESKKKLEDMEAKMKSLENQLSNLSVEKEELVKELYRTREDLNNLRNELEKQTAVKDTMAKESTNLKEELEALKAELNKTRDENDKLKNENDKLNAEIARLNKQLDALKDESANLKNEIENLNERNAELSKELAGAKDNLKDMETQLNNLKRENDDLKNKINSLEDKMQEVDDLKKKLEETKKELDKPSLELDTLKSTNKKLEDDLNNARNESLNLKNDLDKLQNDYNNLQTELTDLKMERDTFRERAAALEKDLVRVKRENDELKDQNAKFKTELDDCQEENNRLLKELEKLKSENVKLQDNLINAKSEGDRLKEDLNKLKKDYTDLRGDLTKAREDRDIRKEKDVELDKEIDELKTTNAKLKSDLYDCQKENEKLRKQLTKLKAENDKLKKALDKDVDQKETEIDKDILDDCSEYVKANKILQKNFENQLKGVQRLRDYINYLDGKGTEPKMADKLEESEINPNTTNDIEIEDLLKMSQDLSININKAEQEIQNLAKLLKPVELGAFDPDGWLNSLTLTQLAELHDKICELTSDMVQQDSKAVPCEAPQRSENSTDILNKRIAALQKQIAEKQMEAGWKLQELKRALRNEQANLIRLSNEMNLERQRNFNLQSRLDSS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00142247;
90% Identity
-
80% Identity
-