Avir031899.1
Basic Information
- Insect
- Agapostemon virescens
- Gene Symbol
- -
- Assembly
- GCA_028453745.1
- Location
- JAQQRL010005302.1:1-4254[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 27 6.7 9.6e+03 -2.6 0.0 43 60 18 35 15 38 0.73 2 27 0.0003 0.43 11.3 2.1 30 60 77 107 73 111 0.89 3 27 2.6 3.7e+03 -1.3 5.0 28 63 133 168 131 170 0.91 4 27 1.5 2.1e+03 -0.5 3.2 32 59 162 191 155 203 0.67 5 27 0.014 20 6.0 1.4 24 46 222 244 212 259 0.63 6 27 0.82 1.2e+03 0.3 4.8 31 57 275 301 256 309 0.51 7 27 3.7 5.3e+03 -1.8 7.4 25 62 283 337 279 340 0.61 8 27 1e-05 0.015 16.0 2.2 24 63 341 380 338 382 0.94 9 27 0.00082 1.2 9.9 6.9 24 64 369 409 368 410 0.95 10 27 0.021 30 5.5 6.9 24 57 383 416 381 420 0.86 11 27 0.0016 2.2 9.0 2.7 28 64 408 444 406 445 0.89 12 27 0.032 46 4.8 2.6 21 60 443 482 443 487 0.89 13 27 0.00013 0.18 12.5 0.6 24 60 495 531 494 533 0.93 14 27 5e-05 0.072 13.8 7.3 24 63 523 562 522 564 0.95 15 27 0.031 45 4.9 0.3 41 64 568 591 559 592 0.79 16 27 0.00033 0.47 11.2 0.7 22 62 584 624 583 627 0.86 17 27 0.00057 0.83 10.4 9.1 21 62 639 680 638 683 0.91 18 27 0.028 40 5.0 4.4 22 59 682 719 680 722 0.87 19 27 0.0014 2.1 9.2 6.6 27 60 722 755 717 759 0.91 20 27 8.1e-06 0.012 16.4 4.2 22 63 745 786 744 788 0.94 21 27 0.0095 14 6.5 4.2 22 57 787 822 786 826 0.83 22 27 9.9e-05 0.14 12.9 12.1 26 64 826 864 823 865 0.92 23 27 8e-05 0.12 13.2 8.2 22 60 836 874 836 876 0.93 24 27 3.4e-06 0.0049 17.6 5.9 22 63 864 905 863 906 0.92 25 27 0.29 4.2e+02 1.8 0.1 37 59 914 936 907 939 0.79 26 27 0.00039 0.56 11.0 3.0 33 64 938 969 933 970 0.88 27 27 0.029 42 5.0 2.3 24 64 971 1011 970 1012 0.92
Sequence Information
- Coding Sequence
- ATGGGAAAACGTAGTTGGTCCAACGAAAAACAAAATCTCGTTTCCGTCGGCAGATTAAGGGACACGATATCGCACTTGAAATTAAAACTCTCGCAGACTGAGGAGGACGTGAGCTGTCCTGCGATATATCGCCTGAGAGCTAAGCTCCGCGAACTGATGAAAGGAGGTCAAACAGCCGTCCAGCAGGTTCCGAAAGTCGTCGAAAGATCGATCGAAATGTTGGTGGATCTGTCGAACAGCTGCGACGATCTGCGTCTCGAGAACGAGCAGCTTTTAGCCGAGCTTGCCGAGTTACGTCGCCAATTAGCGGATCTTGAAAAAACGAGACCGAAAGAACCGTCGGCGGAGATGCTAAGAACAGCCGAAACAACGACGGTTCCTGAATACATAGACGTCTCGGAGCTGCTGCGAAAGCTCGAAGATTGCGAACTCGCTGTTTCCGAATTGAAACATCAACTAGACGAGAAAGATAAGCTCATCGAGGCACTGAAGAAACAACTTGAAAACATGGTAGATCAGCAGGCTCTGTTGGATGAGATCACGGCCATGAAGGAGGAACTCGCGAAAAGGGACAATAAGATGAGGGACCTTCTGAACGACATGAGGCAGTCTGAGATAAGTCTGTTGGGTCTGAACCAACTAAATTCGGAGCTGGAAGCGTTGAAGCCTCGATTATCCGAACTGGAAGGGGAGAGAGATTCGTTGACGGACGAAGTGTCGAAGCTGAGAAAATTATTAGCCGAGAGGAACGATCAAATAATTGAGATACTGGAGCACAAGAACAGGCTGGAGCAGGAGCTTGCTGAGAAGGAGGCGGAGTCTCAGCGGATTATCGACAGCTTGAAAAAGGAGATGGACGATTTGCTGGCGCGAGTAGCGAATTTGCAGGACGAGCTCGATCAGCGTGATAAGCGAATCACGGACCTCGAGAAATGTTGCTCCGAAAGGGACGAGCTTTTAAAGAAACTACAGGCCGCGGAGGACGAGTTGGCTTCGCTACGAAGCGAACTTGCGTCCGCGAAAGCAACGATAGAGGATCTTCAGGGCGAAGTGGATACCCTAAAGAAGGACAACGACAATCTGCTGAAAGAGCTGAATGAAATCAAGGAACAGATGAATGATTTGACCGACCAATTAGCAGAGGCGAGAGCCGCGAAACAGGCCTTAGAAAAAGACCTCGAATCTGTCCGGAAGGAGCTAGAACAACTACGAAATGAGAACTCCGATCTGAAGGGTCAGTTGGAAGAAGCAAGATTCGAGAACGACAAGCTCAGAAAAGAGAACGAAGCCTTCAAAACAGAGCTGGACAACGTGACTTCGGAGCTGGATAAACTGAAGAAAGCCAATGACGATCTGCAGAGGAGCTTGGACGCCGCGAAGCTGGAGAACGACAATTTGAAAAGCAGCCTCGAAGATGCTCGAAGAGAAGCAGACCGGTTGCGAGCCGACAAAGACGCGTTGGAGACAACGGACGCAGATGCGAAAGCTAAGATCGACGACCTCCAGTCCCGATTGACGGATCTGACAGCGGAAAGGGACCGGTTGGCTGGCGAGAACGCGGACATCAAGGCTAAGAATTCGGAACTGGAACGGAGATTGGACGACGCGACGAAAGCGCTGGAAAAACTGAAGGCAGAGAATGCCGATCTGCTGGAGGAGCTGGAGCGTTTGAGAGCGGAATTAGCGAGAGCCCGGAGCACGATCGATCAATTAAAAGAAGAGATGGGCTCGCTGAAGGAAGCGCTGGATAAGTGCGTGGACGAGATGGACAAGTTGAAAGTCGAAATTGGGGAGCTTCGGGCGAAGAACGAGGCTCTGAGAGCGGAGCTTGACGAATGCAAGGCCGACAGGGACTCGTTGAAGAACGTTTTGGGCCGAACCAAGGCGGAGTTGGACGGCGTAACCGACGAGCTGAACAAGCTGAAGGAGGAGCACGGGTTGCTTGAACAAAATTTCGACCGACTCCAGGCCGAAAGGGACAAACTGCAGGCGGAGCTGGAAACCCTGAAGAACGAGGCGAAGAAGCTCGAAGAAGAAATGAGCAGAGCGAAGCAGAGAGAAGCTGCGTTAAACGACGAGCTGGATCGCGCGAGGAGGGAGAACGACGCGTTAGCAGCGGAATTAAGCAAACTGAAAGACGAACATTTAGCTTTGCAGAACGAGAGAGATCGATTGAAGAGACAATTGGACGACGCGAACGCTGAGAACGAGAAGCTGAGAGACGAGCTGGCTCGGTTGAAGGACGAGCACGAGAAGCTGAAAAGAGACAGAGACGATTTACAGCAGGATAACGAGAAACTGAACGGCGAAGTTGAGCGACTGCGCGGAGAAAGGGACTCGTTAAACGACGAGCTGGATCGCGTTAAGAAGGAAAAAGATTCGACGGCAACGGAATTAAGCAAACTGAAAGAGGAACATTCGGCGTTGCGGGACGAGAGGGATCGATTGAAGAAACAATTGGACGACGCGAACGCTGAGAACGAGGAGCTGAGACAAGAGCTGGCTCGGTTGAAGGACGAAAACGAGAAGCTGAAAAGGGAAAAAGACGATCTGCGGCAGGATAACGAGAAACTGAACGAAGAAGTTGAACGGCTGCGCGGAGAAAAAGACTCGTTGAACAACGAAGTAAACAAGCTGCGGGAGGAGAACGCTCGGCTGCAGAAGGATCTGAGTGCTCTACAGAGCGAAGTGCATGATCTGAAGGCGAAGCTCGATGAGGAACGGAAGGCCAACGAAATATTGAAGAAGGATTTGATGATGCTAGACAGCGAGATACAGGATTTGAGGAAGGCTCTCGACGAGGCCAGGACGAAAAGTGCTGCCCTGACGGAGGAGAATCAAGAGCTTCGATCGAGGTTGCAGGATTTGCAACACGAGCTGGACAGCTCGAGGTCCGAGATCGAAGATTTGAAGAACCAAATCGCTGATTTGAAGGCGAAAATCGCTAAATTGGAGGAGGACCTGGAGCATTGGAAGTTGGAGAACTGTAAGATCAAGATGGAGCTGGATAAACTGAAGGATGACTTGGAGAAGGCGTTGAAGGATTTGAACGAGTGCAAG
- Protein Sequence
- MGKRSWSNEKQNLVSVGRLRDTISHLKLKLSQTEEDVSCPAIYRLRAKLRELMKGGQTAVQQVPKVVERSIEMLVDLSNSCDDLRLENEQLLAELAELRRQLADLEKTRPKEPSAEMLRTAETTTVPEYIDVSELLRKLEDCELAVSELKHQLDEKDKLIEALKKQLENMVDQQALLDEITAMKEELAKRDNKMRDLLNDMRQSEISLLGLNQLNSELEALKPRLSELEGERDSLTDEVSKLRKLLAERNDQIIEILEHKNRLEQELAEKEAESQRIIDSLKKEMDDLLARVANLQDELDQRDKRITDLEKCCSERDELLKKLQAAEDELASLRSELASAKATIEDLQGEVDTLKKDNDNLLKELNEIKEQMNDLTDQLAEARAAKQALEKDLESVRKELEQLRNENSDLKGQLEEARFENDKLRKENEAFKTELDNVTSELDKLKKANDDLQRSLDAAKLENDNLKSSLEDARREADRLRADKDALETTDADAKAKIDDLQSRLTDLTAERDRLAGENADIKAKNSELERRLDDATKALEKLKAENADLLEELERLRAELARARSTIDQLKEEMGSLKEALDKCVDEMDKLKVEIGELRAKNEALRAELDECKADRDSLKNVLGRTKAELDGVTDELNKLKEEHGLLEQNFDRLQAERDKLQAELETLKNEAKKLEEEMSRAKQREAALNDELDRARRENDALAAELSKLKDEHLALQNERDRLKRQLDDANAENEKLRDELARLKDEHEKLKRDRDDLQQDNEKLNGEVERLRGERDSLNDELDRVKKEKDSTATELSKLKEEHSALRDERDRLKKQLDDANAENEELRQELARLKDENEKLKREKDDLRQDNEKLNEEVERLRGEKDSLNNEVNKLREENARLQKDLSALQSEVHDLKAKLDEERKANEILKKDLMMLDSEIQDLRKALDEARTKSAALTEENQELRSRLQDLQHELDSSRSEIEDLKNQIADLKAKIAKLEEDLEHWKLENCKIKMELDKLKDDLEKALKDLNECK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -