Htar018657.1
Basic Information
- Insect
- Herminia tarsipennalis
- Gene Symbol
- -
- Assembly
- GCA_945859575.2
- Location
- OX243843.1:50183-68298[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 7 3.7e-05 0.032 15.3 0.8 29 62 266 299 262 301 0.89 2 7 1.8 1.5e+03 0.3 1.5 4 19 433 448 431 450 0.89 3 7 0.00015 0.13 13.3 0.4 27 62 701 736 699 738 0.90 4 7 0.0044 3.8 8.6 0.2 25 60 1129 1164 1128 1167 0.92 5 7 0.0058 5 8.2 0.6 28 63 1528 1563 1523 1565 0.88 6 7 8.9 7.6e+03 -2.0 2.8 20 39 1594 1613 1592 1624 0.88 7 7 0.0027 2.3 9.3 0.0 30 59 1825 1854 1813 1856 0.73
Sequence Information
- Coding Sequence
- atggacagtaacaaactacgaaatatttttaacaagaggagcacaacgctatcggagaaggagaaggagaaggagaaggagaaagagaaggagagagagggaGAGAACGTACTGCAAAGGAGTGCGGGGAGTTTGCCCCTAGAGGCGGCTGATTCCGGCCGGCTTAGCCGTTCCAGGACTTCGGTGACCTCTGCGTCGCTGGAGAGTTTGCTGCGCTCGGGCCAGTTCGGCTCTGAGTCATCCGTTAGCTCAGCGGTTAGCGACCTGGACCAGATGGTGGCTGGGTTAGGGAACGCTCCAGAGACCTCGACGGCTGGCGAACGCCGCGCTCAACATCAAAAGCGCGGCGTCTCTCCCAGTGATGACGAAAACCTTCCTGGGCTTGACCTTGAGGCCCCCAGGAAGATACATACATCCCAAAGGGGGGGATCTGTGGCGAGGTTTGCTCGCCCGAAAACACCCCCTGCAGTGACGGCTACTAGTGCGGCTACTAAGGCGGTCAATAAAACGTCTGGCGTATTTTCTAGGCGGGAGGCCTACAACAGGGGCCGCTCAGAGGGCGAAATAGTTGCTGAGGAGCGGGTGCAGCAGCTGTTGGGTGGGCTCTGCCCGTCGGGAGGGGTTAGCTCCCCGAAAACCACGGAAGAGCTCATGCGCTCGGCGACCACCTCCGTGGCGGTCATCACAAACGTGGCCGTAAAAAGTAAAAACCTCAAGGGCACCTTCGCCAAGGGCCTGAAGGAGTCCGCCACGGTTTTAAAGGAGGTGGTAGAGGCGTTTTTTGAGCGGTCCACTGCCAACGAGACGGAGACCAAGAGTCTGCGGGCGGATAACACCCGCCTGAAGGAAGAAATAAAGGTGCTCCGCGCTGAAGTAGCACAGTTGCGTGCCGATTTCAGATCGGCTAGAGGCGGTTCTGTCCCGGTCTTAGTCAGGACCGGGCCTGTCGGTTCTGTCCCGGCCCATAGTGGAGAGCGCCCATCTGACGACTCCTTTACGGAGGCGCAGATGAGGCAGCTCTCACGTCTTATCGGGGGCATCGTCGACGCTCGCCTTGAGGGCGTTAAGGCGCGTCTGCCCCCAGAACCGAGCTGTCGGCCGCCGCTCGCGGGGCCCAAGCCCCCTAAGGGGCCACTGATGGCACGCCCGGCGCCACCACCGCGTACAGCGGGACGCGCTGGGTCTCTTAGAGCAGCGGGCACCTCAAGCGCCCAAACTGCTCAAGTCCGTCCCCCGGCGCCTGCGCCTGCTCCCGCCTCTGCTCCCTCTGGTGAGGAATGGCAAACGGCAGGGAGCAAGAAATCGCGGCGCCGGCATAAGCAGAGGgccgctgccgcctcctctgtgcaggaggagggtcccgagcgcacttcgctcggggaatcccttgcggagccagtcggctccgtctgtcccacgtactctgggggggacaCCCAACGACGCACCCGCCCTGTGCCGAGCGGTAAATCCGGCACCGAGAAAAGCGCGAAGGTCGCGCTTCGATCTTCCAACCGCAGTGCGGTTCTTAAGAAGGGTCGAAAGACCACAAGGAGGGCTAAGGCGAGTGGGGATCCATCCCCGTTGCCGCCTAAAGTCCCGCGGAAGGGGTCTCTGACCGCTTCTGAGTCCTCCATTACGGACGTGGACGTAATGGAGGATACCGAAGTCCACGAAGGGGGTGTAGTGTGTGGCGAGCAGCCAGGTAGGCTCTGGGCTTCCCAGAAGCGTAACCGCAGCCGTTCGCCGCATCAGGAAGTGCCGGGGGTCTCTCAGACTCCTAGCAAAATTCCGACCGCCCGCAGAGGAGGAATTGGTAGTTCAGTGTTCTCCAGTAGGGAGAAATACCTACGCACGCGCACGGAGGGTGAGCGCACCGCCGAAGACCGGGTGGTGCAGCTCCTCGCCGAGGACCAACGGTCGGGGGGGGACCTTGAATCCCCTCCTACTACGTCGGAACTCAAGGCGAGGGCCGGGGATGCGGTAGCCGCGATTTACGAGGTGGCTACCAAATCCACCAATTTGAAGGGCCCCTTTGTGCGGGGCCTTAAAGAGGCAGGCGCAAAACTGCTTGTAGTGGTGGACGACCTGGCTGGTCGAACAGACGCCACCACCAGGGAGATACAGCAGTTGCGCTCAGAGAACGCCACTCTGCGGTCGGACATGGCGGAGCTCCGCGCCGAGCTCGCCAGATTGCGGGCTGACTTGGCCTCCTCCCAAGCCGGTCCACCCCGGGCTGTATTGACCAGTCCGGGCAAGACGCAGCCTTCCACCTCTCCTGGAGGCATTGAGGCGCAGTTGCGGCAGTTAATGGGCTCATTCGAGTCTAAGATCGCTGCTCGCATCGGGAGTCTGGAGGCAAGATTGCCTCCGGTGCCGACCTCCCGCACACCGCCGGTCGCAAAGCCGTATACGGCTACCGCTCCGACGCCCCTACCCTGTTCTGGGGGACGCGTTGGGACCTACGCGACACGCGCGGCGTCTGCGCCTAAGAAGACGACGGCACGGGCTACGGCCCCTCCAGCGGTCGAGAGAACGCCAGCCCAGTCCCGGGGAGGCCCCCAGCCAACCCAGGTTTTGACGGCAGCACCATCTCCCGCTCCGTCGGTCACTCCACGTGCCCGGGAAAACGAGGTTCGTGCCCCAGCGCCCGCACCTGCTCCTGGCCCCCTCGTCTCCGCTGACGAGGGCTGGCAGAAGgagacggtaacagatggagagggagaAGATGCTGTAAGGAGTGGGGGCAGATTGTCCCGAGAGGTGGCTGAATCCGGCCTTTTGAGCGGTTCCAGAACTTCAGTGACGTCTGCGTCGCTGGATAGCCTGTTGCGCTCGGGTCGGTTCGGCTCAGATTCCTCGCTTAGCTCAGTAGTAAGCGATATGGACTCGCTGGCGGGTGGGTCAAGTACCGGTCCACGAACGTTGACAGCTGAGGATAGCCGCGCTCAACAACAAAAGCGCGGCGCATCACCCGGATTTGACGAGAGTCTTCCTGAGGTTGACCTTGACGTCCCCAGGAAGGTTCATTCCTCCCAAAGGGGGGGGTCTGTAGCGAGGTTTACCCGCCCAAAGACACCCCCTGCAGTGACGGCCCCGACCGTTTCAACGGCCGATGTGTTTTCCAGGCGTGAGGCCTATAACAGAGGCCGCACAGAGGGAGAAATAGCTGCTGAGGAGAAAGTCCAGCAGCTGTTGGGTGGGCTCTGCTCGTCGGGAGCGGTTGGCACCCCAAAATCCATCGAGGAACTCGTCCAATCGGCGACCAAATCAGtggcggttattgcgaatgtggccacgaaaagcaaaaacttgaagggcacattcgcaaaaggtttgaaggaagcagcctccatcctaaaggaggtggtaaacacgttctccgaacgtactgctgccactgaggtGGAGATGGAGAAGATGCGGGCCGACAACACCCGACTGAAGGATGAACTAAAGACGGTCCGGGCAGAGATGTCTCAACTGCGTGCCGACTTCGGATCGCCTAGAGCCGGTTCTGTCCCGGCCCATAGTGGAGAGCGCCCACCTGACGACTCCTTTACGGAGGCGCAGATGAGGCAGCTCTCACGTCTTATCGGGGGCATCGTCGACGCTCGCCTTGAGGGCGTTAAGGCGCGTCTGCCCCCGGAACCGAGTTGCCGGCCGCCACTCGCGGCGCCAAAGCCCCCAAAGGGGCGATTGATGGCACTCCCGGCGCCACCAACCCAAACAGGGGGACGCGCCGGGTCTTTTGCGGCAGTGGTTGCCTCAGGACCAGTCGGCCGAAGTGCGCGGAGTGGAGATTCCCATCCCCCGCAGCGTGTGCCAAGAGGAGCACCCCGTGTTGCCGTAGTGGGCGCCTCAAGCGCCCCATCGGCCCAAGTCTGCCACCAAGCGCCAGCGTCTGCTCCCGCCTCCGCCCCCTCTGGAGAGGACGCAGCAAGGAGTGGGGGCAGTTTGCCCCGAGAAGTGGCGGAGGCCGGCCCTTTGAGCGGTTCCAGAACTTCGGTGACGTCTGGGTCACTGGATAGCCTGTTGCGCTCGGGCCGGTTGGGCTCCGATTCATCGCTGAGCTCAGTCAGTGATATGGACTTGTTAGTGGGTCCACAGACCCTGTCAGCTGAGCACCGCCGCGCTCAACAACAAAAGCGCGGCGCATCTCCCGGAGGTGACGGAAACCTTCCTGAGGTTGACCTCGAGGTCCCCAGGAAGGTTCATTCTGCTCAAAGAGGGGGTTCCGTGGCGAGGTTCGCTCGCCCGAAAACACCCCCTGCAGTGACGGCACCGGTCGTTCCAACGTCCGGCGTGTTTTCTAGGCGGGACGCCTACAACAGAGGCCGCACGGAGGGGGAAATAGTTGCTGAGGAGCGGGTCCAGCAACTGTTGGGTAAGCTTCGCCCGTCGGAGGGAGTTAGCTCTCCGAAGACCATGGAGGAGCTAATGGAATCGGCGGCCACCTCCGTGAAGGTCATAGCGAACGTGGCTCAAAAGAGCAAGaaccttaagggcacgttcgccaagggcctgaagacgtcggcctcgacattaaaagaggtggtggaggcgatgtttgagcgcttcagtgccaacgaggtggagacacggaggttgcgggccgacaactcccgcttgGAGGAAGACCTGAAGGCACTTCGCTCCGAGTTAGCTCTGTTGCGTGCCGATCTTCGATCGACTGGAGCTGGTTCTGCCCCGGCCGTAGTCAGTGCTGTGTCGGTCGGTTCTGTCCAGGCCCATAGtggagccggaggcggagcacagagggagaaggaaaaggagaAAGAGAAGGAGAAGGAGAAAGAGGTGGAGGACGTAACGCGAAGGAGCGCGGGCAGTTTGCCTCAAGAAGCGGCCAATTCCGGCCGGCTAAGCGGTTCCAGGACTTCGGTGACCTCTGCGTCACTGGAGAGTTTGTTGCGCTCGGGCCGGTTCGGCTCGGAATCGTCGCTCAGCTCAGCGATGAGCGATTTGGACCAGTCGATGGGTGGACCAGTCATCGGTCCAGACGCGCCTACGGCTGATGAGTGCCGCGCTAGGCAGCAGAAGCGCGGTGCTTCGCCTGGAAGCGGTGAGAACCTTCCTGAGGTTGACGTTGGAGTCCCCAGGAAGGTTCACTCTTCCCAGAGAGGGGGATCTGTGACTAGATTTGCTCGGCCGCAGACACCCCCTGCAAAGACGCCGGTCGCAACAACGAGTGGCGTTTTTTCTAGGCGGGAGGCCTACAACCGAGGCCGCACAGAAGGAGAGTTAGTCGCTGAGGACCAAGTCCAGCAACTGATGGACGGGCTCCACTCGTCAGGAGGGATAATTGCCCCGAAAACCGTGACGGAGCTCGTGGAGTCGGCGACTGCTTCAATAGCGGTCATTGCGAAGggcaccttcgcaaaaggcttgaaggagtcagcctccgtccttaaagaggtggtggacaccttctccgaccgttgcATGGAGTTGCGGGCCAATGGCTCCCGCTTAGAGGCTGAAGTCGGGGCGCTCCGCGCCGAGATCAGTCAGTTGCGGGCCGAAGTTAAATCGGTCAAGGCCGGTTCTGTCCCGGCCAGTGTAGAGGCTGTGCCAACAGGTTCCGTCCTGAATGTGCCAGCCCAAACTAGAGAGCTGTCGCCTGGCGACTCCTTTACGGAGGCGCAAATGAGACAGCTCACAATTCTTTTCGGGAGCCTTATTGACGCGCACCTTGAGGGCATCAAGGCGCGTCTGCCCCCGGAGCCGAGCTGTCGGCCGCCGCTCGCAGTGCCCAAGCCGAAGCCTTCTCAGGGGCCACCAATGGGGCCGCCACCACCCCGTACAGGGGGACGGGCCGGGTCTTTTGCAGCAGCGGCTGCCTCAAGACCGGTTGAGCAAGGTGTGCGGCCGCGGAGGGGAGATCCCAAACCCCCGCCGCAGCGTGCGCCGAGAGGAGCCGTTGTAGATGCTTCATGCGCCCCAGCGGCTCAGGTCCATCACCCGGCGCCTGCGCCTGCTCCCGCCACTGCCACCTCTTGTGAGGAATGGCAGTAG
- Protein Sequence
- MDSNKLRNIFNKRSTTLSEKEKEKEKEKEKEREGENVLQRSAGSLPLEAADSGRLSRSRTSVTSASLESLLRSGQFGSESSVSSAVSDLDQMVAGLGNAPETSTAGERRAQHQKRGVSPSDDENLPGLDLEAPRKIHTSQRGGSVARFARPKTPPAVTATSAATKAVNKTSGVFSRREAYNRGRSEGEIVAEERVQQLLGGLCPSGGVSSPKTTEELMRSATTSVAVITNVAVKSKNLKGTFAKGLKESATVLKEVVEAFFERSTANETETKSLRADNTRLKEEIKVLRAEVAQLRADFRSARGGSVPVLVRTGPVGSVPAHSGERPSDDSFTEAQMRQLSRLIGGIVDARLEGVKARLPPEPSCRPPLAGPKPPKGPLMARPAPPPRTAGRAGSLRAAGTSSAQTAQVRPPAPAPAPASAPSGEEWQTAGSKKSRRRHKQRAAAASSVQEEGPERTSLGESLAEPVGSVCPTYSGGDTQRRTRPVPSGKSGTEKSAKVALRSSNRSAVLKKGRKTTRRAKASGDPSPLPPKVPRKGSLTASESSITDVDVMEDTEVHEGGVVCGEQPGRLWASQKRNRSRSPHQEVPGVSQTPSKIPTARRGGIGSSVFSSREKYLRTRTEGERTAEDRVVQLLAEDQRSGGDLESPPTTSELKARAGDAVAAIYEVATKSTNLKGPFVRGLKEAGAKLLVVVDDLAGRTDATTREIQQLRSENATLRSDMAELRAELARLRADLASSQAGPPRAVLTSPGKTQPSTSPGGIEAQLRQLMGSFESKIAARIGSLEARLPPVPTSRTPPVAKPYTATAPTPLPCSGGRVGTYATRAASAPKKTTARATAPPAVERTPAQSRGGPQPTQVLTAAPSPAPSVTPRARENEVRAPAPAPAPGPLVSADEGWQKETVTDGEGEDAVRSGGRLSREVAESGLLSGSRTSVTSASLDSLLRSGRFGSDSSLSSVVSDMDSLAGGSSTGPRTLTAEDSRAQQQKRGASPGFDESLPEVDLDVPRKVHSSQRGGSVARFTRPKTPPAVTAPTVSTADVFSRREAYNRGRTEGEIAAEEKVQQLLGGLCSSGAVGTPKSIEELVQSATKSVAVIANVATKSKNLKGTFAKGLKEAASILKEVVNTFSERTAATEVEMEKMRADNTRLKDELKTVRAEMSQLRADFGSPRAGSVPAHSGERPPDDSFTEAQMRQLSRLIGGIVDARLEGVKARLPPEPSCRPPLAAPKPPKGRLMALPAPPTQTGGRAGSFAAVVASGPVGRSARSGDSHPPQRVPRGAPRVAVVGASSAPSAQVCHQAPASAPASAPSGEDAARSGGSLPREVAEAGPLSGSRTSVTSGSLDSLLRSGRLGSDSSLSSVSDMDLLVGPQTLSAEHRRAQQQKRGASPGGDGNLPEVDLEVPRKVHSAQRGGSVARFARPKTPPAVTAPVVPTSGVFSRRDAYNRGRTEGEIVAEERVQQLLGKLRPSEGVSSPKTMEELMESAATSVKVIANVAQKSKNLKGTFAKGLKTSASTLKEVVEAMFERFSANEVETRRLRADNSRLEEDLKALRSELALLRADLRSTGAGSAPAVVSAVSVGSVQAHSGAGGGAQREKEKEKEKEKEKEVEDVTRRSAGSLPQEAANSGRLSGSRTSVTSASLESLLRSGRFGSESSLSSAMSDLDQSMGGPVIGPDAPTADECRARQQKRGASPGSGENLPEVDVGVPRKVHSSQRGGSVTRFARPQTPPAKTPVATTSGVFSRREAYNRGRTEGELVAEDQVQQLMDGLHSSGGIIAPKTVTELVESATASIAVIAKGTFAKGLKESASVLKEVVDTFSDRCMELRANGSRLEAEVGALRAEISQLRAEVKSVKAGSVPASVEAVPTGSVLNVPAQTRELSPGDSFTEAQMRQLTILFGSLIDAHLEGIKARLPPEPSCRPPLAVPKPKPSQGPPMGPPPPRTGGRAGSFAAAAASRPVEQGVRPRRGDPKPPPQRAPRGAVVDASCAPAAQVHHPAPAPAPATATSCEEWQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -