Thoc012026.1
Basic Information
- Insect
- Tetragonula hockingsi
- Gene Symbol
- -
- Assembly
- GCA_010645185.1
- Location
- WIUV01024707.1:1-2926[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 23 0.0076 12 7.1 2.9 24 62 29 67 29 70 0.91 2 23 0.0038 5.7 8.1 4.2 24 63 95 134 93 135 0.92 3 23 0.0034 5.3 8.2 3.3 32 63 152 183 137 185 0.63 4 23 0.0012 1.8 9.7 6.8 35 64 186 215 179 224 0.67 5 23 0.055 84 4.3 1.4 26 57 240 271 238 275 0.89 6 23 0.063 96 4.1 3.5 39 62 288 311 270 327 0.52 7 23 1.6e-05 0.025 15.6 10.8 23 65 349 391 341 391 0.91 8 23 0.2 3.1e+02 2.5 5.0 40 62 397 419 380 441 0.53 9 23 0.00021 0.32 12.1 3.3 32 61 438 467 429 470 0.78 10 23 0.0016 2.4 9.3 2.3 30 59 461 490 459 499 0.85 11 23 0.0082 13 7.0 6.8 22 63 484 525 483 527 0.92 12 23 0.0018 2.7 9.1 1.8 28 61 504 537 500 541 0.89 13 23 0.012 19 6.4 0.3 35 62 539 566 527 568 0.68 14 23 0.01 15 6.7 1.8 36 60 565 589 563 594 0.73 15 23 4.6e-05 0.07 14.2 1.1 22 63 589 630 588 639 0.78 16 23 0.0055 8.5 7.5 0.1 33 64 635 666 632 667 0.90 17 23 0.00064 0.97 10.5 1.3 26 63 649 686 646 688 0.90 18 23 3.4e-06 0.0052 17.8 9.5 22 65 694 737 689 737 0.94 19 23 0.023 35 5.6 3.3 35 61 728 754 726 758 0.63 20 23 8.5e-05 0.13 13.3 1.3 22 63 757 798 756 800 0.92 21 23 0.0098 15 6.7 8.2 25 63 816 854 798 856 0.71 22 23 0.0009 1.4 10.1 4.4 24 64 843 883 842 884 0.89 23 23 0.0002 0.3 12.1 0.8 24 57 878 911 875 918 0.89
Sequence Information
- Coding Sequence
- atCGCGGAACTGTTGGACGGCCTAAGACTGTCAGAAATTAACCTGCTCGGGCTCTCCACTCTAAAATCCACACTAGAAGACTTCAAAGAGAAAATAGTCGACTTACAGTCGAAACTCGACAAGGCGAACCAAGATATTGACGATCTGAAATCGGAGTTAGCCAATCTGAGGAACGAGTTGGAAGACTGTAACAAGCGAAACGCGGAGCTGCAGGAGTATTGCTTTGACATGGACGCTCGTTCGAAGAAGCTGCGCGACCTGGACGAGAATCTCGCGGCTGCGAAACTCACAATAGCCGATCTCGAGAAAGAGGCGGACGTCTTGAGGAGAGACAAGGAGAATTTGTTGAACGAGCTAGACGAGGCGAGGAAACAGGTGGTGGCATTGACCGAGCAACTGGAGGACGAGAGGGCGGCCAGGATCGCGTTGGAGAAGGAACTGGAGGATAGCCGAAATGAGATTGAAAAGTTGCAGAAGGAGAATTCGGATCTGAAGGATCAGATCGGCGCTGAGAGGAAGGGGAACGATAAACTTCGCCAGGCATTGGAAGCGTCGAAAGAGCTGGCCGACGAGAACGAAAAGTTAAAGGCTCGGCTGGAGCAGCTGAAGAACGAGAACGACAGCCTGACGCGGAGCATGAAGGAGCTGAACGATTTGAATAATCAGCTGAGAAACGACTACGATAGTATGAAATGGGCGATGGATAATTTTCAAGCGGAGATCGACATACTGGCGGACGAGTTGGCCAACGCGGAACAGAAACACGACGCGTTGTTGAACGAGAATAATACTATCAGAAAGCAGCTCGAACGAGCGATTGCGGAAGACGTGAGTCTGAGAGCCGAACTGGACGAGGCTGGCGAACAACTCGACAAACTGAGATCGGAGAAAAGCGAGCTGTTTAAGAGCCTCGACGAGATGAAGCTCGAGAACGATTCGTTGAAGCGGGATATGAAGGCTTTAAGGGACGACCTTGAGGATTCTAGGGGGCAAGTGGAGGAGCTGAAAGCCGCTGGCGATGCTTTAAGGGCGGCGGATAAGAATAAGAAACTCGAACTCGCCGAACTGGAACAACGAGTAGAGAGCTTGAAGTCCGAGAAGGATCGCTTGACGAAGGAGAACGACGACCTGAGAAACAGAAACATGGAATTGCAACGGAGATTAGAAGAGCTGGATCAGATAAAGGGGGAAAATGCAGATTTACTTGCTGAAATGGATCGTTCGAGAAAAGAGTTGGATAAAACCTTGGAAGACGTTGATCAGTTAAAATCCGAAATAGGTTCCCTGAGGGACGGACTGGAAAATTGCGTGGCCGAAATGGAGAAACTGAAAACCGAGAACAATGACCTGAAGAAGGAGAACGAGTCCCTGAAGTCCGAAATTCAGGGCATTGCCAATCGGTTGATGAAAGAAAACGACAGTTTGAAAGATGAAATTGCGGAATTGGAGAAAAAGCTGACGGAATTGGATGAACTGAAGGGAGAAAATGCCGATTTGCTCGGCGAACTAGATCGTTTGAAACAGGAATTGGAGGAAACCTGGAAGGAGGTTGACCAATTAAAATCCGAGGCAAGTTCGTTGAAGTACGCGCTCGACAAGTGTGTAGACGAGATGGGGAAGTTACGAACTGAGAATGATGATCTTAAATTGGCAAATCAAGCTTTGAAGTCCGATATTCAAGGACTCGGCGATCGTTTAACGAAGGACGACGCCGATTTGAAAGCGAGAAACGAGGAACTGCGACAAAAATTAGGAGAGTTGGACAAACTGAGGTCGGAAAACGCGGATTTGCACGGCAAGGTCGATCATTTGAGACGCGAGGTGGAAAAACTTTTAGTGGATATCGATCAATTGAAATCCGAGGTAGCTTCTTTGAAAGACGCGCTGGATAAGTGTGTCGGCGAGATGGAGAAGCTGAGAAGCGAGAACAATGgtttgaagtttgaaattcAGGGGATGAAACGTGAAGGCGATAGTCTAGCCGTGGAGTTAAATAATCTGAAGAACGAGATTTCCACTTTGAAAGAGGAGAAGGATCAATTGAGCAAGCAATTGAGCGACAATAAGACGGACAACGAGAAACTGCGAGCGGACAGCGAGAAACTACGAGCGGAAAAGGCTCAAGTTGAAGCCGAAAACGAGAAACTGAGAGAAGAGATAAATTCCTGCAAGCAGgagaatgataaattaaaagacgAACTTGCAAAATTACGAGAACAGTCGCAATCGTTGAACGacgaattgaataaattaaaggCAGACCTCGATAAATCTGAGGAGAAAATTCGGTCTCTGGAACCGTTGGTCTCTCGTTTACAGAGTGAAAACgataaattacgaaatgatTTGACAGATTTGGGGAACGAGGCGAACGATTTGAAAGCAAAGATGCGCAAAGAAACTGCCGACAACGAAAAGATGCGGAACGACTTGAAGATATTGGAGGATCAGGTGCAAGATCTGAATAAGAAGTTGAACAATACCAGGACAGAAAACGATGCATTGAAACAGGAGAATCAAGATCTCAAAGCAAAGTTATTGAATACGGATCAAGATTTATCGAATTTGAAAGCGGAATGTGCCGAACTGAAACAAGAGATTGCTGAcctgaagaaattaattgacgagttaaaggaaaaaatcgcTAAATTGGAAGCAGACGTGGATCATTGGAAAATGGAGAATTGTAAGCTTCAGTTAGAGATTGATAAATTGAGAGCTGATCTTGAGGGAGCGTTGAAAGACGTGAGCGAGTGTAAg
- Protein Sequence
- IAELLDGLRLSEINLLGLSTLKSTLEDFKEKIVDLQSKLDKANQDIDDLKSELANLRNELEDCNKRNAELQEYCFDMDARSKKLRDLDENLAAAKLTIADLEKEADVLRRDKENLLNELDEARKQVVALTEQLEDERAARIALEKELEDSRNEIEKLQKENSDLKDQIGAERKGNDKLRQALEASKELADENEKLKARLEQLKNENDSLTRSMKELNDLNNQLRNDYDSMKWAMDNFQAEIDILADELANAEQKHDALLNENNTIRKQLERAIAEDVSLRAELDEAGEQLDKLRSEKSELFKSLDEMKLENDSLKRDMKALRDDLEDSRGQVEELKAAGDALRAADKNKKLELAELEQRVESLKSEKDRLTKENDDLRNRNMELQRRLEELDQIKGENADLLAEMDRSRKELDKTLEDVDQLKSEIGSLRDGLENCVAEMEKLKTENNDLKKENESLKSEIQGIANRLMKENDSLKDEIAELEKKLTELDELKGENADLLGELDRLKQELEETWKEVDQLKSEASSLKYALDKCVDEMGKLRTENDDLKLANQALKSDIQGLGDRLTKDDADLKARNEELRQKLGELDKLRSENADLHGKVDHLRREVEKLLVDIDQLKSEVASLKDALDKCVGEMEKLRSENNGLKFEIQGMKREGDSLAVELNNLKNEISTLKEEKDQLSKQLSDNKTDNEKLRADSEKLRAEKAQVEAENEKLREEINSCKQENDKLKDELAKLREQSQSLNDELNKLKADLDKSEEKIRSLEPLVSRLQSENDKLRNDLTDLGNEANDLKAKMRKETADNEKMRNDLKILEDQVQDLNKKLNNTRTENDALKQENQDLKAKLLNTDQDLSNLKAECAELKQEIADLKKLIDELKEKIAKLEADVDHWKMENCKLQLEIDKLRADLEGALKDVSECK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01419393;
- 90% Identity
- iTF_01419393; iTF_01418115;
- 80% Identity
- -