Hcor021557.1
Basic Information
- Insect
- Hymenopus coronatus
- Gene Symbol
- -
- Assembly
- GCA_030762935.1
- Location
- CM060885.1:26324496-26329602[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 32 0.017 83 6.0 0.0 30 64 10 44 3 52 0.64 2 32 2.9 1.4e+04 -1.1 0.0 30 64 52 86 46 87 0.73 3 32 0.79 3.8e+03 0.7 0.0 30 61 66 97 60 101 0.82 4 32 0.00045 2.1 11.1 0.5 25 63 96 134 94 136 0.90 5 32 0.87 4.1e+03 0.6 0.1 24 53 158 187 151 198 0.61 6 32 0.039 1.9e+02 4.8 0.1 27 62 213 248 209 251 0.79 7 32 0.63 3e+03 1.0 0.1 30 64 251 285 244 286 0.85 8 32 4.1 1.9e+04 -1.6 0.0 33 59 303 329 295 333 0.63 9 32 0.026 1.3e+02 5.4 0.0 22 56 338 372 337 381 0.85 10 32 1.5 7e+03 -0.2 0.0 25 50 432 457 427 463 0.71 11 32 0.039 1.9e+02 4.9 0.1 19 61 522 565 514 569 0.61 12 32 0.12 5.6e+02 3.3 0.1 26 63 565 602 563 608 0.79 13 32 0.026 1.2e+02 5.4 0.0 28 63 648 683 637 691 0.56 14 32 0.59 2.8e+03 1.1 0.2 26 59 688 725 678 731 0.58 15 32 0.19 8.9e+02 2.7 0.2 27 60 749 782 738 787 0.82 16 32 0.26 1.2e+03 2.2 0.2 27 61 763 797 759 800 0.82 17 32 0.17 7.9e+02 2.8 0.1 30 61 811 842 805 846 0.69 18 32 0.038 1.8e+02 4.9 0.1 24 64 826 866 822 867 0.91 19 32 0.2 9.5e+02 2.6 0.0 27 59 871 903 868 909 0.85 20 32 0.52 2.5e+03 1.3 0.0 27 59 913 945 909 951 0.84 21 32 0.41 1.9e+03 1.6 0.0 26 59 933 966 930 972 0.86 22 32 0.42 2e+03 1.6 0.0 26 59 954 987 950 992 0.86 23 32 0.51 2.4e+03 1.3 0.0 27 59 997 1029 992 1035 0.83 24 32 0.4 1.9e+03 1.6 0.0 26 59 1017 1050 1013 1056 0.86 25 32 0.4 1.9e+03 1.6 0.0 26 59 1038 1071 1034 1077 0.86 26 32 0.39 1.9e+03 1.6 0.0 26 59 1059 1092 1054 1098 0.86 27 32 0.52 2.5e+03 1.3 0.0 27 59 1102 1134 1098 1140 0.84 28 32 0.86 4.1e+03 0.6 0.0 26 58 1122 1154 1119 1158 0.83 29 32 1.6 7.8e+03 -0.3 0.1 19 52 1176 1210 1174 1235 0.82 30 32 1.9 9.1e+03 -0.6 0.1 20 61 1247 1289 1245 1293 0.74 31 32 0.079 3.8e+02 3.9 0.3 26 57 1289 1320 1271 1325 0.62 32 32 0.067 3.2e+02 4.1 0.1 30 58 1362 1390 1343 1404 0.58
Sequence Information
- Coding Sequence
- atgactgcacagaccgccACCGCCTACTTGAGTACACAGACTGCCGACTTGACGGCACAGACACCCGACTTGAgtacacagaccgccgacttgaggACACAGACAGCCGACTTGACAGCACAGATCaccgacttgactgcacagaccaccgacttgaTTGCACAGACCACCGACCTGATAGCACAGCCCaccgacttgactgcacagaccaCCGACCTGACAGCAGAGACTGCCAAATTCAGTGCACAGATCAccgacctgacagcacagaccgccgacatgactgcacagaccgccgaccttacagcacagaccgccgacttgacagcacagaccgccgaATTAAAAGCAGAGGCCACCGACTTGACTGCTCAGTCCGCCGAATTGAGTGCACAGATCACCTACTTGACAACACAGACCACCGACTTGAAAGCACAGACCACCGTATTAAGTGCACAGACCACCGAAATGAAGGCACAGACCACCGACCTGATAGCACAGACCACCGACATGACTGCACAAACCGCCGACCTGACAGCAAAGACCGCCGAATTGACAGCACAATCCGCCGACGCCACCGACCTGAAAGCACAGCGCACCGTCCTGATATCACATACCACCGACTTGGCTGCACAGACCAccgacctgacagcacagaccgccgaATTGATGGCACAGACCACCGACCTGACACCACAGGCCACCCACTTGACTGCAGAGACCACCGACTTGAGAGCACAGACCACCGATTTCAgtgcacagaccaccgacttgaCTGCTCAGACCACCGACTTGAGAGCACGGACCACCGACCTGAGTCCACAGACCACTTACTTGACAGCACAGAACACCGACTTGAGTACACAGACCACCAACTTGAGTACACAGACTGCCGACTTTAcagcacagaccaccgacttCACAGCACAGATCGCCGACTTGACAGCACAGACAGCCGACTTGAATCGCCGACTTGAGTACACATACCGTCGACTTGACATCACAGACAGCCGACTTAGAGCACAGACCGCCCACTTGACAACAAAGACATTCGACTTGACAGCACAGAACGCCGACTTGACAGCACAGACAGCCGACTTGAGTACACAGACCACATGCAtgacagcacagaccgccgacttgacagCGCATACCGACGACTTGAGTACACAGACCGATGACTTGAgtacacagaccgccgactttAGTATAGAGACTGCCGACTTGAGTACACAGACCGACGAATTGAGTACACAGACAACCAACTTTAgtacacagaccgccgacttgacagCACAGACAGCCAACATGAgtacacagaccgccgacttgagtTCACAGACCGACGACTTGAGTACACAGAACGCCTACTTAGTACACACCCATCGACATGAAtacacagaccgccgacttgagtacacagaccgccgacttgacaTCACAGACAGCCGAATTGACAGCACAGACAGCCGACTTGACAGCACAGAACGCCGACTTGAGGACACAGACAGCCAACTTAATTACACAGAATGCCGACTTGAgtacacagaccgccgacttgacagcacagaccgccgacttgagtACACAACCGCCTTCTTGAGTACACAGACAGCCGACTTGAATACAGAGACCGCCGACTTGACAGCACAGACAGCCGACTTGAATACACAGATTGCTGACTTGACAGCACAGACAGCGGACTTGAGTACACAGACAGCCGACTTGACAGCACAGACAGCCGATTTGAgtacacagaccgccgacttgagtacacagaccgccgacttgagtACACAGACCACCGACTTGAACCGCCCACTTAAGTACACAGACCTTCGACTTGAATACGCAGACAACCAAAGTGACAGCAAGGACCACCGATATGACTGCACAGACCGCCCTcctgacagcacagaccgccgacgTACGAGCACATACCACAACCTCActgcacagaccaccgacttgactgcacagaccgccgacttgagtgCACAGCCCACCGACCTGACAGCACAGATTACCGATTTGActgcacagaccaccgacttgaCGGCACAGACCACAGACATGACTGCACAGACCACTGACTTAAcagcacagaccaccgacttgaaccgccgacttgaggCACACACCAccgacctgacagcacagaccaCAGACTTGAATGCACAGACCACTgacctgacagcacagaccaccgacccaacagcacagaccaccgacttgaGAGCACTGACCGCCGACTTGACAGCACAGACCACCGATTTAAgtgcacagaccaccgacttgagaacacagaccaccgacttgactgcacagactGCCAACTTGAGTGCACGGACCGCCGAATTGAGTGCACAGACCATCGACTTGAGTGCACAGACCACCAACCCGATAGAACAGACCACCGACACCACCGACCTGACAGAAAAGACCGCCGACTTGACAGCAGAGACCACCGACTTGAGAGCACAGACCACCGATTTGAGTGCACAGACCATCGACTTGACTGCTCAGACCACCGACTTGAGAGCAAGGACCACCGACCTGAGTGCACAGACCACCTACTTGACAGCACAGAACACCGACTTGAGTACACAGACCACCAAATTGAGTACACAGACAGCCGACTTTACAGCACAGACCGCCGATTTGACAGCACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAgtacacagaccgccgacttgacagCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAgtacacagaccgccgacttgacagCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACGCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAgtacacagaccgccgacttgacagCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCGACTTGAGTACACATACTCCCGACTTGAGTACACAGATCGCCGACTTGACAGCTCAGACAGCCATCTTGAGTACACATACTCCCGACTTGAGTACACAGACCGCCTACTTAGTACACACGCATCGACATGAATACACTGACCGCCGACTTGAgtacacagaccgccgacttgacaTCACAGTCAGCCGACTTGACAGCACAGACAGCCGACTTGACAGAACAGAACGCCGACTTGACAGCATAGACAGCCGACTTGACAGCACAGAACGCCGACTTGACAACACAAACAGCCGACTTGAGTACACAGTCCACATACTTGatagcacagaccgccgacttgacagCACATACCGACGACTTGAgtacacagaccgccgacttgacagCACAGTGAGCCGAATTGAAAACACAGAGAGCCGACTTGAGTACACAGACGCCGACTTGAGTACACAGACCGACGAATTGAGTACACAGACAGCCAACTTGAGTACAAAGACCGCCGACTTGACAGCACAGACAGCCAACTTGAgtacacagaccgccgacttgagttcgcagaccgccgacttgactaCACAGACTCCCGACTTTAcacagaccgccgacttgagtacacagaccgccgacttgagtACACAGACAGCCGACATGAATACAGAGACCGCCGACTTGACAGCACAGACAGTCGACTTGAGTACACAGATTGCTGacttgacagcacagaccgcGGACTTGAGTACACAAACAACCGACTTGACAGCACAGACTGCCGATTTGAgtacacagaccgccgacttgagtACACAGACCATCGACTTGAACCGCCCACTTAAGTACACAGACCTCCGACTTGAGTACGTAGACCACCGACGTGACAGCACAGACCACCGACATGACTGCACAGACCGCCCacctgacagcacagaccgccgacgTATGAGCACAGACCACAACCTCAATGCACATACCACCGACTTGACTTGCACAGATTaccgacttgactgcacagaccaccgacttgaGAGCACAGACCACAGACATGACTGCACAGACCGCCAAATTGAGGCACACACCAccgacctgacagcacagacTTGA
- Protein Sequence
- MTAQTATAYLSTQTADLTAQTPDLSTQTADLRTQTADLTAQITDLTAQTTDLIAQTTDLIAQPTDLTAQTTDLTAETAKFSAQITDLTAQTADMTAQTADLTAQTADLTAQTAELKAEATDLTAQSAELSAQITYLTTQTTDLKAQTTVLSAQTTEMKAQTTDLIAQTTDMTAQTADLTAKTAELTAQSADATDLKAQRTVLISHTTDLAAQTTDLTAQTAELMAQTTDLTPQATHLTAETTDLRAQTTDFSAQTTDLTAQTTDLRARTTDLSPQTTYLTAQNTDLSTQTTNLSTQTADFTAQTTDFTAQIADLTAQTADLNRRLEYTYRRLDITDSRLRAQTAHLTTKTFDLTAQNADLTAQTADLSTQTTCMTAQTADLTAHTDDLSTQTDDLSTQTADFSIETADLSTQTDELSTQTTNFSTQTADLTAQTANMSTQTADLSSQTDDLSTQNAYLVHTHRHEYTDRRLEYTDRRLDITDSRIDSTDSRLDSTERRLEDTDSQLNYTECRLEYTDRRLDSTDRRLEYTTAFLSTQTADLNTETADLTAQTADLNTQIADLTAQTADLSTQTADLTAQTADLSTQTADLSTQTADLSTQTTDLNRPLKYTDLRLEYADNQSDSKDHRYDCTDRPPDSTDRRRTSTYHNLTAQTTDLTAQTADLSAQPTDLTAQITDLTAQTTDLTAQTTDMTAQTTDLTAQTTDLNRRLEAHTTDLTAQTTDLNAQTTDLTAQTTDPTAQTTDLRALTADLTAQTTDLSAQTTDLRTQTTDLTAQTANLSARTAELSAQTIDLSAQTTNPIEQTTDTTDLTEKTADLTAETTDLRAQTTDLSAQTIDLTAQTTDLRARTTDLSAQTTYLTAQNTDLSTQTTKLSTQTADFTAQTADLTAQIADLTAQTADLSTHTPDLSTQTADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQTADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQTADLTAQTADLSTHTPDLSTQIADLTAQTADLSTHTPDLSTQIADLTAQTAILSTHTPDLSTQTAYLVHTHRHEYTDRRLEYTDRRLDITVSRLDSTDSRLDRTERRLDSIDSRLDSTERRLDNTNSRLEYTVHILDSTDRRLDSTYRRLEYTDRRLDSTVSRIENTESRLEYTDADLSTQTDELSTQTANLSTKTADLTAQTANLSTQTADLSSQTADLTTQTPDFTQTADLSTQTADLSTQTADMNTETADLTAQTVDLSTQIADLTAQTADLSTQTTDLTAQTADLSTQTADLSTQTIDLNRPLKYTDLRLEYVDHRRDSTDHRHDCTDRPPDSTDRRRMSTDHNLNAHTTDLTCTDYRLDCTDHRLESTDHRHDCTDRQIEAHTTDLTAQT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -