Eher095289.1
Basic Information
- Insect
- Euschistus heros
- Gene Symbol
- stc
- Assembly
- GCA_003667255.1
- Location
- RCWM01000502.1:112265-136938[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 0.86 1.1e+05 -3.9 1.6 15 18 266 269 265 270 0.75 2 19 0.72 9.2e+04 -3.7 1.6 15 19 293 297 292 297 0.81 3 19 0.0046 5.9e+02 3.4 0.3 4 10 326 332 325 333 0.97 4 19 9.5e-07 0.12 15.1 16.3 4 19 340 355 338 355 0.94 5 19 0.2 2.6e+04 -1.9 0.6 5 10 380 385 380 385 0.95 6 19 0.0011 1.5e+02 5.3 15.7 3 18 393 408 391 409 0.91 7 19 0.76 9.7e+04 -3.7 1.5 9 13 415 419 414 420 0.71 8 19 0.58 7.4e+04 -3.3 0.5 6 10 438 442 437 442 0.88 9 19 0.061 7.8e+03 -0.2 0.8 1 5 448 452 448 457 0.91 10 19 1.2e-09 0.00015 24.4 11.1 1 19 472 494 472 494 0.88 11 19 0.00069 88 6.0 8.4 1 11 534 544 534 545 0.96 12 19 0.67 8.5e+04 -3.5 1.0 6 10 551 555 549 556 0.54 13 19 2.8e-11 3.6e-06 29.6 12.3 1 18 561 578 561 579 0.97 14 19 0.56 7.1e+04 -3.3 0.9 6 10 608 612 607 613 0.68 15 19 1 1.3e+05 -9.7 16.6 1 19 618 636 618 636 0.75 16 19 0.25 3.2e+04 -2.2 0.2 9 13 651 655 650 656 0.79 17 19 2.9e-07 0.037 16.8 8.9 1 17 669 684 669 684 0.96 18 19 0.36 4.6e+04 -2.7 1.5 5 10 688 693 688 693 0.89 19 19 0.0013 1.6e+02 5.1 16.2 1 18 700 719 700 720 0.82
Sequence Information
- Coding Sequence
- ATGTCCAACTGGGATAGACCCCAAAGTAGTGGTAGCTCCTATCAGGGATATGGTAGTTATTACAATCCTCAGTCGCAGCCAGataatacaaatttgtatggGAACTTATATCAGCAACCTTATTATAATGGTTCCTGTTATAATGGTCCACCACAGGCCTTCAGAAATGGATATCAAGGAAATTATCATACTCCTCCCACATCTGCTGAAGAGTTTTTTAGGATCATTCAAGAGCAACATCAAGAAAGCCCACGTTCAAGAAGTGTTGATTACGACACCCTTGGTCATGGGTGTAATATTCCAACTATGGTTTTTCCATCATCTAATACTAGTTCAgGTGGAGAACAAGTACAACAAGGTAAAAGAAAGATATATGAAAACAACAAAGGTGGAGGAAGAAGTGGAGGACGATACCAGCAAGGCGCTGTTCCAAGGCAGCAAAACAACTACAGGCGTGGTGGATACAATAACGTTAAAAAAAGTGTGCAAAATCAGAACTGGGCTGCTGTTCCAAATGGTGATGTCATCGGTGATGATGCAAAGGTAGAAGAAGTGAAAAATTCATCGAAGCGTGGAAAAGGTTTTGCTCAAAGGAATACAAACAGGTTTAATGGTTACCAaaaagatAGCGGAACATTTAAGCGTGATAACTTCAGTAAAGATTGGCGAGCCAGAGGGAAAGGCAAAGGAAatgaagAATGCCAGCGTGATCGTTTAACATCTCAGCTGTATTCTGGTGCTTTGGAATGTTTAGTCTGTTGTGAAAGGATGAGACAAGCAGATCCTGTTTGGTCTTGTCCAGCTTGCCATCACGTCCTTCATCTTCGTTGTACAGTTCGTTGGGCGTCTTCTTCAAGATCAGACAATGGTTGGAGATGCCctgcttgtcaaaatgttaccgATGCGATACCTACAGATTACAAGTGTATGTGTGGAAAAGTGATTAATCCGCAAtgggtcCGAGGAGAAACACCTCATACTTGTGGACAAGTATGTGGTAAAGATCGTGGTTGCCCACACAGCTGTACCCTCTTGTGCCATCCTGGTCCTTGTCCGCCGTGCTCTGCGAACATTGACAGgTATTGCGGTTGTAGCAGAACGAAACAGGTTGTTCAGTGTTCATCAAGGGTCGAATTAACTTGTGACTCTGTATGCGATAAGTTGCTCAACTGTAAAGTTCATCATTGTACAAAAGGATGCCATAATGGTACCTGTGACCCTTGTAACGAAACCATTCATCAGAGGTGTCATTGCGGACAAGAGGAAAGAGACGTTCCTTGTGATAGTGCTGACATCGAACCTGTCTACAGCTGCGACAAGTTATGCGATAAACTATTGGCCTGCGGACAGCATAAGAAGTCCTGCAAGGATCCGGTACCTCTTTGTGGAGCAGTTTGTGGAAAAGTGTTGATGTGTGGCGCTGCAGGTAGGAAACATACATGTGCTGTATTGTGCCACAGTGGTGACTGCCCCCCATGTCCTGAAACTACATCAGTTATGTGCCGTTGTGGAGCTATGGCTAAAGAAGTTCCTTGTGCAGGCTTGGATTCACGTCCTGATGATGCTCGTTGTTCTAAAAGATGTACAAAAaaaaGGAGTTGTGGAAAACATAAATGCAATCAAGCTTGTTGCATTGAGATAGACCATTTTTGCCCTCTTCCGTGCAACAAAATGTTGACTTGTGGAAGACATCGTTGCCAAGAACTGTGTCACCGAGGCAACTGTCGACCCTGCCTTGCTGCGAGTTTTACTGAATTGAGCTGTGAGTGTGGAAAAGCTGTCCTCTATCCGCCAGTCCCTTGCGGGACTAGACCTCCAGAATGCAAAGCACCTTGTTCTCGTGACCATCCGTGTGGCCATCCCCCTACTCATCACTGCCACTCGCAAAGTGAATGTCCTCCGTGTTCAGTTCTAACTACTAAATATTGTTTCGGAGCACATGAGGTagTTCCTTGCCATATTGGAGAATTTTCTTGTGGACGtgcttgtaataaaaaattagactGTGGCCATTCCTGTATAAAACTCTGCCATTTAGGACCTTGTACCCCAGAGGGAACTAGGTGCACCCAGCCGTGTACTGTACCGAGGTCTAGTTGTGGCCACGAATGTGCTGCCCCGTGTCATCCTGGCCAACCTTGCAATCCCCAGCCTTGTCAGTCAAagGTTGAAGTTCGTTGTGAATGTGGAAgaagaattgttaaaaaaagtTGTTCAGAAAATAGTTCCGAGTATCAACGCTTAGCAACTGCTCAGTTAGCATCTCAAATGGCTGACGTTCGTCTAGGAAAGACAGTTGATCTCAGATCTTCTTCAAAGCCAGTATATAaGTCTTCGCCTCCTCCAAACCACTTCCCCTAA
- Protein Sequence
- MSNWDRPQSSGSSYQGYGSYYNPQSQPDNTNLYGNLYQQPYYNGSCYNGPPQAFRNGYQGNYHTPPTSAEEFFRIIQEQHQESPRSRSVDYDTLGHGCNIPTMVFPSSNTSSGGEQVQQGKRKIYENNKGGGRSGGRYQQGAVPRQQNNYRRGGYNNVKKSVQNQNWAAVPNGDVIGDDAKVEEVKNSSKRGKGFAQRNTNRFNGYQKDSGTFKRDNFSKDWRARGKGKGNEECQRDRLTSQLYSGALECLVCCERMRQADPVWSCPACHHVLHLRCTVRWASSSRSDNGWRCPACQNVTDAIPTDYKCMCGKVINPQWVRGETPHTCGQVCGKDRGCPHSCTLLCHPGPCPPCSANIDRYCGCSRTKQVVQCSSRVELTCDSVCDKLLNCKVHHCTKGCHNGTCDPCNETIHQRCHCGQEERDVPCDSADIEPVYSCDKLCDKLLACGQHKKSCKDPVPLCGAVCGKVLMCGAAGRKHTCAVLCHSGDCPPCPETTSVMCRCGAMAKEVPCAGLDSRPDDARCSKRCTKKRSCGKHKCNQACCIEIDHFCPLPCNKMLTCGRHRCQELCHRGNCRPCLAASFTELSCECGKAVLYPPVPCGTRPPECKAPCSRDHPCGHPPTHHCHSQSECPPCSVLTTKYCFGAHEVVPCHIGEFSCGRACNKKLDCGHSCIKLCHLGPCTPEGTRCTQPCTVPRSSCGHECAAPCHPGQPCNPQPCQSKVEVRCECGRRIVKKSCSENSSEYQRLATAQLASQMADVRLGKTVDLRSSSKPVYKSSPPPNHFP*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -