Rexs013115.1
Basic Information
- Insect
- Rhorus exstirpatorius
- Gene Symbol
- stc_1
- Assembly
- GCA_963564615.1
- Location
- OY751317.1:38302698-38315438[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 15 2 2.4e+04 -4.4 1.6 15 19 554 558 553 558 0.81 2 15 0.14 1.7e+03 -0.4 0.1 4 10 587 593 586 594 0.94 3 15 2.7e-06 0.033 14.6 11.3 4 18 604 618 602 619 0.94 4 15 1.6e-06 0.019 15.4 15.4 1 18 655 672 655 673 0.91 5 15 0.58 6.9e+03 -2.4 0.6 1 5 682 686 682 691 0.83 6 15 9.5e-09 0.00011 22.5 14.0 1 19 711 729 711 729 0.98 7 15 0.62 7.3e+03 -2.5 0.9 5 10 758 763 758 763 0.95 8 15 0.0098 1.2e+02 3.3 12.3 4 18 776 790 769 791 0.86 9 15 1.6 1.9e+04 -3.8 2.0 5 10 820 825 820 825 0.93 10 15 0.0037 44 4.6 11.3 1 11 831 841 831 853 0.87 11 15 4.2e-09 5e-05 23.6 14.8 1 19 858 876 858 876 0.98 12 15 0.42 4.9e+03 -1.9 0.7 5 10 904 909 903 909 0.94 13 15 2 2.4e+04 -8.4 11.3 10 19 923 933 912 933 0.71 14 15 4e-07 0.0047 17.3 13.3 1 16 969 984 969 995 0.87 15 15 8.7e-06 0.1 13.0 14.2 1 19 1001 1020 1001 1020 0.97
Sequence Information
- Coding Sequence
- ATGGCCACCTGGGACGGATCCGACCAGGCAGGTCCCTCCGAAGATCCAAATTTTTATCTCCTGGCGCACGCTATCAACGCCGCGGCCGATCCCCGAACCTGGAATTATTATTCGACGTCCGGAAACGTCGATTATTCGCATCCTTCCGTGGGTCTCTACGCTCAAAATAATCTTCCCGATGCAGCGACGAATCGGGGATTTCGAAGCGTCCCTTATCGAGAACCCGTCGTTTACTCGCCGCCCGTTCACGAAAACGCAGCGCCGCCGCCGCCTCCCCTCGACGATAGATCTTGCTTTTTCGAGAACAATCGACAGTCCCAGAAACATCCCATGTACAACAATCGTCCCTTAAATTATAATCACAGCGAGATGGGAATAACGAAAACGAGATACGCGAGTCCTAAGCGCGAAACCAGCGCCGCGAAACGCTCCAATCTTCATCCCGGCGCGAGCGAATTCGTTCCGACGTCGGTAACGAAAAATATGCCGAAAAATTCGTCCCAAATGTCGTACAACGGTAGCAACGAACCTCATTACAACGATcatacgaatgaaaattcgccTTCCTCGCTGTCGTCCGCGTCTCATATATTCTTCGGCGATAATAGAAATCGCGCCAACGAACGACGCTACGACAATGCgaaaagtaataattcttatcGAGGCGCGTACAAGCCTCAGAGATCGCGAAATTACGAGCAAAATTATGCCGGCTCGAGGCTCGGCGGCGATTCGACGAACTCGGCGAGCCAGGAAAATTTCACGAACAATGCACAAACGGATGAGAGCTCTTatcgcggcggcggcggcgccAAGAATTCAAGAGCTCGCAATCAAGGCGCTCGTTTTACGAACGATCGATACACCGGCAATCGTCAGCAGCGCGAGAACAACGATAAATCAAGGCCGGATAGAAGAAAAATGCAGATCGATAGTTGGCAAAATTCTGACGAAACGTATCGCGATTCGTCGACGGCGAGCGCCGATGATACGCGCGATAATCATGGAGCGATGAATAATCGGGGAAAATGGACGAGAAACGACAGGAGATATCAGAACGAGAGATATTCAGGCGCGAATCGTTCGTATAACgataataaaaatcaaataaaagcTTCCGGCGATTACGACGAGCAGCAAAGCTTTGATTCTCCCGTATCGGTCAATTCGTCGATGGGCATGAGACGCGAGATTTCGCATAATCGCGACAGCGAGAGAAGCTACAGAAATGAGAATAGGGGACGCGCAAGGCCTCGAGACGGCTCGGCGAATTATGATAACGTTGGCCATGCACATTACAATCACGAAGGAGAGAATCACGAGAgggatattaataataaattcgaaAGATCGCgggatgataataaaaaattgagggATTCGCGCGACGAGGAGGCTCAGAGCTGGAGGCAAAAGGACAAAGTGCCGAACAGAGGTGGAATGACGAAGAAGTACAAGAAAATTGAATTCgATGACGATGCCAGTCAGAGAGAACGATTGACCGAGCTATTGAACCAAGGACACTTGGAGTGTCTCATTTGTTGCGACTACATAAGGCAAAACGATCCGGTTTGGTCCTGCAACAACTGCTATCACGTGCTGCATCTAAAGTGCTTGAAAAAATGGGCGCACTCGTCGCAAAGCGAAAATGGCTGGCGATGTCCGGCCTGCCAGAACGTTAGCCTCGTCGTTCCCGAAATTTATCGCTGCTTCTGCGGCAAAGCCCGAGCTCCAGAGTGGAATCGCCGCGACGTTGCGCACTCGTGCGGCGAAATTTGCGGTCGTCTGAGAGCCAAGAATAATTGCGTTCACAAGTGCACGCTGCTCTGTCATCCGGGCTCGTGTCCCGAATGCATAGCAATGGTAACGAGGCAATGCGGCTGCGAAAAAACATCGCAGAGCCTCAAGTGCAGCACAATGACCCTCGTTGTTTGCGACGCCGCGTGCGATAAAATTCTCAATTGTAAGATTCACAATTGCGAGAAAACGTGTCATCACGGCGATTGCGGTCCTTGCGACAAAACGCTGCATCAagAATGTTACTGCGGCAAGCACGAGCGTGACGTAACTTGCGACGTCGACGTGCCAGGGACTTACAGCTGCGatgaaatttgcgaaaaatttctcGAGTGCGGTAATCATCGTTGCGGTGAAATTTGTCATCCCGGTAGCTGCGATTCCTGTAAACTCCAGCCCGCAATGGTGACGCATTGTTGCTGCGGACAAACGCCATTGGCCGTCGAACGTAAAacttgcctcgatccgattccCACGTGCGACAAagtttgctcgaaaaaattgaaatgcgGCCAACCcaGCGATCCTCATACGTGCAAAATCGACTGTCATCCGGACGATTGTCCGGAATGCGAATTAACGACGAAGGTTCGCTGTCGCTGCGGCAATATGGATCGGGAGATCGCTTGCAAGGAGTTGCGCACCAAGGCCGATGACGCGCGTTGCGAGAAAAAATGCACGAAGAAACGTTCTTGCGGCAAGCACAAATGCAATCAGCTTTGTTGCATCGATATCGAGCATATGTGCCCCATGCCGTGCTCCAAGACGCTAAATTGCGGCAGACACCGATGCCAATTGTCCTGTCACAAAGGCAGATGTCAGCCCTGTCAGGAAATGAGTTTCGACGAGTTGCACTGCGAGTGCGGCACGTCGGTGATTTATCCTCCGGTACCTTGCGGCACGAGACGACCGACCTGCAATCGTCCGTGTACGCGTCAGCACTCATGCGGTCACGAGGTTCTTCATAATTGTCACAGCGATCCGACCTGTCCGCCTTGCACCGTGCTGACCCAGCGCTGGTGTTACGGCAAGCACGAGCTGAGAAAAGCCGTGCCCTGTCACGTCAATGAAATATCGTGCGGTCTGCCATGCAACAAGCCGATCACCTGCGGCAGGCACAAATGTATAACCCTATGTCATTCGGGTCCGTGCGAAAAATCGGGTCAAATTTGCGTACAACCGTGCACCATAGCGCGAGAAATGTGCGGTCACATTTGCGCATCGCCGTGCCACGAGGGAAAATGTCCCGACACCCCTTGCAAAGAAATGGTCAAGGTGACCTGCAACTGCGGCAACAGAACCATGACACGTGCATGCGCCGAAAATTCTCGTGATTTTCAGAGAATCGCGAGCGGTATATTGGCGAGTAAAATGGCCGATATGCAGCTGGGTCATTCGGTCGATCTTGAGGAAGTTTTTGGCCAGGGAGcgaaaaagcaaaatcaattgaagACCCTCGAGTGCAACGACGAGTGCAAGATAATAGCGAGAAATCGTCGAATAGCGCTCGGGCTTCAGATCGTAAATCCGGATGTGAGCGGCAAGCTAATGCCCCGTTATACGGAAACGATGAAACAATGGGCGAAGAAGGATCCTCACTTTTGTCAAATGGTGCACGATCGTCTCACCGAGCTCGTACAATTGGCCAAAGCATCGAAGCAAAAATCGCGCAGCTATTCGTTCGACAGTATGAATCGCGACAAGCGTAATTTCATACACGAGAGCTGCGAGCATTTTGGCTGCGAGAGTCAGGCATACGATCAGGAACCCAAGCGAAATGTCGTGGCTACCGCGGTCAAAGATAAGTGTTGGCTGCCAAGTTACAGCTTGTTAGAAATTGTGCAACGAGAAAACGGTCAGCGCAAAGTGCCACGACCGATGCTCAATTCGTCAAAACCAAAATCGACTCTACGATTATCGGAAGTGCTATCCTCAACTGCCAGGGCGAGTCAGAAGCTTCGGCCTGCGGCTTCGACCTCCGCCAAGTCTCCGGAGCCTGAAATCGattatttcgattttccaaacaGCAAAGCCGGCAAATGGGCCAACATTTAA
- Protein Sequence
- MATWDGSDQAGPSEDPNFYLLAHAINAAADPRTWNYYSTSGNVDYSHPSVGLYAQNNLPDAATNRGFRSVPYREPVVYSPPVHENAAPPPPPLDDRSCFFENNRQSQKHPMYNNRPLNYNHSEMGITKTRYASPKRETSAAKRSNLHPGASEFVPTSVTKNMPKNSSQMSYNGSNEPHYNDHTNENSPSSLSSASHIFFGDNRNRANERRYDNAKSNNSYRGAYKPQRSRNYEQNYAGSRLGGDSTNSASQENFTNNAQTDESSYRGGGGAKNSRARNQGARFTNDRYTGNRQQRENNDKSRPDRRKMQIDSWQNSDETYRDSSTASADDTRDNHGAMNNRGKWTRNDRRYQNERYSGANRSYNDNKNQIKASGDYDEQQSFDSPVSVNSSMGMRREISHNRDSERSYRNENRGRARPRDGSANYDNVGHAHYNHEGENHERDINNKFERSRDDNKKLRDSRDEEAQSWRQKDKVPNRGGMTKKYKKIEFDDDASQRERLTELLNQGHLECLICCDYIRQNDPVWSCNNCYHVLHLKCLKKWAHSSQSENGWRCPACQNVSLVVPEIYRCFCGKARAPEWNRRDVAHSCGEICGRLRAKNNCVHKCTLLCHPGSCPECIAMVTRQCGCEKTSQSLKCSTMTLVVCDAACDKILNCKIHNCEKTCHHGDCGPCDKTLHQECYCGKHERDVTCDVDVPGTYSCDEICEKFLECGNHRCGEICHPGSCDSCKLQPAMVTHCCCGQTPLAVERKTCLDPIPTCDKVCSKKLKCGQPSDPHTCKIDCHPDDCPECELTTKVRCRCGNMDREIACKELRTKADDARCEKKCTKKRSCGKHKCNQLCCIDIEHMCPMPCSKTLNCGRHRCQLSCHKGRCQPCQEMSFDELHCECGTSVIYPPVPCGTRRPTCNRPCTRQHSCGHEVLHNCHSDPTCPPCTVLTQRWCYGKHELRKAVPCHVNEISCGLPCNKPITCGRHKCITLCHSGPCEKSGQICVQPCTIAREMCGHICASPCHEGKCPDTPCKEMVKVTCNCGNRTMTRACAENSRDFQRIASGILASKMADMQLGHSVDLEEVFGQGAKKQNQLKTLECNDECKIIARNRRIALGLQIVNPDVSGKLMPRYTETMKQWAKKDPHFCQMVHDRLTELVQLAKASKQKSRSYSFDSMNRDKRNFIHESCEHFGCESQAYDQEPKRNVVATAVKDKCWLPSYSLLEIVQRENGQRKVPRPMLNSSKPKSTLRLSEVLSSTARASQKLRPAASTSAKSPEPEIDYFDFPNSKAGKWANI
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01129744; iTF_00266865; iTF_00266137;
- 90% Identity
- -
- 80% Identity
- -