Ccet013400.1
Basic Information
- Insect
- Crossocerus cetratus
- Gene Symbol
- stc
- Assembly
- GCA_963675795.1
- Location
- OY776668.1:19844148-19848011[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 2 1.9e+04 -4.2 1.6 15 19 434 438 433 438 0.81 2 17 0.12 1.2e+03 -0.2 0.3 4 10 467 473 466 474 0.94 3 17 2.7e-06 0.026 14.7 10.0 4 18 484 498 482 499 0.94 4 17 2 1.9e+04 -4.2 1.2 6 10 525 529 525 529 0.96 5 17 1.5e-07 0.0014 18.7 13.1 1 18 535 552 535 553 0.97 6 17 0.24 2.3e+03 -1.2 1.7 7 13 557 563 557 564 0.84 7 17 3.1e-10 3e-06 27.2 14.3 1 19 591 609 591 609 0.98 8 17 0.43 4.2e+03 -2.0 1.4 5 10 638 643 638 643 0.96 9 17 7.2e-05 0.7 10.1 12.6 4 18 656 670 649 671 0.85 10 17 2 1.9e+04 -4.5 1.6 5 10 700 705 700 705 0.90 11 17 0.0039 38 4.5 12.6 1 18 711 732 711 733 0.78 12 17 1.6e-09 1.6e-05 24.9 12.1 1 18 738 755 738 756 0.97 13 17 0.33 3.2e+03 -1.6 0.6 5 10 784 789 782 789 0.95 14 17 2 1.9e+04 -7.0 10.0 10 19 803 813 795 813 0.69 15 17 0.4 3.9e+03 -1.9 2.8 9 18 831 843 829 844 0.71 16 17 2.4e-07 0.0023 18.0 11.6 1 16 849 864 849 875 0.85 17 17 2.8e-05 0.28 11.4 13.2 1 19 881 900 881 900 0.96
Sequence Information
- Coding Sequence
- ATGGCAACGTGGGACGGCTCATATTCCGCCCCCGATGATTATTATTACGGCGATCAGAACCAGAACAGAGCCGTCGAGGTTCCGACGCGCTGGCAAGAGTACTGCTTACCTAGCTATCCGAGCTATCTTCCCTCCTCGGTCAGCGAGATGGTACGCAATCCCTACTGCCCCAATAGCAGCGCGCCCACTGCCTACTTTCAAGAGGAGCCACCACCGGATCTGGTCGTCCACGACAGTCTGGAATTCTTCCCGACACCTGCTAGGTCGGCGCCCAACCCTCAGAGAACCAATAAACGTAAGCCCTTCAAGAACAATCGCAACCAGACCTTCTTCCAAGAGCAGAGTGCGCCTCAACCCACTGAGGATGTAGATTTGGCAGGGAACGCTCAGGAGCAAGTGTCTAGGTATCCTGGCCCGAGGAACCGATCGTCGGGAAGGTTTCCTGAGAGAGGCAGACCAGAGAGGTTCAATAACAGAGGCGGGAGTGCTAGTAAGAGCAGAGCACCGGGGTACGATCAGCCTCAAAGAGAGCCGATGCAGAAGAAGTTTTATAAGCAGGATAATAGGAATTATGGAAGATTCCAAGGGCATaggtattataataatagacaaccttccaattatccgcaacAGGAGACTTTCGATGCGAGTGGAAGTCAATTCGAAGGCACCAGTGGATGTTCCTATCCTGATGTGCAAGATGCATCTAGTTATTATGTTCAAAGCGTAGAAGCCACTGAGAATCTGGCTCAGGCAAGGGACGGCGAAAGCAGCCAGAATTTAAATAGATCCGAGAGGCAATTCAGAGATCCCAGCTTTCAGAGTTCCAGTAATAGACAGTTCCAAAATCCTTTGAGAGGTAATCGAAAGTATTCCGCTAACCCGAGACAAGAAGGGtatgaaaataattataaagaCAGGAAACACAACTACCCGAGGTACAATGGGAGCAGGGAGAATTATAGTAGAGAATATAAGTCGGATGTTTACATggaaaaagagaagaagacTCCGGAGAGTGAGGAGAAACTCGGTGGATCTTGGAGGAGTCAAAAGGAAAGCGAAGAAAAGAGTACCGTGCCTAAAAAGAGGCAAACTAGGAAGCAAATAGACGACAATGCCAGCCAAAGGGAAAGTTTGAGCGATCAGTTAAATAGAGGTATATTGGAGTGTTTGGTCTGTTACGAGAATATAAAGCAACATGATTATATTTGGTCATGTTCAAACTGTTACCATACGTTACACTTGAAGTGTGTTAAAAAGTGGGCCAAATCCTCTCAAGCAGATAACGGCTGGAGATGTCCGGCATGTCAGAATGTAACTGCGTCAATTCCCGAAGAATATTATTGCTTCTGTGGAAAGACAAGAGCACCCGAGTGGAATCGTAGAGACGTCGCGCATTCCTGCGGCGAAACCTGCGGGCGCACTCTTTCTAAAAATAATTGTGTTCATAAGTGTACGCTTCTCTGCCATCCGGGCTCCTGTCCCATGTGCATAGCAATGGTAACGAAAAACTGCGGCTGTGGAAAGACATCGCAGACCTTACAGTGTAATACTCACACGGCTCTGCTTTGCGAGTCTCTTTGCGAGAAGGAGTTGAATTGCGGCAGGCATGTCTGCGAGCGAAAGTGCCACGAGGGCGAATGTGGCCTTTGTGAGAAAACTGTGGAGCAAGTCTGCCATTGTGGTAAGAATAAGCGAGAAGTCACTTGCGACAAGACTGTATCATTTACTTATGCCTGCGACGATATTTGCAACAAGAGCTTGGATTGTGGCAATCACAATTGCACCGAGTTGTGCCATCCGGGTGATTGCAAACCTTGTTCTTTAACTCCGGACAAGATCGTGACCTGCTGCTGTGGGCAAACACCCTTAACAGAGGAACGTAAGAGTTGTTTAGATCCCATTCCGACTTGTGAAAAGATATGCTCAAAAAATTTAAGATGCGGCCAACCCAGTAATCCTCACAAATGCAAAGTAAACTGCCACCAAGGCGACTGCCCTGACTGTGATTTGTCGACGGACGTGAAATGCCGTTGCGGCAACATGGACCGAGAGATCGAATGCAAGGATCTGCGATCGAAAGCCGACGACGCTCGCTGCGAGAAGCGATGCATCAAGAAGAGGTCATGCGGCAAGCACAAGTGCAATCAACTCTGTTGCATTGACATCGAGCACGAATGCCCACTGCCATGCTCGAAGACGTTAACCTGCGGCAGGCACAAGTGCGAGCAGAGTTGTCATCGGGGCAGATGTCTACCATGTTACCGCAGTAGCTTCGAGGAGTTATATTGCGAGTGCGGGCACGAAGTAATATACCCTCCGATTCCGTGCGGAAGAAGGAGACCGACCTGCAATAGACCTTGCTCTCGAGAGCATGGCTGTGGACATGAAGTCTTGCATAATTGCCATAGCGAGCCTACATGTCCGCCATGCACCGTTCTCACTCAGAGACGCTGTCACGGACTCCATCAACTTCGGAAAGCTGTACCCTGCCACTTAGGAGACATTTCCTGCGGCCTGCCGTGTGGCAAACCCATTTCTTGCGGACGACACAAGTGCATCACTATATGTCACGCAGGCCCCTGCGAGAGACCGGGCCAGCAATGCACCCAGCCCTGTACGACTCCTAGGGAATTGTGTGGTCATATCTGTGCTGCTCCTTGCCACGAAGGGGTATGCCCGGAGACTCCTTGCAAGGAAATCGTCAAGGTAACATGCCAATGCGGGAATAGATCAATGTCACGACCTTGTGCGGAGAACTCGAAAGAATATCAGAGAATAGCGAGTAATATACTCGCGAGTAAAATGGCCGACATGCAACTGGGACATACGGTCGACTTGGAGGAAGTATTTGGCCAAGGAGCCAAAAAGCAAAACCAATTGAAGACTTTAGAGTGCAACGACGACTGTAAATTATTGGAACGAAACAGGCGATTGGCTCTCGGTCTTCAGATTGTTAACCCTGACATAAGTGGGAAACTGATGCCGAAGTATAGCGATTTCATGAAACAATGGGCCAAAAAGGACCCGATCTTCTGCCAATCGATCCACGATAAGTTGACTGAGCTAGTCAAGTTGGCCAAGACATCGAAACAAAAGTCCCGCAGTTACTCGTTCGATACGATGAACAGGGAGAAGCGGCACTTCGTCCACGAGATTTGTGAACATTTTGGTTGCGAGAGCCAAGCCTACGATCGCGAACCAAAACGAAATATTGTCGCCACTGCCGTTAGGGATAAATGCTGGCTGCCGAGCTACAGTCTGATGGAAATAGTGCAAAGGGAGAACGGCCAGAGGAAAGTTCCAGGACCAATGTTAAACTGCAAATCGGTACAATCACTGCCGTCAAGAAAGAACCAAGACAGACCATCAGGTTCCAAAGTACACGAAGCAGACGAGATAGATTATTTCGATTTCGGGGGCTAA
- Protein Sequence
- MATWDGSYSAPDDYYYGDQNQNRAVEVPTRWQEYCLPSYPSYLPSSVSEMVRNPYCPNSSAPTAYFQEEPPPDLVVHDSLEFFPTPARSAPNPQRTNKRKPFKNNRNQTFFQEQSAPQPTEDVDLAGNAQEQVSRYPGPRNRSSGRFPERGRPERFNNRGGSASKSRAPGYDQPQREPMQKKFYKQDNRNYGRFQGHRYYNNRQPSNYPQQETFDASGSQFEGTSGCSYPDVQDASSYYVQSVEATENLAQARDGESSQNLNRSERQFRDPSFQSSSNRQFQNPLRGNRKYSANPRQEGYENNYKDRKHNYPRYNGSRENYSREYKSDVYMEKEKKTPESEEKLGGSWRSQKESEEKSTVPKKRQTRKQIDDNASQRESLSDQLNRGILECLVCYENIKQHDYIWSCSNCYHTLHLKCVKKWAKSSQADNGWRCPACQNVTASIPEEYYCFCGKTRAPEWNRRDVAHSCGETCGRTLSKNNCVHKCTLLCHPGSCPMCIAMVTKNCGCGKTSQTLQCNTHTALLCESLCEKELNCGRHVCERKCHEGECGLCEKTVEQVCHCGKNKREVTCDKTVSFTYACDDICNKSLDCGNHNCTELCHPGDCKPCSLTPDKIVTCCCGQTPLTEERKSCLDPIPTCEKICSKNLRCGQPSNPHKCKVNCHQGDCPDCDLSTDVKCRCGNMDREIECKDLRSKADDARCEKRCIKKRSCGKHKCNQLCCIDIEHECPLPCSKTLTCGRHKCEQSCHRGRCLPCYRSSFEELYCECGHEVIYPPIPCGRRRPTCNRPCSREHGCGHEVLHNCHSEPTCPPCTVLTQRRCHGLHQLRKAVPCHLGDISCGLPCGKPISCGRHKCITICHAGPCERPGQQCTQPCTTPRELCGHICAAPCHEGVCPETPCKEIVKVTCQCGNRSMSRPCAENSKEYQRIASNILASKMADMQLGHTVDLEEVFGQGAKKQNQLKTLECNDDCKLLERNRRLALGLQIVNPDISGKLMPKYSDFMKQWAKKDPIFCQSIHDKLTELVKLAKTSKQKSRSYSFDTMNREKRHFVHEICEHFGCESQAYDREPKRNIVATAVRDKCWLPSYSLMEIVQRENGQRKVPGPMLNCKSVQSLPSRKNQDRPSGSKVHEADEIDYFDFGG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00633995;
- 90% Identity
- -
- 80% Identity
- -