Basic Information

Gene Symbol
stc
Assembly
GCA_963971165.1
Location
OZ020192.1:98218107-98221181[-]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 12 1.9 9.7e+04 -4.0 1.6 15 19 335 339 334 339 0.81
2 12 2.4e-06 0.12 14.8 14.1 3 19 382 398 381 398 0.93
3 12 1.9e-09 9.8e-05 24.7 13.2 1 19 434 452 434 452 0.98
4 12 8.1e-09 0.00041 22.7 16.0 1 18 492 509 492 510 0.97
5 12 3.2e-06 0.16 14.4 15.5 4 19 557 572 550 572 0.89
6 12 1.3 6.4e+04 -3.5 2.0 5 10 601 606 601 606 0.93
7 12 0.061 3.1e+03 0.7 11.8 1 11 612 622 612 634 0.90
8 12 2.9e-09 0.00015 24.1 10.3 1 18 639 656 639 657 0.98
9 12 1.4 7.2e+04 -3.6 0.8 6 10 686 690 685 690 0.91
10 12 0.0059 3e+02 4.0 16.5 1 18 696 712 696 713 0.87
11 12 4.2e-05 2.1 10.8 15.4 1 18 749 767 749 771 0.96
12 12 0.0098 5e+02 3.3 13.7 1 18 778 796 778 797 0.86

Sequence Information

Coding Sequence
ATGTCAAGTTCTTTAAACAAGGACGGAATTTTATTTCATGATGTTCAACCTGGATCTTCACGCGAAGAGCAAGCAAATAATGACAATAGAATATTTAGTGTTCAATCTAGTTTACAAGCTACTGCGCCAGAATATTTTCCTTTACACACAAATCAGGGTTCAGTAAAAAAGAAATCTTTAAGGCTGAACCAAGAGTTCCGAGGTCCAAATACTTACAAACCTAAATTCGGTAAAAATGGTAGACCATCTGATTTAAGACCAACAGCACAAGAATTTTACCCAGGAAACTCCAGTGCAGATGTTCAAAGATATCAGGAGAAAAAAAGTTATGGGAAAGATAGAAATTCCAGGTATACTAACTGGCGATCTACTAATAATGAATATAATGCAGGTCCAGAATCTAATAGTTATGGAAGGCCAAATACTGCGAGATATTCCATGAATTATGAGGAACATGGTCATAGGGGATTTAATAGTAACACTAATTATCATAATGGAAGGTATAAAAGAAATGAGGAGACATCTGGAAATAGCAGAGCAGCTAGTGGTCCTTCTTCATATCTCCAATTTGAGAGACATTCAACTGAAAATGCTAATAATGATTATAAGGATAAATCTCATTCTGAGCGAAAAAATGGGTATGTAACTAAGTCTTTCAGTAGACTAAAACAAAACGAGAAAAATTACTCTGATTTTGATGACAATGGTTATTCTGATAGATTTCCTGGTAGTAGCAGACATACAGATTATTATAATAACCAAGACAAAAGAAAAAACAGATTTAAAGGGTTTTCCACTaatgctttaaaaaatggttCTACTGCAGGACAGCGAGAAAGGTTGATTGAAATGATTAATAGCAGAGTATTAGAATGTATGGTGTGctgtgaaaaaattaaaaatgttgatAAGGTTTGGTCATGTTTACAGTGTTACCATATCATTCATATAAGTTGCATAAGTGCTTGGGCAGAATCATCAAAAGTTGAAGAAAACTGGCGATGTCCAGCTTGTCAAAATATTTACGCAGAGATTCCAAATCAATATACATGTTATTGTGGTAAAATTACAGATCCAAAACTTATACCAAATATTATTGCTCATGGATGTGACAATATGTgtttaagaaaagggaaaaattgTGAGCACAAGTGCAACATATTATGTCATCCAGGCCCTTGTCCGGAATGCAGTGTTATGGTTTCCAAACCATGTGGATGTGGATTGACCCAACAAGTTGTTAAATGTAGCAGTGATATTAAAATAACTTGCAGTGGAAATTGCAATAAAATGCTAAATTGTGGAATACATACATGTGTAGAAAAATGTCATGCAGGACCTTGTTCCCCATGTAAAGCTTTTATAATACAAGAATGTTACTGTGGCAAAACTGGTAGAAAAGTACAGTGCAATGCAGATAACAGTAGTAAATTAGTATTTATATGTGAAAATgcatgtaataaattattaccatGTGGAAATCATAAATGTCAAAAGCAATGCCACCTTGGTCCATGCTCCCCATGTGAAAGAGACATCAATAtaattaaaagttgtttttgTGGCAAATCTCTATTGGAAACCAAAAGAACTTCTTGTCTTGATCCAATTCCATGTTGCAATAACAAATGTAATAAGACCCTAATTTGTGGACCTCCTAGTACACCCCATGTTTGTAAAGAAAAGTGCCACGAAGGTGCGTGTCCACCCTGTCCTTTAACAACTGTGATTCGCTGCCGGTGCGGACACATGGACAAAGAAATGGCTTGTCAAAAATTAACTACCAAGGCTGATGATGCAAGATGTGAAAAGAAGTGTACCAAGAAGCGCCTTTGTGGCAAACATTCCTGTAAACAACGCTGTTGTATTGAAATTGAACATATTTGCCCATTACCCTGTAATCACTTACTTTCCTGTGGTCTCCATAAGTGTGAACTTACCTGTCATTCTGGTCGATGCCCTGCTTGTATGGAAACCAGTTTTGAAGAACTATATTGTGAATGTGGTGGAAGTGTTTTATACCCACCAATTCCTTGTGGCACAAAACCTCCTGCTTGTAAAAAGCTTTGCTCGAAGACTCGTTCTTGTGGTCATCCACCAAATCATGAATGTCATCCTGGGCCCTGTCCACCATGTTTTGTATTAACTAAACAATGGTGTTATGGTCATCATGAGCAACGAGCCGCAATTCCATGTCATCAAAAAAGTTTTAGTTGTGGTTTACCATGTGGCCAGCCAATGCCTTGTGGCCGACACATGTGCATTAAGCCTTGCCATGAAGATAACTGTCCAACACCATGTATTCAGCCATGTGCTGTGCCTAGAGTGTTATGTGGTCATCCATGTAATAAGCCTTGTCACAATCCTCCTTGTCTTGAGAGTACTTGTAAGCAAATAGTACCAGTTACTTGTCCATGTGGCCTACAGAAAAGTAACAAAATTTGTATGGATTTAACAGAGGAGTTTAGAAACATTGAAATGGCTCAGCTTAAAGACAAACTAGGAAATTTTTCCGTTAATACAAGTATTGATGTCACTAATATGGTAGTTAAAAAGCCAtctgtattaaaaattttggactGCAATGAAGAATGCAGAGTTCTAGAAAGAAATAGAAGACTGGCTATTGGCTTACAAATCCAAAACCCTGATTTAAGTCAAAAATTAACTCCAAGATATTCTGATTTTTTAAAGCAATGGGCCAAGAAGGATTCAAAATTTTGTCAAAGAATTCATGACAAACTGACAGAACTGGTGCAGTTGGCAAAACAAAGTAAACAAAAAAGTCGCTCTTATTCATTTGAGTCTATGAATCGAGATAAGCGCACATTTATTCATGAATATTGTGAATATTTTGGAGTCGAAAGTGCAGCCTATGATGCAGAACCAAACAGAAACGTTGTTGCTACTGCTCTTAGAGATAAATCATGGTTACCTGGTATGAGTCTGCTAGAAGTACTTCAACGAGAAAATGGACAACGAAGAGTACCAGGACCAGTTTTGGGTTATAGTGTCACTGGTGAAGCTGAAAcagtttctttaaaattatcaCAAGTGCCAagaaaatttaagtaa
Protein Sequence
MSSSLNKDGILFHDVQPGSSREEQANNDNRIFSVQSSLQATAPEYFPLHTNQGSVKKKSLRLNQEFRGPNTYKPKFGKNGRPSDLRPTAQEFYPGNSSADVQRYQEKKSYGKDRNSRYTNWRSTNNEYNAGPESNSYGRPNTARYSMNYEEHGHRGFNSNTNYHNGRYKRNEETSGNSRAASGPSSYLQFERHSTENANNDYKDKSHSERKNGYVTKSFSRLKQNEKNYSDFDDNGYSDRFPGSSRHTDYYNNQDKRKNRFKGFSTNALKNGSTAGQRERLIEMINSRVLECMVCCEKIKNVDKVWSCLQCYHIIHISCISAWAESSKVEENWRCPACQNIYAEIPNQYTCYCGKITDPKLIPNIIAHGCDNMCLRKGKNCEHKCNILCHPGPCPECSVMVSKPCGCGLTQQVVKCSSDIKITCSGNCNKMLNCGIHTCVEKCHAGPCSPCKAFIIQECYCGKTGRKVQCNADNSSKLVFICENACNKLLPCGNHKCQKQCHLGPCSPCERDINIIKSCFCGKSLLETKRTSCLDPIPCCNNKCNKTLICGPPSTPHVCKEKCHEGACPPCPLTTVIRCRCGHMDKEMACQKLTTKADDARCEKKCTKKRLCGKHSCKQRCCIEIEHICPLPCNHLLSCGLHKCELTCHSGRCPACMETSFEELYCECGGSVLYPPIPCGTKPPACKKLCSKTRSCGHPPNHECHPGPCPPCFVLTKQWCYGHHEQRAAIPCHQKSFSCGLPCGQPMPCGRHMCIKPCHEDNCPTPCIQPCAVPRVLCGHPCNKPCHNPPCLESTCKQIVPVTCPCGLQKSNKICMDLTEEFRNIEMAQLKDKLGNFSVNTSIDVTNMVVKKPSVLKILDCNEECRVLERNRRLAIGLQIQNPDLSQKLTPRYSDFLKQWAKKDSKFCQRIHDKLTELVQLAKQSKQKSRSYSFESMNRDKRTFIHEYCEYFGVESAAYDAEPNRNVVATALRDKSWLPGMSLLEVLQRENGQRRVPGPVLGYSVTGEAETVSLKLSQVPRKFK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-