Basic Information

Gene Symbol
stc
Assembly
GCA_030142595.1
Location
JARQTH010000626.1:417284-421646[-]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 2 1.8e+04 -4.3 1.6 15 19 525 529 524 529 0.81
2 15 0.17 1.5e+03 -0.7 0.1 4 10 558 564 557 565 0.94
3 15 2.4e-06 0.021 14.8 11.3 4 18 575 589 573 590 0.94
4 15 7.3e-05 0.66 10.1 16.7 1 18 626 643 616 644 0.87
5 15 0.45 4e+03 -2.0 1.3 1 10 653 662 653 662 0.73
6 15 5.1e-10 4.6e-06 26.5 14.0 1 19 682 700 682 700 0.98
7 15 0.35 3.1e+03 -1.7 3.8 5 12 729 736 729 736 0.91
8 15 4.7e-06 0.043 13.9 16.2 4 18 747 761 740 762 0.86
9 15 2 1.8e+04 -4.6 1.6 5 10 791 796 791 796 0.90
10 15 0.0086 77 3.5 11.0 1 11 802 812 802 824 0.88
11 15 6.9e-11 6.2e-07 29.3 14.0 1 19 829 847 829 847 0.98
12 15 1.4 1.3e+04 -3.6 0.5 6 10 876 880 875 881 0.70
13 15 2 1.8e+04 -4.9 7.6 10 19 894 904 883 904 0.81
14 15 1.1e-06 0.01 15.8 14.3 1 16 940 955 940 966 0.85
15 15 7.5e-06 0.067 13.2 13.7 1 19 972 991 972 991 0.97

Sequence Information

Coding Sequence
ATGGCCACTTGGGATGGTTCGTACTATGAACCTGATGATCCCAACTATCATCCATACTCCAATCATGGTGATGCAGGCCCTGGAGATCAAACATGGGGTTACTATgctggaaataattattctcgtgCAAGAGGTAATCTTTATACTCCAAGCGGGTCTAGTGCTTTTGATCAGAATAGAGCTGTACCCTTTCAACAAACTAGTAGAACCAACGTTCCATCGGATCAACAAAATGTAGCACCATTGTCCATCTACCAAGAGCAACCAATACCCTCACTTCTGGAAAATGTTAATCAGTGTTTCTTTGAGAATAACAGAGTACAACCACAGCTTCCATCGTTCAATTACAAAAGTAGTAACTATGGAGGATGGAGTAAGAATGACAATACTGATTCCAAGTACCCAGGACAAGTGAAGAATTCATATCCAAATATAGCAGAGTATTCAAATCTGCATGTTACTGCTGGTGAATTTATACCAAACAGTGCAAAATCAAGTTATTCAGAAGCACCAGTTGCCAGTCTAAATAGTTCAGGGAATCAAACTAACATACAATGCGAAGCTACAGAACTGTATCATGGAAATGCTACTGCCACGATAGAAGCAGGGCCATCTAATGTCTTCTTCAACAAAAAGAATAGCAATTATAGGCAACGGGAACCACAAGATCAGCAACATAATCAGAATGGCTTCCATAGGAATTCTTACAAGAATCAAAATCCCAAGAACTATGGCAAGTTTCAAAATGATAGATATAACAATGGTAGACAATATTCCTCTGGATGGAGggacaatgaaagaaaaaggcagAACGTAGATCAGTCAGTTTCTAATATCACCAATGGAACCAGAAACAGTTGTGGAACAGCCTCAATGTCGaaggatgaagaaaaacaacACACTAATGGAAAGCTAAATAAGAACGGTGCTAAAGGTTTGCCTTATAATGATTCCCTATCTTATCAATATAAACAAGAGAGTTCAAGACAACCTGGCAAGTTACCCAAGTACAATTATAACACTACTAGTAACGAGCCCAGTTCGTCTAGTCATAATTCGGAGCAGCCTCAGTCCTGGataaagaatggaaaaaagtACTTCAGCGATGATCGTTCTGAAAGGTATCGTGAGAGAAAAGAGCGCTACAGTGATAGATACGATAGTGGtcagagagagaagaggaataaTTACCAGGATTATGATGgacacaaaaataattatgatgtGAGTAGCGAGAGGAGTGAAAGAAACAGAGATAAAGAGCGTAACAAGAATAGTGCCCGAGAGAACAAAGATAAAGACAATGAAAGTTGGAGGTCAAAGAATGAAACTAGCAGCAGGAGTGGTACTCCAAAACGTAGTGGCAACAAGAAATACGATACAGACGATGATGCCAGTCAAAGGGAAAGGTTAACAGAACAATTAAATAGAGGACAACTCGAATGCCTAGTATGTTGTGATCGTATAAAGCAAACAGATCATGTTTGGTCGTGTTCCAATTGTTATCATGTATTGCATTTAAAGTGCACTAAGAAATGGGCCAAATCATCGCAGAGTGaaaatgGTTGGCGTTGTCCAGCCTGTCAGAATGTAACATCTGTGATAccagaagaatatttttgcttCTGCGGAAAGGGTAAGACGCCAGAATGGAATCGTCGTGAGGTCGCTCATTCCTGTGGGGACGTCTGTGGACGTATGCGAGTAAATACCAACTGTGTTCACAAATGTACTTTACTTTGTCATCCTGGTCCTTGTCCTTTATGTATCGCTATGGTGACAAAGTACTGTGGTTGCGGAAGGACGTCGCAAACACTCAAGTGTAGCACCGCGACGCTCTTGCTCTGCGAGGCCACTTGcggtaaattattaaattgcgAAAAACACACCTGTGAGAGAAAGTGTCATCACGGCAGCTGTGAAAAATGCGATAAAACCATACACCAAGAATGTTTCTGTGGCAAACATAATCGTGAGGTTACTTGCGACGTCGATGTTCCTTCTACGTATACATGCGGAAACATTTGTGAGAAATTTTTAGACTGTGGCAATCACAAGTGCAAGGCTCTTTGCCATCCTGGACCTTGCGAATCTTGTTCCCTGAAACCGGAAGCCGTCACTCATTGCTGCTGCGGACAGACTCCTTTGACGGAGCAGAGAAAAAGCTGTTTAGATGAAATACCAACGTGCGAGAAGATCTGTTGCAAGCGTCTAAAGTGCGGCCAACCAAGTCATCCTCATACATGCAAGTCAAAGTGCCACGAAGGTGATTGTCCAGAGTGCGAATTAATCACAAAGGTGAAATGTCGCTGTGGCAACATGGACAAGGAAATTCCTTGCAAGGAACTAACGACAAAGGCCGATGATGCTCGTTGCGAGAAAAGGTGCACCAAGAAAAGATCCTGTGGTCGACATAAGTGTAATCAAATGTGCTGCATCGATATTGAGCATATTTGCCCATTACCTTGCTCCAAAACATTAAGCTGCGGAAGGCATAAATGTGAGCAGACTTGTCATAAAGGAAGGTGCCAGCCCTGCTGGAGAAGCAGTTTTGATGAACTGTTCTGCGAATGTGGCGCTGCTGTTTTATATCCTCCAGTTCCTTGCGGCACGAGACGTCCCGCCTGTGACAGACCTTGTTCAAGACAGCACGCGTGCTCGCACGAGGTGTTGCACAATTGTCACAGCGAAGCTACGTGTCCTCCGTGCACTGTGCTCACTCAAAATTGGTGTTACGGTAAGCACGAATTGCGCAAAGCCGTGCCGTGTCATGTGAATGAAATATCGTGCGGTTTACCATGCAACAAACCAATATCATGCGGACGACATAAGTGCATTACACTTTGTCACGCTGGACCTTGTGAAAAACCAGGACAAGTGTGTACGCAGCCATGTACCACACCTAGGGATTTATGTGGACACATTTGTGCTTCTCCGTGTCACGATGGAAAATGTCCTGATACTCCATGTAAAGAAATGGTCAAGgTTACCTGTCAGTGCGGTCATAGAAGTATGACTAGAGCTTGTGTTGAGAATTCGCGCGAATTCCAAAGAATAGCCAGTGGTATACTTGCCAGTAAAATGGCAGACATGCAACTTGGTCATTCGGTGGACTTGGAAGAAGTCTTTGGCCAGGGTGCGAAGAAGCAGAATCAGTTAAAAACTTTAGAGTGTAATGATGAGTGTAAGGTTATTGAAAGAAACAGGAAACTGGCTTTGAGCTTGCAAATTGTCAATCCTGATCTGAGTGGCAAGCTTATGCCGCGGTATAGTGACCTCATGAAACATTGGGCCAAGAAGGATCCTTTCTTTTGTCAAATGGTCCATGATAAATTGACGGAACTAGTTCAGCTGGCTAAAACGTCTAAGCAGAAGTCAAGGAGTTATTCCTTTGAATGTATGAATCGAGATAAGCGGCACTTTGTTCACGAATACTGCGAACAGTTTGGCTGCGAAAGTCAAGCTTACGATCAGGAACCGAAGAGGAATGTTGTTGCTACTGCTGTGAAGgataaaTGTTGGATGCCGAGTCTAAGTTTATTAGAATTAGTACAACGGGAAAGTGGTCAAAGGAAGGTACCAGGCCCTATGCTCAATACTCCAAAGGCTAACTGCTCTCTAAGAAATGTTGAAGTTCTCCCCTTGCCCGCTAAGAAAGGCCACAAACTCGTGTCGATGCCGTCGACTTCAAAGTCGAAGATTAGTCAAAGATGA
Protein Sequence
MATWDGSYYEPDDPNYHPYSNHGDAGPGDQTWGYYAGNNYSRARGNLYTPSGSSAFDQNRAVPFQQTSRTNVPSDQQNVAPLSIYQEQPIPSLLENVNQCFFENNRVQPQLPSFNYKSSNYGGWSKNDNTDSKYPGQVKNSYPNIAEYSNLHVTAGEFIPNSAKSSYSEAPVASLNSSGNQTNIQCEATELYHGNATATIEAGPSNVFFNKKNSNYRQREPQDQQHNQNGFHRNSYKNQNPKNYGKFQNDRYNNGRQYSSGWRDNERKRQNVDQSVSNITNGTRNSCGTASMSKDEEKQHTNGKLNKNGAKGLPYNDSLSYQYKQESSRQPGKLPKYNYNTTSNEPSSSSHNSEQPQSWIKNGKKYFSDDRSERYRERKERYSDRYDSGQREKRNNYQDYDGHKNNYDVSSERSERNRDKERNKNSARENKDKDNESWRSKNETSSRSGTPKRSGNKKYDTDDDASQRERLTEQLNRGQLECLVCCDRIKQTDHVWSCSNCYHVLHLKCTKKWAKSSQSENGWRCPACQNVTSVIPEEYFCFCGKGKTPEWNRREVAHSCGDVCGRMRVNTNCVHKCTLLCHPGPCPLCIAMVTKYCGCGRTSQTLKCSTATLLLCEATCGKLLNCEKHTCERKCHHGSCEKCDKTIHQECFCGKHNREVTCDVDVPSTYTCGNICEKFLDCGNHKCKALCHPGPCESCSLKPEAVTHCCCGQTPLTEQRKSCLDEIPTCEKICCKRLKCGQPSHPHTCKSKCHEGDCPECELITKVKCRCGNMDKEIPCKELTTKADDARCEKRCTKKRSCGRHKCNQMCCIDIEHICPLPCSKTLSCGRHKCEQTCHKGRCQPCWRSSFDELFCECGAAVLYPPVPCGTRRPACDRPCSRQHACSHEVLHNCHSEATCPPCTVLTQNWCYGKHELRKAVPCHVNEISCGLPCNKPISCGRHKCITLCHAGPCEKPGQVCTQPCTTPRDLCGHICASPCHDGKCPDTPCKEMVKVTCQCGHRSMTRACVENSREFQRIASGILASKMADMQLGHSVDLEEVFGQGAKKQNQLKTLECNDECKVIERNRKLALSLQIVNPDLSGKLMPRYSDLMKHWAKKDPFFCQMVHDKLTELVQLAKTSKQKSRSYSFECMNRDKRHFVHEYCEQFGCESQAYDQEPKRNVVATAVKDKCWMPSLSLLELVQRESGQRKVPGPMLNTPKANCSLRNVEVLPLPAKKGHKLVSMPSTSKSKISQR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2