Basic Information

Gene Symbol
stc
Assembly
GCA_947859395.1
Location
OX401999.1:3199905-3222871[-]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 14 2 2.4e+04 -4.4 1.6 15 19 571 575 570 575 0.81
2 14 0.13 1.6e+03 -0.3 0.6 4 10 604 610 603 610 0.93
3 14 4e-05 0.48 10.9 18.1 4 19 618 633 616 633 0.93
4 14 4e-06 0.048 14.1 8.5 1 18 669 686 669 687 0.91
5 14 2.5e-05 0.31 11.5 13.0 1 18 728 745 728 746 0.98
6 14 7.4e-07 0.0089 16.4 9.7 1 19 787 809 787 809 0.88
7 14 2 2.4e+04 -5.4 2.2 6 10 839 843 839 843 0.94
8 14 0.012 1.4e+02 3.0 7.2 1 12 849 860 849 860 0.93
9 14 0.27 3.2e+03 -1.3 0.9 4 10 864 870 863 870 0.89
10 14 7.9e-09 9.6e-05 22.7 15.8 1 19 876 894 876 894 0.99
11 14 1.4 1.7e+04 -3.6 1.4 11 15 923 927 922 930 0.67
12 14 0.0034 42 4.7 11.9 10 18 941 949 935 950 0.93
13 14 1.6e-07 0.0019 18.6 13.8 1 17 986 1004 986 1010 0.83
14 14 0.0022 27 5.3 4.8 1 12 1016 1026 1016 1027 0.96

Sequence Information

Coding Sequence
ATGTCTCAGTGGAACAACTCATACGCTTACAACAACCAGTATCAACAAGGCTCTAATGGGTGGAACGGCGACCCTGGTAGCCAATACATGAATCAAGCCTATTACAATAGGCCAGATGCCAATGGTCAATATGTGAGTTTTAGTGAATTTCTAAATCAGATGCAAAACAGCAATACTGCTCCTCAGACTAACAATCAATTACCTAACTCCTTCAATGTTCCATATGATAACTATTCCTCTGGTCAATATAACTACATGCCTTCTTCATCCCAGATACAGCACAACAATATGAACTATGGGACTCCAGGCATGTCCAATGAAGCTGCCCCCTACCCACAGAACCAGTTAAGTGCTCCACCACCAGAAACAGCTTACTCTAACAATGCCACATTCAAATCTAACCTTACAGCCACTGCATTAGAATTTGTTCCTAAAGGTTCTCTTCGAAGACCTTCAAGTAGCCAGAGCATTCCAGAGTCTTCCAATAGTTCCAAGGATGTGAGTGATGGTGCTAATGAAGCCCAGTCTAATTATGGTAGCTCATCAGATCGAAACTGGAGGCAGAGGCCCCAAAAATCCAGGGATGCAAAAGACAAACCATTGTCTAATGGCTATCAAGAAAACTCTAAAGGTTATGGGCGCAACCGTGATACTTCACAACGAAACAATGAAAATAATGGCAGACACAATAGAAATGAAGATACCAGTACAGAAGTAAGCACTCATAGCAATGAATCTAATAGCAGAAACCAAGAAACAAATCAACAGAGCTATGAATACAATATGAGTCGAGAACCACGACAAAAAAACTATGATTCCAACAACAGAAATCAAGAACCCCGTTCAAATAGAAATCAAGAACCTAGGAATAGAAATTATAATGATAGAAACCAAGATACAAGAAAAAACAATGACTCCAACAGTAGAAACAGAGATTCTACAAATTATGATTCTAACGACAGGGGCCAAGAATCAAGAGACCTAGATTCAAACCGCTACGACTCTCGTAATGAACGTCGTAACCAATCAAAGGGTAACCCTAAAAATAAAGCTAAAGATGATAGAACTTTCTACAATAGTGGTATAAGTAAAGATGGTCAAGATGTGAGGAGTGGCCGTATAGAAAACTCTGGAAGGACTGATGATCGAAACCGAGACAGCGATGGCTGGCCAGATGGTGCACCTAGGGATAATTATAGACGAGGAGAGGCTTCCAGGGGTGATAACTATGGCTCTAGAGATAATTATGGAAGGGCAGAAGGTTCCGGACGGTTGCAAAGAAACTGGGCTGGTACCCAGCGACCACGAGGAGACCGAAAGGAAGACGACGAACAATATGCTAATAGCTACAGTGAGAAAGAGGATCGCGAAAGGCGTGAAAATGAGAGAACAGAAAGACAATATGATAGGGGAGAAAGAGACAAGGGATATGATAGAGGACTTGATAGACAAGATAGAGATAAGCAGAAGAATATGTACAGTCCTCCTAAATTTAAAGGCAAGCAACAGGTTGACTTTGCTAACAAAGAGATGACTCAACGTGAACGTCTCACAGACCAACTGGATAAAGGAACTCTAGAGTGCCTTGTGTGCTGCGATAGAGTGAGACAGACAGATCCTGTGTGGTCCTGCTCTAACTGTTTTCATGTGCTTCATCTTAAATGTATCAGAAAGTGGGCAATGAGCAGTGTAATTGAGGGCAAATGGCGTTGTCCCGCCTGCCAGAACATTAGCAAGAAAATCCCCACAGAATACCGCTGTATGTGCGGCGCGATGCGTTCGCCTGAATATCAGCGAGGCAGCGCCGCTCATACCTGCGGCAAGGCGTGTAAGCGAGAACGAACCTGCCCGCATCCTTGCACGCTGCTCTGCCATCCAGGACCCTGCCCGCCGTGCCAGGCCACTGTTAGCAAGCACTGCGGTTGCGGCGCAGAGACCCGTTCCATAATGTGTAGCAGTAAACTACCGCAAGTTTGTGGCCGCGCCTGCAACCGCACTCTCCCTTGCAACGTGCACCAGTGCGCCGCACTCTGCCACGAGGGAGCCTGCGGTCTCTGCGACAAGACAGTCACGCAAGTATGCTACTGCCCAGCGGCCACGGAACGCACGGTCCCGTGCACTCGCGAAACCGGCTGCAAGACGCAGTGGGAGTGCGAGCGTGCGTGCGGACGCATACTGGCGTGCGGCGCGCACGTGTGCCGCGCCCCGTGCCACGCCCCGCCGTGCCAGCCGTGCGCGCTGCTGCCGCAAGCCGTGCTTACGTGCCCGTGCGGGAAGATGCAACTAGACACGAACGCACGCGAAACTTGCTCTGACCCGATCCCTCTCTGTGGGAACATCTGTGCCAAGCCGCTGCCTTGCGGGCCATCAAACGACAAACACTTCTGCAAGCTGGTCTGCCATGAAGGTGCCTGCCCTGTATGCCCGGACACCACTCTGGTACAATGTCGCTGTGGACACTCGAGCCGCGAGGTGCCCTGCGGTGAACTAGCTGAAATGATCAACAATGTGCTGTGTCAGAAGAAATGTAACAAGAAACTGTCGTGCGGCCGCCACCGCTGCCGTACCGCATGCTGCGATGCGACGTCGCACCGCTGCAGCGTATCGTGCGCGCGCTCGTTGCCCTGCGGCCTGCATCGCTGCGAGGAGTTCTGTCACACCGGGCACTGTCCGCCGTGTCCGCGCGTCAGTTTTGATGAGCTCCGCTGCGAGTGCGGCGCATCAGTGCTGATGCCGCCCGTGCCGTGCGGGGCCCGCGCGCCCCCCTGCGAGGGACCCTGCACCCGCCCCCGTGCCTGCAGCCACCCCCCACACCACTCGTGCCACTCCGGGGAGTGTCCCCCCTGTGTGGTGCTCACTACTAAGAAATGCTACGGCGGACACGAGGAGAGGAAGACCATACCGTGTTCGTTAGAAGAGTTCTCGTGCGGTCTGCCGTGCGGAAAGCCGCTGCCTTGTGGAAAGCACAACTGCATCAAGACGTGCCACAAAGGACCTTGCGACGCTGGCAAATGCACCCAGCCGTGCACAGAGAAGCGTCCCACTTGTGGTCACCCATGCAATGCAGCCTGCCACTCATCAGCGGCGGAGGGTAGCGCCGGCGCCGCGTGTCCGTCGAGCGCGCCGTGCCGCGCCGCCGTGCGCGCCGCCTGCCCGTGCGGCCGCCGCGCCGCGCCGCGCTCCTGCCACGACAACGCCAAGGACCTAgcgcgAATAATGAGTGCCCTAGCTGCGACCAAGATGCAAGAAGGCGGCGCCATTGAGATCACTGAGCAGCGTCCCGCAAATATGCTCAAAACTCTGGAATGCGACGACGAGTGCAGAGTCGAAGCCCGCACACGCCAGATGGCGTTAGCTCTACAGATCCGCAACCCGGACGTCTCCGCCAAACTCGCACCTCGCTACAGCGACCACCTCCGCACCACTGCGCAGAGAGAGCCGTCCTTCGCGCAGCAGATCCACGACAAGCTAACCGACCTCGTACAACTCGCTAAGAAGTCAAAGCAGAAGACGCGAGCGCATTCCTTCCCATCAATGAACTGGCAGAAGCGCCAGTTCATACACGAATTGTGCGAACACTTCGGCTGCGAGAGTGTTGCCTACGACGCCGAGCCTAATAGGAACGTGGTGGCTACGGCTGATAAAGAGAAATCATGGCTGCCTGCAATGAGCGTGTTGGAAGTCCTGGGTCGCGAGGCAGGCAAGCGCCGCGTGCCGGGGCCCGTGCTACGCGCGCCGCCGCAGACCGCCGCCGCCGCGCCCTCTGCCGCCAAATCCGCAAGTGGCTGGGCCACACTCACCTCATCAAAATCGACAAACGCCTGGGCCGCCCGCAGTCAAACTCAAGTCAAGCCCGAACCCAAACCGGAACCGAAACCAGAACCCAAGATTGACTACTTCGATAACCCACCGGATAACTGA
Protein Sequence
MSQWNNSYAYNNQYQQGSNGWNGDPGSQYMNQAYYNRPDANGQYVSFSEFLNQMQNSNTAPQTNNQLPNSFNVPYDNYSSGQYNYMPSSSQIQHNNMNYGTPGMSNEAAPYPQNQLSAPPPETAYSNNATFKSNLTATALEFVPKGSLRRPSSSQSIPESSNSSKDVSDGANEAQSNYGSSSDRNWRQRPQKSRDAKDKPLSNGYQENSKGYGRNRDTSQRNNENNGRHNRNEDTSTEVSTHSNESNSRNQETNQQSYEYNMSREPRQKNYDSNNRNQEPRSNRNQEPRNRNYNDRNQDTRKNNDSNSRNRDSTNYDSNDRGQESRDLDSNRYDSRNERRNQSKGNPKNKAKDDRTFYNSGISKDGQDVRSGRIENSGRTDDRNRDSDGWPDGAPRDNYRRGEASRGDNYGSRDNYGRAEGSGRLQRNWAGTQRPRGDRKEDDEQYANSYSEKEDRERRENERTERQYDRGERDKGYDRGLDRQDRDKQKNMYSPPKFKGKQQVDFANKEMTQRERLTDQLDKGTLECLVCCDRVRQTDPVWSCSNCFHVLHLKCIRKWAMSSVIEGKWRCPACQNISKKIPTEYRCMCGAMRSPEYQRGSAAHTCGKACKRERTCPHPCTLLCHPGPCPPCQATVSKHCGCGAETRSIMCSSKLPQVCGRACNRTLPCNVHQCAALCHEGACGLCDKTVTQVCYCPAATERTVPCTRETGCKTQWECERACGRILACGAHVCRAPCHAPPCQPCALLPQAVLTCPCGKMQLDTNARETCSDPIPLCGNICAKPLPCGPSNDKHFCKLVCHEGACPVCPDTTLVQCRCGHSSREVPCGELAEMINNVLCQKKCNKKLSCGRHRCRTACCDATSHRCSVSCARSLPCGLHRCEEFCHTGHCPPCPRVSFDELRCECGASVLMPPVPCGARAPPCEGPCTRPRACSHPPHHSCHSGECPPCVVLTTKKCYGGHEERKTIPCSLEEFSCGLPCGKPLPCGKHNCIKTCHKGPCDAGKCTQPCTEKRPTCGHPCNAACHSSAAEGSAGAACPSSAPCRAAVRAACPCGRRAAPRSCHDNAKDLARIMSALAATKMQEGGAIEITEQRPANMLKTLECDDECRVEARTRQMALALQIRNPDVSAKLAPRYSDHLRTTAQREPSFAQQIHDKLTDLVQLAKKSKQKTRAHSFPSMNWQKRQFIHELCEHFGCESVAYDAEPNRNVVATADKEKSWLPAMSVLEVLGREAGKRRVPGPVLRAPPQTAAAAPSAAKSASGWATLTSSKSTNAWAARSQTQVKPEPKPEPKPEPKIDYFDNPPDN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2