Basic Information

Gene Symbol
stc
Assembly
GCA_963989405.1
Location
OZ022488.1:26820842-26825311[-]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 14 2 2e+04 -4.3 1.6 15 19 506 510 505 510 0.81
2 14 0.066 6.4e+02 0.6 0.2 4 10 539 545 538 547 0.95
3 14 6e-07 0.0059 16.7 11.2 3 19 555 571 554 571 0.94
4 14 3.8e-08 0.00037 20.6 14.8 1 18 607 624 607 625 0.97
5 14 3.8e-08 0.00037 20.6 12.4 1 19 663 681 663 681 0.98
6 14 3.1e-06 0.03 14.5 12.6 4 18 728 742 721 743 0.86
7 14 2 2e+04 -4.6 1.6 5 10 772 777 772 777 0.90
8 14 0.0042 41 4.5 11.6 1 11 783 793 783 805 0.88
9 14 1e-09 9.8e-06 25.6 13.7 1 19 810 828 810 828 0.98
10 14 2 2e+04 -4.6 1.5 6 10 857 861 857 861 0.87
11 14 2 2e+04 -6.9 10.1 10 19 875 885 867 885 0.70
12 14 0.43 4.2e+03 -2.0 2.8 9 18 903 915 901 916 0.71
13 14 4.8e-06 0.047 13.8 14.4 1 16 921 936 921 947 0.84
14 14 3.7e-06 0.036 14.2 12.9 1 19 953 972 953 972 0.97

Sequence Information

Coding Sequence
ATGGCAACCTGGGACGGCTCGTATCCTGTCTCCGAGGATCCGAATTATTACGTCTACGCTGACCAAGAGATCACTGGTGGATCAGGAACGGCCAACTGGCAGTATTACAACGGTAACGAGAACTATGAAATTCTGCAAGGTGGTCACGAGTATTACGTTCAAAACAGTCCTGTCTACGAGCAAAATTATTCCACTACGTACTATCCTGTAAATGCTAATGCTTCGGTATCGATGCTGCCTGTCGCAAGTAACTATCAGGAGCATCATCGTACTAATCCTGCTGAGAATAATAGGCTGTTCGATAATGTTCTGTACTATGGGAGACCTGGCCAATCCACGGACCACAATCCCAAGTCGTATAAGAATAAATACGATCCTAAGGTAAAGCAGACTAGGAATAATGGAAGACGCAATGTTCCAAGACAGAAAAACTCTATGCAGAACGTAGAACAATCGATTGATTATGCTGCCAATGATACTGAACAAAGTAGCGATAGATCTAATAGAAATAGCAGTGACCCCAGCGCCAAAGACGACAGTGGATCACCCGTCGGTACTCAAAGTACCAGTAAAGAGTTTGGGGTAAGGAGCAAAAGTTCAGATAACTTTTATGATAGAGGCAGAAACAATAGAAGATACGACAATAAAAGCAGAGACAAGCAGTACGAGCAGGGCCAGAAGAGTGGAATGCAACGGAACTTTTACCGGCAAGAGAGCGGAAAGAATACTGGAAAATTTCAGGGCAATAGATATTTTAATGGTAAGCGTTATTCAAATCAACCTCAGGAAAATAGTCTCGAAGCCAGCAACAGGAGAATGGAGAAGAATACTGTTGTAAATAACATACACGTTCAAGGGAATAGTGATTCGAATACTGGTGTAAAACAAGAATCGCCCCTATCCAATGAAACTGTGGAAGGAAGTAGTCAAGGGACAGATATAGGGAGAGAAACGAGCAGTGAACAATCTTCGGTTAAAGACGGCACAGAGCAGTACAATAGACcaagaaaatataatagtCATAAGTTCTCCTCTCAGGAAACAAATTTTAGGCAAACCGAAAGTTATTCTAGAAACAATAGAAGATATCCTGCTAGTACAAAGTACGAAAGCTATGAAAACAATTACAAAGATCGAAGATACAATAATTCGGAATACAATGAAGGGAAAGAATATAGTAATAAGGAATGGAAAGCTGAGAGTAAGGATCGAAGTAAGAATACGATTGGTAACAGTGAGAAAGCTGTACACAATTGGAGAAGCAGAACGGAAACTGacgataaattgaatttattgaaaaagaggcaaatgaagaaaataattattgATGACGATTCGAGTCAGAGGGATAGACTGAGCGATCAGTTGAACAGAGGTGTATTAGAGTGTTTAGTATGTTGCGAGAATATAAAACAGAATGACTACGTATGGTCGTGTTCAAATTGTTACCACGTTCTTCATCTGAAATGTACAAAGAAATGGGCGAAATCATCGCAGGGTGACGATGGTTGGCGTTGTCCTGCGTGCCAAAACGCAAACTTATCGATTCCTAAAGAGTACTATTGCTTCTGCGGTAAGGCGAAACAACCAGAATGGAATCGCAGGGACGTGGCACATTCCTGCGGCGAAGTTTGCGGGCGTACTCTTACGAAGAACACTTGTATCCACAAGTGTACTCTCCTCTGTCATCCGGGTTCCTGTCCCGTGTGTACCGCTATGGTAACTAAGCACTGTGGCTGCGGGCGAACTTCGCAAACTCTTCAATGCAGCGCGCACACCATCCTGCAGTGCGATTCGATTTGCGACAGAGAATTGAATTGCGGTGAGCACAAGTGCGAGAGGAAGTGCCACGATGGCGAATGCGGTCGCTGCGAGAAGACTTTGGAACAAGTGTGCTACTGTGGCAAAGATAGGAGAGAAGTCACTTGCGAAAAAGATCTTTCGCTCACGTATGCCTGCGATAATGTTTGCGATAAATTACTGGATTGCGGTAATCATAATTGCTCAAAGCTCTGTCATCCGGAAGACTGTGATCCCTGTTCCTTAACTCCCGAGAAACATGCGACCTGTTGCTGCGGCCGAACGGTACTGACAGAGCCACGGAAAAGCTGTTTAGATCCCATTCCTACCTGTGACAACATTTGTTCGAAAAACTTGAAGTGTGGCCAACCCAGCAATCCTCATAAATGTACAGCGATCTGTCACCAAGGCGAGTGTCCCACTTGCGACCTCACGACGGATGTCAAATGTCGCTGTGGAAACATGGACAGAGAGATCGCTTGCAAGGATTTGAAATCTAAAGCCGACGATGCGCGATGTGAGAAACGATGCGTAAAGAAGAGATCGTGCGGCAAGCACAAGTGCAATCAATTGTGCTGTATCGACATCGAACATATTTGTCCCTTGCCCTGTTCCAAGACTTTGAGTTGCGGCAAACACAAGTGTGAACAGAGTTGTCATAAAGGCAGATGCGAGCCATGTTGGCGGAGCAGTTTCGAGGAGTTGTACTGCGAATGTGGAGCCTCTGTGATATATCCGCCGGTTCCATGCGGAGCTCGGCGACCCGTGTGTAATCAGCCGTGCTCGCGCCAGCACGACTGTGGGCACGAGGTTCTCCACAACTGTCACAGCGAACCCACGTGCCCACCTTGCAGCGTGCTTACTTCGAGATGGTGTCACGGCAAGCACGAGCTTCGCAAAGCGGTTCCATGCCACTTAGGAGATATTTCCTGTGGATTACCTTGTGGCAAGCCACTTTCCTGTGGCCGGCACAAATGTATTACTACCTGCCATTCGGGAAGTTGTGAACGACCCGGTCAACAATGTCAGCAACCTTGTACAACTCCTAGGGAATTGTGCGGTCACATTTGTGCTGCGCCTTGTCACGAAGGAGCATGTCCAGACACGCCCTGCAAAGAAATTGTTAAGGTAACTTGTCAATGTGGCAACCGAAGCACATCACGAGCTTGTGCGGATAACGCGAAGGAGTATCAGAGAATTGCGAGTAACATATTAGCGAGTAAGATGGCGGATGTTCAGCAAGGGCACACTATTGACTTGGAAGAAGTATTTGGACAAGGAGCGAAGAAAATCAATCAGTGGAAAACACTCGAGTGCAACGACGAGTGCAAATTAATCGAACGGAATCGAAGATTAGCGTTAGGTTTGCAAATCGTTAATCCGGATCTAAGTGGGAAATTGGTTCCGAAGTATAGCGAGTTCATGAAGCAATGGGCGAAGAAAGATCCTCCCTTCTGTCAGATGGTTCACGAGAAACTAACGGAGTTGGTAAAATTATCAAAGACCTCGAAGCAGAAATCACGGAGTTATTCGTTCGACACCATGAATAAGTATAAGCGACATTTTGTCCACGAGAGTTGCGAGCATTTTGGTTGCGAAAGCCAAGCGTATGATCGCGAGCCCAATAGGAACATCGTTGCCACTGCTGTTAGAGACAAGTGCTGGTTGCCAAGCTACAGCTTACTGGAGATCGTTCAGCGCGAGAATGGTCAGAGAAAGGTGCCAGGTCCCACGTTGAACAGTTCAGAGGGTAGCACttctgccaaGAAAATAGTATCACTTCCCGTGACGAAAGCTCAACAATCACCCACGTTAACGAACTTGAAACTAACCGAAAAAGAGCcggaaattgattattttgaTTATCAGGGTTAA
Protein Sequence
MATWDGSYPVSEDPNYYVYADQEITGGSGTANWQYYNGNENYEILQGGHEYYVQNSPVYEQNYSTTYYPVNANASVSMLPVASNYQEHHRTNPAENNRLFDNVLYYGRPGQSTDHNPKSYKNKYDPKVKQTRNNGRRNVPRQKNSMQNVEQSIDYAANDTEQSSDRSNRNSSDPSAKDDSGSPVGTQSTSKEFGVRSKSSDNFYDRGRNNRRYDNKSRDKQYEQGQKSGMQRNFYRQESGKNTGKFQGNRYFNGKRYSNQPQENSLEASNRRMEKNTVVNNIHVQGNSDSNTGVKQESPLSNETVEGSSQGTDIGRETSSEQSSVKDGTEQYNRPRKYNSHKFSSQETNFRQTESYSRNNRRYPASTKYESYENNYKDRRYNNSEYNEGKEYSNKEWKAESKDRSKNTIGNSEKAVHNWRSRTETDDKLNLLKKRQMKKIIIDDDSSQRDRLSDQLNRGVLECLVCCENIKQNDYVWSCSNCYHVLHLKCTKKWAKSSQGDDGWRCPACQNANLSIPKEYYCFCGKAKQPEWNRRDVAHSCGEVCGRTLTKNTCIHKCTLLCHPGSCPVCTAMVTKHCGCGRTSQTLQCSAHTILQCDSICDRELNCGEHKCERKCHDGECGRCEKTLEQVCYCGKDRREVTCEKDLSLTYACDNVCDKLLDCGNHNCSKLCHPEDCDPCSLTPEKHATCCCGRTVLTEPRKSCLDPIPTCDNICSKNLKCGQPSNPHKCTAICHQGECPTCDLTTDVKCRCGNMDREIACKDLKSKADDARCEKRCVKKRSCGKHKCNQLCCIDIEHICPLPCSKTLSCGKHKCEQSCHKGRCEPCWRSSFEELYCECGASVIYPPVPCGARRPVCNQPCSRQHDCGHEVLHNCHSEPTCPPCSVLTSRWCHGKHELRKAVPCHLGDISCGLPCGKPLSCGRHKCITTCHSGSCERPGQQCQQPCTTPRELCGHICAAPCHEGACPDTPCKEIVKVTCQCGNRSTSRACADNAKEYQRIASNILASKMADVQQGHTIDLEEVFGQGAKKINQWKTLECNDECKLIERNRRLALGLQIVNPDLSGKLVPKYSEFMKQWAKKDPPFCQMVHEKLTELVKLSKTSKQKSRSYSFDTMNKYKRHFVHESCEHFGCESQAYDREPNRNIVATAVRDKCWLPSYSLLEIVQRENGQRKVPGPTLNSSEGSTSAKKIVSLPVTKAQQSPTLTNLKLTEKEPEIDYFDYQG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-