Basic Information

Gene Symbol
stc
Assembly
GCA_905475445.1
Location
FR997734.1:4710315-4732058[-]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 2 4.4e+04 -4.4 1.6 15 19 627 631 626 631 0.81
2 15 0.042 9.2e+02 1.3 1.2 2 10 659 667 659 667 0.93
3 15 3.9e-05 0.87 10.9 18.1 4 19 675 690 673 690 0.93
4 15 1.5e-07 0.0034 18.6 10.4 1 18 726 743 726 744 0.98
5 15 1.9 4.2e+04 -4.0 2.4 7 11 748 752 748 753 0.89
6 15 0.0011 24 6.3 16.0 1 19 785 803 785 803 0.98
7 15 8.3e-05 1.9 9.9 11.1 1 19 844 866 844 866 0.87
8 15 2 4.4e+04 -5.4 2.2 6 10 896 900 896 900 0.94
9 15 0.0039 88 4.5 8.0 1 12 906 917 906 917 0.94
10 15 0.045 1e+03 1.2 0.3 4 10 921 927 920 928 0.93
11 15 6.4e-06 0.14 13.4 15.0 3 19 935 951 933 951 0.92
12 15 2 4.4e+04 -5.4 1.5 6 10 980 984 980 985 0.49
13 15 0.00041 9.1 7.7 9.6 10 18 998 1006 995 1007 0.92
14 15 4.4e-08 0.00097 20.4 13.9 1 17 1043 1061 1043 1066 0.86
15 15 0.0012 26 6.2 10.0 1 16 1073 1090 1073 1096 0.73

Sequence Information

Coding Sequence
ATGTCTCAGTGGAACAACACATACGCTTACAATAACCAGTACCAAGCTTGGAACGGCGACCCAAATGTCCAATATGTCAACCAAGCCTACTATCCCAACAGACCAGAGCAACCCAACCAGTATGTGAGCTTTAATGAGTTCCTTTCTCAAATGCAAAACAGTGGTGCGCCCAGTGCTAACAACTATAGCAATGTGCAATATGAAAACTATCCTGCAAGGCAGTATAATTATCAGAATGTACCTTCCAGTGCTCAAAACCCCCAGTTGGACAGTTATGGTTATGCAGGAAACTCATCCAATGTGTCTAGTGGCACTGAAGCATACCCAATGAATGCTCAAAGTCCATACAACCCAGTGGCACCCAATCCTAATGCTTATACTAATGCAATGATACTCAAATCTAATCTCACTCCAACAGCCACTGAATTTGTGCCAAAAGGTTCTATGATGACACCTTCGACTAGCACTCAGAACATTCCTGAATCTAGTTCTGTACAAGATGATAGTGAATCTAGAAATGTAAATGAACCTAAGAACCATTACAGTAGTATGAATGATTCACGAAACAACTATCCTAGCATAAATGAACCACAAAATGCTAGTGGTAGCTCATCAGACACCAACTGGAGAGAAAGATCACAGAGCTCACAACAAAATACTGAAACAAGCCAGAAAACTGAATCTTACCAACGTCCACAAGAGCAAACTAGGAACCAAGAAACAAATGGACGCCATCGTGATAAAAATTATCGTAATAACGAGTCCAATGGGCGTCATGACGACTCAAACAATCGCAATCAGGAATCAAGCAACCGTAATCATGAATCAAACAATCGTAATCAAGAGCCAGATAGTAGACAATATGAATCAAGTAGTCGCCAACAAGACACAAGTAGCCGTAACTATGAAACAAACAACCGTAACTATGATTCAAGTAACAAAAGAGGCCAAGGGAAAGGAAATTACAAGTCCAAAAGTAAAGATGATGCTCGTACCTTCTACAACAGTGCAATTAGTAAAGACAGTCAGGATGTGAGGAATGGGAGAGGGGAGGCTTCAGGTCGTGGAAAGAATTGGGTTGGGACTCCACGACTTAGAGCAATGGAACGCAATAGTACTGAAGATGAACAATATGCCAATACTTATTTACAAGGCAGAGAAGAGAGAGATAATAGAGACAGAGAAAGTGAAAACCGGGAGAGGGACAGAGATAGAGAAAGAGAAACCCGTGATAAGCATAGAGATAGCAGAGACACTCATGACAGGGATAGGGATTACCGAGATGTCCGTGAGACAGATAGAGACAGCCGAGATATTCGCGAGAGAGACAGAGGCAACCGAGACATCCGTGAGAAAGATAGAGATAACAGAGATTTCCGTGATAGAGGTAGAGATTATCGAGATATCCGTGAAAGAGAGAGGGACAGTAGAGACATACGTGAAAAGGACAGAGATTACCGAGACACTCGTGAGAGGGACAGGGACAACAGAGATACTCGTGAGCTGGATAGAGACTACCGCGATACCCGTGACATGGATAGAGATAATCGAGATATTCGCGAAAGGGATAGAGGTAACCGAGATATTCGCGAAAGGGATAAAGATACAAGAGATAGGGAGAGAGCAATAAAGTCTGAGAATGTGCCCAGTCCTGCTCGAAACAAGACAAAATATGGCACTGATCAAGTAAACAAAGAAATGACCCAACGTGAGCGTCTGACAGAACAACTAGACAAGGGCACACTGGAGTGTCTTGTCTGCTGTGACCGAGTCAAACAGTTTGACCAGGTGTGGGCATGCTGCAACTGTTACCATGTCCTACATCTGAGGTGCATCCGGAAGTGGGCTATGAGCAGTATGGTTGAGGGCAAATGGCGCTGCCCAGCTTGCCAGAACACGAACGAAGCCATCCCCACTGAGTACCGCTGCATGTGCGGAGCGGTGCGCGCGCCGGAGTACCAGCGCGGCTCCACGGGCGCGCACACGTGCGGCCGCGCGTGCCGCCGCCCGCGCGCCTGCCCGCACCCCTGCACGCTGCTGTGCCACCCCGGGCCCTGCCCGCCCTGCCAGGCCACTGTTGTCAAACATTGTGGCTGCGGTGCGGAAACCCGCTCAGTGCTCTGCAGCAGCAAATTGCCCCAAATCTGCGGTCGCGTCTGTGGACGAACCCTTCTCTGTGGGGTGCATAACTGTGCCAAGGACTGTCATGAGGGACCCTGTGACATTTGTGCTGAAACTGTCGAACAAGTATGCCACTGCCCCGCTGAAAAGTCCCGCTCGGTGGCGTGCACGCTGGAGACGGGCACGTGCACGAGCTGGTCGTGCGGCGACACGTGCGGGCGCGTGCTGGCGTGCGGCGCGCACGTGTGCCGCGCGCACTGCCACGCGCCGCCCTGCCAGCCGTGCCAGCTGCTGCCGCAATACGTGCACACCTGTCCTTGCGGGAACACGCAGTTGGCGAAAGATTCTCGCAAGGCGTGCACGGACCCTATCCCGTTATGCGGCAACATCTGTGCCAAACCGCTGCAGTGCGGCCCTGCCGGTGACAAACATTTCTGCAAACTTAACTGTCATGAAGGCTTGTGTCCCGAATGTCCCGACAAGACAGTGCTGCAATGCCGCTGCGGGCACTCCAGCCGCGAAGTACCCTGCGTTGATCTTCCTGAAATGTATAACAATGTGCTGTGCCAAAAGAAGTGCAACAAGAAACTATCGTGCGGGCGTCACCGCTGCCGCACGGTGTGCTGTTCCGCGACGTCGCACCGCTGCGCCGTGGTGTGCGGCCGCACGCTGTCGTGCCAGACGCATCGCTGCGAGGAGTTCTGTCACACCGGCCACTGCGCACCCTGCCCGCGCGTCAGTTTCGACGAGCTAACCTGTGAATGTGGTGCGGAAGTAATCCTGCCCCCCGTCCGCTGCGGGGCGCGGCCCCCGGCCTGCAGCGCGCCGTGTCCGCGCTCGCGCTCGTGCCGCCACCCGCCGCACCACTCGTGCCACTCCGGGGACTGTCCGCCGTGTGTCGTGCTCACTACCAAGCGTTGTCATGGTGACCACGAGGAGAGGAAAACTATTCCGTGTTCTCAAGAGGAGTTCTCATGTGGCCTTCCATGCGGGAAGCCGCTGCCTTGCGGCAAGCATACCTGTATCAAGACCTGTCACAAGGGACCCTGTGACACTGGCAAATGCAGCCAGCCGTGCGCGGAGAAGCGCCTGTCGTGCGGGCACCCGTGCGCGGCGCCGTGCCACGTGGGCGCACAGTCCGCGTGCCCCAGCGCCGCGCCGTGCCGCCGCGCCGTGCGCGCCACGTGCCCGTGCGGCCGGCGCCACGCCGAGCGGCCCTGCTGCGATAATGCCAGGGACTACGCCAAGATGATGAGCGCTCTAGCCGCTACAAAGATGTCAGAAGGTGGTTCAGTAGACCTGTCAGACGTGCAACGCCCCGGCAGTATGCTTAAAACTCTCGAATGCGACGATGAATGCCGCGTCGAAGCCCGCACCCGTCAGCTTGCCCTAGCGCTTCAGATACGAAACCCCGACGTGTCGGCCAAGCTCGCGCCGCGCTACAGCGAGCACGTGCGCTCCACGGCCGCACGCGAGCCTGCCTTCGCGCACCAGATACACGACAAACTTACTGAGCTCGTGCAACTCGCTAAAAAGTCAAAACAGAAGACACGCGCTCACTCGTTCCCGTCGATGAACTGGCAGAAGCGCCAGTTCATACATGAGTTGTGCGAGCATTTCGGCTGCGAGAGCGTCGCCTACGACGCTGAGCCCAACAGGAACGTTGTCGCCACTGCTGACAAAGAGAAGGTACTCATTTTGGTTGCGGAACTATATAAAACCTTCTTTCAATTCTAG
Protein Sequence
MSQWNNTYAYNNQYQAWNGDPNVQYVNQAYYPNRPEQPNQYVSFNEFLSQMQNSGAPSANNYSNVQYENYPARQYNYQNVPSSAQNPQLDSYGYAGNSSNVSSGTEAYPMNAQSPYNPVAPNPNAYTNAMILKSNLTPTATEFVPKGSMMTPSTSTQNIPESSSVQDDSESRNVNEPKNHYSSMNDSRNNYPSINEPQNASGSSSDTNWRERSQSSQQNTETSQKTESYQRPQEQTRNQETNGRHRDKNYRNNESNGRHDDSNNRNQESSNRNHESNNRNQEPDSRQYESSSRQQDTSSRNYETNNRNYDSSNKRGQGKGNYKSKSKDDARTFYNSAISKDSQDVRNGRGEASGRGKNWVGTPRLRAMERNSTEDEQYANTYLQGREERDNRDRESENRERDRDRERETRDKHRDSRDTHDRDRDYRDVRETDRDSRDIRERDRGNRDIREKDRDNRDFRDRGRDYRDIRERERDSRDIREKDRDYRDTRERDRDNRDTRELDRDYRDTRDMDRDNRDIRERDRGNRDIRERDKDTRDRERAIKSENVPSPARNKTKYGTDQVNKEMTQRERLTEQLDKGTLECLVCCDRVKQFDQVWACCNCYHVLHLRCIRKWAMSSMVEGKWRCPACQNTNEAIPTEYRCMCGAVRAPEYQRGSTGAHTCGRACRRPRACPHPCTLLCHPGPCPPCQATVVKHCGCGAETRSVLCSSKLPQICGRVCGRTLLCGVHNCAKDCHEGPCDICAETVEQVCHCPAEKSRSVACTLETGTCTSWSCGDTCGRVLACGAHVCRAHCHAPPCQPCQLLPQYVHTCPCGNTQLAKDSRKACTDPIPLCGNICAKPLQCGPAGDKHFCKLNCHEGLCPECPDKTVLQCRCGHSSREVPCVDLPEMYNNVLCQKKCNKKLSCGRHRCRTVCCSATSHRCAVVCGRTLSCQTHRCEEFCHTGHCAPCPRVSFDELTCECGAEVILPPVRCGARPPACSAPCPRSRSCRHPPHHSCHSGDCPPCVVLTTKRCHGDHEERKTIPCSQEEFSCGLPCGKPLPCGKHTCIKTCHKGPCDTGKCSQPCAEKRLSCGHPCAAPCHVGAQSACPSAAPCRRAVRATCPCGRRHAERPCCDNARDYAKMMSALAATKMSEGGSVDLSDVQRPGSMLKTLECDDECRVEARTRQLALALQIRNPDVSAKLAPRYSEHVRSTAAREPAFAHQIHDKLTELVQLAKKSKQKTRAHSFPSMNWQKRQFIHELCEHFGCESVAYDAEPNRNVVATADKEKVLILVAELYKTFFQF*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2