Basic Information

Gene Symbol
stc
Assembly
GCA_947579605.1
Location
OX388307.1:35059071-35072479[+]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 14 2 5e+04 -4.3 1.6 15 19 521 525 520 525 0.81
2 14 0.37 9.3e+03 -1.8 0.2 4 10 556 562 555 562 0.92
3 14 3.3e-07 0.0084 17.6 11.7 1 18 568 585 568 586 0.96
4 14 2.8e-05 0.71 11.4 13.0 3 19 624 640 622 640 0.92
5 14 2.6e-07 0.0067 17.9 23.2 1 18 681 698 681 699 0.97
6 14 2 5e+04 -4.8 1.8 5 10 728 733 728 733 0.91
7 14 0.00048 12 7.4 12.0 3 18 745 760 739 761 0.83
8 14 0.00019 4.9 8.7 7.5 1 11 801 811 801 813 0.95
9 14 0.093 2.4e+03 0.1 1.2 4 10 816 822 814 823 0.90
10 14 3.9e-09 9.8e-05 23.7 14.0 1 19 828 846 828 846 0.98
11 14 0.92 2.3e+04 -3.0 1.9 5 10 874 879 874 879 0.93
12 14 2 5e+04 -9.7 11.4 14 18 899 903 890 904 0.65
13 14 2.3e-07 0.0058 18.1 13.8 1 18 940 958 940 962 0.94
14 14 2 5e+04 -6.7 17.0 1 19 969 990 969 990 0.94

Sequence Information

Coding Sequence
ATGGCCAACTGGAACAGTAATGGAAACACTTACAATAACAACTGGTCACAGGCCCCTCCCTTCTACCCGTCAGCTAGCTCACAACAACAATATGTCGGTTTCAATGATTTTCTGAATCAATATCAAATGTCCTCTAATACTTACCAACCACCCATGCAAAACACATTTCAACATAACTCCATGAATCAATACAGAGGTGGAAATAATTACGGCTATAATGTTAATGGAACTGTCATGTATCCCCAAAACGTCAACACGACACAACACCAACAAAATTACCAGGGTATCCCCACATCATACACACAGGTGCCTTTGTCTCGAGACAATAACGCTTCAAACAGTGATTTTAATAATATAAGAAGTGAGCAGAATAAAGAAAATTCATATAGTATGTCGTTTGAAAACAGTAATAGACGTACTGAAGATAAACAAAAACCAAAAACTTCCAAGTTAACCCCGACAGCAAAAGAATTTGTACCTCAATGTGTCAAAAATCAAACAACTGCAGTAGAAGAAGTTAAAAATACTGAGACGGCTCCCTCTGCGGAACCGTCCACGAGCAGTAGTGTTGCCACAGTAAATAATCCTACAACTAACAATAGTAATAATTCATATGTTAATAGAAGAAATGTAGAAAGTAGTAATAAGTCACATAATAATTCTAATAGAAGAAATTATATACATAATCGAAACAAATATGAAAATACTAATTGGAGAAGCAGAGATGAAGATGTAGAAGAAACTGAACCGAGTACCAGCACCACCGATCAGAGCATAACTGTCTATAATAGTAACAGAAACTATGCTGATGATAATGTCTCCAATCAAAGACACAACAGGAATAGTGTAAAAGATAAACCAAAAGACAGAGCTCATACAAACAATAGCAGCGCTAACAATGAAGGTAGAAATAGAGATTATAATACATCTCGTGGAGAGAGATCTTCCAATTATAGCAACATCAGTCAAAATAATAAATATAACAAAGACGGGTATAGGAATAGAGATAAACGTTACGATAATAATACAGAGACTAACAAAGAAAGTGAAAGTAAGTCTAGGGAGCGAGGAAGTTATTACAACAACGATAATAATGATAGTAAATACAACAGCCATTCTTATCATCAACGTAATGAATCTAGTGCGGAAAATAGGAATAGAAATAATCATAGTCACAACGCTGACAGCGGAAATAACAGCAGATCTAAAGGATCTTATTCTGCAGACTCATTCAGATCACAGAAGAATGTGGAGTCTCATAGAGACGGGAATAGTAAAGAAAGGAATGAAGGATCAAGAAACTATAACAAGAGAAATAAGGAAGCGGACGATGATAGAAAGTATTTGGAGAGAAGTCCTCCTCTTAAAGAGAATGCGAACATTGGTCAAAGAGATAGGCTAACAGAGCAACTAAACCGTGGCAGTCTGGAGTGTCTGGTGTGCTGTGAGAGAGTGAGGCAGCAAGATGCTGTATGGTCCTGTGGTAGTTGTTTCCATGTGCTCCATCTCACTTGCATCAAGAAGTGGGCCAACTCTAGTTTACAAGATGGCTCATGGCGTTGTCCCGCTTGTCAAAATCAACTGAACGAAGTCCCCTCAACGTACCGCTGCTACTGTGGGGCACAGCGCAACCCAGAATGGCACCGAGGAGGCGTAGACACGGCCCATTCGTGTGGAGACATTTGCAATAAAACACGATCGTGTGCCATCCATCCATGCACACTGCTTTGTCATCCTGGACCGTGTCCAGTGTGTGAGTCCATTGTTCAAAAAAAATGTGAATGCGGAGCCACCCAACAGCTGTTGAAATGTTCTCACAAGGCGCCTTTGCTCTGCAACGGGGTCTGCAGTAAACCTCTGGAGTGTAAGGTCCACAGCTGTGAAGAGAAATGTCACGAGAATGCTTGTCCTCCATGTAAAGAGATCATACAGCAGAAATGTTACTGCTTGAAGAAAAAGGAGAGAACTGTAAATTGTGAGTCGTCTACTGTTGGTGTGAATGGGTATTCTTGTGGTGAGGTGTGCGGCCGCACTAGGACCTGCGGGCACCACAAATGTCCTGAGGTCTGTCATCCGGGACCATGTCCACAATGTCATCTCTCTCCTGAGCTGATTACACACTGTCCATGTGGACAGACCAGATTAGAAAAGAAACGGACATCTTGTATGGATCCCATTCCAACATGTAAAAAGTCATGCTCTCTTCCTCTGGGCTGTGGTCCCTCAGATAATCTGCACATGTGTCCCGCACCATGTCATGAGGCACCCTGTCCTCAGTGTGAGCTGCGTACACCTGTCTCATGTCGCTGTGGTCACATCTCTAAGGAAATACCATGTGTACAACTGCAGGAGATGAAGGATACCATACGATGCGGGAGAAAGTGCACTAAGAAGCAAAGTTGTGGTCGCCATAAGTGCAAAGAGACGTGTTGCGTTGATACTGAACACAAGTGTCCACTGCCGTGCACTCGCACTCTGTCCTGTGGGCTACATAGATGTGAGGATACCTGTCACAGAGGACACTGCCAGCCGTGCTGGAGAGTCAGTTTTGAAGAGTTGAGGTGCAGATGTGGTGCTTCAGTCCGATATCCTCCCGTAGAGTGTGGAGCTCGTCCCCCAACATGTACACAGCCATGTACGGTCGCCAGGACCTGTGAGCACCCCCCTCATCACACCTGCCATCCGGCGCATACGCACTGTCCCCCCTGTACAGTTCTCACACAGAGATACTGTCATGGAAGACATAAGCTTCGCAAGACAATCCCTTGTCATCAGGAAGACTTCTCTTGTGGCTTACCATGCGGCAAAGAGATGCCATGTGGCAAACACAAATGTGTGGAGGCCTGCCACAAGGGAGACTGCCCTGCCGTATGCACACAAGTCTGTAACACAGAGAGACACGGCTGTGGACATCCTTGTGGGGCCAAATGCGGTCATCCCGCTCCCTGTCCATCTGGAGCACCTTGTAAAGCTCCTGTTACTGCAAAGTGTCCCTGTGGCCGGCGCTCTGAAGTGAGGCCGTGTTACGAACAAACGGCTGCACAGAGGAAACTACAAGTTCTCCTGGGACCACTGGGTGAGCCTGTGGACATAGGGGAATTGATCAAACAGGGAAGCAATCATGTAACGCTGGAGTGTGATTTAGACTGCCAGACTGAAGACAGGAACAGGCGTCTTGCTATCGGCCTGCAGATCAGGAATCCAGATTTATCATCGAAACTGGCACCCAAGTATTCAGATTATCTGAAACAGATGTGTATGAGAGATGAAAATTTTGCAACTCGCATTCACGATCAACTGAGGCAGTTGGTTGTATTGGCTAAAGATTCTAAGCAACGAACCAGGAGCCACTCATTCCAACCTATGAGCCGACCCAAGAGACAGTTTGTCCACGAGCTGAGCCAACATTTTGGTTGCGAAAGTGCTTCGTACGATAGTGAGCCATACAGAAACATTGTGGCGACGGCTTATAAAGAGAGATCCTGGCTGCCTGCTATGAGTTTGATGGAGGTGGTCCGAAGGGAGAAGGGACAACGCAAAGTGCCTCCTCCTATTCTGAATTCTAGCAGTGGAAGTGGAGGCAGTGCGTGGGCAACTTTAGGATCGTCTTCTACTTCTGCATCTGCCTCTACAAACAGCACAGCAATCGCCCCAGCACCGGAACCGCCTAGGAAACCTCGAATAGACTACTTCGATATGCCACCTGAAGACTGA
Protein Sequence
MANWNSNGNTYNNNWSQAPPFYPSASSQQQYVGFNDFLNQYQMSSNTYQPPMQNTFQHNSMNQYRGGNNYGYNVNGTVMYPQNVNTTQHQQNYQGIPTSYTQVPLSRDNNASNSDFNNIRSEQNKENSYSMSFENSNRRTEDKQKPKTSKLTPTAKEFVPQCVKNQTTAVEEVKNTETAPSAEPSTSSSVATVNNPTTNNSNNSYVNRRNVESSNKSHNNSNRRNYIHNRNKYENTNWRSRDEDVEETEPSTSTTDQSITVYNSNRNYADDNVSNQRHNRNSVKDKPKDRAHTNNSSANNEGRNRDYNTSRGERSSNYSNISQNNKYNKDGYRNRDKRYDNNTETNKESESKSRERGSYYNNDNNDSKYNSHSYHQRNESSAENRNRNNHSHNADSGNNSRSKGSYSADSFRSQKNVESHRDGNSKERNEGSRNYNKRNKEADDDRKYLERSPPLKENANIGQRDRLTEQLNRGSLECLVCCERVRQQDAVWSCGSCFHVLHLTCIKKWANSSLQDGSWRCPACQNQLNEVPSTYRCYCGAQRNPEWHRGGVDTAHSCGDICNKTRSCAIHPCTLLCHPGPCPVCESIVQKKCECGATQQLLKCSHKAPLLCNGVCSKPLECKVHSCEEKCHENACPPCKEIIQQKCYCLKKKERTVNCESSTVGVNGYSCGEVCGRTRTCGHHKCPEVCHPGPCPQCHLSPELITHCPCGQTRLEKKRTSCMDPIPTCKKSCSLPLGCGPSDNLHMCPAPCHEAPCPQCELRTPVSCRCGHISKEIPCVQLQEMKDTIRCGRKCTKKQSCGRHKCKETCCVDTEHKCPLPCTRTLSCGLHRCEDTCHRGHCQPCWRVSFEELRCRCGASVRYPPVECGARPPTCTQPCTVARTCEHPPHHTCHPAHTHCPPCTVLTQRYCHGRHKLRKTIPCHQEDFSCGLPCGKEMPCGKHKCVEACHKGDCPAVCTQVCNTERHGCGHPCGAKCGHPAPCPSGAPCKAPVTAKCPCGRRSEVRPCYEQTAAQRKLQVLLGPLGEPVDIGELIKQGSNHVTLECDLDCQTEDRNRRLAIGLQIRNPDLSSKLAPKYSDYLKQMCMRDENFATRIHDQLRQLVVLAKDSKQRTRSHSFQPMSRPKRQFVHELSQHFGCESASYDSEPYRNIVATAYKERSWLPAMSLMEVVRREKGQRKVPPPILNSSSGSGGSAWATLGSSSTSASASTNSTAIAPAPEPPRKPRIDYFDMPPED

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-