Basic Information

Gene Symbol
OXSM_1
Assembly
GCA_013368085.1
Location
JABVZW010002663.1:2817661-2851502[+]

Transcription Factor Domain

TF Family
zf-C2HC
Domain
zf-C2HC domain
PFAM
PF01530
TF Group
Zinc-Coordinating Group
Description
This is a DNA binding zinc finger domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 2.4e-16 1.2e-12 47.8 8.8 1 28 476 503 476 504 0.97
2 6 3.7e-17 1.9e-13 50.4 5.7 1 29 520 548 520 548 0.98
3 6 0.31 1.6e+03 -0.6 0.0 23 29 1152 1158 1150 1158 0.87
4 6 1.7e-18 8.5e-15 54.7 8.3 1 29 1171 1199 1171 1199 0.98
5 6 9.1e-17 4.6e-13 49.1 5.2 1 29 1216 1244 1216 1244 0.97
6 6 6.9e-18 3.5e-14 52.7 5.4 1 29 1278 1306 1278 1306 0.97

Sequence Information

Coding Sequence
ATGTTTCTTTCCAGATCAAATCGCACCGTAGGATTTCCGCAATTGAAGAAACGAAATCTAGAGACTTCTAGTTCAATTATGACTAGTGTTGCTAAAACTACAAAATCGCATGATATAAGTGACTTAAAAGACGataaacgtaagaaaaaaattttcgacGAATCGGATTCCGGTAAACAAGTTTTGTTGCCGCAAAAAAAGCGCGTCCTACCGACTTCATCTTCTTCTACAAGTAGTccggttaaaaaaaatttaaaaacaggtCAAGCTGTGCCTTCACAGGACGAAAGCGCCGTGAGCGAAACGTTAATTAGAGAAACCGAAGCGGCTCTTAAAAATTTATCGGGCAGTTGGCCTGGACCAAGAGGTTCTTCGTACAGTAAACAGCAAGAAGAATCTCCAGCATTCGagaatttatttgaagaaaaaaagggCAACCCCAAGTTGTCCCCTTCGTCTTCATCAAACAGTAGCAGCGATAATACGTGTTCTTTAAAGGACGTTATTACGTTACGCGACCCTCACGAGGACTTagataaatctttaaaaacaaataaattaaataaagcgataaaaattaaacaggaAACGGAAATCGATAacttaatcaaaattgaaagtGAATGCATTTCTTTACAAACCCAAACAAAAACGAAAGGTAAACAGAATAAGGGAAACGAACCACCCTCGCAATATCAACCCCCCGATTTTAACGAATTGGTTGACGATTCCTCGAATGAATTAGAAATTGATATGTCTGAGGCAGCTGCTGACAAGGATGATGTTAATGATGATAAATTAGATTCGAAgggtaagaaaaaagaaaccgAAGACGTCAAACGCTCCAGATACTCTTCAGGAGAATCCTCTTCTCAAACGCAAACGGTTTATCATCCATTTCCGCGTCCTGTCACGTCTACGGTATCGCCTTTTTCATCCACTTCTGCTTTCCGGCCTCCGCAAACTGATTCTAACAAAACTGCGTCGAAACCGGCGGTTAGTTTAAACGCGTTAGGTCCATATCCCGAAGAAGCCACGTTCGTCGGCTATTCAGCAACGATTGCCACACCTACTTCGCCTATGAGCGAAGACAAATTGAAATCAACTGTTAATTTAGCGCAATTAAAACAAGAAGGTGCTTCTCAAATAGATTCTAATAATTCTTGTTCTAATTCGGATACAAAATCGTCAATCGCTTCTCCGGAGGCGATGAACAAACAGTACACGATACTTCAGCCCGCTACAGTTGGTTCACGAGCTGCTAGCGCGCTCCAAGAAGCTGTTCGAGAAGGTGTTCAAAGTGTATCGGCAGTTTCCAGTAGCACGTCTTCGTCGTCCAACGATTCTCTGAGTAAAATGGGGTCAAGCGGATTTGAAAGAATTACCGGAGCTCTTTCGCCAACCAGCATCGGACGAGagGGTAGTAAATGTCCAACTCCTGGATGTAACGGACAAGGCCACGTGACGGGATTATACTCTCATCACAGAAGCCTGTCTGGCTGTCCGCGTAAAGACAAAGTTACTCCGGAAATCCTGGCGATGCATGAGACTATCTTGAAATGCCCTACTCCGGGATGTAACGGACGGGGACACGTTAGCTCCAACCGAAACACGCACCGAAGTCTTTCCGGTTGCCCTATCGCGGCCGCAAATAAACAAGCAGCTCGAGAACAAAAGTATCAGTCGAGTCTCCAACACCGCATTAAATCTCCACATACGCCTTTTGTAACTTCGTGTCAGAGCGAGTATAGTCCGGGATTTGAAAGTAAACCACCTATGACGACTTCGGTCGACGTTAAACCTAGTTACTTAGTTGCTTACGGTTCCGGACCGACATCTAATGCGGAAGAAGCAAAATCAAACTATAACCCGTATTACtctaaatttcaagaaattaaaaatgacgcGATGAAGATACCCAAAAGTGAAGTAACAACGGCTAGTTGTTGCAGTGTAACGGGACGTAATGATTTACTTGTGCCAAAAACCGAAACTTCTTCGAATTCTTGTCGGTCGCCACCGTCGGGTATGCGTCCGGCTTATGACCCCTACATGAGTCAAGACTCCAACTCCTCATCGATTTCGAGCATGGAGGCTATGAGTACGCGTGGCCCCCATCATCAATTACAACCCCATCCAAACATACCTCATCAAAACCCTCATCAGGGAGCGTACAATATCGAAGATGCACGTCAACATCCACTCCATAGATCACCTTACCACCAGTCTCCAATGTCTTCAGAGGAAATGTATAGAGATCATCCTATGAGACCTTACGGTGATATGGGCGAATCGATGAGCGGTCCGATTGCAAGGCCGACCGTAACTTACCCTAGCGAAATGGGACCCAGACCATACGACACCTCCTTAACACGGCCTTATGATCCCGGTACTGCTACTGCCTTTGAAAGATACGATTCCTCAAATCAATGTGTTACGTTAGCCCAAACTGTGATGCCTCAACGACCGCCACAAAACATGTACGGTTATGGTCCTTTAGATGAACAACAAGAACAAAGATATCAGCAAGAGGCAGCGGCAGCTGCTCAACATCAAATGGCTGTTGCGAATGCTGCCGCCGCCGGCATGATGAAAACGGAAGCGGATCAACCATCAGGCCCGTTGTATCCGAGaccAATGTATCAATACGACACATCAACGGGGCCTATTCCAATGGGGTTTTCAGCAATCAACTTGTCTGTGAAATGTGTGACTACCGCACAAGCACAAATTAAAGGGACCGGCCCTCACACGTCTCCAGGGGGCACAGTCATCGACCTTTCTACCTCTAGTGTTACCACTACAAGTCCACAGGTGGCATACAGTTCCCCTCACTATGGAGGTCAGCGTGTAGGCGCGAGTCCGCAAGCTGCGGCTAGTCCTCATCTCTCGGCCAGTCCCCAAGTCCCGAGCCCGCAAGGTCAAACTTTAGACCTCAGCGTCAGTCGTTTATCCCACAGtagcGGTACCAGTCCCCAGTACCAAAACGGGCACACAGATGCAGTTCCAGTACCCGCTGGTTTTATTGGACCGCGAGACGAACAAACGGAACCCGTCGATTTTTCCACCGCTAACGAACCGGTTAATTTTAGCGGCGTCCGACCTGTGGCGACCTTTGCTGGTCCAGTTCTGGCACCTGGTTCCGGCTACAGCAGAGAATCGACTCCAGACAGCGGCGGCTCGCACTACATGGAAGCATATCGCGACGCGTCCGGATATGGACCAGTTAGTCCGCATCCCGGTTACGGTATGACACCTGTTCCTGCAGATTATCCGGCAAATCCGTATACGTATCCTGGAGGGGGTTACTCTTGTGCTGGAGGATACCCTGGTCCACCTACCACTGGATACCCGGCGCCACCCGGAGGGTATTCACCCAGCCCATGTTATTCAATGCCTCCACCACAACATTCCATGTCACAGCATGACAAGGGACCCAATAAAGACAATGGTCTTTCAGGTTGTCCCAGAGCTGATCGTTCCCAAATTCAAGCACATTCCCAAGAACTAAAGTGTCCAACTCCTGGATGCGATGGTTCCGGACACGTTACGGGCAATTATTCCTCCCACCGTAGTTTATCGGGATGTCCCAGAGCGAACAAACCGAAAAGTAAACCTCGCGACGGACAAGATTCCGAACCTTTAAGATGCCCTATTCCAGGTTGTGATGGATCAGGACATGCGACAGGAAAGTTCTTGTCGCACCGAAGtgcGTCCGGATGTCCGATAGCAAATCGCAACAAGATGCGCGGTGTGATAGACACCGGTGGAAATGTAGATCAACACAAAGCGGCGGTAGTTGCTGCTTCGGCAATGAAATTTGAAGGCGTTAATTGCCCAACTCCCGGCTGCGACGGCACAGGTCACATTAACGGATCGTTTCTAACGCACCGATCTTTAAGCGGATGCCCTGTCGCAGGTCAAACCGCCAAAAAGCACAAATACGAAGACATGCCGTCGTTTTATGCCAAAGGGTATAGCACAGGTATGGATACCTCTGTCGGAAACGGTGAAGACCTCATGACTTTAGAGGCTGAGATTTCAGAACTGCAAAGAGAAAATGCGCGTGTTGAATCGcagatgattaaattaaaaaatgatattaatgcCATGGAAAGCCATCTTAGTCATGGAGAAAAGGAAACGCAAGCCCTGACCCAACGGAACAGCAACCTCAACGAGTATTACGAGAGCTTGCGCAACAACGTGATTACATTATTGGAGCACGTTCGTCTCCCAGGTGGTGGTACGACTCCTGATAAAATTGGTCAAGAAAATTTCGACTCGTATTTAACAAAACTACAAACTTTATGCACACCAGATGGTTATTGTAGCGAGGAAAACAGACCTTTAAGAGTTGTTGTTACAGGCATAGGAGTTTTATCTCCTGTAGGAATTAGTTTGAAAGAATCTTGGGCAAATATAATTGATGGAAAAAGCGGGATAAGGCAACTCGAAGATCCTTCGTTTCATAAATTGCCAAGCAGAATAGCAGGCTTGTTATACGAAAATGGCGGCAAATTTGaatactcaaaatatttttcaaaaagtgaacTTCGCACAATGAGCCTTGCTACTGTTTACGGACTTCTTGCAGCTAAGGAAGCAATTAACGATGCTAAATTAACAGATTTAACCGAAGAACAAAAGTTATCGACTGGTGTAGCCGTAGGAATGGGAATGGTAGATTTGAATGATATATGCTCAACTCACGAAGCACTCTGCAAGAGTTACAATCAAGTTAGTCCCTTCTTTGTTCCCAGAATTTTATCGAACATGACTGCTGGTCAAATTAGCATTAAATACGGATTTCACGGTCCTAATCATTCAGTGTCGACCGCTTGCGCGACCGGCGCTCATGCAATAGGAGATTCGTTTCGTTTCATTCAGAACGGTGATGCAAACGTTATGGTTTGTGGAGGTACCGAAGCTAGCATTTCTCCATTAGCAGTTGCCGCTTTTTGTCGTTTAAGAGCATTGAGCACATCGTTCAACGACACACCCGAGAAAGCTTCTAGGCCTTTCGATAAGAAACGAGATGGTTTTGTTATAGGAGAAGGGAgtgcaattttagttttagaagaATTGGGGCACGCCATTTTACGTGGCGCTTCAATATATGCCGAAATTTTGGGTTACGGTTTATCAGGGGACGCCTCTCATTTAACGGCACCGCATGCAGATGGCAATGGTGCGATCCTGGCAATGACTAGGGCGATGAAAAATGCTAACGTATCACCAGACGAAGTTGGGTACGTAAATGCGCACGCCACGTCCACCCCATTAGGAGATTCAATTGAAATTAAGGCcattaaaactatatttaaagataatttgcATAATATTGCGGTTTCGTCTACGAAAGGCGCTCACGGACATTTATTGGGCGCTGCCGGCAATTTAGAAGCTGTGTTTGCGATTAAGGCAGTACAGGAAGGGTTGTTGTGCCCCACAATTAATTTCGAATCGACTGATTTAGATCAAGATTTCAATTTCGTACCAAATGTTCATCAAAAATGGACGAGTACCAAACGTAGGGTGGCTTTAAAAAATGCGTTCGGCTTTGGTGGTACGAACGCGTGTTTGTGTATaggagaatttaataaataa
Protein Sequence
MFLSRSNRTVGFPQLKKRNLETSSSIMTSVAKTTKSHDISDLKDDKRKKKIFDESDSGKQVLLPQKKRVLPTSSSSTSSPVKKNLKTGQAVPSQDESAVSETLIRETEAALKNLSGSWPGPRGSSYSKQQEESPAFENLFEEKKGNPKLSPSSSSNSSSDNTCSLKDVITLRDPHEDLDKSLKTNKLNKAIKIKQETEIDNLIKIESECISLQTQTKTKGKQNKGNEPPSQYQPPDFNELVDDSSNELEIDMSEAAADKDDVNDDKLDSKGKKKETEDVKRSRYSSGESSSQTQTVYHPFPRPVTSTVSPFSSTSAFRPPQTDSNKTASKPAVSLNALGPYPEEATFVGYSATIATPTSPMSEDKLKSTVNLAQLKQEGASQIDSNNSCSNSDTKSSIASPEAMNKQYTILQPATVGSRAASALQEAVREGVQSVSAVSSSTSSSSNDSLSKMGSSGFERITGALSPTSIGREGSKCPTPGCNGQGHVTGLYSHHRSLSGCPRKDKVTPEILAMHETILKCPTPGCNGRGHVSSNRNTHRSLSGCPIAAANKQAAREQKYQSSLQHRIKSPHTPFVTSCQSEYSPGFESKPPMTTSVDVKPSYLVAYGSGPTSNAEEAKSNYNPYYSKFQEIKNDAMKIPKSEVTTASCCSVTGRNDLLVPKTETSSNSCRSPPSGMRPAYDPYMSQDSNSSSISSMEAMSTRGPHHQLQPHPNIPHQNPHQGAYNIEDARQHPLHRSPYHQSPMSSEEMYRDHPMRPYGDMGESMSGPIARPTVTYPSEMGPRPYDTSLTRPYDPGTATAFERYDSSNQCVTLAQTVMPQRPPQNMYGYGPLDEQQEQRYQQEAAAAAQHQMAVANAAAAGMMKTEADQPSGPLYPRPMYQYDTSTGPIPMGFSAINLSVKCVTTAQAQIKGTGPHTSPGGTVIDLSTSSVTTTSPQVAYSSPHYGGQRVGASPQAAASPHLSASPQVPSPQGQTLDLSVSRLSHSSGTSPQYQNGHTDAVPVPAGFIGPRDEQTEPVDFSTANEPVNFSGVRPVATFAGPVLAPGSGYSRESTPDSGGSHYMEAYRDASGYGPVSPHPGYGMTPVPADYPANPYTYPGGGYSCAGGYPGPPTTGYPAPPGGYSPSPCYSMPPPQHSMSQHDKGPNKDNGLSGCPRADRSQIQAHSQELKCPTPGCDGSGHVTGNYSSHRSLSGCPRANKPKSKPRDGQDSEPLRCPIPGCDGSGHATGKFLSHRSASGCPIANRNKMRGVIDTGGNVDQHKAAVVAASAMKFEGVNCPTPGCDGTGHINGSFLTHRSLSGCPVAGQTAKKHKYEDMPSFYAKGYSTGMDTSVGNGEDLMTLEAEISELQRENARVESQMIKLKNDINAMESHLSHGEKETQALTQRNSNLNEYYESLRNNVITLLEHVRLPGGGTTPDKIGQENFDSYLTKLQTLCTPDGYCSEENRPLRVVVTGIGVLSPVGISLKESWANIIDGKSGIRQLEDPSFHKLPSRIAGLLYENGGKFEYSKYFSKSELRTMSLATVYGLLAAKEAINDAKLTDLTEEQKLSTGVAVGMGMVDLNDICSTHEALCKSYNQVSPFFVPRILSNMTAGQISIKYGFHGPNHSVSTACATGAHAIGDSFRFIQNGDANVMVCGGTEASISPLAVAAFCRLRALSTSFNDTPEKASRPFDKKRDGFVIGEGSAILVLEELGHAILRGASIYAEILGYGLSGDASHLTAPHADGNGAILAMTRAMKNANVSPDEVGYVNAHATSTPLGDSIEIKAIKTIFKDNLHNIAVSSTKGAHGHLLGAAGNLEAVFAIKAVQEGLLCPTINFESTDLDQDFNFVPNVHQKWTSTKRRVALKNAFGFGGTNACLCIGEFNK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-