Aatr000978.1
Basic Information
- Insect
- Anopheles atroparvus
- Gene Symbol
- stc
- Assembly
- GCA_015501955.1
- Location
- CALTRM010000006.1:705568-708731[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 14 0.93 6.3e+03 -4.0 1.6 15 19 309 313 308 313 0.81 2 14 0.0054 37 3.1 0.2 4 10 342 348 341 349 0.96 3 14 7.7e-07 0.0052 15.4 15.2 3 19 355 371 354 371 0.93 4 14 1.4e-06 0.0098 14.6 12.7 1 18 407 424 407 425 0.93 5 14 0.91 6.2e+03 -4.0 1.2 6 10 455 459 455 459 0.96 6 14 6.4e-11 4.4e-07 28.5 11.0 1 18 465 482 465 483 0.98 7 14 0.00019 1.3 7.8 18.8 1 19 524 546 524 546 0.86 8 14 0.0056 38 3.1 6.8 1 11 586 596 586 597 0.94 9 14 0.24 1.7e+03 -2.2 1.1 4 10 601 607 600 607 0.91 10 14 1.8e-07 0.0012 17.4 16.2 3 18 615 630 613 631 0.92 11 14 1 6.8e+03 -6.0 9.5 10 18 678 687 667 688 0.78 12 14 1.3e-06 0.0091 14.7 9.2 1 16 724 739 724 746 0.89 13 14 2.9e-07 0.002 16.8 12.5 1 19 756 775 756 775 0.97 14 14 0.26 1.8e+03 -2.2 0.7 7 12 790 795 784 795 0.68
Sequence Information
- Coding Sequence
- ATGGCATCCGGCAGTGGAAACGGCGCCACCAACGGCGGTAGCGCTGCTTTCAACGAATTCGTTTCCCAGTTCAACGCTCGTTTAGAGCTACAGTATTTTCCACCGAGCAACGGGCAGCAGCAAGGAGCCGCCGCTTCTTCTGCCGCCCCGTTTTCGCCTTATGTGATCGCTTCCAATCTTACACCGACGGCGGCCGAATTTGTGCCCCGTAAGTACCAGCCCGCTACACTACCACAGCCTGCTTCGATGGAAGCGTACGTTCTAGCTGCCCCGTCAGTGTTCAACAACGTTTGCGAAGCGCCTACTGCGTCCACCGATCGGGAGGATCGGGACGAGCGGGAGAGTGGTCGTGCGGGACCCTCCGGTAATCACCACCACGGTTCCGGGCGGGCCGGGTTCGGTGGGGGTAGTAGTCGCCCACGGCGTAACGAAAGTCTGAAGCCGCGCAATGGCTGGCACACCCGGGACGATCGTCCGGTGGGGCGTGGCAAGTGGAAGGCCAATCAAGCGACTGCAGGCAGTGGACGGCTCCAGCGGCTCCAGCATCCAGACGACGACGAGTACGACGACCGACCGTCGCTGAACGGAGAAGGCTCATCGTCGCGCTACGCCACCGGGAGACAGCGAGGAAATCAAAGGGCGATGCACCACCATGACCAGGATCGCAAGGCGAACTTTCACCTGAGCCAGTCGGACGAACCGTCGGTACCGGTGGAGAGCTCGGTCAAACCGCTCTCCAAGTGCTCCCAGCGGGAGAAGCTAATGCGCGAGATCGAATGCTACCGACTCGAGTGTTTGGTGTGCTGCGAGATAGTTAAGCCGGTGCAGTCGACCTGGTCGTGCGGTAACTGTTACCATATTTTGCATCTGGCCTGCGTGACCCGCTGGGCGACCAGTTCGCAGTCGGACGATGGCAGCTGGCGCTGCCCGGCCTGCCAGAACGTCCTGAAAGCAGTGCCGCGCGAATATTTCTGCTTCTGCGGCAAGCAGAAGAACCCCTCGTACAATAGGAACGACTTGGCGCACACGTGCGGGGAGGTCTGCGGTCGGCGGGACCTGTGCGAGCACGCGTGCACGCTCCTGTGTCACCCGGGCCCGTGTCCTCCGTGCCAGGCGTCGGTGCAGCGGCGGTGCGGCTGCGGCCGACTGGAGAAACCGCTGCAGTGCAGCCAGAAGGAGGAGCTGCTGTGCGAAGCGACGTGTGATAAGCCGCTGAACTGTGATCGGCATCGGTGCGCCAAACGCTGCCACGGTGGGGAGTGCGATCCGTGCGAGGAGCAGGTCCAGCACAACTGTTACTGCGGCAAAAGCGACCAGCTGGTGGCGTGCACCAAGGGCAATCTGGAGAAAACGCGCTACGGGTGTGAAGCGGTTTGCGATCATCCACTGTCCTGCGGGAACCATCGGTGCGCACGGTTGTGTCACGAGGGCGAGTGCGCCCCGTGTGCGGACAGTCCCAGCATGGTGCTCAGCTGCCCCTGCGGACGGCAAGCCATCGAAGCCGGAAGCCGTACGTCCTGTCTCGATCCCGTGCCCACGTGCGCGGCCAACTGCGGGAAGAAGCTGACGTGTGGGCCGGCCGGAGCGCATCATTGCTGTGATGCGCGCTGCCATCGGGGCGAGTGTCCACCGTGCAAGAAGTCCACCACCGTCAAATGCCGCTGTGGCAACATGGCCCAGCCGATGAAGTGCAAGGACCTGACGACGCGTGCGGATGACGCGCGTTGCAAAAAGCGTTGCGTGCGGAGGCGTAGCTGCGCGAAGCACAAGTGCAACCAGCTGTGCTGCATCGACATCGATCACGTCTGCCCCAAAACGTGCTCCCTGCAGCTGTCCTGCCTGCGCCACCGGTGCGACAAGCCGTGCCACAAGGGCAACTGCCAGCCGTGCCACCGGGTTTCGTTCGACGAGCTGACGTGCGACTGCGGGGCCAACATCATCTACCCGCCGGTGCCGTGCGGCACGAAAAAGCCGGCCTGCGATCGAACGTGCACTCGGCGGCACGCCTGCGATCATCCGGCGCTGCACAACTGTCACGCCGAGCCGGAGTGTCCTCCGTGCGTGGTACTGACCGCGCAGTACTGCTTCGGCAAGCACGAGCAGCGCAAGACGATTCCCTGCTACCAGCGTTCGTTCAGCTGCGGAATGCCCTGCGCTCGGCCGCTGTCCTGCGGGCGCCACAAGTGCATCCGGCCGTGTCACGACGACAACTGTGCGCAGGAGGGGGTGGTGTGCAAGCAAAACTGCACCACCGTCCGGGAGGCCTGCGGGCACCTGTGCAATGCGCCCTGCCACGAGGGCGACTGTCCGGATGTGCCCTGCCGCGTGACGGTCGAGGTGACGTGCGAGTGCGGCAATCGCAAGCAGCAGCGGTCGTGTCACGATTTTTCCAAAGAATATCGACGCATTGCCTCGACCCAGCTGGCTTCGTCGGTGCAGGAAATGCAGCGCGGTGGATCGGTCGAACTGAGCGACATTTTGGGACCAATAAAACCGAAAAATAACAAAACTCTTGACTGCAACGAAGAATGCCGACTGCTCGAGCGCAACCGGCGGCTGGCGATGGCCCTGCAAATTCGCAACCCCGATCTGGCCACGAAGCTGCAACCCAACTACTCCGAGTTCCTGCGCAGCTACGCCAAGAAAGATCTGCCCCTGGTACAGATGATACACGACAAGCTGACCGAGCTGGTGAAGCTGGCCAAGGAGAGCAAGCAAAAGTCCCGCTCGTACTCGTTTCCGGTGATGAACCGCGAGAAGCGCACGGTGGTGAAAGAGATGTGCAACATGTTCGGCGTCGAAGCGGTCGCCTACGATGCCGAACCGAATCGGAATGTTGTGGCAACGGCGGATCGTTTTACGTCGTGGCTACCGAGCATGAGTCTGATGGAGGTTTTACAGCGGGAGAATGGTCAACGGCGCATCGTCGTACCGAACCTGAACGCCTGGGGACGTACGTCGGGTGCGTCTGCGTCGAACAGCAAATGA
- Protein Sequence
- MASGSGNGATNGGSAAFNEFVSQFNARLELQYFPPSNGQQQGAAASSAAPFSPYVIASNLTPTAAEFVPRKYQPATLPQPASMEAYVLAAPSVFNNVCEAPTASTDREDRDERESGRAGPSGNHHHGSGRAGFGGGSSRPRRNESLKPRNGWHTRDDRPVGRGKWKANQATAGSGRLQRLQHPDDDEYDDRPSLNGEGSSSRYATGRQRGNQRAMHHHDQDRKANFHLSQSDEPSVPVESSVKPLSKCSQREKLMREIECYRLECLVCCEIVKPVQSTWSCGNCYHILHLACVTRWATSSQSDDGSWRCPACQNVLKAVPREYFCFCGKQKNPSYNRNDLAHTCGEVCGRRDLCEHACTLLCHPGPCPPCQASVQRRCGCGRLEKPLQCSQKEELLCEATCDKPLNCDRHRCAKRCHGGECDPCEEQVQHNCYCGKSDQLVACTKGNLEKTRYGCEAVCDHPLSCGNHRCARLCHEGECAPCADSPSMVLSCPCGRQAIEAGSRTSCLDPVPTCAANCGKKLTCGPAGAHHCCDARCHRGECPPCKKSTTVKCRCGNMAQPMKCKDLTTRADDARCKKRCVRRRSCAKHKCNQLCCIDIDHVCPKTCSLQLSCLRHRCDKPCHKGNCQPCHRVSFDELTCDCGANIIYPPVPCGTKKPACDRTCTRRHACDHPALHNCHAEPECPPCVVLTAQYCFGKHEQRKTIPCYQRSFSCGMPCARPLSCGRHKCIRPCHDDNCAQEGVVCKQNCTTVREACGHLCNAPCHEGDCPDVPCRVTVEVTCECGNRKQQRSCHDFSKEYRRIASTQLASSVQEMQRGGSVELSDILGPIKPKNNKTLDCNEECRLLERNRRLAMALQIRNPDLATKLQPNYSEFLRSYAKKDLPLVQMIHDKLTELVKLAKESKQKSRSYSFPVMNREKRTVVKEMCNMFGVEAVAYDAEPNRNVVATADRFTSWLPSMSLMEVLQRENGQRRIVVPNLNAWGRTSGASASNSK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00106152;
- 90% Identity
- -
- 80% Identity
- -