Xxan014671.1
Basic Information
- Insect
- Xanthostigma xanthostigma
- Gene Symbol
- -
- Assembly
- GCA_963575645.1
- Location
- OY754471.1:32659089-32686832[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 33 0.73 15 6.8 4.1 1 14 266 279 266 281 0.92 2 33 0.15 3 9.0 5.3 1 23 559 581 559 581 0.98 3 33 0.00016 0.0033 18.3 0.9 1 23 587 609 587 609 0.99 4 33 1.2e-05 0.00025 21.9 5.1 1 23 615 637 615 637 0.99 5 33 3.6e-07 7.3e-06 26.7 1.8 1 23 643 665 643 665 0.99 6 33 0.00033 0.0067 17.3 4.4 1 23 671 693 671 693 0.98 7 33 0.00028 0.0058 17.5 6.9 1 23 699 721 699 721 0.99 8 33 2.4e-07 4.8e-06 27.2 2.6 1 23 727 749 727 749 0.99 9 33 0.0039 0.08 14.0 7.1 1 23 755 777 755 777 0.98 10 33 0.0001 0.0022 18.9 6.5 1 23 783 805 783 805 0.99 11 33 1.5e-06 3e-05 24.7 2.6 1 23 811 833 811 833 0.99 12 33 8.1e-07 1.7e-05 25.5 1.6 1 23 839 861 839 861 0.99 13 33 4.6e-05 0.00094 20.0 6.2 1 23 867 889 867 889 0.98 14 33 0.00013 0.0027 18.6 6.9 1 23 895 917 895 917 0.99 15 33 0.014 0.3 12.2 5.3 1 23 1205 1227 1205 1227 0.98 16 33 0.00022 0.0044 17.9 0.5 1 23 1233 1255 1233 1255 0.99 17 33 1.2e-05 0.00025 21.9 5.1 1 23 1261 1283 1261 1283 0.99 18 33 0.00011 0.0022 18.9 4.5 1 23 1289 1311 1289 1311 0.99 19 33 6e-05 0.0012 19.7 6.8 1 23 1317 1339 1317 1339 0.99 20 33 0.0047 0.096 13.7 5.7 1 23 1345 1367 1345 1367 0.98 21 33 2e-05 0.0004 21.2 6.0 1 23 1373 1395 1373 1395 0.99 22 33 0.00033 0.0068 17.3 6.4 1 23 1401 1423 1401 1423 0.98 23 33 3.4e-06 7e-05 23.6 5.5 1 23 1429 1451 1429 1451 0.99 24 33 1.2e-05 0.00025 21.9 5.1 1 23 1457 1479 1457 1479 0.99 25 33 0.00033 0.0068 17.3 6.4 1 23 1485 1507 1485 1507 0.98 26 33 0.0019 0.039 14.9 3.2 1 23 1513 1535 1513 1535 0.98 27 33 6e-05 0.0012 19.7 6.8 1 23 1541 1563 1541 1563 0.99 28 33 0.0047 0.096 13.7 5.7 1 23 1569 1591 1569 1591 0.98 29 33 9.7e-05 0.002 19.0 8.2 1 23 1597 1619 1597 1619 0.99 30 33 0.00033 0.0068 17.3 6.4 1 23 1625 1647 1625 1647 0.98 31 33 7.3e-06 0.00015 22.5 3.1 1 23 1653 1675 1653 1675 0.99 32 33 8e-05 0.0016 19.3 7.8 1 23 1681 1703 1681 1703 0.99 33 33 3.2e-07 6.6e-06 26.8 2.9 1 23 1709 1731 1709 1731 0.99
Sequence Information
- Coding Sequence
- ATGAAGCGTGCTGCCGTTCAGATAACCACGGGCGATGAAAAGCCGAATTTTATCTGCCAATCATGTCACACTAAACTGGAAGAAACCCATCAGTTCTTGACACAAATAGAACGGTCTGACAGAACACTGCAACGTTATCTTCAGTTATTATGCGTTTCTGACAACTCAATCAAAGCTAAAAATTCTATCCAACCTTCTGAGGATAATTTAATCCTCAACACTAATAACGATATTGTGAAAAGTGAATCTCTAGAGCTCAACGACGAGTTTGACACGAAACTTGAAAATGTCATccatgaaaacaaaaataacggggaaatatgtttaaaatctgaaatcagAGTCGAACTGGTtgatgataatttatttttaccagACATTTCGTCTAATATAGATAACAACGAATTCATATGTGAAGACTCAAAGATCGACATAGGTGAGACCAAAGTGGGTGAAGATATGACAGTTAACTGGAAAGAGCCAGAATCGGAAAGTTATAATGTAACGGACAAATTAATGGACGAAATatctttaaatttcaaaatcaatgACGAAAATGTGGAcgcaaatttattaatttttacagaaTCAGTCGATCACTCGGAAAACGCTTCATCTGTCGGCGATGTGACTGAAATTAGCCGCGAAGATACAATGATTAATATTGGCGACACATCGCAGAATACGTGTTCACGTGCTGCAAATGTGATCGAAAATTATCGTAAATGTGATGAGAGAAATAAAAACCTTTCGATAAATCGTgagaaacacaaaaaaacaaataatttgaaatcGTATCAATGCGACATTTGCAATAAGTGTTACTCCACTCAAAAATACAAACTCAGCGATATGTGGTTATTCATGAAGGAAATTGAAAGTCAAGCACTGAAAGGAGGCATAACCACGGGCGATGAAAAGCCGAATTTTATCTGCCAATCATGTCATACTAAACTGGAAGAAACTCATCAGTTCTTGACACAAATAGAACGGTCTGACAGAACACTGCAAAGTTATCTTCAGTTATTATGCGTTTCTGGCAACTCAATCAAAGCTAAAAATTCTATCCAACCTTCTGAGAATAATTTAATCCTCAACACTAATAACGATATTGTGAAAAGTGAATCTCTAGAGCTCAACGACGAGTTTGACACGAAACCTGAAAATGTCatccatcaaaaaaaaaaaaaaggggaaatatgtttaaaatctgaaatcaaagtcGAACTGGTtgatgataatttatttttaccagACATTTCGTCTAATATAGATGACAACGAATTCATATGTGAAGACTCAAAGATCGACATAGGTGAGACCAAAGTGGGTGAAGATATGACAGTTAACTGGAAAGAGCCAGAATCGAAAAGTTATAATGTAACGGACAAATTAATGGACGAAATatctttaaatttcaaaatcaatgACGAAAAGGTGGACgcaaatttgttaatttttacagAATCAGTCGATCACTCGGAAAACGCTTCATCTGTCGGCGATGTGACAGAAGTTAGCCGCGAAGATCCAAAGATTAATAAAGGCGACACATCGCAGAATTCGTGTTCACGTGCCGTAAATGTGAATGAAAATTATCGTAAATGTGAtgagagaaataaaaatattgtgataAATCGTaagaaacacaaaaaaacaaataatttgaaatcGTATCAATGCGACATTTGCAAAAGGTGTTACTCCAGTGAGTCTGGTTTCAATTTTCATAAACGAaaacatactggtgaaaaaccatatAAATGCAACGTTTGTATTAAATCCTTTATTACATCCGAAAAATTGAAGATCCACCAACGGATACATACCggagagaaaccttatcagtgtgacatttgtaagaagagtTTTTCTCATTCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccttatcagtgtgaaaTTTGTAAGAAGAGTTTCTCTCAGTCATCCAATTTAACGttacacaaacgtatacataccggAGAGAGACCTTATCAATGTGAGCATTGTAAGAAGCGTTTCTCTAAGTTATCTAATTTAAATGGACACAAGATTAAACATACCGGCGAGAAATCTTATCAGTGTGACTTTTGTAAGAAGCGTTTTTCTCATTCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaagagtTTCTCTCAGTCATCCAATTTAACGTTacacaaacgtacacataccgGAGAGAGACCTTATCAATGTGAGCATTGTAAGAAGCGTTTCTGTAAGTTATCTAATTTAAATGGACACAAGATTAAacataccggcgagaaaccttatcagtgtgactttTGTAAGAAGAGTTTTTCTCATTCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccgtatcagtgtgacatttgtaagaagagtttctctcagtcatttaatttaatatcacacaaacgtacacataccggtgagaaaccgtatcagtgtgacatttgtaacaaGAGTTTCTCTCAGTCATCCACTTTAACGttacacaaacgtatacataccaGAGAGAGACCTTATCAATGTGAGCATTGTAAGAAGCGTTTCTCTAAGTTATCCAATTTAAAAAGACACAAGATTAAacataccggcgagaaaccttatcagtgtgacatttgtatgAAGTGTTTCTCTCACTCATCCAATTTAACGTCacacaaacgtacacataccggtgagaaccTTACCAGTGTGGACTTTGTGAGAAGAGTTTCTCTCAGTTATGCAATTTCAATACACACAAGCTtaaacataccgATAACCAGGGGCGATGAAAAGCCGAATTTTATCTGCCAATCATGTCACACTAAACTGGAAGAAACTCATCAGTTCTTAACACAAATAGAACGGTCTGACAGAACACTGCAACGTTATCTTCAGTTATTATGCGTTTCTGGTAATTCAATCAAAGCTGAAAATTCTATTCAACCTTCTGAGGATAATTTAATCCTCAACACTAATAACGATATTGCGAAAAGTGAATCTTTAGATCTCAACGACGTGTTTGACACGAAACCTGAAAATGTCAtccataaaaacaaaaataacggggaaatatgtttaaaatctgaaatcaaagtcGAACTGGTtgatgataatttatttttaccagACATTTCGTCTAATATAGATGACAACGAATTCATCAGTGAAGACTCAAAGATCGACATAGGTGAGACCAAAGTGGGCGAAGATACGACAGTCAATAGGAAAGAGCCAGAATCGGAAAATTATAATGTAACGGACAAATTAATGGACGACATATCTTTAAAGTTCAATATCAATGACGAAAAGGTGGAcgcaaatttattaatttttacagaaTCAGTCGATCATTCGGAAAACGCTTCATCTGTCAGCGATGTGACTGAAATTAGTCGCGAAGATACAAAGATTAATATTGGCGACACATCGCAGAATTCATGTTCACGTGCCGCAAATGTGATCGAAAATTATCGTAAATGTGATGAGAGAAATAAAAACCTTTCGATAAATCGTAAGAAACACAAAAAAGCAAATAATTTGAAATCGTATCAATGCGACATTTGCAATAAGTGTTACTCCAGTGAGTCTGATTTCAATTTTCATAAACGAaaacatactggtgaaaaaccatatAAATGCAACGTTTGTATTAAATCCTTTATTACATCCGAAAAATTGGAGATCCACCAACGGATACATACCggagagaaaccttatcagtgtgacatttgtaagaagagtTTTTCTCATTCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaaatatTTCTCTCATGCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaagtgtttctcTCATGCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccttaccAGTGTGGACTTTGTAAGAAGAGTTTCTCTCAATTATGCAATTTCAATACACACAAGCTtaaacataccggtgagaaaccttatcaatgtgacgtatgtaaaaagtgtttctctCAATCATCCACTTTAAATACACACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaagtgtttctcTCAGTCATCCAATTTAACGTTTCACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaagtgtttctcTCAGTCATCCAATTTAACGTTACACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaagagtTTTTCTCATTCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaagtgtttctcTCAGTCATCCAATTTAACGTTTCACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtgagGAGAGTTTCTATCATGCATCCAATTTAACGTTTCACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaaatgtttctCTCATGCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccttaccAGTGTGGACTTTGTAAGAAGAGTTTCTCTCAATTATGCAATTTCAATACACACAAGCTtaaacataccggtgagaaaccttatcaatgtgacgtatgtaaaaagtgtttctctCATTCATCCACTTTAAATACACACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaagtgtttctcTCAATCATCCAATTTAACGTTTCACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtgagAAGAGTTTCTCTCATGCATCCACTTTAACGTTACACAAGCGtaaacataccggtgagaaaccttatcagtgtgacatttgtaagaaatgtttctCTCAGTCATGCAATTTAACGTCACACAAACGTatgcataccggtgagaaaccttatcagtgtgacatttgtaagaagagtttctctcagtcatccaatttatttaaacataagcGTATACATACTCGTTAA
- Protein Sequence
- MKRAAVQITTGDEKPNFICQSCHTKLEETHQFLTQIERSDRTLQRYLQLLCVSDNSIKAKNSIQPSEDNLILNTNNDIVKSESLELNDEFDTKLENVIHENKNNGEICLKSEIRVELVDDNLFLPDISSNIDNNEFICEDSKIDIGETKVGEDMTVNWKEPESESYNVTDKLMDEISLNFKINDENVDANLLIFTESVDHSENASSVGDVTEISREDTMINIGDTSQNTCSRAANVIENYRKCDERNKNLSINREKHKKTNNLKSYQCDICNKCYSTQKYKLSDMWLFMKEIESQALKGGITTGDEKPNFICQSCHTKLEETHQFLTQIERSDRTLQSYLQLLCVSGNSIKAKNSIQPSENNLILNTNNDIVKSESLELNDEFDTKPENVIHQKKKKGEICLKSEIKVELVDDNLFLPDISSNIDDNEFICEDSKIDIGETKVGEDMTVNWKEPESKSYNVTDKLMDEISLNFKINDEKVDANLLIFTESVDHSENASSVGDVTEVSREDPKINKGDTSQNSCSRAVNVNENYRKCDERNKNIVINRKKHKKTNNLKSYQCDICKRCYSSESGFNFHKRKHTGEKPYKCNVCIKSFITSEKLKIHQRIHTGEKPYQCDICKKSFSHSSTLTLHKRKHTGEKPYQCEICKKSFSQSSNLTLHKRIHTGERPYQCEHCKKRFSKLSNLNGHKIKHTGEKSYQCDFCKKRFSHSSTLTLHKRKHTGEKPYQCDICKKSFSQSSNLTLHKRTHTGERPYQCEHCKKRFCKLSNLNGHKIKHTGEKPYQCDFCKKSFSHSSTLTLHKRKHTGEKPYQCDICKKSFSQSFNLISHKRTHTGEKPYQCDICNKSFSQSSTLTLHKRIHTRERPYQCEHCKKRFSKLSNLKRHKIKHTGEKPYQCDICMKCFSHSSNLTSHKRTHTGENLTSVDFVRRVSLSYAISIHTSLNIPITRGDEKPNFICQSCHTKLEETHQFLTQIERSDRTLQRYLQLLCVSGNSIKAENSIQPSEDNLILNTNNDIAKSESLDLNDVFDTKPENVIHKNKNNGEICLKSEIKVELVDDNLFLPDISSNIDDNEFISEDSKIDIGETKVGEDTTVNRKEPESENYNVTDKLMDDISLKFNINDEKVDANLLIFTESVDHSENASSVSDVTEISREDTKINIGDTSQNSCSRAANVIENYRKCDERNKNLSINRKKHKKANNLKSYQCDICNKCYSSESDFNFHKRKHTGEKPYKCNVCIKSFITSEKLEIHQRIHTGEKPYQCDICKKSFSHSSTLTLHKRKHTGEKPYQCDICKKYFSHASTLTLHKRKHTGEKPYQCDICKKCFSHASTLTLHKRKHTGEKPYQCGLCKKSFSQLCNFNTHKLKHTGEKPYQCDVCKKCFSQSSTLNTHKRKHTGEKPYQCDICKKCFSQSSNLTFHKRKHTGEKPYQCDICKKCFSQSSNLTLHKRKHTGEKPYQCDICKKSFSHSSTLTLHKRKHTGEKPYQCDICKKCFSQSSNLTFHKRKHTGEKPYQCDICEESFYHASNLTFHKRKHTGEKPYQCDICKKCFSHASTLTLHKRKHTGEKPYQCGLCKKSFSQLCNFNTHKLKHTGEKPYQCDVCKKCFSHSSTLNTHKRKHTGEKPYQCDICKKCFSQSSNLTFHKRKHTGEKPYQCDICEKSFSHASTLTLHKRKHTGEKPYQCDICKKCFSQSCNLTSHKRMHTGEKPYQCDICKKSFSQSSNLFKHKRIHTR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -