Btab009041.1
Basic Information
- Insect
- Bemisia tabaci
- Gene Symbol
- -
- Assembly
- GCA_903994105.1
- Location
- NW:327788-356811[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 45 0.00011 0.005 16.9 2.9 1 23 241 263 241 263 0.98 2 45 0.3 13 6.1 0.7 9 23 277 291 269 291 0.86 3 45 3.8e-07 1.7e-05 24.6 1.1 1 23 297 319 297 319 0.98 4 45 4.6e-07 2.1e-05 24.3 3.3 1 23 325 347 325 347 0.98 5 45 0.00018 0.0079 16.2 1.3 1 23 353 375 353 375 0.98 6 45 9.3e-05 0.0042 17.1 0.8 1 23 381 403 381 403 0.97 7 45 8.3e-05 0.0037 17.2 5.4 1 23 409 431 409 431 0.99 8 45 2.6e-05 0.0012 18.8 5.6 1 23 437 459 437 459 0.98 9 45 3.5e-05 0.0016 18.4 5.4 1 23 465 487 465 487 0.98 10 45 0.0049 0.22 11.7 2.0 1 23 493 515 493 515 0.98 11 45 8.4e-05 0.0038 17.2 5.2 1 23 521 543 521 543 0.98 12 45 0.00024 0.011 15.8 3.7 3 23 550 571 549 571 0.90 13 45 3.1e-06 0.00014 21.7 4.3 1 23 577 599 577 599 0.98 14 45 2.2e-05 0.001 19.0 1.4 1 23 605 627 605 627 0.98 15 45 2.4e-06 0.00011 22.1 1.7 1 23 633 655 633 655 0.99 16 45 4.7e-06 0.00021 21.2 2.5 1 23 661 683 661 683 0.99 17 45 0.00086 0.039 14.1 3.4 1 23 689 711 689 711 0.99 18 45 0.00026 0.012 15.7 5.0 1 23 717 739 717 739 0.98 19 45 0.00074 0.033 14.3 5.7 1 23 745 767 745 767 0.98 20 45 6.5 2.9e+02 1.9 1.0 1 12 773 784 773 787 0.90 21 45 0.00011 0.005 16.9 2.9 1 23 1049 1071 1049 1071 0.98 22 45 9.4e-05 0.0042 17.1 4.4 1 23 1077 1099 1077 1099 0.98 23 45 0.00047 0.021 14.9 5.5 1 23 1105 1127 1105 1127 0.98 24 45 6.1e-06 0.00027 20.8 4.7 1 23 1133 1155 1133 1155 0.98 25 45 3.8e-07 1.7e-05 24.6 1.1 1 23 1161 1183 1161 1183 0.98 26 45 0.0022 0.098 12.8 3.0 1 23 1189 1211 1189 1211 0.98 27 45 0.00017 0.0076 16.3 4.1 1 23 1217 1239 1217 1239 0.98 28 45 4.2e-06 0.00019 21.3 2.6 1 23 1245 1267 1245 1267 0.97 29 45 3.8e-07 1.7e-05 24.6 1.1 1 23 1273 1295 1273 1295 0.98 30 45 3.7e-06 0.00017 21.5 3.8 1 23 1301 1323 1301 1323 0.98 31 45 5.5e-05 0.0025 17.8 2.4 1 23 1329 1351 1329 1351 0.98 32 45 4.5e-05 0.002 18.1 1.3 1 23 1357 1379 1357 1379 0.97 33 45 6e-05 0.0027 17.7 1.6 1 23 1385 1407 1385 1407 0.99 34 45 0.0011 0.048 13.8 0.5 1 23 1413 1435 1413 1435 0.94 35 45 2.4e-05 0.0011 19.0 3.3 1 23 1441 1463 1441 1463 0.98 36 45 9.7e-07 4.4e-05 23.3 4.2 1 23 1469 1491 1469 1491 0.99 37 45 2e-06 8.9e-05 22.4 4.3 1 23 1497 1519 1497 1519 0.98 38 45 3.1e-05 0.0014 18.6 2.6 1 23 1525 1547 1525 1547 0.98 39 45 6.9e-06 0.00031 20.6 2.2 1 23 1553 1575 1553 1575 0.99 40 45 0.00026 0.012 15.7 1.6 1 23 1581 1603 1581 1603 0.98 41 45 0.0022 0.098 12.8 3.0 1 23 1609 1631 1609 1631 0.98 42 45 0.00017 0.0076 16.3 4.1 1 23 1637 1659 1637 1659 0.98 43 45 1e-05 0.00046 20.1 3.3 1 23 1665 1687 1665 1687 0.97 44 45 0.0001 0.0045 17.0 2.8 1 23 1693 1715 1693 1715 0.98 45 45 2.2e-05 0.00096 19.1 6.0 1 23 1721 1744 1721 1744 0.97
Sequence Information
- Coding Sequence
- ATGAATCCGTTGGCGGATATTACCCTCCAAATTTCTTCAACAGATCACTCGCCGATCGAACGTAATCCACCAATCGAATCATCAGCTTCTGGAAGAGGATCCCCCTCTGTTGACTTTTTCTTTGTGAAATGTGAAGACGGGCTAAACTTTTATGACGAAAGTATCCCACCAAGGCAAGTTGAGACGATTCCTCCCAGTTCTTTAGCAAACCCTCTCTTTGTAAAATGTGAAGACGGACTCAATCGTTGTGAGGAAAATACAAAACCCAGGCAACTTGCGACGAGTCCATCCAGATCTTCAGCAGACCCTCGTCTTATAAAATGTGAAGACGAACTCACTTTATGTGTCAAAAATTGGCAACCCAGGACACTTGAGACGATTCCATCAAGATCTTCAGCAACCCTTTTGACCACATTCAAAAGGGAACCAGgcgaagaaaattatattcCGATCGATCCCTTATATACCAATAATGACATCAAAAGAGAATCGTCGTTGCAATCCTTCGAAGGTCAAAATGTCGTAAACGCTGAGAATGAAGCGCGAACACCTGGATATTATTTCTGTAATCCGTCAAACCATGATCTTTTAATCACGAATCAAAGGGAAGAATTTCTAGCTCCTCTGAACTGCAGTGATCAAAAAACTGTATCCTACACAACAACTCTACTTGAATCCTTTCAAGAAGTTACGGATATTCAAAGCGAAGGTCCCAAAACATTCAGTTGTAACCATTGCTCTgttattttctcccaaaaactTCAACTGCGAGAACATCTCCGAACGCACACCGAAGAGAAACCATACAGCTGCAGTCACGGTTCGGCCTCCTCCTCCCAAAAAACCGATTTAAGGCAGCATATGCGAATTCACACCGATGAGAAACCATTCAGCTGCAGTGAGTGTTCTGCTTCTTTTTCCCGAAAAGTAAATTTGATGAGGCATATAcgaacgcacactggtgagaaaccattcagctGCAGTCACTGTCCGTCTTCCTTCtcccaaaaatccgatttaaggCGGCATATGCGAATTCACACCGATGAGAAACCATTCAGCTGCAGTGAGTGTTCGTCTTCTTTTATCCGAAAACAACAGTTGATGAGCCATGTACGAAtgcacactggtgagaaaccattccgTTGCAGTCAGTGTTCTGCTTCTTTTTCGGATACGAGCTATTTATCAAAGCACATGGGTGTGCACACTGATGAAAAACCATTCACCTGCAGTCAGTGTTCGGCGTGTTTTCGGAGAAAACTCAGTTTAATGCGTCATATGCGAAggcacactggtgagaaacaATTTAGTTGCAGTCACTGTTCGGCTTCTTTCTCCCAAAAACCGCATTTAAGGAGCCACATGAAAACGCACatcggtgagaaaccattctgTTGCAGTCACTGTTCGGCTTCTTTCTCCCGAAAAGCCTATTTAAAGGTCCATATTcgaacgcacactggtgagaGACCATACAGTTGCAGTCACTGTTTGGCTTCTTTCCCCGAAAAACAAAGTTTAAAGTACCATTTGCTAACGCACACtcgtgagaaaccattcagttgcagtcagtGCTCGGCTTCATTCTCCCATAAACCGCATTTAAGGAGCCACATGCAAACGCACACTGGCGAGGAACCATTCTGCTGTATTCAGTGTTCGGCTTCTTTTTCccgaaaacaaaatttgatgagGCATATACGAACGCACAGTGGTGAGAAACTATTCAACTGCAGTCACTGTTCGTCTTCCTTCtcccaaaaatccgatttaaggCGGCATATGCGTATTCACACCGATGAGAAACCATTCAGCTGCAGTGAGTGTTCGTCTTCTTTTGCCCGAAAAGAACAGTTGAAGAGCCATGTAcgaacgcacactggtgagaaaccattccgTTGCAGTCAGTGTTCTGCTTCTTTTTCGGATCGGAGCTATTTATCAAAGCACATGCGTGTGCACACTGATGAAAAACCATTCACCTGCAGTCAGTGTTCggtttctttttctagaaaacTCAGTTTAATGCGTCAtatgcgaacgcacactggtgagaGACCATACAGTTGCAGTCACTGTTCGGCTTCTTTccccgaaaaacaaaaattaaagtaccatttgcgaacgcacactggtgagaaaccattcagttgcagtcagtGTTCGGCTTCTTTCTCTCATAAGCACACCTTACAAGGGCACATAAgaacgcacactggtgagaaaccattcagttgctgTCAGTGTTTGACTTCTTTCAGACTTAAATCAACTTTAACTAGGCACATGCGTTCGCACACTGgggagaaaccattcagttgcagtcaatGTTCGGCTTCTTTTTCCGATGAATGCAGTGATACGTGCGACCAGAAATCCTCTCGACACAAAGCAGACTGCCGTAGAAGCTCACTATCCGGTATGAATCCGTTGGCGGATACTACCCTCCAAATTTCTCAAACCGATCACTCGCCGATCGAACGTAATCCACCAATCGAATCATCAGCTTCTGGAAGAGGATCCCCCACTGTTGACTTTCTCTTTGTGAAATGTGAAGACGGGCTAAATTTTTATGACGAAAGTATCCCGCCAAGGCAAGTTGAGACGATTCCTTCCAGTTCTTTAGCAAACCCTCTCTTTGTAAAATGTGAAGACGGACTCAATCGTTGTGACGAAAAAACAAAGCCCAGGCAACTTGCGACGAGTCCATTCAGATCTTCAGCAGACCCTCTTCTTATAAAATGTGAAGACGAACTCAATTTATGTGTCAAAAATAGGCAACCCAGGGCACTTGAGACGATTCCATCAAAATCTTCAGCAACCCTTTTGACCACATTCAAAAACGAACCaggcgaagaaaattttattccgATCGACCCCTTACATACCAATAATGACATCAAAAGTGAATCGTCGTTGCAATCCTTCGAAGGTCAATATGTCATAAACGCCGAGAATGAAGCGCGAACACCTGAAGATTATTTCTGTAATCTGTCAAACCATGATCTTTCAATCACGAATCAAAGGGAAGAATTTCTAGCTCCTCTGAACTGCAGTGATCAAAATACTGTATCCTACGCAACAACTCTACTTGATTCCTTACATGAAGTTATGGATATTCAAAGCGAAGGTCCCAAAACATTCAGTTGTAACCATTGCTCTgttattttctcccaaaaactTCAACTGCGAGAACATCTCcgaacgcacactggtgagaaaccatttagTTGCAGTGAGTGTTCGGCTTCTTTCTCTCATAAGCACACCTTACAAGGGCACATAAgaacgcacactggtgagaaaccattcagttgctgTCAGTGTTTGACTTCTTTCAGACTAAAATCAAGTTTAACTAGGCACATGCGTTCGCACACTGgggagaaaccattcagttgcagtcaatGTTCGGCTTCTTTTTCCCGAAAATACCATTTGAAGAGCCATATAcgaacgcacactggtgagaaaccattcagctGCAGTGAGTGTTCGGCTTCTTTTTCCCGAAAAGTAAATTTGATGAGGCATATAcgaacgcacactggtgagaGACCATACAGTTGCAGTCACTGTTCGGCTTCTTTCCCCGAAAAACAAAGTTTAAAGTACCACTTGCAAAagcacactggtgagaaaccattcagttgcagtcagtGTTCGGCTTCTTTCTCCCATGAACCGCATTTAAGGAGCCACATGCAAACGCAcaccggtgagaaaccattcttTTGCAGTCACTGTTCGGCTTCTTTCTCCCGAAAAGCCGATTTAAAGGTCCATATTcgaacgcacactggtgagaaaccattcagctGCAGTGAGTGTTCGGCTTCTTTTTCCCGAAAAGTAAATTTGATGAGGCATATAcgaacgcacactggtgagaaaaCATTCAGCTGCAGTCACTGTCCGTCTTCCTTCtcccaaaaatccgatttaaggCGGCACAGGCGAATTCACACCGATGAGAAACCATTCAGCTGCAGTGAGTGTTCGTCTTCTTTTATCCGAAAAAAACAGTTGAAGAGCCATGTAcgaacgcacactggtgagaaaccattccgTTGCAGTCAGTGTTTTGCTTCTTTTTCGGATAAGAGCTATTTATCAAAGCACATGGGTGTGCACACTGATGAAAAACCATTCACCTGCAGTCAGTGTTCGGCGTCTTTTCCTAGAAAACTCAGTTTAAAGTCCCATTtgcgaacgcacactggtgagaaaccattcagctGCATTCAGTGTTCGGCTTCTTATTCcgaaaaaggaaatttgatgAGGCATATAcgaacgcacactggtgagaaaccattccgTTGCAGTCAGTGTTCTGCTTCGTTTTCGGATAGGAGCTCTTTATCAAAGCACATGTGTGTGCACACTGATGAAAAACCATTCACCTGCAGTCAGTGTTCGGCTTCTTTTTCCCGAAAAGAACATTTGATGAGGCAtatgcgaacgcacactggtgagaaaccattcagctGCAGTCACTGTTCGTCTTCCTTCtcccaaaaatccgatttaaggCGGCATATGCGAATTCACACCGATGAGAAACCATTCAGCTGCAGTGAGTGTTCGTCTTCTTTTTCCCGAAAACAACAGTTGAAGAGCCATGTAcgaacgcacactggtgagaaaccattccgTTGCAGTCAGTGTTTTGCTTCTTTTTCGGATAGGAGCTATTTATCAAAGCACATGCGTGTGCACACTGATGAAAAACCATTCACCTGCAGTCCGTGTTCGGCTTCTTTTTCTAGAAAACTCAGTTTAATGCGTCATATGCGAAGGCACACTGGTGAGAGACCATACAGTTGCAGTCACTGTTCGGCTTCTTTCCCCGAAAAACAAAGTTTAAAGTACCACTTGCAAAagcacactggtgagaaaccattcagttgcagtcagtGTTCGGCTTCTTTCTCCCATGAACCGCATTTAAGGAGCCACATGCAAACGCAcaccggtgagaaaccattcttTTGCAGTCACTGTTCGGCTTCTTTCTCCCGAAAAGCCTATTTAAAGGTCCATATTcgaacgcacactggtgagaaaccattcagctGCAGTCATTGTTCGGCttctttttcccaaaaaaatagtttaaatAGCCATGTGTTAACGCAtgctggtgagaaaccattcagttgcagtgaGTGTTCGGCCTGTTTCTGCCAAAAATCCAATTTAAGGAGGCATATTAAGCGAAttcatactggtgagaaaccattcactTGTAGTCAGTAA
- Protein Sequence
- MNPLADITLQISSTDHSPIERNPPIESSASGRGSPSVDFFFVKCEDGLNFYDESIPPRQVETIPPSSLANPLFVKCEDGLNRCEENTKPRQLATSPSRSSADPRLIKCEDELTLCVKNWQPRTLETIPSRSSATLLTTFKREPGEENYIPIDPLYTNNDIKRESSLQSFEGQNVVNAENEARTPGYYFCNPSNHDLLITNQREEFLAPLNCSDQKTVSYTTTLLESFQEVTDIQSEGPKTFSCNHCSVIFSQKLQLREHLRTHTEEKPYSCSHGSASSSQKTDLRQHMRIHTDEKPFSCSECSASFSRKVNLMRHIRTHTGEKPFSCSHCPSSFSQKSDLRRHMRIHTDEKPFSCSECSSSFIRKQQLMSHVRMHTGEKPFRCSQCSASFSDTSYLSKHMGVHTDEKPFTCSQCSACFRRKLSLMRHMRRHTGEKQFSCSHCSASFSQKPHLRSHMKTHIGEKPFCCSHCSASFSRKAYLKVHIRTHTGERPYSCSHCLASFPEKQSLKYHLLTHTREKPFSCSQCSASFSHKPHLRSHMQTHTGEEPFCCIQCSASFSRKQNLMRHIRTHSGEKLFNCSHCSSSFSQKSDLRRHMRIHTDEKPFSCSECSSSFARKEQLKSHVRTHTGEKPFRCSQCSASFSDRSYLSKHMRVHTDEKPFTCSQCSVSFSRKLSLMRHMRTHTGERPYSCSHCSASFPEKQKLKYHLRTHTGEKPFSCSQCSASFSHKHTLQGHIRTHTGEKPFSCCQCLTSFRLKSTLTRHMRSHTGEKPFSCSQCSASFSDECSDTCDQKSSRHKADCRRSSLSGMNPLADTTLQISQTDHSPIERNPPIESSASGRGSPTVDFLFVKCEDGLNFYDESIPPRQVETIPSSSLANPLFVKCEDGLNRCDEKTKPRQLATSPFRSSADPLLIKCEDELNLCVKNRQPRALETIPSKSSATLLTTFKNEPGEENFIPIDPLHTNNDIKSESSLQSFEGQYVINAENEARTPEDYFCNLSNHDLSITNQREEFLAPLNCSDQNTVSYATTLLDSLHEVMDIQSEGPKTFSCNHCSVIFSQKLQLREHLRTHTGEKPFSCSECSASFSHKHTLQGHIRTHTGEKPFSCCQCLTSFRLKSSLTRHMRSHTGEKPFSCSQCSASFSRKYHLKSHIRTHTGEKPFSCSECSASFSRKVNLMRHIRTHTGERPYSCSHCSASFPEKQSLKYHLQKHTGEKPFSCSQCSASFSHEPHLRSHMQTHTGEKPFFCSHCSASFSRKADLKVHIRTHTGEKPFSCSECSASFSRKVNLMRHIRTHTGEKTFSCSHCPSSFSQKSDLRRHRRIHTDEKPFSCSECSSSFIRKKQLKSHVRTHTGEKPFRCSQCFASFSDKSYLSKHMGVHTDEKPFTCSQCSASFPRKLSLKSHLRTHTGEKPFSCIQCSASYSEKGNLMRHIRTHTGEKPFRCSQCSASFSDRSSLSKHMCVHTDEKPFTCSQCSASFSRKEHLMRHMRTHTGEKPFSCSHCSSSFSQKSDLRRHMRIHTDEKPFSCSECSSSFSRKQQLKSHVRTHTGEKPFRCSQCFASFSDRSYLSKHMRVHTDEKPFTCSPCSASFSRKLSLMRHMRRHTGERPYSCSHCSASFPEKQSLKYHLQKHTGEKPFSCSQCSASFSHEPHLRSHMQTHTGEKPFFCSHCSASFSRKAYLKVHIRTHTGEKPFSCSHCSASFSQKNSLNSHVLTHAGEKPFSCSECSACFCQKSNLRRHIKRIHTGEKPFTCSQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -