Npan072494.1
Basic Information
- Insect
- Nemotelus pantherinus
- Gene Symbol
- ZFX
- Assembly
- GCA_963978885.1
- Location
- OZ022266.1:65571302-65583814[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 50 0.0034 0.91 12.2 5.6 1 23 116 138 116 138 0.97 2 50 0.00018 0.047 16.3 0.9 1 23 169 191 169 191 0.98 3 50 0.0003 0.079 15.6 1.5 1 23 197 219 197 219 0.99 4 50 0.16 42 7.0 3.1 3 23 227 247 225 247 0.98 5 50 0.02 5.4 9.8 1.2 1 20 254 273 254 274 0.96 6 50 0.72 1.9e+02 4.9 0.1 1 20 287 307 287 311 0.87 7 50 8.7e-05 0.023 17.3 0.7 1 23 318 340 318 340 0.96 8 50 0.0033 0.88 12.3 0.7 1 23 346 368 346 368 0.98 9 50 1.1e-05 0.0029 20.1 2.9 1 23 373 395 373 395 0.99 10 50 6.5 1.7e+03 1.9 2.4 1 21 402 422 402 423 0.89 11 50 1.3e-05 0.0034 19.9 2.2 1 23 561 583 561 583 0.98 12 50 5.3e-05 0.014 17.9 0.3 1 23 589 611 589 611 0.98 13 50 7.1e-05 0.019 17.5 4.5 1 23 615 637 615 638 0.96 14 50 1.3e-05 0.0034 19.9 4.7 1 23 643 665 643 665 0.99 15 50 0.016 4.2 10.2 1.3 1 23 671 693 671 693 0.98 16 50 0.0024 0.64 12.7 0.7 3 21 703 721 701 722 0.95 17 50 2 5.4e+02 3.5 0.2 1 21 734 755 734 758 0.88 18 50 0.0007 0.19 14.4 0.2 1 23 765 787 765 787 0.96 19 50 1.7e-05 0.0044 19.5 0.5 1 23 793 815 793 815 0.98 20 50 0.0026 0.7 12.6 3.0 1 23 825 847 825 847 0.99 21 50 0.00011 0.029 16.9 1.2 1 23 853 875 853 875 0.98 22 50 1.1 2.9e+02 4.4 1.6 1 11 908 918 908 922 0.87 23 50 1.7 4.6e+02 3.7 0.2 1 21 941 962 941 965 0.88 24 50 0.00025 0.068 15.8 4.6 1 23 972 994 972 994 0.96 25 50 0.0022 0.58 12.9 1.7 1 23 1000 1022 1000 1022 0.96 26 50 0.00042 0.11 15.1 5.5 1 23 1035 1057 1035 1057 0.97 27 50 5.3e-05 0.014 17.9 0.2 1 23 1063 1085 1063 1085 0.98 28 50 0.39 1e+02 5.8 2.1 1 23 1091 1113 1091 1113 0.98 29 50 4.3 1.1e+03 2.5 0.7 1 15 1120 1134 1120 1140 0.79 30 50 0.092 25 7.7 0.2 1 23 1153 1177 1153 1177 0.92 31 50 0.00035 0.093 15.4 0.4 1 23 1184 1206 1184 1206 0.96 32 50 9.6e-05 0.026 17.1 0.7 1 23 1212 1234 1212 1234 0.96 33 50 0.00046 0.12 15.0 2.0 1 23 1244 1266 1244 1266 0.99 34 50 2e-05 0.0052 19.3 2.5 1 23 1272 1294 1272 1294 0.98 35 50 0.0003 0.08 15.6 2.7 2 23 1301 1322 1300 1322 0.97 36 50 0.75 2e+02 4.9 3.4 1 20 1329 1348 1329 1349 0.91 37 50 2 5.4e+02 3.5 0.2 1 21 1362 1383 1362 1386 0.88 38 50 5.1e-06 0.0014 21.1 1.7 1 23 1393 1415 1393 1415 0.96 39 50 0.15 39 7.1 3.2 1 23 1421 1443 1421 1443 0.98 40 50 0.027 7.2 9.4 2.2 1 23 1456 1478 1456 1478 0.96 41 50 1.3e-06 0.00036 23.0 1.9 1 23 1484 1506 1484 1506 0.99 42 50 0.054 14 8.5 1.5 1 23 1512 1534 1512 1534 0.97 43 50 0.00058 0.15 14.7 0.9 1 21 1542 1562 1542 1563 0.97 44 50 0.13 35 7.3 0.5 1 23 1575 1599 1575 1599 0.92 45 50 0.0067 1.8 11.3 2.5 1 23 1606 1628 1606 1628 0.96 46 50 0.00014 0.038 16.6 0.9 1 23 1634 1656 1634 1656 0.97 47 50 0.011 2.9 10.7 3.8 1 23 1665 1687 1665 1687 0.99 48 50 0.0039 1 12.1 1.0 1 23 1693 1715 1693 1715 0.98 49 50 0.00096 0.25 14.0 2.8 1 23 1721 1743 1721 1743 0.98 50 50 0.068 18 8.1 0.5 1 20 1750 1769 1750 1770 0.95
Sequence Information
- Coding Sequence
- ATGAGTGTTCTGGGTGACCTGTTCTCTGAAGCCGGCAATATTTTATCCGTAAAGGAAGAAGTTGAAATCACCGAAACACAGCTGCAAGagcaaatttgtgatgaagtAGAAGTGATACAAAATGATATTGGGCAAAGAGGAAAAGTCGAAAAACGGAATGAAATCCGAACTGCACTATGCCACGAAACAGAAGATAATCAGCCTAAATTAAGTGGTGGTGAAACGAATGCAGAACCGAAAAGTGAGACTATTAAGAAACGATCCAATGAGTGCAATCTATTTCCGCAGGATGCCGCCAGCTCAGTTCGGGCACCTGTTGAAACTTGTTTGCCCCTGAAAAACAACCATACATGCTCTGTTTGCGAGAAGTCATTCAGATGTAGATCCAAGCTTGAAAGGCATATTAAATGGCATAAGTGCAATCGGCCGTATGCATGCATAAAATGTCCTAAAAGGTACCTCACTCTCAGGAAATTACGCAGGCATAAGTTGCTTTGCACTCGTTATAAATGTACACTATGTGTAAAAACTTTTGGCACGGAAGCCACACTGAATAGTCACATGAATTTTCATCGTGGCGAACGTCCCTTCAAATGCGAAGTCTGTCCACTGTCGTTTATGAGCAAATATACACTTTCGCAACACAGCAAAACGCATTCGGATGACAAGAAATTTGGGTGTTCATTGTGCAatagaaaattcaatttcaatcggGAACGACGTAATCATGAAAAGACGCACTCAAATAACAGCTACAATTACCAATGTGACACTTGTAAGAAATATTTTAGATTGGAGGATCAATATCGTAGCCATTTGGATGATAATAAGTGCCAAGGATATGCTAGCGAAAACCCATTTCAATGTCCCAAGTGTGACAATGAAACATTCGCAAGCATTGAACCTCTCAACCAGCACAAAGTCGAAGTCCATCCACCCCCGAAAAAAGACCATGCATGTTCAATTTGTGCGAAAGCCTACAGGCGCAAATCTGAACTAGACATCCACATGAATTCTCATCGTGACAAACGTCCTTTCAAATGTGAAGTCTGTTCCAAGCCGTTTTGGAGCAACGAGCTacttttaaaacacaataaaacGCATTTGGGCAGGAAGTATGAATGCACAATGTGCAATAAAACATTCACTGACAATAGCAATCGGCGTCAGCACGAGAAAACGCACTCAAACCGCCGTGAGCGTTATCATTGCATCACCTGTAAGAAGTATTTTAAGCTAGAAGATCAATTTCGTAGCCATTACCAAGATAATAAGTGCGAAGGAAATTCACAAGAGCTCCATGATCATGAAAAACGGACTCAAGCTCAAAATGAAAGAGAAAATATTGGAACTGCTGGAAAATCCCAGAACGATTATATTTTCGAGAAGAAATACAAATGTGACGAATTTATATCTGTAAAAGAGGAAGTTGAAATCTGCGAAACAAAGTTACTAGAACAGATTTTCGATGGATTCGAAGTGCCACATGTTGACCCTTGTgttaaaatcgaaaaagaaattggAGTTGTAAACGAAATATTCCACACAGAAGATAATCAAACCGAATTAAACGATTGTGAAGGGATTAGGGAGCAACCATACGAGTGCTACTTACGACCGGATAATTCCACCAGTTCAGCTCGAAAACATGACAATATTAGGCCGTCCTTGAAAAATAATCATACATGCTCAATTTGTGCGAAATCATTCAGGTCGAGTTCAAGGCTTAAAGTGCATATGAATGTTCATACTGGCAGTCAGCCGTATGCATGCACAATATGCCCTAGAAAGTATGCCAGTCATGATGGCCTACGCGCGCACATTTTAACTCACGGTTCCGGATACAAATGTACGTTTTGTGAAAAGTCTTACAAATCCAGATCCGGAATGAATAAGCACATTAAAATTCATCATGGCGAACGTAATTTCCAGTGTGAAATCTGTTCCAAATCATTCACGAGCAAGTATCTACTCTCGTCACACCACAAAACGCATTcggataaaaagaaatatgagtGCACGTTGTGCAGTGAAAAATTCGCAGAGAATAGCAAGCGGCTTCGGCACGAAAAGACGCACACAAATAACCATAGCAATTATTATGGatgtgaaaattgtaaaaagtatTTTAGATCAGAGGACCAGTTTCGTAGCCATATGAAAGGTAATAAGTGCAAAGGATATGCTTCAGAAAAACCATTTGAATGTTCCAAGTGTGAAAATGAATCATTCGTAAGCATTGAGCTCCTAAACCAGCACAAAGTCGAAGTTCACCCGCCCCCGAAAAAAAACCATGCATGCTCAGTTTGTGCGAAAGCCTACAGGTATGAGGCTGAGCTTCGCGTGCATATGAACGTTCATACTGGCAAGCGGCCGTACGCATGCACAATATGTTCAAAGGCGTTCATCTGGCGCAGTAACTTACGCAAGCACATGTTAACTCACAGCGCAGAGGCGAAGAATCGCTCCAAGTTCAAATGTACACTCTGCGAAAAGCTTTGTAGTTCAAGTGGCCGACTGAATAGTCACATGAATATGCATCAAGGCGAACGTCCTTTCAAGTGCGACACCTGTCCCAAAACGTTTATGAGCCCAGATACACTCTCGTCACACAAAAAACTGCATGCAGATAATAGGGAAAATGAGTTATGCAATCAAAAACTTACAGACAATAACAAGCGGCGTAAGCTCGAGAAGACGCACAAAAATAACAATGTGAAATATTATTGTCCAAAATGTAAAAAGTATTTTACAATGGAGGATCAGCACCGCTGCcatttggaaaataataaatgcAAAGGATATGCTTCAGAACAGCTATTTGAATGTACCAAGTGTGAGAATGAATCATTCGTAAGCATTGAGCTCTTAAACCAGCACAAAGTCGAAATCCACCCGCCTCCGAAAGAAAATCACATATGCTTGATTTGCACGAAAACATACAAGCATAAGTACTTGCTTAACATACATATGAACTCTCACACTGGTAGTCGGCCTTATACATGCCCAATTTGTCGAAAACGGTTCGCCTGCCCCGGAAGCTTACGCCGGCACAAGTTAGATCACAACTTCTTAAGCGCAGAAAGGAAGATTCGTCACAGATTCAAATGTACGCTCTGTGAAAAGTCTTACTACTCAAAAACCAATCTGAATAGTCACATGCATTTTCATCGTGGCGAACGTCCTTTCAAATGCGAAGTATGTCCCAAATCATATACGAGCGCAACATCACTCGGATCACACATTATTAAGCATTTTGATGAGAAGAAATTTGAGTGCACGTTATGCAATTTAAAATGCAGAGGCAATGGTGAGCGGCGTATTCACGAGAAAACGCATACAAATCGCCATATGAATTATCAATGTGAAAAGTgtaaattgtattttagattggAAGACCAGCACCGCAGCCATGTGGAAGATAATAAGTGCAAAGGATATGCTTCAGAAAAGCCATTTCAATGTTCCAAATGTGGAAATGAATCATTCGTAAGCATTGAGCTCCTAAACCAGCACAAAATCGAAGTTCATCCGCCCCCGAAAAAAAACCATGCATGCTCAGTTTGTGCGAAAACCTACAGGTATGAGGCTGAGCTTCGCGTGCATATGAACGTTCATACTGGCAAGCGGCCGCACGCATGCACAATATGTTCAAAGGCGTTCATCTGGCCCAGTAGCTTACGCAAGCACATGTTAACTCACAGCGCAGAGGCGAAGAATCGCTCCAAGTTCAAATGTACACTCTGTGAAAAGCTTTTTAGTTTAAGGAGCCGACTGAATAGTCACATGAATATGCATCGAGGCGAACGTCCTTTCAAGTGCGAAACCTGTCCTAAGACGTTTACGAGCCCAGATGCACTCTGTTCACACAAAAAACTGCATGCAGATGATGGGGTAAGTGAGTGCACGTTATGCAATCAAAGATTTACAGACAACAGCAATCGGCGTAGGCACGAAAAGACGCACACAAATAGCCAATTAAAATATCAGTGTGCAAAATGTAAAAAGTATTTTAGATTCAAGGATCAGCACCGCAGCCATTTGGAAGATAATAAATGCAAAGGATATACTTCAGAAAAGCCATTTGAATGTTCTAAGTGTGAAAATGAATCATTCGTAAGTATTGAGCTCCTAAACCAGCACAAAGTCGAAGTTCATCCACCcccgaaaaaaaatcatgcatgCTCGATTTGTGAGAAATCATTCAGGTCGAGTTCAAGGCTTAAAGTGCATATGAACGTTCATACCGGCAATCGACCGTATAAGTGCACAGTGTGTCCTAAAAAGTACGGCTGCCCCCAGAGCTTGCGCTATCACAAGTTAACTCACAGCTTATTAAGCGCAGATAGGAAGATTCGTGGCAGATTCAAATGTACGCTCTGTGAAAAGTCTTACTTATCAAAAATCAGTCTGAATAGTCACATGTATTTTCATCGTGGTGAACGTCCTTTCAAGTGCGAAATATGTCCCAAGTCGTATATGAACCCACACACACTCAAATGGCACATGAAAACGCATTTTGATGAAAGAAAATATGAGTGCACGTTGTGCAGTAAAAAATTCGCAAAAGATACCGAGCGGCTTAGGCACGAGAAGATGCACACAAATAACTATAGAAAGTATTATGAATGTGGAAATTGTAAAAAGTATTTTAGATCAGAGGACCAGTATCGCAGCCATATGAAGGGAAATAAGTGCAAATGCAATGCTTCAGAAAAGCTATTCAAATGTTCCAAATGTGAAAATAAATCATTCGTAAGCATCGAGCTCCTAAACCAGCACAAAATCGAAGTTCATCCACCCCCGAAGAAAAATCATGCATGCTCAGTTTGTGCGAAAGCATTCAGGTGTAAATATGAGCTTCGCGTCCATATGAACATCCATACTGGTAGCCGGCCGTATGCGTGCACGGTATGTCCTAAAAAGTACAGTTCTCCCTTCACCTTACGCAACCACAAGTTGTCTCACGCAGGGGCGAAGAATCGTCCCAAGTACAAATGTACGCTGTGTGAAAAATTATGCAGTTCAAGGGGCCGACTGAGTAGTCACATGAATAAGCATCGAGGAGAACGTCCTTTCAAGTGCGAAACCTGTCTCCAGAAGTTCATTAGCCCAGAGTCTCTCTAtgcacataaaaaaaatcattcgaaTGACAAGAAATATGAATGCACCCTGTGCAATAAAAGATTTATTTATAACATCCATTGGCGTAAGCACGAGATGAGCCACTTAAATCACCGTTTGCCTTACCAATGCGACGTCtgtaaaaagtattttcaattGGAAGATCAGTATCGCAGCCATTTGGCAGATAATAAATGTAACGGGAACTCGCAAGAGTCGCATGATCGAGAAAATGACAAATCTGAAATCTGA
- Protein Sequence
- MSVLGDLFSEAGNILSVKEEVEITETQLQEQICDEVEVIQNDIGQRGKVEKRNEIRTALCHETEDNQPKLSGGETNAEPKSETIKKRSNECNLFPQDAASSVRAPVETCLPLKNNHTCSVCEKSFRCRSKLERHIKWHKCNRPYACIKCPKRYLTLRKLRRHKLLCTRYKCTLCVKTFGTEATLNSHMNFHRGERPFKCEVCPLSFMSKYTLSQHSKTHSDDKKFGCSLCNRKFNFNRERRNHEKTHSNNSYNYQCDTCKKYFRLEDQYRSHLDDNKCQGYASENPFQCPKCDNETFASIEPLNQHKVEVHPPPKKDHACSICAKAYRRKSELDIHMNSHRDKRPFKCEVCSKPFWSNELLLKHNKTHLGRKYECTMCNKTFTDNSNRRQHEKTHSNRRERYHCITCKKYFKLEDQFRSHYQDNKCEGNSQELHDHEKRTQAQNERENIGTAGKSQNDYIFEKKYKCDEFISVKEEVEICETKLLEQIFDGFEVPHVDPCVKIEKEIGVVNEIFHTEDNQTELNDCEGIREQPYECYLRPDNSTSSARKHDNIRPSLKNNHTCSICAKSFRSSSRLKVHMNVHTGSQPYACTICPRKYASHDGLRAHILTHGSGYKCTFCEKSYKSRSGMNKHIKIHHGERNFQCEICSKSFTSKYLLSSHHKTHSDKKKYECTLCSEKFAENSKRLRHEKTHTNNHSNYYGCENCKKYFRSEDQFRSHMKGNKCKGYASEKPFECSKCENESFVSIELLNQHKVEVHPPPKKNHACSVCAKAYRYEAELRVHMNVHTGKRPYACTICSKAFIWRSNLRKHMLTHSAEAKNRSKFKCTLCEKLCSSSGRLNSHMNMHQGERPFKCDTCPKTFMSPDTLSSHKKLHADNRENELCNQKLTDNNKRRKLEKTHKNNNVKYYCPKCKKYFTMEDQHRCHLENNKCKGYASEQLFECTKCENESFVSIELLNQHKVEIHPPPKENHICLICTKTYKHKYLLNIHMNSHTGSRPYTCPICRKRFACPGSLRRHKLDHNFLSAERKIRHRFKCTLCEKSYYSKTNLNSHMHFHRGERPFKCEVCPKSYTSATSLGSHIIKHFDEKKFECTLCNLKCRGNGERRIHEKTHTNRHMNYQCEKCKLYFRLEDQHRSHVEDNKCKGYASEKPFQCSKCGNESFVSIELLNQHKIEVHPPPKKNHACSVCAKTYRYEAELRVHMNVHTGKRPHACTICSKAFIWPSSLRKHMLTHSAEAKNRSKFKCTLCEKLFSLRSRLNSHMNMHRGERPFKCETCPKTFTSPDALCSHKKLHADDGVSECTLCNQRFTDNSNRRRHEKTHTNSQLKYQCAKCKKYFRFKDQHRSHLEDNKCKGYTSEKPFECSKCENESFVSIELLNQHKVEVHPPPKKNHACSICEKSFRSSSRLKVHMNVHTGNRPYKCTVCPKKYGCPQSLRYHKLTHSLLSADRKIRGRFKCTLCEKSYLSKISLNSHMYFHRGERPFKCEICPKSYMNPHTLKWHMKTHFDERKYECTLCSKKFAKDTERLRHEKMHTNNYRKYYECGNCKKYFRSEDQYRSHMKGNKCKCNASEKLFKCSKCENKSFVSIELLNQHKIEVHPPPKKNHACSVCAKAFRCKYELRVHMNIHTGSRPYACTVCPKKYSSPFTLRNHKLSHAGAKNRPKYKCTLCEKLCSSRGRLSSHMNKHRGERPFKCETCLQKFISPESLYAHKKNHSNDKKYECTLCNKRFIYNIHWRKHEMSHLNHRLPYQCDVCKKYFQLEDQYRSHLADNKCNGNSQESHDRENDKSEI
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -