Ecoc020511.1
Basic Information
- Insect
- Endomychus coccineus
- Gene Symbol
- nfxl1
- Assembly
- GCA_958510875.1
- Location
- OY294067.1:305339913-305355926[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.97 3e+04 -3.1 0.6 15 18 119 122 118 123 0.78 2 20 2 6.2e+04 -5.0 1.9 15 19 161 165 161 165 0.85 3 20 0.0044 1.3e+02 4.4 1.2 4 12 196 204 195 204 0.94 4 20 3e-07 0.0092 17.7 16.0 1 19 210 227 210 227 0.99 5 20 2 6.2e+04 -4.2 0.6 5 6 256 257 253 260 0.45 6 20 5.5e-07 0.017 16.9 13.5 1 18 263 280 263 281 0.92 7 20 1.4 4.2e+04 -3.6 1.0 6 10 306 310 305 310 0.88 8 20 7.4e-08 0.0023 19.6 7.5 1 18 316 333 316 334 0.99 9 20 1.3 3.9e+04 -3.5 1.0 5 10 358 363 358 363 0.86 10 20 9.7e-07 0.03 16.1 7.4 1 18 369 386 369 387 0.96 11 20 4.8e-05 1.5 10.7 15.2 3 18 423 438 421 439 0.91 12 20 2.7e-10 8.4e-06 27.4 14.1 1 18 448 465 448 466 0.97 13 20 0.19 5.8e+03 -0.8 4.1 5 11 494 500 494 506 0.86 14 20 0.00036 11 7.9 6.3 10 19 515 524 512 524 0.90 15 20 0.047 1.5e+03 1.1 5.5 3 11 535 543 534 544 0.91 16 20 2 6.2e+04 -4.7 3.5 14 19 573 578 573 578 0.86 17 20 1.6 4.9e+04 -3.8 0.5 6 10 607 611 607 612 0.88 18 20 2.7e-05 0.85 11.4 6.7 1 12 617 628 617 628 0.97 19 20 8.1e-06 0.25 13.1 18.4 4 19 658 673 656 673 0.93 20 20 4.1e-05 1.3 10.9 11.1 1 18 716 735 716 736 0.89
Sequence Information
- Coding Sequence
- atgaacaATCGTCCTAAACCTGTTAATCCGTGGAATCGGAATGTACAACAGCCAAAACAATCGAAACCAAAATCTAACCTaaataaaaattccaattctgaaattaaatttaaagaacAACAACAAAAGTTGCAAACAGCTGTCCAAAAATTTGCTTATGAATCTTCATCGGAGGAAGATGAAATCGAAACGGAAGATCTAATTAGTCGAGTTCTAGAAAATTATACGAATACTGGAGGAGAACACGAAAAGCTGATCCGCACTACAGCATATTTAAAAGATTCTTTTCTATCAGGTGCAGCAACATGTCTAATTTGTATCTCAAGGATAAAACGTGATGATGAGATATGGAGTTGTCAagaatgttattgttttttccACTTGATGTGTATTAACAAATGGTCCAAAGATAACATAAGTCAGAAAAAAAATGCTCTGGAGGGACAAATAGTAATGCGTGAGATAGTATTATGTTGGGGATGTCCAAAATGCAGACACAATTATGAACCTAATAATATCCCTTCCAGATATGAATGTTTTTGTAAGAAGACAATTAATCCCAAGTATCAATCTTTATTAGTTCCTCATTCTTGTGGTGATATTTGCCACAAAAATCTGAAACCCGAGTGTGGTCACCAGTGCCTCTTACTTTGTCATCCAGGATCTTGTCCACCATGTCCAGTGACAGTGAATTGCACCTGCTTCTGTGGTTCGAAGCTTCCACGATTACAAAGATGTAGTGATAAAAATTGGTCTTGTGGCGAGACATGCAAGAAACTATTATCATGTGATAAACACAATTGTCCTGATATTTGTCATCCAGGCGATTGTAAACCTTGTGAGAAGAAGAGCATTCAGAAATGTATTTGCAAAAGCCAGATGAAGTTGAGAGAATGTGCGTCACCCATTTGGCAATGTGATAAAgTATGCAACAAATTATTGGATTGTGGTAAACATAATTGTTTGGAGGTATGTCATATTGGTGTATGTGATGTATGCATATTTTCCAAACCAAGAACGTGTCCTTGTGGtaaaacaaattatgttttACCTTGTACAGAAGACACACCAACATGTCCAGATACTTGTGATAAGATTTTGGAATGTGGCATGCACACTTGCAGTTTGAAATGCCACAAAGATAAATGTGATCTTTGTTTAGAAATCGTGGAGAAGCGTTGTAGGTGCAGCTTGCATCTGAAGGAAGTTCAGTGCAGCAAACCATATCTTTGTGAGACAAAATGCAAGAACCTGCGAGACTGTTACAGGCATCCCTGCAATAGAAAATGTTGTAACGGTGATTGTCCGCCTTGTGAAAAACAATGTGGCCGGACGTTGAAATGTGGTAATCACAAGTGTCCTTCGGTATGTCACAGAGGCCCGTGCTATCCCTGTAATTTAACTGAAACGGTTTCTTGTCGTTGTGGAGGAACGATGATAACCGTGCCCTGTGGACGAAAAAATAAAGTGAAACCTccaaaatgtttaaaattatgTCATGCACCTACCGATTGTCATCACAAAGAAAGAACATCTCATCGTTGTCATTTCGGCGAGTGTCCTCCGTGCAAACAAATTTGCGAGCAACGCAGAGAGAAATGCGAGCATCCATGCACTAGTCCTTGTCACTATTACGTTTACGTAGATGTGGAAATTGCACAGAAACCCTCCATGCCCTGGGAACAGATCAAACCACAAAAAGAAAAACGCCGAATGCCCTGTCCAGATTGTAAGGTACCGGTAGCAGTCACTTGTTTGGGTGGACATGAGACTACGTCGTGGCCTTGTTACGAGACAACATCGTCCAGAGTGGCTTGTTGTCAGAGGGCGTGCGGACGTCTTTTGGCTTGTGGTAATCATACATGTTCTATCACTTGCCATTCTGTGGATTATGCTGATGGTTCGACGGTTGGTACTAATTGCGAAGTGTGCGACAGTGATTGTACCAGAGCGAGACCGGAAGGTTGTGTTCATGTCTGTCCGAAACCGTGCCATCCAGGACCTTGTCCTCCCTGCAAACAGATGGTGCGCATCAAATGTCATTGCGGTGTGACGAATCCCTACGTTTCGTGCGCAGAATGGTTGATTGTCAATAAACAGGAGGAATTACAAAGTTGCGGCAATCAATGTCCCAAGAATTTCGCTTGCGGTCACCGATGTCGCACCAATTGTCACTCTGGTACATGCCCCAACGAGGCCTCGTGTAATAAGAAGGTGAAACTATCTTGTAAATGTAAGAGAATCAAGAAAGAATTCGGTTGTCAAGCAGTGCGTGTGAATGACGCCAAAGTGGAGTGTGACGATGTGTGTAAACAAATAAAACTGAAAAGAATAACGCTTATCCAGGAAGAAAACGAACAGAAATGTGCGTTAGATGCGATACGCAATAAGGAGGAGCTAGAGAAGTTCAAGAGGAAGTTTGAAGGAAAGAAGAAATCAAGAGTgaagtttgaagaagatcatAAAGATGAAAGTAAATTGGAGAGATACGTCCTGGTGGGGGCCATAGTTGTAGTTGTGTCTTTAGCTTGTGGTTTTTTTAGTAGTTTCTACTAG
- Protein Sequence
- MNNRPKPVNPWNRNVQQPKQSKPKSNLNKNSNSEIKFKEQQQKLQTAVQKFAYESSSEEDEIETEDLISRVLENYTNTGGEHEKLIRTTAYLKDSFLSGAATCLICISRIKRDDEIWSCQECYCFFHLMCINKWSKDNISQKKNALEGQIVMREIVLCWGCPKCRHNYEPNNIPSRYECFCKKTINPKYQSLLVPHSCGDICHKNLKPECGHQCLLLCHPGSCPPCPVTVNCTCFCGSKLPRLQRCSDKNWSCGETCKKLLSCDKHNCPDICHPGDCKPCEKKSIQKCICKSQMKLRECASPIWQCDKVCNKLLDCGKHNCLEVCHIGVCDVCIFSKPRTCPCGKTNYVLPCTEDTPTCPDTCDKILECGMHTCSLKCHKDKCDLCLEIVEKRCRCSLHLKEVQCSKPYLCETKCKNLRDCYRHPCNRKCCNGDCPPCEKQCGRTLKCGNHKCPSVCHRGPCYPCNLTETVSCRCGGTMITVPCGRKNKVKPPKCLKLCHAPTDCHHKERTSHRCHFGECPPCKQICEQRREKCEHPCTSPCHYYVYVDVEIAQKPSMPWEQIKPQKEKRRMPCPDCKVPVAVTCLGGHETTSWPCYETTSSRVACCQRACGRLLACGNHTCSITCHSVDYADGSTVGTNCEVCDSDCTRARPEGCVHVCPKPCHPGPCPPCKQMVRIKCHCGVTNPYVSCAEWLIVNKQEELQSCGNQCPKNFACGHRCRTNCHSGTCPNEASCNKKVKLSCKCKRIKKEFGCQAVRVNDAKVECDDVCKQIKLKRITLIQEENEQKCALDAIRNKEELEKFKRKFEGKKKSRVKFEEDHKDESKLERYVLVGAIVVVVSLACGFFSSFY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -