Dsim009750.1
Basic Information
- Insect
- Drosophila simulans
- Gene Symbol
- nfxl1
- Assembly
- GCA_016746395.1
- Location
- NC:4204433-4207664[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 2 1.4e+04 -8.1 6.4 15 19 147 151 146 151 0.82 2 19 2 1.4e+04 -5.9 4.2 4 10 182 188 182 188 0.56 3 19 6.5e-08 0.00044 19.8 15.5 1 18 196 212 196 213 0.98 4 19 0.43 2.9e+03 -2.0 2.3 5 10 238 243 238 243 0.93 5 19 2.8e-08 0.00019 21.0 16.8 1 19 249 268 249 268 0.97 6 19 0.28 1.9e+03 -1.4 1.3 5 10 292 297 292 297 0.94 7 19 4.7e-08 0.00032 20.3 10.6 1 19 303 323 303 330 0.94 8 19 0.88 6.1e+03 -3.0 0.5 1 7 347 352 346 354 0.49 9 19 2.1e-08 0.00014 21.4 15.3 1 19 357 375 357 375 0.99 10 19 0.7 4.8e+03 -2.6 0.4 1 5 384 388 384 393 0.73 11 19 0.00036 2.4 7.9 17.2 1 18 409 426 409 427 0.97 12 19 3.3e-08 0.00023 20.7 14.7 1 19 436 454 430 454 0.90 13 19 0.00016 1.1 9.0 8.6 9 18 499 508 497 509 0.91 14 19 0.014 94 2.8 6.6 1 12 521 531 521 531 0.96 15 19 0.97 6.7e+03 -3.1 0.6 6 10 585 589 584 589 0.86 16 19 3.8e-06 0.026 14.2 5.4 1 12 595 606 595 606 0.97 17 19 0.028 1.9e+02 1.8 15.8 3 18 635 650 634 651 0.94 18 19 2 1.4e+04 -5.5 2.1 9 13 657 661 656 661 0.72 19 19 1.1e-06 0.0076 15.9 11.0 1 16 702 716 702 722 0.92
Sequence Information
- Coding Sequence
- atggaaaagtttaaTAAAGCGCAAGCTAAAAATTTGGCTGCCGCCCAAAAACTGGTGGACACCTACGCTTCCAGCTCCGAGGATGAAGGCGAACTGGATGAGAATCACATTTTGGAACTCCTATACAAGAACTACATACCGACGGACAACGCAGGAAGCTCCAAGGATGCCGCACGCACCACTACGTTCCTGGAAAACACTCTCCACTCGGGAGCGGCAACCTGTCTCATTTGCATCGGAAGCATCCGGCGGGTGGAGGCCATTTGGTCCTGCGAGAGCTGTTACTGCTTCTTTCACTTGAAGTGCATCCAACGGTGGGCCAACGACAGCATGATGCAGATGAAGGTGAAGGCGGCGGAGCAGCAGAACGGCCAGGGCCACTACAATCATCTGGGCGAATTTGTGCCGCCCAAGCGACAGAAATCCCTGCACTGGTGCTGTCCCCAGTGCCGCAGGGACTACCAACCGGCCGATAAGCCCACGCAGTACAACTGCTTCTGCGGCAAGGAGGTGAACCCGGAGAACCAGCCCTTCCTTGTGCCCCACTCATGCGGAGAGCACTGCGGGAAGCTGTTGCAACCAAAGTGTGGGCACGACTGCAAGCTGCTATGTCATCCGGGGCCATGTCCTCCTTGTGCGCAGCAGGCGCAGGTCTCGTGTCTGTGTGGTAAATCCAGTCCTAGGTCCGTGAGGTGCATTGACAAGCAGTGGAGGTGTCAGCAGACTTGCAAGGAACTTCTGGCCTGCGGCAAGCACAAGTGTAATCAGGTGTGTCATCAGCCAGGAAAGTGTCCTCCCTGCACCAGCAAAAGCTTACAACCTTGCGAGTGCCAGCGGGAGTCGAAGATGGTCAACTGCTCCGATCGCAAATGGAAGTGTCAAAACgTTTGTGGAGCTCCGTTTGCTTGTGGTTTGCACATTTGCGAGAAGGTGTGTCACGCAGGACCCTGCGGCGATGGAGAGTGTCCTTTGCAAGTCAGGAGTTGCCCATGTGGCAAGAATACCCAGGTAAGACCCTGTAATGAAGCAGAGGAAACCTGTGGCGATAcgtgccaaaaacttttgtcCTGTGGCCAGCACACTTGCACCCAGCGTTGTCATCGTGGACCTTGCATTTCCTGCCCAATAAGAACCAAAAAGAAGTGTCGTTGTGGTCTGCACGAAAAGGAGTTGCCCTGCTCCAAAGAGTTTGCATGCGAGACCAAATGCAAGCAAATGCGTGATTGCGGCAAGCATGCCTGCAATAGAAAGTGCTGCGGGGACCAGTGCCCGCCGTGCGAAAAGATTTGTGGCAAGCAGCTGAGCTGCAACAAGCACAAATGTCAGTCCGTGTGCCACAACGGACCTTGCTATCCTTGTAAACTGGAATCCCAGATTAACTGTCGCTgtgggaaaaccaaaaaaagtgTTCCTTGTGGCAGGGAAAGAAGTGCACGCATTGTCTGCTTGGAACTCTGTCGCATAACACCCAAATGTCACCATGCCATTAAGCATCGTTGTCACAAGGGTGAGTGTCCTCCATGTGGCCAAGTGTGCGGTCTGCCCAATGACACCAGCAAGTGTGGACACATCTGTAAAGCACGATGCCACGAGGCTGTTAGAGTTAATAAACCCAAAGAGGCTAGGCCACAGGCCAAAAAGTATGAGTATAAGGCATTGCCCCATCCACGATGTGAGGAAGGCGTCGTTGTCACCTGTATCGGAGGTCATGAGGTTGCCACTTGGCCATGCTGGAACTCCAAACCCACTTCCTGCCAGCGAATGTGTGCACGTCAGCTTAAGTGCGGCAATCACAAGTGTCCATTGGTTTGCCATTCCGTACCGCTTCCCCAGGATATGGCTGCGCAAACTGGCTGTGCCAACTGCGAGGAGGGTTGCATCGTTCCCCGACCCAGCGGATGTATTCACGCCTGTCCCAAAGGATGTCATCCGCCGCCCTGTGCGCCCTGCAACTTTGTGATTAAGACCAAGTGTCACTGTGGACTCAACCAGTTGGTTTACAAGTGCAACGAGTATTATGATGAAACGGGATCGGTCCAGGAGATCATCGAGCGAAGAGAAAAGCTACGGAGCTGCGGTAACCGATGCTTGAAGAATtATCCTTGCGGACATCGCTGCACAGCCATCTGCCACACAGGCAAGTGCCCAAATCCCGAGTTGTGCCGCAAGAAGGTTCGCATTTTCTGCGCATGCAAGCGACTCAAGCAGGAGATTGCCTGTGACAAACACCGTGCTGGTCAGACATTCTTGGATTGCGACTCCAACTGCAAGGCGGAGCAAACTCGCGTCCAGGCGGcagagcaactgcagctggaaCAAAAGCGTCGCGACGAGGAAGAAAGAAACCGCGTGGAACTCGAGAAGTTCGAGGCCAAGTTTGGGAAGCGCAAGCACAAGGAGCGCAAAACTGTGGGTTCTGGACCGGCCAAGACCAAGATCGATTGGCAACGAAGAGCAATCTATGCAGTATCCATTCTAACAGTTGTGGGTGCAATTGTGGTGGCTTTCTACGCGGACAGTTAA
- Protein Sequence
- MEKFNKAQAKNLAAAQKLVDTYASSSEDEGELDENHILELLYKNYIPTDNAGSSKDAARTTTFLENTLHSGAATCLICIGSIRRVEAIWSCESCYCFFHLKCIQRWANDSMMQMKVKAAEQQNGQGHYNHLGEFVPPKRQKSLHWCCPQCRRDYQPADKPTQYNCFCGKEVNPENQPFLVPHSCGEHCGKLLQPKCGHDCKLLCHPGPCPPCAQQAQVSCLCGKSSPRSVRCIDKQWRCQQTCKELLACGKHKCNQVCHQPGKCPPCTSKSLQPCECQRESKMVNCSDRKWKCQNVCGAPFACGLHICEKVCHAGPCGDGECPLQVRSCPCGKNTQVRPCNEAEETCGDTCQKLLSCGQHTCTQRCHRGPCISCPIRTKKKCRCGLHEKELPCSKEFACETKCKQMRDCGKHACNRKCCGDQCPPCEKICGKQLSCNKHKCQSVCHNGPCYPCKLESQINCRCGKTKKSVPCGRERSARIVCLELCRITPKCHHAIKHRCHKGECPPCGQVCGLPNDTSKCGHICKARCHEAVRVNKPKEARPQAKKYEYKALPHPRCEEGVVVTCIGGHEVATWPCWNSKPTSCQRMCARQLKCGNHKCPLVCHSVPLPQDMAAQTGCANCEEGCIVPRPSGCIHACPKGCHPPPCAPCNFVIKTKCHCGLNQLVYKCNEYYDETGSVQEIIERREKLRSCGNRCLKNYPCGHRCTAICHTGKCPNPELCRKKVRIFCACKRLKQEIACDKHRAGQTFLDCDSNCKAEQTRVQAAEQLQLEQKRRDEEERNRVELEKFEAKFGKRKHKERKTVGSGPAKTKIDWQRRAIYAVSILTVVGAIVVAFYADS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00603756;
- 90% Identity
- iTF_00594930;
- 80% Identity
- -