Dere004049.1
Basic Information
- Insect
- Drosophila erecta
- Gene Symbol
- nfxl1
- Assembly
- GCA_000005135.1
- Location
- CH954178.1:6969118-6972177[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 2 1.8e+04 -8.1 6.4 15 19 147 151 146 151 0.82 2 20 0.13 1.2e+03 -0.3 0.4 4 10 182 188 181 188 0.95 3 20 6.5e-08 0.00059 19.8 15.5 1 18 196 212 196 213 0.98 4 20 0.69 6.3e+03 -2.6 1.5 5 10 238 243 238 243 0.91 5 20 2.8e-08 0.00026 21.0 16.8 1 19 249 268 249 268 0.97 6 20 0.28 2.5e+03 -1.4 1.3 5 10 292 297 292 297 0.94 7 20 4.7e-08 0.00043 20.3 10.6 1 19 303 323 303 330 0.94 8 20 0.89 8.1e+03 -3.0 0.5 1 7 347 352 346 354 0.49 9 20 2.1e-08 0.00019 21.4 15.3 1 19 357 375 357 375 0.99 10 20 0.7 6.4e+03 -2.7 0.4 1 5 384 388 384 393 0.73 11 20 2 1.8e+04 -5.0 2.3 5 10 398 403 398 403 0.91 12 20 0.00045 4.1 7.5 15.5 1 18 409 426 409 427 0.97 13 20 3.3e-08 0.00031 20.7 14.7 1 19 436 454 430 454 0.90 14 20 0.00016 1.4 9.0 8.6 9 18 499 508 497 509 0.91 15 20 0.014 1.3e+02 2.8 6.6 1 12 521 531 521 531 0.96 16 20 0.55 5e+03 -2.3 1.3 6 10 585 589 584 589 0.88 17 20 3.8e-06 0.035 14.2 5.4 1 12 595 606 595 606 0.97 18 20 0.028 2.5e+02 1.8 15.8 3 18 635 650 634 651 0.94 19 20 2 1.8e+04 -5.5 2.1 9 13 657 661 656 661 0.72 20 20 1.1e-06 0.01 15.9 11.0 1 16 702 716 702 722 0.92
Sequence Information
- Coding Sequence
- atggaaaagtttAATAAAGCGCAAGCTAAAAATTTGGCTGCCGCCCAAAAACTGGTGGACACCTACGCCTCCAGCTCCGAGGATGAAGGCGAACTCGATGAGAAGCACATTTTGGAACTCCTATACAAGAACTACAAACCGTCGGATAGCGTAGGAAGCTCAAAGGATGCAGCCCGTACCAGTACATTCCTGGAGAACACTCTCCACTCGGGAGCCGCTACCTGTCTTATTTGTATCGGAAGCATCCGACGGGTGGAGGCCATTTGGTCGTGCGAGAGCTGTTACTGCTTCTTCCACTTGAACTGCATCCAGCGGTGGGCCAACGACAGCATGATGCAGATGAAGGTGAAGGCGGCGGAGCAGCAGAACGGTCAGGGCCACTACAATCACCTGGGCGAATTTGTGCCGCCCAAGCGACAGAAATCCCTGCACTGGTGCTGTCCTCAGTGCCGCAGGGACTACCAACCGGGCGATAAGCCCACGCAGTATAACTGCTTCTGTGGCAAGGAGGTGAATCCAGAAAACCAGCCCTTCCTTGTGCCCCACTCCTGCGGAGAGATCTGCGGGAAGCTGCTGCAACCAAAGTGTGGGCACGATTGCAAGCTGCTATGTCATCCGGGGCCATGTCCTCCTTGTGCGCAGCAGGCGCAGGTCTCCTGCCTGTGTGGTAAATCCAGTCCTAGGTCCGTGAGATGCATTGACAAGCAGTGGAGGTGTCAGCAGGCTTGCAAGGATCTTCTGGCGTGTGGCAAGCACAAGTGCAATCAGGTGTGTCATCAGCCAGGAAAGTGTCCTCCGTGCACCAGCAAAAGCTTACAACCTTGCGAGTGCCAGAGGGAGTCGAAGATGGTCAATTGCTCTGATCGGAAATGGAAGTGTCAGAATGTGTGTGGAGCTCCGTTCAATTGTGGCTTGCACATTTGCGAGAAGGTCTGTCATGCAGGACCCTGCGGCGATGGAGAATGTCCTTTGCAAGTCAGGAGTTGCCCATGTGGCAAGAATACTCAGGTAAGACCATGTAATGAGGCAGAGGAAACCTGTGGCGATAcgtgccaaaaacttttgtcTTGTGGTCAGCACACTTGCACCCAGCGTTGTCATCGTGGACCTTGCATTTCCTGCCCAATAAGAACCAAAAAGAAATGTCGTTGTGGTCTGCACGAAAAGGAGCTGCCCTGCTCCAAAGAGTTTACATGCGAAACTAAATGCAAGCAAATGCGTGATTGTGGCAAGCATGCCTGCAATAGAAAGtgcTGTGGAGACCTGTGTCCACCGTGCGAAAAGATTTGTGGCAAGCAGTTGAGCTGCAACAAGCACAAATGTCAATCCGTGTGCCACAACGGACCCTGCTACCCGTGTAAACTGGAATCGCAGATCAACTGCCGCTgtgggaaaacaaaaaaaagtgttCCCTGTGGCAGGGAAAGGAGTGCACGCATTGTCTGTTTGGAGCTTTGTCGCATAACTCCCAAGTGCCACCATGCCATTAAACATCGTTGTCACAAGGGTGAGTGTCCTCCATGTGGCCAAGTGTGCGGTCTACCCAATGACACCAGCAAGTGTGGCCACATCTGTAAAGCAAGATGCCACGAGGCAGTTAGAGTTAATAAACCCAAAGAGGCTAGGCCACAGGCCAAAAAGTATGAGTATAAGGCATTACCTCATCCCCGGTGCGAGGAAGGTGTCATTGTCACATGTATTGGGGGTCACGAGGTTGCCACTTGGCCATGCTGGAACTCCAAACCCACTTCCTGTCAGCGAACATGTGCACGTCAGCTTAAGTGCGGCAACCACAAGTGTCCATTGGTTTGTCATTCAGTACCACTTCCCCAGGATATGTCTGTGCAAACTGGCTGTGCCAACTGCGAGGAGGGTTGCTCCGTTCCCCGACCCAACGGCTGCATTCACGCCTGTCCCAAAGGATGTCATCCACCGCCCTGTGCGCCCTGCAACTTTGTGATAAAGACCAAGTGCCACTGTGGACTCAGCCAGTTGGTATACAAGTGCAATGAGTATTTCGATGAAACGGGAACGGTCCAGGAAATCATCGAGCGAAGAGAGAAGCTACGGAGCTGTGGTAATCGATGCTTGAAGAATTATCCTTGCGGACATCGCTGCACAGCCATTTGCCACACAGGCAAGTGCCCAAATCCCGAGTTGTGCCGCAAGAAGGTTCGCATTTTCTGCGCCTGCAAGCGACTAAAGCAGGAGATTGCCTGTGACAAGCACCGTGGTGGTCAGACATCCTTGGAGTGCGATTCCAACTGCAAGGCGGAGCAAACTCGCGCCCAGGCAGCGGAACAACTGCAGCTGGAACAAAAACGCCGCGACGAGGAAGAAAGAAACCGCCTAGAACTGGAGAAGTTCGAGGCCAAGTTCGGTAAACGCAAGCACAAAGAGCGCAAAACCGTGGGTGCTGGACCGGCCAAGACCAAGATTGATTGGCAACGAAGAGCAATCTATGCAGTATCCATTCTAACAGTTGTGGGTGCGATTGTGGTAGCTTTCTACGCGGACAGTTAA
- Protein Sequence
- MEKFNKAQAKNLAAAQKLVDTYASSSEDEGELDEKHILELLYKNYKPSDSVGSSKDAARTSTFLENTLHSGAATCLICIGSIRRVEAIWSCESCYCFFHLNCIQRWANDSMMQMKVKAAEQQNGQGHYNHLGEFVPPKRQKSLHWCCPQCRRDYQPGDKPTQYNCFCGKEVNPENQPFLVPHSCGEICGKLLQPKCGHDCKLLCHPGPCPPCAQQAQVSCLCGKSSPRSVRCIDKQWRCQQACKDLLACGKHKCNQVCHQPGKCPPCTSKSLQPCECQRESKMVNCSDRKWKCQNVCGAPFNCGLHICEKVCHAGPCGDGECPLQVRSCPCGKNTQVRPCNEAEETCGDTCQKLLSCGQHTCTQRCHRGPCISCPIRTKKKCRCGLHEKELPCSKEFTCETKCKQMRDCGKHACNRKCCGDLCPPCEKICGKQLSCNKHKCQSVCHNGPCYPCKLESQINCRCGKTKKSVPCGRERSARIVCLELCRITPKCHHAIKHRCHKGECPPCGQVCGLPNDTSKCGHICKARCHEAVRVNKPKEARPQAKKYEYKALPHPRCEEGVIVTCIGGHEVATWPCWNSKPTSCQRTCARQLKCGNHKCPLVCHSVPLPQDMSVQTGCANCEEGCSVPRPNGCIHACPKGCHPPPCAPCNFVIKTKCHCGLSQLVYKCNEYFDETGTVQEIIERREKLRSCGNRCLKNYPCGHRCTAICHTGKCPNPELCRKKVRIFCACKRLKQEIACDKHRGGQTSLECDSNCKAEQTRAQAAEQLQLEQKRRDEEERNRLELEKFEAKFGKRKHKERKTVGAGPAKTKIDWQRRAIYAVSILTVVGAIVVAFYADS*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00603756;
- 90% Identity
- iTF_00594930;
- 80% Identity
- -