Dtak011787.1
Basic Information
- Insect
- Drosophila takahashii
- Gene Symbol
- nfxl1
- Assembly
- GCA_018152695.1
- Location
- JAECXN010000121.1:5682166-5685690[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 2 1.5e+04 -8.1 6.4 15 19 150 154 149 154 0.82 2 20 0.13 1e+03 -0.3 0.4 4 10 185 191 184 191 0.95 3 20 6.7e-08 0.00051 19.8 15.5 1 18 199 215 199 216 0.98 4 20 3.9e-06 0.03 14.1 10.8 5 19 276 291 276 291 0.96 5 20 0.29 2.2e+03 -1.4 1.3 5 10 315 320 315 320 0.94 6 20 6.2e-08 0.00047 19.9 8.6 1 19 326 346 326 353 0.94 7 20 0.91 7e+03 -3.0 0.5 1 7 370 375 369 377 0.49 8 20 2.8e-08 0.00022 21.0 13.4 1 19 380 398 380 398 0.99 9 20 0.72 5.5e+03 -2.7 0.4 1 5 407 411 407 416 0.73 10 20 2 1.5e+04 -5.0 2.3 5 10 421 426 421 426 0.91 11 20 0.00037 2.8 7.8 17.2 1 18 432 449 432 450 0.97 12 20 3.4e-08 0.00026 20.7 14.7 1 19 459 477 453 477 0.90 13 20 0.36 2.8e+03 -1.7 0.5 8 13 513 518 512 519 0.82 14 20 0.00016 1.2 9.0 8.6 9 18 522 531 520 532 0.91 15 20 0.014 1.1e+02 2.8 6.6 1 12 544 554 544 554 0.96 16 20 0.88 6.8e+03 -3.0 1.4 6 10 608 612 607 612 0.88 17 20 3.9e-06 0.03 14.1 5.7 1 12 618 629 618 629 0.97 18 20 0.041 3.2e+02 1.3 16.3 4 18 659 673 657 674 0.93 19 20 1.7 1.3e+04 -3.9 1.6 8 13 679 684 678 684 0.74 20 20 5.1e-07 0.0039 17.0 10.6 1 16 725 739 725 745 0.92
Sequence Information
- Coding Sequence
- ATGGAAAAGTTCACTAAAGCACAGGCAAAAAACTTGGCTGCTGCCCAGAAACTGGTGGACACCTACGCCTCCAGCTCCGAGGATGAGGGCGAACTGGATGAGAAGCATATTTTGGAACTCCTCTATAAGAACTACCGACCGACGGAGGGCGCAGGGAGCTCCAAGGATGCAGCCAAGACGAGTACGTTCCTGGAGAACACCTTGCACTCCGGAGCAGCCACCTGTCTCATCTGCATCGGCAGCATCCGGCGGGTGGAGGCCATTTGGTCGTGCGAGAGTTGCTACTGCTTCTTCCACTTGAACTGCATCCAGCGGTGGGCCAACGACAGCATGATGCAGATGAAGGTGAAGGCGGCGGAGCAGCAGAATGGCCAGGCCAGCCAGGGTCACTACAATCACCTGGGCGAGTTTGTGCCGCCCAAGAGGCAAAAATCGCTCCATTGGTGCTGCCCCCAGTGCCGCAGGGATTACCAGCCGACGGAGAAGCCCACGCAGTACAACTGCTTCTGCGGCAAGGAGGAGAATCCCCAGAATCAGCCCTTCCTGGTGCCCCACTCCTGTGGAGAGATTTGCGGCAAGCTGCTGCAGCCCAAGTGTGGACACGATTGCAAGCTTCTATGTCATCCGGGGCCATGTCCTCCGTGTGCCCAGCAGGCGCAGGTCTCCTGTTTGTGCGGCAAGTCCAGTCCAAGGTCCATGCGATGTATTGATAAACAGTGGAGGTGCCAGCAGGCGGTAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNACAAGTGCCAGCAGTTGTGCCATCGACCAGGACAGTGTCCTCCCTGCAGCAGCAAAAGCCTGCAGCCCTGCGAATGCCAGCGGGAATCCAAGATGGTCAACTGCTCCGATCGCAAATGGAAGTGTCAGAATGTTTGTGGCGCTCCCTTTGCTTGTGGCCTGCATATTTGCGAAAAGGTCTGCCATGCCGGTGCCTGTGGCGATGGCGAGTGTCCTTTGCAAGTCAGGAGTTGTCCTTGTGGCAAGAATACCCAAGTTAGACCCTGTAATGAGGCGGAGGAAACGTGTGGCGATACTTGCCAGAAGCTGTTGTCTTGTGGCCAGCATACTTGCACTCAGCGTTGTCATCGTGGAGCCTGTATTTCTTGCCCTATAAGAACCAAAAAGAAGTGTCGTTGTGGATTGCATGAAAAGGAGCTGCCCTGCTCCAAGGAGTTTACTTGCGAGACTAAATGCAAGCAGATGCGTGACTGCGGCAAGCATGCTTGCAATAGAAAGTGCTGTGGCGATCAGTGTCCGCCCTGCGAGAAGATTTGTGGCAAGCAGCTGAGCTGCAACAAGCACAAATGCCAGTCGGTGTGCCACAATGGACCCTGCTACCCGTGCAAACTGGAGTCGCAAATCAACTGCCGCTGTGGCAAGACCAAAAGAAGTGTTCCTTGTGGCAGAGAGCGGAGTGCTCGCATTGTGTGCTTGGAACTCTGTCGGATAACTGCCAAATGCCATCATGGTATCAAACATCGCTGCCACAAGGGCGAGTGTCCTCCTTGCGGCCAAGTGTGTGGGCTGCCCAATGAGACCAGCAAATGTGGACACATCTGCAAGGCTCGCTGCCATGAGGCGGTTAGAGTTAATAAACCCAAAGAGGCTAGGCCACAGGCCAAAAAGTATGAATATAAGTCATTACCCCATCCGCGTTGCGAAGAAGGCGTCGTGGTGACCTGCATTGGGGGTCACGAGGTGGCCACCTGGCCGTGCTGGAACTCCAAACCCACTTCCTGCCAGCGAAAGTGTGCCCGGCAGCTGAAATGCGGCAATCACAAATGCTCCTTGGTCTGCCACTCGGTGGCCCAACCACAGGACATGGCTCAGCAGCCGGGTTGCGCCAACTGCGAGGAGGGTTGCTCCGTTCCCCGACCCACCGGCTGTGTTCACTCCTGTCCCCGCGGCTGTCATCCGCCGCCCTGTGCCCCTTGCAACTTTGTGATCAAGAGCAAGTGCCACTGCGGACTCAATCAGTTGGTGTACAAGTGCAGCGAGTATTTCGATGAAACGGGCACCGTTCAGGAGATTATCGAGCGTAGGGAGAAGCTTCGCAGCTGCGGCAATCGTTGCTTGAAGAATTATCCCTGTGGCCATCGCTGCTCAGCCATTTGCCACTCAGGCAAGTGTCCAAATCCCGAGCTGTGCCGCAAGAAGGTTCGCATTTTCTGTGCCTGCAAGCGTCTCAAGCAGGAGATTGCCTGCGACAAGCATCGGGCGGGTCAGGTTTCCCTAGACTGCGATTCCAACTGCAAGGCGGAGCAATCTCGAGCCCAAGCGGCGGAGCAACAGCAGCTGGAGCAGAAGCGTCGCCACGAGGAGGAGCGAAACCGCCTGGAACTCGAGAAGTTCGAGGCCAAGTTCGGCAAACGCAAGCACAAGGAGCGCAAAACTGTAGATGTTGGACCGGCAAAAACCAAGATCGACTGGCAGCGATGGGCCATCTATGTGGTATCCATCCTCACAGTTGTGGCTGCCATTGCGGTGGCTTTCTACGCGGACAGTTAA
- Protein Sequence
- MEKFTKAQAKNLAAAQKLVDTYASSSEDEGELDEKHILELLYKNYRPTEGAGSSKDAAKTSTFLENTLHSGAATCLICIGSIRRVEAIWSCESCYCFFHLNCIQRWANDSMMQMKVKAAEQQNGQASQGHYNHLGEFVPPKRQKSLHWCCPQCRRDYQPTEKPTQYNCFCGKEENPQNQPFLVPHSCGEICGKLLQPKCGHDCKLLCHPGPCPPCAQQAQVSCLCGKSSPRSMRCIDKQWRCQQAVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKCQQLCHRPGQCPPCSSKSLQPCECQRESKMVNCSDRKWKCQNVCGAPFACGLHICEKVCHAGACGDGECPLQVRSCPCGKNTQVRPCNEAEETCGDTCQKLLSCGQHTCTQRCHRGACISCPIRTKKKCRCGLHEKELPCSKEFTCETKCKQMRDCGKHACNRKCCGDQCPPCEKICGKQLSCNKHKCQSVCHNGPCYPCKLESQINCRCGKTKRSVPCGRERSARIVCLELCRITAKCHHGIKHRCHKGECPPCGQVCGLPNETSKCGHICKARCHEAVRVNKPKEARPQAKKYEYKSLPHPRCEEGVVVTCIGGHEVATWPCWNSKPTSCQRKCARQLKCGNHKCSLVCHSVAQPQDMAQQPGCANCEEGCSVPRPTGCVHSCPRGCHPPPCAPCNFVIKSKCHCGLNQLVYKCSEYFDETGTVQEIIERREKLRSCGNRCLKNYPCGHRCSAICHSGKCPNPELCRKKVRIFCACKRLKQEIACDKHRAGQVSLDCDSNCKAEQSRAQAAEQQQLEQKRRHEEERNRLELEKFEAKFGKRKHKERKTVDVGPAKTKIDWQRWAIYVVSILTVVAAIAVAFYADS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00603756;
- 90% Identity
- iTF_00581954; iTF_00537188; iTF_00505793; iTF_00548482; iTF_00588548; iTF_00529957; iTF_00494791; iTF_00486991; iTF_00512403; iTF_00578965;
- 80% Identity
- -