Ecor001473.1
Basic Information
- Insect
- Eupeodes corollae
- Gene Symbol
- nfxl1
- Assembly
- GCA_945859775.1
- Location
- CAMAOS010000018.1:791811-797048[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 2 3.2e+04 -8.1 6.4 15 19 169 173 168 173 0.82 2 20 0.15 2.4e+03 -0.5 0.8 4 10 204 210 203 210 0.95 3 20 4.5e-08 0.00072 20.3 12.8 1 18 218 234 218 235 0.98 4 20 1.1 1.8e+04 -3.3 1.1 5 10 260 265 260 265 0.93 5 20 1.2e-06 0.02 15.8 13.2 1 19 271 290 271 290 0.93 6 20 1.4 2.2e+04 -3.6 1.8 6 10 315 319 314 319 0.89 7 20 1.6e-09 2.6e-05 24.9 14.4 1 19 325 343 325 343 0.99 8 20 0.51 8.4e+03 -2.2 1.4 5 10 367 372 367 372 0.95 9 20 3.6e-08 0.00058 20.6 10.0 1 18 378 395 378 396 0.96 10 20 0.31 5e+03 -1.5 0.4 1 5 405 409 405 414 0.75 11 20 2.3e-05 0.37 11.7 15.3 1 18 430 447 430 448 0.98 12 20 2.7e-08 0.00043 21.1 13.0 3 19 459 475 451 475 0.85 13 20 1.1 1.7e+04 -3.2 0.6 5 10 502 507 502 507 0.78 14 20 0.00067 11 7.0 11.3 8 19 519 530 511 530 0.87 15 20 0.23 3.8e+03 -1.1 4.9 3 11 543 551 533 552 0.84 16 20 0.33 5.3e+03 -1.6 0.9 6 10 610 614 609 614 0.90 17 20 8.2e-06 0.13 13.1 6.1 1 11 620 630 620 631 0.96 18 20 2 3.2e+04 -6.9 3.8 1 7 644 648 644 651 0.51 19 20 0.0003 4.9 8.1 14.5 4 18 661 675 659 676 0.93 20 20 1.5e-06 0.025 15.4 13.7 1 18 724 743 724 744 0.89
Sequence Information
- Coding Sequence
- ATGGCTTCTAACATTTCCAAAGCAAACAAACCAAAAAATGTCCCTTCCAGCACAAGTAATAGAAAACCAGAAAACCCCAATAAGTCGCAACAACAATCAAAAAAATTCAAAGATATTCAAAATAAAAATATGGAAATGGCTAAGAAAATTACAGAAAACTATGAGTCCAGTTCTGATGAAGAAGAACTCGATGGTGGAAAAATCTTGGAAAAATTATACAAAAACTATACTGGTGGGAATGATCAACTGCAAAAGACTGCAGCTTTTTTGGAAAATGTTTTACAATCAGGATCTGCAATATGCCTAATTTGCATTGGTTCGGTGAAAAGAATGGATGCAATTTGGTCTTGCAAGTTTTGCTATTGTGTGTTCCATTTGAATTGCGTAAAGCGATGGGCAAACGATAGTATTGCATTACTTAAAGACAAGGGCACCGAAGAGCAAGGATATTATAATAATTTGGGTGAATTCATTCCGAAAAAGGTAAAAGCTGTGAAGTGGTGTTGTCCACAGTGTCGACGGGACTATCAACCCCAAGATAGACCAACTCAATATGAGTGCTTTTGTGGAAAAGAAGTTAATCCTGTCAATCAGGAATGGATAGTTCCACATTCTTGTGGGGAGACTTGTGGCAAACCATTACAACCTGAATGCGGTCACATGTGTATGCTTCTGTGTCATCCTGGTCCATGTCCTCCCTGTGCTCAAAGCATATTTTCAAGTTGCAAATGTGGAAAATCCCCACCGAAAACTCTTCGTTGCTTCCAAAAAACCTGGACATGCGATCGGAAATGTTTGCAGGTTCTTCCTTGTGGCAAACACAAATGTGATCAAGTTTGTCATTCGGCTAAACAATGTCCCCCTTGCAGTAAGTCTAGTCGTCAGAAATGCGTTTGTGGCAATGAAGAGGCCATGAAAAACTGCTCCCAGAGAATCTGGCAATGTAAGAAGGTGTGCAACAAAAAATATAGTTGTGGTATTCACAGTTGTAAACAAGTTTGTCATGATGGACCATGCGGCGATTGTCCACTTAGCCTTCCAAGATCTTGTCCATGTGGAAAAACTAGGAAAACAGCACCTTGCAACGAACCCATCGATACTTGTGAAGACACTTGCTACAAGTTGCTCTCGTGTGGAAAACACTATTGTACCCAACGTTGTCATAAAGGCGATTGCAGCTTGTGCTTAATTGTGACGAAAAAAACATGTCGCTGTGGTATGCACGAAAAAGAGTTACCTTGTTGGAAAGCATTCAGCTGTGAGACAAAATGCAAGAAGATCCGAGACTGTGGAAAACATGCTTGTAATAAAAAGTGCTGTGATGATCAATGTCCACCGTGTGATAAGGTATGCGGAAAATTGTTGTCTTGTAAAAAGCACAAATGCAGTTCAGTTTGTCACGATGGTCCGTGTTATCCTTGTAAGTTACAATCTCAAGTTAAATGTCGATGTGGAAGTACTTGCATTACGGTGCCCTGTGGTAGAGAGCGTAGAGCTCGTCCTAATTGTATGGAACCTTGCAGAGTTCCTTCTAAATGTCATCACCAAAATAAACATAAATGTCACAAAGGCGATTGTCCATCATGCAGTCAAGTGTGTGGATTGAAAAATGACACCACCAATTGTGAACACTTGTGTTCAGCTCGTTGTCATGCTGCTGTTAAGATTCCTCTCAAAAATAACCCTAACAATATATTTGAGTATAAATTCGATAATTATGAAATCAAACAAATGCCACATCCAAAATGTGAGAAGAAGGTTATGGTTGCCTGTATTGGTGGCCACGAGATTGCCGAATGGCCATGTTGGAATTCTAAGCCAACATCATGTCAACGTCTTTGTGGTCGCAATTTGAAATGTGGCAACCATAAGTGCTCACTCGTTTGTCACAATGTAACTGATCCCGATGATAAAAATGAGCAAGAAAGCTGTGGATCATGCATAGAAGGATGTGAAGTTAAACGACCTCCAGGATGTGTTCATCGTTGCAAGAAATCTTGTCATCCACCCCCATGCGACCCATGTCTTGCACAAATCAAAGCAAACTGTCATTGTGGCCTAACGCAGGTTTATTATAAGTGTTCTGAATATTATTCAACTACCATGGACATTGAAGAACGTCAAGAAAAATTGAAGAGTTGTGGAAATCGTTGCATTAAAAATTACCCATGTGGTCATCGTTGTTCAGCTGAATGTCATTCAGGGCCCTGTCCAAATCCACAAAGTTGTCGTAAGAAGGTGAAAATCTATTGCGAATGCAGGCGCATCAAAATGGAAGTCACATGTGAGAAATCCCGATCGAAAGACACTTTCATACCGTGCGATGATGTGTGTCAAGTCATCAAAGCCGAATCGGAAAAAATCAAACGCGAAGAAGCAGAAAAACAAAGACTTGCCGAAGAGGAACGCAATCGCATTGAATTGGAACAATTTGAAAAGAGATTTGGTAAACGAAAACCAAGAGAACGTAAAGTTATTGCACAAGAAGTAAAAACAAACAACAATCAAATGAAATTGCTGATTGGCAGTGCAGCTGCTATTGTGGTAGCTGCAGTGGTAATCTTTTATTTTTATTTATAA
- Protein Sequence
- MASNISKANKPKNVPSSTSNRKPENPNKSQQQSKKFKDIQNKNMEMAKKITENYESSSDEEELDGGKILEKLYKNYTGGNDQLQKTAAFLENVLQSGSAICLICIGSVKRMDAIWSCKFCYCVFHLNCVKRWANDSIALLKDKGTEEQGYYNNLGEFIPKKVKAVKWCCPQCRRDYQPQDRPTQYECFCGKEVNPVNQEWIVPHSCGETCGKPLQPECGHMCMLLCHPGPCPPCAQSIFSSCKCGKSPPKTLRCFQKTWTCDRKCLQVLPCGKHKCDQVCHSAKQCPPCSKSSRQKCVCGNEEAMKNCSQRIWQCKKVCNKKYSCGIHSCKQVCHDGPCGDCPLSLPRSCPCGKTRKTAPCNEPIDTCEDTCYKLLSCGKHYCTQRCHKGDCSLCLIVTKKTCRCGMHEKELPCWKAFSCETKCKKIRDCGKHACNKKCCDDQCPPCDKVCGKLLSCKKHKCSSVCHDGPCYPCKLQSQVKCRCGSTCITVPCGRERRARPNCMEPCRVPSKCHHQNKHKCHKGDCPSCSQVCGLKNDTTNCEHLCSARCHAAVKIPLKNNPNNIFEYKFDNYEIKQMPHPKCEKKVMVACIGGHEIAEWPCWNSKPTSCQRLCGRNLKCGNHKCSLVCHNVTDPDDKNEQESCGSCIEGCEVKRPPGCVHRCKKSCHPPPCDPCLAQIKANCHCGLTQVYYKCSEYYSTTMDIEERQEKLKSCGNRCIKNYPCGHRCSAECHSGPCPNPQSCRKKVKIYCECRRIKMEVTCEKSRSKDTFIPCDDVCQVIKAESEKIKREEAEKQRLAEEERNRIELEQFEKRFGKRKPRERKVIAQEVKTNNNQMKLLIGSAAAIVVAAVVIFYFYL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01223893;
- 90% Identity
- iTF_01318704;
- 80% Identity
- -