Bole009932.1
Basic Information
- Insect
- Bactrocera oleae
- Gene Symbol
- nfxl1
- Assembly
- GCA_001188975.4
- Location
- LGAM02016867.1:7072898-7076563[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 1.5 1.6e+04 -3.8 2.1 15 19 174 178 173 178 0.82 2 20 0.39 4.1e+03 -1.8 0.4 4 10 209 215 209 215 0.93 3 20 5.5e-09 5.7e-05 23.2 15.1 1 19 223 240 223 240 0.98 4 20 1.5 1.6e+04 -3.7 1.1 5 10 265 270 265 270 0.91 5 20 5.1e-05 0.53 10.6 13.8 1 19 276 295 276 295 0.96 6 20 0.91 9.4e+03 -3.0 0.8 6 10 320 324 319 324 0.91 7 20 4.4e-08 0.00045 20.4 13.7 1 19 330 348 330 348 0.99 8 20 1.3 1.4e+04 -3.5 0.4 1 8 373 379 373 380 0.59 9 20 7.6e-08 0.00078 19.6 10.6 1 18 383 400 383 401 0.96 10 20 0.22 2.3e+03 -1.0 0.5 1 5 410 414 410 419 0.75 11 20 2 2.1e+04 -5.0 2.3 5 10 424 429 424 429 0.91 12 20 0.00014 1.4 9.2 15.1 1 18 435 453 435 454 0.92 13 20 2.7e-08 0.00028 21.0 15.5 1 19 463 481 457 481 0.87 14 20 0.006 62 4.0 6.1 9 18 526 535 524 536 0.89 15 20 0.037 3.9e+02 1.4 4.7 3 11 549 557 539 558 0.86 16 20 0.89 9.2e+03 -3.0 1.4 6 10 618 622 617 622 0.88 17 20 5.9e-05 0.61 10.4 6.6 1 12 628 639 628 639 0.97 18 20 2 2.1e+04 -4.7 2.2 15 19 652 656 652 656 0.84 19 20 0.052 5.4e+02 1.0 18.6 3 18 668 683 667 684 0.93 20 20 0.0001 1.1 9.6 16.4 1 16 735 749 735 755 0.92
Sequence Information
- Coding Sequence
- atgtctgtTTTTTCTGCTAAGAACGAAAAAATGAAAGGAGGCTCCAAGTCAAATCAACAAAAACCGAATGGACCAACTCGTTTTGAAGAAGTACATGCCAGAAATATAGCTGCCGCAAAAAAGATAGTAGAAAAATATTCTTCTAGCTCGGATGAAGAGGAAGAGGAGCTTAATGAATCAAAAATATTAGATTCCCTATTTAAGCATTACAAGTCTGACGACAGCCGTTTAGGTATACAACAAGCGCTACAACAAAAAACTGCTGCCTTCTTTGAAAACGCTTTGCACTCAGGATCAGCGACTTGTCTAATTTGTATTGGTAGCGTGCGCCGAGCTGACTCGATATGGACTTGTAAACATTGCTATTGTTTCTTCCATTTGAACTGTATACGACGATGGGCAAATGATAGTATCGCGCAGCTGAAAGCGTCCGCTGAACAAACAAGCAATGAACAGGGTTACTACAATAATGTTGGACAATTCGTACCGCCAAAGCGGAAGCGACCATTGCACTGGAGTTGTCCGCAGTGTCGCAAAGATTACAGTTTGGAAGAGAAACCGGCCACTTATAAGTGCTTTtgtgaaaaagaagaaaatccaCGTCCTGGAGCTTTTATATTGCCGCATTCTTGTGGAGAAATGTGCGGTAAAAATTTGCAACCAACATGTGGTCATACATGTATGTTACTTTGTCATCCGGGTCCATGTCCGCCCTGTTCTCAATATGCCTCTACCAGTTGTCTATGTGGTCAATCTCCCAAAAAATCTGTGCGTTGTATAGATAAACAGTGGAAATGTGACAGAAAGTGCAAAGAATTACTGCCCTGTGGCGAACATCTATGTAAGGAAATATGTCATAAGCCCAATCAGTGTCCACCATGTACCAGTACCAGTTTACAACCGTGCGAATGTGGTGCTGAGACAAAGAAACGTAATTGTTCTGAACTTAAATGGCATTGCAAAAAGGTTTGCGGTTCTAAATATTCATGTGGGGCACACGTGTGTAAACGCGTATGCCACTCGGGACCTTGCGGAGATTGCCCTTTAAGTTTACCACGTTCATGTCCATGTGGTAAAACCCAAAAAATCGTACCCTGTATCGAAACCATTGATCCTTGCGGTGATACTTGTCAAAAACTATTATCGTGTGGGCTGCATACTTGCACACAACGTTGCCATCGTGGCGAGTGTAATTTGtgtttaattattacaaaaaagaaaTGTCGCTGTGGCATACATGAAAAGGAATTGCCGTGTTGGAAAATCTTTACATGTGAAACCAAATGCAAGCAAATGCGAGATTGTGGAAAGCACACTTGTAACAAAAagTGTTGTGATGGCCGCAGTTGTCAACAATGTGATAAAGTATGTGGAAAACCACTCACTTGTCAAAAACACAAGTGCCAGTCGGTTTGTCATGAAGGACCCTGTTATCCTTGTAGTCAACAATCACAAGTAAATTGTCGTTGTGGAAAAACATCAAAACGTGTGCCTTGTGGACGTGAACGCACAGCACGCGTTATGTGCATGGAATTATGCCGTATTCCTTCAAAATGTCACCATCCAATCAAACATCGCTGTCATAAAAACGAATGTCCTCCTTGTAATCAAAAATGTGGACAGGTAAATGATACCACTGGGTGTCTGCATATATGCGAAGCTAAATGCCATGCCGCTGttaaagttttcaaacaaaatcCGCTTAACGGCGCTGCAAATGTGTGGCATCAAGGCAAAcagTTTGAATTCAAAAAGCAACCGCATCCACCATGTGAACAATTGGTGAAAGTTACATGTATTGGTGGCCATGAAATAGCTGATTGGCCTTGTTGGAATTCAAAACCTACATCCTGTCAACGCAAATGTAATCGTGCATTGCGTTGTGGTAACCACAAATGTGGGCTTATATGCCATTCCGTGCCCGACTTGAAAGATATGAAGgAGCAACTCGGCTGCGCACCATGCCAAGATGGCTGCAATATACCCCGTCCAGCGGGTTGTGAGCATGCTTGTCCTCGACCTTGTCACGCTCCGCCATGTCATCCATGTGACAAaatgatcaaaaacaaatgttaTTGTGGTTTAACACAATTAATCTACAAGTGTTCAGAATATTTTAGAGTCGAAGGAACCAAAGAGGAAATTGCTTTAATACAGGAACGCCTGAACAGCTGCGGAAATCGTTGCCTCAAAACGTTTCCATGCGGTCATCGCTGTCATACACCTTGTCATCCGGGAAAGTGTCCAAACCCGGAGTTGTGTCGCAAAAAAGTACGTATATTTTGCGAATGTAAACGTTTAAAAGCTGAAATCGCCTGTGATAAGCATCGTGCTGGTCAAACATCAATTCCTTGTGATGAGTTTTGCATTGAGACACGCATCAAGTTGGCAGAACAGTTAAAACGGGAACAGGAAAAATTACGCCAACAAGAAGAGGCGAAAAATCGTGCAGAAGTTGAACAATTTGAGAAGAGATTTAGCAAACGTAAATACAAAGAACGTAAGGTCGTAGTAGAAAAAACAGATAGGCAGATAAATTGGAAATTATTAAGCATTTATGCTGGTATTATACTAGCAATCGTTTTAGCTATTGCTGTAGCTTTCTATGCCGATAGTTAA
- Protein Sequence
- MSVFSAKNEKMKGGSKSNQQKPNGPTRFEEVHARNIAAAKKIVEKYSSSSDEEEEELNESKILDSLFKHYKSDDSRLGIQQALQQKTAAFFENALHSGSATCLICIGSVRRADSIWTCKHCYCFFHLNCIRRWANDSIAQLKASAEQTSNEQGYYNNVGQFVPPKRKRPLHWSCPQCRKDYSLEEKPATYKCFCEKEENPRPGAFILPHSCGEMCGKNLQPTCGHTCMLLCHPGPCPPCSQYASTSCLCGQSPKKSVRCIDKQWKCDRKCKELLPCGEHLCKEICHKPNQCPPCTSTSLQPCECGAETKKRNCSELKWHCKKVCGSKYSCGAHVCKRVCHSGPCGDCPLSLPRSCPCGKTQKIVPCIETIDPCGDTCQKLLSCGLHTCTQRCHRGECNLCLIITKKKCRCGIHEKELPCWKIFTCETKCKQMRDCGKHTCNKKCCDGRSCQQCDKVCGKPLTCQKHKCQSVCHEGPCYPCSQQSQVNCRCGKTSKRVPCGRERTARVMCMELCRIPSKCHHPIKHRCHKNECPPCNQKCGQVNDTTGCLHICEAKCHAAVKVFKQNPLNGAANVWHQGKQFEFKKQPHPPCEQLVKVTCIGGHEIADWPCWNSKPTSCQRKCNRALRCGNHKCGLICHSVPDLKDMKEQLGCAPCQDGCNIPRPAGCEHACPRPCHAPPCHPCDKMIKNKCYCGLTQLIYKCSEYFRVEGTKEEIALIQERLNSCGNRCLKTFPCGHRCHTPCHPGKCPNPELCRKKVRIFCECKRLKAEIACDKHRAGQTSIPCDEFCIETRIKLAEQLKREQEKLRQQEEAKNRAEVEQFEKRFSKRKYKERKVVVEKTDRQINWKLLSIYAGIILAIVLAIAVAFYADS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00191050;
- 90% Identity
- iTF_01564223;
- 80% Identity
- -