Aluc018593.1
Basic Information
- Insect
- Apolygus lucorum
- Gene Symbol
- nfxl1
- Assembly
- GCA_009739505.2
- Location
- CM019170.2:38803468-38833961[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 2.5 1.7e+04 -3.9 3.2 14 19 189 194 189 194 0.87 2 20 0.098 6.4e+02 0.6 0.5 4 10 225 231 224 231 0.96 3 20 2.8e-08 0.00018 21.6 16.9 1 19 239 256 239 256 0.99 4 20 3 2e+04 -9.2 6.9 11 11 282 282 275 289 0.53 5 20 6.3e-07 0.0041 17.2 15.3 1 19 292 310 292 310 0.93 6 20 0.88 5.8e+03 -2.4 0.9 5 10 334 339 334 339 0.94 7 20 1.5e-06 0.0099 16.0 13.3 3 19 347 363 345 363 0.93 8 20 6.9e-08 0.00045 20.3 12.7 1 18 398 415 398 416 0.98 9 20 0.78 5.1e+03 -2.2 0.3 1 5 425 429 425 429 0.93 10 20 9.9e-07 0.0064 16.6 16.9 1 18 450 467 450 468 0.97 11 20 3.8e-10 2.5e-06 27.5 13.0 1 18 477 494 477 495 0.97 12 20 0.88 5.7e+03 -2.4 2.4 5 10 523 528 513 528 0.48 13 20 0.0002 1.3 9.2 6.8 8 19 542 553 541 553 0.88 14 20 0.2 1.3e+03 -0.3 8.8 3 12 563 572 556 572 0.77 15 20 1.4 8.9e+03 -3.0 5.8 14 18 601 605 601 606 0.89 16 20 2.1 1.4e+04 -3.6 1.4 9 12 623 626 622 636 0.53 17 20 0.00035 2.3 8.5 3.4 1 11 642 652 642 653 0.97 18 20 0.0014 9.4 6.5 17.3 1 19 680 697 680 697 0.93 19 20 2.7e-05 0.18 12.0 10.7 1 16 744 758 744 764 0.89 20 20 2.3 1.5e+04 -3.7 0.2 9 11 781 783 781 786 0.78
Sequence Information
- Coding Sequence
- atgataACACGTCGATTATCGTGGAGCCGCGGAAGTTGGCGTTTACTCGTTAATCAAACGATCGAAATGTCTCAACGGAACCGCGGCAACCCTTGGGGACAGGGTCAGTCCAGCGCGAAAACCCCGAAGAAGAAGACGAACCCCCCGGCTGCTCCAGTTCCCGCTAAGAAGTCAGCCGAAGACAGATTCCGTCAAGCTCAAGCCAAACTGCAGGAGTCCGTTCAGAAACACCTCGACCAAGAGTTGGACCTTTCTTCGGAAGAAGAGGATTTGGAAACTGACAATATTTATGACGCTGTCATAAAGAACTACTCTCAATTGGGCGGAAAAAACGAAGATCTcgggaaaactcaaaattttctcgaaaatatcTTCCGGTCAGACGTTGCCATTTGTTTGATATGCATTGGAACCATTAAAAGAGCTGACCAGatttgGAGCTGTTGCTCGTGCTATGGTTTTTTCCACCTGCTGTGCATACAGAGATGGGGGAAAGACAGCATCGCCCATCAGAAGCTGGCTGAAGAAGAGAGATTGGCGACTTCTAAGAAGGAATATTTCTGGCCCTGCCCGAAGTGCAGGAAAAACTATTCTCCTGACAGCGTTCCTCAACGATATGAGTGTTTCTGCAGTCAAGTCCTGGATCCACCTTACCAACCTTGGCTCGTTCCACATTCATGCGGTCAAGTATGCGGGAAGGCCTTAGTTCCCAGTTGCGGGCATACATGTCTGTTGCTGTGTCACCCAGGTCCATGTCCGCCATGCCCCAAAATGATCAGAGCCAAATGCTACTGCTCGAACGGAGGAGACAAGATGGTGCGGTGCTGCCACAAAGAATGGTGTTGCGGGCAGAAGTGCAGCAAACTCCTCATTTGTCAAAAGCATAAATGCGAAACGGTGTGCCACCCTGGTGAGTGTCCAGCTTGCCCGAAAACGTCAACGATGCCTTGTGATTGTAAAAGTCAGTCTCGTCTCACTCCTTGCGCTGAGCTCTCATGGAAATGCGATAAGGTTTGCAGTAAATTATTAGACTGTAAGAAACATAGTTGCGATCTCGTCTGCCACCGAGGAGAATGCGGCCCTTGTCCGCGCGCCCTCCCCAGAACGTGTCCCTGCGGTGCTGAAGAGTCCATCATTCCTTGTACTCAGGAAGTTGCCACGTGCGGCAACACTTGCGGCAAATTGTTGGACTGCGGTCTTCACTCGTGCAACCAGCGGTGCCACCGAGGCCCCTGTGGTGCCTGCATGGAGTTCCTGGAGAAGTCTTGTCGGTGCGGTCTTCACAAGAAAGAAATCGCCTGCATAAAGGAGTACCTGTGTGACACCAAGTGCAAAGGAACTCGAGATTGTGGCATCCATCCTTGCAATAGAAAATGTTGCGATGGCCGCTGTCCCCCGTGTGAGAAGCTGTGCGGGAAAACACTCCGGTGTGGTCAACACAAGTGCGCCTCAGTCTGCCATCGCGGTCCCTGCTACCCCTGCGAGCAGACCAAGCAAATTTCTTGTCGGTGCGCCTCGACCGTCATCACAGTCCCCTGCGGAACCCATCGGAAGAAAAAGAACTTGAAGTGCAACAAGTTGTGCATGATTCCTCCCGAATGTCACCATCCTAAAAGGGAAGAACACAAATGTCACATGGGCCCTTGCCCTTCGTGTCGACAGGTTTGCGGCAAACCGCGGGACTGCCATCACTCCTGCCCTCTGCCCTGTCACTCGTCTGTCCTCGTCAATTCAGTTCCTAACTACAAGCCAGCCACGCCATGGGAGcgaGTAACTGCTCCACTTGAAATCAAAGAACTCCCGTGCCCACCGTGCGAGGTTCCGGTGGGAGTGTGGTGTCCAGGCGGCCACGAAGAGATCTCCTTGCCTTGTCACCGAGCTGTTTCTCAGCAGTGCGGCAGAAGCTGCGGACGAGTCCTCGCCTGCTCCAATCACGAGTGTGCTCTGCCTTGTCACCTAGTCACTGAAGCTGCTGATAATGAGAGCGCGGGAAAGGAGTGTATGGAATGCAATGAAGAATGTGAAGCTCCTCGCCTGTGCTCTCACGAATGCCCTCAGTTGTGTCATCCCCCACCGTGTCCAGAATGTACCGTCACGCTCAAACTCCGATGTCACTGTGGCATCACATTCACTTACGTCAAATGCTGCGAATGGACCTCTCCAACTTTGGAAGACGAGGCTCGAAAGGCGATGCAATCTTGCAACAATCAGTGCCCGAAAAATTACCCTTGCGGACATCGGTGTGTCGACAACTGTCACGAAGGAGAATGCCCTGGAGCAGCCATGTGCAGAAAGAGGATCAAGTTAACCTGCAACTGCAAACGGATCAAGAAAGAAATCGTCTGCCACGTTTCCGGAGAGTCCGCGCCAGTTTGTGACGACATTTGCAAATCAACCAAAGCTCAAATTAATGAGAAAGAAGCGAAGGAAAAGGAGAAACAGCGTCTCCTCGAAGAGAAGCGCAATAAAGAGGAATTGGAAAAGTACGAGAAGAAACTCCAAGGAGCCAAAAGACACCGGAAGAAGCGGatcttggaagaagaagaaaccaaAAGTCTCCTCTCCACTAAAATGATGATCATCGCTGGAGTCGTTTTAATTGTTTCGATAGGCGCGTTTTTTATCCTCAAATAA
- Protein Sequence
- MITRRLSWSRGSWRLLVNQTIEMSQRNRGNPWGQGQSSAKTPKKKTNPPAAPVPAKKSAEDRFRQAQAKLQESVQKHLDQELDLSSEEEDLETDNIYDAVIKNYSQLGGKNEDLGKTQNFLENIFRSDVAICLICIGTIKRADQIWSCCSCYGFFHLLCIQRWGKDSIAHQKLAEEERLATSKKEYFWPCPKCRKNYSPDSVPQRYECFCSQVLDPPYQPWLVPHSCGQVCGKALVPSCGHTCLLLCHPGPCPPCPKMIRAKCYCSNGGDKMVRCCHKEWCCGQKCSKLLICQKHKCETVCHPGECPACPKTSTMPCDCKSQSRLTPCAELSWKCDKVCSKLLDCKKHSCDLVCHRGECGPCPRALPRTCPCGAEESIIPCTQEVATCGNTCGKLLDCGLHSCNQRCHRGPCGACMEFLEKSCRCGLHKKEIACIKEYLCDTKCKGTRDCGIHPCNRKCCDGRCPPCEKLCGKTLRCGQHKCASVCHRGPCYPCEQTKQISCRCASTVITVPCGTHRKKKNLKCNKLCMIPPECHHPKREEHKCHMGPCPSCRQVCGKPRDCHHSCPLPCHSSVLVNSVPNYKPATPWERVTAPLEIKELPCPPCEVPVGVWCPGGHEEISLPCHRAVSQQCGRSCGRVLACSNHECALPCHLVTEAADNESAGKECMECNEECEAPRLCSHECPQLCHPPPCPECTVTLKLRCHCGITFTYVKCCEWTSPTLEDEARKAMQSCNNQCPKNYPCGHRCVDNCHEGECPGAAMCRKRIKLTCNCKRIKKEIVCHVSGESAPVCDDICKSTKAQINEKEAKEKEKQRLLEEKRNKEELEKYEKKLQGAKRHRKKRILEEEETKSLLSTKMMIIAGVVLIVSIGAFFILK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00930915; iTF_00386870; iTF_00884734; iTF_01366023;
- 90% Identity
- iTF_00930915;
- 80% Identity
- -