Llin027726.1
Basic Information
- Insect
- Lygus lineolaris
- Gene Symbol
- nfxl1
- Assembly
- GCA_030264115.1
- Location
- JAEMON010000015.1:21668074-21675856[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 1.7 2.6e+04 -3.8 3.2 14 19 166 171 166 171 0.87 2 20 0.067 1.1e+03 0.6 0.4 4 10 202 208 201 208 0.96 3 20 1.8e-08 0.00028 21.6 16.9 1 19 216 233 216 233 0.99 4 20 3.4e-07 0.0053 17.5 14.9 1 19 269 287 269 287 0.93 5 20 0.57 9e+03 -2.4 0.9 5 10 311 316 311 316 0.94 6 20 5.6e-05 0.88 10.4 12.6 1 19 322 340 322 340 0.95 7 20 1.1e-07 0.0017 19.1 11.5 1 18 375 392 375 393 0.98 8 20 0.5 7.9e+03 -2.2 0.3 1 5 402 406 402 406 0.93 9 20 6.4e-07 0.01 16.6 16.9 1 18 427 444 427 445 0.97 10 20 2.5e-10 3.9e-06 27.5 13.0 1 18 454 471 454 472 0.97 11 20 0.63 1e+04 -2.5 2.6 5 10 500 505 490 505 0.48 12 20 0.00013 2 9.3 6.8 8 19 519 530 518 530 0.88 13 20 0.00046 7.3 7.5 6.0 1 12 539 549 539 549 0.96 14 20 0.62 9.7e+03 -2.5 4.9 14 18 578 582 578 583 0.92 15 20 1.4 2.2e+04 -3.6 1.1 9 12 600 603 599 613 0.62 16 20 0.00023 3.6 8.5 3.4 1 11 619 629 619 630 0.97 17 20 0.0077 1.2e+02 3.6 18.2 1 19 657 674 657 674 0.93 18 20 2 3.1e+04 -4.3 1.7 8 13 679 684 678 684 0.73 19 20 4.5e-06 0.07 13.9 11.3 1 16 721 735 721 741 0.89 20 20 0.41 6.4e+03 -1.9 0.1 8 11 757 760 756 764 0.71
Sequence Information
- Coding Sequence
- atGTCTCAGAGAAACCGTGGAAACCCTTGGGGGCAGGGTCCAACCAACGCAAAAGCCCCCAAGAAGAAAACGAACCCCTCAGCTGCCGCTCCTGCCAAGAGATCAGCTGAAGACAGATTCCGTCAAGCTCAAGCTAAGTTGCAGGAGTCGGTTCAAAAACATCTCGACCAAGAATTGGACCTTTCTTCGGAAGAAGAAGATTTGGAAACTGACAACATTTACGatgcTGTCATAAAGAACTACTCTCAACTTGGCGGAAGAAATGAAGATCTCGGGAAAACgcaaaattttctcgaaaatatcTTCCGGTCAGATGTCGCCATCTGTTTGATATGTATTGGGACGATTAAAAGAGCTGACCCTATTTGGAGCTGCTGCTCGTGCTATGGGTTTTTTCACCTACTTTGCATTCAAAGATGGGGAAAGGACAGCATTGCCCATCAGAAGCTGGCTGAAGAAGAAAGATTGGCTACTTCCAAGAAGGAATATTTCTGGCCATGcccaaaatgtaGGAAAAACTATACTCCAGATAGTGTTCCTCAACGGTACGAGTGTTTCTGCAGCCGGGTCTTGGATCCTCCCTATCAGCCTTGGCTCGTTCCACATTCCTGCGGTGAAGTATGTGGCAAGGACTTGGTACCCAGCTGCGGGCATACGTGTCTGTTGCTGTGCCACCCAGGTCCTTGTCCGCCGTGCCCCAAAATGATCAGAGCCAAATGCTACTGCTCTAATGGAGGAGACAAAATGGTGAGGTGCTGCTACAAAGAATGGTCGTGCGGTCAGAAATGCAGTAAGCTCCTCAGTTGTGAAAAGCACAAGTGCGAAACGGTTTGCCATCCTGGAGAGTGTCCAGCTTGTCCAAAAACATCAATGATGACCTGCGATTGTAAAAGCCAATCTCGCCTCACTCCCTGCGCTGAGCTTTCATGGAAATGCGACAAGgtttgcaataaattattggactGTAACAAACACAGTTGTGATCTAGTCTGCCACAAAGAAGAGTGTGGACCTTGTCCACGCACCCTGCCCAGGACTTGTCCCTGCGGGTCTGAGGAAACGGTCATCCCCTGCACTCAGGAAGTTGCCACGTGTGGCAACACTTGCGGCAAATTGTTGGATTGTGGCCTTCATTCGTGCAACCAGCGGTGCCACCGAGGGTCCTGCGGCGCCTGCATGGAGTTCCTGGAGAAGTCGTGCCGTTGCGGTCTCCACAAGAAAGAAATTGCCTGTATCAAAGAATACCTGTGCGACACCAAGTGTAAAGGAACTCGAGATTGTGGCATCCATCCTTGCAATAGAAAATGTTGCGATGGCCGGTGTCCTCCATGTGAGAAACTTTGCGGCAAAACACTCCGGTGTGGTCAACACAAGTGTGCTTCAGTCTGCCATCGTGGACCCTGCTACCCCTGTGAACAGACCAAGCAAATTTCCTGTCGGTGTGGCGCTACTGTCGTCACAGTCCCCTGTGGAACTCATCGAAAAAAGAAGAACCTGAAATGTAACAAGTTGTGCATGATCCCTCCTGAATGTCACCATCCTAAAAGGGAAGAGCATAAATGTCATATGGGCCCCTGCCCTTCGTGCCGACAAATCTGTGACAAACCTCAGGAGTGCGGTCACCGTTGCCCTCTGCCCTGTCACTCTTCTGTCCTGGTCAATTCAGTTCCAAACTACAAGCCAGCGACACCCTGGGAAAGAGTGACTGCTCCAGTTGAAATCAAAGAACTCCCTTGTCCTCCATGTGTGGTCCCTGTCGGAGTATGGTGTCCCGGTGGCCACGAAGAGATCTCCTTACCCTGTCATCGGGCTGTTTCGCAGCAGTGTGGCAGAAACTGTGGCCGAATCCTTGCCTGCTCTAACCACGAGTGTGCTTTACCTTGCCATCTAGTCTCTGAAGCTCCGGACAAAGAGAGCGCAGGAAAGGAGTGTATGGAATGTGGAGAAGAATGTGAAGCACCCCGGCTATGCTCCCACGAATGTCCTCAGTCGTGTCATCCTCCACCATGCCCAGAATGTACAGTGACTCTCAAGCAGCGATGTCACTGTGGCATCACATTCACTTATGTCAAATGCTGCGAATGGACCTCTCCAACTTTGGATGACGAAACTCGAAAGGCGATGCAATCCTGCAACAATCAGTGTCCAAAAACCTACCTTTGTGGACATCGGTGCACTGACAACTGTCACGAAGGAGAATGTCCAGGGGCGGCCATGTGCAGGAAGAGGATGAAGTTGACCTGCAACTGCAAACGGATCAAGAAAGAAGTCGTGTGCCACGTCTCTGGAGAGTCTGCTCCAGTTTGCGATGATGTTTGCAAATCAACCAAAGCTCAAATTAAtgagAAAgaagaaaaggagaaggaaaaacagcGCCTCCTCGAAGAGAAGCGCAATAAAGAGGAATTGGAAAAGTACGAGAAGAAACTTCAAGGGGCCAAAAGACATCGGAAGAAGCGGATCCTGGAAGAAGAAGACTCCAAGAGTTttttgtcctcgaaaatcaTACTCATCACTGGAGTCGTTCTGGTTGTATCAATCGGCACGTTTTTTATTCTCAAATAG
- Protein Sequence
- MSQRNRGNPWGQGPTNAKAPKKKTNPSAAAPAKRSAEDRFRQAQAKLQESVQKHLDQELDLSSEEEDLETDNIYDAVIKNYSQLGGRNEDLGKTQNFLENIFRSDVAICLICIGTIKRADPIWSCCSCYGFFHLLCIQRWGKDSIAHQKLAEEERLATSKKEYFWPCPKCRKNYTPDSVPQRYECFCSRVLDPPYQPWLVPHSCGEVCGKDLVPSCGHTCLLLCHPGPCPPCPKMIRAKCYCSNGGDKMVRCCYKEWSCGQKCSKLLSCEKHKCETVCHPGECPACPKTSMMTCDCKSQSRLTPCAELSWKCDKVCNKLLDCNKHSCDLVCHKEECGPCPRTLPRTCPCGSEETVIPCTQEVATCGNTCGKLLDCGLHSCNQRCHRGSCGACMEFLEKSCRCGLHKKEIACIKEYLCDTKCKGTRDCGIHPCNRKCCDGRCPPCEKLCGKTLRCGQHKCASVCHRGPCYPCEQTKQISCRCGATVVTVPCGTHRKKKNLKCNKLCMIPPECHHPKREEHKCHMGPCPSCRQICDKPQECGHRCPLPCHSSVLVNSVPNYKPATPWERVTAPVEIKELPCPPCVVPVGVWCPGGHEEISLPCHRAVSQQCGRNCGRILACSNHECALPCHLVSEAPDKESAGKECMECGEECEAPRLCSHECPQSCHPPPCPECTVTLKQRCHCGITFTYVKCCEWTSPTLDDETRKAMQSCNNQCPKTYLCGHRCTDNCHEGECPGAAMCRKRMKLTCNCKRIKKEVVCHVSGESAPVCDDVCKSTKAQINEKEEKEKEKQRLLEEKRNKEELEKYEKKLQGAKRHRKKRILEEEDSKSFLSSKIILITGVVLVVSIGTFFILK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00146088;
- 90% Identity
- iTF_00146088;
- 80% Identity
- -