Xseg012030.1
Basic Information
- Insect
- Xylota segnis
- Gene Symbol
- nfxl1
- Assembly
- GCA_963583995.1
- Location
- OY757238.1:41750597-41754856[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 2 1.8e+04 -5.2 2.6 15 19 161 165 161 165 0.87 2 20 0.15 1.3e+03 -0.5 0.8 4 10 196 202 195 202 0.95 3 20 2.4e-08 0.00022 21.2 14.8 1 18 210 226 210 227 0.98 4 20 1.1 1e+04 -3.3 1.7 5 10 252 257 252 257 0.93 5 20 3.9e-09 3.5e-05 23.7 14.9 1 19 263 282 263 282 0.98 6 20 2 1.8e+04 -4.1 2.5 1 10 300 311 300 311 0.67 7 20 1.1e-07 0.00095 19.1 21.0 1 19 317 335 317 335 0.99 8 20 0.92 8.3e+03 -3.0 0.9 5 10 359 364 359 364 0.93 9 20 5.2e-08 0.00047 20.1 10.8 1 18 370 387 370 388 0.96 10 20 0.31 2.8e+03 -1.5 0.4 1 5 397 401 397 406 0.75 11 20 2.7e-07 0.0024 17.9 16.9 1 18 422 439 422 440 0.98 12 20 2.7e-08 0.00024 21.1 13.0 3 19 451 467 443 467 0.85 13 20 0.81 7.3e+03 -2.9 1.3 6 10 495 499 494 499 0.86 14 20 0.0007 6.3 6.9 12.2 9 18 512 521 503 522 0.89 15 20 0.034 3.1e+02 1.5 5.4 3 11 535 543 525 544 0.85 16 20 0.33 2.9e+03 -1.6 0.9 6 10 603 607 602 607 0.90 17 20 2.2e-06 0.02 14.9 5.6 1 12 613 624 613 624 0.97 18 20 0.017 1.5e+02 2.5 19.4 4 18 654 668 652 669 0.93 19 20 0.58 5.2e+03 -2.4 0.7 8 13 674 679 673 680 0.78 20 20 1.6e-06 0.014 15.4 12.8 1 16 720 734 720 740 0.92
Sequence Information
- Coding Sequence
- atgtCAAACAAATCACTAAAACATGCTCAAAATGCGTCACAAGGGAGGGAAAAGTCCAACAAAATACCACAAAAGAAGAGCTTCACAGCAATTCAAGCTGAGAACATGGAGATGGCCAAGAAAATCACTGAAAACTATGAGTCCAGTTCGGATGAAGAACAAATCGATGGAGGAAAAATTTTAGaaacGTTGTACAAGAACTACAGTGGGGGAGATGATCAGCTTCAGAAGACGACTGCGTTTTTGGAGAATGTCCTTCAATCTGGATCAGCTATATGCCTCATATGCATTGGTTCGGTGAAAAGAGCCGATGCAGTATGGTCTTGCAAATACTGCTATTGTGTGTTTCACCTAACGTGCGTGAAACGATGGGCGAACGACAGTATAGCATTATTGAAAGACAAAGGAACCGAAGAGCAAGGCTATTACAACAATCTAGGTGAATTCGTGCCGAAGAAGACCCGTGTAGTGAAGTGGTTTTGTCCGCAGTGTCGTCGTGACTACCGACCAGAGGAAAGACCAACACACTACGAGTGCTTTTGCGGCAAGGAAATCAATCCTCCGAATCAAGAGTGGTTGGTACCACATTCGTGTGGTGAAACTTGTGGAAAGCCACTGCAGCCAGAATGTGGACACAAGTGTTTGCTACTTTGCCATCCAGGCCCATGTCCACCATGTGCTCAGAATATAGTGGTAAGCTGTGAATGTGGAAAATCCCCCCCGAAGACGCTGCGATGCTTTCAGAAGAGTTGGACTTGCGAGAGGAGATGTTCGCAACTACTACCATGTGGGAAGCACACATGTGATCAGCTGTGTCATGCGCCAGGACAATGTCCTCCTTGCAGTAAGACAAGTCTACAAAGATGTATTTGCGGCAACGAAGAAGCACCACGAAGCTGTTCACAACACGTGTGGCAATGCAAGAAGATCTGTAACAAGAGGTATAGTTGCGGCATTCATTGCTGTAAAAAAGTATGTCATACTGGACCGTGCGGACCATGCCCTCTTAGTCTTCCAAGATTTTGTCCCTGTGGTAAAACAAAAAAAATCGCACCGTGCAACGAACCAATTGAAACCTGCAACGATACGTGTCAAAAATTGCTGTCATGTGGACAACACTACTGTAATCAGCGATGTCATAAGGGTGAATGTAGCTTGtgtTTGATTGTCACAAAGAAAAGATGCCGCTGTGGTATGCACGAGAAGGAACTGCCTTGCTGGAAGCCGTTTTACTGCGACACTAAATGCAAAAGAATTCGAGACTGTGGCAAACATGCGTGTAACAAAAAGTGTTGTGATGGTCAGTGTCCACCGTGCGATAAGATTTGTGGAAAATTACTGTCGTGTAAGAAGCACAAGTGTAGTTCTGTTTGTCACGATGGTCCGTGCTATCCGTGTAAGTTGAAATCCCAAGTTAAATGTCGTTGTGGAGCAACATGTGTTACAGTGCCTTGCGGTAGAGAGCGCAGAGCGCGGCTCAGTTGCAAGGAACCATGCAGaattcCATCCAAATGTCATCATCCGAACAAGCATAAATGCCACAAGGGTGAGTGTCCGCCGTGTAATCAAATTTGCGGGCTGAAAAACGACACCACAAATTGTGAGCATCCCTGCCAGGCACGTTGCCATGCGGCTGTGCGAATTCCTCTGAAAAATTCCACCGCCACTAGTATATTTGACTACAAACATGAAAATTATGAAGTTAAAACCTTACCGCATCCGAAGTGCGAGAAGAAAGTAATGGTGCGTTGCATAGGAGGTCATGAGGTCGCCGAATGGCCTTGCTGGAATTCTAAGCCTACTTCGTGTCAGCGGTTGTGTGGACGTCAATTAAAATGTGGCAATCATACATGTTCTTTAGTTTGTCACAGTGTACCTAATCTAGACGAAGATAGTGAACAAGAAGGCTGTGCATCATGCGGCGAAAGATGTAAGATCGATCGTCCACCAGGTTGTGTTCATCACTGCAAACGACCTTGTCACCCACCACCATGTGAACCATGTGATGTCACAATAAAATCAATATGTCACTGTGGACTCTCTCAGGTCTATTACAAATGCTCAGAATACTATTCCTTAGAGGGAACCCCAAGCGAGGCTCTAATGCGACAAGAAAGGCTAAAATCTTGTGGAAATCGTTGCATAAAAAATTATCCCTGTGGTCATCGTTGtaccgctatctgccattcgggTCAATGTCCCAATCCTGAGGGCTGTCGTAAAAAGGTGAAAATCTACTGCCCCTGTAGGCGCATCAAAATGGAAGTATCTTGCGAGAAATCTCGCTCCTCGGATACCTTCATAAGATGCGATCGGAATTGCGAAGTGGTAAAAGCAGCTCTGGAAAAAATGAAACAAgtcgaagaagaaaaacaaagactCATCGAAGAGGAACGCAATCGCATTGAAATGGAACAGTACCAGAAAAGATTTGGCAAACGAAAACCTCGCGAACGTCGGGTCGTCCCACAAGAAcccgaaaataaaagaaaccatAAGCAAATAGTAGTCGCTACTCTAATTGTAGTGATTGCAGCTGCTGGCTTAGCATTTTATTGTTATCCGTAA
- Protein Sequence
- MSNKSLKHAQNASQGREKSNKIPQKKSFTAIQAENMEMAKKITENYESSSDEEQIDGGKILETLYKNYSGGDDQLQKTTAFLENVLQSGSAICLICIGSVKRADAVWSCKYCYCVFHLTCVKRWANDSIALLKDKGTEEQGYYNNLGEFVPKKTRVVKWFCPQCRRDYRPEERPTHYECFCGKEINPPNQEWLVPHSCGETCGKPLQPECGHKCLLLCHPGPCPPCAQNIVVSCECGKSPPKTLRCFQKSWTCERRCSQLLPCGKHTCDQLCHAPGQCPPCSKTSLQRCICGNEEAPRSCSQHVWQCKKICNKRYSCGIHCCKKVCHTGPCGPCPLSLPRFCPCGKTKKIAPCNEPIETCNDTCQKLLSCGQHYCNQRCHKGECSLCLIVTKKRCRCGMHEKELPCWKPFYCDTKCKRIRDCGKHACNKKCCDGQCPPCDKICGKLLSCKKHKCSSVCHDGPCYPCKLKSQVKCRCGATCVTVPCGRERRARLSCKEPCRIPSKCHHPNKHKCHKGECPPCNQICGLKNDTTNCEHPCQARCHAAVRIPLKNSTATSIFDYKHENYEVKTLPHPKCEKKVMVRCIGGHEVAEWPCWNSKPTSCQRLCGRQLKCGNHTCSLVCHSVPNLDEDSEQEGCASCGERCKIDRPPGCVHHCKRPCHPPPCEPCDVTIKSICHCGLSQVYYKCSEYYSLEGTPSEALMRQERLKSCGNRCIKNYPCGHRCTAICHSGQCPNPEGCRKKVKIYCPCRRIKMEVSCEKSRSSDTFIRCDRNCEVVKAALEKMKQVEEEKQRLIEEERNRIEMEQYQKRFGKRKPRERRVVPQEPENKRNHKQIVVATLIVVIAAAGLAFYCYP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01223893;
- 90% Identity
- -
- 80% Identity
- -