Ehay003417.1
Basic Information
- Insect
- Eretmocerus hayati
- Gene Symbol
- NFXL1
- Assembly
- GCA_029851415.1
- Location
- CM056741.1:51650432-51653029[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 22 4 6.1e+04 -4.9 1.9 15 19 123 127 123 127 0.85 2 22 0.84 1.3e+04 -1.9 0.9 4 10 158 164 158 164 0.91 3 22 1.7e-07 0.0027 19.4 15.4 1 19 172 189 172 189 0.99 4 22 3.5 5.4e+04 -3.9 1.2 14 16 203 205 201 208 0.54 5 22 4.2e-06 0.064 15.0 15.8 3 19 227 243 226 243 0.93 6 22 1.5 2.3e+04 -2.7 0.9 5 10 267 272 267 272 0.93 7 22 0.0014 22 6.9 7.7 1 12 278 289 278 296 0.90 8 22 1.2 1.8e+04 -2.4 0.4 5 10 322 327 322 329 0.88 9 22 0.56 8.6e+03 -1.4 11.2 3 19 335 351 327 351 0.86 10 22 2.2 3.4e+04 -3.3 0.3 9 12 368 371 367 371 0.81 11 22 0.0048 73 5.2 9.0 1 12 385 396 385 396 0.94 12 22 3.5 5.4e+04 -3.9 1.3 6 10 406 410 406 410 0.96 13 22 2.4e-09 3.6e-05 25.4 14.7 1 19 416 434 416 434 0.99 14 22 1.7 2.7e+04 -2.9 1.2 7 11 470 474 470 475 0.85 15 22 0.02 3.1e+02 3.2 7.0 9 19 482 492 480 492 0.91 16 22 0.0012 18 7.2 5.9 1 12 502 512 502 512 0.95 17 22 1.2 1.8e+04 -2.4 5.8 14 19 543 548 543 548 0.90 18 22 0.0001 1.6 10.6 7.1 1 12 584 595 578 595 0.90 19 22 4 6.1e+04 -4.6 3.1 5 10 617 622 615 622 0.85 20 22 0.00042 6.4 8.6 15.8 3 19 631 647 630 647 0.92 21 22 3e-07 0.0046 18.6 8.1 1 16 692 706 692 712 0.92 22 22 1.8 2.7e+04 -3.0 0.9 6 10 742 746 741 746 0.89
Sequence Information
- Coding Sequence
- ATGAGGAAATTCAAGCGAGCTCAAGCAGAAAATCAAGCTACTATTCAAAAGTATCTGCAGGACAATAATGGATCATCTAGTAGTGACGGAGACAATATTGAAGATGCCTTATTATCTGCTGTGAAGAATGTTTTATCGAATTACCAATGTGCAGGTGGTGATGTTGAAAAAACGCTATCTTATTTAATAGATACTTTCCAACCTGGTGCGTCAGTGTGTCTTATATGCATTTCTTCAGTGAAGAAAACTGATGAGATTTGGAGTTGTCTGAAATGTTACACACTCCTTCATTTACACTGTATACAACTTTGGGCGAGAGACAGTTTGAGTCACAAAACCGAAAAAGAAATCTTACCAACATGGGGATGTCCCAAATGTCGGTCAGAATATGGAGAAGATCAAGCTCCTATAAAGTACTCATGtttttgtagaaaaattgaGGATCCAACATATCAGTCTTTGGCTATTCCTCATTCATGTGGAAACACATGTGAAAAGTTTCTGCAACcagaatgtggccacttttgcACTTTATTGTGCCATCCAGGTCCATGTCCGCCTTGTCCCAAAATGGTATTGGTCACTTGCTATTGTGGCAAAGAACAACCTTGTCCTCGCAGATGTAATTCTAAAGAGTGGAGCTGCATGAACCCTTGTAATAAAAAGTACCAGACTTGCGAACATATCTGTTCAGAGCCCTGTCATCCAGGGAATTGCCCTCCCTGCTCCAAAGAAGTAGTCACGCCTTGCAATTGTAAAAGTCAATCAAAACTGAGAAAATGTAATGAAGCTGTTTGGAAATGCAATAAGATATGTGGCCAAGCTTTGTCATGCAGCATCCATGTTTGTGAGGAAATTTGTCACAAACCAGGTGATAGTCACATTTGTTCTCTAGAAAAAAGCAGAACTTGCCCATgtggtaaaaaaaagtatcCCATATCCTGCAAAGAGTTACAGGCTCCAACTTGTAGAGAAACATGTGGAAAGTTGTTTGATTGTAAAGCTCATTACTGCAGTATGAGATGCCACACTGAGGGATGTGGTCAATGCCGGGAAGTTGTTACCAAAAGCTGTCGCTGTGGGAGTTACAGTAAGGAGATTCCGTGTCATAAGGAATTCCATTGCTCCAAAAAATGCACTCAAATGCGCTTGTGTGGGAGACACCAATGCAATAAGAAATGCTGTGATTGCAGACTCACAAATAATTTTTATAtgtgtgaaaaaatttgtgacAATATGCTGAAGTGTGGAAAACATAAGTGCTTAGCCCCTTGTCACAGTGGGCCATGTTATCCTTGTCCTCGCACAATCTCCATACGTTGTCGTTGTGGTCATAGTAGAGCTATTATTCCCTGTGGTTTTACTAGAAAAATCAAACCTCCTCACTGCAACAGACAGTGTGAAATCCAATCACTGTGTCACCACCCTGAAAGGGATGCTCACACATGTCACCAAGATCAATGCCCACCATGTAAAAAGGTTTGTGGATTAGTTTACAAGAAATGTGGCCACAGTTGCTCAGCAATATGCCATACAAATGTGTGGGCAAAAGTCTCTGCAAATGGTTTTCCAAAGGCCACAGGACCTTGGGAAATTCCAAAGGATAGGAGTGATTTCAAATCTTTACCTTGCCCTCCATGTAAATTTCCTGTCACAGTGACTTGTCTAGGAGGGCATGAAACACAGACAAAGCCTTGCTATAAAAGTGCACCAACATTTTGCTTTGGATTATGTGGTCAATTGTTACCATGTACAAATCATACTTGTGAAAAGCTCTGCCACACACTTGCGAAaccaaaaaattgtgaaaaagcAAACACAAAAGGTAATAGAGAAAATGAGTGCATGACATGTGACAAAACCTGTACGATTCCGCGTCCCGAAGGATGTACTCATAGTTGTGACAAACCGTGTCATCCAGCACCATGTGATCCATGCCAACATCTTATAAAAATTCCATGTCATTGTACCATCACTACCCTATTCATACAGTGTGCCGAGCTCACTTCTGCTGATAGCGAGAAACGTGATTATCTTCTCAAATGTAAAAATCAGTGCACAAAAATTTTTGCGTGCGGTCATCGATGTGATGATATTTGCCATGCCGGACCATGCAAGAAATTGATGATATGCGATAAGAAAGTCAAGATAACATGTAAGTGCAAACGAATTGAAAAGGATACTACTTGCTTATCAGAGAGAAATGGGGAAGGAAGTGTTGAGTGCGATGAAGTCTGTcagaaaaagaaagaagattTGGATCGTCTCAAGCGAATTCAATTGGAAGAGAAACGCAAGGAGGatgatatgaaaaatccaaTAGAAGTTGAGATGTTTGAAAAGAAATTCAAACCCAGGCGAAGAGGGTGA
- Protein Sequence
- MRKFKRAQAENQATIQKYLQDNNGSSSSDGDNIEDALLSAVKNVLSNYQCAGGDVEKTLSYLIDTFQPGASVCLICISSVKKTDEIWSCLKCYTLLHLHCIQLWARDSLSHKTEKEILPTWGCPKCRSEYGEDQAPIKYSCFCRKIEDPTYQSLAIPHSCGNTCEKFLQPECGHFCTLLCHPGPCPPCPKMVLVTCYCGKEQPCPRRCNSKEWSCMNPCNKKYQTCEHICSEPCHPGNCPPCSKEVVTPCNCKSQSKLRKCNEAVWKCNKICGQALSCSIHVCEEICHKPGDSHICSLEKSRTCPCGKKKYPISCKELQAPTCRETCGKLFDCKAHYCSMRCHTEGCGQCREVVTKSCRCGSYSKEIPCHKEFHCSKKCTQMRLCGRHQCNKKCCDCRLTNNFYMCEKICDNMLKCGKHKCLAPCHSGPCYPCPRTISIRCRCGHSRAIIPCGFTRKIKPPHCNRQCEIQSLCHHPERDAHTCHQDQCPPCKKVCGLVYKKCGHSCSAICHTNVWAKVSANGFPKATGPWEIPKDRSDFKSLPCPPCKFPVTVTCLGGHETQTKPCYKSAPTFCFGLCGQLLPCTNHTCEKLCHTLAKPKNCEKANTKGNRENECMTCDKTCTIPRPEGCTHSCDKPCHPAPCDPCQHLIKIPCHCTITTLFIQCAELTSADSEKRDYLLKCKNQCTKIFACGHRCDDICHAGPCKKLMICDKKVKITCKCKRIEKDTTCLSERNGEGSVECDEVCQKKKEDLDRLKRIQLEEKRKEDDMKNPIEVEMFEKKFKPRRRG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -