Tuni040115.1
Basic Information
- Insect
- Thereva unica
- Gene Symbol
- nfxl1
- Assembly
- GCA_949987705.1
- Location
- OX465282.1:117317980-117322590[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 21 2 7.6e+04 -7.9 5.6 15 19 174 178 173 178 0.81 2 21 1.5 5.8e+04 -3.7 1.3 4 10 209 215 209 215 0.86 3 21 2e-08 0.00077 21.4 15.6 1 19 223 240 223 240 0.99 4 21 1.1 4.2e+04 -3.3 1.8 6 10 266 270 265 270 0.89 5 21 2.7e-09 0.0001 24.2 12.5 1 19 276 295 276 295 0.96 6 21 0.91 3.4e+04 -3.0 0.8 6 10 320 324 319 324 0.91 7 21 1.5e-07 0.0059 18.6 11.2 1 19 330 348 330 348 0.98 8 21 0.031 1.2e+03 1.7 1.2 5 12 372 379 372 380 0.93 9 21 1.4e-06 0.054 15.5 9.6 3 18 385 400 383 401 0.93 10 21 0.19 7e+03 -0.8 0.5 1 10 410 419 410 419 0.75 11 21 2 7.6e+04 -5.0 2.3 5 10 424 429 424 429 0.91 12 21 9.3e-07 0.035 16.1 16.6 1 18 435 452 435 453 0.98 13 21 1.9e-07 0.0072 18.3 14.2 3 18 464 479 456 480 0.86 14 21 0.52 2e+04 -2.2 1.4 5 10 508 513 508 513 0.91 15 21 1.6 6.1e+04 -3.8 0.5 9 12 518 521 517 523 0.72 16 21 0.12 4.5e+03 -0.2 6.9 10 19 527 536 524 536 0.91 17 21 0.1 4e+03 -0.0 6.7 3 12 549 558 548 558 0.89 18 21 2e-06 0.076 15.1 4.4 1 12 629 640 629 640 0.97 19 21 2 7.6e+04 -5.3 2.9 6 10 656 660 653 660 0.56 20 21 0.00057 21 7.2 15.1 1 19 668 685 668 685 0.94 21 21 8.9e-08 0.0034 19.4 11.4 1 18 736 755 736 756 0.89
Sequence Information
- Coding Sequence
- ATGGCTTCACAAAATCCATGGCACGTAAAATCACAGAACCGCGATGGCAAGCGGAAGAAGAAACCAAATAAAACTAAGCAAAATCCAGCACCAAAAACTGACGTTACCGGATTGAAAAAATTCGAAGAAGTCCACAATAAAAATGTAGAGAAAGCCAAAGTCTATACAGAACGGTATGAGTCTAGCTCGGAGGAGGATGAACTCGATAGCGGAGAAATAATTAATAAGGTATTTGGAGGTTATGTCGCTGAACGAAATCAACTATCAAAGACCCGGGactttgttgagaattttctcCAGTCCGGTTCGGTTGTGTGCCTTATATGCATTGGGACCATTAAAAGAAGTGATGCGATTTGGTCCTGTAAGAACTGCTACTGCTTCTTTCATCTGCAATGTATAAAAAGGTGGGGAAATGATAGTATTGCACAACAAAAAATCCATTCAGACCAAGAGCAAGGTTACTACAATAATCTTGGGGAATatattcctaaaaaaataaaatcaataaagtgGTGCTGCCCAAAATGTAGGAAGGACTACTTACCGGAGGAAGTGCCCACTAAGTATATGTGCTTTTGTGAGAAGGAAATAAATCCTCCAAATCATCCTTGGTTAATTCCCCATTCCTGCGGTGAGCAGTGCGAAAATTACTTAGTACCAAATTGTGGTCATAGATGCTCTTTACTTTGTCACCCTGGAGCGTGTCCACCATGTCCACAGATTATTATTACATCATGCAAATGTGGCAAATCTGATCCTAAGTCTATAAGATGCTTCCAAAAGTCATGGAGTTGTGAGCAGAAGTGCGGCAAGTTACTTTCATGCGGCATTCATAAATGCGAAGCAATATGCCATGAACAAGGCGATTGTCCGCCTTGTACGAAGAAGAGTAAACAAGCTTGTGTTTGCGGAAATGAAACAGCAGAGCGCAATTGCTCTCAACTGAAATGGCATTGTAAAAagGTGTGCAAGAAACCGTTGACTTGTGGCTTGCATAAATGTAAAAGAGTCTGCCACTCGGACGACTGTGGAGATTGTCCGCTTGGTCTTCCGCGATCCTGTCCTTGCGGGAAAACGcaAACAGTTGCACCATGCAGTGAAGTAGTAGATACTTGCGGGGATACCTGTCATAAAGAACTGCCTTGCAAGCTACATTACTGTGCAGAGCGTTGTCACAAAGGAGAATGCAGTCCGtGTATAATAAATATTGAGAAGAAATGTCGCTGCGGGCTGCATACAAAAGAGTTACCATGTTCTAAATCATTCACATGTGAAACAAAATGCAAGcaaatgcgggaatgcggaaaGCATATTTGTAACAAGAAATGCTGTGATGGACAGTGCCCGCCATGTGATAAAGTATGTGGTAAAACGCTATCATGTAAGAAGCATAAATGCAAGTCTATTTGTCATGATGGGCCTTGCTACCCATGCGGCCTTAAATCACAAGTAAAGTGCCGCTGTGGATATACAGCTATTACAGTTCCATGCGGTAGAGAAAAGAAAACTAGACCACCAAAGTGTTCACAACCTTGTAGGATTCCATCTAAATGCCACCATCAAAATCCCCATAATTGCCACATGAATGAATGTCCACCATGCACACAGAAATGTAATTTAAGGAATGATGTTACGAATTGCGAGCACCCTTGCGAAGCTAGATGTCATTCGGCAGTCAAAGTTCCTATTAGCGATAAAGACCAAAAGGATGCTAATATTTGGGAATTCAATTACGACGGTTTCGAAATAAAGACGTTACCCCATCCTGAATGTAACGTAAAGGTGAAAGTTACTTGCATTGGCGGGCATGAAACTGCCTTATGGCCATGCTGGAACTCAAAGCCGACATCTTGTCAGCGGGAATGTGGTCGGGTGTTAAAATGCGGTAATCATACTTGTAATTTTATATGCCATTCAGTGCCTGATATAAACGATAGCAACGAACAAGAAGGATGTGATTCATGTCAAGAACATTGTAAGAGACTTAGACCAGAAGGGTGTAGTCATCCTTGCAAGAAACCATGTCATGTTTCACCTTGTGCTCCTTGTTCagctattattaaaataaactgCCATTGCGGGCTGTCGCAAGTATTCTATAAATGTGTGGACTTCAATACAACAGATTTAAGCAAGGAGGATCTTGAAGAGAAAAGGGAGAAATTAAAAAGTTGCGGAAATagatgcattaaaaatTATCCATGTGGACATAGATGTGCTGCTATATGCCATTCTGGGCCATGTCCGAATCCCGATAGCTGCCGAAAGAAAGTTAAAATCTATTGCGAATGTAAAAATCTTAAAACGGAAATATCCTGTGAGAAATATCGTAGTGGAACTACATCGATCCCGTGTGATAATGTTTGTTTAGCCAAAAAGGAGCAAGCTGAACGGATCAAGAAGGAACAAGAAGAGAAACAACGCAAAATTGAggaagaaaaaaatcgaatagaaTTAGAGCAGTTTGAAAAGAAATTTGGTAGGAAGAAGTATCGTGAGAGGAAAGTTTACGAAGAAAAACCAAAGCAAGATAATCGGAAAATGATATGGATTGCCTGCGGCTGCTCGGTTGCAGTGATTTCTGCTGTTTTAGCTTTTAATTACTTGAagtaa
- Protein Sequence
- MASQNPWHVKSQNRDGKRKKKPNKTKQNPAPKTDVTGLKKFEEVHNKNVEKAKVYTERYESSSEEDELDSGEIINKVFGGYVAERNQLSKTRDFVENFLQSGSVVCLICIGTIKRSDAIWSCKNCYCFFHLQCIKRWGNDSIAQQKIHSDQEQGYYNNLGEYIPKKIKSIKWCCPKCRKDYLPEEVPTKYMCFCEKEINPPNHPWLIPHSCGEQCENYLVPNCGHRCSLLCHPGACPPCPQIIITSCKCGKSDPKSIRCFQKSWSCEQKCGKLLSCGIHKCEAICHEQGDCPPCTKKSKQACVCGNETAERNCSQLKWHCKKVCKKPLTCGLHKCKRVCHSDDCGDCPLGLPRSCPCGKTQTVAPCSEVVDTCGDTCHKELPCKLHYCAERCHKGECSPCIINIEKKCRCGLHTKELPCSKSFTCETKCKQMRECGKHICNKKCCDGQCPPCDKVCGKTLSCKKHKCKSICHDGPCYPCGLKSQVKCRCGYTAITVPCGREKKTRPPKCSQPCRIPSKCHHQNPHNCHMNECPPCTQKCNLRNDVTNCEHPCEARCHSAVKVPISDKDQKDANIWEFNYDGFEIKTLPHPECNVKVKVTCIGGHETALWPCWNSKPTSCQRECGRVLKCGNHTCNFICHSVPDINDSNEQEGCDSCQEHCKRLRPEGCSHPCKKPCHVSPCAPCSAIIKINCHCGLSQVFYKCVDFNTTDLSKEDLEEKREKLKSCGNRCIKNYPCGHRCAAICHSGPCPNPDSCRKKVKIYCECKNLKTEISCEKYRSGTTSIPCDNVCLAKKEQAERIKKEQEEKQRKIEEEKNRIELEQFEKKFGRKKYRERKVYEEKPKQDNRKMIWIACGCSVAVISAVLAFNYLK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01431531;
- 90% Identity
- iTF_01431531;
- 80% Identity
- -