Basic Information

Insect
Thereva unica
Gene Symbol
nfxl1
Assembly
GCA_949987705.1
Location
OX465282.1:117317980-117322590[+]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 21 2 7.6e+04 -7.9 5.6 15 19 174 178 173 178 0.81
2 21 1.5 5.8e+04 -3.7 1.3 4 10 209 215 209 215 0.86
3 21 2e-08 0.00077 21.4 15.6 1 19 223 240 223 240 0.99
4 21 1.1 4.2e+04 -3.3 1.8 6 10 266 270 265 270 0.89
5 21 2.7e-09 0.0001 24.2 12.5 1 19 276 295 276 295 0.96
6 21 0.91 3.4e+04 -3.0 0.8 6 10 320 324 319 324 0.91
7 21 1.5e-07 0.0059 18.6 11.2 1 19 330 348 330 348 0.98
8 21 0.031 1.2e+03 1.7 1.2 5 12 372 379 372 380 0.93
9 21 1.4e-06 0.054 15.5 9.6 3 18 385 400 383 401 0.93
10 21 0.19 7e+03 -0.8 0.5 1 10 410 419 410 419 0.75
11 21 2 7.6e+04 -5.0 2.3 5 10 424 429 424 429 0.91
12 21 9.3e-07 0.035 16.1 16.6 1 18 435 452 435 453 0.98
13 21 1.9e-07 0.0072 18.3 14.2 3 18 464 479 456 480 0.86
14 21 0.52 2e+04 -2.2 1.4 5 10 508 513 508 513 0.91
15 21 1.6 6.1e+04 -3.8 0.5 9 12 518 521 517 523 0.72
16 21 0.12 4.5e+03 -0.2 6.9 10 19 527 536 524 536 0.91
17 21 0.1 4e+03 -0.0 6.7 3 12 549 558 548 558 0.89
18 21 2e-06 0.076 15.1 4.4 1 12 629 640 629 640 0.97
19 21 2 7.6e+04 -5.3 2.9 6 10 656 660 653 660 0.56
20 21 0.00057 21 7.2 15.1 1 19 668 685 668 685 0.94
21 21 8.9e-08 0.0034 19.4 11.4 1 18 736 755 736 756 0.89

Sequence Information

Coding Sequence
ATGGCTTCACAAAATCCATGGCACGTAAAATCACAGAACCGCGATGGCAAGCGGAAGAAGAAACCAAATAAAACTAAGCAAAATCCAGCACCAAAAACTGACGTTACCGGATTGAAAAAATTCGAAGAAGTCCACAATAAAAATGTAGAGAAAGCCAAAGTCTATACAGAACGGTATGAGTCTAGCTCGGAGGAGGATGAACTCGATAGCGGAGAAATAATTAATAAGGTATTTGGAGGTTATGTCGCTGAACGAAATCAACTATCAAAGACCCGGGactttgttgagaattttctcCAGTCCGGTTCGGTTGTGTGCCTTATATGCATTGGGACCATTAAAAGAAGTGATGCGATTTGGTCCTGTAAGAACTGCTACTGCTTCTTTCATCTGCAATGTATAAAAAGGTGGGGAAATGATAGTATTGCACAACAAAAAATCCATTCAGACCAAGAGCAAGGTTACTACAATAATCTTGGGGAATatattcctaaaaaaataaaatcaataaagtgGTGCTGCCCAAAATGTAGGAAGGACTACTTACCGGAGGAAGTGCCCACTAAGTATATGTGCTTTTGTGAGAAGGAAATAAATCCTCCAAATCATCCTTGGTTAATTCCCCATTCCTGCGGTGAGCAGTGCGAAAATTACTTAGTACCAAATTGTGGTCATAGATGCTCTTTACTTTGTCACCCTGGAGCGTGTCCACCATGTCCACAGATTATTATTACATCATGCAAATGTGGCAAATCTGATCCTAAGTCTATAAGATGCTTCCAAAAGTCATGGAGTTGTGAGCAGAAGTGCGGCAAGTTACTTTCATGCGGCATTCATAAATGCGAAGCAATATGCCATGAACAAGGCGATTGTCCGCCTTGTACGAAGAAGAGTAAACAAGCTTGTGTTTGCGGAAATGAAACAGCAGAGCGCAATTGCTCTCAACTGAAATGGCATTGTAAAAagGTGTGCAAGAAACCGTTGACTTGTGGCTTGCATAAATGTAAAAGAGTCTGCCACTCGGACGACTGTGGAGATTGTCCGCTTGGTCTTCCGCGATCCTGTCCTTGCGGGAAAACGcaAACAGTTGCACCATGCAGTGAAGTAGTAGATACTTGCGGGGATACCTGTCATAAAGAACTGCCTTGCAAGCTACATTACTGTGCAGAGCGTTGTCACAAAGGAGAATGCAGTCCGtGTATAATAAATATTGAGAAGAAATGTCGCTGCGGGCTGCATACAAAAGAGTTACCATGTTCTAAATCATTCACATGTGAAACAAAATGCAAGcaaatgcgggaatgcggaaaGCATATTTGTAACAAGAAATGCTGTGATGGACAGTGCCCGCCATGTGATAAAGTATGTGGTAAAACGCTATCATGTAAGAAGCATAAATGCAAGTCTATTTGTCATGATGGGCCTTGCTACCCATGCGGCCTTAAATCACAAGTAAAGTGCCGCTGTGGATATACAGCTATTACAGTTCCATGCGGTAGAGAAAAGAAAACTAGACCACCAAAGTGTTCACAACCTTGTAGGATTCCATCTAAATGCCACCATCAAAATCCCCATAATTGCCACATGAATGAATGTCCACCATGCACACAGAAATGTAATTTAAGGAATGATGTTACGAATTGCGAGCACCCTTGCGAAGCTAGATGTCATTCGGCAGTCAAAGTTCCTATTAGCGATAAAGACCAAAAGGATGCTAATATTTGGGAATTCAATTACGACGGTTTCGAAATAAAGACGTTACCCCATCCTGAATGTAACGTAAAGGTGAAAGTTACTTGCATTGGCGGGCATGAAACTGCCTTATGGCCATGCTGGAACTCAAAGCCGACATCTTGTCAGCGGGAATGTGGTCGGGTGTTAAAATGCGGTAATCATACTTGTAATTTTATATGCCATTCAGTGCCTGATATAAACGATAGCAACGAACAAGAAGGATGTGATTCATGTCAAGAACATTGTAAGAGACTTAGACCAGAAGGGTGTAGTCATCCTTGCAAGAAACCATGTCATGTTTCACCTTGTGCTCCTTGTTCagctattattaaaataaactgCCATTGCGGGCTGTCGCAAGTATTCTATAAATGTGTGGACTTCAATACAACAGATTTAAGCAAGGAGGATCTTGAAGAGAAAAGGGAGAAATTAAAAAGTTGCGGAAATagatgcattaaaaatTATCCATGTGGACATAGATGTGCTGCTATATGCCATTCTGGGCCATGTCCGAATCCCGATAGCTGCCGAAAGAAAGTTAAAATCTATTGCGAATGTAAAAATCTTAAAACGGAAATATCCTGTGAGAAATATCGTAGTGGAACTACATCGATCCCGTGTGATAATGTTTGTTTAGCCAAAAAGGAGCAAGCTGAACGGATCAAGAAGGAACAAGAAGAGAAACAACGCAAAATTGAggaagaaaaaaatcgaatagaaTTAGAGCAGTTTGAAAAGAAATTTGGTAGGAAGAAGTATCGTGAGAGGAAAGTTTACGAAGAAAAACCAAAGCAAGATAATCGGAAAATGATATGGATTGCCTGCGGCTGCTCGGTTGCAGTGATTTCTGCTGTTTTAGCTTTTAATTACTTGAagtaa
Protein Sequence
MASQNPWHVKSQNRDGKRKKKPNKTKQNPAPKTDVTGLKKFEEVHNKNVEKAKVYTERYESSSEEDELDSGEIINKVFGGYVAERNQLSKTRDFVENFLQSGSVVCLICIGTIKRSDAIWSCKNCYCFFHLQCIKRWGNDSIAQQKIHSDQEQGYYNNLGEYIPKKIKSIKWCCPKCRKDYLPEEVPTKYMCFCEKEINPPNHPWLIPHSCGEQCENYLVPNCGHRCSLLCHPGACPPCPQIIITSCKCGKSDPKSIRCFQKSWSCEQKCGKLLSCGIHKCEAICHEQGDCPPCTKKSKQACVCGNETAERNCSQLKWHCKKVCKKPLTCGLHKCKRVCHSDDCGDCPLGLPRSCPCGKTQTVAPCSEVVDTCGDTCHKELPCKLHYCAERCHKGECSPCIINIEKKCRCGLHTKELPCSKSFTCETKCKQMRECGKHICNKKCCDGQCPPCDKVCGKTLSCKKHKCKSICHDGPCYPCGLKSQVKCRCGYTAITVPCGREKKTRPPKCSQPCRIPSKCHHQNPHNCHMNECPPCTQKCNLRNDVTNCEHPCEARCHSAVKVPISDKDQKDANIWEFNYDGFEIKTLPHPECNVKVKVTCIGGHETALWPCWNSKPTSCQRECGRVLKCGNHTCNFICHSVPDINDSNEQEGCDSCQEHCKRLRPEGCSHPCKKPCHVSPCAPCSAIIKINCHCGLSQVFYKCVDFNTTDLSKEDLEEKREKLKSCGNRCIKNYPCGHRCAAICHSGPCPNPDSCRKKVKIYCECKNLKTEISCEKYRSGTTSIPCDNVCLAKKEQAERIKKEQEEKQRKIEEEKNRIELEQFEKKFGRKKYRERKVYEEKPKQDNRKMIWIACGCSVAVISAVLAFNYLK

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01431531;
90% Identity
iTF_01431531;
80% Identity
-