Ocom029134.1
Basic Information
- Insect
- Ophraella communa
- Gene Symbol
- nfxl1
- Assembly
- GCA_035357415.1
- Location
- CM068982.1:9756241-9764317[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 3 1e+05 -7.8 5.6 15 19 161 165 160 165 0.81 2 19 0.18 6.1e+03 -0.2 0.4 4 10 196 202 195 202 0.95 3 19 6.1e-08 0.002 20.5 15.1 1 19 210 227 210 227 0.99 4 19 1.1e-06 0.035 16.5 17.2 1 19 263 281 253 281 0.92 5 19 0.85 2.8e+04 -2.4 0.9 5 10 305 310 305 310 0.94 6 19 2.1e-09 6.9e-05 25.2 10.8 1 19 316 334 316 334 0.99 7 19 6.7e-05 2.2 10.8 12.2 1 18 369 386 369 387 0.97 8 19 0.15 4.9e+03 0.1 0.9 1 5 396 400 396 405 0.90 9 19 5.7e-05 1.9 11.0 15.6 3 18 423 438 421 439 0.91 10 19 7.4e-10 2.5e-05 26.6 13.0 1 18 448 465 448 466 0.97 11 19 0.00027 9.1 8.8 6.5 9 19 514 524 512 524 0.87 12 19 0.074 2.5e+03 1.0 5.9 1 12 534 544 534 544 0.89 13 19 3 1e+05 -6.9 4.5 14 18 572 576 572 576 0.86 14 19 3 1e+05 -4.4 1.4 6 10 603 607 602 607 0.58 15 19 0.002 68 6.0 3.9 1 11 613 623 613 627 0.94 16 19 3 1e+05 -7.0 3.5 6 10 641 645 638 645 0.63 17 19 5.5e-06 0.18 14.2 17.6 3 19 654 670 653 670 0.92 18 19 9.4e-07 0.032 16.7 13.0 1 16 713 727 713 733 0.89 19 19 1.7 5.6e+04 -3.3 0.8 6 10 763 767 762 767 0.87
Sequence Information
- Coding Sequence
- ATGCACAGCCAACCAAAGCAAAAAAATCCGTGgaataaaaatgttcaaaatccgacgaaaaaaaatcaaaataaaaatttaccaaCACCTGGAGAGGTTAAATTTAAAGAAGCCCATGCTAAATTACAAGCCGCCGTAAAAAAACACATTGCAGATTATGAATCTTCATCGGACGAAGAAGAATTGGAATCGGCAACTTTAATAGattCTATATTGAAAAACTACAGAGTAAATAATGGAGAGAATGATAGCATAGATAAAACACAAACTTTTTTACAAGAAACATTTTTATCGGGAGCATCAACTTGTCTAATTTGCATTTCAAGAGTTAGAAAAGAAGATCAGATATGGAACTGTGTAAACTGTTATGGATTTTTTCACTTAATGTGTGTTCAGAGGTGGTCAAAAGACACAATTGCACAATTAAAAAATGCCAATCAAGAACAAAGTGTCTTCAagcaatataatatatattggTGTTGCCCAAAATGCCGCCATGAATATGAAGGTATGGACATTCCCACAAAGTATACATGTTTTTGTAACAAAACTGAAAATCCAAAATATGATCCTTACATTGCTCCCCATTCTTGTGGTCAGATTTGCAAAAAACCTCTTAAACCAGAATGTGGGCATTACTGTTTACTACTTTGTCACCCAGGTCCCTGCCCTCCTTGTCCTGTAACAGTGAATGTATCATGTTACTGTGGGAGTCAGTTACCAAGAAGTCAGAGATGCAGCAAAAAAGAATGGTCTTGTAACAACATTTGTGGAAAACAATTATCCTGCAACAAACATACTTGTCCAAATGTCTGCCATCCTGGTGAATGTTTACCATGTCCAAAGAAAAGTTTGCAAAAGTGTATTTGCAAAGCACAAATGAAACTTTGCGAATGCTCCAGTCCAGTATGGAAGTGTGATAAGgTTTGTAATAAACCTTTAGAATGTGGAAACCATAACTGCCAAGAAGTATGTCATGCAGGAGTTTGTGATATGTGCCCACTAACAAAAGCTAGACATTGTCCTTGTGGGAAAACATCTTACCAGCTTCCTTGCACACAAGAGACTCCCACATGCTTAGACACTTGTGAAAAACTTTTAGGGTGTGGTATACACCACTGTAATTTAAAATGTCACAGGGAAGAATGTGGAGTGTGTCTTGAAACTGTGGAAAAATACTGTAGATGTGGACAGCACACCAAAGAAGTTCAATGCTACAAACCATACTTGTGTGACCTTAAGTGTAAACAATTAAAGGACTGTTATAAACATCCCTGCAATAGAAAAtgttgcGATGGAAACTGTCCGCCATGCGAAAAATCGTGCGGGAAAACTTTAAACTGCGGTAAACACAAATGCAACTCGATATGCCACAGAGGTCCATGTTATCCTTGCAATCGAACCGACGAACTATCTTGCAGATGTGGAAACACAAAAATCGCCGTACCGTGTGGTCGGAAAAGTAAAACCAAACCTCCGAAATGTCTTAAAATGTGTTTGATCCCTCCAGACTGTCATCACGAAAAACGTGATCAACATCGATGCCATTTCGGCGATTGCCCTCCATGCAAGCAAATTTGTAACAAGATCCACGATAATTGTTCTCATCCGTGTCCTGTCACTTGTCATTCGGCTGTGCTTGTCAAAATTGAAGGACAGAAGGCTTCGATGCCTTGGGAGCAAGTAAAACCACAAGTGGTAAAAAAAGAGCTACCATGTCCTGACTGTATCGTCAAAGTTCCTGTTACTTGTTTAgGTGAACATGAAACTGCTGATTGGGCTTGTTATTTAGCAAAACCTTCCAGTTGTCATCGCCCTTGTGGTAGAATGCTGAGCTGCAGCAACCACAGTTGCAGCTTACAATGTCATGTGGTCGAAGGGGCTCCCGATAAACTTCAGgCCGGAAGTAACTGCGAGCAATGTGAAGACAGTTGCAGTAAAGATCGCCCCGACGGTTGCACCCACAAATGTCCCAAACCTTGTCATCCTGGCGATTGTTCTCCGTGCAAAGTAATGATTAGAATCAAATGTTATTGCGGTCTAAACCAACCGTACGTGGCTTGCTCCGATTGGTTAGATGAAGATAAGAAAATAGAGCTGCAAACATGTGGTAACCAATGCCCCAAAAACTACGAATGTGGGCACAGATGTAAGGAGAACTGCCACCCGGGGCCTTGTCCTAACGCGGATCGGTGCAAAAAGAAAGTGAAGGTATCCTGCAAATGCAAAAGGATCAAGAAGGAGTTCTCCTGCGAAACGGTTCGGAAGAACGAAGCCGTCGTTCCTTGCGACGAGGTTTGCGCCATTAAAAAAGAGGAGAGTAAAAAATTGAGGGAAGCAGCCGAAGAACAAAGAAGATTGGAAGAAGAAGCGAGAAATAAGAAAGAGTTGGAGAAGTACCAAAAGATGTTCGAGGGTAAAAAGAAGAATAGGGAAAGGAGGCGATACGAAGATGAAGTGGAGGaaggttttttgaaaaaatatcgaTACATTCTTATagctgttttgtttatttttatttctgtgttggtatattttgtgtttatgtga
- Protein Sequence
- MHSQPKQKNPWNKNVQNPTKKNQNKNLPTPGEVKFKEAHAKLQAAVKKHIADYESSSDEEELESATLIDSILKNYRVNNGENDSIDKTQTFLQETFLSGASTCLICISRVRKEDQIWNCVNCYGFFHLMCVQRWSKDTIAQLKNANQEQSVFKQYNIYWCCPKCRHEYEGMDIPTKYTCFCNKTENPKYDPYIAPHSCGQICKKPLKPECGHYCLLLCHPGPCPPCPVTVNVSCYCGSQLPRSQRCSKKEWSCNNICGKQLSCNKHTCPNVCHPGECLPCPKKSLQKCICKAQMKLCECSSPVWKCDKVCNKPLECGNHNCQEVCHAGVCDMCPLTKARHCPCGKTSYQLPCTQETPTCLDTCEKLLGCGIHHCNLKCHREECGVCLETVEKYCRCGQHTKEVQCYKPYLCDLKCKQLKDCYKHPCNRKCCDGNCPPCEKSCGKTLNCGKHKCNSICHRGPCYPCNRTDELSCRCGNTKIAVPCGRKSKTKPPKCLKMCLIPPDCHHEKRDQHRCHFGDCPPCKQICNKIHDNCSHPCPVTCHSAVLVKIEGQKASMPWEQVKPQVVKKELPCPDCIVKVPVTCLGEHETADWACYLAKPSSCHRPCGRMLSCSNHSCSLQCHVVEGAPDKLQAGSNCEQCEDSCSKDRPDGCTHKCPKPCHPGDCSPCKVMIRIKCYCGLNQPYVACSDWLDEDKKIELQTCGNQCPKNYECGHRCKENCHPGPCPNADRCKKKVKVSCKCKRIKKEFSCETVRKNEAVVPCDEVCAIKKEESKKLREAAEEQRRLEEEARNKKELEKYQKMFEGKKKNRERRRYEDEVEEGFLKKYRYILIAVLFIFISVLVYFVFM
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00912653; iTF_00736295; iTF_00911573; iTF_00026518;
- 90% Identity
- -
- 80% Identity
- -