Csem006757.1
Basic Information
- Insect
- Cecidostiba semifascia
- Gene Symbol
- nfxl1
- Assembly
- GCA_900474235.1
- Location
- UCQR01053415.1:40-2617[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 4 2.6e+04 -4.8 1.9 15 19 28 32 28 32 0.85 2 19 0.21 1.4e+03 -0.0 0.3 4 10 63 69 62 69 0.96 3 19 2e-08 0.00013 22.4 16.7 1 19 77 94 77 94 0.99 4 19 2.5e-06 0.016 15.7 14.5 3 19 132 148 129 148 0.93 5 19 0.84 5.5e+03 -1.9 0.3 6 10 173 177 172 179 0.88 6 19 0.00021 1.3 9.6 8.5 1 19 183 202 177 202 0.89 7 19 0.00023 1.5 9.4 9.1 1 18 238 255 238 256 0.97 8 19 0.012 78 4.0 7.6 1 12 290 301 290 301 0.94 9 19 0.81 5.3e+03 -1.9 1.0 6 10 311 315 310 315 0.92 10 19 1.5e-07 0.00099 19.6 14.6 1 19 321 339 321 339 0.93 11 19 0.00019 1.2 9.7 7.6 9 19 387 397 385 397 0.90 12 19 8.8e-05 0.58 10.8 7.7 1 12 407 417 407 417 0.95 13 19 1.5 9.7e+03 -2.7 5.8 14 18 448 452 448 453 0.89 14 19 2.8 1.8e+04 -3.6 0.6 9 12 470 473 470 473 0.77 15 19 0.00026 1.7 9.2 6.1 1 12 489 500 483 500 0.91 16 19 1.4 8.9e+03 -2.6 1.7 6 10 517 521 516 521 0.88 17 19 0.002 13 6.5 14.3 3 19 530 546 529 546 0.92 18 19 2.3e-06 0.015 15.8 10.9 1 16 591 605 591 611 0.89 19 19 2.7 1.8e+04 -3.6 0.8 6 10 641 645 640 645 0.88
Sequence Information
- Coding Sequence
- atgCATTTACCTTGTATTCAACATTGGGTCAGGGATAGTTTGAGTTCTAAACATGAGAAAGGAATTGTACCCTTATGGGGATGCCCTAAATGCAGAAAAGAATATAGAGAAGATCAAATACCAAAACGATATGTGTGCTTTTGTGGCAAAACTGAAGATCCACCATATCAACCATGGTCAATTCCTCATTCTTGTGCAGACATATGTGGAAAGTTTCTTCTACCAAAATGTGGACACAAATGCATGCTTCTGTGTCACCCTGGTCCATGTCCTCCGTGTCCAAAAATGGTGTCAGTTACCTGTTTCTGTGGAAAACAATCACCTTGCCCACGGAGATGTAATGCAAAGGAGTGGTCTTGTGGAgtgatttgcaataaaaaatacaaaacatgTGAACATGCATGTATTGAAAATTGTCATCCTGGACCTTGTCCACCTTGTACAGAAAAAGTGCTCACTGCCTGCAATTGCAAGAATAAGTCTGaattaagaaaatgtaatGAATCATTATGGCAGTGCAACAAAGTTTGTGGAAGACGTTTCTCATGCAATGTTCATGTTTGTGAAGATTTTTGTCACAAACCTGGTGATTGCAATGCTTGTCCATTAGAACGTAATAGAACTTGTCCTTGTGGtaagaaaaaatatgctaTATCATGCAAACAGCAGCAGGTACCTACATGCGGAGATACTTGTGGAAAATTATTAGATTGTGGAGCTCATTATTGCAATATGAGATGCCATACAGAACGGTGTGGTCAATGCTTAGAAGTTGTAACAAAAACTTGTCGTTGTGGTAGTTTTAGCAAAGAACTTTCATGTACTAAGGAATTTcactgcaataaaaaatgtactcaAATGAGACTTTGTGGTAGACATCTTTGCAATAAAAAGTGCTGTGATTGCTTGattgaaaatacttttaaagcCTGTGAAAAAGTGTGTGATAACACTCTAAACTGTCGTAAACATAAATGTTCTGCTCCTTGCCATAGTGGACCTTGTTACCCATGTCCTCGTACTATTGTTATTCAATGTCGTTGTGGTAATAGCAAAATTACTGTACCATGCGGAACGGTTAAGAAGATCAAACCTCCGAAttgcaataaaatgtgtaaaatCCCTCCAATCTGTCATCATCCAAAAAGAGAGTCACACAAGTGTCATCAAGGTGCCTGTCCACCATGtcgtaaaatatgtaaattaatttataaaagatgTGGTCACACTTGTCAAGCAGTGTGTCATACTAAAGTGTGGACTAAAGTAAAACCTAATGGCGTTACTAAACCAACTGGACCATGGGAAATTCATAAAGAAAGATGGGAATATCAGTCCTTCCCATGTCCTCCTTGTGAAGTCTCTGTTATGGTCACATGCCTTGGAGGACATGAAACTCTTCCTTGGCCTTGTCACAAAGCTGTTTCAACTTCTTGCCTTAGAGAGTGTGGTCAATTATTGGCGTGTACAAATCACTATTGTGAAAAACTTTGTCATAAGCTAAACTCATtggatgatgataatgatgatgataataagtGTATGGAATGTGAGAAACCATGTAGTTTTCCAAGACCTAAGGGATGTACGCATGCTTGCCCTAAATTGTGTCATCCTATTCCATGCGAACCATGTAAGCAGctggtaaaaattttttgtcacTGTGGAATCAATACTCTGTACATACGTTGTGTTCAACTCACTTCTGCTAACACTGAGAAACGCAATGAACTTCTACAATGTGGCAATCAGTGTCCTCGAAATTATCCCTGTGGACATCGGTGTATTGATAATTGCCACCCTGGACCATGTCAGAAAGCTGAAATTTGCAGTAAGAAAGTTAAAATCATGTGCAATTGTAAACGATTGAAAAAAGATTTCTCATGTAATATAGTACGAGCTGGAAAAGCTTCCGTGGAATGTGATAAGTTAtgccaaaaaagaaaagaggaattagataaaataagagaaatcgagttggaaaaaaagcgaaaagaggaagaagtgaaaaatcaaaaggaaatcgaaatgtttgaaaagaaatttaaacCCAAACGAAGAGGAAAAGAGCGTCATCTTAATAAAGAGCtcttaaacaataaaagtGGCTCTTATAAGTTGATATGGATGTTTGCGGGACTTGCAATTGTAATATCNNNNNNNNNNNNNNNNNNNNGTTAA
- Protein Sequence
- MHLPCIQHWVRDSLSSKHEKGIVPLWGCPKCRKEYREDQIPKRYVCFCGKTEDPPYQPWSIPHSCADICGKFLLPKCGHKCMLLCHPGPCPPCPKMVSVTCFCGKQSPCPRRCNAKEWSCGVICNKKYKTCEHACIENCHPGPCPPCTEKVLTACNCKNKSELRKCNESLWQCNKVCGRRFSCNVHVCEDFCHKPGDCNACPLERNRTCPCGKKKYAISCKQQQVPTCGDTCGKLLDCGAHYCNMRCHTERCGQCLEVVTKTCRCGSFSKELSCTKEFHCNKKCTQMRLCGRHLCNKKCCDCLIENTFKACEKVCDNTLNCRKHKCSAPCHSGPCYPCPRTIVIQCRCGNSKITVPCGTVKKIKPPNCNKMCKIPPICHHPKRESHKCHQGACPPCRKICKLIYKRCGHTCQAVCHTKVWTKVKPNGVTKPTGPWEIHKERWEYQSFPCPPCEVSVMVTCLGGHETLPWPCHKAVSTSCLRECGQLLACTNHYCEKLCHKLNSLDDDNDDDNKCMECEKPCSFPRPKGCTHACPKLCHPIPCEPCKQLVKIFCHCGINTLYIRCVQLTSANTEKRNELLQCGNQCPRNYPCGHRCIDNCHPGPCQKAEICSKKVKIMCNCKRLKKDFSCNIVRAGKASVECDKLCQKRKEELDKIREIELEKKRKEEEVKNQKEIEMFEKKFKPKRRGKERHLNKELLNNKSGSYKLIWMFAGLAIVISXXXXXXX
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01470178;
- 90% Identity
- iTF_00286886;
- 80% Identity
- -