Cmar001684.1
Basic Information
- Insect
- Clunio marinus
- Gene Symbol
- nfxl1
- Assembly
- GCA_900005825.1
- Location
- CVRI01000001.1:234401-238123[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 0.43 9.1e+03 -2.9 2.0 14 19 132 137 132 137 0.86 2 17 0.032 6.9e+02 0.7 0.4 4 10 166 172 165 172 0.96 3 17 0.00061 13 6.2 12.4 1 19 181 198 181 198 0.98 4 17 0.035 7.5e+02 0.5 15.3 1 18 234 249 234 250 0.96 5 17 0.17 3.6e+03 -1.6 1.3 6 10 275 279 274 279 0.90 6 17 6e-07 0.013 15.8 10.1 3 18 287 302 285 303 0.92 7 17 5.5e-05 1.2 9.5 10.1 1 19 340 360 340 360 0.91 8 17 1 2.1e+04 -4.8 14.0 3 18 396 410 394 411 0.87 9 17 8.5e-10 1.8e-05 24.9 12.5 1 19 420 438 420 438 0.98 10 17 1 2.1e+04 -4.6 1.9 5 10 464 469 464 469 0.89 11 17 0.00037 7.9 6.9 7.9 9 19 483 493 481 493 0.90 12 17 0.012 2.5e+02 2.1 4.8 1 12 502 512 502 512 0.90 13 17 9.3e-05 2 8.8 4.5 1 12 583 594 583 594 0.97 14 17 0.73 1.6e+04 -3.7 0.7 6 10 610 614 609 614 0.73 15 17 0.026 5.5e+02 1.0 16.0 4 19 622 637 620 637 0.93 16 17 3.7e-06 0.078 13.3 10.1 1 16 688 702 688 708 0.93 17 17 1 2.1e+04 -4.6 0.9 6 10 739 743 739 743 0.90
Sequence Information
- Coding Sequence
- ATGTCATCgaaaatatcatcaacaaGGGAAATCAGTCTGAAGAAAGTTTTACTAGAAGAATCGGAATCAGAAGATGACGAAGAAAGCTATGATAAAGTTGTAATTACagagaaacttttccaaaatTATGATGGGAATAAGAATGACGTTTCCAGAATCGTGGAACTTGTGGAGAATGGAGAAGTAGACTGTTTAATATGTATTCGAAGAGTGAGAAGTGACAGTAAAATATGGAATTGTAGTAACTGTTACTGCACATTCCATCTTTTGTGTATTCAAAAGTGGGCAAATGACTCAATAAGCCAAAAGCGATTGTACTTTGAGAATCAACCTTTGGGATATTACAATAATGCGGGTGTGTATATACCTAGGAAAGAGTCTTCTATTTATTGGGATTGTCCTCAGTGTCGTCTTCAGTATACTGACATTCCGAGGGTTTACGAATGTTACTGCGGAAAGGAACGCGATCCATTACCACAACCATTTCTAATTCCACACTCTTGTGGTGAAGTCTGCGGTAGACCATTAAATAATCCTCAATGTGGACATGTTTGTTACTTATTATGTCACCCAGGACCCCATCCAACCTGCCCACAAATTCTTCAAAAGTCTTGTGAATGTGGGAAATCTCCAATAAAATCGATTCGTTGTTCTCAACAATATTGGAGTTGTAACAACATTTGCTTAAAGAAACTAAGTTGTGGTCACAATTGTGGAGGGATTTGTCATAAAACTTGTCCACCATGCAATAAATCTAGTGTAAGATTTTGCTTATGTCGCGGTagttcaaaagaaatcaaatgtaAGCAGGAATTTTGGAGCTGTCAGAAAGTGTGCAAGAAAATTCTATCTTGCGATGTTCATCTTTGTGAAAAAAAGTGTCATGAAGGTGACTGCGGCTCTTGCATTTATGGATTGAAACGTTCATGCTTTTGTGGTAAACAATCATTCATATCGGAAAGTTGCGGGAATTTCACAATAGAATCATGTGGCGATACTTGTTTAAAGCCTCTTGCTTGTGGTAATCCGAAACATTTGTGTTATATGAGATGTCATAAAGGAGAATGCGACACTTGCAAGgAATTAATCGAGAAAAATTGTCGTTGTGGTGCAATAACGAAGAAATATCCGTGCTCGAAAGAACTTTTATGCGAGACTAAATGTAAGAACATCAAAAGCTGTAAAAAGCATAATTGCAATCGTAAATGTTGCGTTGAGTGTTTGCCATGCGATAGAATATGCTCAAAAACACTTTCTTGCGGTAAACATAAATGCAATGCTTTATGTCATGGTGGGAATTGCTATCCatgttcaattaaaaaacaGATAAAATGTCGATGTGGAGCCACAAGTGTTTCTGTAGTCTGTGGACGAAAGAATCGAATTCCAAAATGTCGAGAAACCTGTAAGTTACCTTCAAAGTGTCATCACGATCCTTCACCACATAAATGTCATTTTGGCGAGTGTCCACCATGTACACAAACTTGCGATGAACTTCTACCGTGCTCGCATAAATGTTTAGCAAACTGTCATGACTTTGTTAAAGTCgttaaaaaagataaaagttttatccCAAAACTTCCTGGTGAAATTGCTGAGCAGATAGtggagatgaagaaaattgatcaTCCGCAATGTTTTACAAAAGTCCAACTTGAGTGCCTTGGAGGTCACGAAATTGTACATATGAAGTGTCACGAAGCAAAAGTCTTTTCTTGTGGCAGATCCTGCAACCGAAAATTGGAATGTGGAAATCATTTTTGTCAGTTAAATTGCCATTCAGTAAACGAACCAAAGAGTGAAAATCAAGATGCAAATTGTGAAGATTGTGACAAGCCATGCGAGTTCGAGAGATTTTGCCCGCACCCATGTCAAAATCCATGTCATCCAAACGCTTGCAAGAAATGCCGCATTCAAgtcaaaacaaaatgtttctgtGGATTGAATGATGCTTATTTTCGTTGCTgtgatgtttttaaaaaggGTTTGAATAACGATGAAATCGATAATTTGAAAGCTAAATATTTATCGTGTGGCTTGAGATGTATTAAAAATtATTCTTGTGGTCATCAATGTGTTGCTGTTTGTCATAATGGAGAATGTCCCGATGAAGAATTTtgcaagaaaaaagttaaactaATATGTGAATGcaaaacaaggaaaattgaaacaacatGTGACAAAGTAAGAGCCGAAAAGCTTCACATTCTTCCATGCGACGAAACTTGTgaggagaaaagaaagaaagctgaagaattgaaagatATTGAGCGAAAAAAACAGGAAGAGTTGGAAGCTGAGAGAAACAGACAGGAACTTGAAgcatatgaaaagaaaattggcGGCAGAAAACACCGAGAAAGAAAGCCGAGatttattgaagaagaaaagtcaaaTAAGAATGTTGTTTATATCTTGATTGTAATTTCGGCAGTTGTTGCTGTTGTCATAGGATATTTAATGACAAGTATGTAG
- Protein Sequence
- MSSKISSTREISLKKVLLEESESEDDEESYDKVVITEKLFQNYDGNKNDVSRIVELVENGEVDCLICIRRVRSDSKIWNCSNCYCTFHLLCIQKWANDSISQKRLYFENQPLGYYNNAGVYIPRKESSIYWDCPQCRLQYTDIPRVYECYCGKERDPLPQPFLIPHSCGEVCGRPLNNPQCGHVCYLLCHPGPHPTCPQILQKSCECGKSPIKSIRCSQQYWSCNNICLKKLSCGHNCGGICHKTCPPCNKSSVRFCLCRGSSKEIKCKQEFWSCQKVCKKILSCDVHLCEKKCHEGDCGSCIYGLKRSCFCGKQSFISESCGNFTIESCGDTCLKPLACGNPKHLCYMRCHKGECDTCKELIEKNCRCGAITKKYPCSKELLCETKCKNIKSCKKHNCNRKCCVECLPCDRICSKTLSCGKHKCNALCHGGNCYPCSIKKQIKCRCGATSVSVVCGRKNRIPKCRETCKLPSKCHHDPSPHKCHFGECPPCTQTCDELLPCSHKCLANCHDFVKVVKKDKSFIPKLPGEIAEQIVEMKKIDHPQCFTKVQLECLGGHEIVHMKCHEAKVFSCGRSCNRKLECGNHFCQLNCHSVNEPKSENQDANCEDCDKPCEFERFCPHPCQNPCHPNACKKCRIQVKTKCFCGLNDAYFRCCDVFKKGLNNDEIDNLKAKYLSCGLRCIKNYSCGHQCVAVCHNGECPDEEFCKKKVKLICECKTRKIETTCDKVRAEKLHILPCDETCEEKRKKAEELKDIERKKQEELEAERNRQELEAYEKKIGGRKHRERKPRFIEEEKSNKNVVYILIVISAVVAVVIGYLMTSM
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -