Carc005231.1
Basic Information
- Insect
- Coenonympha arcania
- Gene Symbol
- nfxl1
- Assembly
- GCA_036785405.1
- Location
- CM072059.1:9712593-9715073[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 3 3.3e+04 -7.8 5.6 15 19 135 139 134 139 0.81 2 17 0.016 1.7e+02 3.2 0.3 4 10 170 176 169 177 0.97 3 17 4.7e-07 0.0051 17.6 15.9 3 19 187 203 186 203 0.95 4 17 3 3.3e+04 -5.0 2.1 6 10 228 232 227 232 0.81 5 17 2.4e-06 0.026 15.4 12.2 3 19 240 256 238 256 0.91 6 17 2.7e-09 3e-05 24.8 13.3 1 19 291 309 291 309 0.99 7 17 1.6e-05 0.18 12.7 19.3 3 19 346 362 338 362 0.86 8 17 0.0001 1.1 10.2 20.3 1 18 396 413 396 414 0.97 9 17 2.2e-08 0.00024 21.9 17.3 1 19 423 441 423 441 0.99 10 17 0.00011 1.2 10.1 6.1 9 19 489 499 487 499 0.90 11 17 0.0052 57 4.7 6.8 1 12 508 518 508 518 0.96 12 17 0.47 5.1e+03 -1.5 3.5 14 18 547 551 547 552 0.87 13 17 0.83 9.1e+03 -2.3 0.3 8 11 568 571 568 572 0.75 14 17 1.3e-05 0.15 13.0 7.9 1 11 588 598 588 602 0.94 15 17 1.2 1.3e+04 -2.8 1.3 15 18 613 616 612 617 0.84 16 17 0.028 3e+02 2.4 21.5 3 18 629 645 628 646 0.87 17 17 0.00033 3.6 8.5 12.5 1 16 687 701 687 707 0.85
Sequence Information
- Coding Sequence
- ATGGCTAGGAGGTACCGTGATGCTGCCGCCAAGTTGCAAGAAAACGTTAAAAAACACTTGcaaataataaaacatgaaGTGTCTTCTTCGGAAGACGAAGAGCCGTTTGAACAGAATGTCCTGGATGGCGTGTTCCAAAGCTATTGGCGGGGCGGTGGAGACACGCAGATGTTGGAGAGAACCAAAAATCTTCTCGAAGAGGCGATCAGTGGTAGGTCCGTAACTTGCTTAATATGCATCGGCTCCATAAAGCGGGCGGACGCGATATGGACTTGTGACCACTGCTACTGTTATTTCCACCTCTCTTGTATTCAAAAGTGGTCAAATGACAGCATCAGCTTACGTAGTGAAGAGCACCATGGCCCGATAGCAGTTTTCCGACCCAAGAAAATAGAATGGTGCTGTCCTAAATGCAGACATTCCTATTCTAAAGACGAAATCCCCAGAAGGTACCGCTGCTTCTGTGGCAAAACTGACGATCCAGAACATCATCCTTGGTTGATTCCCCATACATGTGGAGAAGTTTGTGGGAAGAGGTTATCTTTAGGAGAGAGTTGCAAGCACAAGTGCTTGCTGCTGTGCCACCCAGGCCCCTGCCCTCCATGTCCACAGACAGTGAATGCGGCATGCTATTGTGGCAAAGAACGGAAGAGAGTTCGATGCAGTGCCTCTCTCTGGTCATGCCAGCAACAATGCAACAAGATATTGCCCTGTAAGTCACATAAATGTGCAATCGAATGCCACGATGGAGACTGTCCTCCTTGCACCTACACAAGCGTGCAACCCTGTCAGTGTGGAGCTGAGAAAACAAAGAGACCCTGCAATGACCTCATATGGCAGTGTACAAAGCAATGCAACAAACCATTAGCTTGTGGATATCACAAATGTGAGAAAATATGTCACACTGGAAGTTGTGGTTCATGCCCACACTCTGGTGTACAGTCTTGTCCGTGTGGCTCCAATGAGCATTTCATACAGTGTCCTGATGTTATGGAAACTTGTGTCAGTACTTGTGGGAAGAAACATGAAGATTGTGAACATAACTGCCCGGCAAGATGTCACAAGGGCCCTTGTCCGCCTTGCCAAGTTTTGATAGAAAAACAATGTCAATGTGCAACTCACACTAGATCACTTCCGTGTAGCAAAGAATTCAGGTGTGAAACAAAATGTAGAGGCATCAGACCATGCGGAAAACATGGCTGCGGACGCAAATGCTGCAATGGAAACTGTCCTCCTTGTGAAAAGATATGTGATAAGCCTCTCCAATGTGGCAGACATAAATGCACCACCGTTTGCCACCACGGTCCTTGCTACCCGTGCCCATTGGAATCTAAAGTAACTTGTCGCTGCAAGGAGACTTATGTAACTGTTCCTTGTGGTAGAGAAAAGCACACACGCCCACCAAAATGTAATCTTCCTTGTAAGATTAAGTATAAATGTGGACATGTTGAAGAAAATAAACATACTTGTCATTTTGGTGACTGTCCTCCATGTAAGgctgtttgtaacaaaaaatatgaGTGTGGACACAACTGTGTGGCAACATGTCATGAATATGTGCTGGTTGTTTTTAAACAAGTGGAGAAGCCTGCCACGCCGTGGGAGATTCAGCCTTCTAAAACTAAAATTGTGAATCTCGATTGCCCACCTTGTGAGGCCCCTGTCTTGGTCACATGCTTTGGGGAACACGAGACAGACTACCAGCCATGTCACACGGCAACCCGCAGACCCTGTGGCCGTGAATGTGGCAGGCCACTCTCGTGTGGAAATCACACATGTGAACTTCTCTGTCATCTGCATGGACAAGATGCTGACTATCCAAATGTGCCTTATACTTGCAAGCCATGTAACAGGGAATGTCTGAAAATCCGTCCTGAGAAATGTACACATAGGTGCTCAAAATGTGGCTGTCATCCCGGACCGTGTCCACCTTGTGAAATCTTGGAAAGAATACCATGCCACTGCAAAGTGACAGAAATGTATATGCGATGTAGAGAGCTGGCAGCAACAACAGAAGAAAGTCTGAGCTGCAAACAACAGTGCCCCAAGAATTTAGAGTGTGGTCATCGCTGCAAGAAGATTTGTCACTCGGAACCATGTCATGATAACCAGACATGtttaaaaaagacaaaaataaattgccCTTGTGGCCATTTGAAAAAAGAGGCACCATGCACCTCTGTCAGGAACAGCGAAATACTGGTGAAGTGTGATGAAAGCTGCGAAGCTAAGAAAGCTGCTGCTAAGTTGGAaagagaaaaagaagaaaagcgTTTGAAGGAAGTGGAACTAGAGAAAAATCGTAGAGAATTAGAAGAGTATGAATGGAAATTGAGTGGTAAGAAGAAGAAGTATAAAGAGAAGAAAGTTGTTTCAAACAGAGATGACAGAAATTTTCTGCAAAAATATTGGGTTCCTATTTTGTCTATACCAGTGCTGATTGTAGCTGCtatttattacatatttaaTCCTAATATCTGA
- Protein Sequence
- MARRYRDAAAKLQENVKKHLQIIKHEVSSSEDEEPFEQNVLDGVFQSYWRGGGDTQMLERTKNLLEEAISGRSVTCLICIGSIKRADAIWTCDHCYCYFHLSCIQKWSNDSISLRSEEHHGPIAVFRPKKIEWCCPKCRHSYSKDEIPRRYRCFCGKTDDPEHHPWLIPHTCGEVCGKRLSLGESCKHKCLLLCHPGPCPPCPQTVNAACYCGKERKRVRCSASLWSCQQQCNKILPCKSHKCAIECHDGDCPPCTYTSVQPCQCGAEKTKRPCNDLIWQCTKQCNKPLACGYHKCEKICHTGSCGSCPHSGVQSCPCGSNEHFIQCPDVMETCVSTCGKKHEDCEHNCPARCHKGPCPPCQVLIEKQCQCATHTRSLPCSKEFRCETKCRGIRPCGKHGCGRKCCNGNCPPCEKICDKPLQCGRHKCTTVCHHGPCYPCPLESKVTCRCKETYVTVPCGREKHTRPPKCNLPCKIKYKCGHVEENKHTCHFGDCPPCKAVCNKKYECGHNCVATCHEYVLVVFKQVEKPATPWEIQPSKTKIVNLDCPPCEAPVLVTCFGEHETDYQPCHTATRRPCGRECGRPLSCGNHTCELLCHLHGQDADYPNVPYTCKPCNRECLKIRPEKCTHRCSKCGCHPGPCPPCEILERIPCHCKVTEMYMRCRELAATTEESLSCKQQCPKNLECGHRCKKICHSEPCHDNQTCLKKTKINCPCGHLKKEAPCTSVRNSEILVKCDESCEAKKAAAKLEREKEEKRLKEVELEKNRRELEEYEWKLSGKKKKYKEKKVVSNRDDRNFLQKYWVPILSIPVLIVAAIYYIFNPNI
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00802512;
- 90% Identity
- iTF_00354770;
- 80% Identity
- -