Basic Information

Insect
Ceramica pisi
Gene Symbol
Plc21C
Assembly
GCA_963859965.1
Location
OY982536.1:20260090-20275890[+]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 11 0.00047 7 8.1 1.9 9 17 397 405 395 405 0.86
2 11 0.00047 7 8.1 1.9 9 17 441 449 439 449 0.86
3 11 0.00047 7 8.1 1.9 9 17 485 493 483 493 0.86
4 11 0.00047 7 8.1 1.9 9 17 542 550 540 550 0.86
5 11 0.00047 7 8.1 1.9 9 17 586 594 584 594 0.86
6 11 0.00047 7 8.1 1.9 9 17 630 638 628 638 0.86
7 11 0.00047 7 8.1 1.9 9 17 674 682 672 682 0.86
8 11 0.00047 7 8.1 1.9 9 17 718 726 716 726 0.86
9 11 0.00047 7 8.1 1.9 9 17 762 770 760 770 0.86
10 11 0.00047 7 8.1 1.9 9 17 806 814 804 814 0.86
11 11 0.00047 7 8.1 1.9 9 17 850 858 848 858 0.86

Sequence Information

Coding Sequence
atgatggagtgtatttatttttgtgagcAGTTAACGCGGCAAGGCTCGTCGGAGTCGTCGGACTCTGAGAGTTCGTCGGGCGAGGAGGAGGCGGGGCTGGCCGACAGCGCGCCCGACGACGCGCGCGAGACGCACGCGGGCGCGGAGATCTCCGCGCTCGTCAACTACGTGCAGCCCGTGCACTTCAGCTCCTTTGAGAACTCCGAGAAAAAGAACCGCTTCTACGAGATGTCTTCATTCGACGAGAAGCAGGCCACGACGCTGCTGAAGGAGCGGCCCATAGAGTTCGTGAACTACAACAAGCACCAGCTGTCGCGCGTGTACCCCGCGGGGACCAGGTTCGACTCCTCCAACTTCATGCCGCAGGTTTTCTGGAACGCCGGCTGTCAGTTAGTGGCTCTCAACTACCAGACGTTGGACTTGGCCATGCAGCTGAACTTGGGCACCTACGAGTACAACCGACGATGTGGGTACCTCCTCAAGCCGGAGTTTATGCGGAGAAAgGACCGCCGCCTGGACCCGTTCGCGGAGAGCACGGTGGACGGCATCATCGCGGGCACGCTGTCGGTCACGGTGCTGTCGGGCCAGCTGCTGACGGACAAGCGCTGCGGCACGTACGTGGAGGTGGACATGTTCGGCCTGCCCGCCGACACCGTGCGCAAGAAGTTCCGCACGCGCGTCGCGCCCAACAACGGCATTAACCCCGTCTATGGAGAGGAGccctttgtttttaaaaagGTAGTGCTGCCAGAACTAGCCATGCTTCGCATCGCGGCGCACGAAGAAAGTGGCCGTTTGTTGGGGCACCGCGTACTCCCCGTGCTGGGGCTGTGTCCCGGCTACCGCTCCGTGAACCTGCGCACGGAACTCGGCCTGCCACTACCCGCGAGTTTGTTGCTACTCGTCGTCGTCAAGGATTACGTGCCAGACCGGCTGTCGGAACTAGCAGAGGCTTTGGCGAACCCAATCAAGTATCAGAGCGAGCTCGACAAGCGTGAACACCAGCTAGCTAAGCTAACAGAAGACACCGACGTGCCCATCGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCTGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCGTTCTGCCCCAGACGCGCTGCCACTCCGGCCCCTGTGCCCCGCCGACCAGCCCACCAAGGACGACGCGCGCCGCCCAGTCACACACCCTGTACGTACCCACACATGACACACAGAGCGAGCTCGACAAGCGCATTAAACTAGACGAGCAGCCAGACCGCAGTCCGTCGAAAGAGACGACGAACGAGACCAGAAAATTTATCGAGACCGAGAGCAGCGTACCTTCACCACTACCTCCCAATACCAATGAAGCCGACGTAAAAACGAACGGTACATCGGACGAGTCTTTAGCGGAAACACTCGAGACTCTCCTATCAATGAAGATCGTTCGAGAAAAACAGGCGGAGCTCGCGCGGAAACTTGACGCGTTACGACGCAAGTTCGACAAGGAGAAGAAACCGACCTCCAAGTTTTACGTCAAAAATAGCCTCGTGAAAAGACTGTCCTCgaaaaatatgtaa
Protein Sequence
MMECIYFCEQLTRQGSSESSDSESSSGEEEAGLADSAPDDARETHAGAEISALVNYVQPVHFSSFENSEKKNRFYEMSSFDEKQATTLLKERPIEFVNYNKHQLSRVYPAGTRFDSSNFMPQVFWNAGCQLVALNYQTLDLAMQLNLGTYEYNRRCGYLLKPEFMRRKDRRLDPFAESTVDGIIAGTLSVTVLSGQLLTDKRCGTYVEVDMFGLPADTVRKKFRTRVAPNNGINPVYGEEPFVFKKVVLPELAMLRIAAHEESGRLLGHRVLPVLGLCPGYRSVNLRTELGLPLPASLLLLVVVKDYVPDRLSELAEALANPIKYQSELDKREHQLAKLTEDTDVPIDALPLRPLCPADQPTKDDARRPVTHPSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRVLPQTRCHSGPCAPPTSPPRTTRAAQSHTLYVPTHDTQSELDKRIKLDEQPDRSPSKETTNETRKFIETESSVPSPLPPNTNEADVKTNGTSDESLAETLETLLSMKIVREKQAELARKLDALRRKFDKEKKPTSKFYVKNSLVKRLSSKNM

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-