Basic Information

Gene Symbol
Mical
Assembly
GCA_944738805.1
Location
CALYJE010000048.1:4747776-4786602[-]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 0.0079 0.86 10.3 1.4 1 21 5158 5178 5158 5179 0.95
2 6 0.016 1.8 9.3 0.2 3 23 5230 5251 5228 5251 0.87
3 6 0.48 52 4.7 0.2 1 23 5257 5279 5257 5279 0.96
4 6 2.4e-05 0.0027 18.2 2.2 1 21 5286 5306 5286 5307 0.94
5 6 2.5e-05 0.0028 18.1 1.2 2 23 5319 5340 5318 5340 0.97
6 6 0.027 3 8.6 3.9 1 17 5351 5367 5351 5368 0.94

Sequence Information

Coding Sequence
ATGAGTCGATCTTCACATCAACGCCAACAGCAGCAGCTGCTGGCTCAACAACAGCAACAAATGGCGGACAATGATGCCGCTGCGGCCGCTGCCGAAATGTTCGACATGTTCTGCCTGGCCACAACAATGCGGCAAATACTCGGCCTTCATCGGAACATGTGCGACACAGTCGGTCttcggccagcaccgttaaatgaattctacccaaagctaaaagcaaaaatccgatcttggaaagctcaagcactgtggaagaagtttgacgcacgtgcatcgcaccgggcgtactccaagggaaatgcatgcacaggcacaagagtTCTAGTCATAGGGGCTGGTCCTTGCGGCTTGCGAACGGCAATCGAAGCTCAACTACTCGGCGCCAAGGTTGTAGTGCTTGAGAAACGTGATCGGATTTCACGTAACAATGTCCTGCATCTGTGGCCATTTGTCATCACAGATTTGCGGAATCTGGGCGCGAAGAAATTCTATGGGAAGTTCTGTGCCGGTTCTATCGATCACATCTCGATTCGGCAGCTACAGTGTATTCTGCTGAAGGTAGCTCTGCTGCTGGGAGTCGAAGTCCACGAGGGTGTGTCATTCGAAAGCACAATCGAGCCGAACGAAGGCTGCGGCTGGCGGGCGGCCATCACaccagaggaccatgcggtttcccattatgaattcgacgtacttatcggagctgatggcaaacggaataccctggaggcattcaaacgcaaggaatttcgcggcaagctggccattgctataacagcgaatttcataaataagaagaccgaggccgaggccaaggcggaggagatcagcggcgtggccttcatattcaatcagacgttcttcaaagagctgtaccacacgactggtatagatcttgagaacatagtgtactacaaagatgagactcattactttgtgatgaccgcaaagaaacacagtcttatcgacaaaggtgtgatattgcagGATTTTGCCGATCCAGCTGAACTTCTTGCTTCACAGAATGTGAACACAGAAAAGCTTCTGGACTATGCGCGGGAGGCAGCTGAGTTTTCAACCAAATACCAAATGCCAAATCTGGAGTTTGCCGTCAATCACTACGGCAAGCCGGACGTTGCCATGTTTGACTTTACTTCGATGTTTGCGGCAGAGTCCTCCTGTCGAGTGATCGTCCGAAAGAATTATCGCCTCATGCAGTGCCTGGTTGGTGATAGTCTGTTGGAACCGTTTTGGCCAACAGGGTCTGGCTGTGCGAGAGGTTTTCTCTCAAGCATGGATGCAGCTTACGCAATCAAATTATGGAGCAATCCTAGAAATAGTACACTTGGTGTCTTGGCTCAAAGGGAGAGTATTTACAGACTTCTGGCTCAAACGACTCCTGAGAACCTACAGCGTGATATAGGTTCCTATACAGTTGATCCAGCAACACGTTATCCAAACCTCAATCGCACTTCAGTGAACGTTTACCAGGTGAAGCATCTAATCGATACCGATGACAAGTCAATTCTGGAGCAAACGTTTATGGACACAAACGCTATCCAAGCCGCTCAAGCTGAAACGCCAGTACGCCGAAAAAGACGTACCGGCGATACGCTGCCTTTAAGTGCTGTGTTACTCAGGTGGATTAAGGCCCAATTGCATTCATACGAATTTACGCAGAACCTTAATGAAGTGTCCGATTGCTTCACAAATGGGAAGGTTCTTTGTGCCTTAATAAATAGGTACCGCCCTGATCTAGTCGATTTTAATTCAGTTAAGGATCTGCCTGCAGCAGAAAGCAATGAATTGGCCTTCAAAATCCTTGATAAAGAACTGAAGATATCAAGGGTTATGTCGGGTAAAGACTCGATAGGGCTGGATAAGGTAGACTCGAAAATCTGGTTAAATTACTTGGAGCAAATCTGTGAAGTGTTCCGTGGCGAGATCCCACATGTCAAACATCCAAAACTCGATGTATCCGAGTTGAGGGAGAAGAACCGAAACAATGCTCCAGATTTTTCCAAACTTCTGCAGATGTCATCGAAGAAGGCAAAGTCCCCCATGCAGGATGTTGCTGATGTGGCGCATGTGGTGCAAAGGCGTTCGGTCTTGGATGAAGAAAAGGTTAAACGTCAAAGAAAGCATGAAACTGGACAGGGTAAGACAGCTTTCATTGGAATTCTGCAAAAACATTTACCGAGTTATTCCCTAGCTCAAGCGGAAACGCCAAGAAGAGCGAAGAAAAGAAGAAGCGCTGAGAAAGCAACAAATATCGAGGAACGCCAGCAACGTCTGAAAGAAATTGAAGCGAACCGACAGGATAGACAAAGTAGACGCCGAATGCAACGCTTCCAACAGACGCAGAACTTCTACAAAAGTCTTCATATGTTGCAAGTGAACACCCTTCTACGAGAAAGCGATGAATCGAGTCCTTTTGAGGACTATTCGATATATATGTACCGCCAGCAAGCGCCGGTATTCAACGATCGTGTCAAAGAGCTTGAGCGAAAGCTTCTATATCCTGACCGGGAGAGAGGCATTCCATCGGCGATGCCACGCGGTGCTGACGAACAATTTAGCGATCGCATCAAATCGATGGAGCAACGAATTACATCACGTAACGCAGTTGGTACTGATAAGAAACCAAAAGATTTACTACGTGCCATTGGAAAGATTGATTCAAATGACTGGAATGTAAGGGAGATTGAGAAGAAAATTGAACAATCAAAGAAAACAGAAATCGGTGGTAAAGGACGCGAAAAAGTACCAAAGTGGAGTAAGGAGCAATTTGTGGCGAGACAATCGAAAATGTCCAAACCCACCCGACAGGATTCGGGCGAAGAGAAATTCAAAGAAATCGATCAGACTCTGAAAAATTTGGATAAGCAATTGAAGGAAGGTTCTGTGTTGGAAGTGGGTGAACGCGGTAAGAATAAGGTCGCTTCAATTGCGGGACAATTTGTGAAGAAAGAAGATACATCCGAAGAGAAAAATCCACCACCGATCGCAAAATCTAACTCAAAAGTAGCCTTAGCGTTTAAGAAGCAAGCTGCTTCAGAAAAATGTCACTTCTGCAAGCAAACGGTTTATCTTATGGAGAAGATTTCTACCGAGGGCCTTATACTTCATCGATCGTGCTTGAAATGTCATCATTGTCACACAAACCTGCGACTGGGTGGATATGCGTTTGATCGTGATGATCCCGAAGGTCGTTTCTACTGCACGCAGCACTTCAGACTACCAGCTAAAGCTATCCGTCCAGTTGTACGGAAGCCTGGACAAAGGAAATCAACTGCTCATGGTTCCCATGCGCCTGGTGGTGCTGATCCGAAAACTCCAGAGAAACCAAGAGGAATCGCTGAGAATGTGGTTCAGCTGGATCTCCTCGATCGCGGACAAACACCTGAACGCATTGAATTCGAAAACACTGATGCCATGTCAGACGGTGAACCATCCGAGGAGCACATCATTGACGAGAACGAGTGGTCAGGACGTAACTTCTTGCAACCAAGTGCCGATTCAGAATCGGAGCTGTCGTCAAGTTCTGATGAGTCAGACGATGATTCCGACTCCGATATGTTCGAGGAGGCGAATGGATCACCGATTGCTCAGCAAACATTGCAATTGGCCAATGAATGGATCGGCAAGCAACACTATTCGAATGTCAACAGCGATGACAGTGATGACGACTTTTATGATTCAAGTGAGGGTTTGATAGATGATGGAAAGGATGACACCGAAGGGGAGGAGATCAAGAAAGCACGTGAAATGCGTCGCGAAGAAGTACGTCTGCTGCCACTGCCGACGAATCTACCAACAGACACCGAAACTGAGGTGTTAGGGCCGCAGGATGGCGTTACTGCGCAAAAGATTGACGATGTAAGTAGTGAGAGAATTTCGTTGAAATCGCGCGCATCCGATCAAACTCCTACACCGTCAAAGACGGACATTGAGAAAATAGAACAAAGTTCAGACCGTAGGTTTAGCGCGGACGTTGATGCAATCAGTGAGAAACTGTACAGGTTGAATAATTTGATGAAAATGAACAAGGACATAGAGAATATGGCCAAGGAAAACCTAGTTAAGAGTGATATACTTAAAAAATTGTCGTTAAAGGAGAAGTGGTTGTTGGACAATAGTGTGGGTGGGCCGGTAAAGGGGCCAAATGCGGCATTTCTTACAGAGAAAGTTCAGGAAAAGCAGAACGAAAATGTGACTGAGCAAAAAACTGAAACAATAAATCCACCCAGTGAGGCATTGATGGAGATAACGGAAAATAAATCCGGAGTCTCAAATGGGGACGATGATCGGGGTGATGGTTTACAGTTGCAGCCAGTGCAACCAGCCGAAGCTAGCGAAGAGTATTCAAAGAAAGAGGAGCAGTCACTACCAAAAGACGATACGGTTTCAAAACTGAAATCAGACAAAATGGAAATCGAGGGCATTTTAGATATCGTCAAAATGATGGACACTTCGTACGTAGAGCCGTCGAAGGAGCCGTTGCCTTCACCAAAACCCAGTCCCAAAGATGTTGGCAAGTTGCTGTATGAAGCAGAGCACAAGAACGACAATCGTTTGGGAGATGCTTTGCATCAAATTGAAGTGAAGAATTTCGCTGGCACAATGGACACGATTAAGTCTCAAATTGTAACCCCAACAATTAGTCACCAAGGTTCGCTGGACATATCGAAATATTTTCCCAATACGAAGCAGGAAAAACAGTCGGCTGCGTCTTTGAATAAAACCCAGAAAACACTGAAGGATGTGGATTTGACCAAATACTTCCCGCAAAGTCCTGCACCTCAGAGGAAGGTGACTGTGACAACAGTTGCCGACAGGCTAAAAAAGTCACAGTCAGTAGACGAGCCACCCAAAGCCCCACCTAGGCGGCAAAAATCAGTGGCTTCTACCACTGACAAATCGTTTAGCTTACACGAGCATCTCTTGGACGGAGCCATAGATATTCAAAAGGCCCCTGTAGTGAAACCAGTAACGAAAAAGGGTGGCATCAAAATAGTCAAGAAGATTGTACCCAAAGGAACGAAAACGAAAAAAGTAATTGCCGTGGCAAAGAAAACTCAAGATGAAGCTGACAAAATTCTAGACGAGATCTTGCAGAGTGGACAAGAGGTGCGCTCGCCCAGTCTTGAATATCAGAAACTATTCACGGAGGAAAAGTCTCCAAGTGAAGACATTTCTGAAAAAATCGAAAAAATACTCGAGGAAACTGGCATCGATCTCGGCCTGCCACCAAAGCCGAAAAAGAAACTCGTGAAAACGAAGAGTTTGGGTGAAGAGGAGTTCAAACGAACACAGGACCGTGTTATGTCGAAGGAAGATGGTGACCGTCCAATGGGAGTGCAGAAAATTCTTCAGCGCTTCGAATCAATGAGTTCCATCAAATCGGATGAATCGTTCCGCTTAAAGAAATCAAATCTGAGCAGCACATGTAGTAGTCTTAACAAGTCAAAAGAGTCATTTCAATCAAAAGAGTCTTTGAAGTCGAATTCCGATTCGTTGAGTGATCTAGAAAAGACCATGGAATACCTTAAGTCTGAGTGGAAGACTGAGGCAACAAACTTCTTGCAAAAGAAACGAGACTCATTCTACACGAGCCTTAAGGAGAAGCAAACTGAAGCTGAGGCAGAAACTAAAAAGCAGACTCGCCCAGAACCGCAATGGAAGAACACAAAGTACTCAAAATTCTTTGGGGTCAAAGAGAAGAGTCCCGAAAAAAGAAAATCACCACCGAAGTTCAGTGCGAAGAGAAAATCACCGGCAAAGATTCTGAAAAAGTCCCCACGTGAGGAGATCATGAAGCATAGGAAAAGGGTAGAGTCGGTGAGCAGCAGGAGGGCATCGCTTGAAGAGAAAATCAAAGCCGAAGAAGCCAGGTTGAGGGCTGAGGATGTAACTAAACCAGGAAGTAGGAAAGCGTCTCTAGAGGATGGTGAGCAAGTGTCCCAAAGTGGACTCGCCAATGGTGTGACCAGCAGGAAGAGTTCAGTTGACTATCAAGGACAGGCAACTGAAGAAGTTAAGACAAAAGAGTGTCCAAGCAACGGACATTCCAGGATTCCTATTAGTCAGAATCGCAAAGAGGCCCTGAGCTCTGCGGAAGCCACCAGAAATGTCATAGATACGGCACAGAAGGTTTTGACAAGAAGAGGATCTGGTTCTGGTTCAAGTGGTCTTGCCGACATTTTGAAGCCACCAGTTGTGAAGGGCATAAGGGGTCCCCAACGAGACAAAACAAAAACCGATGAAAACGAATTAAGTGAAGTGATCACTAATGGATTGCAAACTGATGACATCTCAAAAACAGACAAGGCACCTAAAGCCAGTGAGACTGAGGCCATCGAAAAAAGTGGGAAAATACTGCAAATCACTGAGACTCAGAGTATTACCGAAAAAGATGATACACATGTTGAAGGTGTAGATAAAATCGATGATATCACAGAATTAAAAGGCTCCAAAGAAATAGCCGAACAACAAACTGCAGACGGGCACCACACTATTGTGTCACCAGAAAAGTCATCGTCCAGAAGGGAATCCCCACAACAGGCTAAGCAAAATGTGAGAACTTCACCAGTTAAAAATCTTATTGAATCTCAAAAAACGTCAAGTCGTAGATCATCTCAAGTTGAAGTTAAAGACACTAAACAAACAACTCCACAGTCACAAAGTCGAAGAGGATCTGCTGCGGCTTTGGAAAGTAGGCGGTCATCTATCATGTCTGAAACCGCAGACCGAACAATTACAATAGACGACGTTACCGATGTCGATGACATGTTCAATGAAATTCTAAATGAACTCGAAATCGATGACAAAATTCCGGCAAACGTTCTCCCACTTTCTGCGCCACAGGCAACCACAGAAACTGTTAAACCCATCGAGAAAGAGTATGAAACCGAACAAGTTGATGAAGAGTTTGAAAATATCGCGAAAAATTCCCGAAGCCCAAGTGCAACAAAACGCAGTCGGGCATCGTCAACAGACAGCATTGAACAGTTGTTTAGTCAATTCACCGATGATATGCTGGTGGACGTTGAATTCGATTCCAATGATGAGGTGGTGGCCATAACACCAAGAGCAATCATTGTGGATGAGAATGGGGAGGAAGAGGCGAAAAACATTTCTGTGGAGGAGGCCAAACTAAAGTTTGAACCAATGTTCCCTCGTCTAGCCTCGAAACGATTGGCAAGCAGTACATCTTCTTCGGTTAGCCTCAGTCCGGCAAGGGGTGCAAGTAAAGCGAAGAAGATTGTCCAGAAACTTGATCCGACGACTATGGCTCCATCTGTGCAGGATATGTTGCAGAAAATTATTCCCCGCGAAGAAAGTGAACCATTGGACAGGAAGCCAAGGACTTTCCCACGGCACTATTATGAATCCGATGAGGACATGTTGACTTCTTCAAGTTCTAGGATGCAAACTCCTGAATCTGTGACTAATTCTGAGAAAATTAAGGATAGGACGCAAACTCCTGAATATGCGAGTCAAGTTGGTAAATCTAAGTCTAGGACGCAAACACCGGAATCTGCAACTTCCTCTGATAAAACCGAGGGTAGGATGCAAATTTCTGATTCTGTGACTAATTCTGGCAAACCTATGGAAACGGTGGAAACCCCCGAATCTATGCTTAGTGCTGGTGAAACTGAAGAATCGAAAGGCAAACACAACCGCTTGGAGATTGTGCATGAGAAATCTAGTCCAGTTGAGAGCCCTGGACTTGAAAACAACAAAATAAGTAAAAGTGTGTCACAATCACCACGCGGTGTTTCAAATGATTCAATGGAGACGAAATCTGGCACAAGTACATCATTAAATTCTAGAACAGAAACCTTAAACACTGATTTTGAATACAAACAGCCAAGTGTTCAAAATGAAAGGAACTCCTTAATAGTCGAACCCGAGAAGGTAGAAACAAATATTCAAAGTAGAAACGACTCCCCGGTAAGCGGGCCTCCAGCATCCGCAAGGCAATCTATTCCAGTTGACAATGCAGACATCAATGAGCCTGTGTCATCACCAAGAAGAAGTTCTCTAGTTTCCATTGAGGAGGGAACACAGCTTCAGCAGGTAGGATCTCGAAGAAGTTCGAAAACGTCCCTTCAAGAAACTGGAAGTCCACGAAAGGAAGGAATTTATCATAAAGCATCGCCAGCCTTGCTACCCACACCACGAAGGGGTTCACGCAGTTCAGTGGAGTCATTATCGATTAAGACTGAAGTTACTGGTATTCACAGTACCTCTCCGCGGCGGAATTCAAGAACGTCTTCAACGCCTGACGAAGTTCCATCACCTGTAAGAATTTCCGAAATTCTATCCTCCCCCATTGCTTCAAAAGAAGTGTCGCCTGTGGCTTCAAAAAGAACTTCCTCTGTGTCGAAAGACTCATCACTTCTTGGTTCAAAGCAGAATACACCTCTTGTTTCAAAAGACTCATCACCCGTTGATGTAAAAGATTCTTCACCACACAACAAGGCAATTGAGAACCATTTGGAAGTCCTGTCGGGTCTCTCAAATAACTTAGAGAAACGTGAACAGAGTAGCAAATCTTCAACTCCATCCGATCGGGTGTTGAATAAGAGTACACCTGCAAAGCTCGAAGATGTTCCAAAATCAGCATCAATTGACAGAGATTCTAAAACTTCGCCTAAAGAAGTCAAAAAGGATAGTTCTCCCAAAATCGCGCCATCAAGAGTTTCACCAACTCTCGCCTCTCCAGCCAAACCTGCGTCACCTCGTGAAAGGTTCTTTGCTCAAAACCAGGTCAAGGAGAAGTCTCCCGTTACAAACGCAACACCTAGTCCAAATTCGACACGAAGTTCCAAAGAAAATACTGTTGAAATACAGGAACAGAATCGTGAACCAGCGATGGTATATAAGGACATTCTTCGAGCTAAGCTGGAATCACCGTCAGTCGACAGGAGGAGTGCAAGTCGATCAAAGGAAAATTCTCTGGAGTGGGATATGGAACAGCTTCCCAGAAGTCCGATGCCAAGACGAAAGTATGCAGACGTCCTCATACACCCTAGGGACAAAGAGGAGCCTTCAAAGCCATCGCATATTCCGTCGCCTTCTCCAGTTCCCAACAAAAGCGTACCAGAAAAGCAATTGCCTATCAAACGCCTCGAGGAGTGGGAGATTATTGAACAGCTCGAAAGAGAAAGGAGACTAGCCCTGATAAAGCGCGACGAGAACTACTCAAATCAGATTCAGGAAGCATCAGCGAATGCTTCTCCAGCCAAGAACTTTTCGCAACCAATTCCCCCGCTCAATAGACTGGAATCCTTAGACGAGCCACCAAGCTCAAGACTCATTGAATATATCAAACAGCAAACATTGGGAGAATCCCACAGGCATCAAGATGAAGTTCCAAAGCCGCCCCCACGGCCACGAACTCCAGGTAGTCCCACAAGAGCGCCAAGCAAATCATCGCACGAAGACTTCGGCAAACAGGTTGAAGCGATTCTCAAAAGTGAAGAGAAGAAATCTTCGCCTGTTACAAGTTCTAGACAGGCACAACTCCATAAAACTGAATCTACAGAAGCAACCTATGAAGACTTTGCACGCTTCACGCGAAGATCGGTCGACACCGAGTCGAAACAGGAGGAACCAGACCTGAATGAACCAGTGTCATCGGGCTCCCAGCAACGCTTGTCTAGTTACAGACAGAGATCGCCGGAGTCCGCCACGGGCACATACAGTCCAACCAGCAGTCTCTCAAGATCAACCGAGCATTTGGCTCCAACCCGTCCACAACGTAGCAAACCCTTATCCTCCGTGGCAGTGCCTGAAAACCGAGTGGACAGGGAAACGAGATTCCTGATAGATCGCAGTAAGCACCTGCGCAACATGAAGCGTGACTTCATGGAGGAGAAAATCGCAGGCAACAACCCCTATCTCAAAAAGGTGATCGAAGCCCAAAAGTTTTCTCGAGACTCAGAGGACGAAGACGAGGTTGACTATGCCCTAGAAAGCTATCGCCCGAAGAACCACACCACGCCTCGATTCCCCACGACAAGTTCAATTAGCGGCAGAAGACGTCGTGAAACACCTCGATATCCAGACTATACTTCTACGATATCAGACTACAGTCGGCGACCGTACAGCCCGCACTATTCAACCACCTCCGCCTCCAGTTCAGCAGGATCTCGAAACATTGTCGACTACTTCAAGCGCAGTCCACCGCACTACAAACCTCGCGAATCTAGCAAAGACTCGTGCGTTCAAAGTGATTCGGAGTCCTCATCGGCCGATGGTGTTGAACTCAATTCCGCCACAGAAATTTCAACCGATTCAGAATTTGATCACGACGAAATCATCCGCGAAGCACCAACAATTTTCATCGACGAAACCTACTTAAGGAAACCGACAAAGGTTCAGATCAAACAAACTGTTATTGCGCCAACCACTGGACTGCCAAAGTATCACATTAGCAATCGCATGACCCACAATCAGCAGCCAAGTAAGCGTGAGCCACAGTTCAAACCTCTAGTCCAAGTTGATCCATCACTGCTCAGTTCAAATCGTATGCCTCTTCAGAATCCCCGCGCAGGCGATTATTTGCTCAATAAAACTGCCAGCACGGAGGGAATTGCTTCAAAGAAGAGTCTGGAACTCAAGAAACGGTATCTGCTCGGCGAGCAAGGAAACGGGAATAAGATACAAAAGTCTGGCTCCACATCGATTTTGGATTCGAAGATCAGAAGTTTCCACTCCAATATATCGGAGTGTCAGAAGCTTTTGAATCCCAGTAGCGAAATAAGTCCGAGCATGAAAACGTTCCTAGATCGCACCAAACTGGGTGAAGGTCAAATCGACGGCTTGGCCAAGACTGTTGGTGACAAGAAGGTCGAGATCACAATCAAGAAAAGCACAACGGACGAGAAAGAGAATGTGTTTGTGAATAGCAAGAACGAGCTGAACAAGGGTATGGAGTACTCTGAGACGGTGAATACGACTTTGGTCGAGACAATGAGCAAGAAGAAACCTTCGCCCATCATTGAAACGATTGATTTGATTTCACCCGAGAAAAAAGTTCCTATTATAGATCTTACCGAAGTGGATATTCCACCGAAGATGCTAGAGTCCAATGTAGAATTTATCAAGAACCTGAACTTCTGCACAGAAGAGGATAACACGAAAGAAAGTAGTTTATTGCCTGACAATAAGATAATTGACCTGACATTGGACTCGCCTGTGAAGAGTGATAAGACCGTGAAGGAGGAGATCGTAGAGAAACCGGACGTTTCCAAGGACGCTAAGGAAGCAATCCCAGACATAATAACGCACATTGAAGATGTAAAGTTGGAACGGCAACCGTTGCTTGACAACGAGTCGGAGAAAAAAGACTCACCAGAGAAAACGGAGTATGACATTCAGGAAACGAGTATTCAGGTGCCTAACATCCCGTGGGTCGCGAAAAAATGCGTCACAGAGTCAGACAGCCTTTCCAGCTCCTCATCGTCGAGTGTGGAGGATATTCAGCATTTTATTTTAGACTCAACCACTAGTCCTGATACACAAACCGGTCAAATAGTGCCTCGATTGGAAGTGCACGACAGTTCGGGTACTCTTATGCAGATCGATAGCCTTATGATAATTGATGGGAAGTATATTGGCGATCCAGAAGATCTCAAGAACATGGAAATTCCTCCAGGAGTAATTATTCCTGAAGTCCCAACACAATCCCCATCGCCTGAAGACCCCATAACTGAGAAGCGGAAGGAGGCGACTCCACCTGCCTCACCTCAACAGAAACCGGACCTAAAATTCGACACTAAGAATGAGAATAAAATCGATACCTTAAAGAATATTCCGTTGATTCTTGAAAAGTCCGACGACCACTCTAAGGTTGTAAAGCCTATCAGCTTGAATATTGCAGACACTGAGACGAAACAAGAAGTGGATAACGACAAGACTCCTACAGCTGAGGTTATTACAAGAAGTTCGGACTCAGATGCAGAGATCACAAACCAAGTCCTCACAGAAACGGAACTGTCCGACTGGACTGCAGATGATGCGATTTCAGAAAATTTCATTGACTTAGAGTTTGTTCTAAACTCAAACAAAGGTACGATAAAGCGCAATAAGAAAACCAAGAAGAAGGCTCCGCTTCAATCGGCACTATCTATTACCCAAGAGCCGGAGCCGGAACCTGAACAAGAAGCGCAACCTGAACCTTGTGGAATACTGAAAAATCTTGATATTGAAGAAATCGAATTCATGGACACTGGTTCCGAGGGAAGTTGTGCCGAGGCGTATTCAGCAACCAACACGGCACTTTTGAGAAATCGCGGATATGTCGATTACACAGGATCTCAAAGAAATGGTCACAATCAGCAACGAATGCTAGCCGAACAGGAGCTCAAGACTCCAGTTAATGAGAGTCCTCCACTGCCAAACACAGTATTGCTATCACAAAAGTCACAATCACAATCCAGTCCACCGAAGCACGGTTCTCCGTCCCAAACGCAGCATGTGCAGAGCCTTTCACAATCACTTACAGACAGCACAAACGATATTGACGAAGACAGTCTTTGTATGCTAACAAACTCTCAGGGTGCTACGGCAAACAATACTGCAACAATAACTACAACCACCACCACAACAACTGAAGAAAGCGAAGCTCTCACTGTCGTCACGAGTCCGTTAGACTCGTCATCACCCAAAACTCAAGAGAACTACCAAACAACTGGCAGCAGTTCGGAACAAAATGGGAAAACAACATCGCCGAGCAGCAATACAACAACTGCCACTCCTGCCACAAGCTCCTCCAGCAAGGGCCCAACCAGGAAGTCATCGCATGAAGATCTGCTCTCAAACGGCAACACAAAACGCAAAGAATCCGAAGAGTTGAGCTACGAGGAATATGTGCGAAAACTGCAGCAAAAAATTACGCAGATCAGCAATGCACGTGACTCGATCGATGTGCGAAAGCAGAACCGACGAAAGAGTTCGAAAGGTGAAGCAAGTGAAAATTCTGTGATAGAGGGAAGCGCTAAGGGTCTTAGCGTGTTCGAAAGTGGCTCCACAGTCAGTGTTCAGCCAAAAGTAACGTTGTCAAGCATTAATGTGGCGGCGAATCCATTACAAGAAGTTCATAAGGTACCTGAAATACCAACATTAACGCGTAAGTTGGAGGAGTTAACCAAAGAGCGAACCAAGCAGAAGGATCTGATTCACGATCTGGTAATGGACAAGTTGCAGAGCAAGAAGCAGCTCAACGCCGAAAAGCGACTAAATCGTAGTCGCAATCGAAGCATGTTTGGTAGCAGTTTAGCCGGCAGTAGTCTCAGTCCAACACCGAAGGTGATGTCAGCAAACTCGTCTCCCTACCATGTGACATCGTCTTCGGGAGGTGATCATAGCGGAAGCAGTTGTAAAGAGAACAATCCGTTGGTAGGAGCGAAAACCAAAAACTCTCCAAAGGACCCAAAGGATGTGGTTCAACTATTCACCGAGGCACCGCCGCGTTCCGCCAAGCAAAGACCTTTCAGCGAGAACTTTGATCCCCAGAACATCGATGAGCAAATGTGCAAACTGAGCAAGACACAATCGTTTTCATACTCAAGCCAACGCAATTTTCCCTTCAAACATGACGCAAACATCAATCAACCTCTAACTACGGCGTATGCTACACCCTTGGCGCCGCAAAGAGTAAGCCGCCGAGATGAGCCCATTCTTAGCACAGTCACTGCGGACAAACTGCGCACGGAAGCAAGAGCCAGAGCTCGCTTGAAGTCAAATCACGATCTCGGGCTTAGTCCAGAGGAAAAATTGCAACTGCTGCGCAAGCGCTATCACCTTGATATGCACGAAGCCACCGCAGAAGCCGGTCAAAAGTCTGAGGAAATACGTGCACGCGACCGCAAAATGGTTTCTTCCAAGAGTGTCAATGATATTGCAACAGTTCAACTAATGACTCTGCAGCCAGATGAAAATTCAAACACAACAGCCGTCCGTAGTGATTTGGTCGCTGATTTCACATCAGATCCCAACCTCTCGCAGGTCAATACGCCGTCAATTGCACTGAGCAAAGTCAATCGTCGCCACAAAGATCCAGAGCGACGCAAGAGTATTATCCAAACGTTTTCGAGCTTCTTCCAAAAGGGTAAAAAGGAAAGGGATGTTGGACCGGTCTCCGGAACGGAGAGATCAGGAACCAGCAGCAGTAATGTTGCCGCCACCAATAATGGCGGCGTTGGCGATGGCATGTTCAGTCGCTTCCGAATTTCGCCGAAATCCAAAGAGAAGTCTAAGTCATGCTTTGATGTGAGGAATTTCGGGTTTGGTGACAAGGACAATGGATCATCGCCCAGCACTCCTGCTGCTACCACACCGCAACATCAACACCGACACCAACACGATCAGCGAAGCAATTCACAAGACTGCATTGCCAAGCAAAAGTATCAAATGTCATCGTACGCATCCAGTCCGCAGCTTTTTAAGAACACGGAGGAGCAAATTCCGCCTCCAATTCCACCATTACCATTAAATTATCAGAGATCCGATGACGAGAGCTATGCCACCGAATCTAGAGAACAGAAGAAGCAGCGTGCAATATCAAAGGCTACACGACAAGCTGAGCTCAAGCGATTAAGGATTGCACAGGAGATACAGCGGGAGCAAGAGGAAATCGAAGTTCAATTAAAGGAATTAGAAGCGAGGGGTGTACTCATTGAGAAAGCATTAAGGGGAGAGGGGCAAACTATCGATAATTTAGAGGCATCTAACATGGGTGCAACTGATGAAAAACTGTTAAAGGAGCTCTTAGAGATTTGGCGAAACATAACACAACTTAAAAAACGGGATGAAGAACTTGGCATAAGACAACAAGAACTGCAATTAGAACACAGACATGCACAACTCAAGGAGGAACTGAATATTCGACTGGCCGGAAGTAAATTAGAAAAGAGTTCGGCTGACGTAGCTGCTGAAGGTGCAATACTCAACGAAATGCTTGAAATTGTTGCTAAACGAGCCGCACTACGACCCACAACATCAGTTACATCACCAATTGCGACTGGAGCCCTGAACCCTCTGGGAGACAGTTCAAATGATGGCATTACATTCAGTGGCCGGCGATCAAGTCAAATAAATGAATCAGACTTGACGTCATTTAAAACAACCAGTATGAAACGAGCATCAAAATCACCCGCCAACAGTGGCGCCTTATTGCTCAACAAAAGCAAATCTCCAGCTACACCAAAAGTCGCAAAAAAGGACACAATTGACCTTGAAGTCGCTGAAACTCCTTCAAAGAAAACAAATCATTATTTACTAACTGAAATCAAGGGCAACGAAAATGAGAACGAGATCTCATTTGTCCTCACTGGCAGCACAGAGGGTGAACACAAACCACCAGCCGAATGCCGGGAGGAGATACAAATTGAAGAGCCTTGTGAGGAGATGGAAGAAGACGACGATGATGATAAGATGAATACCGAAGAAATCGTTGAGCACGTATGTGGCAAATGTTACAAATCGTTTCGGACAATCACGGTGCTTAAGCGACACATCAAATATTGCCTCTTCAACAGTGAAATGGAGTCGCGGAAAGAAAAAATGCTGAAAAACATCAATAAGATCGAGAAGGAGGCGGTTATTATGGAGAAAAAAGATTTATGCTTCTGCTGTGGAGAAAGCTACGACACATGTCATCTAGGTCATGTTAATTGCCCCGAATGTCCCAAATCATTCAAGTCACAGCTTATACTGGAACGCCACACGTTCTTAATTCATTCGGAAAACCAGGAATTCCCTTGCACCATTTGCAATGGTTTTATTCGAACGGAGCAGTTGCTGAAACTGCATATAGAACAGCACAAAAACCGGGGCAAACCGTTTGCGTGCAAACGCTGCGGGAAAGACTTTACACGGCAATATCACTTGAGGAGGCATCTGCTCTACAGTAGCTGTGGTGATGGACAAGCTGAAACGATGAAATGCAAAGTATGTTCCAAAGAATTCTATCGTTTGGATAACTTGCGTGCACATCTAAAGTCGCATTTAGGTCATGGGACGACGAAAAGGTCTGAGTACACATGTCCATACTGCAAAAAGTGCTTTTATAGTTTGTCAACACTTAATCAAAATTCTGTCGTTGAAGAGGAAGAGGAAATAGAAGAAAAACAGGAAACCCCCGAAGAGATCAAAACGGAAGAAGACGAATGCGTTGTTATTGGAGAGTTCATCATTGAATCGTTGCCGGCCGACACTGCTACTCTGTTGGAGAAATCTGGGCAAAATTTCGAAGTGATTGCAGAAGATAAAGAAGAGATTCCAACGATTAAAATTTCCAAAAGTCCAACGACATCAACTGTGTCTTCGAATTTGAAGGTTGAACTAGCAGATCTCAAGAAGCCGATAACGCAGCAGCAGGAGGTCACATCTAAGCCAGTTGTGGACAAGCCCATGAGTCCCGAAGACATTGAAATTACTGAAAATGTGAACAAACTACTTGACATTTTGGTTGACCTGGAGACACTAGTGAAATTCGGCTGGTTAAACGATAATGTTGAAAATGTCCTTTGCAAGGTGATCGAGAGCTGCGGGTATAACCTCTACAAAACCAAGTTTACCGAGGATTATGGTACTCGTATGAGAGAGTACGTTAAGCTCTTGTTCACTGTGGTAATTCAAAATGATTCCATCAAGGAGCTACTGAACAATTACTCGATTGATGAAGTTATTGACTTTGTCCTCTCCAATGATGAGGCAGACTAA
Protein Sequence
MSRSSHQRQQQQLLAQQQQQMADNDAAAAAAEMFDMFCLATTMRQILGLHRNMCDTVGLRPAPLNEFYPKLKAKIRSWKAQALWKKFDARASHRAYSKGNACTGTRVLVIGAGPCGLRTAIEAQLLGAKVVVLEKRDRISRNNVLHLWPFVITDLRNLGAKKFYGKFCAGSIDHISIRQLQCILLKVALLLGVEVHEGVSFESTIEPNEGCGWRAAITPEDHAVSHYEFDVLIGADGKRNTLEAFKRKEFRGKLAIAITANFINKKTEAEAKAEEISGVAFIFNQTFFKELYHTTGIDLENIVYYKDETHYFVMTAKKHSLIDKGVILQDFADPAELLASQNVNTEKLLDYAREAAEFSTKYQMPNLEFAVNHYGKPDVAMFDFTSMFAAESSCRVIVRKNYRLMQCLVGDSLLEPFWPTGSGCARGFLSSMDAAYAIKLWSNPRNSTLGVLAQRESIYRLLAQTTPENLQRDIGSYTVDPATRYPNLNRTSVNVYQVKHLIDTDDKSILEQTFMDTNAIQAAQAETPVRRKRRTGDTLPLSAVLLRWIKAQLHSYEFTQNLNEVSDCFTNGKVLCALINRYRPDLVDFNSVKDLPAAESNELAFKILDKELKISRVMSGKDSIGLDKVDSKIWLNYLEQICEVFRGEIPHVKHPKLDVSELREKNRNNAPDFSKLLQMSSKKAKSPMQDVADVAHVVQRRSVLDEEKVKRQRKHETGQGKTAFIGILQKHLPSYSLAQAETPRRAKKRRSAEKATNIEERQQRLKEIEANRQDRQSRRRMQRFQQTQNFYKSLHMLQVNTLLRESDESSPFEDYSIYMYRQQAPVFNDRVKELERKLLYPDRERGIPSAMPRGADEQFSDRIKSMEQRITSRNAVGTDKKPKDLLRAIGKIDSNDWNVREIEKKIEQSKKTEIGGKGREKVPKWSKEQFVARQSKMSKPTRQDSGEEKFKEIDQTLKNLDKQLKEGSVLEVGERGKNKVASIAGQFVKKEDTSEEKNPPPIAKSNSKVALAFKKQAASEKCHFCKQTVYLMEKISTEGLILHRSCLKCHHCHTNLRLGGYAFDRDDPEGRFYCTQHFRLPAKAIRPVVRKPGQRKSTAHGSHAPGGADPKTPEKPRGIAENVVQLDLLDRGQTPERIEFENTDAMSDGEPSEEHIIDENEWSGRNFLQPSADSESELSSSSDESDDDSDSDMFEEANGSPIAQQTLQLANEWIGKQHYSNVNSDDSDDDFYDSSEGLIDDGKDDTEGEEIKKAREMRREEVRLLPLPTNLPTDTETEVLGPQDGVTAQKIDDVSSERISLKSRASDQTPTPSKTDIEKIEQSSDRRFSADVDAISEKLYRLNNLMKMNKDIENMAKENLVKSDILKKLSLKEKWLLDNSVGGPVKGPNAAFLTEKVQEKQNENVTEQKTETINPPSEALMEITENKSGVSNGDDDRGDGLQLQPVQPAEASEEYSKKEEQSLPKDDTVSKLKSDKMEIEGILDIVKMMDTSYVEPSKEPLPSPKPSPKDVGKLLYEAEHKNDNRLGDALHQIEVKNFAGTMDTIKSQIVTPTISHQGSLDISKYFPNTKQEKQSAASLNKTQKTLKDVDLTKYFPQSPAPQRKVTVTTVADRLKKSQSVDEPPKAPPRRQKSVASTTDKSFSLHEHLLDGAIDIQKAPVVKPVTKKGGIKIVKKIVPKGTKTKKVIAVAKKTQDEADKILDEILQSGQEVRSPSLEYQKLFTEEKSPSEDISEKIEKILEETGIDLGLPPKPKKKLVKTKSLGEEEFKRTQDRVMSKEDGDRPMGVQKILQRFESMSSIKSDESFRLKKSNLSSTCSSLNKSKESFQSKESLKSNSDSLSDLEKTMEYLKSEWKTEATNFLQKKRDSFYTSLKEKQTEAEAETKKQTRPEPQWKNTKYSKFFGVKEKSPEKRKSPPKFSAKRKSPAKILKKSPREEIMKHRKRVESVSSRRASLEEKIKAEEARLRAEDVTKPGSRKASLEDGEQVSQSGLANGVTSRKSSVDYQGQATEEVKTKECPSNGHSRIPISQNRKEALSSAEATRNVIDTAQKVLTRRGSGSGSSGLADILKPPVVKGIRGPQRDKTKTDENELSEVITNGLQTDDISKTDKAPKASETEAIEKSGKILQITETQSITEKDDTHVEGVDKIDDITELKGSKEIAEQQTADGHHTIVSPEKSSSRRESPQQAKQNVRTSPVKNLIESQKTSSRRSSQVEVKDTKQTTPQSQSRRGSAAALESRRSSIMSETADRTITIDDVTDVDDMFNEILNELEIDDKIPANVLPLSAPQATTETVKPIEKEYETEQVDEEFENIAKNSRSPSATKRSRASSTDSIEQLFSQFTDDMLVDVEFDSNDEVVAITPRAIIVDENGEEEAKNISVEEAKLKFEPMFPRLASKRLASSTSSSVSLSPARGASKAKKIVQKLDPTTMAPSVQDMLQKIIPREESEPLDRKPRTFPRHYYESDEDMLTSSSSRMQTPESVTNSEKIKDRTQTPEYASQVGKSKSRTQTPESATSSDKTEGRMQISDSVTNSGKPMETVETPESMLSAGETEESKGKHNRLEIVHEKSSPVESPGLENNKISKSVSQSPRGVSNDSMETKSGTSTSLNSRTETLNTDFEYKQPSVQNERNSLIVEPEKVETNIQSRNDSPVSGPPASARQSIPVDNADINEPVSSPRRSSLVSIEEGTQLQQVGSRRSSKTSLQETGSPRKEGIYHKASPALLPTPRRGSRSSVESLSIKTEVTGIHSTSPRRNSRTSSTPDEVPSPVRISEILSSPIASKEVSPVASKRTSSVSKDSSLLGSKQNTPLVSKDSSPVDVKDSSPHNKAIENHLEVLSGLSNNLEKREQSSKSSTPSDRVLNKSTPAKLEDVPKSASIDRDSKTSPKEVKKDSSPKIAPSRVSPTLASPAKPASPRERFFAQNQVKEKSPVTNATPSPNSTRSSKENTVEIQEQNREPAMVYKDILRAKLESPSVDRRSASRSKENSLEWDMEQLPRSPMPRRKYADVLIHPRDKEEPSKPSHIPSPSPVPNKSVPEKQLPIKRLEEWEIIEQLERERRLALIKRDENYSNQIQEASANASPAKNFSQPIPPLNRLESLDEPPSSRLIEYIKQQTLGESHRHQDEVPKPPPRPRTPGSPTRAPSKSSHEDFGKQVEAILKSEEKKSSPVTSSRQAQLHKTESTEATYEDFARFTRRSVDTESKQEEPDLNEPVSSGSQQRLSSYRQRSPESATGTYSPTSSLSRSTEHLAPTRPQRSKPLSSVAVPENRVDRETRFLIDRSKHLRNMKRDFMEEKIAGNNPYLKKVIEAQKFSRDSEDEDEVDYALESYRPKNHTTPRFPTTSSISGRRRRETPRYPDYTSTISDYSRRPYSPHYSTTSASSSAGSRNIVDYFKRSPPHYKPRESSKDSCVQSDSESSSADGVELNSATEISTDSEFDHDEIIREAPTIFIDETYLRKPTKVQIKQTVIAPTTGLPKYHISNRMTHNQQPSKREPQFKPLVQVDPSLLSSNRMPLQNPRAGDYLLNKTASTEGIASKKSLELKKRYLLGEQGNGNKIQKSGSTSILDSKIRSFHSNISECQKLLNPSSEISPSMKTFLDRTKLGEGQIDGLAKTVGDKKVEITIKKSTTDEKENVFVNSKNELNKGMEYSETVNTTLVETMSKKKPSPIIETIDLISPEKKVPIIDLTEVDIPPKMLESNVEFIKNLNFCTEEDNTKESSLLPDNKIIDLTLDSPVKSDKTVKEEIVEKPDVSKDAKEAIPDIITHIEDVKLERQPLLDNESEKKDSPEKTEYDIQETSIQVPNIPWVAKKCVTESDSLSSSSSSSVEDIQHFILDSTTSPDTQTGQIVPRLEVHDSSGTLMQIDSLMIIDGKYIGDPEDLKNMEIPPGVIIPEVPTQSPSPEDPITEKRKEATPPASPQQKPDLKFDTKNENKIDTLKNIPLILEKSDDHSKVVKPISLNIADTETKQEVDNDKTPTAEVITRSSDSDAEITNQVLTETELSDWTADDAISENFIDLEFVLNSNKGTIKRNKKTKKKAPLQSALSITQEPEPEPEQEAQPEPCGILKNLDIEEIEFMDTGSEGSCAEAYSATNTALLRNRGYVDYTGSQRNGHNQQRMLAEQELKTPVNESPPLPNTVLLSQKSQSQSSPPKHGSPSQTQHVQSLSQSLTDSTNDIDEDSLCMLTNSQGATANNTATITTTTTTTTEESEALTVVTSPLDSSSPKTQENYQTTGSSSEQNGKTTSPSSNTTTATPATSSSSKGPTRKSSHEDLLSNGNTKRKESEELSYEEYVRKLQQKITQISNARDSIDVRKQNRRKSSKGEASENSVIEGSAKGLSVFESGSTVSVQPKVTLSSINVAANPLQEVHKVPEIPTLTRKLEELTKERTKQKDLIHDLVMDKLQSKKQLNAEKRLNRSRNRSMFGSSLAGSSLSPTPKVMSANSSPYHVTSSSGGDHSGSSCKENNPLVGAKTKNSPKDPKDVVQLFTEAPPRSAKQRPFSENFDPQNIDEQMCKLSKTQSFSYSSQRNFPFKHDANINQPLTTAYATPLAPQRVSRRDEPILSTVTADKLRTEARARARLKSNHDLGLSPEEKLQLLRKRYHLDMHEATAEAGQKSEEIRARDRKMVSSKSVNDIATVQLMTLQPDENSNTTAVRSDLVADFTSDPNLSQVNTPSIALSKVNRRHKDPERRKSIIQTFSSFFQKGKKERDVGPVSGTERSGTSSSNVAATNNGGVGDGMFSRFRISPKSKEKSKSCFDVRNFGFGDKDNGSSPSTPAATTPQHQHRHQHDQRSNSQDCIAKQKYQMSSYASSPQLFKNTEEQIPPPIPPLPLNYQRSDDESYATESREQKKQRAISKATRQAELKRLRIAQEIQREQEEIEVQLKELEARGVLIEKALRGEGQTIDNLEASNMGATDEKLLKELLEIWRNITQLKKRDEELGIRQQELQLEHRHAQLKEELNIRLAGSKLEKSSADVAAEGAILNEMLEIVAKRAALRPTTSVTSPIATGALNPLGDSSNDGITFSGRRSSQINESDLTSFKTTSMKRASKSPANSGALLLNKSKSPATPKVAKKDTIDLEVAETPSKKTNHYLLTEIKGNENENEISFVLTGSTEGEHKPPAECREEIQIEEPCEEMEEDDDDDKMNTEEIVEHVCGKCYKSFRTITVLKRHIKYCLFNSEMESRKEKMLKNINKIEKEAVIMEKKDLCFCCGESYDTCHLGHVNCPECPKSFKSQLILERHTFLIHSENQEFPCTICNGFIRTEQLLKLHIEQHKNRGKPFACKRCGKDFTRQYHLRRHLLYSSCGDGQAETMKCKVCSKEFYRLDNLRAHLKSHLGHGTTKRSEYTCPYCKKCFYSLSTLNQNSVVEEEEEIEEKQETPEEIKTEEDECVVIGEFIIESLPADTATLLEKSGQNFEVIAEDKEEIPTIKISKSPTTSTVSSNLKVELADLKKPITQQQEVTSKPVVDKPMSPEDIEITENVNKLLDILVDLETLVKFGWLNDNVENVLCKVIESCGYNLYKTKFTEDYGTRMREYVKLLFTVVIQNDSIKELLNNYSIDEVIDFVLSNDEAD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-