Basic Information

Gene Symbol
hoxb3
Assembly
GCA_949699805.1
Location
OX453048.1:8612264-8637782[-]

Transcription Factor Domain

TF Family
Homeobox
Domain
Homeobox
PFAM
PF00046
TF Group
Helix-turn-helix
Description
This entry represents the homeodomain (HD), a protein domain of approximately 60 residues that usually binds DNA. It is encoded by the homeobox sequence [7, 6, 8], which was first identified in a number of Drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates [1, 2], as well as plants [4], fungi [5] and some species of lower eukaryotes. Many members of this group are transcriptional regulators, some of which operate differential genetic programs along the anterior-posterior axis of animal bodies [3]. This domain folds into a globular structure with three α-helices connected by two short loops that harbour a hydrophobic core. The second and third form a helix-turn-helix (HTH) motif, which make intimate contacts with the DNA: while the first helix of this motif helps to stabilise the structure, the second helix binds to DNA via a number of hydrogen bonds and hydrophobic interactions, which occur between specific side chains and the exposed bases and thymine methyl groups within the major groove of the DNA. One particularity of the HTH motif in some of these proteins arises from the stereo-chemical requirement for glycine in the turn which is needed to avoid steric interference of the β-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, while for many of the homeotic and other DNA-binding proteins the requirement is relaxed.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 24 1.2e-17 3.4e-15 56.4 4.7 1 57 28 84 28 84 0.99
2 24 1.6e-09 4.5e-07 30.4 0.4 22 57 127 162 124 162 0.94
3 24 6.7e-10 1.9e-07 31.6 0.7 1 45 255 299 255 300 0.97
4 24 2.7e-10 7.7e-08 32.8 0.6 1 45 355 399 355 400 0.97
5 24 3.9e-10 1.1e-07 32.3 0.5 1 45 489 533 489 534 0.97
6 24 6.7e-10 1.9e-07 31.6 0.7 1 45 623 667 623 668 0.97
7 24 2.7e-10 7.7e-08 32.8 0.6 1 45 723 767 723 768 0.97
8 24 1e-10 2.9e-08 34.2 0.7 1 45 823 867 823 868 0.97
9 24 3.9e-10 1.1e-07 32.3 0.5 1 45 957 1001 957 1002 0.97
10 24 6.7e-10 1.9e-07 31.6 0.7 1 45 1091 1135 1091 1136 0.97
11 24 2.7e-10 7.7e-08 32.8 0.6 1 45 1191 1235 1191 1236 0.97
12 24 1e-10 2.9e-08 34.2 0.7 1 45 1291 1335 1291 1336 0.97
13 24 2.7e-10 7.7e-08 32.8 0.6 1 45 1391 1435 1391 1436 0.97
14 24 1e-10 2.9e-08 34.2 0.7 1 45 1491 1535 1491 1536 0.97
15 24 1.2e-17 3.4e-15 56.4 4.7 1 57 1591 1647 1591 1647 0.99
16 24 1.6e-09 4.5e-07 30.4 0.4 22 57 1690 1725 1687 1725 0.94
17 24 7e-10 2e-07 31.5 0.3 1 45 1818 1862 1818 1863 0.97
18 24 6.7e-10 1.9e-07 31.6 0.7 1 45 1952 1996 1952 1997 0.97
19 24 2.7e-10 7.7e-08 32.8 0.6 1 45 2052 2096 2052 2097 0.97
20 24 1e-10 2.9e-08 34.2 0.7 1 45 2152 2196 2152 2197 0.97
21 24 4.5e-10 1.3e-07 32.1 0.5 1 45 2252 2296 2252 2297 0.97
22 24 9.7e-11 2.7e-08 34.3 0.9 1 45 2352 2396 2352 2397 0.97
23 24 1.2e-18 3.3e-16 59.6 1.7 2 57 2426 2481 2425 2481 0.98
24 24 1.3e-09 3.5e-07 30.7 0.9 23 57 2525 2559 2520 2559 0.92

Sequence Information

Coding Sequence
ATGATGTGTCCgtcgatatacgtgcatggcgtgTACCAGCCCGTCTCTGAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAAAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGGAGGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAGATTAAAGTGCGGTTCCAAAACCGTCGGAGTAAGTTCAAAAAACAACGAAATCAGAATGATGCGAGGAGTCCGACATTGATGACCGTACCGGAAACTCCATCGGTTTCGACTGAGCAGAATATGCCGGCCTCTTGGTCGGCTCAACCTGTGAAAGTCTCTCAAACATTCAGACAATATCTTAGCGGTCCTCGAAAGAATAAAATTGCTGCGGCGCTGAATGTTACCGAGCGACAGATAAATATCTGGTTCCAACACCATCGTATGGTGTACAAAAAACCACGGAGACAAAGCACTGCGAGGAGTTTGATGTCAATGAATTCGGCAGCAACTTTATTGGTTCCATCTGAGAGGAATACTCAAGCTTCTTGGTCAGTTCAGCCCGttgatatatttgtttataatttcaagatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCTGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAAGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTACTCAGCTCGTCGAGCTTGAAAAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAATGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCGAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGATCCAGCAAGGGGAAGTCAGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGAGAAACACAACTTTCTTCTCTATCCCGAACTTCTGCCCCGAAGGGTAGGCCTGAAACTGAGCGTAACACCTATTCAATTATTTGGCTTAGCCTGGATCAAGTCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAATGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGAGAAACACAACTTTCTTCTCTATCCCGAACTTCTGCCCCGAAGGGTAGGCCTGAAACTGAGCGTAACACCTATTCAATTATTTGGCTTAGCCTGGATCAAGTCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAAGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGATCCAGCAAGGGGAAGTCAGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTACTCAGCTCGTCGAGCTTGAAAAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAATGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCGAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGATCCAGCAAGGGGAAGTCAGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAGGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGAGAAACACAACTTTCTTCTCTATCCCGAACTTCTGCCCCGAAGGGTAGGCCTGAAACTGAGCGTAACACCTATTCAATTATTTGGCTTAGCCTGGATCAAGTCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAATGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAaaGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGAGAAACACAACTTTCTTCTCTATCCCGAACTTCTGCCCCGAAGGGTAGGCCTGAAACTGAGCGTAACACCTATTCAATTATTTGGCTTAGCCTGGATCAAGTCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAAGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTACTCAGCTCGTCGAGCTTGAAAAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAATGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCGAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGATCCAGCAAGGGGAAGTCAGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAGGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTACTCAGCTCGTCGAGCTTGAAAAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAATGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCGAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGATCCAGCAAGGGGAAGTCAGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAGGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAAAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGGAGGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAGATTAAAGTGCGGTTCCAAAACCGTCGGAGTAAGTTCAAAAAACAACGAAATCAGAATGATGCGAGGAGTCCGACATTGATGACCGTACCGGAAACTCCATCGGTTTCGACTGAGCAGAATATGCCGGCCTCTTGGTCGGCTCAACCTGTGAAAGTCTCTCAAACATTCAGACAATATCTTAGCGGTCCTCGAAAGAATAAAATTGCTGCGGCGCTGAATGTTACCGAGCGACAGATAAATATCTGGTTCCAACACCATCGTATGGTGTACAAAAAACCACGGAGACAAAGCACTGCGAGGAGTTTGATGTCAATGAATTCGGCAGCAACTTTATTGGTTCCATCTGAGAGGAATACTCAAGCTTCTTGGTCAGTTCAGCCCGttgatatatttgtttataatttcaagatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCTGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAATGATGGACAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGAGAAACACAACTTTCTTCTCTATCCCGAACTTCTGCCCCGAAGGGTAGGCCTGAAACTGAGCGTAACACCTATTCAATTATTTGGCTTAGCCTGGATCAAGTCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAAGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTACTCAGCTCGTCGAGCTTGAAAAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAATGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCGAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGATCCAGCAAGGGGAAGTCAGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAGGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCAAACGGGGACGATATATGAGTCATCCTCGAATGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTTGGAatGGATATGAAAAATGCCGATTCCTTCAATGCCAGGTTCCAGCAAGGGGAAGTCGGCTACTCCCGGGGCCAGCCCAATGTTCATGCTCGAAAATTCACCGTCTCGAACGCTAAAATTCCGAGTACTTCCGAGCAATCGAATGGTAAGCGGGCTAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATTCTCGAAGGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAATTAAAGTGCGGTTCCAAAACCGTCGGAGAAGCAGCTTACTCTCCTTCGCGTACAGCTGAAATTCCGAGTACTTTCGAGCAATCGAATGGTATGCGGGCCAGGACCACGTATTCCAGTGCTCAGCTCGTCGAGCTTGAAGAGGAATTCGAACGAGGACGATATATGAGTCATCCTCGAAGGATGGAAAGGGCTGCGACGCTGAACCTGCCGGAACGCCAGATTAAAGTGTGGTTCCAATGCCGTCGGGTTAAGGTAAAAAAACAACGGAATCAGAATGATGTGAGGATTCCGTCATTGATGACCGTACCGGAAACTCCATCGGTTTCGACTGAGAAGAATATGCCAGCCTCTTGGTCGGCTCAACCTGTGGAAGTCTCTCAAACATTCAGACAACATATTAGCCGTCCTCGAAAGATTAAACTGGCCGCGGCCCTGAATGTTACCGAGCGACAGATAAATATCTGGTTTCAAAACCATCGTATAGTGTACAAGAAACCACGGAATAAAAGCACTGCGAGGAGTTCGATGTCAATGAATTCACCCGAAACTTTATTGGTTCTATCTGAGAGGAATACTCAAGCCTCTTGA
Protein Sequence
MMCPSIYVHGVYQPVSEIPSTSEQSNGKRARTTYSSAQLVELEKEFERGRYMSHPRRMERAATLNLPERQIKVRFQNRRSKFKKQRNQNDARSPTLMTVPETPSVSTEQNMPASWSAQPVKVSQTFRQYLSGPRKNKIAAALNVTERQINIWFQHHRMVYKKPRRQSTARSLMSMNSAATLLVPSERNTQASWSVQPVDIFVYNFKMDMKNADSFNARFQQGEVGYSWGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRKMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSTQLVELEKEFERGRYMSHPRMMERAATLNLPERQLKCGSKTVGMDMKNADSFNARIQQGEVSYSRGQPNVHARKFTVSNEKHNFLLYPELLPRRVGLKLSVTPIQLFGLAWIKSKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRMMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNEKHNFLLYPELLPRRVGLKLSVTPIQLFGLAWIKSKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRKMERAATLNLPERQLKCGSKTVGMDMKNADSFNARIQQGEVSYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSTQLVELEKEFERGRYMSHPRMMERAATLNLPERQLKCGSKTVGMDMKNADSFNARIQQGEVSYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRRMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNEKHNFLLYPELLPRRVGLKLSVTPIQLFGLAWIKSKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRMMERAATLNLPERQLKCGSKTVGKDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNEKHNFLLYPELLPRRVGLKLSVTPIQLFGLAWIKSKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRKMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSTQLVELEKEFERGRYMSHPRMMERAATLNLPERQLKCGSKTVGMDMKNADSFNARIQQGEVSYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRRMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSTQLVELEKEFERGRYMSHPRMMERAATLNLPERQLKCGSKTVGMDMKNADSFNARIQQGEVSYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRRMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEKEFERGRYMSHPRRMERAATLNLPERQIKVRFQNRRSKFKKQRNQNDARSPTLMTVPETPSVSTEQNMPASWSAQPVKVSQTFRQYLSGPRKNKIAAALNVTERQINIWFQHHRMVYKKPRRQSTARSLMSMNSAATLLVPSERNTQASWSVQPVDIFVYNFKMDMKNADSFNARFQQGEVGYSWGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRMMDRAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNEKHNFLLYPELLPRRVGLKLSVTPIQLFGLAWIKSKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRKMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSTQLVELEKEFERGRYMSHPRMMERAATLNLPERQLKCGSKTVGMDMKNADSFNARIQQGEVSYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHPRRMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEEEFKRGRYMSHPRMMERAATLNLPERQLKCGSKTVGMDMKNADSFNARFQQGEVGYSRGQPNVHARKFTVSNAKIPSTSEQSNGKRARTTYSSAQLVELEEEFERGRYMSHSRRMERAATLNLPERQLKCGSKTVGEAAYSPSRTAEIPSTFEQSNGMRARTTYSSAQLVELEEEFERGRYMSHPRRMERAATLNLPERQIKVWFQCRRVKVKKQRNQNDVRIPSLMTVPETPSVSTEKNMPASWSAQPVEVSQTFRQHISRPRKIKLAAALNVTERQINIWFQNHRIVYKKPRNKSTARSSMSMNSPETLLVLSERNTQAS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-