Basic Information

Gene Symbol
PLB1
Assembly
GCA_950371165.1
Location
OX493387.1:2266043-2271987[-]

Transcription Factor Domain

TF Family
AP-2
Domain
TF_AP-2 domain
PFAM
PF03299
TF Group
Basic Domians group
Description
Activator protein-2 (AP-2) transcription factors constitute a family of closely related and evolutionarily conserved proteins that bind to the DNA consensus sequence GCCNNNGGC and stimulate target gene transcription [PMID: 2010091, PMID: 1998122]. Four different isoforms of AP-2 have been identified in mammals, termed AP-2 alpha, beta, gamma and delta. Each family member shares a common structure, possessing a proline/glutamine-rich domain in the N-terminal region, which is responsible for transcriptional activation [PMID: 2010091], and a helix-span-helix domain in the C-terminal region, which mediates dimerisation and site-specific DNA binding [PMID: 199812]. http://www.ebi.ac.uk/interpro/entry/IPR013854
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 2.8e-05 0.72 11.2 0.0 127 195 123 190 106 192 0.89
2 28 0.00056 14 6.9 0.0 139 195 184 239 180 241 0.88
3 28 0.00055 14 6.9 0.0 139 195 233 288 229 290 0.88
4 28 0.00072 18 6.6 0.0 140 195 283 337 280 340 0.88
5 28 0.00051 13 7.1 0.0 138 195 330 386 326 388 0.88
6 28 0.0006 15 6.8 0.0 139 195 380 435 377 437 0.88
7 28 0.00051 13 7.1 0.0 138 195 428 484 424 486 0.88
8 28 0.00051 13 7.1 0.0 138 195 477 533 473 535 0.88
9 28 0.0006 15 6.8 0.0 139 195 527 582 524 584 0.88
10 28 0.0006 15 6.8 0.1 139 195 576 631 573 634 0.88
11 28 0.0006 15 6.8 0.0 139 195 625 680 622 682 0.88
12 28 0.0006 15 6.8 0.0 139 195 674 729 671 731 0.88
13 28 0.00056 14 6.9 0.0 139 195 723 778 719 780 0.88
14 28 0.00091 23 6.2 0.0 140 195 773 827 771 829 0.87
15 28 0.0005 13 7.1 0.0 138 195 820 876 816 878 0.88
16 28 0.00051 13 7.1 0.0 138 195 869 925 865 927 0.88
17 28 0.0012 30 5.9 0.0 139 195 919 974 916 975 0.88
18 28 0.00055 14 6.9 0.0 139 195 968 1023 964 1025 0.88
19 28 0.0006 15 6.8 0.0 139 195 1017 1072 1014 1074 0.88
20 28 0.0006 15 6.8 0.0 139 195 1066 1121 1063 1123 0.88
21 28 0.00072 18 6.6 0.0 140 195 1116 1170 1113 1173 0.88
22 28 0.00042 11 7.3 0.0 137 195 1162 1219 1156 1221 0.89
23 28 0.00048 12 7.1 0.0 138 195 1212 1268 1206 1269 0.88
24 28 0.0006 15 6.8 0.0 139 195 1262 1317 1259 1319 0.88
25 28 0.0006 15 6.8 0.0 139 195 1311 1366 1308 1368 0.88
26 28 0.00051 13 7.1 0.0 138 195 1359 1415 1355 1417 0.88
27 28 0.0006 15 6.8 0.0 139 195 1409 1464 1406 1466 0.88
28 28 0.00051 13 7.1 0.0 138 195 1457 1513 1453 1515 0.88

Sequence Information

Coding Sequence
ATGCGCTCCTCCCCGGACATAGACATCGCGAACGACTGGAAGATGGTGACGGTATTCATCGGCGCCAACGACCTGTGCTCGGCGTCGTGCCTCAACCCCGTGTCGTGGTCGCCGGCCGCGCACGCCAAGAAGCTCAGCATCGCGCTCGACTACCTGCAGAAGCATCTGCCCAGGACTATCGTTAATTTGGTACCAGTTCTAGACGTATCAGTATCTATCCGCGTGCTTCGTCCGATGATGTGTCGTCTGATGCACTCGCTGTTCTGCACCTGCTTCCACCACCAGGGCGGCAACGAGCTGTACGACCTCGTGCGGATGGCGCGGCTGTACCAGAAGGCTGAGGTTGCTCTGGTGGAAAGCGGCCGCTACGACACGAGCGAGGACTTCACGGTGATCGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACAAACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGGTACGTTACGTGTACGTGTTACTGTGCAGCCCTTCATGCGGCTGTTCAACGCGCCCATGCCGCCCACCCGGCCGCTGCCGCTCGTCATCCACCAGTCCTACATCACGCACGACTGCTTCCACTTCTCGCAGAAGGGCCACGCGCTAGCCGCCAACCTCCTGTGGAACAACCTGCTAGAGCCGGTGGGCAACAAGTCTGACAACGTGCCGCCCGTACTGCTGCGCTCCTTCCGCTGTCCCTCGCGCCGCGCGCCCTACCTGTTCACGTACAAGAACTCGCGCAGCTACGTGATCAGCGGCAAGCAGGAGGGGGCTGATGATGTATTCCTATGA
Protein Sequence
MRSSPDIDIANDWKMVTVFIGANDLCSASCLNPVSWSPAAHAKKLSIALDYLQKHLPRTIVNLVPVLDVSVSIRVLRPMMCRLMHSLFCTCFHHQGGNELYDLVRMARLYQKAEVALVESGRYDTSEDFTVIVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITNDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALGTLRVRVTVQPFMRLFNAPMPPTRPLPLVIHQSYITHDCFHFSQKGHALAANLLWNNLLEPVGNKSDNVPPVLLRSFRCPSRRAPYLFTYKNSRSYVISGKQEGADDVFL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-