Basic Information

Gene Symbol
-
Assembly
GCA_905147815.1
Location
LR990630.1:3974858-3984553[+]

Transcription Factor Domain

TF Family
CTF_NFI
Domain
CTF/NFI and MH1 domain
PFAM
PF00859
TF Group
Unclassified Structure
Description
Nuclear factor I (NF-I) or CCAAT box-binding transcription factor (CTF) [2, 1, 5] (also known as TGGCA-binding proteins) are a family of vertebrate nuclear proteins which recognise and bind, as dimers, the palindromic DNA sequence 5'-TGGCANNNTGCCA-3'. This family was first described for its role in stimulating the initiation of adenovirus DNA replication [6]. In vertebrates there are four members NFIA, NFIB, NFIC, and NFIX and an orthologue from Caenorhabditis elegans has been described, called Nuclear factor I family protein (NFI-I) [4]. The CTF/NF-I proteins are individually capable of activating transcription and DNA replication, thus they function by regulating cell proliferation and differentiation. They are involved in normal development and have been associated with developmental abnormalities and cancer in humans [5]. In a given species, there are a large number of different CTF/NF-I proteins, generated both by alternative splicing and by the occurrence of four different genes. CTF/NF-1 proteins contain 400 to 600 amino acids. The N-terminal 200 amino-acid sequence, almost perfectly conserved in all species and genes sequenced, mediates site-specific DNA recognition, protein dimerisation and Adenovirus DNA replication. The C-terminal 100 amino acids contain the transcriptional activation domain. This activation domain is the target of gene expression regulatory pathways elicited by growth factors and it interacts with basal transcription factors and with histone H3 [3].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 22 0.0018 32 5.8 0.7 239 281 28 71 10 75 0.77
2 22 0.00092 16 6.7 0.3 230 281 168 220 140 224 0.76
3 22 0.00092 16 6.7 0.3 230 281 317 369 289 373 0.76
4 22 0.00092 16 6.7 0.3 230 281 466 518 438 522 0.76
5 22 0.00092 16 6.7 0.3 230 281 615 667 587 671 0.76
6 22 0.00092 16 6.7 0.3 230 281 764 816 736 820 0.76
7 22 0.00092 16 6.7 0.3 230 281 913 965 885 969 0.76
8 22 0.00092 16 6.7 0.3 230 281 1062 1114 1034 1118 0.76
9 22 0.00092 16 6.7 0.3 230 281 1211 1263 1183 1267 0.76
10 22 0.00092 16 6.7 0.3 230 281 1360 1412 1332 1416 0.76
11 22 0.00046 8 7.7 0.2 230 282 1509 1562 1481 1566 0.76
12 22 0.00093 16 6.7 0.3 231 281 1607 1658 1579 1662 0.75
13 22 0.00042 7.3 7.8 0.5 230 281 1755 1807 1727 1811 0.76
14 22 0.00092 16 6.7 0.3 230 281 1904 1956 1876 1960 0.76
15 22 0.0012 21 6.3 0.2 231 281 2054 2105 2025 2109 0.76
16 22 0.00092 16 6.7 0.3 230 281 2202 2254 2174 2258 0.76
17 22 0.00092 16 6.7 0.3 230 281 2351 2403 2323 2407 0.76
18 22 0.00092 16 6.7 0.3 230 281 2500 2552 2472 2556 0.76
19 22 0.00028 4.8 8.4 0.5 229 281 2648 2701 2620 2705 0.78
20 22 0.00092 16 6.7 0.3 230 281 2798 2850 2770 2854 0.76
21 22 0.00099 17 6.6 0.3 232 281 2949 2999 2919 3003 0.75
22 22 0.00092 16 6.7 0.3 230 281 3096 3148 3068 3152 0.76

Sequence Information

Coding Sequence
ATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGGGCTGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCAGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCAAAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTTTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCAATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCACATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTGGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGTCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACCAGCCCATGGTGCTGCCCAACGGGCAGGTCTACGGGGAGAAGGTACGTGGGGTGCTGCCCAACGGACAGGGTGCACGCTGCCGCACGCGCACTGCTCGCACTCGCGCCTCGTGTGCCGCATCTCCAACCGCCCGCTCAACGAGCACAACATACCGGACTCAATTCCAGATTGA
Protein Sequence
MVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRGLHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCSPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSKTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCATSPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTAGTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAVARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNQPMVLPNGQVYGEKVRGVLPNGQGARCRTRTARTRASCAASPTARSTSTTSPWCCPTGRSTGRRYVGCCPTDRVHAAARALLALAPRVPHLQPPAQRAQPAHGAAQRAGLRGEGTWGAAQRTGCTLPHAHCSHSRLVCRISNRPLNEHNIPDSIPD*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-