Basic Information

Gene Symbol
ILP1
Assembly
GCA_905163435.1
Location
LR990990.1:17132702-17159118[+]

Transcription Factor Domain

TF Family
GCFC
Domain
GCFC domain
PFAM
PF07842
TF Group
Unclassified Structure
Description
This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 30 0.00019 2.9 8.1 0.0 6 40 204 238 199 242 0.90
2 30 0.00019 2.9 8.1 0.0 6 40 261 295 256 299 0.90
3 30 0.00019 2.9 8.1 0.0 6 40 318 352 313 356 0.90
4 30 0.00019 2.9 8.1 0.0 6 40 375 409 370 413 0.90
5 30 0.00019 2.9 8.1 0.0 6 40 432 466 427 470 0.90
6 30 0.00019 2.9 8.1 0.0 6 40 489 523 484 527 0.90
7 30 0.00019 2.9 8.1 0.0 6 40 546 580 541 584 0.90
8 30 0.00016 2.4 8.4 0.0 6 41 603 638 598 663 0.88
9 30 0.00019 2.9 8.1 0.0 6 40 669 703 664 707 0.90
10 30 0.00019 2.9 8.1 0.0 6 40 726 760 721 764 0.90
11 30 0.00019 2.9 8.1 0.0 6 40 783 817 778 821 0.90
12 30 0.00019 2.9 8.1 0.0 6 40 840 874 835 878 0.90
13 30 0.00019 2.9 8.1 0.0 6 40 897 931 892 935 0.90
14 30 0.00019 2.9 8.1 0.0 6 40 954 988 949 992 0.90
15 30 0.00019 2.9 8.1 0.0 6 40 1011 1045 1006 1049 0.90
16 30 0.00019 2.9 8.1 0.0 6 40 1068 1102 1063 1106 0.90
17 30 0.00019 2.9 8.1 0.0 6 40 1125 1159 1120 1163 0.90
18 30 0.00019 2.9 8.1 0.0 6 40 1182 1216 1177 1220 0.90
19 30 0.00019 2.9 8.1 0.0 6 40 1239 1273 1234 1277 0.90
20 30 0.00019 2.9 8.1 0.0 6 40 1296 1330 1291 1334 0.90
21 30 0.00019 2.9 8.1 0.0 6 40 1353 1387 1348 1391 0.90
22 30 0.00019 2.9 8.1 0.0 6 40 1410 1444 1405 1448 0.90
23 30 0.00019 2.9 8.1 0.0 6 40 1467 1501 1462 1505 0.90
24 30 0.00019 2.9 8.1 0.0 6 40 1524 1558 1519 1562 0.90
25 30 0.00019 2.9 8.1 0.0 6 40 1581 1615 1576 1619 0.90
26 30 0.00019 2.9 8.1 0.0 6 40 1638 1672 1633 1676 0.90
27 30 0.00019 2.9 8.1 0.0 6 40 1695 1729 1690 1733 0.90
28 30 0.00019 2.9 8.1 0.0 6 40 1752 1786 1747 1790 0.90
29 30 0.00019 2.9 8.1 0.0 6 40 1809 1843 1804 1847 0.90
30 30 1.5e-13 2.2e-09 38.1 0.0 6 190 1866 2030 1861 2039 0.70

Sequence Information

Coding Sequence
ATGTTCAAGGGTGCAAGTCGGCTCAGGTTAGGAGAGCTCCAAGTAGAGCGCTCGGCGACCTCAGACACGCAGAAGGAGCTCCGCGAGAGACTGCTAACCTCAGCCAGGATACGAGAGAGCCGCGCGGCCCGCTGCGGAGAGCTAGACGCGGCGTATAGGAGAGCGCAGGCTATACGCGGCTACCTCACCGACCTTATCGAGTGCCTCGATGAGAAGATGCCACAActggaggcgttggaagctCGCGCGCTGGCGCTGCACAAGCGTCGCTGCGAGTTCCTCGTAGAGCGACGACGTGCCGACCTGCGGGACCAGGCGCAGGACGTGCTCGCTCCACCTGGTCGAGCCTCAAAGCAAGTCGACAGCGAAGAGAAGACGCGGCGCGCCGCCGAGCGGGAGGGGCGCCGGCGTGCGCGCCGCCTCAAGCGCgaggccgccgccgccgccgccggcgccgcgctGCCTCACCGCGACGGAGACTCCTCGGATGACGAGCTGCCTCCGCACGAGATGCATCACTATACGCAGGAGAGAGACGCCATACGTCAACAATCAGCATCACTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGCTCATATTGTGGAACCCACTGGCAGACGAAGACAACGAAGACTACGAGAGAATGGATTGGTACAAATGCCTGATGATGTACGGCGTGCGCACGGAGCGCTCTGCAGACGACTCCTCCTCCTCAGGGTCGGAAGGAGAGCCCGAGCCGCCGCCCGTCACCAACACTACTGTGCGGGACGACCCCGATCTGCTGCTGGTGCCCAGCATCATCAGCAGGGTGGTGCTGCCCTGTCTCACAGAGCTGGTGTCAGTGGCGTGGGACCCGCTGAGCGTGCGCTCGTGCATCCGCCTGCGCTCGCTGCTGGTGCGCGCGGCGGGCCTGCCCACGTGCTGCGCGGCGCTGCAGCGCCTGTCCGCTGCGCTGCGGGCCAGGCTGGCCACCGCGCTCGGCGCTGACGTGTTCCTGCCGGCCTTGCCGCCACAAGTAATGGAAGGCCCGGGTGGCGCTTTCTGGCGTCGCTGCTTAGGAGCAGGAGTCAGGCTGCTGCGCGCCACGCTGGCGCTCACCGGCCCGCCTGACTTGCTGTATGCTGACCCGCTCGTACTGTCGCTTATTGAGACACTAtgctgcggcgcgggcgcagcgggcGGGCCGTACATAGCTAGCGCGGCGTCCGCGCTCACGGACACGCTGCCGCGCTCGGGCGCGCTGCGCAAGCGAGCGCTGGCGCGGCTGGCGGCGCTCGCTACACTGGCGCTGTCGAGGCTGGACAGTGACAACCCGCTGCACTTGTGA
Protein Sequence
MFKGASRLRLGELQVERSATSDTQKELRERLLTSARIRESRAARCGELDAAYRRAQAIRGYLTDLIECLDEKMPQLEALEARALALHKRRCEFLVERRRADLRDQAQDVLAPPGRASKQVDSEEKTRRAAEREGRRRARRLKREAAAAAAGAALPHRDGDSSDDELPPHEMHHYTQERDAIRQQSASLFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHELILWNPLADEDNEDYERMDWYKCLMMYGVRTERSADDSSSSGSEGEPEPPPVTNTTVRDDPDLLLVPSIISRVVLPCLTELVSVAWDPLSVRSCIRLRSLLVRAAGLPTCCAALQRLSAALRARLATALGADVFLPALPPQVMEGPGGAFWRRCLGAGVRLLRATLALTGPPDLLYADPLVLSLIETLCCGAGAAGGPYIASAASALTDTLPRSGALRKRALARLAALATLALSRLDSDNPLHL*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-