Mbra005495.1
Basic Information
- Insect
- Mamestra brassicae
- Gene Symbol
- ILP1
- Assembly
- GCA_905163435.1
- Location
- LR990990.1:17132702-17159118[+]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 30 0.00019 2.9 8.1 0.0 6 40 204 238 199 242 0.90 2 30 0.00019 2.9 8.1 0.0 6 40 261 295 256 299 0.90 3 30 0.00019 2.9 8.1 0.0 6 40 318 352 313 356 0.90 4 30 0.00019 2.9 8.1 0.0 6 40 375 409 370 413 0.90 5 30 0.00019 2.9 8.1 0.0 6 40 432 466 427 470 0.90 6 30 0.00019 2.9 8.1 0.0 6 40 489 523 484 527 0.90 7 30 0.00019 2.9 8.1 0.0 6 40 546 580 541 584 0.90 8 30 0.00016 2.4 8.4 0.0 6 41 603 638 598 663 0.88 9 30 0.00019 2.9 8.1 0.0 6 40 669 703 664 707 0.90 10 30 0.00019 2.9 8.1 0.0 6 40 726 760 721 764 0.90 11 30 0.00019 2.9 8.1 0.0 6 40 783 817 778 821 0.90 12 30 0.00019 2.9 8.1 0.0 6 40 840 874 835 878 0.90 13 30 0.00019 2.9 8.1 0.0 6 40 897 931 892 935 0.90 14 30 0.00019 2.9 8.1 0.0 6 40 954 988 949 992 0.90 15 30 0.00019 2.9 8.1 0.0 6 40 1011 1045 1006 1049 0.90 16 30 0.00019 2.9 8.1 0.0 6 40 1068 1102 1063 1106 0.90 17 30 0.00019 2.9 8.1 0.0 6 40 1125 1159 1120 1163 0.90 18 30 0.00019 2.9 8.1 0.0 6 40 1182 1216 1177 1220 0.90 19 30 0.00019 2.9 8.1 0.0 6 40 1239 1273 1234 1277 0.90 20 30 0.00019 2.9 8.1 0.0 6 40 1296 1330 1291 1334 0.90 21 30 0.00019 2.9 8.1 0.0 6 40 1353 1387 1348 1391 0.90 22 30 0.00019 2.9 8.1 0.0 6 40 1410 1444 1405 1448 0.90 23 30 0.00019 2.9 8.1 0.0 6 40 1467 1501 1462 1505 0.90 24 30 0.00019 2.9 8.1 0.0 6 40 1524 1558 1519 1562 0.90 25 30 0.00019 2.9 8.1 0.0 6 40 1581 1615 1576 1619 0.90 26 30 0.00019 2.9 8.1 0.0 6 40 1638 1672 1633 1676 0.90 27 30 0.00019 2.9 8.1 0.0 6 40 1695 1729 1690 1733 0.90 28 30 0.00019 2.9 8.1 0.0 6 40 1752 1786 1747 1790 0.90 29 30 0.00019 2.9 8.1 0.0 6 40 1809 1843 1804 1847 0.90 30 30 1.5e-13 2.2e-09 38.1 0.0 6 190 1866 2030 1861 2039 0.70
Sequence Information
- Coding Sequence
- ATGTTCAAGGGTGCAAGTCGGCTCAGGTTAGGAGAGCTCCAAGTAGAGCGCTCGGCGACCTCAGACACGCAGAAGGAGCTCCGCGAGAGACTGCTAACCTCAGCCAGGATACGAGAGAGCCGCGCGGCCCGCTGCGGAGAGCTAGACGCGGCGTATAGGAGAGCGCAGGCTATACGCGGCTACCTCACCGACCTTATCGAGTGCCTCGATGAGAAGATGCCACAActggaggcgttggaagctCGCGCGCTGGCGCTGCACAAGCGTCGCTGCGAGTTCCTCGTAGAGCGACGACGTGCCGACCTGCGGGACCAGGCGCAGGACGTGCTCGCTCCACCTGGTCGAGCCTCAAAGCAAGTCGACAGCGAAGAGAAGACGCGGCGCGCCGCCGAGCGGGAGGGGCGCCGGCGTGCGCGCCGCCTCAAGCGCgaggccgccgccgccgccgccggcgccgcgctGCCTCACCGCGACGGAGACTCCTCGGATGACGAGCTGCCTCCGCACGAGATGCATCACTATACGCAGGAGAGAGACGCCATACGTCAACAATCAGCATCACTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGGTAACTATCAAACACATAATATTGTTCAGCGACGCGCTGCCCGCGTGGCGCAGCGTGTCAGGAGTATGCGGGCGCCTCGCACGCTGGCGAGCGCGCGCCACGTCTCTCTATACGGACGCCTACGTGGCCGACTGCCTGCCCAAGCTGCTGGCGCCCTATGTCAGGCACGAGCTCATATTGTGGAACCCACTGGCAGACGAAGACAACGAAGACTACGAGAGAATGGATTGGTACAAATGCCTGATGATGTACGGCGTGCGCACGGAGCGCTCTGCAGACGACTCCTCCTCCTCAGGGTCGGAAGGAGAGCCCGAGCCGCCGCCCGTCACCAACACTACTGTGCGGGACGACCCCGATCTGCTGCTGGTGCCCAGCATCATCAGCAGGGTGGTGCTGCCCTGTCTCACAGAGCTGGTGTCAGTGGCGTGGGACCCGCTGAGCGTGCGCTCGTGCATCCGCCTGCGCTCGCTGCTGGTGCGCGCGGCGGGCCTGCCCACGTGCTGCGCGGCGCTGCAGCGCCTGTCCGCTGCGCTGCGGGCCAGGCTGGCCACCGCGCTCGGCGCTGACGTGTTCCTGCCGGCCTTGCCGCCACAAGTAATGGAAGGCCCGGGTGGCGCTTTCTGGCGTCGCTGCTTAGGAGCAGGAGTCAGGCTGCTGCGCGCCACGCTGGCGCTCACCGGCCCGCCTGACTTGCTGTATGCTGACCCGCTCGTACTGTCGCTTATTGAGACACTAtgctgcggcgcgggcgcagcgggcGGGCCGTACATAGCTAGCGCGGCGTCCGCGCTCACGGACACGCTGCCGCGCTCGGGCGCGCTGCGCAAGCGAGCGCTGGCGCGGCTGGCGGCGCTCGCTACACTGGCGCTGTCGAGGCTGGACAGTGACAACCCGCTGCACTTGTGA
- Protein Sequence
- MFKGASRLRLGELQVERSATSDTQKELRERLLTSARIRESRAARCGELDAAYRRAQAIRGYLTDLIECLDEKMPQLEALEARALALHKRRCEFLVERRRADLRDQAQDVLAPPGRASKQVDSEEKTRRAAEREGRRRARRLKREAAAAAAGAALPHRDGDSSDDELPPHEMHHYTQERDAIRQQSASLFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHEVTIKHIILFSDALPAWRSVSGVCGRLARWRARATSLYTDAYVADCLPKLLAPYVRHELILWNPLADEDNEDYERMDWYKCLMMYGVRTERSADDSSSSGSEGEPEPPPVTNTTVRDDPDLLLVPSIISRVVLPCLTELVSVAWDPLSVRSCIRLRSLLVRAAGLPTCCAALQRLSAALRARLATALGADVFLPALPPQVMEGPGGAFWRRCLGAGVRLLRATLALTGPPDLLYADPLVLSLIETLCCGAGAAGGPYIASAASALTDTLPRSGALRKRALARLAALATLALSRLDSDNPLHL*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -