Basic Information

Insect
Lerema accius
Gene Symbol
-
Assembly
None
Location
scaffold929:258548-334161[+]

Transcription Factor Domain

TF Family
AF-4
Domain
AF-4 domain
PFAM
PF05110
TF Group
Unclassified Structure
Description
This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental disabilities [1]. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 6.6e-05 1.1 8.3 0.0 43 111 62 129 58 151 0.67
2 5 3.2e-11 5.4e-07 29.2 14.5 343 513 166 323 145 324 0.70
3 5 1 1.7e+04 -9.1 11.7 87 215 340 463 325 567 0.50
4 5 0.066 1.1e+03 -1.6 1.7 44 69 562 587 444 609 0.44
5 5 0.3 5.2e+03 -3.8 0.3 434 455 848 868 835 885 0.40

Sequence Information

Coding Sequence
ATGTTCAAAACAGAACTAAAAAACCTTGGTTTATTAGCCACGTGTGAGGAGTCTCTATCTGTTTTTTTTAATGTCCCAGGAAAAGAGGCGCGCGACCGCCGCGACGGAAATAGACGCCCCCCGCCCCGCCGCCCGCGCCGCGACGCCGCAAACGATGACGATCGACGATTTGTGAAGGGCAACAATGATCGAATGAGCCAGCAGATCCAGACGAAGCTGGGGGACTACCACCTCGCCCAAACCCTCCTCGACGACCCCAGCAAGTCGATTGGAATCTGCGCCGAGCCAGCCAGCCCAGCACCCTGCCACCCTTCCCGTCGAGAGGGCGAGTTTAAGAAACCTGCACACGCGCCGCACGGGCCTACCAATGGACGCGTGCATCACCGCCCTTACATTTACAATAAGAGCGAAGGTCTCAGTTCGACCGGCGCGGCCGCGCACCGACCGCCCCCCTTGAGGATACCCAACGGCTTTCAAGGACAGCATAACATCGACAGCAGCAGTCAACCGCCTATTGAAAGTATTCTTAAGGAAATGAAAGAGCTGCCCACTCCTCTGTCGGTGATAGCGGCCACGCCTCGCAAGGAACTCGAGAATAAGTTCATCTTTAACCCTTACACCAATAAGGTACAAGAAAACCCCCAGCTGGTCAACAAAGATCTCAAAACACCCATCACAAAACCCGGTTTAGACAACCGGATATCACATTCCCGGAGTTCTGCTTCGCCTGATGTTGTGAACAAGGACTTGGGTCTCTCGGAGAGCGATGATGAGACGCCCGCCACCACCACCAGACTTGAACCTATCCTATCACCGATCGGTTCGGGTGGTAGTCCGAGTTCTGGCAGTGAGTCATCACAAACTGACTCCGAGTCGGAGTCGTCGTCGGCCGAGTCGACAGAGGCGCCGCCTACCGTCGCCGTGCCGCAACCTCCCGAGCGCACGTCGTGGGGCCTCTCCAACTTCGTGCCGCCGTCCACGCAGCTACCCGACCACACGGAGTCGTATACCGCGGGAAAGGAGCTCAGGCATGCGTTGGACGATGTTAAGAGGAAACCGAGTCCAATTTCGGAGTTGTCGGATTCGGAGACGTCGTCCCCGCTAGCGGGCGCTACGCGGCGCCGGCGCTCGGTGGCTGCGCGGGTGTCCATCGCCTCTAGCGACGATGAAACACCTAGGCCGCATCATCAGCGGTCCAATGAACGGCGAAACGAGGAAGCCATCGCGCGGGTGTCGCCGCCGCGGGAGCCCGAGACACTAACGAAAAGACGAGAGGCCGAAAGAGTCGCGCACGAGCGGATGCAAGAACGAGTCGCCAACGAACGGCCCGATCGCGTATCCGAAAGATCCGACCGCGTCGATCGTATCTCCGACCGGATCTCCGATCGCATCTCCGACCGACTCGCGGATCGAACGACCAACGACAGGCTACCGCCGGATAGGAACGAGCGACACGCGGAGAGAATACCCAACGAACGCCTGCAACCGGACAGACTGCAACCCGTCCAACCGGACAGGCCGCAACAGGACCGCAGATCGATCGAGAGGTTCCAAGAACGGTTTCAGTTGCAGCACGAACGGATCGCCTACGAACGGTTGCAGGCCGAACGGTTGTCCAACGAGAAAATGGCGTTGGAGAAAGCGAACGAACGGTTACAACCGACTGTCGAGAGGATGTCGACGGATCGACTGTCCAATCAGATCGGGGATAAGTTGGATCGGTTGGCGGAGAAGCGGGAGAGACTTTCGGATAAATCGCACGATCGATTGTCGGATAAGATGTCTGTGGATTGTTTGCCGGGTGATAAGATATCACTCGACAGGGTCAATATATCGGATCGGATCGTGCAGGAGCGGTACGTTAGGGCGGAGGCCGATGTCGGATCCGGATTGAAGTCCGGTCGGGGATCGGAAAGACTGGAGGATGCCTCTTCCAATGTCCGCGGTCGTCCGCGCGTGCTAGTCCGCATCCCGTTAAGTCGACTGCGGTCGGACCGCTTGCGGACACTGCAGCGGCCCGCGCGTCGCCCGCCCGCGCCACGTGTGGCCACGCCGCCGCCCGACCAGGGCGTGGAAGCGATGGAGGTCCTGCGGAGTGGGGGGAGGGACACCGGCCGCTCGCCTGTCTACTACTCATACTTCGAACAACTACCCCCGGACACGCTGTCCGATGATGACCGGGATCACACATACTACATGGACGAAGCGAAACGAATGCGCTCAGCGGCTGAAAGCGAACAAGAGCCACTCGCAAGAGTAATGTTGTACCACGAATCGGTACTTTGTTTCGTACTAACCGGTCGACTGCTAGAAATGGAAATGGAGACCAAGAAAGCCTTCACGATATATCGAGAGACTATCGAGTACATAAAGTCGATTCACTCGATGCCGCAGCGGTTTAGGACGTCACCCAAATCGCCGCATTCTACCTTCAGCAAGTTGGATATACTCAGCCTGAGGGTGCAAGCGTTACTGTACCTGCGGATGTTCAGAATGTACAACAGAGAAGTCAAAGAGTACAACAAAGTGGTTCAGGAGTACCAACAGAAGCCGGCGTGCGCAGAGCCAGTGTCTCCGTTGTCACCAACGCCATCTAGCGGGTCGCTGCTGTCGGCGGGCTCGGGCTGCGGCTCGGGCTCCGGCTCGGGCTCCAACGGCTACTGCTCGCTGGTGCACTCGGTGCCGGCGCAGGCGCACCTCGCGCTGCTGCAGCTCACCAAGTACTACTCGTTCCTGTATGTCGCGCACGAGCTGTGGGAGCAGGCGGACATGCTGTGCAGGCTGAGGCCTAACCAAGATCTGTTCATAGCAGTGGATCGCAAGTGCGGGCCTCTGACGTTGTTCTCGACGTTTAGGCACCTGGTGCAGTATGTGCGCTACGCGATGTCGCTACTGAAGAGCGCTCCCCGTGAGTGA
Protein Sequence
MFKTELKNLGLLATCEESLSVFFNVPGKEARDRRDGNRRPPPRRPRRDAANDDDRRFVKGNNDRMSQQIQTKLGDYHLAQTLLDDPSKSIGICAEPASPAPCHPSRREGEFKKPAHAPHGPTNGRVHHRPYIYNKSEGLSSTGAAAHRPPPLRIPNGFQGQHNIDSSSQPPIESILKEMKELPTPLSVIAATPRKELENKFIFNPYTNKVQENPQLVNKDLKTPITKPGLDNRISHSRSSASPDVVNKDLGLSESDDETPATTTRLEPILSPIGSGGSPSSGSESSQTDSESESSSAESTEAPPTVAVPQPPERTSWGLSNFVPPSTQLPDHTESYTAGKELRHALDDVKRKPSPISELSDSETSSPLAGATRRRRSVAARVSIASSDDETPRPHHQRSNERRNEEAIARVSPPREPETLTKRREAERVAHERMQERVANERPDRVSERSDRVDRISDRISDRISDRLADRTTNDRLPPDRNERHAERIPNERLQPDRLQPVQPDRPQQDRRSIERFQERFQLQHERIAYERLQAERLSNEKMALEKANERLQPTVERMSTDRLSNQIGDKLDRLAEKRERLSDKSHDRLSDKMSVDCLPGDKISLDRVNISDRIVQERYVRAEADVGSGLKSGRGSERLEDASSNVRGRPRVLVRIPLSRLRSDRLRTLQRPARRPPAPRVATPPPDQGVEAMEVLRSGGRDTGRSPVYYSYFEQLPPDTLSDDDRDHTYYMDEAKRMRSAAESEQEPLARVMLYHESVLCFVLTGRLLEMEMETKKAFTIYRETIEYIKSIHSMPQRFRTSPKSPHSTFSKLDILSLRVQALLYLRMFRMYNREVKEYNKVVQEYQQKPACAEPVSPLSPTPSSGSLLSAGSGCGSGSGSGSNGYCSLVHSVPAQAHLALLQLTKYYSFLYVAHELWEQADMLCRLRPNQDLFIAVDRKCGPLTLFSTFRHLVQYVRYAMSLLKSAPRE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-