Aper007169.1
Basic Information
- Insect
- Antheraea pernyi
- Gene Symbol
- Smad4
- Assembly
- None
- Location
- GWHABGR00000029:3932477-3953276[-]
Transcription Factor Domain
- TF Family
- MH1
- Domain
- MH1 domain
- PFAM
- PF03165
- TF Group
- Unclassified Structure
- Description
- The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related proteins such as Smads. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain shows that a highly conserved 11 residue beta hairpin is used to bind the DNA consensus sequence GNCN in the major groove, shown to be vital for the transcriptional activation of target genes. Not all examples of MH1 can bind to DNA however. Smad2 cannot bind DNA and has a large insertion within the hairpin that presumably abolishes DNA binding. A basic helix (H2) in MH1 with the nuclear localisation signal KKLKK has been shown to be essential for Smad3 nuclear import. Smads also use the MH1 domain to interact with transcription factors such as Jun, TFE3, Sp1, and Runx [2, 1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 6.8e-31 2.8e-27 95.1 0.2 2 83 33 113 32 122 0.94
Sequence Information
- Coding Sequence
- ATGAATACTACAGCACCAACATCAGCAGATGCTTGCCTCAGTATTGTCCACTCACTGATGTGCCACAGGCAGGGTGGAGAGAGTGAGGGATTTTCAAAAAGAGCCATTGAATCACTTGTAAAAAAATTGAAAGAGAAGAGAGATGAGCTGGACTCATTGATCACAGCTATTACTACGAATGGTGCACATCCAAGCAAATGTGTCACTATACAAAGAACTCTTGATGGTAGATTGCAGGTCGCTGGTCGCAAAGGGTTTCCTCATGTAATTTACGCGCGTATTTGGCGGTGGCCAGATCTACATAAGAATGAGCTGAAACATGTTAAGTTTTGCTATAGGACGCTGGCGGGTTCGGCGTGCACTTGCACTGCCGTCGGGCGTGTTGGTAGGGCATTTGCCTTGCCTCCTACCGTCGATCTCGTAATCCCAGTTGGAGTGTCGTGGTCATGTCTCGCGGCTCCAATGGAAACGAGGGTCTTGGCTTAATCGCCCGGACCGCGAGGAAGACAACCTCTATAAAAAATTACCCCAATTCTAAGCGGCGGCGCACCGCGAGAGGATGCATGGCTGATGGGAGATCAGTCTCTAGTGCGACCCCCGTCAGAGGACCGGCCGACTCTCTTGACGGGTTACGCTAGGCTTACCTAGGTACAGGGGGCACTGCCTGGGTGGACCGTCTTTTCCCCCATACTCGTGGGAATAAAAATGGACCATAATTCTTTTATTTATTTAAAATCAGGATGTAAGAATGGCACGGGAGTTACCACCCGTGCGGCGGGTAGAGCAGAAGGAATGGAGGTGGCCAAGGCGGATTCTGTCACCAAAAAGAAAGCGGCCAAGATAGGCGAGAAGCGCAATAGTGTGTCCCGAGTGTTTGACTCGGATACCGAAGTGGAAGAACGGCAGGGGCCGAAGGTGTCATACCGCAAGAGGGGTTCGAACAGGGGACGCCCCCGGTCACGTGGTGGAGGACGGGGAGTCATGGGGGAGGCCGATGGCTCCTATGACAGAGCGTTATCCGGAGCAATGTCTGGACACTTGAGCGTGAGCCTGCCAAGCTCCGCGGTTGGCTCCATTGCTGGCTCTTCAGAGGGATCCAGAGCCTGCTCTGTTGTAGGATCACGTGCTGGCTCTGCGGCTCACTCGGCCGTAGGTTCCGGGGCAGAGAGTAAGCCCCCTGCACCTCCAATAGATATTGGCGCCCTCGACGCCGACTCGCTGAGTGCCCTTGCAGGGGGCAAGATCGCGGAGATATTTGACGTCGTGAAGAAGTCCAAGAACCTGCAGGGACCTTGCCGCGGTAAGCTGAAAGGGGCGGCGAGGATCCTGCAGGAGGTGGTGGAGTCCTTGGCTGAGAGAACAGAGGCGGAGGAAACCCTCAAACTAAAGAGGGAAACCAAAAACCTCCGCCGCCAAGTGGACGACCTCAAGGCGGAAATCATTGCCTGGAAGAGAACGGTGGAGGAGGCGAAGAAGATGCCTCCACCACCCCCCGGCAGCCACTGGTGGTCGAGGCTCCAAGCCTAAACTGGGAGGAGGCCCTCGAAGGGGCAGTTTCGCGGCTACAGAGGACCCTCCAGGCGTCCTTTAGCAACATGCTAAACGCGCGCCTGGAGAACCTGGAGAGGCGCTTGCCCCCTGAAGCTCCCGTTCGTCCGCAATTAGCAGCGGATAAAAAGAAGGGGGTTCAAGCGGCCCCACAGCCTAAACCGAAGCCTCTGGCGGTCCCTCAGCCTAAGCCGAAGAAGGCTCCGGTGGCTTCTAGGGGGACCTCTTCTTCGAGGGGGACTGGGAAGGTAGTAGCGGGGCCTTGCACTGCCCCTCTGGGTGTGGAGTGGGCAGACCTTCCTGCCCCAGCTGCGACTGTGCCTGGAGCAGACTGGGCCACAGTCATCAGCCGGAAGAAGGCAAGGGCAGCGAAGGTAGCTGCGCCTAACACTACAGCCTCGGCGGTGGCTGGGAGGCCTATAGCCAAATTGCGGGCGAAGAAGCTCCCGTCCGCTCCTCGCTCTGCTGCTGTGGTAATAACCCTGCAGCAGGGGGCGGCGGAGAGAGGAGTGGACTATAAGGCCGCCCTCCTGAAGGCACGGGAGCAGATTCCCCTCGCAGAGATGGGAATCACTGCTCCCAACGTACGGAAGACGGCCACGGGAGCTAGGCTTATAGAAGTGTCAGAGGGTGCCCAGAAAGCCGAAGAGCTTGCCCGGAGGCTGCGTGAGGCCCTGGCTGATATCGCCACGGTCGCCAGACCGGTGAAAAAAGCCGACATCCGGATCGTTGGCCTTGACGATTCGGTGTCGCGCGAAGAGGTTCTGACGGTTGTGGCGGCTAAAGCCGGGTGTGAAGAGACCTTAATGAAGGCTGGCGAAATTAGGCGCAGTGGGGGAGGCCAGGGAGCTCTGCACATTACCCTCCCGGCGACGGCGGCTAAAGAGCTCTTCCAAGTCGGCAAGATCTTGGTCGGCTGGAGTGTGGCTCAGGTAGTCCTGTTGAGCGAGAAGCCCATGCGCTGTTACAAATGCATGGGCTTGGGGCACACCCGCCCCATGTGCCCTTCGTCGTCAAACAGAGGGGCGGACTGTTACCGATGTGGTGCGTCGGGGCACAAGGTGGCCACCTGCACATCGGCTGCTGAGAAGTGCACCGTATGTGCCGATGCAGGTAGGCCCTCAGGACATCAGATGGGGGGCCGCAATTGCAAGCCCCCCAATGTAAAGGGAAGGAACGCCCCCGCCCCCGCAACGATAACAGTCCCAAGAGTTGAAGCCGCCAGGCAGGAGGAAGCTGAAATGGAACAGTAATGGACCGCACCCATTACAGTTTCCTCCAGACTAACCTAAACCACTGCGCCGCTGCACAGGACCTGTTGGTCCAGTCAGTGGCGGAGTGGGATATTGGGCTGGCGGTTGCCGCGGAGCCCTATTATATCCCAACCCAGTCACACTGGGCTGGGGATAGGAACGGCTCCGTGGTGATCTTGTCTGGTTCGGGTTCGTCTCTGGCGGTCCGAGAGTCGGGGGCCGGGTTCGTGGCAGCCGTCTGGGGAGAGTACGCCATTGTGGGGGTGTACTTCTCCCCTAACAGACCTCTGTGGGAGTTCGAAGACTTCCTTGACTCCCTGCAGGGGGTTGTAAGGCGGCTCTTGCCGCTCCCTCTAGTAATCCTAGGTGACTTAAATGCTAAGTCCAGGACCTGGGGAGCGGCGGCCACGGACTTGCGAGGGATGGCCGTTGAGGAGTGGGCAGCCGAACTGGGGCTGTCTCTCCTCAACCAGGGGAACGTCCATACCTGCGTGCGGCAGAACGGGGGATCAATAGTGGATGTCTCTTTCGCTACCCCGTATGTCGCGCGCCGTGTCCGGGGGTGGCGTGTGCTGGTGGGGGTGGAGACCCTCTCCGACCATCGGTACATAAGGTTTGAGGTCTCCACCTCCCAAGCCAGTACTGGTACACGCCCCCGGAGGGGCGCGAAGTTCCCCTGTTGGGCGATGAGCCGCTTCGACCGCGAACTGGCGGAAGAGGCGGCCATCGTCCAGGCCTGGGCCATGACACCCCCGACTGTGGAGGATGTCGACGCGTCTGCAGCCCGCCTCCGTGGTGTCCTCACGGAGGTTTGCGACCAGTCGATGCCGCGGAGTACGCACCGCCCAAACAGACGGGCGGTGTACTGGTGGACGTCGGAGATTGCCGATCTTCGTGCTGCCTGCTCCGCGGCCCGACGGCTATTCGGGCGCAGCCGTCGTAGGCGACACCGAGACCCTGAGGAGGAAGCCCGGCTGTATGGCAGCTACAGGGAGCGCAAGAAAATCTTGCAGGCTGCCATCAGCCGGGCAAAAGAAGAGGCATGGGAGGAATTGCTCCAGACGGTAGAGGGGGATCCGTGGGGGCGTCCGTATAGAATGGTTAGGAAGAAGTTACTGTCCCGGGCGCCCCCAGTGACGGAAACCCTCCGGCCCAAGGTGGTGGTGAGGTTGGTGGGAGAGTTGTTCCCTAACCCTCCGGTGCTGCCACCTCCTCCCATGCCTCCGCCTCCACGGGTAGTAGTTGCGGAGGTGCTTGATGGCCCTCCCCCCCTTATAACGGAGGAAGAGATGGGGAGAGCGCTACAGCGCCTCCGCAATCGAAGGAGAGCGCCCGGTCCGGATGGTGTCCCGGGTCGGGTGCTCGCGGTCTCGTTGGACCATATAGGGGGGTGGTTTCGGGGACTACTCGACCGTTGTCTGGAGGTTGGGCGGTTCCCGAAACCCTAGAAGCAGGGGCGGCTATGCCTGCTGGCAAAGGTGGGCCGCCCCCAGGACAGCGCGTCCGCGTATCGGCCCTTGGTGCTGTTGGACGACTCCGGGAAGGTCCTGGAGAGAGTTGTTGCTTTCCGGGTCGTCCAGCACCTGAAGGAAGTTGGGCCGAACGTGGCGGACGCGCAGTTTGGCTTCCGGGCAGGGCGTTCCACAGTGGACGCCCTGACCCTTCTTAAGTCGCTGTCTATGGAGGCGAATTCCCAAGGGGAGAGACTTCTGGCGGTGTCCTTGGACATCGCCAACGCCTTCAACAGCCTACCGCACAGCGTGATACGGGAGGCACTCCGGTATTTCGGTGTGCCTCCGTATCTGCGGCGGCTGTTGGAGGCGTATTTGGTCGATAGGGAGATCGCCTACATAGATGGCGACGGTCATATGGTCCGGCGAGTCACGCGGCGGGGAGTTCCACAGGGGTCGGCCTTGGGCCCACTCTTGTGGAACATGGGCTACGACTGGGTGCTCCGCGGCGTGCTCCCGTTCGGAGCGCGCGTAATTTGTTACGCGGATGACACCCTAGTCGTAGCCCGGGGGGAAAGTTGGCAGGAGGTGGAGCGCCTGGCCACGATGGCCACAACCCTCGTGGTCGGGCGCATTCGGCGTTTGGGTCTCAGAGTGGCCCTCGAGAAGACCGAGGCCCTACTATTTCATGGGCCTCGGCGAGGGCCACCACCTGGGGCCCACATAATGGTGGAGGGTGCCCGAATCAGGGTGGGGACCCAGATGAAGTATCTGGGCCTCACCCTGGACGGACGCTGGCGGTTTGAGCCCCACTTTGAGGGGCTCAGACCGCGAATTGTGGGGGCGGCGGCGGCACTCGGCCGACTGCTCCCCAACATAGGGGGCCCGAATGCTGGGTGCCGCCGACTATATATGGGGGTCGTACGGGCGATGGCCGTCTATGGTGCCCCGGTGTGGGCCGACGCTCTTGGTGCCTCTAATGTGAGAGCGCTGCGCGCTGCGCAGCGCATAGTGACTCTGAGGGCGATCAGGGGATACCGCACCGTGGCCTCCGCGGCGGCTTTTGCGCTCGCCGGCTCCCTGCCCTGGGAGCTGGAGGCGGGAGTTCTTGCCGCCGTGTACGAATTTACGGCGGCAAGAAAGAGTATCGGGGAGCCTCCGGCTCCCAGCGAGATTGAGGCCAAGAGAAGGGAGGGCCGGGAGGAGGCGAAGGAACGATGGAGGCGGGACCTGGCGGACGAGCGGTATGGACGTTACACTATTGACGCCGTCCGCCCGGTCCTGCCGGATTGGATAGGGAGACGTTGGGGCTCCCTGTTGTTCAGACTGGTGCAGGTACTCACCGGGCATGGTTGCTTCGGGTGGTACTTGCATCAGATCGCTCGTAGGGAGCCAACCGCCGCATGCCATGAATGCGGCGCTGCGGAGGATACCGCGCAGCACACCCTGTCTGAGTGTACTGCGTGGGAGGGACAGCGCCGCATTCTGACTGCGGCGGTAGGGCTGGATTTGTCGCTGCCGAGTGTGGTTTCGGCCATGCTCGGCAGCGAAGAGGCATGGATTGCGATGGCCACCTTCTGCGAGGACGTCATCTCGCAGAAGGAGGCGGCGGAGAGGGTCCGCGAAGATGACCCTCTGAGTGGCCGTCTCCGCCAACGGAGGACCGGAGGGAGGAGGCGGCGATTTGCCCTTCTCCTCTCACCACCATAGCCGGACCTGAGGGGGTGGGCCGGGGGGCCCATCTCCTCATTGCACGGGTCCGGTCGGCGAGGAGAGGCAGCTGGTGTGTACCCAGTCTGTCTCCCTATTAGGATGGAGCCCGGCGGCCACCTTCCGCCGGGCGGACTGCTGGGGCGGCTATCTAAGGGGAATAGCTTAGTTCGATCCCTTGCCGCCCCAACAATACGGTCGGACGGTACACCGTGGGGTTTTAGTCTGTGCGAGTCGGACATAACCCATCCTCTCCCCCGAGAGGGTGGCGTATCCATAAGGGATTTCCCCACGTAAATAAAAAAAAAAAAAAAAAAGGCTCGCTTACTTAATTGACATGATGCTAAATTGCAAATTTATATCTTGCTAAAAAATACCGCTCAACGCGCCTCAAAAAGTTTTCACTTCAAAAATAAATAAGATAAATTAAAACCAATTTCTTTTGCCTAATCGCTATCTCTCATCCTACTGATAATTTTAAAAAAAATCGTATGTGTGGTTCGCACGTAGTAGAAGTGAAACTCTAAAAAACTGAGAAGAGTATATCCCTCCAGTGTCCTCCAATACATACAGATGGGCGAGGCTTTGATCACTCTATGTGACACAATGTTTACCCCTACTTTTCGTCACATAAAAAGATTACGGATCAGAATTTCAAAGTCCTTTCATAAGAAGGTTTACTCCGAAAACAGCGTAACAGAAAAAAGTATAATTATCATCAGAAACAATGATGCTATAATTTTTAGTTATCTCTGTTATAGGATATTAATATCACCTGGTACAATACTTAAAATGTTCATTTTCCTTATGAAACAAATGTTGATTCGAAGACGTAAAGTAGAAATCTAGAAGAAATTAGAAAAATCTGTAAATGTACATTTAAAATTGTACTGTTAGTTAACCTGCAGTAATGATAGATGAGGCAATTAAGGCCATACAGACCTCTATCGTGTTTATGTATTAAGTTTTAAATTGTCCCTGGTTGGAGTACGCTGATTTAATTGTAAATTATTTTCTGGTTTTATTTTATATAATTAGAAAAAATGTAATCTCGATTTTATAGCGTCAAAATAGATTCATTGTTTAATATCGATATTTGACATATTCGATTTCAAATCGACATCCCTACGCTGAATTCTTTATTAGCCATTTTGAAATGCTTTTTTTTCGTTTGCTTTGGAGCTACCGTGTGATTGGTTTGATTATTGTTATTTAGTTTTTGTTGTCTAAATTTTAATTAAGTCTTGTATTACTTTGTTTTCGTAAAATTCATATTGTTAAAATGGCTCAGGAACACAAAGAGGAGCCGAAAATCTTGTATATTTGGATGCGATAAACTTGGTAAGTATTACTGATCTTACAATCATAATATTTGTTATTCAATGGCCATGGCCATAGATAAGATACTATATACTATATATGGCCATGGCAATGCCATTTTGTGCATAAAATATTACATAAAACCCTTTTCAGATACACCCATCGTTTTCCTAATCCAATAAATCACCATGAGTGGTATGCAATTTGAGTCGGGATCAATTATTATTAGGATGGGATTAGGGATGATCGATATCCATCGATGTTGACTAAACATCGATGTTCAGCAGGAAAACATCGATGTTTTTACATCGATGATTTGATTTCGATAACATCGACACATCGATGTATTTGGAAAAAAATATCGAAATGTCAATTTATCCTTTAGGGGCATTTAGCGTTGGATGTACTAGATAGAGACTAGATAACTATCAAGTTCAACTATTTTACACTAGATGACACTTATCAAAGTAATCAAAGTAAGCCACACGAATAGTTATTTCTTTGCATTTTCTCACTGTTGTCATTTTAGTGAGGTGGTTAGTCTCTAGATCTAGAGGATTTTGGATATTTTTAACGGCGTTTTTTTTAGTAATCTAGTAGCTTAAAAAAAATTATCGAATTCTAGCGACTTTTTGACTAATGAAAATACATAAATATTTTGACATTATTTTCAATATAATTACTGGCAACAGCATGAATTGCTTTTGTTTTTTGGGAATGGAAGGGGGAGGACTTAAGGGAAAACGATGTTAATTTTTTTGCAGTGTGTCCAAAACGTACTGGGAATGTTAAATATTGTATTTCCAATAACATAAACATTTAATATGTACAATAGTTATATTTTCTACCGAAAACATCGATGTAATTAGACATCGATGTTTTGCTGTCGATGTTTATCGTACATCGATACTTTCTATTTGATGTTAACATCGATGTCAACATCGATGTTTTTCTAATATCGATCATCCCTAGATGGGATGATCGCTATGAAAATTTATAATCACAAGAAGATTTGCAATTACCATTTCAAAAGTGAAGAAATAACAGCCGGTCGGTGTATAAGCATATGCATATGGTGCTATACCTTCTCTAAATTTGACTGGTGAGCTTCTTTGCAAATCTAATTTCTAAGCATGCAGAAAATTCTTTTAATAAAGGCAGGCACGCTCAACACATGTATAGTCCGCGCCGAACTTTTTGGCTGTTCTGAATATAGCCGTGCGTATGAGCTTGCGCATAATATTATTAAGATTTGAATACAAAAGAAAGTAAAAGTAGTATTATAGGAATCTAAATTGGCATTTATTTATCCTTAAATTGTCTCGATGTTAGGCATATAATAAAATACCCATTATTACGTAATTACGTAAATTATGATCTACGCAGCTATATTCGGGGCGGCCATAATATTTGACGCGGACTATATATCCGCAGTTTTGTATGCTATTAAGCTTACAGATAATATTGGGAACATATTTAACAATCCTGTGTAAACAGATAGGCACAGTTAGTGTAATATAAGAACTGATTAAGATTTAATTGAATGATTAAGACTATCTTATTTTTTAGGTGCGTTGGAATCAAAGTCAAAGTATTGTGCGTTGGAATCAAAGTCAAAGTATTGTGTACAAAGTGAACACAATTATGCTTCTTGTAGTATTGTATGTATTATTATTGTAGTATTGAAGTAGTCAATAATAAATTACTAATTTGCTCAAGGAAATACAACTGTTCATATTCAATCTTTATTGGAAGAAGATGTTATTCAAGGTACATAATGAATTTTTTAA
- Protein Sequence
- MNTTAPTSADACLSIVHSLMCHRQGGESEGFSKRAIESLVKKLKEKRDELDSLITAITTNGAHPSKCVTIQRTLDGRLQVAGRKGFPHVIYARIWRWPDLHKNELKHVKFCYRTLAGSACTCTAVGRVGRAFALPPTVDLVIPVGVSWSCLAAPMETRVLA*SPGPRGRQPL*KITPILSGGAPREDAWLMGDQSLVRPPSEDRPTLLTGYARLT*VQGALPGWTVFSPILVGIKMDHNSFIYLKSGCKNGTGVTTRAAGRAEGMEVAKADSVTKKKAAKIGEKRNSVSRVFDSDTEVEERQGPKVSYRKRGSNRGRPRSRGGGRGVMGEADGSYDRALSGAMSGHLSVSLPSSAVGSIAGSSEGSRACSVVGSRAGSAAHSAVGSGAESKPPAPPIDIGALDADSLSALAGGKIAEIFDVVKKSKNLQGPCRGKLKGAARILQEVVESLAERTEAEETLKLKRETKNLRRQVDDLKAEIIAWKRTVEEAKKMPPPPPGSHWWSRLQA*TGRRPSKGQFRGYRGPSRRPLATC*TRAWRTWRGACPLKLPFVRN*QRIKRRGFKRPHSLNRSLWRSLSLSRRRLRWLLGGPLLRGGLGR**RGLALPLWVWSGQTFLPQLRLCLEQTGPQSSAGRRQGQRR*LRLTLQPRRWLGGL*PNCGRRSSRPLLALLLW**PCSRGRRREEWTIRPPS*RHGSRFPSQRWESLLPTYGRRPRELGL*KCQRVPRKPKSLPGGCVRPWLISPRSPDR*KKPTSGSLALTIRCRAKRF*RLWRLKPGVKRP**RLAKLGAVGEARELCTLPSRRRRLKSSSKSARSWSAGVWLR*SC*ARSPCAVTNAWAWGTPAPCALRRQTEGRTVTDVVRRGTRWPPAHRLLRSAPYVPMQVGPQDIRWGAAIASPPM*REGTPPPPQR*QSQELKPPGRRKLKWNSNGPHPLQFPPD*PKPLRRCTGPVGPVSGGVGYWAGGCRGALLYPNPVTLGWG*ERLRGDLVWFGFVSGGPRVGGRVRGSRLGRVRHCGGVLLP*QTSVGVRRLP*LPAGGCKAALAAPSSNPR*LKC*VQDLGSGGHGLARDGR*GVGSRTGAVSPQPGERPYLRAAERGINSGCLFRYPVCRAPCPGVACAGGGGDPLRPSVHKV*GLHLPSQYWYTPPEGREVPLLGDEPLRPRTGGRGGHRPGLGHDTPDCGGCRRVCSPPPWCPHGGLRPVDAAEYAPPKQTGGVLVDVGDCRSSCCLLRGPTAIRAQPS*ATPRP*GGSPAVWQLQGAQENLAGCHQPGKRRGMGGIAPDGRGGSVGASV*NG*EEVTVPGAPSDGNPPAQGGGEVGGRVVP*PSGAATSSHASASTGSSCGGA*WPSPPYNGGRDGESATAPPQSKESARSGWCPGSGARGLVGPYRGVVSGTTRPLSGGWAVPETLEAGAAMPAGKGGPPPGQRVRVSALGAVGRLREGPGESCCFPGRPAPEGSWAERGGRAVWLPGRAFHSGRPDPS*VAVYGGEFPRGETSGGVLGHRQRLQQPTAQRDTGGTPVFRCASVSAAAVGGVFGR*GDRLHRWRRSYGPASHAAGSSTGVGLGPTLVEHGLRLGAPRRAPVRSARNLLRG*HPSRSPGGKLAGGGAPGHDGHNPRGRAHSAFGSQSGPREDRGPTISWASARATTWGPHNGGGCPNQGGDPDEVSGPHPGRTLAV*APL*GAQTANCGGGGGTRPTAPQHRGPECWVPPTIYGGRTGDGRLWCPGVGRRSWCL*CESAARCAAHSDSEGDQGIPHRGLRGGFCARRLPALGAGGGSSCRRVRIYGGKKEYRGASGSQRD*GQEKGGPGGGEGTMEAGPGGRAVWTLHY*RRPPGPAGLDRETLGLPVVQTGAGTHRAWLLRVVLASDRS*GANRRMP*MRRCGGYRAAHPV*VYCVGGTAPHSDCGGRAGFVAAECGFGHARQRRGMDCDGHLLRGRHLAEGGGGEGPRR*PSEWPSPPTEDRREEAAICPSPLTTIAGPEGVGRGAHLLIARVRSARRGSWCVPSLSPY*DGARRPPSAGRTAGAAI*GE*LSSIPCRPNNTVGRYTVGF*SVRVGHNPSSPPRGWRIHKGFPHVNKKKKKKGSLT*LT*C*IANLYLAKKYRSTRLKKFSLQK*IR*IKTNFFCLIAISHPTDNFKKNRMCGSHVVEVKL*KTEKSISLQCPPIHTDGRGFDHSM*HNVYPYFSSHKKITDQNFKVLS*EGLLRKQRNRKKYNYHQKQ*CYNF*LSLL*DINITWYNT*NVHFPYETNVDSKT*SRNLEEIRKICKCTFKIVLLVNLQ***MRQLRPYRPLSCLCIKF*IVPGWSTLI*L*IIFWFYFI*LEKM*SRFYSVKIDSLFNIDI*HIRFQIDIPTLNSLLAILKCFFFVCFGATV*LV*LLLFSFCCLNFN*VLYYFVFVKFILLKWLRNTKRSRKSCIFGCDKLGKYY*SYNHNICYSMAMAIDKILYTIYGHGNAILCIKYYIKPFSDTPIVFLIQ*ITMSGMQFESGSIIIRMGLGMIDIHRC*LNIDVQQENIDVFTSMI*FR*HRHIDVFGKKYRNVNLSFRGI*RWMY*IETR*LSSSTILH*MTLIKVIKVSHTNSYFFAFSHCCHFSEVVSL*I*RILDIFNGVFFSNLVA*KKLSNSSDFLTNENT*IF*HYFQYNYWQQHELLLFFGNGRGRT*GKTMLIFLQCVQNVLGMLNIVFPIT*TFNMYNSYIFYRKHRCN*TSMFCCRCLSYIDTFYLMLTSMSTSMFF*YRSSLDGMIAMKIYNHKKICNYHFKSEEITAGRCISICIWCYTFSKFDW*ASLQI*FLSMQKILLIKAGTLNTCIVRAELFGCSEYSRAYELAHNIIKI*IQKKVKVVL*ESKLAFIYP*IVSMLGI**NTHYYVIT*IMIYAAIFGAAIIFDADYISAVLYAIKLTDNIGNIFNNPV*TDRHS*CNIRTD*DLIE*LRLSYFLGALESKSKYCALESKSKYCVQSEHNYASCSIVCIIIVVLK*SIINY*FAQGNTTVHIQSLLEEDVIQGT**IF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -