Mnit003514.1
Basic Information
- Insect
- Musotima nitidalis
- Gene Symbol
- TBX20
- Assembly
- GCA_949126915.1
- Location
- OX421415.1:14873903-14889306[-]
Transcription Factor Domain
- TF Family
- T-box
- Domain
- T-box domain
- PFAM
- PF00907
- TF Group
- Unclassified Structure
- Description
- The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 0.0015 4.1 8.1 0.2 84 134 1332 1387 1323 1446 0.79 2 2 6.2e-28 1.7e-24 87.5 0.0 113 182 2509 2581 2500 2581 0.92
Sequence Information
- Coding Sequence
- ATGGCTTCCTACGATGACCTTCTTACCAATCAATCTACAATTCTTGATGCAATTAAAAAGATCAAggataattttatgaaagatgGTCCCTCGCGTAAAACGTCTCTTGAATATCACGATGATAGACTTAACAAGCTAGAAACTTTATGGAGCGAATTTAAGAAAAACGACGTTAAGTTATCATTATTTGATTGCGAAAAGTCAGGGCCTTACTACGAGAATCGTGTGTATGATCGATTAGAACAACTCTACGAAGAAACCAAGTCTATGATAAACAAGTCAAAACAAAGATTGTTAGTTAAGATCGAAGATGAAAATCTTGGCTTCGTTACTCAACACCAAAAACACGGAGAAAGTAGCGCTACATGTACTCAAGAGAAGGATTTAGGACAACATGACGACGAGGAGACCCCACATAATAGTAGGAGAATGTCAGACGAGAGAAAAGACGTAGACCAGCGTACAAGGGACATAATGAATGAACAGCGGTCGAACATTAGAGCCTTTGAGAGGGCGGTAAGAAACATGGATCTGGAGAACGTGAAAGAAAAGTGGCAACTTGAAGATAAGTTGAAGACAGTTACACAACGCTGGGAAAAGGTCGATAGTACCTACTGGAAACTCGACAACATCTTGGACGACACGAGAGCGTTTGAGACAGATTACAACAGATTAGAAAGGCTGTATGAATTCAGCAAGGAACAAATAAACAAGAGAATTTGGGATAGCAAACATCAGGAAAAATCTACGCCTAAAATTGAACTTCCTACGTTCAGTGGCAGCTACGCCCAGTGGAcaacatttaaagatttattcataGAAATCGTCCATCGCAACACTTTCATCTCTAACGCTCAGAAAATGCAACATTTGAAGTCAAAATTGAAGGGAGAACCTGAGCGATTGATTCAACACTTAAACATTAGTTCGGACAATTACGAAACCAGTTGGCAGATTCTTCAGCAGAGATACGATAATGAGAGACTATTATTCACCTACCAGGCTAACATTCTGCTGAACCAACCCAACATACAAACCCCAACGGCGTCTCAGCTGAAAAGAATGCATGATACTACTATCGAGTGCATCGAGgctatcaaaaatttaaattttgatacctCGTCGTGGGACCCCTTggtggtttatattttaatgcaaaagCTGGATCCCGAAACAAACCAGTTTTTTACAGCAACATTGGACAACCCGAGAAGCATGCCAGCTTTAAAGGAGTTCCTGAAATTCATTGAGACCAAATTTATGTCACTGGAATCAAGTCCGTATCGCCCACGTAACACAAACAGCCAACATCAAAATAATCAAACGAAAACAGTTCATAATGCAACATACAGTAAGCCTAAATTGCAACATCAAAATACTCAGGCAGAAActaaaaatttagtaaaaacatttcacaacaaaaagacatttcataattacattataacttGCCCTAGATGTAATAAACAGCATGGTTTGTATAATTGCAAAGAATTCTTAAGCATGTCACCAGAAGAGCGTTTGAAAACTGCTAACGAGTTACGTGTTTGCTTAAATTGCCTTTTTAGCCATAGAGGCGAAAAATGTATGTCGGATAAAAAGTGCAAGCACTGCGAAGAACCGCATAGTAGCATCTTGCATGACGCAATTGAGATTGCAAAGTCGCGcggaaataaatctaatgtaggtCAGCGGTTAAACACGCACCTAGCGCATGACAACTCTAACGAAGTGCTGCTAGCCACAGCTCAAGTTAAGGTGACAGCTATTGATGGTTCTACAGTTACCTTACGCGCCTTGCTAGACCAAGGCTCTCAAGTGAGTTTTATTACAGAAAACGCTGCACAAAGATTACAGTTAAAGCGGAACAAGATGAGCGCGGTGGTATCAGGCATTGGTGCTTCAGCCACCACCAAGTGCAAAGGCGTGCTTCATCTTAAATGTCAAGCGTTATACACCAACttcaacattaacattaatgcaCTTATTATGAAGTCTATAACAAACAAACTGCCGAGTAAAACAGTTGATATGGATCATTTTGAGCACATTCAACATTTACAACTTGCTGATCCTGAATTTAATGTGTCTCAGGATATAGATTTATTACTGGGAGCCGATGTTTATTCGGAAATTCTGTTAGAAGGGCTCATCAGATCAaatacatcatccatcataGCTCAACAAACTCAGCTAGGATGGGTAGTGTGCGGCAACTACACACACAAAACATTTAACTGTCATGTAGTTGTCAACAACATTGAAGATCTTGCTCAATTCTGGGAACAGGAGTCATTGCAGGACGACAAGGCAAATACGGACCCAAACGATAAGTGCGAACAATTGTACAAAGAAACAACAACACGACTGCCTGACGGGAGATATCAAGTCCATCTCCCTCTGAAGGACAAGTTCGATGAACACCTGGGAACATCCAAGCCGCAGGCCATCGCCCAGTTTCAACAATTAGAGCGAAAGATGGCAAGAAACCATGCTTTTGCAGAGGAATATGCAGAATTCATGAAAGAATACGAAGAAATGGGTCACATGGAACAAATCAACACTACACACAACAAGATTGAGTACTTCCTACCACACCACGGAGTAGTGCGACCAGGcgctataacaacaaaattgagaGTCGTTTTCAACGCGTCAGCCAAAACAAGTACAAAACACAGTCTCAACGATTTAATGGAGGCAGGACCTAACCTACAGAAAGATATTCTTAGCATCCTGCTGAAGTGGAGACAGTACAAATATGTCTTTACATCCGACATTGAAAAAATGTATCGCTTTATAGCGGTACATCCTGAACATCAACCCTTACAGAAAATAGTGTGGAGAAATTGCCCACAACAACTGCTAAAGGAATATCAACTGTGTACAGTTACGTATGGCACTAAGGCCGCTCCATTCTTAGCGCTCCGAACACTTCAACAGCTGGCGTTGGACGAGGGTGACAAGTACCCGGaagcaaaaaacattttactcaaTCAGTTTTTCGTCGATGACGCGTTATTTGGTAAGGATACCATAGACGAGGCTCGAGTTACGAGAGAccaactaataaatttattaaaagaaggaGGGTTCAACCTTAGGAAGTGGTCAAGCAATGAACATCAATTGTTAGAAGACCTGCCAGAACATATGATTAGCCCTACAAGTTTCAGCTTTACAGAGGCATCAACATCAAAGGCTTTAGGGTTGGCATGGAATCCGAAACaagataaattcatatttacacccacgatcaaaaacaaaaacaacactgAAACtaacaaacactcaacaaagcgTCAAATCTTAAGCGACATATCTACCATTTTTGATCCGCTGGGATTGCTCTCCCCAATAACAATCAAAGCAAAAATCCTGTTTCAAAAAGTATGGAAGCATAATATGGAATGGGATGATGCAGTTCCCGACGACGTAGAACAAGAATGGAAAACACTTAAAAACGAGTTAATAAACATGCAACCTTTCAAAATAGACCGTTGGCTTGGCTTGAAAACGAGCGAAGAATGCGAACTACACGCGTTCTGTGACGCATCGGAAAAAGCGTTCGCCTGTGTTGTGTACAGCAGGGTCCCAGCGGGAATACttaaacatcaaatttcattagTTGCAGCGAAAACAAAGGTCGctccaattaaaaaacaacaaacacttCCCAGACTTGAGTTATGCGGCGCTCAGTTGCTGGCTCAGctggttaagaaaataaaacactgccttgATAAACCGAACATTAAAGTAACTGCATGGACGGACTCCATGATCGTGCTTGGTTGGCTGAAAGGAGATATCTCACGATGGAAGGCCTTCGTTGGACACAGAGTACAGGAGATAACATCGGTCGTGCCCGCGGAGAGTTGGAAACACGTAAGATCGGAGCAGAACCCGGCCGACTGCGCATCACGAGGACTATTACCTTCTCAGTTACATAACTTTCCTCTGTGGTTTAAGGGACCCGAATGGCTTAGCTGTGAAGACATTCCTGAAACATGTGACACGTATATGACCCCAGAAATAGAAGTCAAAAAGGTGCATATAGTTACGTCTCAAACTACagacaacaaaaacattaacattataacACAACTCCTTATCAAGTACAGCTGTATTATGACTATAATACGCATCGTGGCTTGGGTAATGCGTTTTATTACAAACTCATCACGAATAAATTGTCACGGGGCAGCCCTGCCCATGAATCACCCGtcaaaacaatgtaatttaacaCTCGCCGATATTCGACGcgctacttacataataattaaacaagtgCAGAATGAACACTTTGAACAAGAAATCACGGCCATAAAAACAAAagggaaaattaataacaacattggacttttaaaacttaaccctTATTTAGATGATAACAACATTTTACGAGTTGGAGGGAGATTACAAAATTCCAACTTACCCGAAGACACAAAACACCCTATTATCATACCAAAAAACAGCAAACTGACGCAACTTCTCATTGACCGGGCGCATGCCACAACGCTACACGGTGGAGCGAGATTAACATTAGCTACATTGAGACAAAAGTATTGGATAATAGGAGGAAATAATACTGTCAAAAAACACCTACAAGGCTGTGTGAAGTGCTGTCGTTTCAGACCATCAAAACCTTACCAATTGATGGCTGACTTACCCAGTGAGAGGATAACGCAATCACCACCTTTTTATCACACAGGGGTAGACTTTACAGGATTTGTTGACGTCAAACTCAACAAGGGCCGAGGTGTAAAAACAAGTAAAGGTTACGTCGtggtctttgtatgcatgacaaCAAAAGCGGTACATTTAGAACTCGCGTCGGACTTATCATCCCAAACATTTATACTTGCGCTTAATCGCATGATTGCGCGACGAGGCTGCCCGAAACACATGTACTCCGACAATGGAACAAACTTTGTTGGAGCCGAAAAAGAATTAGCTAAACAACTACACATTGCTCAAACATTCAACAACGACACTTTGTCGAGAAAGAATAGTGAAATGGGAATAGAATGGCACTTTAACGCACCGAGCTGGCCGACTGCAGGCGGACTCTGGGAGGCTGCAGTTAAGTCTATGAAATACCATCTAAAACGAGTTTTAGGAGAACAAAAGTTAACATACGAGGAGTTCCTAACACTGCTCACCCAAATCGAAGCGTGTATGAACTCTAGACCACTTTGCCCTTTAACAGAAGACGTTGAGGATATCGACTACCTTACACCATCACACTTTTTGATCGGCCGACCAGCAACTTTCCTACCACATGAAAACATCGAAGAGCAAAACATTAGCATTCGACAAAGATGGCAGTTGCTTAACAAAATGCAGCAAGGGTTCTGGGCCAGATGGTCTTCTGAATACCTGCAACAACTTCAAGTTCGAAGTAAATGGCTACATAAAACGATAAATTTAACAATCGGGGATATTGTCTTAATCAAGGAGGACAACCTGCCTCCAGCAAAATGGGCTCTTGGCAGAATACAAGAGGTTCACCAGGGAAAAAATAATCTGGTTAGGGTTGtaacattgaaaacaaaaacaggCTTACTAACAagaccaataacaaaattagttaAACTCCCGATACCAGAGTCTCAAACAGAAAAACCTGATACCGACACCTCTAGTATAACAAACCaggataataaaaaaccaacatcaaacaataacaatcaaaaaccaccatttaaatcaaagaaaacaagTACCAGCTTGCTCTGCAtgcttataacattattaacattGTTTCAACCTTCTACACAAAACAGTAACGTCATAAACATCACGCCGTTTGAACCAAACAGAGCTATCTACTTTGACAAAATCGGCAATTTACAAACCGTGCATGACGACTGGAAGATTGTTACCTACTATAACATGACGTCATATTGGCAAggcataaataacatacatagagTTGTGAAACACCTTGGGGAAAGGTGTGAGCAATTCGATTATAAAACGATGTGCAAAACGATTATGACGGAATTCGAACAGGAACTAAACGAACTTGATCATAACAATTACATGCTGGAAACACACCATGGTCAAACAAGAGGACGCATCACCCGAGGTCTTATTAATGGAATAGGCTACCTTGCAAACAGTCTATTCGGTGTGCTGGACGAAAGATTTGCAGCGCAATATAAACAAGACATAGAAACATTGCATAACAACAGAGACCATTTACTGAAACTGATTAAAGGTCAGACATCAATTATTGAGGcgcaaaacaacattttaaagagAAACGAAGAGTCTATGATTACCCAGTTCAATCTTTTATCTCAACACCTCAACACTACTGACGACTACTTACGAAAATTGGGTAAAGCTGTTCAGGAGGAGGAAGAATTGAACTACTTCAACATCATGGCCACCACCACCTCAATAATGTTGTCTAAGTTGCGACACATCCAACACATGCTACTAGACACCTCAACTAATATTCACAATGGTCACGTAGATACCAGACTTCTACCCCAAAGTCAGCTCTTCCAGGAACTGAACATTATTTCTGGACAGTTACCTCAGCAGTTATCTATACCTGTGGACAACATTCAAACCCAACTAGCAGATATATACAAGCTACTTGAGGTCAAATCAAGAGTATTAGAAAATTACTTCATCATAGAAATTACATTGCCTCTAACAGGTGACACACCGTTTACTGTATATAAAACAGTTCCGATACCGATGAAAGCTAACAATAAGACGTCGCTTTCCATGAAAACAAGCTCCGAATACGTTGCAGTTAACATgagaaaagaaacatttatgcaACTCACAGGCAGCGACCTTGACCGTTGCATAAAAATAGAACCAAGCACATATGTTTGCTCAGTCAACAAACCAACACGGAACATGAGGAGTCATCAAATTCCATGTGAAATTGGagccataacaaatattactatAGCAAAATGTGTATACACATCGGAAACTTGTAATAGTAACTGGACACCATTACGTAAACCAAACACATGGCTGTATGCCTGTTGCACGGAGTGTCAACTACAGATCATATGTGAAGGTCGCATGACTTCTAAGGCTATACATTCAACCGGCATCTTAGAAGCCAAGGAAGGATGCATAATAAGAAACAATGAGGAAACATACCACACACAGGTCtattacacaaataacatttacgtcAACCCAGAGATACCTTTATTAGACCTGGAAGGTATCAATAATCTCACATCTTTGCAACCTATTGATTTATCACTTCACCAACCAAACACACAATTCATAAAACACGATTACGACAAACTTCATAATTCATTGAAGGAATTAGAAGAACAGACAAAACAGACTGAAATGTCTCAACACGATGTACACCAATACATCGTGTCATACGTCACGATGGGGCTACTGATCCTGTCAGCATTCGTCGCGGGAGGACTCTACGTGTACTGCAAACGCCGCGGCACCCTCGCCGTCAACAATGCCAACACGACCACCACAGTACCACCACCAAAGCCAAGAAGGGCAGAAAATATAGAAAGGACCATCGTGCTAAACTCCATGCATCGGTACCAGCCCCGGATCCATCTGGTGAAGGTTCGGGAGGGTGGTGGACCCATCGTCGACCTAACCAGAGAACAGCATCGCACCTTCGTGTTCCCGGAGACGGTGTTCACCGCAGTCACCGCCTACCAAAACCAGCTTATTACCAAACTGAAGATTGACTCCAATCCTTTTGCTAAAGGGTTTCGAGATTCGTCGAGACTTACTGATTTTGATAGAGACCCGATGGAGTTAATGCTAATGGAGCAACAGTTGCTCCGGTCGCCGTTGCGTTTATACGCGAACAGTAGCGAAGAGGAGGCGGAGAAGCGCGCGGCGTGGGCCCGGACTCCGCCGGCACTGCAGTTGCTGGCCCTTGGGGGCCGCGCCTGGCCCTCAGCCTGGCCCCAGCCGCTGGCCCTGCCGCCAGACCTCTGGATGCAGAAGCCCCCTACGCCACTGCGATACTGCCCCTACCCGCCGCCCCCTCGACCGGACCACGCGCCATCACCACGACATCCTCTGTGA
- Protein Sequence
- MASYDDLLTNQSTILDAIKKIKDNFMKDGPSRKTSLEYHDDRLNKLETLWSEFKKNDVKLSLFDCEKSGPYYENRVYDRLEQLYEETKSMINKSKQRLLVKIEDENLGFVTQHQKHGESSATCTQEKDLGQHDDEETPHNSRRMSDERKDVDQRTRDIMNEQRSNIRAFERAVRNMDLENVKEKWQLEDKLKTVTQRWEKVDSTYWKLDNILDDTRAFETDYNRLERLYEFSKEQINKRIWDSKHQEKSTPKIELPTFSGSYAQWTTFKDLFIEIVHRNTFISNAQKMQHLKSKLKGEPERLIQHLNISSDNYETSWQILQQRYDNERLLFTYQANILLNQPNIQTPTASQLKRMHDTTIECIEAIKNLNFDTSSWDPLVVYILMQKLDPETNQFFTATLDNPRSMPALKEFLKFIETKFMSLESSPYRPRNTNSQHQNNQTKTVHNATYSKPKLQHQNTQAETKNLVKTFHNKKTFHNYIITCPRCNKQHGLYNCKEFLSMSPEERLKTANELRVCLNCLFSHRGEKCMSDKKCKHCEEPHSSILHDAIEIAKSRGNKSNVGQRLNTHLAHDNSNEVLLATAQVKVTAIDGSTVTLRALLDQGSQVSFITENAAQRLQLKRNKMSAVVSGIGASATTKCKGVLHLKCQALYTNFNININALIMKSITNKLPSKTVDMDHFEHIQHLQLADPEFNVSQDIDLLLGADVYSEILLEGLIRSNTSSIIAQQTQLGWVVCGNYTHKTFNCHVVVNNIEDLAQFWEQESLQDDKANTDPNDKCEQLYKETTTRLPDGRYQVHLPLKDKFDEHLGTSKPQAIAQFQQLERKMARNHAFAEEYAEFMKEYEEMGHMEQINTTHNKIEYFLPHHGVVRPGAITTKLRVVFNASAKTSTKHSLNDLMEAGPNLQKDILSILLKWRQYKYVFTSDIEKMYRFIAVHPEHQPLQKIVWRNCPQQLLKEYQLCTVTYGTKAAPFLALRTLQQLALDEGDKYPEAKNILLNQFFVDDALFGKDTIDEARVTRDQLINLLKEGGFNLRKWSSNEHQLLEDLPEHMISPTSFSFTEASTSKALGLAWNPKQDKFIFTPTIKNKNNTETNKHSTKRQILSDISTIFDPLGLLSPITIKAKILFQKVWKHNMEWDDAVPDDVEQEWKTLKNELINMQPFKIDRWLGLKTSEECELHAFCDASEKAFACVVYSRVPAGILKHQISLVAAKTKVAPIKKQQTLPRLELCGAQLLAQLVKKIKHCLDKPNIKVTAWTDSMIVLGWLKGDISRWKAFVGHRVQEITSVVPAESWKHVRSEQNPADCASRGLLPSQLHNFPLWFKGPEWLSCEDIPETCDTYMTPEIEVKKVHIVTSQTTDNKNINIITQLLIKYSCIMTIIRIVAWVMRFITNSSRINCHGAALPMNHPSKQCNLTLADIRRATYIIIKQVQNEHFEQEITAIKTKGKINNNIGLLKLNPYLDDNNILRVGGRLQNSNLPEDTKHPIIIPKNSKLTQLLIDRAHATTLHGGARLTLATLRQKYWIIGGNNTVKKHLQGCVKCCRFRPSKPYQLMADLPSERITQSPPFYHTGVDFTGFVDVKLNKGRGVKTSKGYVVVFVCMTTKAVHLELASDLSSQTFILALNRMIARRGCPKHMYSDNGTNFVGAEKELAKQLHIAQTFNNDTLSRKNSEMGIEWHFNAPSWPTAGGLWEAAVKSMKYHLKRVLGEQKLTYEEFLTLLTQIEACMNSRPLCPLTEDVEDIDYLTPSHFLIGRPATFLPHENIEEQNISIRQRWQLLNKMQQGFWARWSSEYLQQLQVRSKWLHKTINLTIGDIVLIKEDNLPPAKWALGRIQEVHQGKNNLVRVVTLKTKTGLLTRPITKLVKLPIPESQTEKPDTDTSSITNQDNKKPTSNNNNQKPPFKSKKTSTSLLCMLITLLTLFQPSTQNSNVINITPFEPNRAIYFDKIGNLQTVHDDWKIVTYYNMTSYWQGINNIHRVVKHLGERCEQFDYKTMCKTIMTEFEQELNELDHNNYMLETHHGQTRGRITRGLINGIGYLANSLFGVLDERFAAQYKQDIETLHNNRDHLLKLIKGQTSIIEAQNNILKRNEESMITQFNLLSQHLNTTDDYLRKLGKAVQEEEELNYFNIMATTTSIMLSKLRHIQHMLLDTSTNIHNGHVDTRLLPQSQLFQELNIISGQLPQQLSIPVDNIQTQLADIYKLLEVKSRVLENYFIIEITLPLTGDTPFTVYKTVPIPMKANNKTSLSMKTSSEYVAVNMRKETFMQLTGSDLDRCIKIEPSTYVCSVNKPTRNMRSHQIPCEIGAITNITIAKCVYTSETCNSNWTPLRKPNTWLYACCTECQLQIICEGRMTSKAIHSTGILEAKEGCIIRNNEETYHTQVYYTNNIYVNPEIPLLDLEGINNLTSLQPIDLSLHQPNTQFIKHDYDKLHNSLKELEEQTKQTEMSQHDVHQYIVSYVTMGLLILSAFVAGGLYVYCKRRGTLAVNNANTTTTVPPPKPRRAENIERTIVLNSMHRYQPRIHLVKVREGGGPIVDLTREQHRTFVFPETVFTAVTAYQNQLITKLKIDSNPFAKGFRDSSRLTDFDRDPMELMLMEQQLLRSPLRLYANSSEEEAEKRAAWARTPPALQLLALGGRAWPSAWPQPLALPPDLWMQKPPTPLRYCPYPPPPRPDHAPSPRHPL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -