Tsim023576.1
Basic Information
- Insect
- Tetramorium simillimum
- Gene Symbol
- JARID2
- Assembly
- GCA_011636635.1
- Location
- VBVQ01002926.1:9965-20796[+]
Transcription Factor Domain
- TF Family
- ARID
- Domain
- ARID domain
- PFAM
- PF01388
- TF Group
- Helix-turn-helix
- Description
- This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 1.9 7.1e+03 -2.0 0.2 56 83 149 178 134 179 0.65 2 2 2.1e-23 7.8e-20 71.6 0.1 6 89 1358 1439 1354 1439 0.94
Sequence Information
- Coding Sequence
- ATGGTTTTGAGTCGCAACgataagagaaaaaggaaggagGGAGACCTGGTGGACCTCATGGAGCCACTTTCAGAATCTCCGAAAAGGACCAAAGTTCATGCACAAAGAAAATTTGCGCAGGGTGCAACCACAGTATTTAATACATCACCTACGCCTGTCAAGGACAAGGAGAAAGCTAAACCTACTATGCTCACTGAATTAATAACTCACAAACGTCCGAATACGGAggattttttaacatttttatgcttCAGAGgAACATCTATTTTGCCACCCagtttaaatttcttcaatattggcaacaaaaaagagaaacaggGACTTAAGCCAGTTCGAACTTCTATAGAAAAAGCATCTTCTACTAATACTTTCACAGAGATTCAGAAACAAGAAACTACATgccaaaaaaatgttacaaaagtAAAGCCTATTGTctcgaataagaaaaaagtagcTAGTACATTACATAAAAACATTACCAAGCTTAAGACAGCTACATCTACAGTCCAAGcgcttaaaaagaaatatcaagaGCAACGCCTTGCTAAACAAagaatcaaaaataaattcaaaaccACTTGTGTTATGAGGACAAGATCGTGTACAGAAAGATCTATGTTGAAGGCTGGTCTTCAGTTGGCGCCACGAAAATTCTTACCTAGAATCAAGGCGAAGAGACATGGCCTAAGAAGTGCTGTATTGCCAGATAAAACTAATAAACCTTTGTCAAAAACAAAAGTGTTATCCAAACCAAAGAAGAAAGTCAATTCTAAAACGAAAAATAGCGATACATCGAATACAGATGCATCATCCGAAGAGAATGATGTCGAGGACAATGAAGAAGAAGAGTTACATGTGGAGAAATCGACATCGCAGATAAAGCGGAACATACAAAGAGGCATCCAAAAAAAGATCTGTACCAGAAGTAAATCAGAGATGAAGAGAGTTACAAGATCATCCAGTAGCACGATCCCCTCGCAGAATCTCTCCAGGAGGCCTACCAGAAAAACGAAGGAAGCAGCCGCTGTATATATGGAAATTCTTGGAAGAAAATTAGTGAGCCCAGATTTAGAAAATGACGATAACATTTCAGTTGAGAGCTTTCCTGAACTACCAAACGCTCGTAGAATAGCGCAAACGGAAAATGAGCTTAAAGCCAAAGTGAAACAGAGCAGTACCAAGACTATCACTAAAACTAAGGACTCGAAAGTTAGCTCCTTCAGTAGTAACGCTAACAAGCGATTGGATAAGAGCAAAAACAACAAGTTGATTTTAAAGAGCAAACGTCTGATGAGAGTGCAGAGATATTGCGAAGAAGATAGCGACGAAGAATCTGAGAGCTCAGTAGAAACGAACAATACAAGACCAATTACCAGACAGAGTCTTGGAAAGATTGATAAACCGATTAGCAGGAGTCTTAGATCATCTTCCAAAAGCACAACTATACAaacagaaatacaaaataacgTAAATAGTAAACCGAAGACTCAGGTTCAAAGTTATGTGAAAACagctaaagaaaaattcgtgATAAAGCGTAAACGTGagcaaaatatatctaaaacatCGTCGGTTAAGATTAAGCAAAAAAGTCTTAAAAGTAATCAAAATGAGAAACCGGAATCCGAAGAGGAAACGCTGGGAATGTTGCTcagtaaaataaagaagaagaaagaaaatggcGAGGAGGAAGTGGAAGGACAAATTCCTGACAATGTCAATCATATAAAAGACGGAAAAGTAACTCGCAGCAACGCGCAAAATGTGAAAGCAGATCTGTTGAGGGTATCGGACGACGAGGAATCTTTTCGTGGGTTTACTAAGAAGGCGATATCAAAAGTATTGAATTCGTGTCAGACACATGCCAATGTTAATTTGCTAGTTACAGAGTCTAATGAAAAGTTGAATTTGTCTAAAACATTGGACGTATCGCAAACAATGGAGCAAAGCGCTACTTTACAAGAAAATAAGGTAGAAACATTATCGATCAAAGCCGGTTCCCCGGTCGCAAATTCGTTCAACGGCGAGTGCAGCGGATCTGGGCAGGATCAAAAATCGGTATTGAATTTGTCCGTAAAAACGCAAAGCTCGTTGTTGCCATCTATGGAACAGACGAAATGTAATCACGAAAACGAGATTTCTTCCAATCTCATGCCACTATCTAACCTGCACACAAGAAAAGAACGAGTGAATATGTCTACCGAACAGATTGAAAAATGGTTGAACGAAAGCTCGTTTGCCAAAGAAGAGAGTAAGTTAGAAATGGAAAACGTTTCCAGCTTTAGGTATGATTcggcgaaagaaaaattaaaggcGGACGTTTCGCATCTGTCTATCTCGACGAAAATCCAACATTTAGTACGCCCTGTCAACGTCACTCTTTCGAAATTATCGGACAAGACGAATTTGAAAGACCGTGCAGGAATACACAGCCGAAATGTAATATCAGCATCATCGGATATGCATAATACGGGAGTAAAACAACAGAATGCGATTGTCGACAAGAATAAGATTACTATTGAGACGAAAGAGattgtcaaaaataaaattgagaaaatgaaCAAAGATACTTCGCTTGCTGAAGATACTAGTGATACTGTGTCCAAATCGGACCAAAATTCGGTGGATGGTTCCATAGAGAAGAAATCGCCaacggaaaagaaaatatttcaaccgCGAAAACCCTTTTTACCCAAGGTCAAGGAACGTAAGACTGTAATACGGAATGCAAACGCATTTTCGCCGGAAAACGAGAGCAGCGTTTACGCTTTTGAAAGTGACACTGAGGTTCCTGTTAGCACACCGTTCCGGAGAAAGGTCAGAGATGATGCGAAACGTTCGGCTATAATAATGGAGTCCACAGCGACTGGTAGAGAGGTATCCAAAACCAAAGTGATTGCCGAAACCAAGAAATCGAAAGATTCCGAATCGTCAGACAGCAAGGAGAAACCGTCGAACGATCCTATGACGCCCATGAGGAAAGAAGACGCAAAAGAGACGCAAGGACCGATAAATGTGGTGAACACGGCTCCAAGCGTGAACAAGTTTGAACTGCCCAAGAATTTTGCAAGCTTGACCAGTGTCCAAGTGCTACCGCTCGATAAACTCACGACGAGTTGGAGCAATGTAAATTGTAGCGCTTCCATAGCGGTGCAAGTGAATCTCGATGAGACCATGCAGGAGCAAGAAACCGATGCGAGTCAGCAGAAAAGTACTGAGATATCCACTCAGACTGAAATGAATAacgagaacgacgacgacaacgatggACAGCTGTTTTACATTCCGTTGCAGGCTGTTACGAGGAGCGGCCCAAATCTGGTGCAAGGTCAACAGCTAATACAGGGTGTCGCCGTTAAGCTTGGCACGGAAGGACCGACTGGTCCTAATCAAAGAGTACTCCTTCGAGCTAAGTTGGTCACTAAGCCACCGTTATCCGTGGCACGTTGTCCGCCGATAGGAACGGTGCAACCGACAACACGTACTCCGCCCAATCCTACGGCAATTACAGTGGGGCAGCAGGTACCATCGACGTCAACGACCGAGACTGTATCGACTACGACAGCAAGTATGCCGAGTGCGGAGACGCAACCAACGCAGTCATtagcttttaaaaatgaagTGTCGGTGCCAAACCTAAATCGTCAAAATGTCAGTACAAGCGCAAATAGCAGTTTGGAGAAACTCACAAAATCACCGAAGTCGTCTAGGGAAAGGAAAACTTCTATAGATTCGACAAAAAGCGGGAAGAGgacACAAATGAAATGCAAACAAAGAGGGTTGGACTTATGCTCTTCGAGCAACAGTACGACGTTTCCAGGTACGAAGATGGGGAATAATGAAGCTCGCGTAGTCGAGGCGCCAACATTTCATCCGACAGAGAAAGATTTTCAGGATCCATTGGAGTATATAGACAAGATAAGGCCGATTGCTGAAAAATTTGGGATATGTAGAGTAGTTCCTCCACCAAATTTTAAGccgGAATGCAAGGTGTCTGACGATATGAGATTCACTGCTTACAATCAATATGTACATCGCATGCTCCACAGATGGGGTcctaatgtaaaagaaatgatggctatcaaaaaatatttagctaCTCAGAATATTACTTTGACTCAACCTCCATGgATTGGCGGAATGGAAGTAGATTTACCTCACTTATATCAAACAGTTCAAAGTTTGGGTGGATTGAAAgaagttattgaaaaaaagaaatggcaGAAAGTAGCAGATGGCATGAAAATACCGAAATCAGCACAGGATCGTGTTACTAAACTAGATgacatttattgtaaatatttactgCCATATGATACATTATCGCCAgaggAACGTGGGAAATTATTTGATGAGGTGGAAGCTGAATGGATGAGAAAAGAAAGCAAAGCTTTGCAAAGGCAAGAATCCtctaatgataatgatgaagAAGAGGATGAGGACAGTTCTGATGAAATTGAAGAATGCATTGTGAAAgGAAGAAATATGCCTTTGAACGCTTTTTACCGAATTGCCCGTAATACGCAACGTATGTGGTTCGGTGAAAATCAACGAGGAAATGAAGCCGAAGGTGCTTCCGCCGATGAAGTAGAACATGCATTTTGGAAACACGTTGCCGAGAGAAAACGTCATGTTTGCGTACATGCGGCAAGCATTGATTCCAGTGGTCGTGGATTTGGCTTTTCCGTTGCAAAAAACAGCCCATTCGCTAGACATCCTTGGAATCTTAAAGTTCTCACTAACAACGCTGGATCAGTATTGAGAGCTTTAGGTCCACTAATGGGTGTGACCGTACCAACATTACACGTGGGTATGTTATTTAGTGCTTGTTGCTGGTATCGCGATCCGCATGGTCTACCATGGATAGAATATCTACATACAGGTGCCAAAAAGATTTGGTACGGAATACCTGATGAGCATAACAACAATTTCCGAGAAGCGCTTTCGAAAATGGTACCGCGATATTGCAAAAACAAGACTATATGGCTACCTTCTGACACTGCCATGGTTCCGCCGGAATTATTAGTAAGCAATGGCGTACCATTATGTCAGATAGTGCAAGAACCAGgacaatttataatagtatttcCAAAAGCGTTCACATCCAGTATATGTACCGGTTATGTAGTATCCGAAAGCGTATATTTCGCACAACCATCCTGGTTAGAAACTGCGGAACAAGTATTTAAgGACATACAAGATAGTTGTGAACCTTCTATCTTTTCATTCGAgagattgttatttaatattattaatgacacAAGATCTCACATAGATGTATTAAAACagATATTGCCGAGTGTGATAAAGATTCGCGAAAAGGAATTAGATTATCGTAAACAACTGGAAAATGTGGGTCTTATCAATAGAGAAAGATTACCTTTGCCAGATAGtggaaaagggaaaaaaggaaaaaaggtgAAAGAAGATGATGGCGATTTTGAATGTGAGATATGCAGAGCTAACTTATTTGTCTCTTTGATAAACAATTCGCAAGACGACAGCGTTTATTGCTTGCCGCATGCATTGCATTTAATCAGTTATAAGAAGCAAGTTTTGAAACATTGTACTCTGATGTACACATATGACGAAgaCGAATTAGACGAATTGATTCACAAACTGGAAAACAGGATTGAAGCCAAATCTAAAAAAACtaatcaaataaaacaaaataagtaa
- Protein Sequence
- MVLSRNDKRKRKEGDLVDLMEPLSESPKRTKVHAQRKFAQGATTVFNTSPTPVKDKEKAKPTMLTELITHKRPNTEDFLTFLCFRGTSILPPSLNFFNIGNKKEKQGLKPVRTSIEKASSTNTFTEIQKQETTCQKNVTKVKPIVSNKKKVASTLHKNITKLKTATSTVQALKKKYQEQRLAKQRIKNKFKTTCVMRTRSCTERSMLKAGLQLAPRKFLPRIKAKRHGLRSAVLPDKTNKPLSKTKVLSKPKKKVNSKTKNSDTSNTDASSEENDVEDNEEEELHVEKSTSQIKRNIQRGIQKKICTRSKSEMKRVTRSSSSTIPSQNLSRRPTRKTKEAAAVYMEILGRKLVSPDLENDDNISVESFPELPNARRIAQTENELKAKVKQSSTKTITKTKDSKVSSFSSNANKRLDKSKNNKLILKSKRLMRVQRYCEEDSDEESESSVETNNTRPITRQSLGKIDKPISRSLRSSSKSTTIQTEIQNNVNSKPKTQVQSYVKTAKEKFVIKRKREQNISKTSSVKIKQKSLKSNQNEKPESEEETLGMLLSKIKKKKENGEEEVEGQIPDNVNHIKDGKVTRSNAQNVKADLLRVSDDEESFRGFTKKAISKVLNSCQTHANVNLLVTESNEKLNLSKTLDVSQTMEQSATLQENKVETLSIKAGSPVANSFNGECSGSGQDQKSVLNLSVKTQSSLLPSMEQTKCNHENEISSNLMPLSNLHTRKERVNMSTEQIEKWLNESSFAKEESKLEMENVSSFRYDSAKEKLKADVSHLSISTKIQHLVRPVNVTLSKLSDKTNLKDRAGIHSRNVISASSDMHNTGVKQQNAIVDKNKITIETKEIVKNKIEKMNKDTSLAEDTSDTVSKSDQNSVDGSIEKKSPTEKKIFQPRKPFLPKVKERKTVIRNANAFSPENESSVYAFESDTEVPVSTPFRRKVRDDAKRSAIIMESTATGREVSKTKVIAETKKSKDSESSDSKEKPSNDPMTPMRKEDAKETQGPINVVNTAPSVNKFELPKNFASLTSVQVLPLDKLTTSWSNVNCSASIAVQVNLDETMQEQETDASQQKSTEISTQTEMNNENDDDNDGQLFYIPLQAVTRSGPNLVQGQQLIQGVAVKLGTEGPTGPNQRVLLRAKLVTKPPLSVARCPPIGTVQPTTRTPPNPTAITVGQQVPSTSTTETVSTTTASMPSAETQPTQSLAFKNEVSVPNLNRQNVSTSANSSLEKLTKSPKSSRERKTSIDSTKSGKRTQMKCKQRGLDLCSSSNSTTFPGTKMGNNEARVVEAPTFHPTEKDFQDPLEYIDKIRPIAEKFGICRVVPPPNFKPECKVSDDMRFTAYNQYVHRMLHRWGPNVKEMMAIKKYLATQNITLTQPPWIGGMEVDLPHLYQTVQSLGGLKEVIEKKKWQKVADGMKIPKSAQDRVTKLDDIYCKYLLPYDTLSPEERGKLFDEVEAEWMRKESKALQRQESSNDNDEEEDEDSSDEIEECIVKGRNMPLNAFYRIARNTQRMWFGENQRGNEAEGASADEVEHAFWKHVAERKRHVCVHAASIDSSGRGFGFSVAKNSPFARHPWNLKVLTNNAGSVLRALGPLMGVTVPTLHVGMLFSACCWYRDPHGLPWIEYLHTGAKKIWYGIPDEHNNNFREALSKMVPRYCKNKTIWLPSDTAMVPPELLVSNGVPLCQIVQEPGQFIIVFPKAFTSSICTGYVVSESVYFAQPSWLETAEQVFKDIQDSCEPSIFSFERLLFNIINDTRSHIDVLKQILPSVIKIREKELDYRKQLENVGLINRERLPLPDSGKGKKGKKVKEDDGDFECEICRANLFVSLINNSQDDSVYCLPHALHLISYKKQVLKHCTLMYTYDEDELDELIHKLENRIEAKSKKTNQIKQNK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01422021; iTF_01421345; iTF_00015841; iTF_00014517; iTF_01407013; iTF_01406288; iTF_01228807; iTF_01228806; iTF_01098984; iTF_01077258; iTF_01407950; iTF_01405473; iTF_00127970; iTF_00129512; iTF_00128749; iTF_00126487; iTF_00125694; iTF_00127232; iTF_00181764; iTF_00016485; iTF_00417349; iTF_00385064; iTF_01475976; iTF_01408842; iTF_01015595; iTF_00015181; iTF_00109614; iTF_01245007; iTF_01476720; iTF_01520018; iTF_01477365; iTF_00181110; iTF_00898551; iTF_01254584; iTF_00264058; iTF_00280442; iTF_01355100; iTF_00264799; iTF_01523156; iTF_00730522; iTF_00868555; iTF_00867181; iTF_00867929; iTF_00729078; iTF_00729851;
- 90% Identity
- iTF_01422021;
- 80% Identity
- -