Pter002454.1
Basic Information
- Insect
- Protophormia terraenovae
- Gene Symbol
- ARID2
- Assembly
- GCA_951394005.1
- Location
- OX596073.1:60191544-60204581[-]
Transcription Factor Domain
- TF Family
- ARID
- Domain
- ARID domain
- PFAM
- PF01388
- TF Group
- Helix-turn-helix
- Description
- This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 3.2e-27 1.8e-23 83.8 0.0 3 89 89 173 87 173 0.94
Sequence Information
- Coding Sequence
- ATGTTGTTAAACGAGTCCATAAAATCATCAAATATATCACAAATTCAACAAAATGAAGGCACAACACAAAACAGCAGTACAGTGAACACAATTGGCGTCATCACAGCCAGCTCCACCACCACTTCTGCTAGTTCATCCTCCACTACCACCGCAACAACAGCCGGAACACCATTAAGATCTCGAAATACAAATAGGGATCGCCAGCAGTCCGATGATAAGCAGGACAATTCAAATAATAAGGGTAAAAATCCTCCAAAACCCATGGaagaattttacaaagatcTACAACAGTTCCATGAGAAAAGAGGCACACCTATTATGCATATGCCGAAAATAACAGGACGTGAGGTTGATCTTCATCGCTTATATTGTGAGGTCACTGAGAGAGGTGGATTTCAAAAAGTCAATATGCGTGACGAATGGGATGAAATTTTACCCGAACTGGGTTTAAAAGAGAAAATAGTCAATGGTTCGGCtgcattgaaatatatttatcgtCGTGTATTGGAAAAATATGaacgtttaaattttttcggtGAAGATCCCGATAAAGTAGATGCTTCAGAGGCAGCCGAAATGGCTGAATTTAATATGGGCGGTGGTCGAGGTAGAGGTTCTAGGTATCCTTCCTCTTACAGTAGTTCCAATCAAGCCATTATACATAATGTTGCTATGACCTATAACTACAGACAACATCATGTGAATATGGACAGACGAAgacaatttaaattatccacAGATCTACATAAACCCTCAGCGTATGAAAAACTGATGCTATCACTACTCTCACCACTACCAAACGAACAAGACTTTGCCATAAATGCCTGTACGTTAATGGCTAATGAAAGTAGACATACGCTGAAACTTAACGAATATCCAAAGTTATTGGATGCTTTATTGGCTCATACTGGTGTATTTTCTGATTATACCATGCGCAAGTTGTTCCAACACATCTATAGTGGTGTAAGGCATCATTCTTTGTATGGATTTTGGTATGATTTGTTAACGGATAAACCTCAAATAATGGATTTGTATACAGATGAACAGGCTTTACGTGAAGCAGCTCTAATAGATGATTATGATGATGGCAAACCAGAAATGGAGGGCATTTGGCAAGATTGTAATGAAATggactttttaaatttacgcACTGGTTTAGGTACTCAAGACTATGTTGGCCAGCGGGTATTACAAGTAGTGTCTATACTACGTAATCTGAGTTTTCAAGAAGAAAACTTGggagttttagtaaaaaataaaacttttctgAGATTCCTGGTCATGAGCTCGAATATACGCTGGAGCAATGTTCATATACAGGCTCTTGAAATAGCTGGAAATATTGCTGGGGAATTGGAAATCTTAGATCCTACAATTGATGATTTGTCACGTTGTTTAATGGCAACTTTATGTGATGGCATCGAAGGTCCCGACCGTGGTGTCATAATAAATTGTCTAGAGATTTTATACAAACTCTGTCAAAAAGATACCAATGAAGACTATGTTCTTAAATGTTTAAATGCCAACTTTTACAATACAATAGCATTGTTTTTATCACTAAATGACATAATGCTATTATTATTCACTTTGGAAGCGATTTATGCTCTCACATGTTTGGGTCCCAAGAGTTGTCATTCTATGATGCAGGTTAAGGGCATTGTTGATCAATTGATATCCCTAATTACAGTTGAGGCTCAGGCCTATGGTCCAGATGGTTGCATATTGATGCGTGTGGTAGAGACGGTGCCGGGAAATATGCTACCTATGGTAGCACAAAATATTGCCAACTTACAAAATGTTACCcacatacaaaaacaaactaCTACCATAAGCAGCCACAATCAAATTAAACCACAAATACTAACACCAAATCCAATGCCAGCCCCTGAAGTAGCCGAAACTTCAGCTGAACATATACAACCACAGCAGCTGCATCAACAAGtccaacagcaacaacaattacAAATGCCTCAAAACTTTACCCATGACGATGAACAATACGCTTTAGCATGGCTGGGAGCCACCTTTGAACGAACCTCTAGTGTGGAGAGTCATATTGAAACTCAAGAATTGTATCGCATGTATTTATCACACTGTCAAAAGTCGGGTAAACACTCAGTTGTTAATCATTTGCATTTTCCCAAATTGGTACGTCTTATTTTCACCGCCTCTGTTGGTCCGGCTGCAGTAAGAAAGACAGATGGGACAGATTTACCCGGCTTACACTATGTGTCGATTAGATTGCGAGGTCAACCATTGCCTTTACAAAAAGCTATACTTTCTGCAGGTGGTGGTCAAATTTCGCCCTCGGGCAAGCAATCTGAAAGTCCAGGATCAGctagaaaaaataagaaaaaattaaaacagcaaTTGGAATCGTGTCTGGCGCCCGCAACTTCAAATACATTAATACCATCAGTTCCAGTTACTTCCGTTGCTAATAATAATACGGCTCCTGCTCAAACGACTACAACAGATAGTTCTGCAGCTGCAAATGTCACTGAAACAGTTAAACAAGAAATGAATCCTGTTAAACAAGAGACGAATCCTgttaaacaagaaataaatcCTGTGATGACGCAACCATCAATACCAACAATGACTGAAAATTCTTCTCCAACACAACCACAGCAAGGTGAAGTTAAAAACGAATGCGAAAAAATGGAGATAGATGAAAAAGATAATATTCCAACTAAAACGGAGAAACAAACACAATTAACAGCCCAACAAAATGAATCTGAATCTATGCAAGTTGCAGACCAGACAGTAAACCAAGTAACCCAACCAATAACAGCAATCGCAACACAAAATCCATCAAgctctttaataaaaagtctaCTGGCGAACAAAGTTAATCAAAGACAACTGAAACAAAAGGACAGTTCTGTAAATTCTAATcctacaacagcaacaacaacaacaactgcTGCAGTCACTACCACAACACAACCCATTAAAGTGGCAAGTACAGCTATCACCGCTTTGGTCAACAATCCCCTCATGCAACACACCCCGGTAAAAGTAGGGCAAActacaataaaacctttaaatcCACATCCTACACCACTAGACAAAAAACCTCAAATATCAGAATCAACACCTCCACCCTTAGCTCCCCTAAGCGGAGCTAATGTTGCCAAAGATGCCTCTGGTAGACCCATCATTCTAGCCAATCAAATGTTGGTTGATATTCTAGATAAAAAAATGGTTGATCCACCTATACCAGGAAGCATTATTCAAAAACGTAAAATAGAAGATCATTTGGGAAACAATAAACGTTTGGCTATAGAATCAAACTCATCTCTCTCGGCTGCAAAAGAAGAAGTCCAAGTAACACCCTCTAAAAATGCAGCAAATTTATATGCTGAAATGGCTGCCTCCATATTGGAGGATGAAGATCTTGAAGAATTTTCAACTCAACCACAACAGCAAGCTCCTCCGCCTCCATTGGAACAACCACAACAAATCCAACAAACTTCTTTAATTATACCAGCTAAAGTGCAGCAACAAACAACCGGACCTCTTCCGGGTGTACAAAGACAATTGGTGTTTCAATCGAACCAACCACAACTAAAGCTAACTCAACCCGTAACATCAACCAACCAACAAATGCCCAATGCTGTGGCCACTATTAAAACTGACCAGGGTTTGCAAACAGTGCCGGTGATATTACAACAAAAGCCCTTAGAAATGCAACAACAGGGCCAAGCCCAAATCATACAACAAATGATACAACCAGCCCCGCAACAACAGCAAACATACGTGTTAGCCACGAATCAACAAGGACAAACTTATTTAGTAGCCCAACAATCACAGCAACAACCTCCTCCTCCACCACCTCAACCCACACAAACAGTTTTGGTAACCCAGACTCCTCAACAACAATCAACAGGACAAAAAACCATAATTATATTGCAACAACAAGCAATGCCGGGACAACAGCAACAGATAATTACCACAGGTCCACCTGGCCAGGGACAAAAAATGATTATGACCACTTCGCAAGGTCAACAAGTGCTGGTTACACAAAGACCTCAAACACCTCAGCAAATTTTCATAAATCCTCAAACAGGGGCTGCCACACACATACAACATGTACCCGTACAAACGCAAACCAGACAAATTATACAAACCTCGAATCAACAACAGCCCCAAACCCAGCAAACACAACAAATTCAAGCAGGACAAATATCTCCATCCTTGCTCAATCAACTAAATCAAATACCAGCAACCATAAAACTCCACCAGCCCATAGCTCAGCAATCGACACCAGTACAACAACCGCAACCAGCCACTATCACACGTATGACTAAAACCGTTTCCTTAGTTCAACCCTCCACGAATACACCTACAACACCTGTGGCCATACCTCAGCTACAGCAACATCAATCCATCATTCAACAACATATAATTTCAGGACCATCAGAAAAACGGCATGTTATCTTAAGCGGTGGCAGTGCAACAGGTGGTATGGGCATGCGTGctatagaaataaaagaaactgTTATTACCCAAGCCCAACAACCTCCACAACAATGCCAATTGAACTCGCAGACAATTATAACAACTCAGCAACCAGCAGCATCGCAGGGAACAACAACACCACCACTTTTACAACAACAGGTACAACAACAAATAAGAGCTGTTATTGAACAACAGCAACATCCGTCtataatacaacaaaaaaattcctCATTGCCTGTTGTACAGCAACAAAACCAGCCATCTCAGTCTCAGCCACAGACGTCGTCGTCTTCGGCGCcccagcaacaacaacaaactaAAACACAAAGTCAAGTAATGGTACAGACAAATATAGTTGCATCAAAACACAAAACATCATTACCacagcaacaacagcaaacGCCAACACTCATAGCAGCAAAGctacCGGATCCTACGCCATCTCTGATACAATCAAAGAATCAATCTCAAAATGTGGTACAAACTGTTGTGCCATCAATTCCAGCAACAACTCAACAATCTTCTAGCTTGGGAAATGTCAAAAAACCTAttacattaataaaaccatCGCTACCAACAGTTAAATTGCCACCTATTGTTAACACATCAGGACCACCACCGCTGGCAGtagtaaatacaaataataatgtaaatgtAACTGCAACGGCTGCCAATCCGTCGATTAATGCTCAAGCTCCATTAACTACAGAGGCAAACAAAAAAGATGAAATGGTCAAATCACAACAAGCAACTACAGGTCCCAATCAAGCAACTTCTACATCCAAAGCACAAATGTCAACAACATCCGCATCATCTACAACTTCAACAACATCATCTAGTAATGTGACTGCTACACCAGCTGCGCCACAACAACAGCAAAACAGTCAAAACCCACCACAAGTGCCGTCACAATCACTACCCCAACAAACATCTACACCTACTCCTCAACCACCACctccacaacaacaacaaactcAACAACTTTCAGCCATCGATGCCCAGTGGTTATATATTTGTGATTGGCGGAATTGTCCacgtaaaaaatttaaatcgatgGGTGATTTACAACATCATGTTTGTACATCCCATGCACCAGATCATTTAGATCCAGCGGCAGATATATTTTGCCAATGGGGCATCGGTCCTGGCCTCTGTGATGGTGTACCCCGTAAACGCTTCTCGCTTATGACTCATTTAATCGACCATCATTTAACCATGGACAGTTTAAGAAACGCGGTTCAACGTAGAATTGCTACCGGCATGTACAACATAGCACCGGCAATACCACCCGTAACCATAGTTCGTAATCTTGAATTATCACAAAGAACTAATAATGCATCGCCCTCCCCTTCGACGTCATCATCGTCCAGTTCTCAGACTGCGGCTACCGGATTGTCTGCTTTACAGGCCATCAAACGTCACACAACCGACTTAATGAATTCTAAAGAATTAATGGATGAGAATGAAGGCCCAGTAACCAAGAGTATTCGTTTGACTGCTTCATTGATTTTACGTAATTTAGTAACGTATACCTCTACCGCAAAGCGTAGCGTACGTCGTTATGAACCACATTTAGCTAATATTGCTCTAAGTAATGTTGAATCAAGTGGCACTATATCTCACATACTCTATGAAATGAATGGTTGA
- Protein Sequence
- MLLNESIKSSNISQIQQNEGTTQNSSTVNTIGVITASSTTTSASSSSTTTATTAGTPLRSRNTNRDRQQSDDKQDNSNNKGKNPPKPMEEFYKDLQQFHEKRGTPIMHMPKITGREVDLHRLYCEVTERGGFQKVNMRDEWDEILPELGLKEKIVNGSAALKYIYRRVLEKYERLNFFGEDPDKVDASEAAEMAEFNMGGGRGRGSRYPSSYSSSNQAIIHNVAMTYNYRQHHVNMDRRRQFKLSTDLHKPSAYEKLMLSLLSPLPNEQDFAINACTLMANESRHTLKLNEYPKLLDALLAHTGVFSDYTMRKLFQHIYSGVRHHSLYGFWYDLLTDKPQIMDLYTDEQALREAALIDDYDDGKPEMEGIWQDCNEMDFLNLRTGLGTQDYVGQRVLQVVSILRNLSFQEENLGVLVKNKTFLRFLVMSSNIRWSNVHIQALEIAGNIAGELEILDPTIDDLSRCLMATLCDGIEGPDRGVIINCLEILYKLCQKDTNEDYVLKCLNANFYNTIALFLSLNDIMLLLFTLEAIYALTCLGPKSCHSMMQVKGIVDQLISLITVEAQAYGPDGCILMRVVETVPGNMLPMVAQNIANLQNVTHIQKQTTTISSHNQIKPQILTPNPMPAPEVAETSAEHIQPQQLHQQVQQQQQLQMPQNFTHDDEQYALAWLGATFERTSSVESHIETQELYRMYLSHCQKSGKHSVVNHLHFPKLVRLIFTASVGPAAVRKTDGTDLPGLHYVSIRLRGQPLPLQKAILSAGGGQISPSGKQSESPGSARKNKKKLKQQLESCLAPATSNTLIPSVPVTSVANNNTAPAQTTTTDSSAAANVTETVKQEMNPVKQETNPVKQEINPVMTQPSIPTMTENSSPTQPQQGEVKNECEKMEIDEKDNIPTKTEKQTQLTAQQNESESMQVADQTVNQVTQPITAIATQNPSSSLIKSLLANKVNQRQLKQKDSSVNSNPTTATTTTTAAVTTTTQPIKVASTAITALVNNPLMQHTPVKVGQTTIKPLNPHPTPLDKKPQISESTPPPLAPLSGANVAKDASGRPIILANQMLVDILDKKMVDPPIPGSIIQKRKIEDHLGNNKRLAIESNSSLSAAKEEVQVTPSKNAANLYAEMAASILEDEDLEEFSTQPQQQAPPPPLEQPQQIQQTSLIIPAKVQQQTTGPLPGVQRQLVFQSNQPQLKLTQPVTSTNQQMPNAVATIKTDQGLQTVPVILQQKPLEMQQQGQAQIIQQMIQPAPQQQQTYVLATNQQGQTYLVAQQSQQQPPPPPPQPTQTVLVTQTPQQQSTGQKTIIILQQQAMPGQQQQIITTGPPGQGQKMIMTTSQGQQVLVTQRPQTPQQIFINPQTGAATHIQHVPVQTQTRQIIQTSNQQQPQTQQTQQIQAGQISPSLLNQLNQIPATIKLHQPIAQQSTPVQQPQPATITRMTKTVSLVQPSTNTPTTPVAIPQLQQHQSIIQQHIISGPSEKRHVILSGGSATGGMGMRAIEIKETVITQAQQPPQQCQLNSQTIITTQQPAASQGTTTPPLLQQQVQQQIRAVIEQQQHPSIIQQKNSSLPVVQQQNQPSQSQPQTSSSSAPQQQQQTKTQSQVMVQTNIVASKHKTSLPQQQQQTPTLIAAKLPDPTPSLIQSKNQSQNVVQTVVPSIPATTQQSSSLGNVKKPITLIKPSLPTVKLPPIVNTSGPPPLAVVNTNNNVNVTATAANPSINAQAPLTTEANKKDEMVKSQQATTGPNQATSTSKAQMSTTSASSTTSTTSSSNVTATPAAPQQQQNSQNPPQVPSQSLPQQTSTPTPQPPPPQQQQTQQLSAIDAQWLYICDWRNCPRKKFKSMGDLQHHVCTSHAPDHLDPAADIFCQWGIGPGLCDGVPRKRFSLMTHLIDHHLTMDSLRNAVQRRIATGMYNIAPAIPPVTIVRNLELSQRTNNASPSPSTSSSSSSQTAATGLSALQAIKRHTTDLMNSKELMDENEGPVTKSIRLTASLILRNLVTYTSTAKRSVRRYEPHLANIALSNVESSGTISHILYEMNG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00045155; iTF_00259556; iTF_01194154; iTF_00258647; iTF_00331324; iTF_00349956; iTF_00921227; iTF_00921228; iTF_00199643; iTF_00921883; iTF_01259006;
- 90% Identity
- iTF_01259006;
- 80% Identity
- -