Basic Information

Gene Symbol
pol
Assembly
GCA_035045965.1
Location
JAWNOM010000534.1:42035-62390[-]

Transcription Factor Domain

TF Family
zf-MIZ
Domain
zf-MIZ domain
PFAM
PF02891
TF Group
Zinc-Coordinating Group
Description
This domain has SUMO (small ubiquitin-like modifier) ligase activity and is involved in DNA repair and chromosome organisation [1][2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 1 1.2e-23 4.3e-20 71.3 2.6 1 46 307 352 307 356 0.96

Sequence Information

Coding Sequence
ATGCGAAAGACCCGCTCGCAAACTCAGACGGAAAATGCCGCTTCTACCACAACGCAGCAATCAACGCCAACTGCAGTAAACACATTTGATGCGGCTAAATTCAAGGAATGTGAGCAAATGGTGCAGATGCTACGCGTTGTAGAGCTGCAAAAAATTTTATCGTTTCTAAACATTTCCTTCGCTGGACGAAAAACCGATCTACAAGGTCGCATCCTATCGTTCCTGCGCACTAACTTGGAGCTGCTCGCACCCAAGGTACAAGAAGTCTACGCTCCATCCGTGCAAGAACAAACACCCACGCTGCAGTACATAGATCCAACCAGAATGTACTCGCACATGCAGCTGCCGGCTGTGCAGCCAAATAATGCGAGTCTCGTTGGTGGTGCACCGACACAGGTACCGCAGGTGGGCACCGGCAATCCCGCCCAGATGCCAGTCGGCGCTGGCGGTGCACCCGCCAATATGCTACCCTTCCTGCACACGCACGGTATGAACAGCCAGATGCCCATACATCCAGATGTGCAGCTGCGCTTCTATCTCGTGGAGACCTCGTGCGACCAAGAGGATTGTTTTCCACCAAACGTCAATGTCAAAGTGAACAACAAATTATGTCAGCTGCCTAATGTCATTCCTACAAATCGTCCAAATGTGGAACCAAAACGTCCACCACGGCCAGTGAATGTGACCTCAAATGTGAAATTGTCGCCGACAgtaacaaatacaataatggTACAGTGGTGCCCAGAGTACACGCGTAGCTACTGCCTGGCTGTCTATTTGGTAAAGAAGCTGACATCTGCACAGCTCTTACATCGTATGAAGACAAAGGGCGTCAAGCCTGCAGACTACACGCGTGCTCTGATTAAGGAGAAGCTTACAGAGGATGCGGACTGCGAGATAGCCACAACCATGCTAAAGGTGTCGCTTAATTGCCCGCTgggtaaaatgaaaatgtcgcTGCCGTGTCGCGCATCAACCTGCTCGCACTTGCAGTGCTTTGACGCCAGCCTCTACCTGCAAATGAACGAGAGAAAGCCCACCTGGAATTGTCCTGTGTACGATAGACCAGCCATCTATGATAATTTAGTAATTGATGGCTATTTCCAAGAGGTATTAGGGTCCTCGTTGCTAAAGAGCGACGATACCGAGATTCAGTTGCATCAGGATGGTTCGTGGAGTACACCAGGCTTGCGTAGCGAGACCCCCTACGCCTCgtcgcaacagcagccacaaaaaaCTGACGTGGTAAAGTCAATGCCCGACTTCGACGGGGCTCAAGAAGACTATGTTGCCTGGCGGCAGTCAGCAACAGACGCTTACGAGCTATTTAGGTCCCACTCTACTAGCAGCGTTCATTACCAAGCTGTGACCATAATTAGAAATAAGATTAGGGGAAACGCAAGAGCCCTGCTTACGTCCCACAACACGGTTCTGAATTTTGACGCGATCATTGCCAGGCTGGATTGCACGTACGCGGACAAGACCTCACTAAGGCTCCTCCGCCAAAACCTGGAAATGGTCAGACAGGGAGATTCTTCTCTCATGCAGTACTACGACGACGTAGAACGCAAGCTAACGTTGGTAACCAACAAAATCATCATGACACACGAAGCCGCCATGGCGACATTATTAAACAACGAGGTCAGGGCAGATGCCCTTCACGCATTCATTTCGGGTTTAAAAAGATCCCTCAAGGCCATCGTCTTCCCAGCCCAACCCGGCGACTTGCCCACCGCCTTGGCCCTAGCCCGAGAAGCCGAAGCTAGCATTGAGCGCAGCAACTTCGCTGTGTCATACGCGAGGGTGATGGAAGAAAGAGCTCATTCGAATGAGCACAACAAAAGTCAAACCAGACAGGACAAACACGGTAGGAATAATAACCCTAACCAAGAGAAGAACCCACACTTCGTAAAAAAACAAGGGAAACAACCTCAAAGCGGCGATCACGCGCAGCAACAGCCGAAACCGCCCCAGCAGTCGGAGCCGATGGATGTAGACCCGTCAATGTCCAAGCCGTTCAACGCCATAATAGGCTTCGACCTGCTAACGCAAGTAGGAGCAACATTGGACATTCAGAAAGGTACGATTAATTATGGGCAAACTAACGAAAAGTTACGGCATCACTTCTGTGACAACGTTAATTTCACTAACGTCAACGATATCGTGGTACCCGAGATGGTAAAAGAggattttaaaaaaatgattcTGAAGAACATCAAGGCATTTTCGACCTCTAACGAGGCACTGCCGTTCAACACTTCGGTCATAGCGACGATACGCACAGAAGACGACCAGCCCATCTACTCCAAATTGTACCCACATCTAATGGGAGTGGCCGACTTCGTCACTAAAGAGATCAGGGCTCTCGTAGACAACGGCATCATAAGGCCATCAAAATCCCCATATAACAGCCCGACATGGGTCGTTGATAAAAAGGGATACGATGAAGAGGGTATCAGGAAAAAGCGACTCGTCATCGATTTCAGGAAACTAAACGAAAAGACCATCGCAGACAAATACCCGATGCCGAACATCTCAATGATCTTGGCCAATTTAGGCAAAGCTAAATACTTCACGACTTTGGATCTAAAATCCGGGTACCATCAGATATATTTAGCAGAGCAGGACAGAGAAAAAACATCGTTCTCCGTAATTGGAGGAAAGTATGAATTTTGTAGGTTGCCCTTTGGCCTGAAGAATGCGGGCAGTATCTTTCAGCGGGCAATCGACGACGTCCAACGAGAACAAATCGGAAAGTCGTGCTATGTGTACGTAGACGACGTCATAATTTTTTCCGAAAACGAAAAGGACCATGTCAGGCACATAGACTGGGTTCTCAAAAGCTTATGCCGCGCTAACATGAAAGTCTCAAAGGAAAAAACGCAGTTCTTTAAACAAAGCGTTGAGTACTTGGGATTCATAGTTACCATGGGAGGAGCAAAATCCGACCCCGAAAAGGTTAAAGCAATAAAAGAATTCCCAGAACCAAAGAACCTCTACGAGCTGAGGTCGTTTCTGGGACTGGCCAGTTATTACAAATGCTTCATAAAAGATTTTGCTGCCATAGCGAGACCGCTGACGGCAATAATGAAAGGCGAAAACGGATCCGTCAGCAAGCATCTCTCACGGAAAACCATCGTGAATTTCGACGACGCGCAGAGGCACGCGTTCCAGAAACTAAGGGATATCTTGGCATCGGAGGATGTCATCCTGAGGTACCCGGACTTTAAAAAACCATTCGACCTGACGACGGATGCCAGCTCATACGGCATAGGGGCAGTATTATCCCAAGAAGGAAAGCCCATTACAATGATCTCGAGGACTCTAAAAGATTGCGAGACCAAATACGCTACAAACGAAAGAGAGCTGTTGTCCATCGTGTGGGAGTTAGGTAAACTACAGCACTACTTGTACGGAACTCGAGACATCAACATCTATACTGATCACCAGCCTCTTACCTTCGCTGTTTCAGATAGGAACCCCAACGCAAAAATTAAGAGATGGAAAGCATACATAGACGAACATAACGCTAAGATACACTACAAGCCAGGCAAGGAAAACTTCGTTGCAGACGCCCTTTCAAGACAGAACATCAATGCTCTTCAGGAAGAGGCCATGTCCGATGCAGCCACAATACATAGTGAGCTGTCCCTGACCTACACGGTCGAAACGACGGAAAAACCCTTAAATTGCTTTAGAAACCAGATAGTGCTAGAACAGGCCCAGTTCCCAAGATATCGTAAGTTCATACTTTTCGGCGAGAAAAACCGCCACATCATCCACTTCACAGAAAAAGACGGAGTCATAAATGACATCAAAGACGTCGTGAACCCGAGCGTGGTTAACGCCCTTCACTGCGATTTGCCAACGTTGGCGGCAGTCCAACATGAGTTGGTCAAGTCATTGCCAAACACAAAATTCTTATACTGCAAGAATATTGTCTCAGACATTACAGACAAGTCGGAACAAAGAGAAATAGTTGAGGCCGAGCACAATCGAGCTCACAGGGCAGTGCAGGAAAACATCAAACACCTTCTTAAGGATTACTTCTTCcctaaaatgacaaaaatagcTAATGAGGTGGTAGCGAATTGTAGAGTTTGTGCCAAGGCAAAGTATGACCGACACCCGAAAAAACAGGAGTTAGGTTTAACCCCGATCCCCTCATATGTCGGCGAAATGTTGCACATCGACATCTTCTCCACCGacaagaaaatgtttttaactTGCATAGACAAATTCTCGAAAATGGCCGTGGTATATCCAATAACGTCCCGGACGATAGTAGACGTCAAGACGCCCATTTTGCTGCTAGTAAACTTGTTCCCCAGAATCAAAACAATCTACTGCGACAACGAGGCGGCATTCAATTCCGAGACAATCACGTCTCTTTTAAAAAATGGCTTCAACATAGACGTCGTAAACACGCCCCCTCTGCACAGCGTTTCTAACGGACAGGTCGAACGCTTCCACAGCACCTTGGCGGAAATAGCCAGATGCATGAAGCTAGACAAAAAAATTGACGATACCGTTGAGCTCATCCTAAGAGCGACAACAGAATACAACCGGACAATACATTCAACAGCTGACATGAGCTCAATCGAGGTCATGTACTCAGTCTCAGAGGAAATCGCGGAAAATATAAAGCGAAAACTCGAACACGCACAAAAGAACAACATGAACAGCATTTTCCCGATCTCGCACGAACACATCGTGCTACAGCTGGATGACGACACCGTGGCGGGGTGTGACAGTGAGGCACTGGTGGTCACTAGctgcgcagcaacaacatatgCGCCCTTCTGCAAGCTGGCGGTACATGACACTTGCGCCCGTGGACTACATTCGGGATCCGTGGCACTCTGCCGCACACAACCCAGCTACTTGGAGACTGTCACCCAAGTAGACGACGGCGTTCTGATCATAAACGAAAGCCCGGCCCAGGTGACCACGGACGACGGCCCAACAATATCAGTAAGCGGCACCTTCCTCGTCACTTTCGAAACGTCGGCCATAATCAACGGGACCAAGTACGTGAACCAGAGCGAAGCCCTTAGCAAGAAACCAGGCATCGCCGTGTCACCTCTTCTCAACGTGATCAGCCACGACCCAGTCCTCAGCATGCCACACCTTCAGAGGATGAATCACCACAACCTACGCGTCATCCAAGAGCTCCAAGAAGAGGTCCAATCTGCCGGGTCACCCAAAATCTGGCTTATCATTGTCACCGGGATTACCGTCATCATCAGTGGCTCCACACTCATCTACTTAGTCTGA
Protein Sequence
MRKTRSQTQTENAASTTTQQSTPTAVNTFDAAKFKECEQMVQMLRVVELQKILSFLNISFAGRKTDLQGRILSFLRTNLELLAPKVQEVYAPSVQEQTPTLQYIDPTRMYSHMQLPAVQPNNASLVGGAPTQVPQVGTGNPAQMPVGAGGAPANMLPFLHTHGMNSQMPIHPDVQLRFYLVETSCDQEDCFPPNVNVKVNNKLCQLPNVIPTNRPNVEPKRPPRPVNVTSNVKLSPTVTNTIMVQWCPEYTRSYCLAVYLVKKLTSAQLLHRMKTKGVKPADYTRALIKEKLTEDADCEIATTMLKVSLNCPLGKMKMSLPCRASTCSHLQCFDASLYLQMNERKPTWNCPVYDRPAIYDNLVIDGYFQEVLGSSLLKSDDTEIQLHQDGSWSTPGLRSETPYASSQQQPQKTDVVKSMPDFDGAQEDYVAWRQSATDAYELFRSHSTSSVHYQAVTIIRNKIRGNARALLTSHNTVLNFDAIIARLDCTYADKTSLRLLRQNLEMVRQGDSSLMQYYDDVERKLTLVTNKIIMTHEAAMATLLNNEVRADALHAFISGLKRSLKAIVFPAQPGDLPTALALAREAEASIERSNFAVSYARVMEERAHSNEHNKSQTRQDKHGRNNNPNQEKNPHFVKKQGKQPQSGDHAQQQPKPPQQSEPMDVDPSMSKPFNAIIGFDLLTQVGATLDIQKGTINYGQTNEKLRHHFCDNVNFTNVNDIVVPEMVKEDFKKMILKNIKAFSTSNEALPFNTSVIATIRTEDDQPIYSKLYPHLMGVADFVTKEIRALVDNGIIRPSKSPYNSPTWVVDKKGYDEEGIRKKRLVIDFRKLNEKTIADKYPMPNISMILANLGKAKYFTTLDLKSGYHQIYLAEQDREKTSFSVIGGKYEFCRLPFGLKNAGSIFQRAIDDVQREQIGKSCYVYVDDVIIFSENEKDHVRHIDWVLKSLCRANMKVSKEKTQFFKQSVEYLGFIVTMGGAKSDPEKVKAIKEFPEPKNLYELRSFLGLASYYKCFIKDFAAIARPLTAIMKGENGSVSKHLSRKTIVNFDDAQRHAFQKLRDILASEDVILRYPDFKKPFDLTTDASSYGIGAVLSQEGKPITMISRTLKDCETKYATNERELLSIVWELGKLQHYLYGTRDINIYTDHQPLTFAVSDRNPNAKIKRWKAYIDEHNAKIHYKPGKENFVADALSRQNINALQEEAMSDAATIHSELSLTYTVETTEKPLNCFRNQIVLEQAQFPRYRKFILFGEKNRHIIHFTEKDGVINDIKDVVNPSVVNALHCDLPTLAAVQHELVKSLPNTKFLYCKNIVSDITDKSEQREIVEAEHNRAHRAVQENIKHLLKDYFFPKMTKIANEVVANCRVCAKAKYDRHPKKQELGLTPIPSYVGEMLHIDIFSTDKKMFLTCIDKFSKMAVVYPITSRTIVDVKTPILLLVNLFPRIKTIYCDNEAAFNSETITSLLKNGFNIDVVNTPPLHSVSNGQVERFHSTLAEIARCMKLDKKIDDTVELILRATTEYNRTIHSTADMSSIEVMYSVSEEIAENIKRKLEHAQKNNMNSIFPISHEHIVLQLDDDTVAGCDSEALVVTSCAATTYAPFCKLAVHDTCARGLHSGSVALCRTQPSYLETVTQVDDGVLIINESPAQVTTDDGPTISVSGTFLVTFETSAIINGTKYVNQSEALSKKPGIAVSPLLNVISHDPVLSMPHLQRMNHHNLRVIQELQEEVQSAGSPKIWLIIVTGITVIISGSTLIYLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-