Cvar022074.1
Basic Information
- Insect
- Cheilosia variabilis
- Gene Symbol
- -
- Assembly
- GCA_951230905.1
- Location
- OX579678.1:61620006-61625737[-]
Transcription Factor Domain
- TF Family
- BTB
- Domain
- zf-C2H2|ZBTB
- PFAM
- PF00651
- TF Group
- Zinc-Coordinating Group
- Description
- The BTB (for BR-C, ttk and bab) [6] or POZ (for Pox virus and Zinc finger) [1] domain is present near the N-terminus of a fraction of zinc finger (Pfam:PF00096) proteins and in proteins that contain the Pfam:PF01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation [1]. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule [2]. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT [5, 3, 4]. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 27 3 9.1e+02 0.9 0.0 66 103 99 136 94 140 0.92 2 27 3.1 9.5e+02 0.8 0.0 66 102 157 193 152 197 0.92 3 27 3.1 9.5e+02 0.8 0.0 66 102 215 251 210 255 0.92 4 27 3.1 9.5e+02 0.8 0.0 66 102 273 309 268 313 0.92 5 27 3.1 9.5e+02 0.8 0.0 66 102 331 367 326 371 0.92 6 27 1.4 4.2e+02 2.0 0.0 65 104 388 427 381 430 0.92 7 27 2.3 6.9e+02 1.3 0.0 66 103 445 482 438 485 0.91 8 27 5.8 1.8e+03 -0.0 0.0 66 102 501 537 497 540 0.91 9 27 2.2 6.6e+02 1.3 0.0 65 103 558 596 551 599 0.91 10 27 2.6 8e+02 1.1 0.0 66 103 615 652 610 655 0.92 11 27 2.6 7.8e+02 1.1 0.0 66 103 671 708 666 711 0.92 12 27 4 1.2e+03 0.5 0.0 66 103 727 764 720 767 0.91 13 27 2.6 7.8e+02 1.1 0.0 66 103 783 820 778 823 0.92 14 27 2.7 8.2e+02 1.0 0.0 66 103 839 876 833 880 0.91 15 27 2.2 6.6e+02 1.3 0.0 65 103 929 967 922 970 0.91 16 27 3.1 9.5e+02 0.8 0.0 66 102 986 1022 981 1026 0.92 17 27 2.5 7.7e+02 1.1 0.0 65 103 1043 1081 1036 1085 0.91 18 27 2.4 7.4e+02 1.2 0.0 66 103 1102 1139 1097 1143 0.92 19 27 3 9.1e+02 0.9 0.0 66 103 1158 1195 1153 1199 0.92 20 27 3.5 1e+03 0.7 0.0 66 103 1216 1253 1210 1257 0.91 21 27 3.1 9.5e+02 0.8 0.0 66 102 1272 1308 1267 1312 0.92 22 27 2.5 7.7e+02 1.1 0.0 65 103 1329 1367 1322 1371 0.91 23 27 2.4 7.4e+02 1.2 0.0 66 103 1388 1425 1383 1429 0.92 24 27 1.7 5.3e+02 1.6 0.0 66 103 1444 1481 1439 1486 0.92 25 27 2.6 7.8e+02 1.1 0.0 66 103 1502 1539 1497 1542 0.92 26 27 3 9.1e+02 0.9 0.0 66 103 1558 1595 1553 1599 0.92 27 27 0.48 1.5e+02 3.4 0.0 66 108 1616 1658 1610 1660 0.92
Sequence Information
- Coding Sequence
- ATGAGAGCTACGATTTTGGCAGTTGTCCTGGCAGCAGGGATATGTGCTGCGGAAGTCATCGGGAGTCAACGGAGCTTGTGCACCGGAATAACGGACGGATCTTTTGTTGGAGATCCGGATTCATGTTCCAATTTCTTCATCTGTTACAACTCCGAGGCTGTCAGTCAAAGTTGTGAGCAGGGTAGTTACTTTGACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGCCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGTCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGCCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGCCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGCCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGCCGGAGTGCAGCGAAAATGGAGCTTACATTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCGGCAACCGTCGGAGTGCGGCGAAAATGGAGCTTACATTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCGGCGAAAATGgagcttacagtccacaccccACCGAGTGTACCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGCCGGAGTGCAGCGAAAATGGAGCTTACATTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCGGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGCCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCTCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCGGCGAAAATGGAGCTTACATTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAACACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCGGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCGGCGAAAATGGAGCTTACAGTTCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCATCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGACACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGCCGGAGTGCAGCGAAAATGGAGCTTACATTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCGGCGAAAATGGAGCGTACAGTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGCCGGAGTGCAGCGAAAATGGAGCTTACATTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTATTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGTCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCACTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGTCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCAAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCCCGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCAGCGAAAATGGAGCGTACAGTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGCCGGAGTGCAGCGAAAATGGAGCTTACATTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTATTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGTCGGAGTGCAGCGAAAATGGAGCTTACAGTCCTCACCCCACCGAGTGTGCACTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCTCCAGCCCGCAACCGTCGGAGTGCAGCGAAAATGgagcttacagtccacaccccACCGAGTGTGCCCTCTTTTACATTTGCTATAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATACCGGATACGGATAATGTTTGCCCGCAACCGTCGGAGTGCGGCGAAAATGGAGCGTACAGTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTAACAGCACGAATCTGTACTGCATCCCGGATACGGATAATGTTTGCCCCAGCCCGCAACCGTCGGAGTGCAGCGAAAATGGATCTTACAGTCCACACCCCACCGAGTGTGCCCTCTTTTACATTTGCTACAATGGGACGCTGGTGGAGCAGAGCTGTGACGTTGGCAGCTACTTTGACAAAACAGAACTCTATTGCGTACCGGACACAAACAACATCTGTCAACAGCCACAGTGCACCACAAATGGGGCAATAAGTCCCTATCCCGGGCAGTGTGCTTTGTTCTACATCTGCTACGATGGCTTACTTGTGGAACAAACCTGCGGCGTCGGTAGTTACTTTGACGCAGAGAATTTGTACTGCTTACCCGACTCTGACAACGTCTGTTGGCCATCGACGACAACCGAAAAGTCGTCATGCCCATGCGTTGGAGGCCACACGAATGGCGAGTTTGTCGCCAGCCCGGACTGCTGCTATTCGTACTATGTGTGCAGCGATGGCGAACTTGTGGAACAGAACTGTGGATATGGAAATTATTTTGACAACAGCATTAAGCGGTGTGAAAAAGATGTCACCGGTGAGTGTTGGCCGTCGTCCAGTTTTACAACAACTACTACCCAGGCGCCTTGTACTTCATGCAATGAGGGTGACTTCGTTGCCAACGATGACAGTTGCTACGCATTTTCTCTGTGTGCCCAAGGTGCGCTTGTGGAACTAAGCTGTGGCTATGGAAACTATTTCGACAGGGTCACTAAGGTGTGCGAAAGGGACATCGAAGGGAAGTGCTGGACAACAACAGAGCCCTCCGATTGTTAG
- Protein Sequence
- MRATILAVVLAAGICAAEVIGSQRSLCTGITDGSFVGDPDSCSNFFICYNSEAVSQSCEQGSYFDSTNLYCIPDTDNVCPQPPECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPSECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPPECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPPECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPPECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPPECSENGAYIPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCRQPSECGENGAYIPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPSECGENGAYSPHPTECTLFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPPECSENGAYIPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPSECGENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPPECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPSECGENGAYIPHPTECALFYICYNGTLVEQSCDVGSYFNNTNLYCIPDTDNVCPQPSECGENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPSECGENGAYSSHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPSECSENGAYSPHPTDTNLYCIPDTDNVCPSPQPPECSENGAYIPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPSECGENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPPECSENGAYIPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPSECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPSECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPSECSENGAYSPHPTKCALFYICYNGTLVEQSCDVGSYFNSPNLYCIPDTDNVCPQPSECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPPECSENGAYIPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPSECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPSECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCSSPQPSECSENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPQPSECGENGAYSPHPTECALFYICYNGTLVEQSCDVGSYFNSTNLYCIPDTDNVCPSPQPSECSENGSYSPHPTECALFYICYNGTLVEQSCDVGSYFDKTELYCVPDTNNICQQPQCTTNGAISPYPGQCALFYICYDGLLVEQTCGVGSYFDAENLYCLPDSDNVCWPSTTTEKSSCPCVGGHTNGEFVASPDCCYSYYVCSDGELVEQNCGYGNYFDNSIKRCEKDVTGECWPSSSFTTTTTQAPCTSCNEGDFVANDDSCYAFSLCAQGALVELSCGYGNYFDRVTKVCERDIEGKCWTTTEPSDC
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -