Bdim011970.1
Basic Information
- Insect
- Balanococcus diminutus
- Gene Symbol
- vdrb
- Assembly
- GCA_959613365.1
- Location
- OY390716.1:59329291-59375728[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 40 8.8 6.8e+02 1.2 0.2 1 15 14 29 14 32 0.72 2 40 0.007 0.54 11.0 0.2 2 21 49 68 48 69 0.94 3 40 3.6 2.8e+02 2.4 10.0 1 23 77 100 77 100 0.94 4 40 0.1 8 7.3 0.1 2 21 127 146 126 147 0.94 5 40 2.6 2e+02 2.9 0.1 1 16 155 170 155 172 0.86 6 40 2.1 1.6e+02 3.2 1.1 3 20 244 261 242 263 0.92 7 40 0.8 62 4.5 0.3 3 21 311 329 310 330 0.95 8 40 0.013 0.99 10.2 0.5 2 21 377 396 377 397 0.94 9 40 0.071 5.5 7.8 2.0 2 21 448 467 447 468 0.95 10 40 0.17 13 6.6 1.4 2 21 519 538 518 539 0.95 11 40 1.1 81 4.1 2.0 1 13 615 627 615 631 0.89 12 40 7.3 5.6e+02 1.5 3.0 1 13 681 693 681 695 0.88 13 40 0.047 3.6 8.4 0.6 1 20 772 791 772 793 0.94 14 40 0.25 20 6.1 0.8 2 21 887 906 886 907 0.94 15 40 0.43 33 5.4 1.0 2 21 952 971 951 972 0.95 16 40 1.6 1.3e+02 3.5 1.0 1 13 980 992 980 994 0.89 17 40 0.2 15 6.4 1.4 2 21 1021 1040 1020 1041 0.95 18 40 0.001 0.079 13.6 4.5 1 21 1096 1116 1096 1117 0.96 19 40 8.4 6.5e+02 1.3 0.2 1 12 1125 1136 1125 1140 0.86 20 40 2.3 1.8e+02 3.1 1.5 2 21 1166 1185 1165 1186 0.91 21 40 6.6 5.1e+02 1.6 1.4 1 12 1194 1205 1194 1208 0.84 22 40 1.5 1.1e+02 3.7 0.5 1 13 1261 1273 1261 1275 0.87 23 40 0.15 11 6.8 0.4 2 21 1300 1319 1299 1320 0.94 24 40 2.8 2.1e+02 2.8 0.3 3 13 1330 1340 1328 1342 0.91 25 40 0.004 0.31 11.7 0.3 2 21 1367 1386 1366 1387 0.94 26 40 0.00022 0.017 15.7 1.2 1 21 1395 1415 1395 1416 0.94 27 40 0.00041 0.032 14.9 2.2 1 23 1475 1498 1475 1498 0.94 28 40 8.9 6.9e+02 1.2 1.5 1 13 1623 1635 1623 1636 0.91 29 40 0.00011 0.0088 16.6 3.8 1 21 1678 1698 1678 1699 0.96 30 40 0.00059 0.046 14.4 0.4 1 21 1717 1737 1717 1738 0.95 31 40 0.00014 0.011 16.3 0.9 1 23 1746 1769 1746 1769 0.93 32 40 0.0082 0.64 10.8 2.9 1 21 1784 1804 1784 1805 0.94 33 40 0.0019 0.15 12.8 1.7 1 16 1814 1829 1814 1831 0.93 34 40 0.00081 0.063 13.9 1.0 1 21 1891 1911 1891 1912 0.95 35 40 0.01 0.8 10.5 1.0 5 21 2003 2019 2001 2020 0.95 36 40 0.00012 0.0092 16.6 0.7 1 20 2028 2047 2028 2049 0.95 37 40 0.0019 0.15 12.8 0.1 1 21 2053 2074 2053 2075 0.95 38 40 0.0011 0.086 13.5 3.8 2 23 2084 2106 2083 2106 0.96 39 40 0.00017 0.013 16.1 3.0 1 21 2115 2135 2115 2136 0.96 40 40 0.0019 0.15 12.8 0.8 1 20 2144 2163 2144 2164 0.96
Sequence Information
- Coding Sequence
- ATGAGGCATTTACGTTACGAATGTGGCGTCAAACCTCAATTTCCTTGTCCTCATTGCCCTCATAGGAatctAAGTAGAAAAAGCAACGTGAGCAGAGTTTTCCATACTGGTGATTCGATTGTCAATGAAGAAGGCGAATCTGTGACTTGTGCTCGGTGTCTGAAAACGTATAGTTCTGTGGGTAATTTACAAAGGCATTTGCGTTACGAATGTGGCGTCGAACCTCAGTTTTCTTGTTGCTTTTGTTCGTACAAATGCAAACATAAGACCAACTTGAAAGTACATGTTGTTAGAAAACATAatccAATTAGAAAAAGCAGTGTTAGGAGACTTTCCCATAATGATGGTTCAGTTGACAGTGGTGAAAGGGAACCTGTGACGTGTGCTCGGTGTCTGAAAACTTACACTGCTGTGGGTAATTTACAAAGGCATTTACGTTACGAATGTGGCGTCGAACCTCAGTTTCCTTGCCCCTATTGCTCTCATAGGAGTAAACGGAGAGatccACTTGCTGTTACTAGTATTCTCCAAGTCAACAACATAGACGGACCTCGACGAAGCAAACGAATCACTCGTAAATTGTCTGTCAGCGATCCAGGAATAGTTGATAAAGTGACATGTGATCGTTGTTTAAGAAATTTCCACGGATTTGTCGGAGGCAGCAGTTTTCAACTAGAACCAGTCACTTGTAAATCATCTGTCAAACATGAGAGGGAATCCAGTGTGCACTGTGATCGATGTCTGAGAAGTTATGGTAGTATTCGTAGTTTGCAGAGACATCTGCATTACGAGTGTGGTGTCGAACCAAAGTTCTCTTGTCTTCATTGCCCGTACAAATATTTCCAAGGAAAAATTAGTAGCAGCATTTGTAAAGTTGAACAAACCTCTTGTAAATCTACTGCCAAATATGAGAGGAAATCGCGTGTGTTGTGTGCTCAGTGTCTGAGGAATTATAGTAGTATGAATAATTTAGATAGACATTTACGTTACGAATGTGGCGTCGATCCTAAGTTCTCTTGTCTTCATTGGCCATACAAATATTTCCAAGGAGAAATTAGTAGCAGCAGTTGTAAAGTTAAACAAGCCATTTGTAAATCTTCTGCCAAATATAAGAGGAAATGGAGTGCAACGTGTGCTCGGTGTCATAAAACTTATAGTAGTGTGAGCGCTTTGGGAAGACATCTGCGTTACGAGTGTGGCGTCGAACCACAGTTTTCTTGTCCTTATTGTCCATGTAAATGTAAACAGAAGAATTTCCGAAGAGAAATTAGTAGCATCTGTTGTAAAGTTGAACAAGCCACTAGTGAATCTCCTGCCAAATATGAAAGGGAATCGCGTGTGACGTGTGCTCAGTGTCTGAAAAGTTACAAGTATTTACGCTGTTTACAAAGACATTTACGTTACGAATGTGGCGTCAAACCAAAGTTTTCTTGTCCTTATTGCCCGTGTAAATGTAAACAGAAGAATTTCCAAGGAGAAATTAGTAGCAGTAGTTGTAAAGTTGAACAAGACATTTGCAAATCTTCTGCCAAATATAAGACTGAATCACGTGTGACGTGTGCTCAGTGTCTGAGAAGTTATCGTTTTATGCGTAATTTACATAGACATTTGCGTTATAAATGTGGCGTCGAACCACAGTTTTCTTGTCCCCATTGCTCTCATAGGAATTTCCATGGTGAAATTAGCAGCAgttgtaaaattaaacaagCCATTTGTGAATCTTCTGTCAAATATGAGAGTGAATCAGGTGCGACGCCAACGTGTGCTCGGTGTCTGAGAAGTTATAGTTGCATGCATAATTTGCAAAGGCATTTACGTTACGAATGTGGCGTCGAACCACAGTTTTCTTGTCCTTATTGCCCTCGTAAATattatCGCCAATTTAGCAGAAGTAAGCTCAAAGTTGAACAAGACACTCGTAAATCATCTGCCAAACACGATAAGGGATCAGGCGTGACGTGTGCTCGGTGTCTGAGAAGTTACAGTTGCATTGCTACTTTACAAAGACATTTGCGTTACGAGTGTGGCGTAAAACCCCAGTTTTCTTGTCCTCATTGCCAGTATAAATattatCGCAGATTTAGCAGCAGTAAGCTTAGAGTCGAACAAGTCACTCGTAAATCATCTGCCAAACATGATATGGGACCGGaatctcaACAGATCTGTGGTTCACAAGCTAAACGAGCCACTTGTGAATCGTCTGCCAAATATAAGAGGAAATCGGATGTGACGTGTGCTCGGTGTCTTAGAGCTTATAGTTGTATTGGTGCTTTACAAAGACACTTGCGTTACGAGTGTGGCGTCGAACCCAAGTTTTCTTGTCCTCATTGCCCATACAAAAGTAAACTTAAAGAAACCATGAAAAGACATGTTCTTATCTGTAGCTCACAAGCTAAACAAGCCACTTGCAAACCATCTGCCAAATATAAGAGGAAATCGGGTGTGACGTGTGCTCGCTGCCTGAGAGCTTATAGTTGTATGGGCGCTTTACAAAGGCACTTGCGTTACGAATGTGGCGTCGAACCCAAGTTTTCTTGTCCTCATTGCTCCTACAAATATTGTCATCGAATCGGCACTATCTGCAGTTCTCACGTTGAACAATGTAGTTGCAGTAGTTGTAAATCGTCTACCAGAGCGAATGAAGAGGTGCCTGGTTTGACGTGTGCTCGGTGTCTGAGAAGTTTCAATTACATGTTTAGTTTGAAATCTCATTTACGTTACGAGTGTGGCGTTGAGCCTCGGTTTCCTTGTCCTCATTGTTCCCATAGAAGTTCGTTCAATGTCAATAGCAGTTCCCAAAGCAAAAGACGTCATGGTCAATTTTCTGCCAAAAATGAAGCTGTATCTAGTGTGACGTGTAATAGGTGTCTGAGGAGTTACCGTAGTACAAAGACATTGTACTACCATTTACGTTACGAATGTGCTGTAGAACCTCAGTTTATGTGTCCTCATTGCTCGTATAAGTattCACGTGAACAAACTCATGTGATGATGGGAAGTTCGGCAGCTAAACAAAGTACCCGTAAATCGTGTGCCAAAAATGAAGACGTGCCTGGTGTAAGGTGTAGTCGATGTCTGAGAACTTACCACAGTACGGGTACTTTACAAAGGCATTTACGTTACGAATGTGGCGTCGAACCGCAGTTTCCTTGTCCTCTCTGTTCGCATAAAATTGGAATTCTTCTAGATTCATTCGGTGTTAGCTCCCAAATCAACCATACCAGACTTTTCCAAGCCAATTCGAGTATTCTAAGATCGTCTGCCGGAGATAAAGAGGTTAACGGTTACACGTGTTCTAGATGTCATAGAACCTGTACCAGTAAGAGCAACTTAGGAAAGCATTTACGTCTCCAGTGTGGCGTCGAACCTAAGTTTGCTTGCCCCTACTGTTCGCATAAATatcCCCACCGGATCGTTCTTAGCAGCAATTCTCGAGTTAAACAAGccaattgcaattttaaatCGTCTGCCAAGTATGAGAGAGCACCCGGTGTCACGTGTGCTCAGTGTCATAAAAGTTATAGTTGTATCGGCGTTTTACGAAGACATCTACGTTACGAATGTGGCGTAGAACCTCAGTTTTCTTGTCCTTATTGTCCGCATAAATattttcgcCGAATTACTTGTAATAACAGTTATCAATTTGAACAAGTCACATGCAAATCGTCTGACAAGGGTAAAGGCAAATCTGGTGTCACGTGTTCTCAGTGCTTGAAAACTTTGAGTTGTATGTTATCATTGCAAAGACATTTGCGTTACGAGTGTGGCGTCGAACCTCAGTTTCTTTGTCCTCATTGCCCGTACAAATattCTCGCCGAATTACTTGTGATAGAAATTCTCAAGTTAATCAAGTCATTTGTAAATCGTCTTCCAAGTATGAAAGCAATCCCGATACGACTTGTGTTCGGTGCCTGAGAACTTACAAAAGTATGGGCGGTTTACAAAGACATTTGCGTTACGAGTGTGGCGTCGAACCTCTGTTTGGTTGTCCTCATTGCCCATACAAGTattccCGCCGAATTTTGCGTAATAGCAATTCTCAAGTTATTCAAGCCACATGTGAATCGTCTGCCAAGTATAAAAGCGAACCCAGTTTAACGTGTGCTCAATGCCTGAGAACTTATAGTCGTCGTGGAGGTTTGCAAAGACATTTGCGTTATGAATGTGGCGTCAAACCTCAGTTTTCTTGTCCTTATTGTCCGCATAAATCTAAACAGAAATCTAACCTGAGAGCCCATGTCCAAAAAGCTCAAAACATCGTTGTAACAGTTGCCACAAACAGTACATTCATCGAAGCCATCTTCTGCGTCATGAAAGATACGAATGTGGAAAAGAAAGACAATTCTCGTGCCCTTATTGCCCCCACAGATAAGGCCAATCTGGACAGGCATAAGAAATACGAATGCGAAGTggatcgtaatttttcatgcCCCGTGTGTACTTATAAAGCTAAAAGAAAGGAGCATTTGAAAAGACATATGATTTTTATACATAATGTTATACTCCAACCTTCTGATAATCCGCAGGAGTTGGTTATGCATCATATGAaagttATCGACAGTGTATCGGAAACGCACGTTTTGTATTTGGATGAAAGTGGAATCATAGGTGATCCGGAATATGAAGACCAAGAATTCGATTTCCAGCCAACTTCCAAAAGGAAAAGTACTCCTGGGTATCCGAAAATCGATTTAGGAGTCGTTGTACCCAGTAGCATTGATTTACCTAGTAATCCGAATACTAGTGCTTTTACAATTGAACGTGTTGAATCGAttatgACTCGACCATTAGCTTACGAAGTGAATAAAGATTTCCCCAGATGTTCAATTTGTGGAGATAAGTCAATTGGCAAGCAGTATGGAGCTTATACTTGCGACGGATGTAAGAGCTTTTTTAGAAGATCTGTTCGCAGCAAAAGCGTATACACTTGCATATTTGGAGGAGGTTGTAGTATAACTACTGTCAATCGTAACCATTGTCGATATTGTCGTATGATGAAATGTCTAGAAGTTGGCATGCGTGTCGAAGGCTTCCAATGTGACATATGCCATAAATCGTACCAATATAAACGTAACCTGCAAAGGCACCAGAAGTACGAATGCGAAAAAGAAGCTCAGTTCCATTGCCCCtattGGTATACAAATTCTCACACGTGCCCAAAGTGTGGTAAAGCCTACACCTACAGGCCTGGATTACTGCAGCATTTGAAATACGAATGTGGCAAAGAACCCCAATTCCAGTGTCCGTTTTGCCCTCAAAAAAGCAAACGGAGATCCAACTTGAGAGTTCATATTGTCAACAGACATCCCGAATGTTATTCGAGAAGACGTAAGCATCAAGATGCCTTGTTTCATTGCCCGAAATGCTTTAAAGCTTATAATCGTCGAGATTCCATGACAAGGCATAAGAATTACGAATGCGGAATGATCTTGCCTAAATTCAAGTGTACCGTTTGTCATAAATCGTTCAAAAGAAGTGATCATTatttgcTGGCAGATGGCACAGAATGGCTCAAAGAATACGTCTTCCCTTCTACTTCCAATAAACCCAAGCAACGTTCTAGAGTTGTCGAAAATACCAAAGAATACAAACCTTATAAATGTTTGACTTGtgaTCAAAAACTAACCACGACTGAAATCATTAGAGAAATATTGGTAGTGGATGGTAATTTCGTGTGCCCAAAATGCGGTAAATCGTACCAAAGGAAATATACGGCGATTAGGCACTATAAATTCGAATGCGGCTCTTCGCCGCAGTTTCAATATCCACTATACATCAGTGGAATTGGCGCATTGGCAGAGGAATTGAAGCCCGCCCTTAGTGACGAAGAGTTCAGGCAGTTGACTACAAAACGAGCTTTTGAGTGCCCGAATTGCCATAAGTACTATTTGTATAAAGGTGGTGGAATGTTTCGAGTATTGATGAAGAACGTTGTCAAAATACCGATACCAGCTATTCGAGGCAATGCTACCGCAGCAAATCCATCTCAGAAAGCATTTGTGTGTCACTGTGGCAGAGGATACAAATACAAGGGCAATTTGAAGCAGCATCAGAGGTTCGAATGCGGTGTAGCTCCTCAGTTCGAATGCAGCGTTTGTGCTAGACCTTTTACTCATAAGAGTACGTTGAAAACGCATATGGatTCCGAATTCCCTTACGAATGCCCGAACGCATGTGGACGCAAGTACAAGTATAAAGGAAGCCTTACTGCTCATTTGAAGATCGAATGCGGAAAAGATCCTGCCAATCAATGTCATTTGTGCGATCGCAAGTTCAAAAGGTTCAATAGTTATAAGAGGCATTTGATCATTTTACACGATGGATCATCGGGAGTACGAAGTTTTACCTGCGAAGATTGCGGCAAATGTTATAAGTACAAAAGAGGCTTAACCGCCCATAAGCGTTTCTATTGCGGAAAAGAGCCCCAGTTTCAATGTGACATTTGTTCGAAACGATTTGTCGAACGAAAGTCATTGCATAATCATTTAGTCTGCATACACAAACTACTCATCAAAAGGTCGCAAGCCGAATTGTTCCAAAGGTCTTTGAAATCCTACCGACAttctattttgaattaa
- Protein Sequence
- MRHLRYECGVKPQFPCPHCPHRNLSRKSNVSRVFHTGDSIVNEEGESVTCARCLKTYSSVGNLQRHLRYECGVEPQFSCCFCSYKCKHKTNLKVHVVRKHNPIRKSSVRRLSHNDGSVDSGEREPVTCARCLKTYTAVGNLQRHLRYECGVEPQFPCPYCSHRSKRRDPLAVTSILQVNNIDGPRRSKRITRKLSVSDPGIVDKVTCDRCLRNFHGFVGGSSFQLEPVTCKSSVKHERESSVHCDRCLRSYGSIRSLQRHLHYECGVEPKFSCLHCPYKYFQGKISSSICKVEQTSCKSTAKYERKSRVLCAQCLRNYSSMNNLDRHLRYECGVDPKFSCLHWPYKYFQGEISSSSCKVKQAICKSSAKYKRKWSATCARCHKTYSSVSALGRHLRYECGVEPQFSCPYCPCKCKQKNFRREISSICCKVEQATSESPAKYERESRVTCAQCLKSYKYLRCLQRHLRYECGVKPKFSCPYCPCKCKQKNFQGEISSSSCKVEQDICKSSAKYKTESRVTCAQCLRSYRFMRNLHRHLRYKCGVEPQFSCPHCSHRNFHGEISSSCKIKQAICESSVKYESESGATPTCARCLRSYSCMHNLQRHLRYECGVEPQFSCPYCPRKYYRQFSRSKLKVEQDTRKSSAKHDKGSGVTCARCLRSYSCIATLQRHLRYECGVKPQFSCPHCQYKYYRRFSSSKLRVEQVTRKSSAKHDMGPESQQICGSQAKRATCESSAKYKRKSDVTCARCLRAYSCIGALQRHLRYECGVEPKFSCPHCPYKSKLKETMKRHVLICSSQAKQATCKPSAKYKRKSGVTCARCLRAYSCMGALQRHLRYECGVEPKFSCPHCSYKYCHRIGTICSSHVEQCSCSSCKSSTRANEEVPGLTCARCLRSFNYMFSLKSHLRYECGVEPRFPCPHCSHRSSFNVNSSSQSKRRHGQFSAKNEAVSSVTCNRCLRSYRSTKTLYYHLRYECAVEPQFMCPHCSYKYSREQTHVMMGSSAAKQSTRKSCAKNEDVPGVRCSRCLRTYHSTGTLQRHLRYECGVEPQFPCPLCSHKIGILLDSFGVSSQINHTRLFQANSSILRSSAGDKEVNGYTCSRCHRTCTSKSNLGKHLRLQCGVEPKFACPYCSHKYPHRIVLSSNSRVKQANCNFKSSAKYERAPGVTCAQCHKSYSCIGVLRRHLRYECGVEPQFSCPYCPHKYFRRITCNNSYQFEQVTCKSSDKGKGKSGVTCSQCLKTLSCMLSLQRHLRYECGVEPQFLCPHCPYKYSRRITCDRNSQVNQVICKSSSKYESNPDTTCVRCLRTYKSMGGLQRHLRYECGVEPLFGCPHCPYKYSRRILRNSNSQVIQATCESSAKYKSEPSLTCAQCLRTYSRRGGLQRHLRYECGVKPQFSCPYCPHKSKQKSNLRAHVQKAQNIVVTVATNSTFIEAIFCVMKDTNVEKKDNSRALIAPTDKANLDRHKKYECEVDRNFSCPVCTYKAKRKEHLKRHMIFIHNVILQPSDNPQELVMHHMKVIDSVSETHVLYLDESGIIGDPEYEDQEFDFQPTSKRKSTPGYPKIDLGVVVPSSIDLPSNPNTSAFTIERVESIMTRPLAYEVNKDFPRCSICGDKSIGKQYGAYTCDGCKSFFRRSVRSKSVYTCIFGGGCSITTVNRNHCRYCRMMKCLEVGMRVEGFQCDICHKSYQYKRNLQRHQKYECEKEAQFHCPYWYTNSHTCPKCGKAYTYRPGLLQHLKYECGKEPQFQCPFCPQKSKRRSNLRVHIVNRHPECYSRRRKHQDALFHCPKCFKAYNRRDSMTRHKNYECGMILPKFKCTVCHKSFKRSDHYLLADGTEWLKEYVFPSTSNKPKQRSRVVENTKEYKPYKCLTCDQKLTTTEIIREILVVDGNFVCPKCGKSYQRKYTAIRHYKFECGSSPQFQYPLYISGIGALAEELKPALSDEEFRQLTTKRAFECPNCHKYYLYKGGGMFRVLMKNVVKIPIPAIRGNATAANPSQKAFVCHCGRGYKYKGNLKQHQRFECGVAPQFECSVCARPFTHKSTLKTHMDSEFPYECPNACGRKYKYKGSLTAHLKIECGKDPANQCHLCDRKFKRFNSYKRHLIILHDGSSGVRSFTCEDCGKCYKYKRGLTAHKRFYCGKEPQFQCDICSKRFVERKSLHNHLVCIHKLLIKRSQAELFQRSLKSYRHSILN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -