Dlut009883.1
Basic Information
- Insect
- Dyspetes luteomarginatus
- Gene Symbol
- -
- Assembly
- GCA_963669185.1
- Location
- OY769907.1:20439720-20446809[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.62 50 4.7 2.2 2 23 231 253 231 253 0.96 2 20 0.057 4.6 8.0 2.9 1 23 320 342 320 342 0.97 3 20 0.021 1.7 9.3 0.2 1 23 391 413 391 413 0.98 4 20 0.7 57 4.5 5.1 1 20 480 499 480 501 0.93 5 20 0.83 67 4.3 0.6 1 10 513 522 513 536 0.91 6 20 0.084 6.8 7.4 0.2 3 21 543 561 541 562 0.94 7 20 0.25 21 5.9 0.5 2 23 608 630 607 630 0.94 8 20 2.1e-07 1.7e-05 25.0 0.4 2 23 687 709 686 709 0.96 9 20 1.9 1.5e+02 3.2 0.1 1 23 715 738 715 738 0.92 10 20 0.00048 0.039 14.5 0.1 1 23 745 768 745 768 0.96 11 20 0.00083 0.068 13.8 0.5 3 23 778 798 776 798 0.92 12 20 0.034 2.8 8.7 0.1 2 20 854 872 853 872 0.95 13 20 0.0029 0.23 12.1 0.1 1 23 891 913 891 913 0.98 14 20 0.00023 0.018 15.5 1.6 2 23 931 952 931 952 0.97 15 20 0.00022 0.018 15.5 0.1 1 23 974 997 974 997 0.95 16 20 0.59 48 4.8 0.1 2 23 1051 1073 1050 1073 0.95 17 20 9.5e-05 0.0078 16.7 1.6 2 23 1099 1121 1098 1121 0.93 18 20 1 83 4.0 1.4 1 23 1130 1154 1130 1154 0.93 19 20 0.0015 0.12 13.0 0.4 1 23 1159 1182 1159 1182 0.98 20 20 0.0023 0.18 12.4 0.3 2 23 1213 1235 1213 1235 0.96
Sequence Information
- Coding Sequence
- ATGGTCGACGTTGTGGATGTGCAAAATATGGAAAACGTGTGTAGATTGTGTCTCTCCATGGAGGAACCAAAATCCTCGGTGTTCGCCGATCAAGCATCCTCCGTTTCGTTAGCTGCGAAAATACAAGCTTACTTGTCAATCGAGATTTTCGCGACCGACAAAGTATCGACGTTGATATGCGGTGACTGTCTGACGAGGGTCAACAAATGGCACGTTTACAAGGAATATTGTCTGCGCTCGCAACAAAAGTTGCAACAATGCCTTGGAGATCAATCGGAAAATCCGCCAAATATAAAGATCGAGCCAATGGACATCTGCGAGCCTCCAAGTCCGCTTGAAACGGTGCCATTGACATCGGACGACGAGTTGGAAAGTTTATCGTCCCATCGGAACGACACAATGCCCACAATATCTCTATCAGCCAGCAATATTCAAGTGATACAACAACCAATGACAACGATAAAGCGCGAACCCCCAGACGAGATTCGTCCAGTCATTGAGATCGAGCAAATGGCAAATCCAGAATTATTGCTAAATCCAATGGCAATCGCATGTCCTAAGGACGATCGTTTAATGGAAGCAGACTCGACGCAAAAACTCGTGAAAAAGACgagagttggaaaaataaagaaaaaacctCGGCGTGGACCTCACACGCATTATCGAGGTCCAGGTCGTCAGTATAAACGACGCTGTCCATATTGCCAGATTTATTTACACTCCAAGACCTCTTATGGCAAGCACATGGAGCGTTTTCACAGTGGCAAGATAATTGGTCAGAAAGCAATGAAGCATGGTTCAAAGGGAATTATCGAAAATGTGGAGACAGTAAAATATTCCGAGATTAATGAAAATCTCGATGAGATGATCGAGGATGTCGAGGATGAGCTCGCGAGCCTTGAAAAAGATTCACCTCTGACCCCAGTACAACAAAATATCATAAGTCAgctaaaaacattttcatgcTATTCATGTCGACAATCGTTTAACGATCGTCGCAGTACTTTGAATCACATAAGACAGCACATGCCGGATCTTCGTCCTTACACGTGTATCGCGTGTTTAACAGAATTTGCTGATCGTTCAATTTACAAGCTTCATTGTGGCGCCTCGTTCCAATGCGCCATGAAAATTGCCCTAGTTGTACCCGAGCATggaacagaaaaatatttcacctgCAATATGTGCCTCAAGTCTCTACCAGGCAGAAGAGAACTCTTGAGTCACCTGGCGCGACACTCGGACAAGCAATACGAGCAAATGGTCGCTCAACCCTTTAATTCTCCACCCAGGCTCAAGccaatgacaaaaataatttcgctgaaaaaagaaatgacaATGATCGTTCCAAAAACTACCATCAGCTCAAACTCAACTTCCCCTCAGCCCAAAAAGCCCATTCCTGGACCGTACAAAAATGGCGATCCAGCTTTCAATCATAAATGCCAATGTTGCGGTATGATATACAAATACAAGCAAAATTTGCACAAACATCTATCATTGTGCAAAAAATTGCCGGCTGATGTGCGCACTTGTTATCAGTGTGCACATTGCGGCTTGACTTTTCTCGCTTTTAACAAATTTGTCGGTCATCATAATAACGAGCACAAGAGACGTAATATCATTTGTTCAAATTGTAGAGCAAAATTCAAAGattcaaatgattatttgGTACATTACAGAGATATATGCGCCCCTGTGGTACGTCGCGAGCGGAAAAGACCGGAAGTAACGAAATcgcatgataaaaatttgaatacgaAGATTCttgatgataagaaaaatattgagattggaggaaaaatgaataattatagatGGCGGTGTACTGTTTGTCCTGACAGTACATTTCCAACGAGAAGCAAATTGTACgagcacaaaaaaattcattccaaACAGAAACTCGAGACTCAGATGGCTGATCATGAGAttggaaattttccatttgcgCCACCAGAAAATACGAGCCTTCCTCTCTTGGATAACGAGCGTATAAACGACGAAACAGCAAATAccagtaatttttcaattcaatcagATCCTGGTGTCAATACGACAGAGTGCCAAGAATGCGGCAAAGTATTTGCGAGTAATACGAATTTACGACGTCACGTTAGAAGCGTTCACAATACAATTGGACGTTTCAGCTGTTTCACGTGTGCGATGACATTCATTCTTGAGGAACAATGGCGCGAGCACGTTGAGAACGAGCACAGTAACACGAGAATGACCTTTAATTGTCCTTTATGCCCGGAGATATTTGATACcgaggaaaaattggaaaatcatcgtcaaataattcatggaAGAAACGATGAGGATCATCTTGCCTGTTACATATGTggaaaaatttttatcaacgaaaCTTCATTAAAGATTCATCGTGGTCATCATTTTCGTGTAAATTCACGTTTGAGCATTGGAAAGCCAAATGGAGATAAAATTGAGGCAAAGATCGAGGCAAAAAGTATCGGGGAAGTTAAAAAATCACCAACAGCAAGAAAATCATTTCCAAACTCGAGTCCACAGAGGCAatcgtcatcgtcattgtCACTTTTACAATGTCAAGTTTGTGACGATCGTTTTAGCGATGTCGCTGAGCTAAGAAAGCATCTCTGGGATGTTCATTGTGCACGTAATaagcctgaaaaaaattacaaaactcACGAGTATCAGTGCGAATTATGCACAAATGTTTTACCCGACGAGGAGAGTCTGACAAGCCACATGAAATGGCACACGAATAATCCAATATTGagcaatgttgaaaaattcgagataCCCGAGGCACAATGTGACTTGTGCGGTAAATTTTACAGCAGCATGGCGAGTCTTCgtaagcacaaaaaaaatcacaaaatggCTGACAATTCATTAATGGGAAATTTATCAGATACAGGAAGAAGACCAAGAATTGGTTATACTTGCGGTATTTGCGATAAAGTATTTGCCGTAAAAATAACACTCGACAGACACAAGGAAGTTGCACATCCCACCTCATGCTCACCGGATACATCAAAAAGACTCTCTGGCCAGGTCAAGAGTCAAACGGCCATTAAGCCAATTAGaccaaaatttgattttgacGCGATTGTGACAAATCATCGTGCTCTCGGTCAATCCtcaaaaattgatgcaaaaaagCCAATCACGTGTAATCTTTGCCGAAGAGTTGTGCCTGGAATTAGAGCACTTTACAAGCATCGTCAGCGTGTACATAAAATAACGATTGACGATGACGCAACAATGCTAAGGGATGATTATTACAATGAGGGAAATTGTGACGAATCGGTATCTTGTACAATTTGTACAAAAACATTCtccaatttatcaaatatgaAACAACATTTTACAAAAGTTCATGGCAGTGGACGTACCGGCTCTCAATTCAAATGCTCCTTTGATGGCTGTAATctcgaatttaaaaatcgtctcGCCAaggaaaatcatgaaaaatttcattccaacGGGCTGTACAGATGTGCCCTTTGCCCTCGTCACATGACAAATCGTGGAGCTTTAAATCAGCACATGATAACCGCGCACAAAACAAATTACACGGAGGAACGTAGAAAATCAGTCTCGTGGAAAAACATTGACCTTGACAATTATGTTGTCAAGGGTGCCAATGGGAGACAATGTcctatttgcaaaattatttatccaaatCACAAAGCTCTCAAGGTGCATTATCTGAAGATTCATGAGGGCGCATAA
- Protein Sequence
- MVDVVDVQNMENVCRLCLSMEEPKSSVFADQASSVSLAAKIQAYLSIEIFATDKVSTLICGDCLTRVNKWHVYKEYCLRSQQKLQQCLGDQSENPPNIKIEPMDICEPPSPLETVPLTSDDELESLSSHRNDTMPTISLSASNIQVIQQPMTTIKREPPDEIRPVIEIEQMANPELLLNPMAIACPKDDRLMEADSTQKLVKKTRVGKIKKKPRRGPHTHYRGPGRQYKRRCPYCQIYLHSKTSYGKHMERFHSGKIIGQKAMKHGSKGIIENVETVKYSEINENLDEMIEDVEDELASLEKDSPLTPVQQNIISQLKTFSCYSCRQSFNDRRSTLNHIRQHMPDLRPYTCIACLTEFADRSIYKLHCGASFQCAMKIALVVPEHGTEKYFTCNMCLKSLPGRRELLSHLARHSDKQYEQMVAQPFNSPPRLKPMTKIISLKKEMTMIVPKTTISSNSTSPQPKKPIPGPYKNGDPAFNHKCQCCGMIYKYKQNLHKHLSLCKKLPADVRTCYQCAHCGLTFLAFNKFVGHHNNEHKRRNIICSNCRAKFKDSNDYLVHYRDICAPVVRRERKRPEVTKSHDKNLNTKILDDKKNIEIGGKMNNYRWRCTVCPDSTFPTRSKLYEHKKIHSKQKLETQMADHEIGNFPFAPPENTSLPLLDNERINDETANTSNFSIQSDPGVNTTECQECGKVFASNTNLRRHVRSVHNTIGRFSCFTCAMTFILEEQWREHVENEHSNTRMTFNCPLCPEIFDTEEKLENHRQIIHGRNDEDHLACYICGKIFINETSLKIHRGHHFRVNSRLSIGKPNGDKIEAKIEAKSIGEVKKSPTARKSFPNSSPQRQSSSSLSLLQCQVCDDRFSDVAELRKHLWDVHCARNKPEKNYKTHEYQCELCTNVLPDEESLTSHMKWHTNNPILSNVEKFEIPEAQCDLCGKFYSSMASLRKHKKNHKMADNSLMGNLSDTGRRPRIGYTCGICDKVFAVKITLDRHKEVAHPTSCSPDTSKRLSGQVKSQTAIKPIRPKFDFDAIVTNHRALGQSSKIDAKKPITCNLCRRVVPGIRALYKHRQRVHKITIDDDATMLRDDYYNEGNCDESVSCTICTKTFSNLSNMKQHFTKVHGSGRTGSQFKCSFDGCNLEFKNRLAKENHEKFHSNGLYRCALCPRHMTNRGALNQHMITAHKTNYTEERRKSVSWKNIDLDNYVVKGANGRQCPICKIIYPNHKALKVHYLKIHEGA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -