Cgly024223.1
Basic Information
- Insect
- Coenonympha glycerion
- Gene Symbol
- -
- Assembly
- GCA_963855885.1
- Location
- OY979654.1:1745122-1756950[-]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 24 0.54 1.7e+03 -0.7 0.0 7 29 268 289 266 293 0.79 2 24 0.54 1.7e+03 -0.7 0.0 7 29 312 333 310 337 0.79 3 24 1.4 4.5e+03 -2.0 0.0 7 28 356 376 354 379 0.68 4 24 0.89 2.8e+03 -1.4 0.0 7 29 400 421 398 424 0.76 5 24 3.6 1.1e+04 -3.3 0.0 9 28 489 507 488 509 0.63 6 24 0.54 1.7e+03 -0.7 0.0 7 29 531 552 529 556 0.79 7 24 0.54 1.7e+03 -0.7 0.0 7 29 575 596 573 600 0.79 8 24 0.0049 15 5.9 0.0 20 32 622 633 618 636 0.85 9 24 0.062 1.9e+02 2.4 0.2 21 33 674 686 670 688 0.82 10 24 0.048 1.5e+02 2.7 0.0 19 31 711 723 706 727 0.82 11 24 0.048 1.5e+02 2.7 0.0 19 31 758 770 753 774 0.82 12 24 0.048 1.5e+02 2.7 0.0 19 31 805 817 800 821 0.82 13 24 0.023 73 3.7 0.0 19 31 851 863 847 867 0.79 14 24 0.048 1.5e+02 2.7 0.0 19 31 898 910 893 914 0.82 15 24 0.048 1.5e+02 2.7 0.0 19 31 945 957 940 961 0.82 16 24 0.048 1.5e+02 2.7 0.0 19 31 992 1004 987 1008 0.82 17 24 0.023 73 3.7 0.0 19 31 1038 1050 1034 1054 0.79 18 24 0.048 1.5e+02 2.7 0.0 19 31 1085 1097 1080 1101 0.82 19 24 0.048 1.5e+02 2.7 0.0 19 31 1132 1144 1127 1148 0.82 20 24 0.048 1.5e+02 2.7 0.0 19 31 1179 1191 1174 1195 0.82 21 24 0.048 1.5e+02 2.7 0.0 19 31 1226 1238 1221 1242 0.82 22 24 0.048 1.5e+02 2.7 0.0 19 31 1273 1285 1268 1289 0.82 23 24 0.048 1.5e+02 2.7 0.0 19 31 1320 1332 1315 1336 0.82 24 24 0.045 1.4e+02 2.8 0.0 19 31 1367 1379 1362 1381 0.83
Sequence Information
- Coding Sequence
- ATGATTTGCCTAGACACAGAGAGTAAACTGTATCCGCTCAACAAATACAATTTGGACACAAAGTTTGAATATCTGACAGGATTTTCtCTTCACGATGTAGAGAATTTTCTGCCACAGTTTTGCATTGAGTGCGCTCAGAGGCTGACCACTTGTAGTAGCTTCAGAGAAAAGGCCCTCAGAGCGTATCACTTGTTGCTAGAAGTAGCTGAGAACAGTCAAGAGGTGGTTAAAAAAGAGAAGGTTTCTGCTAAAAAGGAGGAGAATGGTACAAAAGGGGGGGATGCATTGGCGCAGTTTAAAGTGACATTGCTGAGTTTTGAAGAACAGTTGGCTGAAATAGAGAAGAGACGGGAGAGCGCTAACTTCAAATATTCTAGATATAAATGCAACAAGTGCTTCAAAGGATTTAGTAGTGTTCCTACATATGAAAGTCATATGGAGAAGCATACTAATAAATTCGGTGAATTCGAGTGCGAAGTGTGCAAGGTCCACGTGAAGAGCGACTACCTGCTGCGGCACCACGTGCGCCACACGCACAGCGTGCGCTACACGTGCGGCAGCTGTCCCTTTGTCACCAACCAGAAATTGTCAGCGATACGTCATGAGGGTTGGCATGCGGGCAAAACATTCAAATGCCCGCACTGCGATGAAGAGTATAGTAAAAGAACGTCCTACTTGTCCCACCTAAGGGTCTCACATCCCACCGATGTCGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACGTGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGAGGGTCTCACATCCCACCGATGTCGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCCCACTGAAGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGCACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTTGACGACGCGCAGGTGTGCACTACACTCCTCCTGCGGGTCTCACAGCGCACGGACGCGGTGTGCGCGttgtgcgggttctccttcatcaaCGAGTGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGACCCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGACCCTCAAACACCGCTTCGACGACGCGCAGAGCGCGGCGGGTCCGCTGTGCGCGCCGTGCGGCATCCGCTTCGCGTCGCAGACCGCCTACGCGCAGCACCTCGAGGTGTCGCCCAAACACACGTCAGCCGATAAATTGAAAGTGAACGCTCCGAAGAGACCTCGCAAAAACCGCCTCAAACCTTTGGACTGTGAGACGCTGGAGTGTGAACAATGCGGGGTTCAAGTGAGGAGTTACAAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACACACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAAATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAAATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGaatcaatgGGGGCTGAGAGACCACGTGCTGGTGAAACATTCCGGCGTAAAGGAATTCGTCTGCGACACTTGCAACAAGTCGTTCTGCCTCAAGGCGAGCCTCACAGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATGTACTGGTGAGTGAACGTAACAGAGTTCGTCTGCGACACGCGCAACAAGTCGTTCTGCCTCAAGGCGAGCCTCACAGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATGTACTGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATGTACTGGTGAGTGAACGTAACAGAGTTCGTCTGCGACACGCGCAACAAGTCGTTCTGCCTCAAGGCGAGCCTCACAGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATGTACTGAGTTCGTCTGCGACACGCGCAACAAGTCGTTCTGCCTCAAGGCGAGCCTCACAGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATCTACTATCAACAACATCTTTCGAAGACAGCTACAGTGTTACGCACAGAGAGTCGCGCCCCTTCAAGTGCCACGCGTGCGAGAAGACATTCGTGAACGGCTCGTCGCGGCGCTACCACGAGCTGCACGCGCACCTCAAGCAGCCGTGGCCCAAGAAGAACCGCGGCCCGCGCCAGAGGGCCAGCCGCGCGCGCCACACCAAGGAGGCCGTGTACACCATGTGGCCCAAGGTGAGAGTCGAGAAGACATTCGTGAACGGCTCGTCGCGGCGCTACCACGAGCTGCACGCGCACCTCAAGCAGCCGTGGCCCAAGAAGAACCGCGGCCCGCGCCAGAGGGCCAGCCGCGCGCGCCACACCAAGGAGGCCGTGTAG
- Protein Sequence
- MICLDTESKLYPLNKYNLDTKFEYLTGFSLHDVENFLPQFCIECAQRLTTCSSFREKALRAYHLLLEVAENSQEVVKKEKVSAKKEENGTKGGDALAQFKVTLLSFEEQLAEIEKRRESANFKYSRYKCNKCFKGFSSVPTYESHMEKHTNKFGEFECEVCKVHVKSDYLLRHHVRHTHSVRYTCGSCPFVTNQKLSAIRHEGWHAGKTFKCPHCDEEYSKRTSYLSHLRVSHPTDVVCTLCGFSFINERGLNMHMNLKHRFDDAQVSTTLLLRVSQRTDAVCTLCGFSFINERGLNMHVNLKHRFDDAQVSTTLLLRVSQRTDAVCTLCGFSFINERGLNMHMNLKHRFDDAQVSTTLLLRVSHPTDVVCTLCGFSFINERGLNMHMNLKHRFDDAQVSTTLLLRVSQPTEAVCTLCGFSFINERGLNMHMHLKHRFDDAQVSTTLLLRVSQRTDAVWLCGFSFINERGLNMHMNLKHRFDDAQVCTTLLLRVSQRTDAVCALCGFSFINEWGLNMHMNLKHRFDDAQVSTTLLLRVSQRTDAVCTLCGFSFINERGLNMHMTLKHRFDDAQVSTTLLLRVSQRTDAVCTLCGFSFINERGLNMHMTLKHRFDDAQSAAGPLCAPCGIRFASQTAYAQHLEVSPKHTSADKLKVNAPKRPRKNRLKPLDCETLECEQCGVQVRSYKMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKNQWGLRDHVLVKHSGVKEFVCDTCNKSFCLKASLTAHMKTHSDSQPTHACPICGKHFTSKANTNRHVLVSERNRVRLRHAQQVVLPQGEPHSAHEDAQRLAADARVPDMRQALHQQGQHQQACTAHMKTHSDSQPTHACPICGKHFTSKANTNRHVLVSERNRVRLRHAQQVVLPQGEPHSAHEDAQRLAADARVPDMRQALHQQGQHQQACTEFVCDTRNKSFCLKASLTAHMKTHSDSQPTHACPICGKHFTSKANTNRHLLSTTSFEDSYSVTHRESRPFKCHACEKTFVNGSSRRYHELHAHLKQPWPKKNRGPRQRASRARHTKEAVYTMWPKVRVEKTFVNGSSRRYHELHAHLKQPWPKKNRGPRQRASRARHTKEAV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00354354;
- 90% Identity
- iTF_00354354;
- 80% Identity
- iTF_00354354;