Agra007880.1
Basic Information
- Insect
- Aldrichina grahami
- Gene Symbol
- -
- Assembly
- None
- Location
- Contig11:1498259-1500919[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 6 0.00023 0.58 9.8 0.1 16 50 41 75 32 79 0.85 2 6 0.00028 0.72 9.5 0.2 23 46 76 99 72 105 0.88 3 6 0.0034 8.5 6.1 0.2 20 44 101 125 97 133 0.85 4 6 0.039 98 2.7 0.1 21 44 130 153 127 161 0.86 5 6 0.0011 2.9 7.6 0.0 23 45 160 182 154 188 0.91 6 6 2.6 6.6e+03 -3.2 0.0 22 32 213 225 210 229 0.65
Sequence Information
- Coding Sequence
- ATGGCCTCGAGAGAGGAATTTGAGGAAACGATGCCACTCAATGGTGTCGTAGCCATGTATACATGCAAAAGGTGCAACAAGATATTCCCATGTAAACGTGACCAACTTTTGCACAAAAAAGAAGTGCATTCCCAAACAAAATCCTCCTATGAGTGTAAAATATGTTCAAAGTACTTCTGCAATAGCGGCAATTTGGAACGGCATATGAAGGTACACAATGATGTACGTCCCTTTGTGTGTCGCATCTGTGGTAAGGCTTTCGCCCAGTCTGTTAATTTAAATCGTCACTACTCCGTACACAATGGTGAACGCCCGTATCCGTGTACATTTTGTACCAAAACATTTACCCAACAATCGAACATGCAACGCCATCAGCTGACACATACAGGAGAGAAACCCTTTCGCTGCAAACGTTGTGGCCGTTACTTCTCCCAGCGTGTTAACCTTAAAAAGCATATTATGGGTCATTTAAATACCAAACCATACACGTGCAAAATATGTCAAAAATCATTTATACAATTGGGCAATTTCAAGAAGCATCTACAATCTCACCTCAAAGATGGCATAGAGATCGATATGAAGGCGTTAGTGGCTGAGGCGCAGGCAGTGGCAAAGCAAAGTTTGGAAATGGCTGATGAACAACCAGTTGGCTTTGAGTGTGCCGTCTGCCGTTCGATTTTCAATAATTTCACCGATTTTGAAATGCATGAAAGTGATTGCAACGACAATGCACAGGCTGCCATGGAACAAAAATATAATCCAGATGAACAACTACATGAAGTTGTTGTCGAAGAGGTTGAAGACCACGACCACGTAGAAGAACAAATAATAGCCCATGATGGTGGTGTCGGAGGTGGACACTTTGGCAGTTATGCACCATTAAAATTTAGTGTCTCACAGTTGGATACACATGAAATAATAATTGAAACCAGTCGATAA
- Protein Sequence
- MASREEFEETMPLNGVVAMYTCKRCNKIFPCKRDQLLHKKEVHSQTKSSYECKICSKYFCNSGNLERHMKVHNDVRPFVCRICGKAFAQSVNLNRHYSVHNGERPYPCTFCTKTFTQQSNMQRHQLTHTGEKPFRCKRCGRYFSQRVNLKKHIMGHLNTKPYTCKICQKSFIQLGNFKKHLQSHLKDGIEIDMKALVAEAQAVAKQSLEMADEQPVGFECAVCRSIFNNFTDFEMHESDCNDNAQAAMEQKYNPDEQLHEVVVEEVEDHDHVEEQIIAHDGGVGGGHFGSYAPLKFSVSQLDTHEIIIETSR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01375146;
- 90% Identity
- iTF_00045541; iTF_00259146; iTF_00921761; iTF_01074753; iTF_01236321; iTF_00331714; iTF_01237318; iTF_00350663; iTF_01162471; iTF_01237031; iTF_01236013; iTF_01194926; iTF_01238222; iTF_00350383; iTF_01427951; iTF_01261553; iTF_00921587; iTF_00200118; iTF_00742166; iTF_01397782; iTF_01201879; iTF_00331994; iTF_01194618; iTF_00435836; iTF_01237948; iTF_01075037; iTF_01427678; iTF_01261247; iTF_00200416; iTF_00760359; iTF_01314888; iTF_00998210; iTF_01313415; iTF_01315149; iTF_01314062; iTF_01315992; iTF_01314329; iTF_01315739; iTF_01313182; iTF_00997952; iTF_01174476; iTF_01374325; iTF_01374628; iTF_01398845; iTF_01398094; iTF_00899966; iTF_01109540; iTF_01138297; iTF_00900894; iTF_01177183;
- 80% Identity
- iTF_00045541;