Acre016322.1
Basic Information
- Insect
- Apamea crenata
- Gene Symbol
- -
- Assembly
- GCA_949629185.1
- Location
- OX451362.1:14338452-14356395[+]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 15 0.0092 37 5.0 0.1 2 14 655 667 655 667 0.89 2 15 0.0094 38 5.0 0.1 2 14 666 678 666 678 0.89 3 15 0.0094 38 5.0 0.1 2 14 677 689 677 689 0.89 4 15 0.0093 38 5.0 0.1 2 14 688 700 688 700 0.89 5 15 0.0094 38 5.0 0.1 2 14 699 711 699 711 0.89 6 15 0.0094 38 5.0 0.1 2 14 710 722 710 722 0.89 7 15 0.0093 38 5.0 0.1 2 14 721 733 721 733 0.89 8 15 0.0094 38 5.0 0.1 2 14 732 744 732 744 0.89 9 15 0.0093 38 5.0 0.1 2 14 743 755 743 755 0.89 10 15 0.0095 39 5.0 0.1 2 14 754 766 754 766 0.89 11 15 0.0094 38 5.0 0.1 2 14 765 777 765 777 0.89 12 15 0.0094 38 5.0 0.1 2 14 776 788 776 788 0.89 13 15 0.0093 38 5.0 0.1 2 14 787 799 787 799 0.89 14 15 0.0094 38 5.0 0.1 2 14 798 810 798 810 0.89 15 15 0.0084 34 5.1 0.1 2 14 809 821 809 822 0.90
Sequence Information
- Coding Sequence
- atggaaAGTGGCATCACGGACGAACCCAAGCCTGCTGTTGCAGAAACTGAGAATGTGTTGAAAACCGATTCTGCGGACTCTTTTCAATCATTCATCTACGAATTGTTTGAAAAGAATGGTGTACTGACCGATCTGAGAGCGTACTTACGTGGCCATATCGTAAATGTATTAAAAAGTGCTCAAACTGGTGATCCTCTCCCCTGTCAGAAAAAATTCACACAACGCCTGGACCTCACCTATCAAgcactaaatatattaatagCTGAATATCTGATACGCTTAgAGTTCAGCTACACTTTCTCAGTCTTCGTGTCTGAAATACCACTGGCTAACATGGTGTTTGGGTTTGCTAAGTCTTTGATGGCAAAAACTGATGAAAATAAGACGGACATGAGGTTCAAGGACACGGATGTTTGGTctatactaaattatttagGAGTGCAGTGTGATTCGGAACATGCATCGAATATTGTACAAATGTATAATAATGAGAAACAACATCCGTTACTGCTGTGCATATTGAAGGGTATGCCTATGTATAAGGAGTTTGCAGAGCCTCCACACAgtATATCAGAAGATTCTGTGGCAAGTTCGAAGTCTTTGGACAGCGCAGAAAATGATGCTCCACATAAAAAGAATTTGTCTGTAATGCCAGAGTGCAGCCACTGGGCTTATTgtaagaactgtcaaaacaaGATATATGAGCTCAAAGCCAAATATAAGAGGAAGAAGAAATATTTGGCCAAGgatattaaaaataaagagagTACTAACATAAGTCCTATGAAATGGGAGAGCCTCATTAAAAATATCAGCATTATGGAGAAAGGCCTCATCGAAGAGATGTTCCAACAACTGAAGTCGGTGTACGAGACGGAAGTGGAAATGGTAAAGGTAGAGGAAAAGAAGAAGGTGCAGCGATCGATGGCGACACACGCGATTCAGCTGCAGCAGAAACGGAACGAGTTGGAGGAGACGTTCAAAGCTCGCGAGGTTGAGATGGAACGCAGCGTGCGACATAAGAAGAAGTTCCTCTGGGGCCTGGCGCGCTCGCTGCGCGACCAGCACCAGCACATGACGCGCGCGATGCACGACGTGCGCGCCGAGACAGAACGCTTGGAGCTCAAGGAAGACAGTCTAAAAACTCAACTGGCTGAAGCTGAACAAATATTAAAGAAACGCGGAGAGGAGATGCGTTTACAAATAACGAACGAACTAACTGTCTTAGAGGGTCACCTCGAGTCTAtgaaaagagagagagaaaacaTCACACGACAGCGCTCCGAACTAGAAGACATGAAAACTGTTCACAATTCAACCAgcaaagtaattaaattacCCAATACAGAAAGCGAGGAAGTTCATTCACATTACGATTTATTACACAACGAACTAGCAATACTCAGGTCGTACCTAGAATCAACTCAGATGCAAGCGAAGTGTGTTATCGAACGGGGAACGATCACCGAGGCAAGTGAACCGGTGTCGCAGATCAATCTGACCTTAAATAACAGTCAGGGGAACGAAAAGAGTTCGAAAACCGATGATTTTGACGGAAGACTCAAAATGAATCAAGTTGTCAAtgattttagaaagaaaaatgttaatTTCAGTCAGTCGAATTTAGACGAAATCTACCGCGAGAGGAGTAGAGATCGCAGCCGGTCGTCGAACTCAAGCGAGGCCGGTGACACGGTGTCTCCCGACTTCGATCTCGAGAGGGGCCGCGACCGTGACCTCATACAGCGTCTGAGGGACGAAAATGATAGACTCAAGGCGTTTGCCAGACAGAGAAAATCCATGGTAAATTGGTTAATCCGGAATTCTGTTACTTTTATAAAACTATATACAATGATTGTTCCCAGCGCCGGCCGCGCGCGTCGGCGGCCCCGCGCCCGCGCAccgcgcctgcgcccgcgcacgtgCACGTGTTCCCGTGAGTACACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACAGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACACCACTGCTACAccactgctacaaggatcttgtGGCCTTGAACACCAGTGTGAGTGCCAGTACCCTGAACGTTGGTTGGCGGAAGGGCGCCGGCGAAGAGCTGAGCATTTTCAGTAACGCGCAGCCTCGGATCCTGGTGCCAGGAGACACGCTGCCCTTCATCGGAGTACTAAGGGACAGGCATGGGAATAATAGGAGGCCGGTgccgggccgcgcgcgcgcactGGCCGGCGTGAGCAAGCGGGCGGCGTCGCCCTTCCGCGAGCGACACGCGCCCGGTACAGGTAGCAGCATGCAGGCCGTGCGCGCGCACGTGCCCGTCGCGTCGTGCTCGCTGCAGCCTGACCACGACACGTACCAGGCCATACGggaTCGCCTACAACAGCGTCCGCGGCCTTCATTGGAAGACCGGGTTAGGGACAAGAGCCCCAAGTCCTTGCTCAAAGAGGCAAAGGAGAAACTGCGAAGGAAGGACAGCAACAAGGACTGTATCGACTTGCCGCGCGACAAGAGCCCCTCCGCGGTACTGCGCGAGGCCAAGCAGCGCCTGCGCAAGCTGGAGATAGAAGCCGAAGCCGTCGAGAAGTCCTACCTCGACTTCCGCCGCCGCCGGTCCAAGCGCGACGACCTCGCCAGGGAGCTCGACGTGACACGAAGCCAGTCGACGCAGGAACTCGATGCAACCACAAAAAATGATATAGAAGCGCAATTCGCAGACGTACAGAAATCTATGCGCGGGGACTTTGACAAGTACATCCGCGACTACAAGACCAAGTTTGACATCGGGGAGACGCACTTTAGGAACAAACCCACAGCCGCGGAGAGAGCGAAGCCCATTCCAGACACTTACATCGCACATTGTACGGACCACGTCGATGTCCACAGCAACTATCTCGAGACTCCACTCACAGAGTTCAGGAAACTGTACACGTGCGCACGACACACGCTGCGCGACGAATCCGGGCAGCAGTCGCCAAGCAAGAGCGTACGCTCGGGCAACGACAGCAGCAAGACCGAGTTCGACTACGAGAACTCGGCCCGAAACGACGCGCTACGGCAGGTCAAATACAACAAACAAGCTGAActacaaatattaaaacaaaatatcaataaaatgtataatttacCAAATTCCAATTTCGCTGCGACGGCAGAGCAACAAGTCAATCAAAACTATCCGTCGAGCGTTGAAGACGAAAACAAATTGAGAGTGGAGGTGGAAAATATCAGCGAAATACACGACATCAAATCACAAACGCAAGACATGTTGCTCGTGCTGCAGAGTTCAGTGAACTCGAGAGAAGTGACGATGTCGGGTATTGAGGTGCCTAGCGAGTTGTTTTCCACGCAGATGACAATCATAGTGAGCCCCAAGCGGCCCGGCTCCGAGATTGTCGAGGACGGAGGCCTGTGTGTGGCCTCCGAGCGGACCCGCCGCAGCCAGTCGCCCGAGCAAGCGGCACATCTCACCAGGAACGACGTCCTCGACGCCATATTCCACGCCGACCCCAACAACGAAGTCTCGAGCGTGCAGATGCAACTAGAATTGTCCAAAGACGAAGACGATTCCATCAGCGAGTACGACAAGGCAGACTACTCGGGCGAATTTGCCGTTGACTTGGACAACTACAACAGTCGCTTGGATGGCGACCACTCGCCAATATCTGTGGCCGGCATGACGGAGGACAACAACTTTTGGGAGGTCTAG
- Protein Sequence
- MESGITDEPKPAVAETENVLKTDSADSFQSFIYELFEKNGVLTDLRAYLRGHIVNVLKSAQTGDPLPCQKKFTQRLDLTYQALNILIAEYLIRLEFSYTFSVFVSEIPLANMVFGFAKSLMAKTDENKTDMRFKDTDVWSILNYLGVQCDSEHASNIVQMYNNEKQHPLLLCILKGMPMYKEFAEPPHSISEDSVASSKSLDSAENDAPHKKNLSVMPECSHWAYCKNCQNKIYELKAKYKRKKKYLAKDIKNKESTNISPMKWESLIKNISIMEKGLIEEMFQQLKSVYETEVEMVKVEEKKKVQRSMATHAIQLQQKRNELEETFKAREVEMERSVRHKKKFLWGLARSLRDQHQHMTRAMHDVRAETERLELKEDSLKTQLAEAEQILKKRGEEMRLQITNELTVLEGHLESMKRERENITRQRSELEDMKTVHNSTSKVIKLPNTESEEVHSHYDLLHNELAILRSYLESTQMQAKCVIERGTITEASEPVSQINLTLNNSQGNEKSSKTDDFDGRLKMNQVVNDFRKKNVNFSQSNLDEIYRERSRDRSRSSNSSEAGDTVSPDFDLERGRDRDLIQRLRDENDRLKAFARQRKSMVNWLIRNSVTFIKLYTMIVPSAGRARRRPRARAPRLRPRTCTCSREYTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYTTATPLLHHCYKDLVALNTSVSASTLNVGWRKGAGEELSIFSNAQPRILVPGDTLPFIGVLRDRHGNNRRPVPGRARALAGVSKRAASPFRERHAPGTGSSMQAVRAHVPVASCSLQPDHDTYQAIRDRLQQRPRPSLEDRVRDKSPKSLLKEAKEKLRRKDSNKDCIDLPRDKSPSAVLREAKQRLRKLEIEAEAVEKSYLDFRRRRSKRDDLARELDVTRSQSTQELDATTKNDIEAQFADVQKSMRGDFDKYIRDYKTKFDIGETHFRNKPTAAERAKPIPDTYIAHCTDHVDVHSNYLETPLTEFRKLYTCARHTLRDESGQQSPSKSVRSGNDSSKTEFDYENSARNDALRQVKYNKQAELQILKQNINKMYNLPNSNFAATAEQQVNQNYPSSVEDENKLRVEVENISEIHDIKSQTQDMLLVLQSSVNSREVTMSGIEVPSELFSTQMTIIVSPKRPGSEIVEDGGLCVASERTRRSQSPEQAAHLTRNDVLDAIFHADPNNEVSSVQMQLELSKDEDDSISEYDKADYSGEFAVDLDNYNSRLDGDHSPISVAGMTEDNNFWEV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -