Cinc020996.1
Basic Information
- Insect
- Clistopyga incitator
- Gene Symbol
- Mta1
- Assembly
- GCA_947507545.1
- Location
- OX382187.1:3962550-3974898[+]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 3e-11 6.8e-08 32.3 8.5 1 35 494 530 494 531 0.97 2 2 2.4 5.5e+03 -2.6 0.1 25 35 571 581 571 582 0.88
Sequence Information
- Coding Sequence
- ATGCCGATGCCGGCCGAATACCACGAGAGCCACGGGAACCACGAGAGCACAAATTTCGGGCTCGCGGCCGCACAGGACGCCAGCAACATGACGGCCAACATGTACCGGGTGGGAGATTACGTGTATTTCGAGACGTCATCTACATCTCCCTATCAAATACGAAGGATAGAAGAGTTAAATAAGACGGCCAGTGGAAATGTGGAAGCAAAGGTGATGTGCTTCTTCAGGCGGAGGGATTTGCCTTCCACTTTGATTATGCTTGCGGACAAGCATCAATTGGCAAGCGTGGAGCAGCAGAGGTCAGATTCCCCTGCAAGTACGCAGAGTCAGAAACTCGAGACTCCAGACTCGACTAAGGAAATGACTAGTAAAGATGGGATTGGGCCCAAAGTGATGAGCAAAGGGGGAGGTAAAGGTGGCTGGCTGAAAGCCCCACTGTCTGAGGCCCACGAGCCCCACGGGATGGAAGAGGGAATTGTCGGCGCTGGTGGTGTGGCCGACTTGAGTTCGAAGCAACGTCACCAAATGAAGCACAGAGAACTGTTCCTGTCACGGCAGGTCGAAACTATGCCGGCGACCCACATAAGAGGCAAATGTTGTGTAACCCTACTCAACGAGACCGAGTCCTTGTTGAGTTACCTCAACAAGGAAGACTCCTTCTTCTACTGTTTGGTATTCGACCCTACCCAGCGTACATTACTCGCAGACAAAGGCGAAATACGAGTGGGTAGTAGATATCAAGCCGATGGAATATCATCGACAGTTCTGACCCCGGCGGAAAGGGAGGCGGATCCTCGAAGATTGCAGGACCTCGAAACGTTAGTTTGGACCCCGAGACACGCATTGACGGATCGTCAGATAGATCAATTTTTGGTCGTCAGTCGATCGGTAGGAACTTTTGCCCGGGCGCTCGACTGTTCGTCGTCCGTGAAACAACCGTCACTGCACATGTCCGCGGCGGCAGCTTCACGAGATATTACCCTTTTTCACGCGATGGATACCTTACACCGGCACAATTACGATGTAGCGAAAGCAATGTCATCATTAGTGCCCAGTTCGGGTCCGGTATTATGTAGAGACGAGATGGAAGAGTGGAGCGCGTCTGAGGCTAATCTGTTCGAAGAGGCTCTTGATAAATATGGAAAGGATTTCTCGGACATTCGTCAGGATTTTCTGCCGTGGAAAACATTGAAGAACGTGATCGAGTACTATTATATGTGGAAAACGACGGATCGTTACGTACAGCAGAAGAGAGTGAAAGCGGTCGAAGCCGAGAGCAAATTGAAACAAGTATACATACCGAATTACAACAAAGCTCCGGCATCGGGCAACGCGCCGACATCCGCGAGCATACTACCGCTGAGCAACAGCAATAATAGCAGTAACGGAAAGGCGATCAGTGTTCTCAACGGAAGTAGTAACGGCAATATCACGAGCGATAATAGCGGTATGCTTATGGTAGGACTCGGCGGTAAACCCTGTGAGAGTTGTCAAAATTCGCAGAGTCCCCAGTGGTACGCCTGGGGACCCGCGAACATGCAATGTCGGCTGTGTCAACCGTGTTGGATGTACTGGAAGAAGTACGGAGGGCTCAAGGTACCAACACGCATCGACGACGCCGATTTGGAGCGCAAAAGAGGAGGCGTCGGTTCCGATGAGGAGAGTAAAGGAATGAGTGGAGCTCATCGGCCTCACCGATGCAGCATACCCGCGTGCGGCAAAGAGTTCAAATTGAAAGCGCATCTGAGTCGTCACTACGCGAGCGCTCACGGCGTCGACTTACGAGGAAGCGGAGCCGGCGGAGGCGGTGGAAGCGGATCTCCGCGTCCCGTCATGAAAACCCGATCGGCGTTCTATCTAAGAACCTCGGCGCTTGCACGCGCCGCTCGTAGACTTTGTGCGGCGCAATTGCGCACCCGTCACGCGGCGCGAGCTCCTCATCAACCGATCAACGCCGCACCATTGCGGCATCTCTGCGCTTCACCCCAACTCACCTCGAAGAGCCCCGCCGAACTGCGAATTCTCGCACGTGCCGTGAGACCGCGACCGCGTCCGCGAGTCACCGACATCGCGACTCGTCTCGGCGATCATCCCTCGCCCAGACAACCCGGAGACTGGGATTGGCTCGCGCTCACATTACCCGCCCAACGCAAGCAACCCGATCGCGTATCGTTTCCGCGACCACCTAAAGCCGCCGACGGCAGTCTATTGTACGAAAGAGTACCGAACAAGCCAGAAGTCGACAGGTTGCCATTAACGCCACCACAACCGCAGCCCGCGATGCAGGCTCAGCAAACGATATTAAAGCGCACAAGGCCGCCCTTCGATGAGATCAACGGTTCGGATGGTATTGCCCTGAATGCAGGGCTCCCGGGTGGACCGCCTGCGAAACGGGCTCACCATTCTCAACAAGTGCACCCGAAGCACAATTTGGAGCACGCGACACCACCGGTTGTGCCACTTGCGCCACCGCTCAACGGCAGAGCCGCTCATCCACATTTTTTGCCCCACGGACCGCCGCTGTCTAGAAGCAACGCCCGCAAACAGGTCATTTCGTGGATGGACGCACCTGACGATGTTTACTTCCGCGCCTCGGATCAGACAAAAAGAATCCGAAGAACCCTAACGTCGGTTGAGTTGCGAAGAGCGGCGCGCAAGCCCTGGCGCAGATTGCCAACCCCTCTGCATCCTCCTCATCCGCAGAGAGCGGCATCGCGGGGCGACGATATGGTCGTCATCCTCGACTGA
- Protein Sequence
- MPMPAEYHESHGNHESTNFGLAAAQDASNMTANMYRVGDYVYFETSSTSPYQIRRIEELNKTASGNVEAKVMCFFRRRDLPSTLIMLADKHQLASVEQQRSDSPASTQSQKLETPDSTKEMTSKDGIGPKVMSKGGGKGGWLKAPLSEAHEPHGMEEGIVGAGGVADLSSKQRHQMKHRELFLSRQVETMPATHIRGKCCVTLLNETESLLSYLNKEDSFFYCLVFDPTQRTLLADKGEIRVGSRYQADGISSTVLTPAEREADPRRLQDLETLVWTPRHALTDRQIDQFLVVSRSVGTFARALDCSSSVKQPSLHMSAAAASRDITLFHAMDTLHRHNYDVAKAMSSLVPSSGPVLCRDEMEEWSASEANLFEEALDKYGKDFSDIRQDFLPWKTLKNVIEYYYMWKTTDRYVQQKRVKAVEAESKLKQVYIPNYNKAPASGNAPTSASILPLSNSNNSSNGKAISVLNGSSNGNITSDNSGMLMVGLGGKPCESCQNSQSPQWYAWGPANMQCRLCQPCWMYWKKYGGLKVPTRIDDADLERKRGGVGSDEESKGMSGAHRPHRCSIPACGKEFKLKAHLSRHYASAHGVDLRGSGAGGGGGSGSPRPVMKTRSAFYLRTSALARAARRLCAAQLRTRHAARAPHQPINAAPLRHLCASPQLTSKSPAELRILARAVRPRPRPRVTDIATRLGDHPSPRQPGDWDWLALTLPAQRKQPDRVSFPRPPKAADGSLLYERVPNKPEVDRLPLTPPQPQPAMQAQQTILKRTRPPFDEINGSDGIALNAGLPGGPPAKRAHHSQQVHPKHNLEHATPPVVPLAPPLNGRAAHPHFLPHGPPLSRSNARKQVISWMDAPDDVYFRASDQTKRIRRTLTSVELRRAARKPWRRLPTPLHPPHPQRAASRGDDMVVILD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00050810;
- 90% Identity
- iTF_00343115; iTF_00059393; iTF_01364581; iTF_00721195; iTF_00058637; iTF_00059853; iTF_00059120; iTF_00905360; iTF_01365072; iTF_00057874; iTF_00721684; iTF_00058358; iTF_00905785; iTF_00397735; iTF_00397230; iTF_00398510; iTF_00398016; iTF_01129737; iTF_01129282; iTF_00459690; iTF_00460166; iTF_01102730; iTF_01101990; iTF_01103945; iTF_01100527; iTF_01101263; iTF_01101008; iTF_01101749; iTF_01102457; iTF_01103228; iTF_01103487; iTF_00414731; iTF_00414245; iTF_00439884; iTF_00829157; iTF_00266857; iTF_01299115; iTF_00828719; iTF_00266128; iTF_00265618; iTF_00439412; iTF_00829908; iTF_00266393; iTF_00829437; iTF_01299679; iTF_00263024; iTF_00262596; iTF_00798514; iTF_00798950; iTF_00050810; iTF_00051415; iTF_00263386; iTF_00263949; iTF_01508799; iTF_01509252; iTF_00841521; iTF_00841009; iTF_00629490; iTF_00629040; iTF_01497874; iTF_01497352; iTF_00653267; iTF_00652641; iTF_01055926; iTF_01056467; iTF_01379703; iTF_01379152; iTF_01058041; iTF_01057582; iTF_01207294; iTF_01207787; iTF_01057311; iTF_01056753; iTF_01306911; iTF_01306388;
- 80% Identity
- iTF_00343115;