Basic Information

Insect
Cydia amplana
Gene Symbol
-
Assembly
GCA_948474715.1
Location
OX419679.1:7841898-7868636[+]

Transcription Factor Domain

TF Family
ARID
Domain
ARID domain
PFAM
PF01388
TF Group
Helix-turn-helix
Description
This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 0.081 2.9e+02 2.4 0.0 3 23 589 609 587 635 0.76
2 6 0.15 5.4e+02 1.5 0.0 4 23 653 672 651 697 0.77
3 6 0.15 5.4e+02 1.5 0.0 4 23 716 735 714 760 0.77
4 6 0.16 5.7e+02 1.5 0.0 4 22 779 797 777 820 0.77
5 6 0.15 5.4e+02 1.5 0.0 4 23 842 861 840 886 0.77
6 6 1.8e-12 6.4e-09 36.6 0.0 4 89 905 977 903 977 0.89

Sequence Information

Coding Sequence
ATGAAAGGGCAGATTACTTCTTGCTGTGGTGTTATAAATGGACAAAATAGGCTACTTACATATCAGAGTAGATGTCTTCAAAAAATCTTGGGCACTCTCACAAACACTAATATAGATCTGCCCTTGACTAGTACACAAGAAGCACTAAATCAGTCCGGTCCGGAAGGATTGATATTTCTTCAAGATGATTTGCCGCTGGATATCCAAAATGAAGAACTGATATCATCACCAGCACCACCTCCAACCCAGCCTGTCTCTGTTGCCATGACCCCTGGGCATGAGAGCCCTATCCTTCCAGTTCCGAAGCCAGCAGATCCAGAGCCGTCGGTGAAGTCTAGTGCTCAACAGCCAGATCCAGCGCCCACAAATCCTGACCATCTGACTGAGACGCGCCGATCTCGACCGCGTCGTGGAGTTGTTTCGGCCCAAAGTAGGACGTGCGCCACAAAAAAAGATGACTGTCTTCCAACGGCCCCTGCTGACGAGGTGCTGGCCATAAACGACAAGGTGGTGCTGAAGGCCGAGGACCTGCTGTCGTGGGTGTGCGGCGGCGAGGAGTGGCGCTGGGGGCTCCGCGCCGTGTGGCGCGGCGACTGCGCCCCGCCCGCGCAGCCGCGCCGCTCGCAGCCGCTCCACCACACCCGGCTCGACTTTAGCGACGTCGACAAGGAGAAAAACGCCATCAACGAGGACTCGGACAGCCCCGGCGTGGTAGTTTTCTCATACCCTCGCTACTGCCGCTACCGGGCGCTCCTCTCCCGGCTCGAGGGCCTCCAGGGGGACTGGCTCAGGGACAGCCTGGTGTGCGCGCTCGGGGGCTACGCGGCGCCCACCAAGAACACCAGGATACTCTACTGCAAGGACACATTCGAGTACCCAGAGCTGGAAGGCCACGAGTTCGTCTGCAACCAGCTCGCGCCGAAGCTGAAGGGGCGACCACGGGGTCGGCGCCGGAAACGAGCATCCAACCACTCTCCGTCCCTCTCCGACAGATCTAGTGACAGCGACTCGCAGAAACGCGTCGAGCAACCTGCTGAGGTGAGTCACAGACACCTCGTGAGGCGCTCACGAAGAGGAGTTAGGCTACAACCAGCTCGCGCCGAAGCTGAAGGGGCGACCGCGGGGTCGGCGTCGGAAACGAGCCTCCAACCACTCTCCCTCGCGCCGAAGCTGAAGGGGCGACCGCGGGGTCGGCGTCGGAAACGAGCCTCCAACCACTCTCCGTCCCTCTCCGACAGATCTAGTGACAGCGACTCGCAGAAACGCGTCGAGCAACCTGCTGAGCTCGCGCCGAAGCTGAAGGGGCGACCGCGGGGTCGGCGTCGGAAACGAGCCTCCAACCACTCTCCGTCCCTCTCCGACAGATCTAGTGACAGCGACTCGCAGAAACGCGTCGAGCAACCTGCTGAGCTCGCGCCGAAGCTGAAGGGGCGACCGCGGGGTCGGCGTCGGAAACGAGCCTCCAACCACTCTCCGTCCCTCTCCGACAGATCTAGTGACAGCGACTCGCAGAAACGCGTCGAGCAACCTGCTGAGGAGTTAGGCTACAACCAGCTCGCGCCGAAACTGAAGGGGCGACCGCGGGGTCGGCGTCGGAAACGAGCCTCCAACCACTCCCCGTCCCTCTCCGACAGATCTAGTGACAGCGACTCGCAGAAACGCGTCGAGCAACCTGCTGAGCTTCAGCCCCCGCGGCGGCTGTCGCTCCGCAACGGCGCGCCCAAATACAGCGAGGAAGATGACAAGGAAACCAGCGCCGAGGACAGAGCCTTCATCGGCCATCTCAAGCAGTTCTACAAGCAGAGGGGGGAAGCCTTCAAGCCTGCACACTCGCTCAAAGAGCGTGAGTATTTAAATTACTGGTTAAAAGTGGCCGCCATTTTGAATGGCTTACTTACTAAGTTGCGAACGTGTATAAAGTATGGTGAGGATGAGGACAGAGACTATGGAGATGGAACCTTCATCGGACACCTCAAGCAGTTCTACAAGCAGAGGGGGGAAGCCTTCAAGCCTGCACACTCGCTCAAAGAGCGTGAGTATTTAAATTACTGGTTAAAAGTGGCCGCCATTTTGAATGGCTTACTTACTAAGTTGCGAACGTGTATAAAGTATGGTGAGGATGAGGACAGAGACTATGGAGATGGAACCTTCATCGGACACCTCAAGCAGTTCTACAAGCAGAGGGGGGAAGCCTTCAAGCCTGCACACTCGCTCAAAGAGCGTGAGTATTTAAATTACTGGTTAAAAGTGGCCGCCATTTTGAATGGCTTACTTACTAAGTTGCGAACGTGTATAAAGTATGGTGAGGATGAGGACAGAGACTATGGAGATGGAACCTTCATCGGACACCTCAAGCAGTTCTACAAACAGAGGGGGGAAGCCTTCAAGCCTGCACACTCGCTCAAAGAGCGTGAGTATTTAAATTACTGGTTAAAAGTGGCCGCCATTTTGAATGGCTTACGTACTAAGTTGCGAACGTGTATAAAGTATGGTGAGGATGAGGACAGAGACTATGGAGATGGAACCTTCATCGGACACCTCAAGCAGTTCTACAAGCAGAGGGGGGAAGCCTTCAAGCCTGCACACTCGCTCAAAGAGCGTGAGTATTTAAATTACTGGTTAAAAGTGGCCGCCATTTTGAATGGCTTACTTACTAAGTTGCGAACGTGTATAAAGTATGGTGAGGATGAGGACAGAGACTATGGAGATGGAGCCTTCATCGGCCATCTCAAGCAGTTCTACAAGCAGAGGGGGGAGGCGTTCAAGCCTGCACACTCGCTCAAAGAGCTGTCGCTGAGAGCGCTGTACCACTCCGTGACAAACGCTGGCGGCTACGAATCGGCCTGCAGGCACAAATTGTGGCGACAGATCCACCCCCCTCAGCCCAATACTGCGCGACGACACTACGAGAGATTCCTGCTGCCCCTAGAAAAGCACGAAGTGTCCACCGGACTCCGAACGCTACCCAAACTGAACGGCGCCGTCGAACCACCCACAGTCACCATAGAGCTGGCTGACAGTCGTGAACCCAGTCCCGTGCCCAAAGAAGAAGCGAAGGACTTCAGCGTCACTTCCAAACCTGCTGAGGAACTGAACAGGGAGTTCCTAGACTCCTTACCACCGAAGGAAGAAAATAAAGTAAGATTATTGACATAA
Protein Sequence
MKGQITSCCGVINGQNRLLTYQSRCLQKILGTLTNTNIDLPLTSTQEALNQSGPEGLIFLQDDLPLDIQNEELISSPAPPPTQPVSVAMTPGHESPILPVPKPADPEPSVKSSAQQPDPAPTNPDHLTETRRSRPRRGVVSAQSRTCATKKDDCLPTAPADEVLAINDKVVLKAEDLLSWVCGGEEWRWGLRAVWRGDCAPPAQPRRSQPLHHTRLDFSDVDKEKNAINEDSDSPGVVVFSYPRYCRYRALLSRLEGLQGDWLRDSLVCALGGYAAPTKNTRILYCKDTFEYPELEGHEFVCNQLAPKLKGRPRGRRRKRASNHSPSLSDRSSDSDSQKRVEQPAEVSHRHLVRRSRRGVRLQPARAEAEGATAGSASETSLQPLSLAPKLKGRPRGRRRKRASNHSPSLSDRSSDSDSQKRVEQPAELAPKLKGRPRGRRRKRASNHSPSLSDRSSDSDSQKRVEQPAELAPKLKGRPRGRRRKRASNHSPSLSDRSSDSDSQKRVEQPAEELGYNQLAPKLKGRPRGRRRKRASNHSPSLSDRSSDSDSQKRVEQPAELQPPRRLSLRNGAPKYSEEDDKETSAEDRAFIGHLKQFYKQRGEAFKPAHSLKEREYLNYWLKVAAILNGLLTKLRTCIKYGEDEDRDYGDGTFIGHLKQFYKQRGEAFKPAHSLKEREYLNYWLKVAAILNGLLTKLRTCIKYGEDEDRDYGDGTFIGHLKQFYKQRGEAFKPAHSLKEREYLNYWLKVAAILNGLLTKLRTCIKYGEDEDRDYGDGTFIGHLKQFYKQRGEAFKPAHSLKEREYLNYWLKVAAILNGLRTKLRTCIKYGEDEDRDYGDGTFIGHLKQFYKQRGEAFKPAHSLKEREYLNYWLKVAAILNGLLTKLRTCIKYGEDEDRDYGDGAFIGHLKQFYKQRGEAFKPAHSLKELSLRALYHSVTNAGGYESACRHKLWRQIHPPQPNTARRHYERFLLPLEKHEVSTGLRTLPKLNGAVEPPTVTIELADSREPSPVPKEEAKDFSVTSKPAEELNREFLDSLPPKEENKVRLLT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-