Basic Information

Insect
Cydia amplana
Gene Symbol
KAT7
Assembly
GCA_948474715.1
Location
OX419655.1:15582834-15615470[+]

Transcription Factor Domain

TF Family
zf-C2HC
Domain
zf-C2HC domain
PFAM
PF01530
TF Group
Zinc-Coordinating Group
Description
This is a DNA binding zinc finger domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 2 1.4e-12 5.7e-09 36.3 1.9 1 29 355 383 355 383 0.97
2 2 0.72 3e+03 -1.2 0.1 5 24 1128 1147 1128 1148 0.88

Sequence Information

Coding Sequence
ATGACAAATCTTGATGTGAGCAACAGTAGCACAGACAGTAGTTCTGGGTCCTCGTCAGACTCAAGTAGCAGCGGCAGCTCTTCCACCAGCTCTGGGTCCGGGTCAAGCAGCTCTGACTCTGAATCTTCTACCTCAGACACACCTGCAGTTGCACAACCTGCCCCTAAATCACCCGCTAAGCCTGATGAAGAACCACCAAAAAAGAAGGATGAATCAAAACCTAGGCAAACAAGTAATAAAAAAGTATCATCATCCAGCGATGATGATGTGCCCAAGCCAGACCCACCAAAACCTGCACCACCCCGTAGACGTTCTTCTACCAAACCTAAATCAGCAGCTAGTGTCTTGGCTAAGGGGAAACCGCCTCGAGCAGCAGCAGGAGCTAAAACACCTAAACCAATACCTAAGACCATGACCAAACAGCAAACAAAAAATGACCAAAAAAAGAAGAGTATATTCTCTCCTGACAACAGCTCAGAATCTGAATCAGAGAGCAAATCATCAAAGAGTCCTAAAGGTTCACCGAAAAATAAAAGAGGTACCAGGAAATCTAGCGAGGACAAGGACACGTCTCCTCCATCACCATTGACAGCTGCAGAAAATGATGATTCAAATTCCAAAGACAATATAAAACGTAGAAGTGGCAGTAGGCCCAGTGGGCCCCCCTCCAAAAAAGTTGTGGACAATAAAAGCGCCTCTTCATGTTCCTCGAGTCAATCTTCAGCAGAAACCGTTTCTTCCGAAAGTGATAGTGAACCAACTGCTAAAAAGGAGCCAATTAAAACTGGAAAGACAAAACCACCTTCGTCAGCTAATAAAGTTGAAAAAGGTAAAAGCGACTCGGGCGAGTCTCAGGCGAACAACATGCCTCGGAAGCTGACGCGCTCGCTGTCAGCTCGCGTGTCGCGCATGGCCACGGCCACAAGGCCCAGAACCACGGACACCGACTCTGATGTTGATGAAAAATCAAACGACAAAAGTAAGGAGATAAAGAAGAAAGGCCGTCCTCCCATTTCTCGGTCTCCAATCGCTGAGCTCGCGCCGCCGCAGCCGAGCGAGCGGCGCTGCCCCGTGCGCGGTTGCGACTCCACCGGACACCTCGGTGGCAAAGCGGGCCGACACTTCACTTGGGACGCGTGCCCGCTGTACCACAACGTGACGGGTGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGTGAGTAGCGTGCTGTACCACCACGTGACGGGCGCGTGGTGCGTGGCCGCGGCCGAGGAGCGCGCCGCCGCCGCCGCCGCGCGCCGCCGCGCGCTCGCCAACATGCAGCAGCGGCCGCGCGCCATGCCCACTATAGAGCAACGGGCGTACCAGCTCAAGGTCAAGGATATGCGCTCCAAATGGAAGGGCAGCCAGGAGCTGCGCGAGAAGCTGGCAGCGTCAGGCGGAGACGACATGAACGAAGAACGAGAGCCAATATTGGAGGGATTCGCGCCGGATTACGACCTGCGGCTGTTCCGGGAGGCTCAAGCGCTTGCTGCTATTAAGATAGAAGAGGAGCTAGGGGACATCCCCACCGATAAGGGAACTAGGTTCGTGGTGATGGGCAAGTACATGATGGAGGTGTGGTACCAGTCGCCGTACCCGAACGACGCGGCGCGCGTGCCCCGCCTGTTCGTGTGCGAGTTCTGCCTCAACCACCACAAGAGCGCCACCGGCGCCGAGCGGCACCGCGCCAAGTGCGTGTGGCGGCACCCGCCCGGGGACGAGGTGTACAGAAAGGACAAACTCAGCGTGTGGCAGGTGGACGGTCGGAAGCACAAGCAATACTGCCAGCAGCTGTGTCTGCTCGCCAAGTTCTTCCTGGACCACAAGACGTTATACTACGACGTGGAACCTTTCCTGTTCTACGTCATGACGTGCGCCGACGAGGAGGGCTGCCACGTCGTTGGGTACTTCAGTAAGTTCTTCCTGGACCACAAGACTTTATACTACGACGTGGAACCGTTCCTGTTCTACGTCATGACGTGCGCCGACGAGGAGGGCTGCCACGTCGTTGGGTACTTCAGTAAGCTGTGTCTGCTCACCAAGTTCTTCCTGGACCACAAGACGTTATACTACGACGTGGAACCGTTCCTGTTCTACGTCATGACGTGCGCCGACGAGGAGGGCTGCCATGTCGTCGGGTACTTCAGTAAGTTCTTCGACCACAAGACGTTATACTACGACGTGGAACCTTTCCTGTTCTACGTCATGACGTGCGCCGACGAGGAGGGCTGCCACGTCGTCGGGTACTTCAGTAAGCTGTGTCTGCTCGCCAAGTTCTTCGACCACAAGACGTTATACTACGACGTGGAACCGTTCCTGTTCTACGTCATGACGTGCGCCGATGAGGAGGGCTGCCACGTTGTCGGGTACTTCAGTAAGGAAAAGAATTCGTTCCTGAACTACAACGTCTCGTGCATCCTGACGCTTCCTCCGTACCAAAGGCAAGGCTATGGAAGACTACTCATCGACTTCAGTTACCTTCTAACCAAAGTCGAAGGCAAAGTCGGTTCTCCAGAAACTCCGCTCTCAGATTTGGGTCTCATCTCGTACCGCTCCTATTGGAAGGAAGCGCTCTTGAAAAGACTATGCTCCGCGCCGGGGCCTACTCTGTGCATCAGGGATCTGAGCAAGGACCTCGCCATTGCATCCTCGGATATAGTGTCCACGTTGCAGGAGAGAGGACTCATGAAGTATTGGAAGGGGAAGCACATTGTACTAAAGAAACAGGACGTCCTGGAGGAGTTGTCTCGGCGGGCGGACCGCGCGCGCTGCGTGGAGCCGAGCTGCCTGCGTTGGACGCCGCCGGGCGCGCCGCCGCCGCGCTGA
Protein Sequence
MTNLDVSNSSTDSSSGSSSDSSSSGSSSTSSGSGSSSSDSESSTSDTPAVAQPAPKSPAKPDEEPPKKKDESKPRQTSNKKVSSSSDDDVPKPDPPKPAPPRRRSSTKPKSAASVLAKGKPPRAAAGAKTPKPIPKTMTKQQTKNDQKKKSIFSPDNSSESESESKSSKSPKGSPKNKRGTRKSSEDKDTSPPSPLTAAENDDSNSKDNIKRRSGSRPSGPPSKKVVDNKSASSCSSSQSSAETVSSESDSEPTAKKEPIKTGKTKPPSSANKVEKGKSDSGESQANNMPRKLTRSLSARVSRMATATRPRTTDTDSDVDEKSNDKSKEIKKKGRPPISRSPIAELAPPQPSERRCPVRGCDSTGHLGGKAGRHFTWDACPLYHNVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKRPRAMPTIEQRAYQLKVKVSSVLYHHVTGAWCVAAAEERAAAAAARRRALANMQQRPRAMPTIEQRAYQLKVKDMRSKWKGSQELREKLAASGGDDMNEEREPILEGFAPDYDLRLFREAQALAAIKIEEELGDIPTDKGTRFVVMGKYMMEVWYQSPYPNDAARVPRLFVCEFCLNHHKSATGAERHRAKCVWRHPPGDEVYRKDKLSVWQVDGRKHKQYCQQLCLLAKFFLDHKTLYYDVEPFLFYVMTCADEEGCHVVGYFSKFFLDHKTLYYDVEPFLFYVMTCADEEGCHVVGYFSKLCLLTKFFLDHKTLYYDVEPFLFYVMTCADEEGCHVVGYFSKFFDHKTLYYDVEPFLFYVMTCADEEGCHVVGYFSKLCLLAKFFDHKTLYYDVEPFLFYVMTCADEEGCHVVGYFSKEKNSFLNYNVSCILTLPPYQRQGYGRLLIDFSYLLTKVEGKVGSPETPLSDLGLISYRSYWKEALLKRLCSAPGPTLCIRDLSKDLAIASSDIVSTLQERGLMKYWKGKHIVLKKQDVLEELSRRADRARCVEPSCLRWTPPGAPPPR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-