Nio001123.1
Basic Information
- Insect
- Nymphalis io
- Gene Symbol
- cora
- Assembly
- GCA_905147045.1
- Location
- LR989917.1:821229-831224[+]
Transcription Factor Domain
- TF Family
- zf-C2HC
- Domain
- zf-C2HC domain
- PFAM
- PF01530
- TF Group
- Zinc-Coordinating Group
- Description
- This is a DNA binding zinc finger domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 18 0.0035 30 5.3 0.1 14 29 354 369 353 369 0.88 2 18 0.0035 30 5.3 0.1 14 29 406 421 405 421 0.88 3 18 0.0035 30 5.3 0.1 14 29 458 473 457 473 0.88 4 18 0.0035 30 5.3 0.1 14 29 510 525 509 525 0.88 5 18 0.0035 30 5.3 0.1 14 29 562 577 561 577 0.88 6 18 0.0035 30 5.3 0.1 14 29 614 629 613 629 0.88 7 18 0.0035 30 5.3 0.1 14 29 666 681 665 681 0.88 8 18 0.0035 30 5.3 0.1 14 29 718 733 717 733 0.88 9 18 0.0035 30 5.3 0.1 14 29 770 785 769 785 0.88 10 18 0.0035 30 5.3 0.1 14 29 822 837 821 837 0.88 11 18 0.0035 30 5.3 0.1 14 29 874 889 873 889 0.88 12 18 0.0035 30 5.3 0.1 14 29 926 941 925 941 0.88 13 18 0.0035 30 5.3 0.1 14 29 978 993 977 993 0.88 14 18 0.0035 30 5.3 0.1 14 29 1030 1045 1029 1045 0.88 15 18 0.0035 30 5.3 0.1 14 29 1082 1097 1081 1097 0.88 16 18 0.0035 30 5.3 0.1 14 29 1134 1149 1133 1149 0.88 17 18 0.0035 30 5.3 0.1 14 29 1186 1201 1185 1201 0.88 18 18 0.0035 30 5.3 0.1 14 29 1238 1253 1237 1253 0.88
Sequence Information
- Coding Sequence
- ATGCCGGAGGGAGTTGCCAAAGATGGCAAGGAGGCGAAGACGAAAGCGAAGGAGTCGCCGAAACGCCGCACCAACCTCGCCAAGATAAAACTCGAGCTGCTAGATGGATCGGCTATGGAACTTGAGGCCGATAGGAAAATTCGCGGTCATGATCTACTTAGCAAGGTCTGCGACAGCCTCAACCTGGTCGAGAAAGACTATTTCGGGTTGTTGTACGAGGATCGAGGGGATCCACGTGTCTGGATCGACCTGGAGAAGCGAGTTTCTAAGATGCTGAAACATGAGCCGTGGGTGGTGAGGTTCGCGGTGAAGTTCTACCCGCCAGAGCCGACGCAGCTGCGGGAGGAGCTGACGCGGTACCAGCTCGTGCTGGCCATCAGGAAAGACCTACTGGAAGGTCGCCTTCCATGCTCTACGGTAACTCACGCATTGCTCGCCAGCTATCTGCTACAGTCCGAGCTGGGCGACTACGAGGAGACGGAGAACGGTGCTGGTCTCTGCAAACAGCTCAAGCTGGCCCCTCCCGCGACATGCACGGCGGAATTCGAGGAGAAGGTCGTAGAACTGCACAAAACCCACAGAGGGCAAACTCCGGCGGAAGCGGAACTGAACTACCTGGAGAACGCGAAGAAGCTCGCCATGTACGGCGTGGACCTGCACCCCGCCAAGGACTCGGAGAACGTCGACATCGCGCTCGGGGTCTGCTCGTCTGGCTTGCTCGTGCACCGCGAGAAACTCCGCATAAACCGTTTCGCGTGGCCCAAGATCCTCAAGATCAGCTACAAGCGCCACAACTTTTACGTGAAGCTGCGTCCGGGTGAGTTCGAGCAGTTCGAGTCCACCGTCGGCTTCAAGCTCGCCAACCACCGCGCCGCCAAGAAGCTCTGGAAGACCTGCGTCGAGCATCACACTTTCTTCAGGCTGATGTCCCCGGAGCCGGCGACGCGCAGCACGCTGTTCCCACGGCTGGGCTCGCGCTTCCACTACAGCGGCCGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGCTGTCCTCGCGCAGCGCCGACGGTGAGTGACGTCACACGCGCACACACACAGACACGCACGCACTACGAGTCCCGCGCCGCGCCGCCCAACCGCTCGCAGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGCCGCACTTCGCGCGCACGCTCTCCTCGCGCAGGCTGTCCTCGCGCAGCGCCGACGGTGA
- Protein Sequence
- MPEGVAKDGKEAKTKAKESPKRRTNLAKIKLELLDGSAMELEADRKIRGHDLLSKVCDSLNLVEKDYFGLLYEDRGDPRVWIDLEKRVSKMLKHEPWVVRFAVKFYPPEPTQLREELTRYQLVLAIRKDLLEGRLPCSTVTHALLASYLLQSELGDYEETENGAGLCKQLKLAPPATCTAEFEEKVVELHKTHRGQTPAEAELNYLENAKKLAMYGVDLHPAKDSENVDIALGVCSSGLLVHREKLRINRFAWPKILKISYKRHNFYVKLRPGEFEQFESTVGFKLANHRAAKKLWKTCVEHHTFFRLMSPEPATRSTLFPRLGSRFHYSGRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADGCPRAAPTVSDVTRAHTQTRTHYESRAAPPNRSQPHFARTLSSRRLSSRSADAALRAHALLAQAVLAQRRR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -