Aali004040.1
Basic Information
- Insect
- Anopheles albimanus
- Gene Symbol
- KIF4A
- Assembly
- GCA_013758885.1
- Location
- NC:53734243-53738531[-]
Transcription Factor Domain
- TF Family
- CSRNP_N
- Domain
- CSRNP_N domain
- PFAM
- PF16019
- TF Group
- Unclassified Structure
- Description
- This presumed domain is found at the N-terminus of cysteine/serine-rich nuclear proteins. These proteins act as transcriptional activators [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 6 0.31 1.7e+03 -2.0 0.3 62 103 450 492 438 504 0.55 2 6 2 1.1e+04 -5.9 6.2 80 135 600 655 558 672 0.42 3 6 2 1.1e+04 -5.4 8.2 55 102 634 684 587 724 0.46 4 6 0.57 3.2e+03 -2.9 4.1 63 103 729 770 701 856 0.57 5 6 0.3 1.7e+03 -2.0 0.8 88 109 983 1004 928 1042 0.51 6 6 2.3e-09 1.3e-05 24.6 16.3 70 188 1058 1179 1015 1187 0.76
Sequence Information
- Coding Sequence
- ATGTCGAACAACCCGGAGTGTGTGAAGGTGGCGGTGCGAATCCGGCCGATGGCAACGTCGGAAGCGGAACGGGGATGCCAGAGTGTGGTCGAGATATcgccaccgaacgaaccgcaGGTGGTCATCTGCGGAGGGCGCACGAAATCGGACATCTTCACGTACAACTACGCGTTCGCGCCGGACGCGTCGCAAGCCTTACTGTacgaccgatcggtggccccggtgctggagaagctgtTCGAGGGTTACAACGTCACGATCCTTGCCTACGGCCAAACCGGATCCGGTAAAACCTACACCATGGGGACGGACTTCAGTGGCGATATGGTGGAGAGTGCGGGCGTCATTCCGCGCGCCATCGTCGACATCTTTCGGAAGGTGGAAGAAACGGACGGTAGCATAAGCACGAGCTGTTCCTTCGTCGAGCTGTACCAGGAGAACGTGTACGATCTGCTGTCCGAGAAGGGAACCGCGAGCGATCGACAAACGGTCGACATTCGTGAGCTGGCCAGTGGGATCGTAGTCCTGCAGGGCCTCACTGAGATCCCCGTTCGGAGTGTCCCGGAAACGTTCGATTGTCTGGTGCGCGGTTCGTCCGGCCGGATGGTACGTGCGACGGCCATGAACGCCGTCTCGAGCCGTAGCCACTCGATCTTCACCATCACGCTGCAGCAACCATCGGCCGAGGATCCGAAGTCACTGTTGACCTCCAAGTTCCATCTGGTCGATCTGGCCGGTTCGGagcgatcgaagaaaacgaaaacgacggGCGATGGGTTCCGGGAAGGTGTCAAGATCAACCAGGGACTGCTGGCACTGGGCAATGTCATCTCCGCGCTCGGGACGGCCGTTACGACGGGCTCGAACAATCACGTGCCCTACCGGGACTCGAAACTGACCCGCCTGCTGCAGGACTCGCTCGGCGGCAACTCGTACACGCTGATGATCGCTTGCGTTTCACCGGCCGACTACAATCTGAACGAAACGATCAGTACGCTCCGATACGCGGATCGGGTGCGGAAGATCAAGAACAAACCGATCGTCAATCAGGATCCGCACCAGGCGAAGATCAAGCAGCTGGAAGCGATCATTCAGGATTTGCGCGTCGAGATACTGACGCTCAAAGGTGGGGACAGCTCAATGGCCACATCGGCCGAGTTTAGGCTACCGAAGGTACCGTCTACTTCCGCCCGACCATCCGGGATTGCCGGACCGCTTTCGAAACGTCCAACGCTCGCGACATCCGCGGCGAATCAGAACAGTTCCTCCTCTACCATGGCGAGCGATGGAAGTGCACCGGCGGGAGCGCCGAATGGAGCTCGCAAATCGGCGTTACAGGAGCTGCTCGATAagaaccagctgctgcaggcacAGCTACAGGCGATGGCTCAGGACCTGGCCACCAACGAGATGCGTGCGTTGGCCGCTGAGAAAACGCTTGAAATGCTGGATGAGAAGGTCGACGATGAGACGGACATTCGGGAGCACCTGGGCAGCATCCTGAGCACGTATCACGAGGAGATGGTGGCGCTCGGTATGGCGGAACCACCTGCGGTAACTGTCGGTGCGACCAGTGATTCAACCGGCCAGCGGCCATCGATCGCCTCGTCCGTCCTTAGCTCAACGTCATCGGGCGAACCCATCGATCTGCCCCTTACCGAGGATGGCCAGAAGCGCTCGGACAATCATACGCAGCAACAGATCCGTCTGCACAAGGAGCTACGCCAGCTGAACatggagctgaagctgaaggaggaacTGCACCGGCGCTGCATGGGTAACAGTGCCGCCGTCCAGAGCAACGGGCccgcccagcagcagatggCGGAGGTACTGGCCGAGTATCAACAAACCATCGCCagcctcgagcagcagctggccgagctgAACGGGCAGCTCGAGAACACGAAGGCGAGCGACAAAAAGTCGAAGCTCTCCGAGGAGCGTCGCAagaaggtgcagcagctcgaggcaCAGCTGAGTGAGCTGCGTACTAAGAGTATGCGCCAGGCGAAGCTGCTCAAgctgaaggaaaaggatgCCCAGCGGATCGAAACGCTCAGCACGGAGATCCAATCGATGAAGGCGACACGCGTGAAGCTGCTGAAAACGATGCGCGCCGAGAGCGAAAACTTCCGCAAATGGCGCCTGACGCACGAGAAGGAGATCTGCCAGCTGAAGGCGAAGGATCGCAAGCGGCAGAATGAGCTGCAAACGATGGAATCGATGTACgccaagcagaagaagatcaTGCAGCGCAAGATGGAGGAAACGATCGCGGTGAACAAGCGGCTGAAGGCGGCGCTCGATCGGCGTCAGCAGCGTAACGAGGGTAAGGGTGGGACCGGGCCGACCGGGGATAAGTCGATGCTGCGCGGTACCGAAGCGATCCGCTGGATcgagcaggagctggagctgctgtACAGTATGGTCGAAGCATCCGTAACGCTGGACGTGCTGATGGAGCAGCGGACGCAGCAGGCGGCCAAGCTGGCTGGTTTGAAGGGCGCGAAAAGCAAGGATCCGGCCACGCTCGAGGAGATCCGCCAGTGCGAGAACGAGATGGCACTACGGAACGCGCAGATTAGTGATCTGCAGCAGCGCGGTCACGATATCGATGGGCAGCTCGCGCTGTTCAGCAAGACACTCGATACGTTGCCCGAGAAACGGGAAGCCTACCGGCGCCTACTGGCGGCGAGCGTCGCGGATCGCAAGCAGCTCAGCGGACTACGCTTCCAGCTGGTCGAGTGTCAGGCGGCGAACGAGTGTCTCGAGGAGTCGCTGGCCCAGCTAcgcaccgaccagcagcaaaccgagcaACAGTATGCGGTGCAGGTGGCCGAACTCGAAAAGGCTTACGAGGACAAGCTGgccctgctgttgctccatCAGCAACAACGCACCCCAGCTAGGAAGCCCCCAGCGGCCGGGCCgtgcgaagcagcaacagagcaggcggtacaacagcaaaccgagaCTGAAAGCAGCTCCGAGCAGCACACTGCCGACAACAATAATGTGCTGGTCGAGGCAATCGAACGTATCGAGATGCTGCGCACCGAACTGGAGCTCCACAAGGAGGGCAATCgtcggctgcagcagcaactgatcGCTACCAAGGAGAAGCTGGCGGCAAGTAACGTGCCCCGGGTGTCCGGCAATCGGAAACCAAAATCACGATCGCTGAGCTATGTGACcaagcagcaagaagaggaagaagaggaagaggaggaggaatacgacgatgacgatgacgacgacgacgatgagttgGATGAGTTGGAGCGCGAGCGTGATCCGGACTTCCGTGGAACGCCCATCAGCAAACGGCGAAAGTTGAGTACCGTATCGTTATTGGGTCAGTCGCAGCGTTCCAACGAAAGCGTTACGAATGAaacgaccaacagcagcggcgcGACGGGACCACACTGCACCTGCTCCGGTAACTGCTCAACAATGCGCTGCGGCTGCCGGAAGGCTGGCCTTTCCTGTCAGACCAGCTCTTGTAAGTGTCCGGCGACCTGCGCCAACAAGGCGCTGGATTCGATCTCCGAAGACGCACCGTCGGACAAGGAGGACGAAAAGGAGAACACGTTTAGCGCctctgatggtgatgatagtCGGAAGCTACTGGAGAAGCTCTGCACTCCTCAGCAGAGCAAAAGGACCGCAGAAATGGACATTCTTACGTACGTGTCGACGCACAGAAAGCTGAAACCGCTGCTCAGTGACTAA
- Protein Sequence
- MSNNPECVKVAVRIRPMATSEAERGCQSVVEISPPNEPQVVICGGRTKSDIFTYNYAFAPDASQALLYDRSVAPVLEKLFEGYNVTILAYGQTGSGKTYTMGTDFSGDMVESAGVIPRAIVDIFRKVEETDGSISTSCSFVELYQENVYDLLSEKGTASDRQTVDIRELASGIVVLQGLTEIPVRSVPETFDCLVRGSSGRMVRATAMNAVSSRSHSIFTITLQQPSAEDPKSLLTSKFHLVDLAGSERSKKTKTTGDGFREGVKINQGLLALGNVISALGTAVTTGSNNHVPYRDSKLTRLLQDSLGGNSYTLMIACVSPADYNLNETISTLRYADRVRKIKNKPIVNQDPHQAKIKQLEAIIQDLRVEILTLKGGDSSMATSAEFRLPKVPSTSARPSGIAGPLSKRPTLATSAANQNSSSSTMASDGSAPAGAPNGARKSALQELLDKNQLLQAQLQAMAQDLATNEMRALAAEKTLEMLDEKVDDETDIREHLGSILSTYHEEMVALGMAEPPAVTVGATSDSTGQRPSIASSVLSSTSSGEPIDLPLTEDGQKRSDNHTQQQIRLHKELRQLNMELKLKEELHRRCMGNSAAVQSNGPAQQQMAEVLAEYQQTIASLEQQLAELNGQLENTKASDKKSKLSEERRKKVQQLEAQLSELRTKSMRQAKLLKLKEKDAQRIETLSTEIQSMKATRVKLLKTMRAESENFRKWRLTHEKEICQLKAKDRKRQNELQTMESMYAKQKKIMQRKMEETIAVNKRLKAALDRRQQRNEGKGGTGPTGDKSMLRGTEAIRWIEQELELLYSMVEASVTLDVLMEQRTQQAAKLAGLKGAKSKDPATLEEIRQCENEMALRNAQISDLQQRGHDIDGQLALFSKTLDTLPEKREAYRRLLAASVADRKQLSGLRFQLVECQAANECLEESLAQLRTDQQQTEQQYAVQVAELEKAYEDKLALLLLHQQQRTPARKPPAAGPCEAATEQAVQQQTETESSSEQHTADNNNVLVEAIERIEMLRTELELHKEGNRRLQQQLIATKEKLAASNVPRVSGNRKPKSRSLSYVTKQQEEEEEEEEEEYDDDDDDDDDELDELERERDPDFRGTPISKRRKLSTVSLLGQSQRSNESVTNETTNSSGATGPHCTCSGNCSTMRCGCRKAGLSCQTSSCKCPATCANKALDSISEDAPSDKEDEKENTFSASDGDDSRKLLEKLCTPQQSKRTAEMDILTYVSTHRKLKPLLSD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -