Lko_34015-RA
Basic Information
- Insect
- Laupala kohalensis
- Gene Symbol
- -
- Assembly
- GCA_002313205.1
- Location
- NNCF01129918.1:16288-76148[-]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 18 0.031 27 4.5 0.1 22 44 425 447 407 455 0.81 2 18 0.62 5.4e+02 0.4 0.1 18 43 449 474 447 477 0.83 3 18 0.08 70 3.2 0.0 11 44 469 503 464 507 0.80 4 18 0.074 65 3.3 0.0 23 43 599 619 589 629 0.87 5 18 0.086 75 3.1 0.0 26 46 630 650 620 655 0.87 6 18 0.026 23 4.8 0.2 22 44 736 758 731 762 0.87 7 18 0.017 15 5.4 0.0 21 45 763 787 758 793 0.89 8 18 0.18 1.5e+02 2.1 0.0 8 46 863 899 861 902 0.85 9 18 0.002 1.7 8.4 0.0 22 52 931 961 926 963 0.93 10 18 0.087 76 3.1 0.0 24 45 989 1010 986 1017 0.85 11 18 0.0088 7.7 6.3 0.1 21 45 1014 1038 1010 1045 0.87 12 18 0.027 24 4.7 0.0 21 47 1042 1068 1038 1074 0.85 13 18 0.0074 6.5 6.5 0.4 27 51 1075 1099 1069 1102 0.91 14 18 1.1 9.6e+02 -0.4 0.0 20 45 1096 1121 1094 1124 0.89 15 18 0.17 1.5e+02 2.2 0.1 22 44 1126 1148 1120 1158 0.85 16 18 0.31 2.7e+02 1.3 0.1 14 43 1145 1175 1140 1181 0.81 17 18 0.00046 0.4 10.4 0.0 21 44 1181 1204 1173 1208 0.89 18 18 0.14 1.2e+02 2.5 0.0 21 45 1209 1233 1204 1237 0.87
Sequence Information
- Coding Sequence
- aattttcaGAGAGCTTCCCATAAGgcatttatacatttatacatttataaggTTTTGCGTATTTTTGTATGCAATGGGTGGACAAAAGGGGTTGCAAATAAAGCAGAGTCTGCTTTCACAGCTGAGAGATCTGAACTTGGTACAAGTACCCAAACCTGCAGAGGATTTAAAAATCCGTGCGCAGGTCTGAAgtatgatgTTCTTGTATCCTCTTCCGATTCATCTGAACCTTCCTCACTTTTGCTACAGaAGCTGTTGATGGATCACCAGGGGATGACTATGATTCCTTTTAAAGAAGTGAAGTACAAGGACATGAAACTCATTTTAGATTTCATGTATCTCGGAGAAGTGGATGTTCCCCTTGTTGATGTGGCTTCTCTAATGGACACTGCTGAATTATTGCAAATCTCATCCTTAAGATGTGCCACTCACCATTTTGCAGAGGAATCGGCGACTGCTGATGTTTGTACGAATGAAAAGAGAGCAATTCCTAACAAAGATTTACATAGCGGCAGGAAAGACAGATTCGATTCCATCGGCCGGAACATTGAAGTGATATCAGTGAAGGGTTCTAAAAGGAAAGcaatatatgttttgaaacaactACCAGAGAAAGCAAATGAGGATCAACCAAACGTCAGTGTGTTTGAAACAGACGTTATCGGTTCCGATCATGTGAATAATATACGGGATGAATCTCTATCGATTACCTTTGTTGCTGTTCCTTCAGAACCTGAGGAGAGAGGAGAAGAGGAACAGAACAAAGGAGGTGAAGTGTGCGTTCACTCTGCTGATCAGAGTGTGTCCTACGATGCAAGAGATGGTACTGTTCCGGACGGAGCAGATGGTTTGGAAGTACAATACAAATCAATAAGTGTGATTTCTGTACCTGACTTGGAAGCAGGGAATTGGATTGACACAAGTGTGGTTGTTGAAGATACCAAAGTGAGTCGGGAATCATCCTCGTTAGGGGGAGAGGGAGAGATTTCAAATTGTGAATCACAGGAAACGTACCTCATTCATTACGAGCCAGCAGCAACCGAAGGATTAGGGGACGATGCGAATGTGAGTCATCTGGAGCCGAAGGAAGCCATTGTGCTTTTGGAGACTGCGGATCCCGATGGTCATCAAATGGAAGTTAGCCACGATACAGttaCAGGGCACAGGCAGCCCGAggaaaaagaagtaaagaaacgGTTTCCTTGTATTCATTGCGAGAAATCCTTCCCCACCGCAGACAGACTCAAGTCGCACAAGCGCTATCACAGCGGGACGAAGCCCCATCTCTGCGGCGTCTGCGGAGCGTCTTTCGTCGAGAAGAGTAACCTCAAGAGACACGCGAGACGTCACACGGGAGAGAGACCCTACTCGTGCGACGTTTGCGGTGCGTCTTACAGCGAAGGAGGCGCGCTCAAGAAACACTCGAGAACCCACACGGGCGAGAAACCCTACGCGTGCGGTATCTGCGGAGAATCCTATAGGTTGAAAAGTTCGCTGAAATATCACGTGACGAAACACACGGGCGAGAAGAGTTATCCGTGTTGTATTTGTAAAGAATCGTTCGGTGACATCGACTGTCGGAAGGCGCACATGAAGACCATTCATCCGGACGAGTATCCTTTCGCGTGTAGCGCCTGCGACGCCACCTATCTTCACAAGGCCATGTTGAAGAAACACGTGGCGACGCATTCGGGCGTGAGAGCGGCGTTCTCGTGCGACGCGTGCGACGCCTCCTTCTTTAGAAAGGATCTGCTGAAGAACCACCGCGCGGCGAAGCACTCGACGGCGGGTGCGAAGCCGTTTCTGTGCAGCGTCTGCGGAGCGTCGTTCGCCGAGAGAAGCAACTTGAAGAGGCACGAGCGGAGTCACGTGGGCGACAGGTCGTTCCTCTGCAGCTTCTGCGCGGCGTCTTTCGTGGAGAGCGGGGATCTGAAGAGGCACCTTAGGATTCACACGGGCGAGAAGAAGTTTCCGTGTCGCTTCTGTAGAGTGTCGTTTGGGGATTATAGTAGTCGTACAGCTCACGTGAAAACCGAGCACAGTGAATTTCTATTCGCGTGTAGTGTCTGCGACGCGTCGTTTGTGGAGAAGAGTAGTTTAAAGTCGCATCGGAAGAAACACGACAGGGAGTTTTCCTGCGTGTTGTGCGAAGCGTCTTTCGCGTGCAAGAGCGACCTCAAGTGCCACGTGGCGACTCACACGGGTGAGAAGACGCACGCGTGTGGCGCCTGCGGAGCGCGCTTCGTCCAGAGAGGAAACCTCAGGCGACACCAGAGGATCCACACGGGGGAGAAACCTTTCTCCTGCGACATCTGCGGAGCCGCCTTCGCCGAACGAGGGACTCTGAAGAGACACGTCAAGACTCACGTCGCAGAGACAAAGACGCATTCaAATCAGATCGGGGCTTCAGTGAATCGTAGGGTTAGATCAACGATGGCAACCTCCAGCGAAACTCGTGTGTCCTCGTGGCCATTCATTTCACCCACCACAGAACAGAAGGAAGTGGTGGAATGTTCTAGAGACGACCAGCAATTATCGGAAGGTGCCGCCGGATCTGAAATTAACTTTGTCGATCATCTTTACGTACATCCTCCTACCGACTACGGTCATTCTAGTTCACAAGAGAAATCGTTTCGTTGTAGCTTTTGTGAAGCGTCTTTCACTCTCAGAGGTAATTTGAAGACGCACATGAAAATTCACTCTGGTGAGCGACAACTCGAATGCCGCGTGTGTAAAAAGTCGTTTTCCGATAAAAGTTCCCTACACAAACACGTGCTGACACACACTAACGACAAACCGTTTGTTTGTCCGGTGTGTGAAGCGACGTTCGCTCTTCAGAAGTATCTGAAGGCTCACATGAAAATCCACTCGTATGACAAACCCTATCTGTGTGACAAGTGCCCGTTGTCCTTCGCCCAGAAGAGTATTCTGGAGAGACACGCGCTCACTCACATCGGTTGGAAGCCGTTTCCCTGTAGCTACTGCGGAGCGACCTTCCGCGACAAGAACAACCTCAAGACTCACGTGCGAACCCACACGGGAGAGAAGCCCTTCGCGTGTACTTTCTGTTCCGCTTCTTTCTCTCAAAACTGTTCTCTGAAGAAGCACTTGCGATCCCACACCGGCGAGAAACCCTTCACCTGCGACATATGCGGCAAATCGTTCGTGAACAAGGGTAGTCTGACGTCCCACATGGAACTTCACGCGGGCGTCAAACACCGATGTCGACACTGCGAAGCGTCCTTTAGGCAGATGAACAATCTGATAACGCACATGCGGCTGCAGCACAGCGAGAAGACGTTTTCCTGCGCTCTGTGTGAGGAGTCCTTTCAACAGAAGGAAGACCTGCAGGGTCACATGAGGATCCACACGGGCGAGAAGGCGCCCACCTGCGACATATGTGGCAAATCTTTCAACACGGAGAGAGCCCTGAAGAAGCACGCGCGCAATCACTCGGAGCGGAAACCCTTCTCTTGCGGCACCTGCAACGCGTCTTTTCGATACAACAGTCACCTGACTGTTCACACGAGGACGCACACGGGGGAGAAGCCCTTCGCGTGTACCTTGTGCGAAGCTTCGTTTCGTCAGAGCGGAGACTTAAAGTGTCACATGCGATCTCACACCGGGGAGACTCCCTTCTCGTGTAAATTCTGTGATTCCGCATTCCGATACGCGAGCAGTCTGAAGAAACACGTCCAAACCCACGCATCGTTTGTCGGTGAGAAATCTCTAGAGGAACACGGAAAATTTGTCGCAAGAGAATCTTGTGACATGAATGGAAGTCCCAACGGTAGCTGTTGTGGAGAAGAAAACAATTCGTCTTTAGAAGACAACATGGAGATTATAGACTTTGATTTGGCCAATGTGAAAGAAGAAATGCATTCTGTAGTGAGATGA
- Protein Sequence
- NFQRASHKAFIHLYIYKVLRIFVCNGWTKGVANKAESAFTAERSELGTSTQTCRGFKNPCAGLKYDVLVSSSDSSEPSSLLLQKLLMDHQGMTMIPFKEVKYKDMKLILDFMYLGEVDVPLVDVASLMDTAELLQISSLRCATHHFAEESATADVCTNEKRAIPNKDLHSGRKDRFDSIGRNIEVISVKGSKRKAIYVLKQLPEKANEDQPNVSVFETDVIGSDHVNNIRDESLSITFVAVPSEPEERGEEEQNKGGEVCVHSADQSVSYDARDGTVPDGADGLEVQYKSISVISVPDLEAGNWIDTSVVVEDTKVSRESSSLGGEGEISNCESQETYLIHYEPAATEGLGDDANVSHLEPKEAIVLLETADPDGHQMEVSHDTVTGHRQPEEKEVKKRFPCIHCEKSFPTADRLKSHKRYHSGTKPHLCGVCGASFVEKSNLKRHARRHTGERPYSCDVCGASYSEGGALKKHSRTHTGEKPYACGICGESYRLKSSLKYHVTKHTGEKSYPCCICKESFGDIDCRKAHMKTIHPDEYPFACSACDATYLHKAMLKKHVATHSGVRAAFSCDACDASFFRKDLLKNHRAAKHSTAGAKPFLCSVCGASFAERSNLKRHERSHVGDRSFLCSFCAASFVESGDLKRHLRIHTGEKKFPCRFCRVSFGDYSSRTAHVKTEHSEFLFACSVCDASFVEKSSLKSHRKKHDREFSCVLCEASFACKSDLKCHVATHTGEKTHACGACGARFVQRGNLRRHQRIHTGEKPFSCDICGAAFAERGTLKRHVKTHVAETKTHSNQIGASVNRRVRSTMATSSETRVSSWPFISPTTEQKEVVECSRDDQQLSEGAAGSEINFVDHLYVHPPTDYGHSSSQEKSFRCSFCEASFTLRGNLKTHMKIHSGERQLECRVCKKSFSDKSSLHKHVLTHTNDKPFVCPVCEATFALQKYLKAHMKIHSYDKPYLCDKCPLSFAQKSILERHALTHIGWKPFPCSYCGATFRDKNNLKTHVRTHTGEKPFACTFCSASFSQNCSLKKHLRSHTGEKPFTCDICGKSFVNKGSLTSHMELHAGVKHRCRHCEASFRQMNNLITHMRLQHSEKTFSCALCEESFQQKEDLQGHMRIHTGEKAPTCDICGKSFNTERALKKHARNHSERKPFSCGTCNASFRYNSHLTVHTRTHTGEKPFACTLCEASFRQSGDLKCHMRSHTGETPFSCKFCDSAFRYASSLKKHVQTHASFVGEKSLEEHGKFVARESCDMNGSPNGSCCGEENNSSLEDNMEIIDFDLANVKEEMHSVVR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00872453;
- 90% Identity
- iTF_00872453;
- 80% Identity
- iTF_00872453;