Pspi013586.1
Basic Information
- Insect
- Philonthus spinipes
- Gene Symbol
- -
- Assembly
- GCA_963082785.1
- Location
- OY720304.1:32229577-32259396[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 24 0.0054 1.2e+02 4.2 0.4 26 49 102 125 93 128 0.90 2 24 0.0023 54 5.3 0.8 20 49 186 215 170 218 0.88 3 24 0.0029 66 5.1 0.1 5 44 257 299 253 305 0.84 4 24 0.00034 7.7 8.0 0.3 26 49 309 332 302 335 0.91 5 24 0.015 3.4e+02 2.8 0.0 21 49 381 409 375 412 0.89 6 24 1.9 4.4e+04 -4.0 0.2 23 45 440 462 437 465 0.76 7 24 0.0069 1.6e+02 3.8 0.0 26 51 472 497 467 499 0.87 8 24 6.6e-06 0.15 13.5 0.0 26 49 531 554 523 558 0.91 9 24 0.00061 14 7.2 0.0 19 48 628 657 625 661 0.90 10 24 7.1e-05 1.6 10.2 0.7 16 53 711 747 707 748 0.86 11 24 3.6e-06 0.083 14.3 0.0 25 49 781 805 757 809 0.82 12 24 0.18 4.1e+03 -0.7 0.0 26 48 853 875 850 878 0.88 13 24 0.0035 80 4.8 0.1 19 49 876 906 873 909 0.89 14 24 0.00012 2.7 9.5 0.2 16 49 1088 1121 1079 1125 0.88 15 24 0.00072 16 7.0 0.1 26 48 1171 1193 1149 1198 0.86 16 24 0.016 3.8e+02 2.6 0.0 23 45 1225 1247 1218 1254 0.86 17 24 0.0095 2.2e+02 3.4 0.0 21 48 1337 1364 1325 1368 0.84 18 24 0.00031 7 8.2 0.1 23 49 1420 1446 1404 1449 0.82 19 24 8.5e-05 1.9 10.0 0.2 16 50 1503 1534 1478 1537 0.84 20 24 0.065 1.5e+03 0.7 0.0 23 48 1627 1652 1624 1655 0.91 21 24 0.00054 12 7.4 0.0 17 49 1702 1734 1690 1737 0.77 22 24 0.26 5.9e+03 -1.2 0.1 25 48 1799 1819 1791 1821 0.78 23 24 0.11 2.4e+03 0.0 0.2 21 49 1849 1876 1844 1881 0.79 24 24 0.11 2.4e+03 0.0 0.1 27 49 1887 1906 1879 1910 0.77
Sequence Information
- Coding Sequence
- ATGGCGGAACGCAACTACATTCCGGTGTGTCTTGACATGTCAACCGATATTCTTTGTGAACCCTGGTCTGATTTAAGTATCACCAATGAAGAATTAGTTTTAAAGTCTGAAGACCAAAGCGAAGACATATCCAGCGCTGAAGAGGGAGGAGAAACCCCCAGAAAAAACGGCCGAGACAATCTGATTAAGACCAAAAATGGTAAAGCCCTTTACTCCTGCGACGAGTGCGATAATATCTCTAAGGACGCTAAAAAATTCGTCATTCACTACCTCTCGCACAACAAAGACAAATCTCGCATATACACCTGCGAGCAGTGCCTCCAAAGATTCACGCAGCCCGGCAACTTGAAACGCCACATAGACGTGGTCCACCTGCGGATCAGGAAGTTTTGCTGTAAAATATGCGAGACCACGTTTTCGACCAAGTCCGCCAGGGACGACCACATGACGGCCCATTACGATGGGCGATCGAATGTGTGCGCCACTTGCGATGAGAGTTTCGAAAGTAAACGCGCTTTAAGAAATCACAGGAAGTGTCACTTGGGGAGGAGCGATCATAGTAATGAGAGGAAGACGTGTGATACTTGTAATCAATCGTTTAAGTTGAAACAGAGCTTGAAGCGGCATATCGAGAGGAAGCATTTGCAGcttaaatattattgtgaagTTTGCAAGAAGTTGTTTACTAATAAGTCGTCTGAAGATGCACATCCTAATTCCACCAATTGGAAAGAATCAACCGATGACGCGCAAACAAAAACAGACGAAGAAGCGAAGCACCATCAATCATTAATCAAACAGCGCTGTAGAATTAAGAATGATGATGGCACCACGTGCTTCAAATGTGACATTTGCGATAAGATAATCAAAAAAACGAGCAATTTCCTCGACCATTACACGATGCACAGCAACAATTTAACGTACACTTGCGAAATTTGCTGCAAATCCTTAAAACTCCCCTCCAACTTAAAACGCCACATGGAAACGATACACATGCAAATCAGAAAGTTCGATTGTAAAATTTGCAACAAGAGCTACGCCACTAGATCCGGGCTATACAAGCACACGTACAACCACACCAGCAGCCGACCTTACGAGTATTCCAAAAAGGCGAGTTACCCTGAACTCGCTAAAGAATTGGCAGAGAAAAAATATACCTGCGATGTTTGCGATAAGAGCTTCGCGAAGCCGGACAGGTTAAAACGGCACGTCGAGTCGGTGCATATGCAGATTAAGAAGTTCGTGTGCGATCTTTGTGATAAACGATTTAATTCGAAAGTTACGCGGGATGAGCATATGAATAAGCACAGTAATAGGCGGCCTTTTAAATGCGAGGTCTGTAGCAAACGTTACAGCTATGGGTCTGGATTAAGGGCGCACTTAAAGAGGCATATTTTAGATGGGAAACAGTACACTTGTGATATTTGTGATAAGCGGTTTATACAAAAGAGTCACATGACTCGGCATATGAAGTTGGTGCATGCGAAAAGTCCGGAAAGTGATCTTGGGATAAGGCACACGGAGAATGATTTTGTTAACTCTGAGAGTTGCACAAATAGGAAAGATGATGAAATTGGCAATCTTACCTATACTTGCGATATCTGCTACAAGACCTTCAATAaaccaacaaatttaaaacgacACGTGGAGCGTATGCACATGCAAGCCAAGAAGTTTAACTGCAACCTCTGCGACAAAGTCTATTCCTCTTATTCCGGCCTATACAGGCATAGGTTCATCCACTCTAATAACAAATCGTATAAGGCCATCAAAAACCCGATTAAAAAAGAAGTCGATGAGGGAACAAAGCACAGTTGcgatatttgcaataaaaacttTGAGTGTCCAGCGAAGGTTAAGCGACACCTAGATACTGTCCACAAGCAAGCGGGGAAGCCGCACACTTGcgatatttgcaataaaagcTTCGCGATGGCGTACGGGCTGAGGCAACACGTCGAGGGGGTGCATATGCAGATTAAGAATTTCGTGTGCAATATTTGCGATAAGAGATTCATTGCGAAACGCGTTCTCGACGAGCATATGAATGCTCATACTAACAGTCGGCCGTTTAAATGCGAAACCTGCGACAAgcaatttaattacttatctAGTTTACGGATGCACTCGAGGATACATTCGGCCAATGGGAAGCCGTTTTCTTGTGAGCTTTGCTATAAACAGTTCTCGCAAAAGATTCACCTCAAGCGGCATATGTTGAGGCACGCGAAACGTTCCGGAATTGACTGTACAGAGAAGAACGAGGAAAAACCCTTTAGTAGTACTGAGATTATGGAGAGTGACAGTAATAGCAAGGACGATGCATATGGCAATCCCACCTATACTTGCGATATCTGCTACAAGACCTTCAATAAGCCAACAAATTTAAAACGCCACGTGGAGCGTATGCACATGCCAGCCAAGAAGTTTAACTGCAAACTCTGCGACAAAGTATTTTCCTCTCATTCAGGTCTATACAGGCATAGATTCATCCACTCTAATAACAAATCGTATAGCGCCATGAATAAAAAAGAAGTTGATGAAGGAACGAAGCACAGTTGcgatatttgcaataaaaacttCGAGGGTCCAGCGAAGCTTAAACGGCACCTTGATACTGTCCACAAGCAAGCGGGGAAACCGCACACTTGCGAGATTTGCAATAAAAGCTTCGCGATACCGTACGGGCTGAGGCAACACGTTAAGGTGGTGCATATGCAGATCAAGAATTTCGTGTGCAATATTTGCGATAAGAGATTCATTGCGAAACGCCTTCTTAACGAGCACATGAATGCTCATACTAACAGTCGGTCCTTTAAATGCGAAACCTGTGATAAACAATTCCATTACTTATCTAGTTTGCAAATGCACTCTAGGATAAATTCGGCAAATGGGAAGCAGTTATCTTGTGAGCTTTGTTGTAAGCAGTTCTCGCAAAAGATTCGCAACAAGTGGCACATGTCGAAGCACGAGAAACATTCTGGGATTGATTGTACGAAGAAGCACGAGGAGAAACCTTTTAGTAGTACCGAGATTGACTGTAATAGCAAGGACGATGTATTTGGGTCATCTGAAGGCGCACATCCTGTTACAATCAATTTGGAAGAAACAACAACCGACGCATATAAAAAACCGATTCAGctcgaaaaattattaattaaacagcgTTGTAAAATCAGAAACGACGACGGCACCGTTGCGTTCAAATGCGATATATGcgataaaatagttaaaaaaggTAGTAACTTCCTCGACCATTACAGGACGCACGATCGAGATAAGTCGCCGTTTAGTTGTGAACTTTGCGAGAAAACCTTCAGGCATCCCTCTAATTTAAAACGCCACGTCGAGACGATACACATGCAAATCAAAAAATTCGGCTGTGAACTCTGTGACGAGAGGTACACTTCGAAAGTCGCACTATACCGGCACATGCTCACCCACCCGGGTAGCCGGCGTAATAATTCGGACAAAGAAACGCGCAAAGAAGAAGCGACTGTGAAAAAACTCATTTGCGATGTTTGCAATAAAGTCTTTAAAACAGTTCCTAATTTAAGGAGACACGTCAGGACTGTCCATATGCAGATTAAGAATTTCGCGTGCAATATTTGCGAAAGGAGATTCTGCACGAAGGTTAATCTAGACGAGCATATGAACACGCACGATAACAGCCGGCCTTTGAAATGCGATATTTGCAACCAACATTTTAACAATGGGTCTGGTTTAAGGAGGCACATGAAGAGCCACATTGCCCACGGGAATAAGAGGATGTATAAGTGTGGTATTTGTGATAAGCAGTATTTACACATGCCCGACATAAAGCGGCACGTGATGGCGGTGCACTTGAAAATTTCGCTCACTGAACACACGAAACAACTCAGGGAGAAACAGTGTATTCAGAGTAAAGATGATAATTATagGTCATCTCAAGACACTCGGTCTACTTCCTCCAGTTGGAAAGAAACAACCGACAGTGGATTTAAGAAAACGAGCAAGAGGttacgcaataaaaataatccgaATAATTGCGATATTTGCAATAAATCATTTACGGATCAATGGGATTTGAGATGCCACGTCGAGACGATACACAAGCAAAACAAAAAGTATATCTGCGACGGTTGCTATTTAAGATTTTCAAACCGAACCAGTTTAAAGAATCACAGGGCTACATGTGCACAAGTTGGTTTGGATAATGATAACAACATCCACTCCGAGAGCGATGAGGATAATGAACGAAGGGAAAGAGAAGCGCAGATATCGAACACTTGTGTCATTTGCAATAAAACCTTGTCGTCGTTGACTAATTTAAAGCGGCACGTCGAGTCGGTGCacatgcaaattaaaaaattcgaaTGCCACATTTGCAATAAGAGATTTTCTTCAAAATGTGTTAGAGACGAGCATTTGGACACGCACAGCAACAAGCGGCCTCTTAAATGCGATTTGtgcgataaatattttaagaacaaGCCTGGTTTGAGGACTCACAAGAAGGTGCATTTTAACAGCAGGACGAAAAATACTAGAATTTATACGTGTAATATTTGCGATAAGCAGATTTCTAACTTAAAACGGCACCTTCAGTTAGTGCACAAGATAGCTCCGGATAATGGCAGTACGAAAGAGCAAACGAGGAACGAGTTTGGTAGCGCTGAGATTATTGAGAATGATACGAATTCTAAAGAGAGTGAATATGGGTCATCTCAAGACACTCAGCCTACTTCCTCCAATAGGAAAGAAACAACCGACAATGAATATAAAGAAACGGACGAAGCGAAAATAAGGAATGATGACGGTACTGTATGCAATATTTGTGATGAAACACTCGAAACGGGAAGTAATTTAGTCGACCATTACAtgatacacaataaaatgaatccGTATAATTGCGATATTTGCAATACATCATTCACGCATCATTCGGATTTGAAATCCCACGTGGAGACGACGCACAAGCAAGACAAAAGGTACATCTGCGAAGGTTGCAATATAAGATTTAGAAACCGAAAGAGTTTAAAGAATCACATGGCTAAATGTAAAGAAGGCGGTTCGGATAATGATAACAACACCTACCCCGGGAGCGATGAGGATAATGAAGAAAGTGAAACAGAAGCGCAGAAAACGAACACTTGCGTCCTTTGCAACAAAAGCTTCGCGACGATGACCAATTTAAAGCGGCACGTCGAGTCGGTGCAcatgcatataaaaaaattcgagtgcgatatttgcaataaaagattttttacgAGATTTATAAGGGACGAGCATATGGACACGCACAGTAATAACCGACCTCTTAAATGCGATATGtgcgataaatattttaagaaccGGTCTGGTTTAAGGGATCACAAGAAGATTCATCTTAacaagaagaggaagaagaaacTACATACGTGCAATATTTGCGATAAGCAAATATATAACTTGAACCAACACCTTGCGGCGGTGCACACGCAAATTAGAAACTTCGAGTGCGATTTTTGCGATaaaatttttttaacgaaatttctTAAAGACGAGCATATGGCCACGCACAATGACGAACGGCCTGTTAAATGCGACATCTGTGATAAATGTTTTAAGAACCGGTCTGGTTTAACGGTTCACAAAAGGAGGCATTTTCCTCAgaagaggaaaaataaaatgtacagaTGTAATATTTGCGATAAGCAGATATATAACTTAAATCGGCACCTTGCGGCGGTGCACAAGGTAACTCCAGAAAAATACACAACGGTAGAGCAAATAAAGAACGAGTTTGATCGCGCTGGCATTATTGAGAATGATACAAATTCTAAAGAGAGTGAATATggaTCTGATGACCTTATCATCAAAGAAGAGTTACCCTCAGATTCCGAAGATCCACTACAAGATGATACGATGGAATTGTCGAGCTACTGA
- Protein Sequence
- MAERNYIPVCLDMSTDILCEPWSDLSITNEELVLKSEDQSEDISSAEEGGETPRKNGRDNLIKTKNGKALYSCDECDNISKDAKKFVIHYLSHNKDKSRIYTCEQCLQRFTQPGNLKRHIDVVHLRIRKFCCKICETTFSTKSARDDHMTAHYDGRSNVCATCDESFESKRALRNHRKCHLGRSDHSNERKTCDTCNQSFKLKQSLKRHIERKHLQLKYYCEVCKKLFTNKSSEDAHPNSTNWKESTDDAQTKTDEEAKHHQSLIKQRCRIKNDDGTTCFKCDICDKIIKKTSNFLDHYTMHSNNLTYTCEICCKSLKLPSNLKRHMETIHMQIRKFDCKICNKSYATRSGLYKHTYNHTSSRPYEYSKKASYPELAKELAEKKYTCDVCDKSFAKPDRLKRHVESVHMQIKKFVCDLCDKRFNSKVTRDEHMNKHSNRRPFKCEVCSKRYSYGSGLRAHLKRHILDGKQYTCDICDKRFIQKSHMTRHMKLVHAKSPESDLGIRHTENDFVNSESCTNRKDDEIGNLTYTCDICYKTFNKPTNLKRHVERMHMQAKKFNCNLCDKVYSSYSGLYRHRFIHSNNKSYKAIKNPIKKEVDEGTKHSCDICNKNFECPAKVKRHLDTVHKQAGKPHTCDICNKSFAMAYGLRQHVEGVHMQIKNFVCNICDKRFIAKRVLDEHMNAHTNSRPFKCETCDKQFNYLSSLRMHSRIHSANGKPFSCELCYKQFSQKIHLKRHMLRHAKRSGIDCTEKNEEKPFSSTEIMESDSNSKDDAYGNPTYTCDICYKTFNKPTNLKRHVERMHMPAKKFNCKLCDKVFSSHSGLYRHRFIHSNNKSYSAMNKKEVDEGTKHSCDICNKNFEGPAKLKRHLDTVHKQAGKPHTCEICNKSFAIPYGLRQHVKVVHMQIKNFVCNICDKRFIAKRLLNEHMNAHTNSRSFKCETCDKQFHYLSSLQMHSRINSANGKQLSCELCCKQFSQKIRNKWHMSKHEKHSGIDCTKKHEEKPFSSTEIDCNSKDDVFGSSEGAHPVTINLEETTTDAYKKPIQLEKLLIKQRCKIRNDDGTVAFKCDICDKIVKKGSNFLDHYRTHDRDKSPFSCELCEKTFRHPSNLKRHVETIHMQIKKFGCELCDERYTSKVALYRHMLTHPGSRRNNSDKETRKEEATVKKLICDVCNKVFKTVPNLRRHVRTVHMQIKNFACNICERRFCTKVNLDEHMNTHDNSRPLKCDICNQHFNNGSGLRRHMKSHIAHGNKRMYKCGICDKQYLHMPDIKRHVMAVHLKISLTEHTKQLREKQCIQSKDDNYRSSQDTRSTSSSWKETTDSGFKKTSKRLRNKNNPNNCDICNKSFTDQWDLRCHVETIHKQNKKYICDGCYLRFSNRTSLKNHRATCAQVGLDNDNNIHSESDEDNERREREAQISNTCVICNKTLSSLTNLKRHVESVHMQIKKFECHICNKRFSSKCVRDEHLDTHSNKRPLKCDLCDKYFKNKPGLRTHKKVHFNSRTKNTRIYTCNICDKQISNLKRHLQLVHKIAPDNGSTKEQTRNEFGSAEIIENDTNSKESEYGSSQDTQPTSSNRKETTDNEYKETDEAKIRNDDGTVCNICDETLETGSNLVDHYMIHNKMNPYNCDICNTSFTHHSDLKSHVETTHKQDKRYICEGCNIRFRNRKSLKNHMAKCKEGGSDNDNNTYPGSDEDNEESETEAQKTNTCVLCNKSFATMTNLKRHVESVHMHIKKFECDICNKRFFTRFIRDEHMDTHSNNRPLKCDMCDKYFKNRSGLRDHKKIHLNKKRKKKLHTCNICDKQIYNLNQHLAAVHTQIRNFECDFCDKIFLTKFLKDEHMATHNDERPVKCDICDKCFKNRSGLTVHKRRHFPQKRKNKMYRCNICDKQIYNLNRHLAAVHKVTPEKYTTVEQIKNEFDRAGIIENDTNSKESEYGSDDLIIKEELPSDSEDPLQDDTMELSSY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01186412;
- 90% Identity
- iTF_01186412;
- 80% Identity
- iTF_01186412;