Basic Information

Gene Symbol
grh
Assembly
GCA_008121235.1
Location
NC:10181158-10232901[+]

Transcription Factor Domain

TF Family
CP2
Domain
CP2 domain
PFAM
PF04516
TF Group
Beta-Scaffold Factors
Description
This family represents a conserved region in the CP2 transcription factor family.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 0.69 4.5e+03 -3.8 1.3 94 109 174 189 165 212 0.49
2 4 0.58 3.8e+03 -3.5 0.2 88 97 475 484 441 520 0.61
3 4 0.17 1.1e+03 -1.8 1.3 77 111 745 779 737 792 0.68
4 4 3.4e-63 2.2e-59 200.0 0.4 24 221 910 1102 896 1104 0.94

Sequence Information

Coding Sequence
ATGTCCACATCCACCGCCACAACGAGTGTCATCACGTCCAACGAGCTCTCGTTGTCCGCCCACGGGCAcgcccactcccacacacacaactcccACACGCACAACTCCCACTCGCACGGCCACGCgctgcaccagcaccagcacagccGCATCGGAGGAGGCCTAGGAATCGGAGTCCTCAGTGACGCCTCCCTATCACCCATCCAGCagggcggaggaggaggaggcggcggcggcggaggtgcCAACAGCTCACCCCTGGCGCCCAACGGTGTACCCCTGCTGACCACCATGCACCGCTCCCCGGACAGTCCCCAGCCGGAGCTGGCGACCATGACGAACGTCAACGTCCTCGACCTGCACACAGACTCCTCCAAGCTGTACGACAAGGAGGCGGTCTTCATCTACGAAACGCCCAAGGTGGTGATGCCCGCCGACGGTGCCACGGAAGATGGCCATGCCATCGATGCGCGGATCGCGGCCCAGCTGGGCACACagcacgcccagcagcagcaacaacagcagcaacagcagcagcagcagcaacagcagcagcaacagacggAGCACCAGCCGCTGGCCAAGATCGAGTTCGATGAGAACCAAATCATCCGCGTGGTGGGACCGaacggagagcagcagcagatcatcTCGCGAGAGATCATCAACGGGGAGCACCACATTCTGTCGCGGAACGAGGCCGGGGAGCACATCCTCACGCGCATCGTCAGCGATCCCTCGAAGCTGATGCCCAACGACAATGCGGTGGCCACGGCCATGTACAACCAGGCCCAGAAGATGAACAACGACCACCAGGGCGTCTACCAGACCTCGCCCCTGCCCCTGGACGCCTCCGTGCTGCACtacagcaacgacaacagcaacgtGATCAAAACGGAGGCAGACCTCTACGAGGACCACAAGAAGCATGCGGCCGGCGGTGGCTCCATCATCTACACCACCTCCGACCCGAACGGCGTCAATGTGAACGTGAAGCAGCTGCCCCACCTGGCCGTGCCGCAGAAGCTCGACCCGGACCTCTACCAGACGGACAAGCACATCGATCTGATCTACAACGACGGCAGCAAGACCGTCATCTACTCCACCACCGACCAGAAGGGTCTCGAGATCTACTCAGGCGGCGACATCGGCAGCCTCGTCTCCGACGGCCAAGTGGTGGTCCAGGCGGGCCTGCCCtatgccaccaccaccagtgCCACAGGCCAGCCCGTCTACATCGTGGCCGATGTCAGCGAGGACCATCTACAAAGTGGAAAGCTCAATGGCCAGACCACACCCATCGATGTCTCGGGCCTATCGCAGAATGAGATCCAAGGCTTTCTGCTCGGCTCACACCCCTCATCATCGGCCTCGGCCGTCAGCACCACTGGCGTTgtctccaccaccaccatctcgcagcaccaccagcagcagcaacagcagcagcagcagcagcaacagcaacagcagcagcatcccggCGACATTGTGAGTGCCGCAGGCGTGGGGAGCGCAGGCTCCATTGTCTCATCGGcggtgcaacaacagcagcagcagcagcagctgataaCCATCAAAAGGGAGCCGGAGGATCTGCGCAAGGATCCGAAGAACGGTAACATTGCCGGAGCGACGGCAGCGAACGGCGGCGCGGGTTCGGTCATAACGCAAAAGATCTTGCACGTGGATGCGTCGCCACCAGCCGACGAAGCTGAGATTAGCGAGATtagccacagacacagtccCTGCACCACCAGAACcaccagaaacaacagaaccaacagctgcagcagtagTAGTCCTGCCACCGAACTAGAGATGTATGCTACCACGGGCGGCACACAGATTTACCTACAGACCTCACATCCCAGCACGGCCAGCGGCGCGGGAggaggtggcggcggcagcggcgtcgGTGGTGGCGGATCTGGCGGAGCCGGCGGCTCCCTGCAGGCACAAAGCCCCAGTCCGGGGCCGTACATAACCGCCAACGACTATGGAATGTACACGGCCAGCCGACTGCCGCCCGGCCCCCCgcccaccaccgccaccaccttCATAACGGAGCCCTACTACCGGGAGTACTTTGCGCCGGACGGACAGGGCGGCTATGTGCCCGCCTCCACGCGCTCCATCTACGGCGACGTGGACGTTTCGGTGTCGCAGCCAGGCGGAGTGGTCACCTACGAGGGTCGCTTCACAGGCAACGCCCCCCCGCCCACCACCACGACGCTGCTCACGAGCAGCGTCAGCgtgcaccaccagcaacagcagcagcagcagcagcaacaacagcaacagcaacagccccaacaccagcagcacctccaccaacagcagcagcagcagcagcaccaccatccGCAGGACGGCaagggcggcggcagcacgCCACTCTATGCCAAGGCCATCACGGCGGCGGGACTCACCGTCGATCTGCCCAGTCCCGACTCGGGCATCGGCACGGATGCCATCACGCCGCGGGATCAGACCAACATACAGCAGTCCTTCGATTACACGGAACTGTGCCAGCCGGGCACCCTCATCGATGCCAATGGCAGCATACCGGTGTCCGTTaacagcatccagcagcgGACAGTGGTGCATGGCAGCCAGAACAGTCCCACCACCTCCCTGGTGGACACGAGCACAAACGGTTCGACGCGTTCGCGGCCCTGGCACGACTTTGGCCGCCAGAACGATGCcgataaaatacaaataccaAAAATATTCACAAATGTTGGCTTCCGCTATCACCTGGAGAGCCCCATCAGCTCGTCACAGAGGCGCGAGGACGATCGCATCACCTACATCAACAAGGGCCAGTTCTATGGGATCACGCTGGAGTATGTGCACGATGCGGATAAGCCCATCAAGAATACGACGGTTAAGAGTGTGATCATGCTCATGTTCCGCGAGGAGAAGAGTCCCGAAGATGAGATCAAGGCCTGGCAATTCTGGCACAGTCGCCAGCATTCCGTGAAGCAAAGAATCTTGGATGCAGATACGAAGAACTCGGTTGGCCTGGTTGGCTGCATCGAGGAAGTGTCGCACAATGCCATCGCCGTTTACTGGAATCCGCTGGAGAGCTCCGCCAAGATCAACATTGCGGTGCAGTGCCTGAGCACGGATTTCAGCAGTCAAAAAGGAGTCAAGGGCCTGCCGCTGCACGTACAAATCGACACATTCGAGGACCCCAGAGATGCGGCCGTCTTCCACCGGGGCTACTGTCAGATAAAGGTCTTCTGCGATAAGGGCGCCGAGCGCAAGACGCGCGATGAGGAGCGACGGGCCGCCAAGCGGAAGATGACGGCCACCGGCAGAAAGAAGCTGGATGAGCTCTACCATCCGGTGACAGATCGGTCCGAGTTCTATGGCATGCAGGACTTTGCCAAGCCGCCGGTGCTCTTCTCGCCAGCGGAGGACATGGAGAAGGTAGGTCAGCCGGGCCTGACTGGCTTGACATTCATCCacacgaatacgaatacgcacacgaatacgaatacgaactcgaactcgaactgcaactccaactcgaaTTCGCCCTTGCAGAGCTTCTACGGCCATGAGACTGACTCGCCGGACCTGAAGGGTGCCTCGCCGTTCCTGCTCCACGGCCAGAAGGTGGCCACGCCGACGCTCAAGTTCCACAATCACTTTCCGCCCGACATGCAGACCGATAAAAAGGATCACATACTGGACCAGAACATAATGACCAGCACACCCATGGCGGACTTTGGTCCGCCCATGAAGCGGGGACGCATGACGCCGCCGACCTCGGAGCGTGTCATGCTGTACGTGCGGCAGGAGAACGAGGAGGTCTACACACCGCTGCATGTGGTTCCCCCCACCACAATCGGTCTGCTGAATGCGattgaaaacaaatacaaaatctcAACAACGAGCATAAATAACATTTATCGCACAAACAAGAAGGGGATTACTGCGAAAATTGACGACGATATGATATCGTTCTACTGCAATGAGGACATCTTCTTGCTGGAGGTGCAGCAGATCGAGGACGATCTGTACGATGTCACGCTGACGGAGCTGCCCAATCAGTAG
Protein Sequence
MSTSTATTSVITSNELSLSAHGHAHSHTHNSHTHNSHSHGHALHQHQHSRIGGGLGIGVLSDASLSPIQQGGGGGGGGGGGANSSPLAPNGVPLLTTMHRSPDSPQPELATMTNVNVLDLHTDSSKLYDKEAVFIYETPKVVMPADGATEDGHAIDARIAAQLGTQHAQQQQQQQQQQQQQQQQQQQTEHQPLAKIEFDENQIIRVVGPNGEQQQIISREIINGEHHILSRNEAGEHILTRIVSDPSKLMPNDNAVATAMYNQAQKMNNDHQGVYQTSPLPLDASVLHYSNDNSNVIKTEADLYEDHKKHAAGGGSIIYTTSDPNGVNVNVKQLPHLAVPQKLDPDLYQTDKHIDLIYNDGSKTVIYSTTDQKGLEIYSGGDIGSLVSDGQVVVQAGLPYATTTSATGQPVYIVADVSEDHLQSGKLNGQTTPIDVSGLSQNEIQGFLLGSHPSSSASAVSTTGVVSTTTISQHHQQQQQQQQQQQQQQQQHPGDIVSAAGVGSAGSIVSSAVQQQQQQQQLITIKREPEDLRKDPKNGNIAGATAANGGAGSVITQKILHVDASPPADEAEISEISHRHSPCTTRTTRNNRTNSCSSSSPATELEMYATTGGTQIYLQTSHPSTASGAGGGGGGSGVGGGGSGGAGGSLQAQSPSPGPYITANDYGMYTASRLPPGPPPTTATTFITEPYYREYFAPDGQGGYVPASTRSIYGDVDVSVSQPGGVVTYEGRFTGNAPPPTTTTLLTSSVSVHHQQQQQQQQQQQQQQQPQHQQHLHQQQQQQQHHHPQDGKGGGSTPLYAKAITAAGLTVDLPSPDSGIGTDAITPRDQTNIQQSFDYTELCQPGTLIDANGSIPVSVNSIQQRTVVHGSQNSPTTSLVDTSTNGSTRSRPWHDFGRQNDADKIQIPKIFTNVGFRYHLESPISSSQRREDDRITYINKGQFYGITLEYVHDADKPIKNTTVKSVIMLMFREEKSPEDEIKAWQFWHSRQHSVKQRILDADTKNSVGLVGCIEEVSHNAIAVYWNPLESSAKINIAVQCLSTDFSSQKGVKGLPLHVQIDTFEDPRDAAVFHRGYCQIKVFCDKGAERKTRDEERRAAKRKMTATGRKKLDELYHPVTDRSEFYGMQDFAKPPVLFSPAEDMEKVGQPGLTGLTFIHTNTNTHTNTNTNSNSNCNSNSNSPLQSFYGHETDSPDLKGASPFLLHGQKVATPTLKFHNHFPPDMQTDKKDHILDQNIMTSTPMADFGPPMKRGRMTPPTSERVMLYVRQENEEVYTPLHVVPPTTIGLLNAIENKYKISTTSINNIYRTNKKGITAKIDDDMISFYCNEDIFLLEVQQIEDDLYDVTLTELPNQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00580456;
90% Identity
iTF_00574576;
80% Identity
-