Basic Information

Gene Symbol
-
Assembly
GCA_933228835.1
Location
CAKOGE010000177.1:2541845-2551828[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 3.4e-05 0.0018 19.0 6.7 1 23 191 213 191 213 0.98
2 29 0.0047 0.25 12.2 0.3 2 20 269 287 268 289 0.93
3 29 0.00016 0.0085 16.8 0.1 3 23 323 344 321 344 0.96
4 29 0.0084 0.45 11.4 0.5 1 23 395 417 395 417 0.98
5 29 0.022 1.2 10.1 0.4 1 23 444 466 444 466 0.95
6 29 0.0056 0.3 12.0 3.7 2 23 515 536 514 536 0.94
7 29 0.00015 0.0079 16.9 2.3 2 23 580 602 580 602 0.97
8 29 2.2e-05 0.0012 19.5 1.0 1 23 652 675 652 675 0.97
9 29 0.0003 0.016 16.0 0.2 1 23 725 747 725 747 0.97
10 29 0.028 1.5 9.8 5.4 2 23 759 780 758 780 0.96
11 29 0.022 1.1 10.1 1.0 2 23 792 813 791 813 0.96
12 29 0.017 0.88 10.5 0.4 1 23 823 845 823 845 0.98
13 29 2.7 1.4e+02 3.5 6.8 1 23 874 896 874 896 0.91
14 29 0.00076 0.04 14.7 2.5 1 23 907 929 907 929 0.98
15 29 3e-05 0.0016 19.1 1.4 1 23 937 959 937 959 0.98
16 29 0.00038 0.02 15.6 0.5 2 23 979 1001 978 1001 0.97
17 29 1.2 66 4.6 0.9 1 23 1019 1041 1019 1041 0.91
18 29 0.13 7.1 7.6 3.7 1 23 1048 1070 1048 1070 0.97
19 29 0.011 0.61 11.0 0.3 2 23 1081 1105 1080 1105 0.94
20 29 0.0044 0.23 12.3 0.2 2 23 1128 1149 1127 1149 0.94
21 29 1.7 92 4.1 0.9 2 20 1153 1171 1152 1172 0.91
22 29 0.0031 0.17 12.8 0.2 2 23 1230 1251 1229 1251 0.95
23 29 1.5 80 4.3 3.7 5 23 1261 1279 1258 1279 0.94
24 29 0.02 1.1 10.2 0.9 2 23 1290 1312 1289 1312 0.95
25 29 3.6 1.9e+02 3.2 6.5 2 23 1329 1347 1329 1347 0.89
26 29 0.47 25 5.9 1.4 2 21 1351 1370 1351 1371 0.93
27 29 0.29 15 6.6 0.4 2 23 1420 1441 1419 1441 0.93
28 29 0.023 1.2 10.0 1.1 2 23 1452 1474 1451 1474 0.95
29 29 6.6 3.5e+02 2.3 3.1 2 19 1512 1529 1511 1532 0.91

Sequence Information

Coding Sequence
ATGGCTCTCAAGCTTGGGAAATGCAGGCTCTGCCTCAAACTGGGCGACTTTTACTCCATCTTCACGGTGGACAACAACATGCAGCTGGCCGAAATGGTCATGGAATGTGCCCGGGTGAaaATATACGACGGCGACGGACTGCCGGACAAGATTTGCAGCGAATGTATTCAGAAGCTGAGTAGTGCGCATATTTTCAAACAACAGTGCGAGCGCTCCGATCAGGAGCTGAGACGGAACTATGTTCCTCCTCCAGGTTTTAGTCCAACTCCGCCGCCAAACAGACAGAGCAGTGATTCGGCGTTCTCGTCTCACACTGAAGTTTCCAAGCCATCATCTTCCACCGAGAGCCAAGTCACTCCCGTCAGTCGGACGAGAAAGCGCAGTAGAGACAGTATCGACGAAGCTTCCACAAGCAGTCGCTCGAACTACAAGCCGGGAAGCTCAAAGAGAGTCGATGAATTGCGCATGTCCCAAAAGAAACCGAGAATTTCACAGAATTCGGATTCAGACTATGAGGACAACAGCGCGTCGTTGTACTCCGCAGAAACTGACTCTGACGAACCCTTGCAGCACAAGTGTAACCAGTGCTCCAAATCATTCAGGACTGCCAAAAGTTTGTGTGCACATATGAAAtcacataaaagaaaatttaacttaCAAAACGCTCAAAATGCTACTCCCACTAATGAATCGCCTGCTAAAAAACAAGCACCCAAAGAAACCCCTACCAAGGAAGCCAAGGAAACCCTCACCAAGGAAGGTTACAAAGAAATGCGAGACGCCAAGGACGACGACGATAAATTAAACTGCGATAAATGCGGAAAACAgttcaaattaaatataatgCTGAAAAGGCATTACGATCTCTGCGGGAAATCTCCTCAAAAAGAACTCTTAGTCTCATTAGAGCCTATAGATATGGTACAAGCTATTACTACTAGTAGTACTCAATCCGGCAAGATCGATTGCGAAATATGCACCGCTAAGTTTAAAACTATAGACTATCTAGAGAAACACATGAGAATTGTACATGCGGCTGTGCCAAAAAAAGAAAGACTCTCAATTACAAACGAGAACGGTATTTCATGCGTACCGTGCGTCTTTTGCAACCAACCATTTGAGGATTACTACGTGCATAGTGCTCACCTTTCTGCGTGTCCTAAAAAGACGGATACAATAAACTTCGAATGTACAGTTTGCAAGAaggttataattaaaaaatcttcaTACATCCTCCATGCTAAGATGCATTTTTTTCAATTAGCCGCAATAAAAGAGCCACCTGAGAAGGCTTCTAATACCAGTAATAACGAAGGTTGTAATAACAGTCATCAATGTCGGATGTGCACTAAGAAGCTACCTTCGCAGGAAGCGTTGATCAGTCATCTCGCTGCTCATATGAACAAAGCAGAAGAGGAAGACTATGAAGATAATGGCGATACAATGGCCGCTGACGATGATAACGATTCGAGAACTAGTACAGTCGAAGAATCGGCGTCCGTGCATTCGGACTACAACTCTTATATCACCAGCGGACCCCTCCAGTGTAGGTACTGCGACAAAAACTTCAAATACAAGAAGGCTCTGCATTCACACGAAGTTAAACACACGACTGGTGATATAAAAATCGAGAAGTCAGACAAGAAATCCAAAAATTTCTTGAACCAGACCAATAATTCGTATTTATCCGACGAGTCCGATATGGAGTCTAGTCAAGACGAAGGCGAAGAAGACAACACCTGTGATATTTGCGAGAAGCAGTTTTCCTACAAGCGGTTGCTTATAAAGCACAAGCGAACCAAACATTGTATGACTTCTGGCACGAAGCGAGCGAAAATTAACCTAAAGGACTGCTCGGTGCGCTGCCTGATCTGCGACCTGGAGATGAAGGTGAGCGCGATCAACGAGCACAACCAGACGCACATCACGGCCAACATGAAGCCGCGCAACCTGTACACGTGCAAGGAGTGCGGCGAGCAGTTTAAGAGTTGCAGTGGGCTCGCGAACCACATCAAGCTGGTGCACCGGCTGCACCAGCCGCCCGCCAAGAAGATCGTCGTGCCCAACGCCGATCTGGCGGATTTTTGTGAAGTCGTTGTGACGAAGGCGGAACCCCTGGACGAGCTCCAGAGTCACAACGGCTTTGGTGAGGTTTCCGTCGACGCAAGCGGCTTTACGTGCCCCGTGTGCAGCAAGACGCTGCCTACCCTCGTCTCGCTCAAGCGCCACGTCAATTGGCACAAGAATGTCGGGAATAACATCGAGAAGAAGTTGGAGTGCTTTGTTTGCAAAGAGatcTTCCGTTTCCAATGTCACTACAAGATCCACATGCGGCAGCATTACCAGGATCCCAACCTGGACCCCAAGCTGCTCACGTGCGACATCTGCGGCCGCAAGAGCAAGCACCTGCGCGCCGCTCAGGCGCACATGAACTTCCACAAGCAGACGCGCTTCAAGAACAAGGACTACGAGTGCGCCATCTGCAAGCGAGTGTTCCAGTACAGGAAGGTGTATCTGTCGCACATGGCCATCCACTTCAAGCGCGGCGAGAGCGCCGCCACCGCCGTCGTGGGCGACGTGGTACCGCTCACGCAGGACAACAAGCGGTTCGACGGCACGCACACCTGCCACCTGTGCGGCAAGGTGTGCGACTCCGAGAACTCGCTCAAGTGCCACGTCAGCTGGCACAACTCCAAGACGCTGCTGTACGGCGCGCGCCACGAGTGCGAGATCTGCAACGTGCAGTTCACCAACAAGCGGCGGCTCGAGCTGCACACCCGCACGCACTTCGAGGACGAGAACGGGCCCTACAAGTGCCACATATGCGGGAAAGGATTCATCGTAGAGGACTATTTTAAACGACACGTTAAAGGCCACAACTTCGATCACCAATCCCACAAAAAGCGTATCGAAAAATTGCGGAAGGACAAAGTAAAATGTCCCATCTGCGAGCGATACTACCCCGATCTGATCCATCTCATCCGACATCTTCGACGAACTCACCCCGAAAGCAAAATGATCAAGGAAGACCCCGACGCCCCGCCTCCCATCTACTACTCGTGTAAACTGTGCGCCAAGGTTTTCTTAGACGAGAGAAGATTGCAGCATCACGAGGAGGCTCATCTGAGAAAACCAGAATTCTTCAAATGCAAGTTCTGCGGCAAGAAAACTATCTCGCTCAAGAATCACAGGGTTCACATCAAGGGCCACCTGACTCAGAAGTATATTGACGAGCCTCTGAAGTGTCCGAGGGAGGACTGCGACGAGACGTTCGGGCGCGGCTACGATTTACACTACCACTTGCGCGACGCGCACGGCATCACGGAAACGTGGATCGCCGAGCGCGGCCCGAGAACGCTGGACGGCCCGCTCAAGGAGCTGCAGTGCTCGATATGCTACAAAGTACTAGCTAGCAAGGGCAACTACGAGAGGCATGTAGACTACCACAACTCTCTGCGATGTAACTACTGCTTCGAATTCTTCAACAGTTTCCGCTTCCTCGAGGGCCACTTGACGTTCAGCTGCGAGAAGAAGAAACTAATCGGTGACTCTGAGGTCTACCTTAAGAGAGTGAAATGCCATATCTGCTACAAGGCCTTCCACTTGCAGGTGAAGCTAGATTGCCACCTGCGCACGCAGCACGATATCAAGACATTCAAGGAGGCCTCGGAGGGCAAGAAGGAGATCGTGTGCGACTACTGCTTCAAGGTGTTCGAGAACGAGTACGCGCTCAGCACGCACAAGATCTACCACCGCACCATCGGCTACTACGGTTGTATCTACTGCAACAGGAAGTTCAACACTATGACAGCGTACCGCAAGCACAAGAACCACCACTTCTCTCAACTAAACGTCGACAACCCCACCAAGTGCGAACACTGTGACGAGACGTTTGTCGCGTTCCGAGAGATGATATACCACATGCGGGACGAGCACGGTGACGACAAGGAGTGGATCGTGCTGCCCAAGGAGTCAATCGAAGAGAAGTGCAACATATGCCATAAAACTTTCTTCAATCTTCACAGACATTTGGATTACCATGAAGAGAACCGATGTAAAAAGTGCGGGGAGTACTTCTTCTCGCAAATGGACTACGACAATCATTTGTGCGCAGTCGACAGCGACGAGGAAGTCACCGAAGTCAACACCAACGGCGTTCGGGCAGTTTACGAAGAGTGCACCTTCTGCTTTAAACCCATTACCAAGAAGAACTCGAAAAAGAAACACGACATAATCCACAAGGGATCCGGAACTATATCGTGTCGGTTCTGCCCTCTTAAGTTTAAAACTATAGACGCGTTCAACATCCACGCATTCTCCCATCGAAGCAGAAAGTATAAGAAGAAGCCAATCAAGTGTCGTAAATGTAAGGAGAAGTTTGTGAAATACGGTCCATTCATGAGGCACATGAAGGAAGTCCACAAGTCTTCTAAGAAGCTTCACTATAGAACAACAGTCATGGCGGAGGAATGCGTCGTGTGTCACGACAACTTTCCCAATTTGCATAATCATTACCGCGCACATTTACAGAACCAGTGTCAGCAGTGCTTCAAGTATTTCACTTCGTTCAAACTATTTTCTATTCACGAATGCGATAAGGAAGACTCGAATCCATCTAAAGTGTTTACGTGTGACCAAAACTTGATTGAACTTATTAACAACTACGTTCCTAAAGATGAGAAGGATGATGAAAAATACTATGGTTATGAGGAAGGAGACGAAGTtgaagaagatgaagatgaagTTGATATTGAGACGATAGAGGCTCCCGTCACTTCAAAACCCATAAATCAAACTTACGTCCCTGATATAACATCACAGGATGAGGACAGCCAAAACTCTATAGACATAGACGAACAAAATGTCCATGAGATGGCGCATGCACCGATTATATCAGACGTTCTGTCCCtctttaaaaagaaagaagaaaaaattaaattagatgATGACAAAGGTAATGATAGTGATGTTGTAGTTTTGACAGATGAAGACTCTGTAGGTTTTGAGAACAACATTATGACTGTCATAACTATAGAAGATTAA
Protein Sequence
MALKLGKCRLCLKLGDFYSIFTVDNNMQLAEMVMECARVKIYDGDGLPDKICSECIQKLSSAHIFKQQCERSDQELRRNYVPPPGFSPTPPPNRQSSDSAFSSHTEVSKPSSSTESQVTPVSRTRKRSRDSIDEASTSSRSNYKPGSSKRVDELRMSQKKPRISQNSDSDYEDNSASLYSAETDSDEPLQHKCNQCSKSFRTAKSLCAHMKSHKRKFNLQNAQNATPTNESPAKKQAPKETPTKEAKETLTKEGYKEMRDAKDDDDKLNCDKCGKQFKLNIMLKRHYDLCGKSPQKELLVSLEPIDMVQAITTSSTQSGKIDCEICTAKFKTIDYLEKHMRIVHAAVPKKERLSITNENGISCVPCVFCNQPFEDYYVHSAHLSACPKKTDTINFECTVCKKVIIKKSSYILHAKMHFFQLAAIKEPPEKASNTSNNEGCNNSHQCRMCTKKLPSQEALISHLAAHMNKAEEEDYEDNGDTMAADDDNDSRTSTVEESASVHSDYNSYITSGPLQCRYCDKNFKYKKALHSHEVKHTTGDIKIEKSDKKSKNFLNQTNNSYLSDESDMESSQDEGEEDNTCDICEKQFSYKRLLIKHKRTKHCMTSGTKRAKINLKDCSVRCLICDLEMKVSAINEHNQTHITANMKPRNLYTCKECGEQFKSCSGLANHIKLVHRLHQPPAKKIVVPNADLADFCEVVVTKAEPLDELQSHNGFGEVSVDASGFTCPVCSKTLPTLVSLKRHVNWHKNVGNNIEKKLECFVCKEIFRFQCHYKIHMRQHYQDPNLDPKLLTCDICGRKSKHLRAAQAHMNFHKQTRFKNKDYECAICKRVFQYRKVYLSHMAIHFKRGESAATAVVGDVVPLTQDNKRFDGTHTCHLCGKVCDSENSLKCHVSWHNSKTLLYGARHECEICNVQFTNKRRLELHTRTHFEDENGPYKCHICGKGFIVEDYFKRHVKGHNFDHQSHKKRIEKLRKDKVKCPICERYYPDLIHLIRHLRRTHPESKMIKEDPDAPPPIYYSCKLCAKVFLDERRLQHHEEAHLRKPEFFKCKFCGKKTISLKNHRVHIKGHLTQKYIDEPLKCPREDCDETFGRGYDLHYHLRDAHGITETWIAERGPRTLDGPLKELQCSICYKVLASKGNYERHVDYHNSLRCNYCFEFFNSFRFLEGHLTFSCEKKKLIGDSEVYLKRVKCHICYKAFHLQVKLDCHLRTQHDIKTFKEASEGKKEIVCDYCFKVFENEYALSTHKIYHRTIGYYGCIYCNRKFNTMTAYRKHKNHHFSQLNVDNPTKCEHCDETFVAFREMIYHMRDEHGDDKEWIVLPKESIEEKCNICHKTFFNLHRHLDYHEENRCKKCGEYFFSQMDYDNHLCAVDSDEEVTEVNTNGVRAVYEECTFCFKPITKKNSKKKHDIIHKGSGTISCRFCPLKFKTIDAFNIHAFSHRSRKYKKKPIKCRKCKEKFVKYGPFMRHMKEVHKSSKKLHYRTTVMAEECVVCHDNFPNLHNHYRAHLQNQCQQCFKYFTSFKLFSIHECDKEDSNPSKVFTCDQNLIELINNYVPKDEKDDEKYYGYEEGDEVEEDEDEVDIETIEAPVTSKPINQTYVPDITSQDEDSQNSIDIDEQNVHEMAHAPIISDVLSLFKKKEEKIKLDDDKGNDSDVVVLTDEDSVGFENNIMTVITIED

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01092303;
90% Identity
iTF_01092303;
80% Identity
-