Cnig000585.1
Basic Information
- Insect
- Cataglyphis niger
- Gene Symbol
- -
- Assembly
- GCA_004329405.1
- Location
- SJPC01003574.1:3301-14598[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 32 3.2e-06 0.0022 20.2 1.0 1 23 271 293 271 293 0.98 2 32 0.0046 3.2 10.2 3.7 1 23 316 339 316 339 0.96 3 32 0.00031 0.22 13.9 0.5 2 23 346 368 345 368 0.96 4 32 0.0046 3.2 10.3 0.1 1 21 388 408 388 409 0.95 5 32 0.00044 0.31 13.4 0.2 2 23 508 529 507 529 0.94 6 32 0.00015 0.1 15.0 0.3 1 23 535 558 535 558 0.96 7 32 4.5e-05 0.031 16.6 0.1 1 23 628 651 628 651 0.96 8 32 0.0041 2.9 10.4 1.1 1 23 657 680 657 680 0.95 9 32 0.032 22 7.6 0.8 1 23 693 715 693 715 0.97 10 32 1.8e-05 0.013 17.8 0.9 3 23 760 781 759 781 0.96 11 32 0.79 5.5e+02 3.2 0.4 1 23 814 837 814 837 0.92 12 32 0.0025 1.7 11.1 0.8 2 23 844 867 843 867 0.94 13 32 0.46 3.2e+02 3.9 0.0 2 23 876 898 876 898 0.90 14 32 0.013 9.2 8.8 0.0 2 23 905 927 904 927 0.93 15 32 6.7e-06 0.0046 19.2 1.2 1 23 935 957 935 957 0.98 16 32 0.56 3.9e+02 3.7 1.2 2 23 1003 1026 1002 1026 0.95 17 32 0.0021 1.5 11.3 1.0 1 23 1029 1052 1029 1052 0.97 18 32 0.00035 0.25 13.7 0.1 2 23 1059 1081 1058 1081 0.95 19 32 0.00047 0.33 13.4 3.9 1 21 1109 1129 1109 1130 0.95 20 32 0.0014 0.96 11.9 1.8 2 23 1144 1165 1143 1166 0.93 21 32 0.18 1.3e+02 5.2 1.1 1 23 1169 1192 1169 1192 0.94 22 32 0.0054 3.7 10.0 0.9 2 23 1199 1222 1198 1222 0.90 23 32 1.6 1.1e+03 2.3 0.2 2 23 1230 1252 1230 1252 0.94 24 32 1.7 1.2e+03 2.2 0.0 2 23 1259 1281 1258 1281 0.88 25 32 0.12 81 5.8 2.7 2 23 1290 1312 1290 1312 0.96 26 32 0.59 4.1e+02 3.6 0.4 2 23 1318 1340 1317 1340 0.91 27 32 0.0086 5.9 9.4 0.1 1 23 1402 1425 1402 1425 0.95 28 32 8e-05 0.056 15.8 0.9 1 21 1453 1473 1453 1474 0.96 29 32 0.16 1.1e+02 5.4 1.9 2 23 1573 1595 1572 1595 0.94 30 32 0.47 3.3e+02 3.9 0.4 2 23 1602 1624 1601 1624 0.94 31 32 0.36 2.5e+02 4.3 2.5 1 23 1635 1657 1635 1657 0.97 32 32 0.068 47 6.6 0.1 1 20 1661 1680 1661 1682 0.92
Sequence Information
- Coding Sequence
- ATGGCGACGAtatcgacgacgatgacgatgacgacgacaacgacgacgatgacgacgacgacgacgaggacgatGTTGACATCGACGTCAACGTTTGCGGCTACAACGAAGGACACGATGGCGATGCCGATGGAACGAAGGAAATCTTATCATGATCTTTGTCGTCTGTGCGCCTCTTACGATAACGTCAGCATGCACATATTCGGCCAGGAAGGCAGAAACCGGCAGCTCGTTGACAAAATTCAAACGTGCCTGCCGTTCAAGATAGAGGAAGATGATCGTTTGCCAAAAGTCCTGTGTTATCGATGCATgtataatttggaaaatttctaCGACTTCAGAACCGCATGCGTGAATGCATCTGCTTGGTTGGAAAGAAATAGACCCAAGGAAGGAGCGAATGATGACGGTGCAAATAATAGCGCACAGTGCAGCGACATGCACGCGGAGCTTCTCaagggaaagaaaaatatgccaATACTTATCCCGGAGGCGCCCGTGGTCAATCCGAATGCCGCATTAGGTACACCACCGAGATTGAATTCCGATGGAGAGGCCGATCCCGAGATCGAAGAGATTCTCGATGCGAGCGAAGGCACCGACGAAGTGCTCGATGATTCGACGGAGGATCGACGGTCGGAATACGAATACGAGATGGACATGGAGACGAATCCTAGCGACTTTTTAGAAATGACGCCGATGGTAACCGAAGAAAATGAGGAGGAATGCGGCGCAAATAATACGAGTGCTACGGCCGGTCAGCAAGAGGCCGCGACGGTCTTTCCACCTACTTCGCAGCAGCACGAGGTCTATGTATGTTCTCTGTGCAACAAGGCATTCAGCTCCAAAGGTCATCTATCGTTGCACGCGAGGATTCACGTGGGTGAAGGTGATGTGATCGGCGAAAGGGTTATTACTGATGACCATACTTCGTATCAACGACCGTATCAATGCGATCTCTGTCATAAGTCGTATTCTACCGCGAAGCATCGCTGGGGACATGTTTCTACGACACATCGGGGACATCCTGCTGTAACGTGTGCCTATTGCTCGCGAATATACTCGACACGGTACAACCTCGACGAGCATATAAAATCGCGACATGCCGGTTTACCGCCACCCCCAGAATTATCGGTTTCTCTTTCCCGCGCAGAGACTCGTTACCAATGCCAAACGTGTCCGatgatttataaagatttggCAGATTTCAACGCGCATCGTCAGATATGCATTGAGGAACAACGCACTGATCTACTAGGACAAACCGACACGCAAaacaataagaatttaatcgATATGTCCGATATGTCGAGTATTATAGACTCGGACGACGAGAACAAAGACTTTAGGAGTGCCGAGGCTAAACTGGCCAAGAATCCGCAATTGACTATATTGAAACAAGCTTTGACTAAAGGAGACAGTTTGAAAAGGATTTACGACGATGAGGGCTCAACGTCGAGCTGCAAGGCGAGAAAAATGGTCAAGAtagAAGGCGAGACGAATCCTGAGATAAAAAGGTGGTATTGCGAATATTGTCCGCAAAGTTTTACATCAGTGGACAGTTTAAAGTTGCACGAAGTCAGGCATGATGCCGAGAAGCCGTTTATCTGCGTACTATGCAAGAAggattttgttttgaaatcttCATTGATAAGGCATATTACGACGATACACGGCGTTGATCCTACTCCTATCATTGACAGCGACAAGTGTCTAAAAACATCTGTGATGCCTCAGAATTGGAATCGAATGGATGTCAGCGTTTACGAGCAGAACGATATAAAGGAACCACCAGAATTTTCGGCGTCACTCGAgACAAATTTGGACAATGATgagaaagatttgaaaaacaacCACGAGAATATAGAAATCGAAACAGTATTTATATGTGAGATTTGTACGAGGGACTTTAACGATCGAGCGTCGTTATGGTTGCACATGCGTGCAACACATAAGGAACTTGCTGCATATGCCTGTGGTGTGTGTTTAAAGATTTGTTCCGACAATACACAACTCCAGAGTCATCTTTACATGTATCACGGAAAATCCAAGCTTTTAATATCGGAACAAAGAAGGTACAGTTGCACGATATGCGGCAGACAGCATGATTCAAGAAAGAAGCTAATAGCTCATGTCTCGATACACAATATCGATTCTGGGTTTGATCCTGCAATTTTTGTACAGTTAAACAGTAATTATTACAATGAGAACTTAAACGGTACCGAAGGAAATGAACAAGTAATGGATTTCGATGGAGAAGACGGCGAGAAAGTCGATTGTTACATTTGTTACAAATCTTTTCCAAACGAGGATCATCTTATACGACATCAGAGAAATGCGCATaaaTCCGAACAAATAATTCCGTTAGGAGATGCCGCGGCAAGTGGAAACGCTCCGAATATCAATGGCAGCGGTAATAGGGCACAGtatcatttgttttttgtttgtgaaaTTTGCGGTAGTTCTCATTCAAGTAAATGGGAACGCTGGTTGCATATCAACAACATGCATAATAACGAATCTTCCATCAAGTGCGAATGGGAAAACTGTGGGAAAATATTCGCGACGAAATCACTGCGTAATGACCATCTCCAGCATCATTTGATCCAAGGCCCGTCACCAAACACCTGTGAGATATGCGGTAAATTGTGGCCTACTCGTGTCGATTACTGGAAACACGTGATGGGTGTTCACGCGGACACGGTGCCCCTGATCTGCGGCGTTTGTCTGAAAGTATTTTCCGATGTGATGCAGTTAAGTGCTCACGTCAAGGCAAAACATTGGCCACTCACCAATGGTGATTTTAGCTGTGATATTTGCGGTAGACCATATTCCAATAAATCCAAGATGTCCCGACATAGAAAGATCCACGGTTTGGAAATggcagcagcggcagcggcAGCGATGATTGCGGATGTCGCGTGTGATAATAGCAATCTCAATGAGACAACCAACGAATCGATAAAACTCGAACACGGCAACAACAGGGCCGTagatttcaaattgaaatgCGAACAATGCCCCGAGCACAAGTTTACGACTCTGGACATTTTACGCAATCATCGTCGGGTAGTGCACAATCTCTTCCCGTGTGATCTATGCGTTAAGTATTACGGTAGGACATCTCACTTGTGGAAACACGTGAACAGGGTACATAAGGGTCACGCGGACGTGACTTGTCCATACTGCGCGAAAACGAGCGCGTCGAGAGATCATCTTGCGGCGCACATCGCGAAGATTCACAGATATGTGCCCACGATGAGTGGCAAAGATAATCAGAACTGCGTCGTTTCTAAGTCCTTGAATATGGAAGATGGTGTCCTGCATTACTGCGAGAAATGTAACAAAGGATTTCACAAACGCTATCTGCTCCGACGTCACATGAAGGGCTGTCAAAACTACCGTAAGGATCCTGGAGCATTATTGACCCGCTGCCGAGCGTGCGAGAGGATATTCAAGGATCGTGCAAGTCTACAGAAGCACATCGAGAATCACCACAGTACATATACCTGTCATTTATGTAATGAGACCATTACGTCCAAACTGGGCATTATGACGCACAATCGCGTCAATCATATGGATCACCCGGATCTGACGTGCGATTATCCGAGCTGCAAGAAGCTTTTCCGCACCAAGGAGGATCTCGAGTCTCATCGAAAGGATCACAAATATCACAGTAACCCGAATGTCTGCGATTTTTGCGGCGACACCGTGGAGaacaaattaaagttaaagatGCACGTGCTATCGTTACACCGGAATGAGATCGGTGTATCCTGCGGCGTCTGTCTCATTCCTATGAAGGATCCGAAAGATTTGAAGAAACACGTCGAGGCGGAGCACAGTAGCGTTCTTTCCAATCCGAATACATGTCAAGTATGTGGTAAGCAATATGCATCCAAGTGGAAGGCGTTTGATCACACGAAAAAGTGTCATGGAAAAGTTTTTCTCACGTGCAAACAATGTTTAGCAGTTTTCACAGATGAGAACGCTATACGCGATCATTACGAACATGTACATAACGTTCCAAAGGATCAGTTAGCCgttttcgaatataaaatggACATCGGTGCGAAGAGGGAAGATTACGAGACACCTGATATCATCGTGAAGGAAGAACCGGATGATCTTGAGTTCGATGAAGAGATGTGCGATGAAAGTTCGAGCGATTCCCGTAAACGCAGACGATCGCCGAACGATACGTATGATTGTGAAATGTGCCCCGAGATCTTTCTCAATTCGGATACACTCGCCAAGCATTATCAGAATGTTCACAACACCGATCCCATCCGTATGTTCAAAAAGTTTAAGAAGTACAGTGGCGACGGCAAACGTAGAatgagaaatagaaataattacgaGTGCAAGAATTGCAAGAAGCAGTTCTCTACCAAAACTTTATTCTGGAATCACATAAACGCGTGCACGCGACGAAACTCGGTATGCAAATTCGACGTTCCGAATAATATCTCGATTCTGGAGTCGCATTTgaagaataataatcagattcAGCGAGAAGAACCGTTATCGCTGATGAACGAATCTAATTTGAACATTCCTGATTTTAACCTATTCGAGGACATCAACATGCAATTGTCAGCCCAGAAACCGGTGCCGAGTCTCATGCCGTTGTCGCAGGTGAAGTCGGCAGGTAATGGCAAGTGCTCGCGCAAAGACTCGCGCAAGGTGTATGACGAATCGACCAATACTGAGTGCACATGCGAGGTCTGCGGCAAACAGTGGCCCGCTAAGAAGCACTTGTGGCAACACTTGATTCGTTTCCATCGTGCCGAAGCGGCCGTTACCTGCGGCGTATGTTTGAAGTTATGCAAATCCTATCAAGATTTAGCCGATCACCTGAAGGCGGAGCACGCCCCTGTTTTGTCACCGGAGGGCAACAACTTCACGTGCAAGACATGCGGCAGATATCACAATGCGAGAAGTAAACTGCTGCTGCACATGAGCATCCATATCGGATACTTCCGATGCGAGAAGTGTCAGCAAGGTTTTGCGAGTGAGGAAAAACTCGGCGAGCACGTGACAAACTGCAACGGCAAGTCGGAATTTGAGGATCATGCAGTAACTGCGGATATTGAAGATAACGCGAAAAATGACAATGATGAAAAGGGCAGTTTAATCGCTGACGAAACGTCGGTCATCGAAGAGGAAGTTGAAGAAGCTTTTGAATCGGAGGGCGAAGGTAGTAGAGGTATgcaaaatgaagaaaacaatAGCGAAGAAGACAATTCTGATAGCGATGACTCGGATAGCGGTAGCAATAGTAGTTCGAGCGAGAACGAAGgcgaagaggaagaagaggaggaagaggaagaagagaacgAAAACGAAAATGAGTCTGATACAAGGACTGAGCCGGACACGAGAACTCCGAGCAGAGCAAGCGGTGACAGTGAGTCGTGTAATTCCGAAAGTGACGAATCGAATATGGACGAAGCAGAAGTGAGCGCGATGCAGAAGAAAGCACCGCGATTGAACGATAACGATAGTTGCACGATACAAGGCGAAGAGAACTCGCCGGCGATGGAAAAGTACGTCGAAGATCAAAGCACCGTTTCCGTTATCGCAACGATTGGTGGAGATCGAATTAAACAGAGTACCTTGAATAATCTAGTGATTTCCGGTGCATCTGCGCACGTAGATAAATTTAAGGCATTACGTCTCCAGGAATCTATTACAGCTACGGCGAGTGATGTAGACTTCtctaatgataatgaaaatgataatgaggaagatgatgaaaataatgaaaatgaagaaaatgaagaaaaagagggagatgACGGTGAGGATGAAGGCGAAGATGAAGGTGAAGGTGAAGGTGAAGGTGAAGGTGAAGGTGAAGGTGAAGGTGAAGGTGAAAGtggagatgaagatgaagatgaagatgaagatgaaagtGAAGGTGAGGGTACAAGCGAGGCCGAAGCTGAGAGTGGACAAGATGAGGGTGAAGACGACGCTGATGGTGAAggggaagaagaggaagaggaggaaaatGATGTGGACGACGACGATGGACCGCCTGTATTGAGTCCGATAATGCCTTTGCTGACTGAAAATGAATCCGAGGAGCACAGCAATACGATGGATCGTACAAGGCATAAGCTTAGTCCGATGGTATCGCTGAGTATGGACAAATTAATAGAGGGATGCCAGATAACGgagatcaaaaataatacgGAGAATACGACGGCACTCTCGAATGCCTCCAACTTCTTTGCAGCTAATAACAACGATGTAGCCGTGACGTGGGACGAGGATGAGGAACGCGATTGTAATTCCGATGTCGGGGACAGAGATATGATGGTAATGAAAAACGAGGAATTTGATAAGGAGTATGTTAAGAGGAATATCAATGATTTGGAAGGGGACGACGATTATGAGGAAGATTCCGCGGACGAAAATGTGATGGACGATAGAGATGGTGATGGTGGGGATCAAGTGCACGAAATACATAGTCTAGACGGGACAGTGTTAATGATGACTAATGATGCGGAAGGTAATCCGATTTTGATAGAACATAATGTGTTAGATATCGATAACGAAGACTCTAACGCCGAGGTGGCGCAGTATATTTATCCAGAGAATGCTTATGAGATTGAGGAAGAGGATGAGGAAGATTTTGCGAGTCGAAATGAAACCGACGCCATGCAGACGGGTGAGATACAAGGTATGTCTTACGTTCAGGATATGTCAGAGAACGAGGATAGTACGGGGGATGATGTAGAGGAGAATAGTAATGATGCCCAGAAATAG
- Protein Sequence
- MATISTTMTMTTTTTTMTTTTTRTMLTSTSTFAATTKDTMAMPMERRKSYHDLCRLCASYDNVSMHIFGQEGRNRQLVDKIQTCLPFKIEEDDRLPKVLCYRCMYNLENFYDFRTACVNASAWLERNRPKEGANDDGANNSAQCSDMHAELLKGKKNMPILIPEAPVVNPNAALGTPPRLNSDGEADPEIEEILDASEGTDEVLDDSTEDRRSEYEYEMDMETNPSDFLEMTPMVTEENEEECGANNTSATAGQQEAATVFPPTSQQHEVYVCSLCNKAFSSKGHLSLHARIHVGEGDVIGERVITDDHTSYQRPYQCDLCHKSYSTAKHRWGHVSTTHRGHPAVTCAYCSRIYSTRYNLDEHIKSRHAGLPPPPELSVSLSRAETRYQCQTCPMIYKDLADFNAHRQICIEEQRTDLLGQTDTQNNKNLIDMSDMSSIIDSDDENKDFRSAEAKLAKNPQLTILKQALTKGDSLKRIYDDEGSTSSCKARKMVKIEGETNPEIKRWYCEYCPQSFTSVDSLKLHEVRHDAEKPFICVLCKKDFVLKSSLIRHITTIHGVDPTPIIDSDKCLKTSVMPQNWNRMDVSVYEQNDIKEPPEFSASLETNLDNDEKDLKNNHENIEIETVFICEICTRDFNDRASLWLHMRATHKELAAYACGVCLKICSDNTQLQSHLYMYHGKSKLLISEQRRYSCTICGRQHDSRKKLIAHVSIHNIDSGFDPAIFVQLNSNYYNENLNGTEGNEQVMDFDGEDGEKVDCYICYKSFPNEDHLIRHQRNAHKSEQIIPLGDAAASGNAPNINGSGNRAQYHLFFVCEICGSSHSSKWERWLHINNMHNNESSIKCEWENCGKIFATKSLRNDHLQHHLIQGPSPNTCEICGKLWPTRVDYWKHVMGVHADTVPLICGVCLKVFSDVMQLSAHVKAKHWPLTNGDFSCDICGRPYSNKSKMSRHRKIHGLEMAAAAAAAMIADVACDNSNLNETTNESIKLEHGNNRAVDFKLKCEQCPEHKFTTLDILRNHRRVVHNLFPCDLCVKYYGRTSHLWKHVNRVHKGHADVTCPYCAKTSASRDHLAAHIAKIHRYVPTMSGKDNQNCVVSKSLNMEDGVLHYCEKCNKGFHKRYLLRRHMKGCQNYRKDPGALLTRCRACERIFKDRASLQKHIENHHSTYTCHLCNETITSKLGIMTHNRVNHMDHPDLTCDYPSCKKLFRTKEDLESHRKDHKYHSNPNVCDFCGDTVENKLKLKMHVLSLHRNEIGVSCGVCLIPMKDPKDLKKHVEAEHSSVLSNPNTCQVCGKQYASKWKAFDHTKKCHGKVFLTCKQCLAVFTDENAIRDHYEHVHNVPKDQLAVFEYKMDIGAKREDYETPDIIVKEEPDDLEFDEEMCDESSSDSRKRRRSPNDTYDCEMCPEIFLNSDTLAKHYQNVHNTDPIRMFKKFKKYSGDGKRRMRNRNNYECKNCKKQFSTKTLFWNHINACTRRNSVCKFDVPNNISILESHLKNNNQIQREEPLSLMNESNLNIPDFNLFEDINMQLSAQKPVPSLMPLSQVKSAGNGKCSRKDSRKVYDESTNTECTCEVCGKQWPAKKHLWQHLIRFHRAEAAVTCGVCLKLCKSYQDLADHLKAEHAPVLSPEGNNFTCKTCGRYHNARSKLLLHMSIHIGYFRCEKCQQGFASEEKLGEHVTNCNGKSEFEDHAVTADIEDNAKNDNDEKGSLIADETSVIEEEVEEAFESEGEGSRGMQNEENNSEEDNSDSDDSDSGSNSSSSENEGEEEEEEEEEEENENENESDTRTEPDTRTPSRASGDSESCNSESDESNMDEAEVSAMQKKAPRLNDNDSCTIQGEENSPAMEKYVEDQSTVSVIATIGGDRIKQSTLNNLVISGASAHVDKFKALRLQESITATASDVDFSNDNENDNEEDDENNENEENEEKEGDDGEDEGEDEGEGEGEGEGEGEGEGEGESGDEDEDEDEDESEGEGTSEAEAESGQDEGEDDADGEGEEEEEEENDVDDDDGPPVLSPIMPLLTENESEEHSNTMDRTRHKLSPMVSLSMDKLIEGCQITEIKNNTENTTALSNASNFFAANNNDVAVTWDEDEERDCNSDVGDRDMMVMKNEEFDKEYVKRNINDLEGDDDYEEDSADENVMDDRDGDGGDQVHEIHSLDGTVLMMTNDAEGNPILIEHNVLDIDNEDSNAEVAQYIYPENAYEIEEEDEEDFASRNETDAMQTGEIQGMSYVQDMSENEDSTGDDVEENSNDAQK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00869056;
- 90% Identity
- iTF_00280104;
- 80% Identity
- -