Acoo000141.1
Basic Information
- Insect
- Atta colombica
- Gene Symbol
- -
- Assembly
- GCA_001594045.1
- Location
- NW:939618-954185[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 32 5.9e-06 0.00032 20.2 1.0 1 23 252 274 252 274 0.98 2 32 0.0086 0.46 10.3 3.7 1 23 297 320 297 320 0.96 3 32 0.00058 0.031 13.9 0.5 2 23 327 349 326 349 0.96 4 32 0.00061 0.032 13.9 0.1 1 21 369 389 369 390 0.95 5 32 0.012 0.63 9.8 0.1 2 23 488 509 487 509 0.95 6 32 0.0059 0.31 10.8 0.2 1 23 515 538 515 538 0.91 7 32 0.00018 0.0094 15.6 0.2 1 23 609 632 609 632 0.96 8 32 0.064 3.4 7.5 3.1 1 23 638 661 638 661 0.95 9 32 0.11 6.1 6.7 1.5 1 23 674 696 674 696 0.97 10 32 3.3e-05 0.0017 17.9 0.9 3 23 741 762 740 762 0.96 11 32 2.8 1.5e+02 2.4 0.4 1 23 794 817 794 817 0.92 12 32 0.005 0.26 11.0 1.0 2 23 824 847 823 847 0.94 13 32 0.86 45 4.0 0.0 2 23 856 878 856 878 0.90 14 32 0.025 1.3 8.8 0.0 2 23 885 907 884 907 0.93 15 32 8.5e-05 0.0045 16.6 3.2 1 23 915 937 915 937 0.97 16 32 0.033 1.7 8.4 1.3 3 23 975 997 973 997 0.93 17 32 0.0002 0.011 15.4 3.3 1 23 1000 1023 1000 1023 0.97 18 32 0.00065 0.035 13.8 0.1 2 23 1030 1052 1029 1052 0.96 19 32 0.00087 0.046 13.4 3.9 1 21 1079 1099 1079 1100 0.95 20 32 0.0026 0.14 11.9 1.8 2 23 1114 1135 1113 1136 0.93 21 32 0.36 19 5.2 1.1 1 23 1139 1162 1139 1162 0.94 22 32 0.025 1.3 8.8 2.1 2 23 1169 1192 1168 1192 0.90 23 32 2.3 1.2e+02 2.6 0.1 2 23 1200 1222 1199 1222 0.94 24 32 3.1 1.6e+02 2.2 0.0 2 23 1229 1251 1228 1251 0.88 25 32 0.21 11 5.9 2.7 2 23 1260 1282 1260 1282 0.96 26 32 0.66 35 4.3 0.5 2 23 1288 1310 1287 1310 0.91 27 32 0.016 0.84 9.4 0.1 1 23 1372 1395 1372 1395 0.95 28 32 0.00016 0.0086 15.7 1.1 1 21 1423 1443 1423 1444 0.95 29 32 0.3 16 5.4 1.9 2 23 1545 1567 1544 1567 0.94 30 32 0.87 46 3.9 0.4 2 23 1574 1596 1573 1596 0.94 31 32 0.66 35 4.3 2.5 1 23 1607 1629 1607 1629 0.97 32 32 0.024 1.3 8.8 0.2 1 19 1633 1651 1633 1653 0.96
Sequence Information
- Coding Sequence
- ATGGCGACGATATCGACGATCACGACGACAACGTCGATGTCGACGTCAACGACGAGAATCGAGACAGCGATGGCGACTGAGCCGAGAAGGAACTATAACGAACTGTGTCGTTTATGCGCTTCTTATGATGCCGTTAGGATGCACATATTTGGCCAGGAAGGCAAGAATCGTCAACTCGTCGACAAGATTGAAACGTGCTTGCCGTTCAAGATAATGGAAGATGATTATTTGCCAAAAGTCTTGTGTTATCGATGTATGTACAACTTGGAGAATTTCTATGACTTTAGGACTGCGTGTGTTAACGCAGCTGTTTTGTTGGAAAGAAATAGGCCGAAGGAAAGCATGAGCGACGAAGGTGCAAATGACACTGCGCAATCTTGCGAGATGCGCGCGGCGTTtcttaagaaagaaaatatgccAGTACTTATCCCGGAAGCACCCGTAGTCAATCCCAATGCAGCATTGGGTACGCCACCGAGATTAAATTCGGATGGTGAAGCAGATCCCGAGATCGAAGAGATTCTTGACACAAGTGAAGGTACGGATGAAGcCACAATGATCGACGATTCGGAAGATCGGCGATCGGAATATGAGATGGCATATGAAATGGACATGGAGACGAATCCTAGCGACTTCTTGGAAATGACCCCGATGGTAACTGAGGAAAATGAAGAAGAATGCAATACAAATACGAATGCTGCGACTGCTCAAGATGCCACGATTTTTCCACATACGTCACAACAACATGAGGTTTATGTCTGTTCTCTATGCAACAAAGCGTTCAGTTCCAAGGGTCACTTGTCTCTGCACGCAAGAATTCATGTGGGTGAGGGTGATGTGATTGGCGAAAGAGTAATTACCGACGACCATACTTCATACCAACGACCATATCAATGTGATCTTTGTCATAAATCGTATTCTACTGCGAAACATCGCTGGGGGCACGTTTCTACAACGCATCGGGGACATCCTGCAGTAACGTGTGCGTACTGTTCTCGTATATACTCGACGCGATATAATCTCGATGAACATATAAAATCACGACACGCTGGGCTACCACCACCTCCAGAATTATCGGTTCCTCTAACGCGTACGGAAACACGTTACCAGTGCCAAACATGTCCGATGGTGTTTACGGATCTAGCAGATTTCAATGCACATCGTCAGATATGTATCCAGGAACAACGTACAGATTTATTAGGTCAAACTGAAacacagaataataaaattttcgctGATACATCCGATGTCTCGAGTATAGATTTGGATGACGAGAATAAAGACTTTAAGAATGCCGAGGCTAAATTGGCGAAAAATCCACAGTTAACCATATTGAAACAGGCGCTGACTAAAGGAGACAATTTAAAACGGAATTTTGATGATGATGGTTCGATGTCTAGCGGCAAGccgaaaaaaatagttaagtcagaaGAAGGCGAGGCAAATCCTCAGAAGAGATGGTATTGTGAAGCTTGTCCACAAAGTTTTACATCAGTGGAGAGTTTAAAAGAACATGAGATTAGACACGATGCCGAAAAGCCATATATTTGCATACTATGCAACAaagattttgttttgaaatcttCATTAATTAGGCATATTACAGTGTTACATGGTGTTGATCCTACTCCTATCATCGACAGTGATAAGTGTCTAAAGGCAACAGTAATGTCTCAGAATTGGAACGATCGAGTGGACGTCAGCGTTTATGAGCAAAGTGAAATAAAAGACCCACCAGAGCTCTCGTCGTCACCTGagataaatttagagAATGATGACAAGGATTATAAAAACAACCATGAGAATGTAGAAATTGAAACGGTATTTGTGTGTGAGACTTGTAAGCGAGATTTTAACGATCGAGCATCATTATGGCTGCACATACGAGCAATGCACAAGGAATATGCAGCATTTACCTGTGGAGTGTGTTTAAAGATGTGCTTTAATAATACACAACTTCAAAATCATGTCTTCATGTATCACGGAAGATCTAAGCTTTTAATCTCGGAACAAAGAAGGTATAGTTGCACGATATGCGGTAGACAGCATGACTCGAGAAAAAAGCTAATCACTCATGTCTCAATACATAATGTCGATCCTGCCTATGATCCTGGAAGTTTTGTACAATTGaacagtaattattataatgaaaacttGAATGGTAACGAAGGCAATGAACAAGTATTAGATTTTGACGGAGAAGATGGTGAAAAGGTTGATTGttacatttgttataaatctTTTCCGACTGAGGATCATCTTATACGACATCAGAGAAATGCTCACAAgTCTGACCAGATTTCGTTAGGAGATGCTGCGGGTGGTGGAAATACTCTAAGTGTCAATGGCAATGGTAATAGAGCGcagtatcatttattttttgtttgcgaAGTATGTGGTAGTTCACATTCGAGCAAATGGGAACGTTGGTTGCATATCAACAACACACATAACAATGAATCTTCTATTAAATGTGAATTGGAAAATTGCGGAAAGATATTTGCGACGAAATCGTTACGTAATGAACATCTTCAGCATCATGCGATACAAGGACCTTCGCCAAACACCTGCGAGATATGTGGAAAATTGTGGCCTACTCGCGTTGATTATTGGAAACATGTAATGGGCGTACATGCAGACACAGTGCCTCTAATTTGTGGCGTTTGTTTGAAAGTATTCTCCGACGTTATGCAATTGAGCGCACACGTTAAGGCGAAACATTGGCCACTAACCAGTGGTGATTTTTGTTGTGATATTTGCGGTAGACCATATTCcaataaatcgaaaatgtcTCGGCATAGAAAGATCCATGGTTTGGAGGCGGCTATGGACGCTGCATGCGATAATAGCAGTTTTAATGAAACGACCAATGAATCGGTGAAACCTGATCACAGTAATGGCGCTtcagaaatagaattaaattgcgAGCAGTGTCCTGAACTTAGTTTCACGACATTGGATAGTTTGTGTAATCATCGACGGATAACGCATAATCTTTTTCCATGCGATTTGTGTAACAAATGTTATGGTAGAACATCACATTTATGGAAACATGTAAATAGGGTACACAAAGGACACACAGACGTGACCTGTCCTTATTGCGCAAAGACAAGTGCGTCAAGGGATCATCTGGCAGCACATATTGCAAAGATTCATAGATTCATGCCCACAGTAGGTAAGGACAATCAAAACTGCGTCACTTCCAAGTCCTTGAGTGCAGATGATGGTGTCTTgcattattgtgaaaaatgtaacaaaggATTCCATAAACGCTATTTGCTCCGTCGTCATATGAAAGGCTGTCAAAATTATCGTAAAGATCCTGGGGCATTGTTGACTCGTTGTCGGGCCTGTGAAAGGATATTCAAAGATCGCGCGAGTCTGCAGAAACATATTGAGAATCATCATACTACATATACTTGTCATTTATGTAATGAGACGATCACTTCCAAGCTAGGTATCATGACACACAATCGTATCAATCATATGGATCATCCGGATCTGACGTGCGACCACCCGAGCTGTAAAAAACTTTTCCGCACCAAAGAAGATTTGGAATCTCATCGAAAAGATCACAAATATCATAGCAATCTGGACGTTTGTGATTTTTGCGGTGACACCGTGGAAAATAAACTGAAATTGAAGATGCATGTATTATCGTTGCATCGAAACGAGATCGGCGTATCCTGTGGGGTTTGTCTCATTCCGATGAAGGATCCCAAAGATTTGAAGAAACATGTCGAAGCGGTGCATAGCAGCGTTCTCTCTAAACCAAACACGTGTCAGGTATGTGGCAAGCAATATGCATCCAAATGGAAGGCTTTTGATCACACGAAGAAGTGTCATGGTAAAGTTTTCCTTACGTGCAAACAGTGTTTAGCCGTTTTTACAGATGAGAATGATATACGCGATCACTATGAACATGTGCATAACGTTCCAAAGGACCAATTAGCTATTTTCGAATATAGAATGGATATTGGTGCAAAGCGGGAGGGTTACGAGACTCCTGATATCATTGTAAAGGAGGAACCAGATGATTTGGAGTTTGACGAAGAAATGTGTGACGAAAGTTCGAATGATTCTCGCAAACGTAGAAGATCACCAAACGATACGTATGATTGCGAGATGTGTCCTGAGATCTTCCTCAATTCAGACACTCTCGCTAAGCATTATCAGAACGTCCATAATACAGACCCCGTCCGTATGTTCAAGAAATTCAGAAAGGATAATAGCGATGGCAAGCGCAAGATGAGAAATAGAAACAATTTTGAATGCAAGAATTGCAAAAAGCAATTCTCTACTAAGACTCTGTTCTGGAATCACATCAATGTATGTACACGACGAAACTCGATAGGTAGATTTGACATGCCGAATAATGTTCCATCATCAATTCTGGAGTCTCATCTGAAAAACAATAATCAGATTCAACGAGAAGAACCGGTACCGCTTACGAACGAGTCCAATTTGAACATTCccgatttcaatctattcGAAGACATCAATTTGCAATTGTCAGCTCAGAAACCTGTGCCGAATCTTATGCCATTGTCGCAGATGAAAGGGACAGGTAACGGCAAATGCTCGAGAAAAGACTCACGCAAGGTGTATGATGAATCGACTAATACCGAGTGTACGTGCGAAGTTTGTGGTAAACAATGGCCCGCCAAGAAACATTTGTGGCAACATCTAATTCGTTTCCATCGCGCTGAAGCTGCTGTTACGTGTGGCGTATGTTTGAAGCTGTGTAAATCTTACCAAGACTTGGCTGATCACCTGAAGGCTGAGCATGCTCCCGTTTTATCGCCGGAGGGCAATAATTTCACATGTAAGACATGCGGTAGATATCATAACGCGAGAAGTAAATTGCTGTTGCATATGAGCATTCATATCGGAAACTTCCGATGTCAAAAGTGCCAGCAAGGTTTTGCAAGTGAGGAGAAACTCACCGAGCATGCAACGGGTTGCAATGGCAAATCGGAGTTTGAGGATAATACAGTGGCAGATGAagataatgcaaaaaatgatAACGATGAGAAGGGCAGTTTAATCGCCGACGAGACATCAGTCATTGAGGAAACAGAGGAAGCAGAGGAAGCAGATTTTGAATCAGAAGGTGAAGGAAGTAGGGATATTCACGATGGAGACAATTCGGAAAATAGCGAAGAAGACAATTCAGAAAATAGTGATGATTCAGACAGTGATAGTAATAGTAGTTCGAGCGAAGACGAAAACGAAAATGAAGgagaagaagaggaggaggaagagaatGAAAACGAAAATGAATCTGATACGAGAACTACTAGTAGAGCGAGCGGTGATAGTGTATCATGTAATTCCGATAGCGACGATGAATCAGAAGAAACGGAAGTGGACGCAACGGAAAAGAAAGCTGTACAATTGAACGATATTAGATTCAGGATATGTGATGATGGTATTCAGGAGAATATGCCAGAGGAGACAAATATTGAGGATCAGAAAACTGATCTTACATCCATAGCAACAGTTGCAGAACAGACTACatcgaataatttaaatgatttgattTCTAATGTACCTGGAAACATAGGCACCGACAAATTTGAGGAATTTCGTGttgaaaaattacaacaaGCTAAAGCTACGAGCGATGTAGACTCTGATAATGAAGACGATGATGATAACGAGGAAGATGATgaggaaaataatgaaaatgaagaagaggaagaagatgGTGAAGAAGGTGATCAGGATGAAGGTGAAGATGAGGATGAAGGTGAAGCTGAAAGTGAGgctgaaaatgaaaatgaaaatgaaaatgagaatGAAAATGAGAATGAAAATGAGAATGAGAATGAGGATGAGGATGAGAGTGAAGCTGCAGccgaggaagaggaagaggaagaagaagaggaagaagaggaggaTGATGAAGATGACGACGATGGACCGCCTGTGTTAAGTCCAATAATGCCTTTGTTACCAGAAAACGAATCTGAAGAGTACAGCGGTGCGACAGATCGCACGAGACACAAGCTTAGTCCAATGGTTTCGTTGAGTATAGCTAAAGATTTAGAGGAGTGTGAAATAACCGAAATACCAAATGACACGGACAATGTATCAAACACAGTCAACTTCTTCGCggctaataataatgatttgtcTGTAACATGGGACGAGAATTTGGATGGTAACGTAATCAACAATGCCGGTTGCAACTCCGACGCTGgaaacagagagaaaaatgAGGAACTTAATACGGagtacataaaaatagaagTAAATGAAGGCGACTTTGAGGAAGATTCTGTAGATGAGAACTTAGTGGATAACAGAGAAGATGATGGAGACAATCAAGTGCATGAGGTGCATAACCTGGACGGAACTGTGTTAATGGTGACTAATGACGCGGAAggtaatcaaattttgatagaaCAAAATGTGTTAGATATCGATAACGAGGACTCTAATGTTGAAGCGACGCAGTATATTTATCCAGAGAACACTTATGAGATCGAGGAAGAGGATTATCCAACGCGAAACGAAACCGACATCATGCAGACGGACGAGATGCAAGGTAGTATGTCCTATGTTCATGATACATCGGAAAACGAAGATAGTATGCAGAATGATATCGAGGAGAACAGTGATGTGCAAAAACAGTAG
- Protein Sequence
- MATISTITTTTSMSTSTTRIETAMATEPRRNYNELCRLCASYDAVRMHIFGQEGKNRQLVDKIETCLPFKIMEDDYLPKVLCYRCMYNLENFYDFRTACVNAAVLLERNRPKESMSDEGANDTAQSCEMRAAFLKKENMPVLIPEAPVVNPNAALGTPPRLNSDGEADPEIEEILDTSEGTDEATMIDDSEDRRSEYEMAYEMDMETNPSDFLEMTPMVTEENEEECNTNTNAATAQDATIFPHTSQQHEVYVCSLCNKAFSSKGHLSLHARIHVGEGDVIGERVITDDHTSYQRPYQCDLCHKSYSTAKHRWGHVSTTHRGHPAVTCAYCSRIYSTRYNLDEHIKSRHAGLPPPPELSVPLTRTETRYQCQTCPMVFTDLADFNAHRQICIQEQRTDLLGQTETQNNKIFADTSDVSSIDLDDENKDFKNAEAKLAKNPQLTILKQALTKGDNLKRNFDDDGSMSSGKPKKIVKSEEGEANPQKRWYCEACPQSFTSVESLKEHEIRHDAEKPYICILCNKDFVLKSSLIRHITVLHGVDPTPIIDSDKCLKATVMSQNWNDRVDVSVYEQSEIKDPPELSSSPEINLENDDKDYKNNHENVEIETVFVCETCKRDFNDRASLWLHIRAMHKEYAAFTCGVCLKMCFNNTQLQNHVFMYHGRSKLLISEQRRYSCTICGRQHDSRKKLITHVSIHNVDPAYDPGSFVQLNSNYYNENLNGNEGNEQVLDFDGEDGEKVDCYICYKSFPTEDHLIRHQRNAHKSDQISLGDAAGGGNTLSVNGNGNRAQYHLFFVCEVCGSSHSSKWERWLHINNTHNNESSIKCELENCGKIFATKSLRNEHLQHHAIQGPSPNTCEICGKLWPTRVDYWKHVMGVHADTVPLICGVCLKVFSDVMQLSAHVKAKHWPLTSGDFCCDICGRPYSNKSKMSRHRKIHGLEAAMDAACDNSSFNETTNESVKPDHSNGASEIELNCEQCPELSFTTLDSLCNHRRITHNLFPCDLCNKCYGRTSHLWKHVNRVHKGHTDVTCPYCAKTSASRDHLAAHIAKIHRFMPTVGKDNQNCVTSKSLSADDGVLHYCEKCNKGFHKRYLLRRHMKGCQNYRKDPGALLTRCRACERIFKDRASLQKHIENHHTTYTCHLCNETITSKLGIMTHNRINHMDHPDLTCDHPSCKKLFRTKEDLESHRKDHKYHSNLDVCDFCGDTVENKLKLKMHVLSLHRNEIGVSCGVCLIPMKDPKDLKKHVEAVHSSVLSKPNTCQVCGKQYASKWKAFDHTKKCHGKVFLTCKQCLAVFTDENDIRDHYEHVHNVPKDQLAIFEYRMDIGAKREGYETPDIIVKEEPDDLEFDEEMCDESSNDSRKRRRSPNDTYDCEMCPEIFLNSDTLAKHYQNVHNTDPVRMFKKFRKDNSDGKRKMRNRNNFECKNCKKQFSTKTLFWNHINVCTRRNSIGRFDMPNNVPSSILESHLKNNNQIQREEPVPLTNESNLNIPDFNLFEDINLQLSAQKPVPNLMPLSQMKGTGNGKCSRKDSRKVYDESTNTECTCEVCGKQWPAKKHLWQHLIRFHRAEAAVTCGVCLKLCKSYQDLADHLKAEHAPVLSPEGNNFTCKTCGRYHNARSKLLLHMSIHIGNFRCQKCQQGFASEEKLTEHATGCNGKSEFEDNTVADEDNAKNDNDEKGSLIADETSVIEETEEAEEADFESEGEGSRDIHDGDNSENSEEDNSENSDDSDSDSNSSSSEDENENEGEEEEEEENENENESDTRTTSRASGDSVSCNSDSDDESEETEVDATEKKAVQLNDIRFRICDDGIQENMPEETNIEDQKTDLTSIATVAEQTTSNNLNDLISNVPGNIGTDKFEEFRVEKLQQAKATSDVDSDNEDDDDNEEDDEENNENEEEEEDGEEGDQDEGEDEDEGEAESEAENENENENENENENENENENEDEDESEAAAEEEEEEEEEEEEEDDEDDDDGPPVLSPIMPLLPENESEEYSGATDRTRHKLSPMVSLSIAKDLEECEITEIPNDTDNVSNTVNFFAANNNDLSVTWDENLDGNVINNAGCNSDAGNREKNEELNTEYIKIEVNEGDFEEDSVDENLVDNREDDGDNQVHEVHNLDGTVLMVTNDAEGNQILIEQNVLDIDNEDSNVEATQYIYPENTYEIEEEDYPTRNETDIMQTDEMQGSMSYVHDTSENEDSMQNDIEENSDVQKQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01476424;
- 90% Identity
- iTF_00015543;
- 80% Identity
- -