Carc001870.4
Basic Information
- Insect
- Coenonympha arcania
- Gene Symbol
- USP
- Assembly
- GCA_036785405.1
- Location
- CM072056.1:12541688-12567052[+]
Transcription Factor Domain
- TF Family
- ESR
- Domain
- zf-C4|ESR-like
- PFAM
- AnimalTFDB
- TF Group
- Zinc-Coordinating Group
- Description
- This entry represents CLAVATA3/ESR (CLE)-related protein 14 from Arabidopsis thaliana (CLE14) and similar proteins predominantly found in plants. CLE14 is an extracellular signal peptide that regulates cell fate. It represses root apical meristem maintenance and functions as an elicitor of the root meristem differentiation through the CLV2/CRN complex signalling pathway. This protein inhibits irreversibly root growth by reducing cell division rates in the root apical meristem [1, 2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 2.8 1e+04 -3.3 0.0 14 82 428 491 418 493 0.47 2 19 1.1e-29 3.9e-26 93.2 0.4 55 199 550 710 504 712 0.90 3 19 5.2e-08 0.00019 22.0 0.1 139 199 700 760 699 762 0.86 4 19 7.2e-08 0.00026 21.6 0.1 139 197 750 808 749 811 0.85 5 19 6.3e-08 0.00023 21.8 0.1 140 199 801 860 800 862 0.86 6 19 5.2e-08 0.00019 22.0 0.1 139 199 850 910 849 912 0.86 7 19 7.2e-08 0.00026 21.6 0.1 139 197 900 958 899 961 0.85 8 19 5.2e-08 0.00019 22.0 0.1 139 199 950 1010 949 1012 0.86 9 19 7.2e-08 0.00027 21.6 0.1 139 197 1000 1058 999 1061 0.85 10 19 5.2e-08 0.00019 22.0 0.1 139 199 1050 1110 1049 1112 0.86 11 19 6.3e-08 0.00023 21.8 0.1 140 199 1101 1160 1100 1162 0.86 12 19 5.2e-08 0.00019 22.0 0.1 139 199 1150 1210 1149 1212 0.86 13 19 5.2e-08 0.00019 22.0 0.1 139 199 1200 1260 1199 1262 0.86 14 19 5.2e-08 0.00019 22.0 0.1 139 199 1250 1310 1249 1312 0.86 15 19 5.2e-08 0.00019 22.0 0.1 139 199 1300 1360 1299 1362 0.86 16 19 7.2e-08 0.00026 21.6 0.1 139 197 1350 1408 1349 1411 0.85 17 19 5.2e-08 0.00019 22.0 0.1 139 199 1400 1460 1399 1462 0.86 18 19 6.4e-08 0.00023 21.7 0.1 140 199 1451 1510 1450 1512 0.86 19 19 1.7e-07 0.00063 20.3 0.0 139 188 1500 1550 1499 1555 0.90
Sequence Information
- Coding Sequence
- ATGAAAATGCTTACTCTGAACTTAAGGAGATTAAAGCTGGAGTACCACAAGGAAGTGTCCTGGGGCCTGTCTTATACCTTCTCTATACATGTGATATTCCAGAACTCGAACATAACACTATCGCTACCTTTGCCGATGACACTGCCATCATCGCTGTGGGGAATACTCATGAAGAGGCAGTGGGAAAGAATGGATTCCAAGAAACAGGAAAAGGAAGAAAGGAAGACAAAGGAGAAGATGGAGAGATATCTTCAATCAAATTGTGGGACCAAACTGGATGACGGGCTGAACTTAGAGGCGGGCTTCATGTCGCCCATGTCGCCGCCGGAGATGAAGCCCGACACGGCCATGCTGGACGGCATGCGGGACGACGCCACGTCTCCGCCGGCGATGAGGAACTACCCGCCGAACCATCCCCTCAGCGGCTCCAAGCACCTCTGCTCCATATGCGGCGACAGGGCGTCGGGGAAGCATTACGGAGTTTATAGGTTAGTCAACATAGAGACGAGCTTCATGTCGCCCATGTCGCCGCCGGAGATGAAGCCCGACACGGCCATGCTGGACGGCATGCGGGACGACGCCACGTCTCCGCCGGCGATGAGGAACTACCCGCCGAACCATCCCCTCAGCGGCTCCAAGCACCTCTGCTCCATATGCGGCGACAGGGCGTCGGGGAAGCATTACGGAGTTTATAGGTTAGTCAACATAGAGACGAGCTTCATGTCGCCCATGTCGCCGCCGGAGATGAAGCCCGACACGGCCATGCTGGACGGCATGCGGGACGACGCCACGTCTCCGCCGGCGATGAGGAACTACCCGCCGAACCATCCCCTCAGCGGCTCCAAGCACCTCTGCTCCATATGCGGCGACAGGGCGTCGGGGAAGCATTACGGAGTTTATAGGTTAGTCAACATAGAGACGAGCTTCATGTCGCCCATGTCGCCGCCGGAGATGAAGCCCGACACGGCCATGCTGGACGGCATGCGGGACGACGCCACGTCTCCGCCGGCGATGAGGAACTACCCGCCGAACCATCCCCTCAGCGGCTCCAAGCACCTCTGCTCCATATGCGGCGACAGGGCGTCGGGGAAGCATTACGGAGTTTATAGTTGCGAAGGCTGCAAGGGTTTCTTCAAGCGGACCGTGAGGAAGGACCTCACGTACGCGTGTCGTGAGGAGAGGAATTGCATAATCGACAAGCGACAGCGGAACCGATGCCAGTACTGTCGATATCAGAAGTGCCTCGCGTGCGGCATGAAGCGCGAGGCCGTGCAGGAGGAgcggcagcgggcggcgcgcggcgcggaggATGCTCATCCCAGCAGTTCTGTTCAGGAGCTGTCGATCGAGCGGCTGCTGGAGATGGAGTCGCTGGTGGCGGACCCCAGCGAGGAGTTCCAGTTCCTGCGCGTGGGGCCCGACAGCAACGTgcccgcgcggtaccgcgcgccCGTCTCCAGCCTGTGCCAGATAGGTAACGTGCCGCGACCTGGAGCGCTTTATGTATCGAAGCAGAGCGAGGAGTTCCAGTTCCTGCGCGTGGGGCCCGACAGCAACATgcccgcgcggtaccgcgcgccCGTCTCCAGCCTGTGCCAGATAGGTAACGTGCCGCGACCTGGAGCGCTTAATGTATCGAAGCAGAGCGAGGAGTTCCAGTTCCTGCGCGTGGGGCCCGACAGCAACGTgcccgcgcggtaccgcgcgccCGTCTCCAGCCTGTGCCAGATAGGCAACAAGCAGATCGCGGCGCTGGTGGTGTGGGCGCGCGACATCCCCCACTTCAGCCAGCTGGAGCTCGACGACCAGGTGGTGCTCATCAAGGCGTCCTGGAACGAGCTGCTGCTCTTCGCCATCGCCTGGCGCTCCATGGAGTACCTGGAAGATGAGCGGGAGAACATGGACGGCACGcgaagcgccgcgccgccgcagctcATGTGTCTAATGCCTGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGCATGACGCTGCACCGCAACTCGGCGCTGCAGGCGGGCGTGGGGCAGATCTTCGACCGCGTGCTGTCGGAGCTGTCGCTCAAGATGCGCGCGCTGCGCATGGACCAGGCCGAGTACGTCGCGCTCAAGGCCATCATCCTGCTCAACCCAGGTAACACTCGGCAGGCATGA
- Protein Sequence
- MKMLTLNLRRLKLEYHKEVSWGLSYTFSIHVIFQNSNITLSLPLPMTLPSSLWGILMKRQWERMDSKKQEKEERKTKEKMERYLQSNCGTKLDDGLNLEAGFMSPMSPPEMKPDTAMLDGMRDDATSPPAMRNYPPNHPLSGSKHLCSICGDRASGKHYGVYRLVNIETSFMSPMSPPEMKPDTAMLDGMRDDATSPPAMRNYPPNHPLSGSKHLCSICGDRASGKHYGVYRLVNIETSFMSPMSPPEMKPDTAMLDGMRDDATSPPAMRNYPPNHPLSGSKHLCSICGDRASGKHYGVYRLVNIETSFMSPMSPPEMKPDTAMLDGMRDDATSPPAMRNYPPNHPLSGSKHLCSICGDRASGKHYGVYSCEGCKGFFKRTVRKDLTYACREERNCIIDKRQRNRCQYCRYQKCLACGMKREAVQEERQRAARGAEDAHPSSSVQELSIERLLEMESLVADPSEEFQFLRVGPDSNVPARYRAPVSSLCQIGNVPRPGALYVSKQSEEFQFLRVGPDSNMPARYRAPVSSLCQIGNVPRPGALNVSKQSEEFQFLRVGPDSNVPARYRAPVSSLCQIGNKQIAALVVWARDIPHFSQLELDDQVVLIKASWNELLLFAIAWRSMEYLEDERENMDGTRSAAPPQLMCLMPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGMTLHRNSALQAGVGQIFDRVLSELSLKMRALRMDQAEYVALKAIILLNPGNTRQA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00353127; iTF_00353134; iTF_00353244;
- 90% Identity
- iTF_00353127; iTF_00353134; iTF_00353244;
- 80% Identity
- iTF_00353127; iTF_00353134; iTF_00353244;