Expand my Community achievements bar.

Guidelines for the Responsible Use of Generative AI in the Experience Cloud Community.
SOLVED

translation data multiplication

Avatar

Level 4

Hi guys,

I noticed the translation framework will create a new copy of the original content under the target language folder. So my data size increases by whatever size the original content was every time I use the translation framework to add a new language. Even though all languages use the same binary file and it's only the metadata that gets translated, the translation framework makes a new copy of the binary for each of the translated languages.

This is not a problem when the size of the original content is small, but when you are dealing with petabytes, adding a few more petabytes every time a new language is added is expensive.

I know in some other systems, when dealing with multilingual asset metadata, the binary is unique and it's only the metadata that gets translated.

My question to you guys is, what are the reasons why AEM needs/wants to create a copy of the asset binaries every time a new language is added?

Thanks in advance for all your help.

1 Accepted Solution

Avatar

Correct answer by
Level 2

Hi,

Below are the steps confirmed by the Engineering team to manage Asset metadata translation without impacting Repository size.

  1. Setup File Data Store / Amazon S3 Data Store by following the instructions from https://docs.adobe.com/docs/en/aem/6-2/deploy/platform/data-store-config.html and/or https://jackrabbit.apache.org/oak/docs/osgi_config.html
  2. Disable DAM MetaData Writeback workflow
    As the name says, DAM Metadata Writeback workflow writes the metadata back in the binary file. As the metadata will change after translation, writing it back to binary will lead to different binaries for the language copies. Therefore, to use this optimization, one has to Disable DAM MetaData Writeback workflow using “Disable the workflow launchers” option documented at https://docs.adobe.com/docs/en/aem/6-2/administer/content/workflow-offloader.html
    NOTE – The above procedure will turn off XMP metadata write back on asset binaries. Therefore, future metadata changes will no longer be saved into assets. Please evaluate the consequences before disabling the workflow.
  3. Enable Set last modified date workflow
    DAM MetaData Writeback workflow sets asset’s last modified date. As this was disabled in above step, Set last modified date workflow should be enabled to keep last modified date up to date. This is because, Asset workflows and UI expect last modified date to be accurate. Keeping stale date could result in unexpected behavior.
  4. Configure Translation Integration Framework to stop translating asset binaries
    Use “Creating translation integration framework configurations” documented at https://docs.adobe.com/docs/en/aem/6-2/author/assets/managing-assets-touch-ui/multilingual-assets.ht... and uncheck “Translate Assets option” under Assets tab to stop translation of Asset binaries.

 

Sridhar

View solution in original post

2 Replies

Avatar

Level 10

I asked the translation team to take a look at this question.

Avatar

Correct answer by
Level 2

Hi,

Below are the steps confirmed by the Engineering team to manage Asset metadata translation without impacting Repository size.

  1. Setup File Data Store / Amazon S3 Data Store by following the instructions from https://docs.adobe.com/docs/en/aem/6-2/deploy/platform/data-store-config.html and/or https://jackrabbit.apache.org/oak/docs/osgi_config.html
  2. Disable DAM MetaData Writeback workflow
    As the name says, DAM Metadata Writeback workflow writes the metadata back in the binary file. As the metadata will change after translation, writing it back to binary will lead to different binaries for the language copies. Therefore, to use this optimization, one has to Disable DAM MetaData Writeback workflow using “Disable the workflow launchers” option documented at https://docs.adobe.com/docs/en/aem/6-2/administer/content/workflow-offloader.html
    NOTE – The above procedure will turn off XMP metadata write back on asset binaries. Therefore, future metadata changes will no longer be saved into assets. Please evaluate the consequences before disabling the workflow.
  3. Enable Set last modified date workflow
    DAM MetaData Writeback workflow sets asset’s last modified date. As this was disabled in above step, Set last modified date workflow should be enabled to keep last modified date up to date. This is because, Asset workflows and UI expect last modified date to be accurate. Keeping stale date could result in unexpected behavior.
  4. Configure Translation Integration Framework to stop translating asset binaries
    Use “Creating translation integration framework configurations” documented at https://docs.adobe.com/docs/en/aem/6-2/author/assets/managing-assets-touch-ui/multilingual-assets.ht... and uncheck “Translate Assets option” under Assets tab to stop translation of Asset binaries.

 

Sridhar