I am working on an Adobe Guides set-up project and I have not been able to find any documentation about how to import an MS Word document and have it be ingested and converted to a working DITA file with Adobe Guides. Is there any documentation posted about this? It is touted as a feature of the application, but there does not appear to be any information about it, nor does it "just work" with the platform.
Thank you for any insight!
Solved! Go to Solution.
Views
Replies
Total Likes
AEM Guides allows you to migrate your existing Word documents (.docx) into DITA topic type docu- ments. You need to specify the input and output folder locations along with other parameters and the document gets converted into DITA document. Depending on the content, you could have a .dita file and a .ditamap file.
To be able to convert a Word document successfully, your document should be well structured. For example, your document should have a Title, followed by Heading 1, Heading 2, and so on. Each of the headings should have some content in it. If your document is not well structured, the process might not work as expected.
By default, AEM Guides uses the Word-to-DITA (Word2DITA) transformation framework. This transfor- mation depends on the style-to-tag mapping configuration file.
To be able to use the Word2DITA trans- formation successfully, you must consider the following guidelines for preparing your Word document for conversion:
Perform the following steps to convert your existing Word documents into DITA topic type document:
– In the inputDir element, specify the location of the input folder wherein your source Word documents are available. For example, if your Word documents are stored in a folder named wordtodita in projects folder, then specify the location as: /content/dam/projects/wordtodita/ – In theoutputDir element, specify the location of the output folder or keep the default output location to save the converted DITA document. If the specified output folder does not exist on DAM, then the conversion workflow creates the output folder. – For the createRev element, specify whether a new version of the converted DITA topic is to be created (true) or not (false). – In the s2tMap element, specify the location of the map file that contains mappings for Word document styles to DITA elements. The default mapping is stored in the file located at: /libs/fmdita/word2dita/word-builtin-styles-style2tagmap.xml NOTE: For more information about the structure of word-builtin-styles-style2tagmap.xml file and how you can customize it, see Style to Tag Mapping in DITA For Publishers User Guide. – In the props2Propagate element, specify the properties that should be passed on to the DITA map. This property is required to pass on the default metadata like dc:title,dc:subject,dam:keywords,dam:category from document metadata to converted DITA assets.
Using the <config> </config> block, you can define one or multiple blocks of configurations for conversion. The conversion workflow gets executed and the final output in the form of a DITA topic is saved in the location specified in the <outputDir> element.
AEM Guides allows you to migrate your existing Word documents (.docx) into DITA topic type docu- ments. You need to specify the input and output folder locations along with other parameters and the document gets converted into DITA document. Depending on the content, you could have a .dita file and a .ditamap file.
To be able to convert a Word document successfully, your document should be well structured. For example, your document should have a Title, followed by Heading 1, Heading 2, and so on. Each of the headings should have some content in it. If your document is not well structured, the process might not work as expected.
By default, AEM Guides uses the Word-to-DITA (Word2DITA) transformation framework. This transfor- mation depends on the style-to-tag mapping configuration file.
To be able to use the Word2DITA trans- formation successfully, you must consider the following guidelines for preparing your Word document for conversion:
Perform the following steps to convert your existing Word documents into DITA topic type document:
– In the inputDir element, specify the location of the input folder wherein your source Word documents are available. For example, if your Word documents are stored in a folder named wordtodita in projects folder, then specify the location as: /content/dam/projects/wordtodita/ – In theoutputDir element, specify the location of the output folder or keep the default output location to save the converted DITA document. If the specified output folder does not exist on DAM, then the conversion workflow creates the output folder. – For the createRev element, specify whether a new version of the converted DITA topic is to be created (true) or not (false). – In the s2tMap element, specify the location of the map file that contains mappings for Word document styles to DITA elements. The default mapping is stored in the file located at: /libs/fmdita/word2dita/word-builtin-styles-style2tagmap.xml NOTE: For more information about the structure of word-builtin-styles-style2tagmap.xml file and how you can customize it, see Style to Tag Mapping in DITA For Publishers User Guide. – In the props2Propagate element, specify the properties that should be passed on to the DITA map. This property is required to pass on the default metadata like dc:title,dc:subject,dam:keywords,dam:category from document metadata to converted DITA assets.
Using the <config> </config> block, you can define one or multiple blocks of configurations for conversion. The conversion workflow gets executed and the final output in the form of a DITA topic is saved in the location specified in the <outputDir> element.
Thank you! I suspected we would need to update the config file to support this, but that is what I was not able to find. I will have our development team take on that action.
Hi vijay,
I tried the mentioned above steps to convert the word doc to dita file, but at the destination i can't able to see any files.
below is log I can see, can you please help me in this
16.06.2023 17:19:31.132 *DEBUG* [JobHandler: /var/workflow/instances/server0/2023-06-16/word2dita_16:/content/dam/test-dita/hello.docx] com.adobe.fmdita.publishworkflow.ConvertWordTODita Adding jcr:content node in folders where it is missing
16.06.2023 17:19:31.225 *INFO* [JobHandler: /var/workflow/instances/server0/2023-06-16/word2dita_16:/content/dam/test-dita/hello.docx] com.adobe.fmdita.conversionutils.ConversionUtils Sending conversion complete event for path /content/dam/test-dita/hello.docx
16.06.2023 17:44:27.868 *ERROR* [[0:0:0:0:0:0:0:1] [1686917667866] POST /bin/referencelistener HTTP/1.1] com.adobe.fmdita.versioncontrol.VersionUtils Index 0 out of bounds for length 0
16.06.2023 17:45:58.127 *ERROR* [JobHandler: /var/workflow/instances/server0/2023-06-16/word2dita_18:/content/dam/test-dita/hello.docx] com.adobe.fmdita.publishworkflow.ConvertWordTODita Error caught : Node with path /content/dam/fmdita-outputs/hello does not exist.
Views
Replies
Total Likes
I hope you have created source and destination folders i.e "word files" and "w2d".
Refer to Migrating Word Documents using AEM Guides
If you are still getting issues, Take the help of Adobe support.
Views
Replies
Total Likes
It worked fine now, the issue is with the word document, as it is not structured as per dita rules.
Views
Replies
Total Likes
Can you give more insight into the problem?
Which element is not structured as per dita rules ?
Views
Replies
Total Likes
Views
Likes
Replies
Views
Likes
Replies