How to classify Chinese/Japanese/Korean/Asian languages properly?

Question

For those who import Classifications with Chinese/Japanese/Korean characters in the “Key” column, how do you do so?
This is what I did:

Classifications Importer > Browser Export > select the report suite and data set, configure all settings > use “UTF-8 to UTF-16" encoding > Export File
In Windows, start Excel, create a new file. Data > Get Data > choose the downloaded SiteCatalyst Classifications.tab file > update the classifications (using English) > Save as Unicode Text (.txt)
Classifications Importer > Import File > select the report suite and data set, choose my TXT file from step 2 > Overwrite data on conflicts > Import File

After waiting for 2 days, my Chinese keys are still unclassified :weary:
When I export my classifications again — even with “<Default>” encoding, I can see that my Chinese keys have the expected English classifications.

elBKL · Accepted Answer

Hello, it needs to be UTF-8 with no BOM character encoding.

https://experienceleague.adobe.com/docs/analytics/components/classifications/classifications-importer/c-saint-data-files.html

I can confirm this as I have worked with Korean customers.

Jennifer_Dungan · Answer

Ouch... I haven't had to deal with those types of characters.. but that sounds horrible that the rules aren't working for you...

It sounds like the export is converting to UTF-16 but when the rules are re-imported, they go back to UTF-8?

Have you tried getting client care to open a ticket with the engineering team? This sounds like it might be a rather large issue / lack of support for the asian customers.....

Sign up

Login with SSO

Login to the community

Login with SSO

Scanning file for viruses.

This file cannot be downloaded