How to classify Chinese/Japanese/Korean/Asian languages properly? | Community
Skip to main content
yuhuisg
Community Advisor
Community Advisor
November 30, 2022
Solved

How to classify Chinese/Japanese/Korean/Asian languages properly?

  • November 30, 2022
  • 2 replies
  • 1155 views

For those who import Classifications with Chinese/Japanese/Korean characters in the “Key” column, how do you do so?

This is what I did:

  1. Classifications Importer > Browser Export > select the report suite and data set, configure all settings > use “UTF-8 to UTF-16" encoding > Export File
  2. In Windows, start Excel, create a new file. Data > Get Data > choose the downloaded SiteCatalyst Classifications.tab file > update the classifications (using English) > Save as Unicode Text (.txt)
  3. Classifications Importer > Import File > select the report suite and data set, choose my TXT file from step 2 > Overwrite data on conflicts > Import File

After waiting for 2 days, my Chinese keys are still unclassified :weary:

When I export my classifications again — even with “<Default>” encoding, I can see that my Chinese keys have the expected English classifications.

This post is no longer active and is closed to new replies. Need help? Start a new post to ask your question.
Best answer by elBKL

Hello, it needs to be UTF-8 with no BOM character encoding.

https://experienceleague.adobe.com/docs/analytics/components/classifications/classifications-importer/c-saint-data-files.html

I can confirm this as I have worked with Korean customers.

2 replies

Jennifer_Dungan
Community Advisor and Adobe Champion
Community Advisor and Adobe Champion
November 30, 2022

Ouch... I haven't had to deal with those types of characters.. but that sounds horrible that the rules aren't working for you...

It sounds like the export is converting to UTF-16 but when the rules are re-imported, they go back to UTF-8?

Have you tried getting client care to open a ticket with the engineering team? This sounds like it might be a rather large issue / lack of support for the asian customers.....

elBKLAdobe EmployeeAccepted solution
Adobe Employee
December 2, 2022

Hello, it needs to be UTF-8 with no BOM character encoding.

https://experienceleague.adobe.com/docs/analytics/components/classifications/classifications-importer/c-saint-data-files.html

I can confirm this as I have worked with Korean customers.

Jennifer_Dungan
Community Advisor and Adobe Champion
Community Advisor and Adobe Champion
December 2, 2022

Always good to have someone with real-world experience! 🙂