Hello everyone !
I have implemented new bucketed dimension using Traffic Variable Classification.
In most cases it works perfect, but some values get into 'Unspecified' in new classified dimension, while I expect them to get in the another value. It isn't a problem with the regular expressions because the same value like '1' can be in the '1 - 10' item and in the Unspecified.
Maybe there are any limits on the count of values in the prop, I have near 8-10 millions Occurances per day.
Please write if you have any ideas on the reasons of that.
Solved! Go to Solution.
Views
Replies
Total Likes
@AMinakov Look back window would be one of the reasons, for this. Below are some points you would need to keep in mind
This is the workaround for those who have more than 10 million classified values for a dimension: You will need to export unclassified values via FTP, in 10-million batches, and manually classify them.
Views
Replies
Total Likes
First question how are you classifying things.
Using classification rule builder or Uploading files to be classified?
Views
Replies
Total Likes
Pablo, thank you for your attention !
I used Classification Rule Builder.
Views
Replies
Total Likes
I am not aware of any limitation by sheer numbers processed.
I suspect the sheer volume you encounter is just having instances/variations that though they seem to should fall into one classification are somehow being processed as unspecified.
Could there be some logic that when so many are sent they are getting corrupted so that some have extra characters in them?
Tough to understand without examples and without seeing classification logic. I do know I have had to re adjust my classification regex from time to time as some new variations can be acting as you describe.
Views
Replies
Total Likes
Do the "Unspecified" ones appear when looking at the current day's data only, or with old data too? If old data, how far back do "Unspecified" appear?
Views
Replies
Total Likes
The current data too. I thought about lookback window too, but the problem most likely caused by the big data, because with the time % of Unspecified decrease.
Views
Replies
Total Likes
@AMinakov Look back window would be one of the reasons, for this. Below are some points you would need to keep in mind
This is the workaround for those who have more than 10 million classified values for a dimension: You will need to export unclassified values via FTP, in 10-million batches, and manually classify them.
Views
Replies
Total Likes