Expand my Community achievements bar.

Check out the November edition of the Analytics Community Lens newsletter to see what's been trending in the last two months!
SOLVED

Clickstream data feed into database

Avatar

Level 1

I am working to bring in our Clickstream data into a local database for deeper analysis correlated with internal data.  Is there a recommended combination of columns to be used as a primary key?  I assume it would be post_visid_high, post_visid_low and one of the time based columns like post_t_time_info.  Or, would date_time be better than post_t_time in this case?  I can't tell what the difference is other than "post_" suggests it would be subject to post processing rules.
 

1 Accepted Solution

Avatar

Correct answer by
Level 5

The key depends of what you have in the other dataser to match it with. You may have a login ID or similar, that are used across platforms?

The visid-fields are the old-school visitor ID's, use the experience cloud ID instead in case you need the visitor ID.

/Løjmann

View solution in original post

3 Replies

Avatar

Correct answer by
Level 5

The key depends of what you have in the other dataser to match it with. You may have a login ID or similar, that are used across platforms?

The visid-fields are the old-school visitor ID's, use the experience cloud ID instead in case you need the visitor ID.

/Løjmann

Avatar

Level 1

Well, my goal is to find a primary key for the hit data.  I need to be able to identify each row uniquely.  Just a user id won't be enough.  Matching to do a join to my other datasets won't be a problem.  But, I need a PK in the table for optimization.

Avatar

Level 5

Got you! I would build a PK out of hitid_high and hitid_low...

/Løjmann