Trying to tie out Adobe Workspace to Downstream Databricks | Community
Skip to main content
skatofiabah
Level 5
December 30, 2024
Question

Trying to tie out Adobe Workspace to Downstream Databricks

  • December 30, 2024
  • 2 replies
  • 1973 views

Hi,

 

I'm trying to tie out UVs, Visits, and Page Views from Adobe Workspace to Downstream Databricks from the adobe daily web traffic table. Here is my code. I don't know why it's not lining up and I cannot seem to find a good answer. I used the documentation to have certain filters. My page views by month start to fly off, along with visits. UVs seem to be fine though. Can anybody help with this? If not, can someone point me to a Downstream analytics consultant?

 

 

 

Thanks!

2 replies

MandyGeorge
Community Advisor and Adobe Champion
Community Advisor and Adobe Champion
December 30, 2024

The first thing that comes to mind for me is whether or not you're using a VRS in workspace? If so, any segments that are applied to the VRS would need to be recreated in code to make the data match. Same for any bot filtering or IP exclusions that are being done to the data. 

For the page views being too high, do you have an event that fires on page views. If so, have you tried counting instances of that event? 

skatofiabah
Level 5
December 30, 2024

Hi @mandygeorge,

 

Yes, so I think my SQL commented out code if I included that would be the mimicking the VRS I think. I commented it out to just compare to the regular report suite and I'm still getting these discrepancies. Also, what bots or IP filters would I need to exclude? My numbers downstream fall short in the more recent months, so filtering would only reduce it. Also, shouldn't Adobe's Out-of-the-Box Page Views metric upstream and downstream still align anyway? If I do do a page event instance, how do I get regex or custom code to only filter that out of event_list? I did find a report suite filter in my code that I added here (username), but I still cannot get a match.

 

I'm close in UVs and visits overall for almost all 2024. I'm only 12 UVs less downstream and about 202 Visits short downstream. However, I am 50,000 Page Views Short in Downstream vs. Workspace. I might be missing more filters potentially? Any help or thoughts?

 

Thanks!

 

skatofiabah
Level 5
December 30, 2024

@mandygeorge might be right then, it might have something to do with the "replication" of your VRS? Maybe all the rules aren't quite right and it causing issues.

 

What happens if you compare data from the full suite to your Databricks data? Does it match better in that scenario?


Hi @jennifer_dungan,

 

My variance is around 0.5% and I mentioned that earlier in the thread when I do the report suite (without VRS). It's just odd that just adding 2 post_ values throws it all off and that they surge in the fall and onward.

Sukrity_Wadhwa
Community Manager
Community Manager
March 3, 2025

Hi @skatofiabah,

Were you able to resolve this query with the help of the provided solutions, or do you still need further assistance? Please let us know. If any of the answers were helpful in moving you closer to a resolution, even partially, we encourage you to mark the one that helped the most as the 'Correct Reply.'
Thank you!

Sukrity Wadhwa