How to get deduplicated visits within a segment? | Community
Skip to main content
Level 2
February 9, 2024
Solved

How to get deduplicated visits within a segment?

  • February 9, 2024
  • 1 reply
  • 2155 views

Hello,

 

Curious how I could build a segment for undeduplicated (sum of total visits in these pages)/deduplicated visits of the segment 

 

In the segment below, I was able to get the denominator. However when i put grand total, this is still the same value. With this metric im looking to see how many pages are visited within "vision" pages section. 

 

logic: sum of page 1 +2 ...x = deduplicated total visits

 

 

This post is no longer active and is closed to new replies. Need help? Start a new post to ask your question.
Best answer by Jennifer_Dungan

"Page" is essentially a special reserved prop, and either props or eVars (set at a HIT level expiry) could be used. You don't have to leave your eVars with a persisting value... Since props have a max length of 100 and eVars have a max length of 255, I have many eVars set up to track like props, to capture larger value items.... (and also because 75 props isn't enough for us.. I had to spill over into using eVars like props). And for this type of data, i.e. "Page Name" I wouldn't recommend using anything other than a HIT level dimension... 

 

 

No matter what you use, even multiple of the same page within the visit will automatically de-duplicate at the visit level, next sample:

 

Visit 1

  • Page A
  • Page C
  • Page D
  • Page A

 

Visit 2

  • Page B
  • Page E
  • Page B
  • Page A

 

 

    Page Views Visits
Pages   8 2
  Page A 3 2
  Page B 2 1
  Page C 1 1
  Page D 1 1
  Page E 1 1

 

 

So Page A was viewed twice in Visit 1, and once in Visit 2; you can see that all 3 PVs are accounted for, and only the two Visits as a whole show up in your Visits column.

Likewise, Page B was viewed twice in Visit 2, and not at all in Visit 1... both PVs are counted, and only the single Visit is shown in the row for that page.

 

You don't need to sum the Visits... in this sample, if you summed them, you would be ended up with 6 Visits (but in no way did that data come from that many visits).. this would inflate your data, and any division done on that would also be divided by an inflated value.... this seems like a way to build inaccuracies than anything...

 

Even if you were looking at a specific sub-set of pages, adding the individual visits for Page A, Page B and Page C for example could end up being a greater number than your total visits. Is let's say you added these up and every visit had page A and Page B and Page C during the visit..

 

If you added visits to Page A and Visits to Page B and Visits to Page C... you would have three times the value of your total visits, resulting in something like:

300/100 (Visits to A, B and C / Total Visits) < Which I do not think is really what you want.

 

 

Sticking with the A/B/C sample for now, are you trying to get a "Percent of Visits that came to A, B or C"? Or maybe a "Percent of Visits that came to A, B and C"? 

 

Because if you are trying to do that, like my earlier suggestion stated, you can create a segment:

 

 

HIT

      Page equals A

      OR

      Page equals B

      OR

      Page equals C

 

OR

 

VISIT

      Page equals A

      AND

      Page equals B

      AND

      Page equals C

 

 

The second means that all three must occur within the same visit.

 

Then you can just do a calculated metric of:

 

(segment + Visits) / Visits

 

You leave it formatted as decimal to get something like "0.48" or format it as a percent like "48%" so that you can say that 48% of Visits came to A/B/C (whether you are using the OR or AND segments)

 

If you need this by Visitor, are you looking for Visitors that Visited those pages? In the same visit or over multiple visits? So something like "Percent of Visitors that came to A, B or C"? etc.. then you will need some new segments, but the logic for the calculated metric would be similar...

 

Or maybe I am not understanding what you are trying to do.. but summing the columns that will result in inflated Visit numbers does not make sense to me the way you have described it....

1 reply

Jennifer_Dungan
Community Advisor and Adobe Champion
Community Advisor and Adobe Champion
February 9, 2024

You don't need a calculated metric for that...

 

You should be able to create a segment which is:

 

HIT

      Page equals A

      OR

      Page equals B

      OR

      Page equals C

 

Then stack that with Page Views to get all page views, and with Visits to get Visits... 

 

So for example:

 

Visit 1

  • Page A
  • Page C
  • Page D

 

Visit 2

  • Page B
  • Page E

 

Visit 3

  • Page C
  • Page B

 

Visit 4

  • Page D

 

You will get a report like this:

    Page Views Visits Segment A/B/C
Page Views Visits
Pages   8 4 5 3
  Page A 1 1 1 1
  Page B 2 2 2 2
  Page C 2 2 2 2
  Page D 2 2 0 0
  Page E 1 1 0 0

 

You can see that Adobe de-duplicates the Total Visits by the vary nature of the reporting.

Level 2
February 16, 2024

Hi @jennifer_dungan ,

 

I see your point but considering I am using props, I would like this to deduplicate those at a session level and sum visits to each page within the key section pages.

 

Considering we are using props and these do not persist by default, this would give a more accurate representation of all visits to each page within this section in my opinion. Then I would like to divide it by total unique visitors in order to see how consumers are moving within these sections. While I know I can add column sum of each page, some sections have over 300 pages. So I would like to see another more efficient way to build this. 

 

 

Jennifer_Dungan
Community Advisor and Adobe Champion
Community Advisor and Adobe Champion
February 20, 2024

Hello, 

 

Just following up here @jennifer_dungan. Would love to hear any thoughts on the above 🙂


Segments are as complicated as you need them to get the data back that you require... Your definitions as posted here still leave room for interpretation which is why I provided so many options.

 

Why would total visits in key section(deduplicated)/total visits work in this case regarding exposure rate?

You said you wanted to get the the % of visits that hit key sections.. (again I don't know if you definition means ALL pages need exposure or ANY of the pages count)... let's say "ANY" for now.

 

Visit 1

  • Page A  (key section)
  • Page B (key section)
  • Page X

 

Visit 2

  • Page A (key section)
  • Page Z

 

You cannot count Visit 1 Page A as it's own Visit and Visit 1 Page B as it's own Visit (and then of course you have Visit 2)

 

That would be 3/2... you can't say you have 150% Visits that hit key sections.

 

But Visit 1 would count as a Visit (whether it hit 1, 2, 20, 100, etc pages within your "key sections"), and Visit 2 would count as a Visit... in this scenario, all Visits hit key sections, therefore 2/2 = 100%

 

Let's throw a third visit into this:

 

Visit 3

  • Page W

 

Now you have 3 visits, and one of those visit doesn't hit a "key section" at all... if you counted each hit of sections as it's own Visit (non-de-duplicated), then you will see 3/3 = 100% and that isn't the case, Visit 3 clearly didn't hit a key section.

 

 

But using proper de-duplicated Visits, 2/3 will show 67% of Visits that hit Key Sections.