Adobe Target - Deriving reward probabilities for MAB algorithms

Avatar

Avatar
Boost 3
Level 1
anils5920589
Level 1

Likes

4 likes

Total Posts

3 posts

Correct reply

0 solutions
Top badges earned
Boost 3
Boost 1
View profile

Avatar
Boost 3
Level 1
anils5920589
Level 1

Likes

4 likes

Total Posts

3 posts

Correct reply

0 solutions
Top badges earned
Boost 3
Boost 1
View profile
anils5920589
Level 1

12-07-2019

Hi Adobe Team,

Had a question on how Target derives reward probabilities for the MAB algorithms implemented for Auto Allocate, Auto Target and Automated Personalization activities.
Was going through your docs and found out that there are three ways of feeding data into Target :

  • mbox parameters
  • Profile parameters/attributes
  • Server side APIs for profile updates.

Since MAB algorithms need reward probabilities of each experience/variant as an input which change over time as more visitors participate in an activity, does Target derive the reward probability from the data supplied using the above methods ?

Accepted Solutions (1)

Accepted Solutions (1)

Avatar

Avatar
Coach
Employee
mikewebguy
Employee

Likes

635 likes

Total Posts

402 posts

Correct reply

85 solutions
Top badges earned
Coach
Contributor
Shape 25
Shape 1
Shape 10
View profile

Avatar
Coach
Employee
mikewebguy
Employee

Likes

635 likes

Total Posts

402 posts

Correct reply

85 solutions
Top badges earned
Coach
Contributor
Shape 25
Shape 1
Shape 10
View profile
mikewebguy
Employee

12-07-2019

anils5920589​,

I'm not too familiar with reward probabilities and how they are derived. Please share the documentation you were looking into in regards to reward probability.

Mihnea Docea | Technical Support Consultant | Customer Experience | Adobe | (:: 1 (800) 497-0335

Answers (3)

Answers (3)

Avatar

Avatar
Boost 3
Level 1
anils5920589
Level 1

Likes

4 likes

Total Posts

3 posts

Correct reply

0 solutions
Top badges earned
Boost 3
Boost 1
View profile

Avatar
Boost 3
Level 1
anils5920589
Level 1

Likes

4 likes

Total Posts

3 posts

Correct reply

0 solutions
Top badges earned
Boost 3
Boost 1
View profile
anils5920589
Level 1

15-07-2019

Hi Mike,

Thanks for the response. I was going through the below content :

1. Methods https://docs.adobe.com/content/help/en/target/using/implement-target/before-implement/methods/method...

2. Data collection for Target's personalization algorithms

3. Personalization Insights reports overview

Additionally, I would like to give you a quick context on what are reward probabilities.

For MAB algorithms, suppose I have 2 variants,  A(control) and B(variation).

Based on visitor interactions, and depending on the CTRs (Clickthrough rates) on the variants, we can derive reward probabilities of A and B. Let's say in a single day 1000 visitors are interacting with A and B. Out of those 1000 visitors, A gets 50% of the traffic and B gets 50% of the traffic. So, out of 500 hits on A, only 150 convert on it. And out of the other 500 hits on B, 300 convert on it. A conversion metric equates to being generating a reward (a boolean 0 or 1). So in this case, reward probability of A is 0.3 (150/500) and that of B is 0.6 (300/500). Ofcourse, this will change as more visitors interact in a typical A/B test activity. These reward metrics ideally serves to be the input data to the training models of the algorithms. This example is extremely simple but in real time there might be a lot more complexity involved into deciding what is the reward probability of the experiences controlled by numerous factors.


Hope, this gives you an insight into the reward probabilities.

Please let me know for any other questions.

Thanks.

Avatar

Avatar
Coach
Employee
mikewebguy
Employee

Likes

635 likes

Total Posts

402 posts

Correct reply

85 solutions
Top badges earned
Coach
Contributor
Shape 25
Shape 1
Shape 10
View profile

Avatar
Coach
Employee
mikewebguy
Employee

Likes

635 likes

Total Posts

402 posts

Correct reply

85 solutions
Top badges earned
Coach
Contributor
Shape 25
Shape 1
Shape 10
View profile
mikewebguy
Employee

18-07-2019

anils5920589​,

Thank you for the information provided I'm not too familiar with reward probabilities so lets see what others have to say.

Mihnea Docea | Technical Support Consultant | Customer Experience | Adobe | (:: 1 (800) 497-0335

Avatar

Avatar
Boost 3
Level 1
anils5920589
Level 1

Likes

4 likes

Total Posts

3 posts

Correct reply

0 solutions
Top badges earned
Boost 3
Boost 1
View profile

Avatar
Boost 3
Level 1
anils5920589
Level 1

Likes

4 likes

Total Posts

3 posts

Correct reply

0 solutions
Top badges earned
Boost 3
Boost 1
View profile
anils5920589
Level 1

17-07-2019

Hi Mike,

I was waiting for your reply on this topic. Please let me know if you have any questions.

This would help us better understand how Target uses Analytics data to derive the reward probabilities of experiences.

Thanks.