Does having duplicate records in dataset impact licensing? | Community
Skip to main content
RyanMoravick
Level 4
May 7, 2024
Solved

Does having duplicate records in dataset impact licensing?

  • May 7, 2024
  • 1 reply
  • 691 views

Hell, im wondering if having duplicates in a dataset has any impact on licensing costs. Thanks!

This post is no longer active and is closed to new replies. Need help? Start a new post to ask your question.
Best answer by Kumar29917170hcyp

Hi @ryanmoravick,

The duplicate records in the dataset make up the Datalake size. This increases the license storage limit. Having said that, the data in the Datalake is compressed into Parquet file format, so the impact will be minimal unless you ingest millions of duplicate rows every day.

 

Regards,

Kumar Saurabh

1 reply

Kumar29917170hcypAdobe EmployeeAccepted solution
Adobe Employee
May 7, 2024

Hi @ryanmoravick,

The duplicate records in the dataset make up the Datalake size. This increases the license storage limit. Having said that, the data in the Datalake is compressed into Parquet file format, so the impact will be minimal unless you ingest millions of duplicate rows every day.

 

Regards,

Kumar Saurabh