Expand my Community achievements bar.

SOLVED

Does having duplicate records in dataset impact licensing?

Avatar

Level 3

Hell, im wondering if having duplicates in a dataset has any impact on licensing costs. Thanks!

1 Accepted Solution

Avatar

Correct answer by
Employee

Hi @RyanMoravick,

The duplicate records in the dataset make up the Datalake size. This increases the license storage limit. Having said that, the data in the Datalake is compressed into Parquet file format, so the impact will be minimal unless you ingest millions of duplicate rows every day.

 

Regards,

Kumar Saurabh

View solution in original post

1 Reply

Avatar

Correct answer by
Employee

Hi @RyanMoravick,

The duplicate records in the dataset make up the Datalake size. This increases the license storage limit. Having said that, the data in the Datalake is compressed into Parquet file format, so the impact will be minimal unless you ingest millions of duplicate rows every day.

 

Regards,

Kumar Saurabh