Hi Everyone,
I would like to check if someone has come across this type of request for data nalysis with Adobe data feed.To achieve this i am trying to build a data pipeline. i have a medallion architecture in mind. And the goal is to have one table at the visit level and aggregated tables as per business requirement. Please share your thoughts/challenges/experience if you have any regarding this.
Thanks
Solved! Go to Solution.
Views
Replies
Total Likes
Hi @SasikalaEs ,
Key Design Considerations:
Data Ingestion
Suggested Tools:
Data Lakehouse: Databricks (Delta Lake), Snowflake, BigQuery
Orchestration: Airflow, Azure Data Factory, dbt
Storage: S3 / ADLS Gen2 (Bronze/Silver/Gold folders)
Analytics: Power BI, Tableau, Looker
Schema Evolution: Apache Iceberg or Delta for handling schema changes
Hi @SasikalaEs ,
Key Design Considerations:
Data Ingestion
Suggested Tools:
Data Lakehouse: Databricks (Delta Lake), Snowflake, BigQuery
Orchestration: Airflow, Azure Data Factory, dbt
Storage: S3 / ADLS Gen2 (Bronze/Silver/Gold folders)
Analytics: Power BI, Tableau, Looker
Schema Evolution: Apache Iceberg or Delta for handling schema changes
One more thing to keep in mind.
Rad Data feeds have every row of data collected... including rows that have been excluded (bots, internal traffic, malformed data, etc).
When processing your raw data, don't forget to check the exclude_hit
and make sure that you don't include these rows, or your data will be inflated.
Also, make sure you are using the "post" version of the data where ever possible.. this is the post-processed version of the data (so your processing rules, vista rules, etc).
Views
Likes
Replies