Best approach to update a target field referencing a source field
Hi Everyone,
We have csv.pgp encrypted source files stored in Azure Blob Storage. These files are ingested into the Data Landing Zone using an API-based ingestion process.
Our source and target schemas are different. We would like to reference a field from the source file (e.g., CRMID) and use it to update an existing attribute in the target schema (customer_status). However, the customer_status attribute does not exist in the source schema/source file.
For example: If CRMID = 12345 (from the source file), then update the customer_status=moved in the target.
Could you please advise on the best approach to implement this logic? Should we use Data Prep or Data Distiller after ingestion? Also, what steps or actions should we take to achieve this?
Additional questions:
- CRMID is present in both the source and target schemas. Should this be the primary identifier in both schemas?
- Do we need to add customer_status field in the source? Is it necessary?
- If the customer_status field currently has the value "Active" in the target, when updated to “Moved “ based on the source input, the "Moved" value should be retained only for 90 days.
Thanks!