Record deletion not reflecting in Data Lake – Expected behavior or issue?
Hi Team,
I’m experiencing an issue where records deleted at the application level are not being fully removed from the Data Lake.
Even after performing deletion, the data still appears to persist in the underlying storage. I’m unsure whether this is expected behavior due to delayed processing (e.g., background cleanup jobs, retention policies), or if it indicates a problem in the deletion workflow.
Could anyone clarify:
- Does deletion propagate asynchronously to the Data Lake?
- Is there a recommended way to validate physical data cleanup?
- Are there configurations that control this behavior?
Any insights or best practices would be greatly appreciated.
Thanks in advance!