RTCDP SFTP Source Connector - Ingest only net new data | Community
Skip to main content
RyanMoravick
Level 4
April 26, 2024
Solved

RTCDP SFTP Source Connector - Ingest only net new data

  • April 26, 2024
  • 1 reply
  • 2521 views

Hello, I am wondering if there is a setting in the SFTP source connector to enable only net new records to be ingested. Im noticing that every time the data flows runs, it ingests the old records as well as any new records, which will inevitably lead to millions of records to be ingested over the long run. Is this a setting in RTCDP or the source (SFMC).

This post is no longer active and is closed to new replies. Need help? Start a new post to ask your question.
Best answer by Kumar29917170hcyp

It is overwriting the current data in the data extension and then being extracted to the same file on the SFTP location. 


Hi @ryanmoravick,

 

In the process of overwriting the data, the last modified time of the file changes and the system considers it as a new file.

The system is designed in such a way that it checks the last execution time and any file placed after its last execution time will be picked.

 

To overcome this problem, limit your export to run once daily.

If you cannot limit the writing process, write the file to a staging folder during the entire day and copy the file to the scheduled folder once daily.

 

Regards,
Kumar Saurabh

1 reply

brekrut
Adobe Employee
Adobe Employee
April 29, 2024

Hello @ryanmoravick 

 

The SFTP source connector can ingest data from a file location and will process records based upon the timestamp of the file.  If you create a dataflow which is mapped to a folder only new files will be ingested by the data flow into the Adobe RTCDP platform.

 

I can see you have indicated you are using SFTP and ingesting from SFMC.  Just to confirm are you exporting from SFMC into an SFTP location and then ingesting to Adobe RTCDP?

RyanMoravick
Level 4
April 29, 2024

Hey @brekrut correct I am exporting from SFMC into the SFTP location and ingesting to Adobe RTCDP. For additional context, the screenshot is showing ingestion of email performance data from SFMC into Adobe CDP. You can see every hour it is ingesting the same 38 profiles. Ideally I would only want the data flow to ingest new records only.

brekrut
Adobe Employee
Adobe Employee
April 29, 2024

I can see based upon the screenshot the dataflow is running every hour and picking up 38 records. With the data which is being exported out of SFMC is this writing to a new file or updating an existing file on the SFTP location?