Hi everyone,
I have a scenario where 10 files are available in an SFTP location, and I’d like to load all of them into Adobe Experience Platform (AEP) at once. However, these files don’t all have the same number of fields — some contain additional or fewer fields than others.
My questions are:
Can all 10 files be ingested into AEP in a single load?
How does AEP handle files with different field counts or mismatched schemas?
Is there a best practice or recommended approach for handling such variations during ingestion?
Thank You
Solved! Go to Solution.
Topics help categorize Community content and increase your ability to discover relevant content.
Views
Replies
Total Likes
@SiddarthK there are 2 ways to resolve it, but first create a master Schema with all the fields included across the 10 files.
1) Single Dataflow:
2) Multiple Dataflows:
@SiddarthK there are 2 ways to resolve it, but first create a master Schema with all the fields included across the 10 files.
1) Single Dataflow:
2) Multiple Dataflows:
Hi @SiddarthK if the answer is helpful, can you mark it as "Correct Answer", it would help wider community members.
Views
Replies
Total Likes
Hi @Devyendar if I select folder the "Next" option is disabled.
I don’t think using multiple dataflows is very practical from a business perspective.
Views
Replies
Total Likes
@SiddarthK as I suggested earlier you can consider single or multiple data flow approach.
Single data flow requires you to standardize the files with all having the same column/fields even if they are empty records. This actually might be more challenging aligning with file creator to add those empty fields, but if it is possible for sure take this approach and this works.
Multiple data flow is least disruptive where you take existing files and building the data flow into 1 standard Schema and dataset (depending on you want all files to go into 1 dataset or multiple). And the multiple data flows are internal to AEP there should not be any business impact or considerations.
You can choose what works best for your requirements.
Generally speaking, it's better to ensure that all of your files have the same format, and only source a dataset from one folder in your cloud storage location.
Different datasets should be sourced from different folders, but each folder should be homogenous in terms of its contents.
Then whenever you decide later to add additional attributes to a dataset (schema), create a new folder and start sourcing your files from there.
Views
Likes
Replies