The DW extracts often include a substantial number of records that should have specifically been excluded by the segment applied in the extract.
For example, segment container is Hit, and the condition is that eVarX does not equal "Unspecified" (we have also tried it with the condition eVarX exists - same result). And yet, the extract contains tens of thousands or records where eVarX is blank. Checking these records in Workspace produces "Unspecified" as the value for eVarX .
The impact is that the extracts are oftentimes bloated x10, they take longer, and require additional cleansing. Any suggestions on how to prevent the extracts from picking up these records?
This is good to know, although why wouldn't Adobe filter these records out of the extract is not clear to me.
For example, if I want to extract the transactions per visitor where the visitor is "registered" and not a "guest", I would apply the "not a guest" segment, but I absolutely do need the visitor dimension in the extract, to know the visitor for each transaction.