I am currently working at a company where we're in the phase of choosing our new analytics tool. In this process we're (obviously) also comparing tools with each other and we're currently looking into "out-of-the-box" data availability.
This means that we're looking at how many rows are directly (not through an API) available in:
a normal report (as far as I know this is 50k, but I would like to know whether this is the right amount).
a data export/extract (scheduled or directly requested) (afaik: 50k)
a data warehouse export (The adobe help pages state this is unlimited and as I can recall this is correct but I would really like to know how many rows DWH can export from the interface).
a workspace report (afaik: 200 rows per breakdown).
Unique values (cardinality) (afaik: 500k, but could this be extended?)
This can be changed by contacting Client Care at email@example.com. It is done using one of the back end tools.
However, please be advised that the limit of 500,000 is designed to optimize the reporting performance. Increasing the limit can impact reporting performance especially when using breakdowns, where the report can take time to fetch.
But after digging deeper into the (pdf) API documentation I found an additional parameter for the element of a report to describe the number of rows returned for an element. This parameter is called top:
reportDefinitionElement --> top
Using this I am able to bump up the number of returned rows:
Having dug into the python libraries code I found that there is no setting of this value at all. This leads me to the believe, that the API has a default value set for top if it is not set in the report definition.