Optimizing Retrieving of Nodes from Iterator after Query

Question

Hi Everyone,We are in a situation where we have to perform up to a 1000 queries per request looking for asset nodes. The queries we use are similar to this one:SELECT * FROM [dam:Asset] As s WHERE ISDESCENDANTNODE ([/content/dam/msi-dam]) AND s.[jcr:content/metadata/cq:productReference] IN ("/etc/commerce/products/msi/smo/smot-wqsav-hon10mm")Using an index the query execution seems to be quick (1-2ms). Here's what the query performance monitor shows for these queries: Query execution time: 2 ms Get nodes time: 992 ms Result node count time: 2421 ms Number of nodes in result: 2 When the time comes to work with the nodes returned as result the things get slow. Here's an example of our code:QueryResult result = query.execute();@SuppressWarnings("unchecked") Iterator<Node> iterator = result.getNodes(); while (iterator.hasNext()) {       Node assetNode = iterator.next();       String path = assetNode.getPath();       if (!assetsUrls.contains(path)) {            assetsUrls.add(path);       } }The two lines of code is what slows down the whole process to a little more that 1s for each iteration of this code. This eventually will result in spending around 1200s for the whole process when we need to have 1000 queries. We understand that there are some limitations when accessing the repository this way but are really trying to find a way to optimize the process. In another functionality we build, we were able have one long query for all the 1000 things we are searching for but we cannot implement it here unless there's a way to retrieve the number of resulting nodes without getting the nodes themselves. This might be very helpful, as well. My questions here would be, is there a way to optimize this whole process and also is there a way to retrieve the number of resulting nodes without accessing them? We are using AEM 6.3.2.1Thank you very much for your help in advance,Bobby

joerghoh · Accepted Answer

The query engine works lazy, that means it doesn't load all results directly when doing the request, but only when you explicitly request it (via the iterator). This explains why the nodeIterator.next() call actually is an operation which can take a significant amount of time.

1000 queries per request will never perform well, you should really change your approach! It looks like your query can be changed quite easily into a traversal, which performs probably much better.

But even then I would definitely think about the content model. Looking at the query, assets will link to products, and it seems to me that you want to display all assets which belong to a product. Why don't you add the references to the product and link all assets from there (actually reversing the relation)? Or create a folder for each product in DAM and place the related assets there. If you build your content model in a clever way, you can often reduce the amount of searching or querying and replace it rather by direct lookups at known locations.

Jörg

arunpatidar · Answer

Hi,I am not sure if below is gonna help or not but you can try :NodeIterator searchResults = query.execute().getNodes(); if(searchResults != null) { while (searchResults.hasNext()) { String path = searchResults.nextNode().getPath(); if (!assetsUrls.contains(path)) { assetsUrls.add(path); } } }

Sign up

Login with SSO

Login to the community

Login with SSO

Scanning file for viruses.

This file cannot be downloaded