Your achievements

Level 1

0% to

Level 2

Tip /
Sign in

Sign in to Community

to gain points, level up, and earn exciting badges like the new
BedrockMission!

Learn More

View all

Sign in to view all badges

excluding properties from fulltext lucene index searches not working for deep nodes

Avatar

Avatar
Validate 1
Level 4
cqsapientu69896
Level 4

Likes

22 likes

Total Posts

76 posts

Correct Reply

4 solutions
Top badges earned
Validate 1
Give Back 5
Give Back 3
Give Back 10
Give Back
View profile

Avatar
Validate 1
Level 4
cqsapientu69896
Level 4

Likes

22 likes

Total Posts

76 posts

Correct Reply

4 solutions
Top badges earned
Validate 1
Give Back 5
Give Back 3
Give Back 10
Give Back
View profile
cqsapientu69896
Level 4

11-10-2020

I want to exclude certain properties from AEM full text search so that there are no matching results when page author's name is searched

 

for example if I search for Amit - it is providing few pages as results as there are few pages authored by Amit. I don't want these search results

 

I am using the default OOTB cqPageLucene index

 

I already checked the document  - https://jackrabbit.apache.org/oak/docs/query/lucene.html where it is mentioned that to

 

exclude a property we can add index(boolean) false on the property

 

I have added index(boolean) false on

 

jcr:content/cq:lastRolledoutBy 

jcr:content/cq:lastModifiedBy

jcr:content/cq:lastReplicatedBy

 

but the issue is that even below jcr:content node - there are various nodes in a page - a responsive grid inside another responsive grid and when an author drops a component that component always has jcr:createdBy ; jcr:lastModifiedBy properties which have the content author's id/name

 

So I am planning to use 

isRegexp 

as mentioned in the doc to write a regex and then set index(boolean) false

 

Has anyone else faced the same issue and can help in excluding these jcr:createdby and jcr:lastModifiedBy properties in deep nodes ? Am I going in the right direction using isRegexp ?

 

If yes  - what can be the right regex to exclude these properties from certain (n) level of nodes?

 

I read this https://experienceleaguecommunities.adobe.com/t5/adobe-experience-manager/indexing-data-indexing-rul... but there is no solution for excluding properties in nested nodes

 

Any oak experts; lucene indexing gurus can help me on this?

 

thanks in advance

fulltext search lucene oak oak:index query string
View Entire Topic

Avatar

Avatar
Validate 1
Level 4
cqsapientu69896
Level 4

Likes

22 likes

Total Posts

76 posts

Correct Reply

4 solutions
Top badges earned
Validate 1
Give Back 5
Give Back 3
Give Back 10
Give Back
View profile

Avatar
Validate 1
Level 4
cqsapientu69896
Level 4

Likes

22 likes

Total Posts

76 posts

Correct Reply

4 solutions
Top badges earned
Validate 1
Give Back 5
Give Back 3
Give Back 10
Give Back
View profile
cqsapientu69896
Level 4

12-10-2020

Thanks @Vijayalakshmi_S for the descriptive answer

 

however it is not correct - I also added a comment to my question yesterday which mentioned that isRegexp does not support child nodes -

 

as it is also mentioned in the document  - https://jackrabbit.apache.org/oak/docs/query/lucene.html 

 

Note that the regular expression doesn’t match intermediate nodes, so, jcr:content/.*/.* would not index all properties for all children of jcr:content. OAK-5187 is an open improvement to track supporting arbitrary intermediate child nodes.

 

I tried adding a node with 

isRegexp  true;

analyzed false;

nodeScopeIndex false

and name as jcr:content/*/*/*/jcr*

 

and it still returned the result with author name (it is not excluding the property)

 

it is the same with these regex

 

jcr:content/*/*/*/jcr.*

 

jcr:content/*/*/*/jcr:lastModifiedBy

 

So it is not working  - and the reason for this is https://issues.apache.org/jira/browse/OAK-5187 

 

can you please let me know if my understanding is correct ? cc @kautuk_sahni