Expand my Community achievements bar.

Guidelines for the Responsible Use of Generative AI in the Experience Cloud Community.
SOLVED

Facing issue in the Google Index!

Avatar

Level 4

We have a multiple sites in AEM Cloud. One of the sites is taking the content path from other website and indexed in Google. 

Example:-

website https://example.com/content/us/en/demo/sample.html

Here the /content/us/en/demo/sample.html is not the actual path of the example.com.

And it is indexed in Google as well with the wrong path. 

How to resolve this? Kindly let me know if there is any solution.

Topics

Topics help categorize Community content and increase your ability to discover relevant content.

1 Accepted Solution

Avatar

Correct answer by
Community Advisor

@Ameen_Dev Does your website have a sitemap.xml which tells Google what to Index from your website ? Plese check few things 

 

1. Check if you have proper sitemap.xml 

2. Robots.txt  

3. Values in the canonical URL 

 

Please go through the documentation related to indexing and make sure your website has evrything mentioned for picking up the right values 

https://developers.google.com/search/docs/crawling-indexing/sitemaps/overview

 

https://developers.google.com/search/docs/crawling-indexing/canonicalization#:~:text=Consequently%2C....

 

https://ahrefs.com/blog/google-index/

https://developers.google.com/search/docs/crawling-indexing/robots/intro

 

 

 

View solution in original post

4 Replies

Avatar

Correct answer by
Community Advisor

@Ameen_Dev Does your website have a sitemap.xml which tells Google what to Index from your website ? Plese check few things 

 

1. Check if you have proper sitemap.xml 

2. Robots.txt  

3. Values in the canonical URL 

 

Please go through the documentation related to indexing and make sure your website has evrything mentioned for picking up the right values 

https://developers.google.com/search/docs/crawling-indexing/sitemaps/overview

 

https://developers.google.com/search/docs/crawling-indexing/canonicalization#:~:text=Consequently%2C....

 

https://ahrefs.com/blog/google-index/

https://developers.google.com/search/docs/crawling-indexing/robots/intro

 

 

 

Avatar

Community Advisor

@Ameen_Dev , you'd also want to redirect all /content/${my_brand}/* to non /content paths.

Avatar

Community Advisor

Hi, 

Most likely, your sitemaps are misconfigured and need adjustment. Once you've corrected these maps, you'll need to reindex the content in the Google Search Console: https://www.youtube.com/watch?v=_-KnG-EeDn0&ab_channel=CHRISPALMERSEO 

 

Hope this helps



Esteban Bustamante

Avatar

Community Advisor

@Ameen_Dev You may want to look at the https://example.com/robots.txt file as this is most likely the incorrect url is being read by the crawler.