Question
LLMs Crawling Origin /content/ URLs: An LLMO Visibility Issue
Large Language Models (LLMs) crawl and ingest origin-level URLs like /content/yoursite even when those URLs are rewritten or hidden from human users. Especially this is happening for home page URLs.
How can we identify from where this is happening and how can we minimize this?