The third result (and several others on the first page of results) link all the way down to the component level:
http://www.rand.org/content/rand/blog/jcr:content/par/bloglist.ajax.topic.postsecondary-education-pr...
We would expect this to link to the actual site page:
http://www.rand.org/blog.html
We've thought of two possible solutions, neither of them good:
1. Add an Apache-level rewrite for anything *./jcr:content(/.*) to rewrite to the parent page. This would almost surely work to fix links from Google, but would break site functionality that directly addresses components for AJAX calls (pagination) or alternate page-renderings (XML), etc.
2. Add affected pages to robot.txt. Even worse: then we don't even get indexed and likely impossible to effectively keep up with.
Any good strategies out there to force Google to index at the page level *only*?