Nobody really knows but I am trying to make a decision and opinions will help.
The problem is a real estate site that has the normal search button. However, the result of the click is a listing but there is another search button at the top of the listing page as well as links to the full description but they all have a search button as well.
FIRST QUESTION: My belief is that this is a no-go for most robots because their job is to follow links and, in this case, they would get into a virtual loop. Is this the case or does the robot stop after so many iterations, pull back out and start over?
SECOND QUESTION: On this particular site the link created by the code under the search button contains a long, long string of parameters with one "?" and is .php. Is that bad for robots?
THIRD QUESTION: Let us assume that we don't want any of these pages indexed anyway because they are not good to optimise and we have lots of static pages that are much better. How do we keep the robot out -
The robots.txt will obviously do this in a positive manner.
... but would a "noindex,nofollow" tag in the header keep the robot out of the whole page?
... or would a "nofollow" tag in the search button code prevent the robot from following the link.
Important to bear in mind that the reason for all this is to have a relatively small number of pages which can be perfectly optimised as opposed to literally 1,000's that are out of control.
Additionally, the Google Webmaster "comment" was that -
a) There were 100's of indexed pages with duplicated title and description tags in the header - the pages produced by the search code.
b) There were many pages with the 404 Error: File not found - where the property was deleted from the database.
Can anyone help with this?
Homefinder