+ Reply to Thread
Results 1 to 4 of 4

Thread: Google not accepting robots.txt rules

  1. #1
    Cormac's Avatar
    Cormac is offline Cormac Moylan Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of
    Join Date
    Jan 2006
    Location
    Cork
    Posts
    1,260

    Default Google not accepting robots.txt rules

    One of my sites has 141 pages indexed in Google in little over a fortnight. The site uses a shopping cart application which hooks up to Amazon and displays Amazon listings.

    The shopping cart is powered by associate-o-matic which brings down a LOT of content from Amazon. I was concerned about duplicated content so I setup some modrewrite rules and I restricted indexing (robots.txt) of all URLS which contain a query string.

    I tested this robots.txt file against a number of pages from my site via the Google Webmaster Console. Each and every time the robots.txt analyzer said that the page are restricted.

    I permitted the inclusion of 14 entry pages via robots.txt and via a sitemap.xml file. These 14 entry pages are the only ones indexed in Yahoo.com. Yahoo has prevented indexing of the duplicated content (as it should do, well done Yahoo).

    Google on the other hand has completely ignorned the robots.txt file and has indexed over a 100 pages of duplicate content which I said not to index.

    In the Google Webmaster Console I have an alert stating that approx 250 URLs are restricted by robots.txt. But a lot of those 250 URLs are appearing in Google's index.

    I can't understand why Google is doing this. Yahoo is playing ball and being correct by following my rules but Google is potentially lining me up for possible dup content issues further down the line.

    Has anybody encountered any similar issues to that of mine? I can't disclose the URL at this time as the site is a work in progress.

  2. #2
    glengara is offline Wannabe Geek glengara is a splendid one to behold glengara is a splendid one to behold glengara is a splendid one to behold glengara is a splendid one to behold glengara is a splendid one to behold glengara is a splendid one to behold
    Join Date
    May 2006
    Posts
    427

    Default

    You might find something on it here -

    robots.txt:

  3. #3
    paul's Avatar
    paul is offline ninja SEO paul has much to be proud of paul has much to be proud of paul has much to be proud of paul has much to be proud of paul has much to be proud of paul has much to be proud of paul has much to be proud of paul has much to be proud of paul has much to be proud of paul has much to be proud of
    Join Date
    Dec 2006
    Location
    .de
    Posts
    1,277

    Default

    Did you always have a robots.txt present ?

  4. #4
    Cormac's Avatar
    Cormac is offline Cormac Moylan Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of Cormac has much to be proud of
    Join Date
    Jan 2006
    Location
    Cork
    Posts
    1,260

    Default

    Yeah, had one from the start.
    I'm hoping to see the pages drop out of the index in the next fortnight. If not then I'm going to have to get onto Google about this.

+ Reply to Thread

Similar Threads

  1. Robots.txt
    By distressed in forum Search Engine Optimisation
    Replies: 16
    Last Post: 15-07-2008, 12:32 PM
  2. Googlebot blocked from robots.txt + sitemap.xml
    By Cormac in forum Search Engine Optimisation
    Replies: 8
    Last Post: 17-04-2008, 08:48 PM
  3. Accepting Credit Card Payments
    By paulocon in forum E-Commerce
    Replies: 15
    Last Post: 01-04-2008, 03:28 PM
  4. Bad Robots
    By Cormac in forum Webmaster Discussion
    Replies: 1
    Last Post: 17-10-2006, 03:31 PM
  5. Accepting Payments on a Website
    By Coby McNulty in forum Webmaster Discussion
    Replies: 3
    Last Post: 17-02-2006, 09:47 PM

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Optimization by vBSEO

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64