Irish SEO,  Marketing & Webmaster Discussion

 
Chitika Malls

Googlebot blocked from robots.txt + sitemap.xml

This is a discussion on Googlebot blocked from robots.txt + sitemap.xml within the Search Engine Optimisation forums, part of the Online Marketing category; I'm running Wordpress MU for the ammado blog. All is going well but I'm having issues with some search visibility ...


Go Back   Irish SEO, Marketing & Webmaster Discussion > Online Marketing > Search Engine Optimisation

Register Forum Rules FAQDonate Members List Calendar Search Today's Posts Mark Forums Read


Notices

Reply

 

LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 17-04-2008, 10:49 AM
Cormac's Avatar
Cormac Moylan
 
Join Date: Jan 2006
Location: Baile Ath Cliath / Corcaigh
Posts: 1,247
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Cormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to behold
Send a message via AIM to Cormac Send a message via MSN to Cormac Send a message via Yahoo to Cormac Send a message via Skype™ to Cormac
Default Googlebot blocked from robots.txt + sitemap.xml

I'm running Wordpress MU for the ammado blog. All is going well but I'm having issues with some search visibility issues.

I have a robots.txt file in the blog root of the MU site. (using this plugin for MU robot.txt files)
http://en.blog.ammado.com/robots.txt

I also have a sitemap.xml file in the same location
http://en.blog.ammado.com/sitemap.xml

I have added en.blog.ammado as a Google Sitemap account and I tried to add the sitemap.xml file to my account. Google is throwing up a "Googlebot is blocked from http://en.blog.ammado.com/" error message.

I'm not sure where GB is being blocked though. Any ideas on this one?

I have also added a robots.txt file to blog.ammado.com just in case - http://blog.ammado.com/robots.txt
__________________
blog | Geansaí Gorm - Written entirely in, awful, Irish! | Me on Blue Jumpers
*
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #2 (permalink)  
Old 17-04-2008, 04:04 PM
Drang3d's Avatar
Leo Fogarty
 
Join Date: Mar 2007
Posts: 95
Nominated 2 Times in 1 Post
Nominated TOTW/F/M Award(s): 1
Drang3d is on a distinguished road
Default

Cormac it might be something to do with your .htaccess file
__________________
Animation Library | Mortgages | Remortgages
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #3 (permalink)  
Old 17-04-2008, 04:53 PM
Cormac's Avatar
Cormac Moylan
 
Join Date: Jan 2006
Location: Baile Ath Cliath / Corcaigh
Posts: 1,247
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Cormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to behold
Send a message via AIM to Cormac Send a message via MSN to Cormac Send a message via Yahoo to Cormac Send a message via Skype™ to Cormac
Default

Hi,
I'm pretty sure the .htaccess file which I am using is the bog standard one for WPMU:
Code:
RewriteEngine On
RewriteBase /

#uploaded files
RewriteRule ^(.*/)?files/$ index.php [L]
RewriteRule ^(.*/)?files/(.*) wp-content/blogs.php?file=$2 [L]

RewriteCond %{REQUEST_FILENAME} -f [OR]
RewriteCond %{REQUEST_FILENAME} -d
RewriteRule . - [L]
RewriteRule  ^([_0-9a-zA-Z-]+/)?(wp-.*) $2 [L]
RewriteRule  ^([_0-9a-zA-Z-]+/)?(.*\.php)$ $2 [L]
RewriteRule . index.php [L]
I added my sitemap.xml file to MSN without any trouble earlier on this afternoon. It's just Google who don't want to play.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #4 (permalink)  
Old 17-04-2008, 05:20 PM
Drang3d's Avatar
Leo Fogarty
 
Join Date: Mar 2007
Posts: 95
Nominated 2 Times in 1 Post
Nominated TOTW/F/M Award(s): 1
Drang3d is on a distinguished road
Default

Hey Cormac

Whats the exact error code thats being thrown up, i.e. 403 etc?

Last edited by Drang3d; 17-04-2008 at 05:24 PM. Reason: misspelling
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #5 (permalink)  
Old 17-04-2008, 05:48 PM
ghost's Avatar
Wannabe Geek
 
Join Date: Dec 2007
Location: Ennis
Posts: 167
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
ghost will become famous soon enough
Default robots and sitemaps

Just viewed the robots.txt it just gives the address of the sitemap.
I am not sure here but if you provide a robots without specifying who or what is allowed will google assume its not allowed and bugger off ,
Try to allow all and see what happens
User-agent: *
Disallow:
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #6 (permalink)  
Old 17-04-2008, 06:01 PM
ghost's Avatar
Wannabe Geek
 
Join Date: Dec 2007
Location: Ennis
Posts: 167
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
ghost will become famous soon enough
Default

Quote:
Originally Posted by Cormac View Post
I have also added a robots.txt file to blog.ammado.com just in case - http://blog.ammado.com/robots.txt
Ignore the above I was looking at this one
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #7 (permalink)  
Old 17-04-2008, 08:25 PM
Redfly's Avatar
Dave Davis
 
Join Date: Sep 2006
Location: Dublin, Ireland
Posts: 1,063
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Redfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to behold
Send a message via AIM to Redfly Send a message via MSN to Redfly Send a message via Yahoo to Redfly Send a message via Skype™ to Redfly
Default

The problem is the robots meta tag.

<meta name='robots' content='noindex,nofollow' />

That will stop GB.
Remove it and you're dandy.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #8 (permalink)  
Old 17-04-2008, 08:42 PM
Cormac's Avatar
Cormac Moylan
 
Join Date: Jan 2006
Location: Baile Ath Cliath / Corcaigh
Posts: 1,247
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Cormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to beholdCormac is a splendid one to behold
Send a message via AIM to Cormac Send a message via MSN to Cormac Send a message via Yahoo to Cormac Send a message via Skype™ to Cormac
Default

Nice one.
Didn't spot that at all.

I must have selected 'do not allow my Wordpress installation to appear in search engines' when I was testing out stuff locally. When I uploaded the test environment, I forgot to change this setting. Yikes!
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #9 (permalink)  
Old 17-04-2008, 08:48 PM
Redfly's Avatar
Dave Davis
 
Join Date: Sep 2006
Location: Dublin, Ireland
Posts: 1,063
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
Redfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to beholdRedfly is a splendid one to behold
Send a message via AIM to Redfly Send a message via MSN to Redfly Send a message via Yahoo to Redfly Send a message via Skype™ to Redfly
Default

Yup, that was probably it. Happens all the time

When working on a site, we add this feature until the site is "ready for indexing". Sometimes we forget too

Anyway, you should be all good now.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
Reply

Tags
blocked, googlebot, robotstxt, sitemapxml

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


Sponsored links

Paid On Results


All times are GMT +1. The time now is 09:14 PM.


Powered by: vBulletin Version 3.7.3, Copyright ©2000 - 2008, Jelsoft Enterprises Limited.
Hosted in Ireland by Blacknight - Test your ISP |Irish Hosting Directory| Armchair.ie|Logo by Eden Web Design|Avatars by Afterglow |Latest Blog Entries | VPS HostingAd Management by RedTyger

Search Engine Friendly URLs by vBSEO 3.2.0