Irish SEO,  Marketing & Webmaster Discussion

 
Make money - save the planet!

fake googlebots gobbling my band width

This is a discussion on fake googlebots gobbling my band width within the Server / Technical Administration Tips and Queries forums, part of the Webmaster Help category; Has any one got a solution for these creatures. I have a googlebot in one site nearly every hour , ...


Go Back   Irish SEO, Marketing & Webmaster Discussion > Webmaster Help > Server / Technical Administration Tips and Queries

Register Forum Rules FAQDonate Members List Calendar Search Today's Posts Mark Forums Read


Notices

Reply

 

LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 11-01-2008, 04:25 PM
ghost's Avatar
Wannabe Geek
 
Join Date: Dec 2007
Location: Ennis
Posts: 160
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
ghost will become famous soon enough
Default fake googlebots gobbling my band width

Has any one got a solution for these creatures.
I have a googlebot in one site nearly every hour , yesterday it cost me 30.33 MB
I am sure its not the real googlebot as my google Webmaster tools show the last visit on the 7th jan.
How do others deal with this problem or do you just ignore it.
In addition to this the bot activity I am seeing now started just before Christmas ,
I all so use a gmail account on the site to keep spam pressure off my server , my Gmail spam has more then doubled
and the subjects have changed I always get porn invites + 1001 ways to enlarge my manhood now I have a whole new range of ads.
any thoughts.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #2 (permalink)  
Old 11-01-2008, 05:24 PM
Forbairt's Avatar
respect my AW-THOR-IT-AYY
 
Join Date: Jun 2007
Location: My Office, Dublin
Posts: 2,022
Nominated 2 Times in 1 Post
Nominated TOTW/F/M Award(s): 1
Forbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enough
Send a message via AIM to Forbairt Send a message via MSN to Forbairt Send a message via Yahoo to Forbairt Send a message via Skype™ to Forbairt
Default

Quote:
Originally Posted by ghost View Post
and the subjects have changed I always get porn invites + 1001 ways to enlarge my manhood now I have a whole new range of ads.
any thoughts.

In some cases it can work ... but you want to be careful and make sure its from a reputable seller


but seriously ... why do you think its googlebot ?

is 30mbs a day of bandwidth an issue ? ... can you implement an no follow rule ?

Quote:
<meta name="robots" content="noindex,nofollow" />
or alternatively the robot.txt to exclude that specific bot ?

Quote:
User-agent: BadBot
Disallow: /
where BadBot is the name of the bot ?
__________________
Forbairt Media | Web Design & Development Galway / Dublin, Ireland - coming soon ... ( vague but descriptive isn't it )
Recent Work: Safari Club African Safari Holidays - South Africa Safaris
Other Stuff: FluffyLinkulator Rapid Inclusion Service Tools
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #3 (permalink)  
Old 11-01-2008, 05:34 PM
louie's Avatar
Senior Member
 
Join Date: Jan 2006
Location: Dublin, Ireland
Posts: 2,010
Nominated 5 Times in 3 Posts
TOTW/F/M Award(s): 0
louie has much to be proud oflouie has much to be proud oflouie has much to be proud oflouie has much to be proud oflouie has much to be proud oflouie has much to be proud oflouie has much to be proud oflouie has much to be proud of
Send a message via Yahoo to louie Send a message via Skype™ to louie
Default

before you block it make sure is not Google.

Just because it shows in the Webmaster Central the date of last visit, it doesn't mean Google doesn't spider your website everyday.
__________________
:. Web Design & Development Web Design Ireland
:. Search Engines Optimization Search Engines Optimization
:. Directory Submission Directory Submission
:. News & Press Release Ireland GiveItSocks.com
:. Used Cars Ireland, Car Parts & Car Audio Cars For Sale, Car Parts & Accessories
:. I Have 2 Find It Directory SEF Directory
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #4 (permalink)  
Old 11-01-2008, 05:36 PM
blacknight's Avatar
Web Slave
 
Join Date: Jan 2006
Location: Ireland
Posts: 6,278
blacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to behold
Send a message via ICQ to blacknight Send a message via AIM to blacknight Send a message via MSN to blacknight
Default

30 megs a day is neglible.

In any case you could check the logs for the IP ranges and see if they belong to Google or not...
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #5 (permalink)  
Old 11-01-2008, 05:48 PM
ghost's Avatar
Wannabe Geek
 
Join Date: Dec 2007
Location: Ennis
Posts: 160
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
ghost will become famous soon enough
Default

Quote:
Originally Posted by Forbairt View Post
In some cases it can work ... but you want to be careful and make sure its from a reputable seller
In situations where most men need to shake I have to kick mine

No problem with the band width unless its been wasted with no benefits to me I am just presuming its a fake google bot .this site has about 20 odd visitors a day with very few inward links the only changing content is a news feed with a cach time of 8 hours.
Is it usual for google to visit sites every hour or so if it is I am more them happy to accommodate it.
I have already tried to capitalize on it by adding links to other sites on the main pages.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #6 (permalink)  
Old 11-01-2008, 05:54 PM
Forbairt's Avatar
respect my AW-THOR-IT-AYY
 
Join Date: Jun 2007
Location: My Office, Dublin
Posts: 2,022
Nominated 2 Times in 1 Post
Nominated TOTW/F/M Award(s): 1
Forbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enoughForbairt will become famous soon enough
Send a message via AIM to Forbairt Send a message via MSN to Forbairt Send a message via Yahoo to Forbairt Send a message via Skype™ to Forbairt
Default

Quote:
Originally Posted by ghost View Post
In situations where most men need to shake I have to kick mine
Ah well maybe its for you then .. sometimes things just need a good kick start

Quote:
No problem with the band width unless its been wasted with no benefits to me I am just presuming its a fake google bot .
30*30days = less than a gb ... most plans these days are giving 30 ish gbs a month (warning may not be true) ... (most plans I've seen in the budgetish range at any rate). Unless it really starts to become an issue I wouldn't worry about it

Quote:
this site has about 20 odd visitors a day with very few inward links the only changing content is a news feed with a cach time of 8 hours.
Is it possible google is seeing the news feed on every page ? and updating thinking your content is changing daily ? (and may increase the rate it indexes your site)

You could try changing the rate in google webmaster tools to something more reasonable.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #7 (permalink)  
Old 11-01-2008, 06:11 PM
RedCardinal's Avatar
Richard Hearne
Recent Blog: Irish Banks In UK
 
Join Date: Feb 2006
Posts: 941
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
RedCardinal is a splendid one to beholdRedCardinal is a splendid one to beholdRedCardinal is a splendid one to beholdRedCardinal is a splendid one to beholdRedCardinal is a splendid one to beholdRedCardinal is a splendid one to beholdRedCardinal is a splendid one to behold
Default

Quote:
Originally Posted by ghost View Post
I am sure its not the real googlebot as my google Webmaster tools show the last visit on the 7th jan.
The date they show in GWT is something akin to the fairy tales of Ireland.

There is actually a way to identify real Googlebots. Cant find the code right now, but if you PM me I'll dig it out. Google it - you'll find out how to verify Googlebot.
__________________
Search Engine Optimisation - Red Cardinal Internet Marketing
Internet Consultant Ireland | Search Engine Optimisation Services
Catering Company Dublin - My sister's handmade canape company!
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #8 (permalink)  
Old 11-01-2008, 06:16 PM
ghost's Avatar
Wannabe Geek
 
Join Date: Dec 2007
Location: Ennis
Posts: 160
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
ghost will become famous soon enough
Default

Quote:
Originally Posted by ghost View Post
No problem with the band width unless its been wasted
Sorry band width was not an issue , I am just finding it hard to believe google is giving this site so much attention.
News feeds 2 on the home page only
Have a robots,txt
have a google xml sitemap
have a yahoo txt sitemap
Will check the ip later
Maybe I have done something right and hit pay dirt and should stop moaning.

Last edited by ghost; 11-01-2008 at 06:24 PM.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #9 (permalink)  
Old 11-01-2008, 07:32 PM
ghost's Avatar
Wannabe Geek
 
Join Date: Dec 2007
Location: Ennis
Posts: 160
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
ghost will become famous soon enough
Default

Quote:
Originally Posted by blacknight View Post
In any case you could check the logs for the IP ranges and see if they belong to Google or not...
Yes the server logs revealed all , and a its genuine Googlebot eight visits to day.
The problem I am having
The site has an Events Calender on the side bar of each page (an include page)
also has a photo gallery page , Each gallery displays 10 pics with links to larger images
in each case google is following the link for the larger image also finding the calender on each page and chasing so many links deep in to the calender as well, ie /gallery.php?picture_ref=1&user_eventsDate=2007-09
so the possiblities are 10 picture links multiplied by 30 calender links on each page = not good for google or me as most of the events are empty.
I will have to rectify this.
Thanks for the Help.
Ghost

Last edited by ghost; 11-01-2008 at 09:21 PM.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
  #10 (permalink)  
Old 11-01-2008, 08:24 PM
blacknight's Avatar
Web Slave
 
Join Date: Jan 2006
Location: Ireland
Posts: 6,278
blacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to beholdblacknight is a splendid one to behold
Send a message via ICQ to blacknight Send a message via AIM to blacknight Send a message via MSN to blacknight
Default

You can use the robots.txt to stop the googlebots from spidering certain sections of your site.

NB: Not all bots behave
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit! Wong this Post!
Reply With Quote
Reply

Tags
band, fake, gobbling, googlebots, width

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


All times are GMT +1. The time now is 06:38 AM.


Powered by: vBulletin Version 3.7.3, Copyright ©2000 - 2008, Jelsoft Enterprises Limited.
Hosted in Ireland by Blacknight - Test your ISP |Irish Hosting Directory| Armchair.ie|Logo by Eden Web Design|Avatars by Afterglow |Latest Blog Entries | VPS HostingAd Management by RedTyger