p2p.wrox.com Forums

Need to download code?

View our list of code downloads.


Go Back   p2p.wrox.com Forums > Web Programming > HTML > HTML Code Clinic
I forgot my password Register Now
Register | FAQ | Members List | Calendar | Search | Today's Posts | Mark Forums Read
HTML Code Clinic Do you have some HTML code you'd like to share and get suggestions from others for tweaking or improving it? This discussion is the place.

Welcome to the p2p.wrox.com Forums.

You are currently viewing the HTML Code Clinic section of the Wrox p2p Programmer to Programmer discussion community. This is a community of more than 40,000 computer programmers including Wrox book authors and readers. As a guest, you can read any forum posting. By joining our free Wrox p2p community you can post your own programming questions and respond to other programmers’ questions. Registered users also don't have to see the ads that are displayed to guests. Registration is fast, simple and absolutely free so please, join today!
Join today and post to win prizes! Post more to increase your chances of being Wrox’s top poster of the month.

Reply
 
Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old April 5th, 2009, 07:26 PM
Friend of Wrox
Points: 6,811, Level: 35
Points: 6,811, Level: 35 Points: 6,811, Level: 35 Points: 6,811, Level: 35
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
 
Join Date: Jan 2005
Location: Mauchline, East Ayrshire, Scotland
Posts: 1,518
Thanks: 0
Thanked 0 Times in 0 Posts
Send a message via ICQ to crmpicco Send a message via AIM to crmpicco Send a message via MSN to crmpicco Send a message via Yahoo to crmpicco
Default prevent Google/Yahoo!/MSN spidering my webpage

Hi,


Am i safe enough with the following meta tags to prevent Google/Yahoo!/MSN Search from spidering my webpage?

Code:
<meta name="robots" content="noindex" />
<meta name="robots" content="nofollow" />
<meta name="robots" content="noarchive" />
<meta name="robots" content="noodp" />
<meta name="robots" content="noimageindex,nomediaindex" />
<meta name="robots" content="unavailable_after: 05-Apr-2009 22:00:00 CET" />
<meta name="googlebot" content="noindex">
<meta name="googlebot" content="nosnippet" />
<meta name="slurp" content="noydir">
My robots.txt file:
Code:
User-agent: *
Disallow: /picco.html
Also - is there anyway to test if Google will pick it up?

Thanks,
Picco
__________________
_______________________
Ayrshire Minis - a Mini E-Community
http://www.ayrshireminis.com
http://www.crmpicco.co.uk
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit!
Reply With Quote
  #2 (permalink)  
Old April 5th, 2009, 09:01 PM
jminatel's Avatar
Wrox Staff
Points: 7,285, Level: 36
Points: 7,285, Level: 36 Points: 7,285, Level: 36 Points: 7,285, Level: 36
Activity: 21%
Activity: 21% Activity: 21% Activity: 21%
 
Join Date: May 2003
Location: Indianapolis, IN, USA.
Posts: 1,349
Thanks: 27
Thanked 49 Times in 40 Posts
Default

I think you have it more than covered. To verify it with Google, sign up for a free Google webmaster tools account
www.google.com/webmasters/tools/
and from there, you can verify which pages Google is and isn't spidering under "URLs restricted by robots.txt."
__________________
Jim Minatel
Associate Publisher
Wiley Technology Publishing
WROX Press
Blog: http://p2p.wrox.com/content/blogs/jminatel
Wrox online library: http://wrox.books24x7.com
Wrox on Twitter: http://twitter.com/wrox
Did someone here help you? Click on their post!
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit!
Reply With Quote
  #3 (permalink)  
Old April 6th, 2009, 06:47 PM
Friend of Wrox
Points: 6,811, Level: 35
Points: 6,811, Level: 35 Points: 6,811, Level: 35 Points: 6,811, Level: 35
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
 
Join Date: Jan 2005
Location: Mauchline, East Ayrshire, Scotland
Posts: 1,518
Thanks: 0
Thanked 0 Times in 0 Posts
Send a message via ICQ to crmpicco Send a message via AIM to crmpicco Send a message via MSN to crmpicco Send a message via Yahoo to crmpicco
Default

Thanks Jim, i'll do that. What about MSN Search and Yahoo!

Will this prevent those two search engines from spidering the page?

Picco
__________________
_______________________
Ayrshire Minis - a Mini E-Community
http://www.ayrshireminis.com
http://www.crmpicco.co.uk
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit!
Reply With Quote
  #4 (permalink)  
Old April 7th, 2009, 05:28 PM
jminatel's Avatar
Wrox Staff
Points: 7,285, Level: 36
Points: 7,285, Level: 36 Points: 7,285, Level: 36 Points: 7,285, Level: 36
Activity: 21%
Activity: 21% Activity: 21% Activity: 21%
 
Join Date: May 2003
Location: Indianapolis, IN, USA.
Posts: 1,349
Thanks: 27
Thanked 49 Times in 40 Posts
Default

I think you're covered with the robots.txt, as I think both Yahoo and MSN (Live) search are respectable crawlers and follow the robot.txt rules. Where you may run in to problems are some of the lesser known crawlers, and even malicious crawlers that deliberately ignore robots. If there is a link to this page from some other page, there's a good chance some crawlers will index it.
__________________
Jim Minatel
Associate Publisher
Wiley Technology Publishing
WROX Press
Blog: http://p2p.wrox.com/content/blogs/jminatel
Wrox online library: http://wrox.books24x7.com
Wrox on Twitter: http://twitter.com/wrox
Did someone here help you? Click on their post!
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit!
Reply With Quote
  #5 (permalink)  
Old April 21st, 2009, 05:13 AM
Friend of Wrox
Points: 941, Level: 11
Points: 941, Level: 11 Points: 941, Level: 11 Points: 941, Level: 11
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
 
Join Date: Jun 2007
Location: San Diego, CA, USA.
Posts: 278
Thanks: 0
Thanked 4 Times in 4 Posts
Default

Yeah, the robots.txt file is all you really need to disallow the page. None of the major engines are looking to catalog things you don't want indexed.

If it's something really sensitive, you may want to look at ASP.NET 2.0. There are some fairly basic ways of setting up a login system to password protect sensitive files like that. Check out www.asp.net for video tutorials on it if you're interested.
__________________
-------------------------

Whatever you can do or dream you can, begin it. Boldness has genius, power and magic in it. Begin it now.
-Johann von Goethe

When Two Hearts Race... Both Win.
-Dove Chocolate Wrapper

Chroniclemaster1, Founder of www.EarthChronicle.com
A Growing History of our Planet, by our Planet, for our Planet.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit!
Reply With Quote
  #6 (permalink)  
Old May 8th, 2009, 03:24 PM
Authorized User
Points: 66, Level: 1
Points: 66, Level: 1 Points: 66, Level: 1 Points: 66, Level: 1
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
 
Join Date: Nov 2008
Location: virudhunagar, tamil nadu, India.
Posts: 22
Thanks: 0
Thanked 0 Times in 0 Posts
Default Try php

Try php. It provide some ways to satisfy your needs.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit!
Reply With Quote
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are Off
Pingbacks are On
Refbacks are Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
msn help muteblaster Beginning VB 6 2 December 21st, 2006 02:51 PM
Can we connect yahoo,msn mail accounts using TCP? suryasimha ASP.NET 2.0 Professional 1 September 22nd, 2006 12:35 PM
Can we connect yahoo,msn mail accounts using TCP? suryasimha ASP.NET 1.0 and 1.1 Professional 3 September 22nd, 2006 10:53 AM
MSN / Yahoo messenger like pop up window tact_259 General .NET 4 May 12th, 2004 03:30 PM
How can I search at Yahoo or Google ? bapechun Classic ASP Basics 1 March 26th, 2004 11:08 PM



All times are GMT -4. The time now is 02:22 AM.


Powered by vBulletin® Version 3.6.8
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
© 2008 Wiley Publishing, Inc