Announcement

Collapse
No announcement yet.

robots.txt

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

    #16
    Bots are programmed by the people running them to use Unix and Windows commands to look for file lists - this is why, even on a Unix server, you will see a Windows folder path showing up in your logs. If not blocked by an exclusion in robots.txt even a 'good' bot will index every link it finds.

    "Bad" bots will ignore the robots.txt and just index everything it can find anyway. I password protect the folders I don't want the bots to enter, as well as posting a robots.txt for the good bots.

    I view a properly written robots.txt file as beneficial to the SE's telling their bot where not to waste its time. Imagine how much more often G could crawl your site content, if it was not wasting time following all the cgi links on all the sites that don't use a robots.txt file?
    Bill
    www.egyptianwonders.co.uk
    Text directoryWorldwide Actinic(TM) shops
    BC Ness Solutions Support services, custom software
    Registered Microsoft™ Partner (ISV)
    VoIP UK: 0131 208 0605
    Located: Alexandria, EGYPT

    Comment


      #17
      This being the case how can the robot get into stats and test folders etc?
      It al depends on where the test folders are.

      eg catalog test mode puts the folder within acatalog

      Comment

      Working...
      X