Today's Posts Follow Us On Twitter! TFL Members on Twitter  
Forum search: Advanced Search  
Navigation
Marketplace
  Members Login:
Lost password?
  Forum Statistics:
Forum Members: 24,254
Total Threads: 80,792
Total Posts: 566,472
There are 1859 users currently browsing (tf).
 
  Our Partners:
 
  TalkFreelance     Business and Website Management     Advertising, SEO and Social Marketing :

What's the point of robots.txt?

Thread title: What's the point of robots.txt?
Closed Thread  
Page 1 of 2 1 2 >
    Thread tools Search this thread Display Modes  
05-23-2005, 10:59 PM
#1
DateinaDash is offline DateinaDash
Status: The BidMaster
Join date: Nov 2004
Location: England
Expertise:
Software:
 
Posts: 10,821
iTrader: 0 / 0%
 

DateinaDash is on a distinguished road

  Old  What's the point of robots.txt?

From my limited understanding, this file is for detailing the files/pages that you don't want the search engine to index, correct? I'm just wondering why anyone wouldn't want the search engines to index their site?

05-23-2005, 11:09 PM
#2
sysblnk is offline sysblnk
Status: I love this place
Join date: Mar 2005
Location:
Expertise:
Software:
 
Posts: 640
iTrader: 0 / 0%
 

sysblnk is on a distinguished road

  Old

Say you're a designer and you make a site for someone and keep a live preview for your portfolio. You would want to block that preview so that the search engine doesnt index the same content twice. Other reasons is for some reason you have personal information on a page and dont want the world to know about it. There's different reasons but so far I havent used it.

05-23-2005, 11:45 PM
#3
schroder is offline schroder
schroder's Avatar
Status: Member
Join date: Nov 2004
Location:
Expertise:
Software:
 
Posts: 159
iTrader: 0 / 0%
 

schroder is on a distinguished road

  Old

There is also bandwidth. You may have a big site and only a few pages that should really be of interest to the spider and you don't want all your bandwidth used up by a spider.

http://www.robotstxt.org - resource

If you create a spider your actually suppose to register it. I think few do though.

To add to the mystery, I looked at ebays robots.txt:

Code:
User-agent: *
Disallow: /help/confidence/ 
Disallow: /help/policies/ 
Disallow: /disney/
So apparently disney is a no touchy. I wonder why they wouldn't allow their policies to be spidered though. Maybe they don't want someone just copying the policy for their own use?

05-23-2005, 11:49 PM
#4
DateinaDash is offline DateinaDash
Status: The BidMaster
Join date: Nov 2004
Location: England
Expertise:
Software:
 
Posts: 10,821
iTrader: 0 / 0%
 

DateinaDash is on a distinguished road

  Old

Humm, that is very weird. I can understand the policies I guess, but why Disney?

05-24-2005, 03:30 AM
#5
madpenguin2 is offline madpenguin2
Status: I'm new around here
Join date: May 2005
Location: PA
Expertise:
Software:
 
Posts: 24
iTrader: 0 / 0%
 

madpenguin2 is on a distinguished road

  Old

Some other things you may not want indexed by the SE's. An 'admin' area. Or, maybe a stats page that is vulnerable to 'referral spamming'.

Brett

06-17-2005, 12:07 AM
#6
BXD is offline BXD
Status: I'm new around here
Join date: Jun 2005
Location:
Expertise:
Software:
 
Posts: 19
iTrader: 0 / 0%
 

BXD is on a distinguished road

  Old

disney was a promotion once but came up at the very top of most search enginey. I guess disney asked them to exclude it as they want to be the search result number one

06-29-2005, 02:18 AM
#7
Royalty Hosting is offline Royalty Hosting
Status: I'm new around here
Join date: Jun 2005
Location:
Expertise:
Software:
 
Posts: 18
iTrader: 0 / 0%
 

Royalty Hosting is on a distinguished road

  Old

Robots.txt simply specifes which folder/file should be index by search engine spiders.

06-29-2005, 02:47 AM
#8
elkjar is offline elkjar
elkjar's Avatar
Status: Junior Member
Join date: Jun 2005
Location: Detroit, MI
Expertise:
Software:
 
Posts: 73
iTrader: 0 / 0%
 

elkjar is on a distinguished road

Send a message via AIM to elkjar

  Old

I use robots.txt to prevent search engines from finding my secret content.

06-29-2005, 02:50 AM
#9
Royalty Hosting is offline Royalty Hosting
Status: I'm new around here
Join date: Jun 2005
Location:
Expertise:
Software:
 
Posts: 18
iTrader: 0 / 0%
 

Royalty Hosting is on a distinguished road

  Old

That is what the robots.txt is for, I use it to prevent google from addming my website images to Google Images

07-14-2005, 06:57 AM
#10
WireNine.com is offline WireNine.com
Status: Junior Member
Join date: Jun 2005
Location:
Expertise:
Software:
 
Posts: 53
iTrader: 0 / 0%
 

WireNine.com is on a distinguished road

  Old

Does robots.txt work for all search engines?

Closed Thread  
Page 1 of 2 1 2 >


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 

  Posting Rules  
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump:
 
  Contains New Posts Forum Contains New Posts   Contains No New Posts Forum Contains No New Posts   A Closed Forum Forum is Closed