HTFC Forums

H.T.F.C.

How To Fix Computers





Go Back   HTFC Forums > Software Newsgroups > Windows Vista

Register FAQ Members List Calendar Search Today's Posts Mark Forums Read
  #1  
Old 09-14-2007, 03:38 PM
YMA
 
Posts: n/a
Default How to index HTML files locally even with ROBOTS noindex?

I have a local mirror copy of the Web sites I manage. Some of the HTML pages
I don't want to be indexed by Web spiders / robots, so I put the ROBOTS
meta-tag with "noindex" in them.

However, I would like those files to be indexed locally, so that I can find
things in them with the local Windows indexed search function. But the
Windows HTML filter intentionally does NOT index files with ROBOTS noindex,
so I don't get those files in my local searches.

Is there a way to tell the HTML filter to go ahead and index HTML files even
if they have the ROBOTS noindex meta-tag? I want my local and remote copies
to be indentical, so I don't want to have ROBOTS index locally and ROBOTS
noindex remotely.

Anybody else run into that problem? Anyone has a solution?

Thanks!

YMA
Reply With Quote
Sponsored Links
Fix your Windows Problems - FAST.
FREE Safe Scan Registry Check. Locate & Fix Errors in Minutes!
  #2  
Old 09-14-2007, 03:49 PM
Synapse Syndrome
 
Posts: n/a
Default Re: How to index HTML files locally even with ROBOTS noindex?

"YMA" <YMA@discussions.microsoft.com> wrote in message
news:542B26AA-FF01-4F27-854D-C220D4A269D1@microsoft.com...
>I have a local mirror copy of the Web sites I manage. Some of the HTML
>pages
> I don't want to be indexed by Web spiders / robots, so I put the ROBOTS
> meta-tag with "noindex" in them.
>
> However, I would like those files to be indexed locally, so that I can
> find
> things in them with the local Windows indexed search function. But the
> Windows HTML filter intentionally does NOT index files with ROBOTS
> noindex,
> so I don't get those files in my local searches.
>
> Is there a way to tell the HTML filter to go ahead and index HTML files
> even
> if they have the ROBOTS noindex meta-tag? I want my local and remote
> copies
> to be indentical, so I don't want to have ROBOTS index locally and ROBOTS
> noindex remotely.
>
> Anybody else run into that problem? Anyone has a solution?



I just put a robots.txt file in the root folder of the website instead. I
do not know of the metatag, but maybe the text file is more flexible, as you
can define which folders the spiders can index or not.

Loads more info here:
http://www.google.co.uk/search?sourc...q=robots%2etxt

ss.


Reply With Quote
  #3  
Old 09-14-2007, 03:55 PM
Synapse Syndrome
 
Posts: n/a
Default Re: How to index HTML files locally even with ROBOTS noindex?

"Synapse Syndrome" <synapse@NOSPAMgomez404.elitemail.org> wrote in message
news:upFlA6t9HHA.5840@TK2MSFTNGP03.phx.gbl...
>
> I just put a robots.txt file in the root folder of the website instead. I
> do not know of the metatag, but maybe the text file is more flexible, as
> you can define which folders the spiders can index or not.
>
> Loads more info here:
> http://www.google.co.uk/search?sourc...q=robots%2etxt
>



Also, it says that not all spiders listen to the metatag, according to this
page:
http://www.robotstxt.org/wc/exclusion.html#meta

ss.


Reply With Quote
  #4  
Old 09-14-2007, 04:08 PM
YMA
 
Posts: n/a
Default Re: How to index HTML files locally even with ROBOTS noindex?

Thanks for your answer, but I do not have access to the root folder of my
websites (with just one exception). So, I really need to be able to tweak the
local HTML filter on my machine...

BTW, I am aware of the limitations of the ROBOTS noindex meta-tag, but I can
live with them.

YMA

"Synapse Syndrome" wrote:

> "Synapse Syndrome" <synapse@NOSPAMgomez404.elitemail.org> wrote in message
> news:upFlA6t9HHA.5840@TK2MSFTNGP03.phx.gbl...
> >
> > I just put a robots.txt file in the root folder of the website instead. I
> > do not know of the metatag, but maybe the text file is more flexible, as
> > you can define which folders the spiders can index or not.
> >
> > Loads more info here:
> > http://www.google.co.uk/search?sourc...q=robots%2etxt
> >

>
>
> Also, it says that not all spiders listen to the metatag, according to this
> page:
> http://www.robotstxt.org/wc/exclusion.html#meta
>
> ss.

Reply With Quote
Sponsored Links
Fix your Windows Problems - FAST.
FREE Safe Scan Registry Check. Locate & Fix Errors in Minutes!
Reply


Thread Tools
Display Modes


Similar Threads
Thread Thread Starter Forum Replies Last Post
bach convert doc files to text format files(eg.xml or html) and viceversa. kang Windows XP 3 09-05-2007 04:22 PM
IE 7 in Vista won't load .swf files unless called from html? lforbes Windows Vista 5 07-24-2007 07:56 AM
Vista Index Corel WordPerfect X3 wpd files? Brian Bradley Windows Vista 3 07-19-2007 06:45 AM
How to index TIF files in Vista? Jerry Windows Vista 0 07-14-2007 12:54 PM
Desktop and HTML Files billbrandi Windows Vista 2 06-13-2007 04:18 AM


All times are GMT. The time now is 01:10 PM.


Powered by vBulletin® Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
LinkBacks Enabled by vBSEO 3.1.0
© 2004 - 2007 Web-S-Sense Pty. Ltd. Usenet and forums posts © their respective authors.
Ad Management by RedTyger