Spider help

Phil L

New Member
Okay, I want to find software that'll enable me to locate instances of a specific keyword on a site and its subpages. I'm not sure exactly how I'd go about this; I tried downloading a web spider, though I can't find an .htm searcher to pair it with. Part of my problem may stem from the fact it's 2 AM and I am very tired, but regardless the fact remains I've searched up and down for such a tool. I'm looking for something more specific than what your typical crawler'll turn up; as I said, I want to be able to pinpoint an exact query/word from a massive website if that's at all possible. Thanks in advance.
 
Phil L said:
I tried downloading a web spider, though I can't find an .htm searcher to pair it with. Part

Yes, your idea of downloading a webspider is good. There are plenty of file searching programs which meta search in files, including .htm files, including google desktop search (only works on desktop :() and blinkx i think thats how you spell it. Both these require a preliminary search dependent on the number of files you have on your hard drive/desktop.
 
So I should download this "blinkx" program?

If there was a way to search directly through the site itself it'd be much easier. This page is huge, and I'm not looking forward to downloading some odd gigabytes of text.

By the way, if by "meta search" you mean "searching the meta tags", that's not what I was referring to. I meant a keyword visible in the rendered HTML itself. For example, finding every instance of which you've posted in this forum.
 
Last edited:
Phil L said:
So I should download this "blinkx" program?
Yes, that is what i suggested.

Phil L said:
If there was a way to search directly through the site itself it'd be much easier. This page is huge, and I'm not looking forward to downloading some odd gigabytes of text.
Try google, or any other search engine. they have a specific string for searching websites, but im not sure how good it is.

Phil L said:
By the way, if by "meta search" you mean "searching the meta tags", that's not what I was referring to. I meant a keyword visible in the rendered HTML itself. For example, finding every instance of which you've posted in this forum.
I know what you mean. Blinkx indexes text inside common files like .htm and .doc so you can search the text insider them. Last bit - press view my profile and click view all posts posted by me.
 
Well, that was an example; I meant assuming the "find all posts" feature was not available. Does it index the text in multiple files or one huge file? If the latter, it'll probably crash my PC despite my having 1.5GB RAM.

I've already tried Google, and other search engines. Nothing.
 
Back
Top