The Anti-Thesaurus: Unwords For Web Searches 148
Nicholas Carroll writes: "In the continual struggle between search engine administrators, index spammers, and the chaos that underlies knowledge classification, we have endless tools for 'increasing relevance' of search returns, ranging from much ballyhooed and misunderstood 'meta keywords,' to complex algorithms that are still far from perfecting artificial intelligence. Proposal: there should be a metadata standard allowing webmasters to manually decrease the relevance of their pages for specific search terms and phrases."
Isn't that what - is for? (Score:2, Informative)
For example, if I'm looking for info on a Toyota Supra and too many Celica-related pages come up, I'll type:
toyota supra -celica
On a related note, does anyone feel that Google's built-in exclusion list of universal keywords (a,1,of) is really aggravating when Google excludes those words in phrases?
Re:Proposal won't work: No incentive! (Score:4, Informative)
http://www.robotstxt.org/wc/exclusion.html [robotstxt.org]
robots.txt ? (Score:3, Informative)
if more people used robots.txt, a lot of 'only useful to internal users' sites would drop right off the engines, leaving relevant results for the rest of the world...
just a thought......
Re:How about this? (Score:4, Informative)
Disclaimer: I'm in no way associated with Google.
What about !keyword? (Score:3, Informative)
Presumably the same could be done for <meta name="keywords"> in HTML.
mod_rewrite reference, examples (Score:3, Informative)
Well some docs are here [apache.org], and the mod_rewrite reference is here [apache.org].
Here is a goofy example that does a redirect back to their google query, except with the word "porn" appended to it. As an added bonus, it only does it when the clock's seconds are an even number. (Or do the same test to the last digit of their IP address). Replace the plus sign before "porn" with about 100 plus signs and they won't see the addition because each plus sign becomes a space. The "%1" refers to their original query.
Here's another one that checks the user-agent for an URL, and then redirects to it. This keeps most spiders and stuff off your pages since they usually put their URLs in the User-Agent:
Anything you can think of is possible. I think you can even hook it into external scripts.