Here's another similar idea, someone please do this: a search engine that excludes all "commercial" sites. "Commercial" means the site contains ads or accepts payments.
It's almost as if there's a need for a top-level domain to distinguish between commercial organisations and non-profits. This [1] was working brilliantly until everyone decided to abuse it left, right, and center.
.io is a generic TLD in most books now anyway. There's lots of other ccTLD that are fairly useless, though it's obvious why ones like .co.ck (the uninhabited Cook Islands) were never picked up as generics.
This would exclude SO (ads) and Wikipedia (accepts payments as donations).
Even of you refined your criteria, what do you have against sites that try to recoup costs of hosting or content? Or, are you referring to completely different sites, lumping them all under "commercial sites" label?
Okay, it seems I was trigger-happy in excluding nonprofits that accept donations. Maybe "accept payments for products or services" would work better? Not sure how to formalize that though.
More generally, I don't have anything against any sites, I'm trying to come up with a way of viewing the internet that would have a high signal to noise ratio. It seems plausible to me that excluding all sites with ads would improve the signal to noise ratio over what we have today. If you think the signal to noise ratio would be improved further by carefully including some sites with ads (SO is a good example), can you take a stab at defining the criteria?
Maybe include only sites whose ads are deemed acceptable by adblock? But then we might include many content farms...
Indeed, the first part (search that filters out site with ads, including affiliate links and the like) is something I wish for frequently. It isn't that I mind seeing ads (I can use AdBlock for that, and sometimes do), it's that I don't want to wade through spam sites when looking for info, and sites with ads seems like an obvious hard filter. Maybe DDG or the like could implement filter.
I remember Yahoo actually had this as a Yahoo Labs thing way back in the day (when they actually did search). You had a slider that would make your results all research (if you are researching a bike) to all commercial (if you want to buy one).
"Contains ads" by itself is a pretty bad definition of "commercial site," and will exclude a whole lot of useful, high-quality sites that run ads to pay the hosting bills.
Your idea started out good. Sometimes I want to research a product before I go looking for a merchant. Blocking all sites that contain ads? Just use adblock if you're that fanatical.
Presumably the idea is to define 'commercial' as a website with ads or accept payments, rather than removing sites with ads to increase viewing pleasure.
I think it's a good idea. It'd certainly help with the problem of blogspam etc.
I think the idea was that the presence of ads indicates the advertisers had influence on content, which isn't true for a lot of websites which run ads.
I don't think that just because a site makes money it is a bad result. If you searched for "Used Jeep Grand Cherokee" Craigslist would be a good result. So would AutoTrader. Both break your rule.
I agree - such a search engine would not be as general as Google or Yahoo. But in sacrificing generality it might become better at handling certain types of requests. I am sure that given such a service the community of HN will make light work of finding out what they are :)
Our Natural Language / Language Heuristics Engine can do this. We do to a small extent now, we haven't gone whole hog because I'm not sure I agree that commercial sites shouldn't rank. For things we determine to be reference searches we downrank sales pages, for things we determine to be shopping searches we uprank them.