Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Google Raises Word Limit

Posted by timothy on Sat Jan 22, 2005 08:12 PM
from the 32-is-nearly-the-answer-to-everything dept.
Philipp Lenssen writes "Google quietly raised their web search limit to 32 words. Previously, only up to 10 words were allowed per query, with succeeding words being ignored. This is not only important to specific approaches of advanced searching (for example, when you need to exclude many different keywords using the minus operator), but it's also of great help to certain tools using the Google API. While there doesn't seem to be any official statement from Google yet, some more details can be found at my Google blog."
This discussion has been archived. No new comments can be posted.
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • I wonder.... (Score:2, Insightful)

    by Heftklammerdosierer! (846009) on Saturday January 22 2005, @08:17PM (#11444941)
    ...what the first 32 word google bomb will be.
    • 1 reply beneath your current threshold.
  • Finnally. (Score:3, Insightful)

    by Phantombantam (850513) on Saturday January 22 2005, @08:18PM (#11444944)
    (http://www.lemursandlakes.tk/)
    About time. I always thought of the 10 word limit as gogle's biggest setback.
    • Re:Finnally. by smitty_one_each (Score:2) Saturday January 22 2005, @09:31PM
    • 2 replies beneath your current threshold.
  • Great (Score:5, Interesting)

    by lastninja (237588) on Saturday January 22 2005, @08:27PM (#11444983)
    Now you can search for quotes, without having to strip half of the words away. Just cut and paste it in to the browser. I guess this will also make it easier to search for source-code, as it is now you will likely end up at a documentation - site. When you want is some sourcefile from some Sourceforge project.
  • 32 word searching increases the complexity of the search many times over. For a ten word search you're usually talking about finding all documents with all ten words, ordering them by how many of the searched terms were found, and then by their linked-to values. With 32 you're finding ~3.2x as many documents, comparing for 3.2x as many words in each documents, and then finding how popular they were.

    So, um, wow.
  • searching for non a-z characters (Score:5, Insightful)

    by fluor2 (242824) on Saturday January 22 2005, @08:56PM (#11445098)
    characters like !,.'$ is pretty much not supported by google. i would like those to be included in the future.
  • Matching MSN Search? (Score:5, Interesting)

    by Utopia (149375) on Saturday January 22 2005, @10:07PM (#11445482)
    Looks like the limit was raised to match
    MSN's new search [msn.com] whih has has sported a bigger word limit for quite some time.

  • Great! (Score:1, Funny)

    by Anonymous Coward on Saturday January 22 2005, @10:12PM (#11445507)
    Now when I do really specific searchs I can get truly relevant google ads!
  • Good for searching multiple sites (Score:3, Interesting)

    by prostoalex (308614) * on Saturday January 22 2005, @10:13PM (#11445510)
    (Last Journal: Monday October 23 2006, @12:44PM)
    I discovered how to make a Firefox plugin for limiting Google searches [moskalyuk.com] to select few sites, but the problem before was that each site:domainname.com directive was treated as a term. So if you wanted to search 7 sites at once, then google would let you enter maximum of 3 keywords to span that search across multiple sites. So this keywords increase, you can do stuff like 5-word searches across 10 domain names, for example.
    • Re:Good for searching multiple sites (Score:5, Insightful)

      by gl4ss (559668) on Sunday January 23 2005, @01:15AM (#11446282)
      (http://--/ | Last Journal: Monday December 09 2002, @05:12PM)
      though.. it's still not good enough.

      what I would hope for them to introduce would be a word blacklist that would be personal, and that you could include at least a thousand terms in it.

      why? TO AVOID THOSE FUCKING LINKFARMS, they usually have the same advert links in them so just adding the referral id of the owner of a certain farm will get a lot of meaningless sites out of the search. it's doable now if you make your own program that does the filtering(using googleapi. there's two ways, either go to the sites yourself or request the cache from google.. massive traffic in any case for you and the search will take ages to complete).
      [ Parent ]
  • The problem with getting good search results are synonyms (different words that mean the same thing) and homonyms (the same word that means different things). With the 32 word limit, you can avoid both of these problems by following a few simple steps- Let's say, for instance, that you live in new york city and are looking for a moving company that specializes in fragile antiques... typically, the vagueness of such a query makes it hard to find good results, but not if you follow these steps:

    1. Break your search into 2-4 principal, independent concepts- In my example, the concepts are NYC (the location) moving company (the company type) and antiques (the specialty)

    2. For each concept, come up with as many terms as you can that are descriptions or examples of the concept that are very specific and won't trigger homonyms- For instance, you wouldn't want to use the word "New York" because it is too vague and could refer to the state (a company in Albany, NY won't help you). However, "NYC" "Long Island" "Brooklyn" "Queens" "New York City" are great, even if they seem overly specific- You just need one of them to cause a hit on a relevant page.

    3. Put parenthesis around the terms for each concept (be sure to put quotes around each compound term) and OR together the items inside parentheses.

    This is what the entire search might look like:

    ("NYC" OR "Long Island" OR "Brooklyn" OR "Queens" OR "Manhattan" OR "Bronx" OR "New York City" OR "Big Apple") ("moving company" OR "moving companies" OR "specialy movers" OR "professional movers" OR "u-haul" OR "apartment movers") ("fragile" OR "antiques" OR "china" OR "difficult to move")

    It takes a bit of time to put together (and google will run slooooow because this kind of logic is very difficult for the search engine), but a search like this will give you the best possible results on hard queries.
  • I thought so... (Score:1)

    by Anti_Climax (447121) on Saturday January 22 2005, @11:01PM (#11445756)
    I was searching last night for Warez^H^H^H^H^HOpen Source Software downloads and it wasn't giving me any greif about what seemed to ba a fairly long search string.

    [/curiousity]
  • Regexp (Score:4, Insightful)

    by John Hasler (414242) on Saturday January 22 2005, @11:19PM (#11445840)
    Now, if they will just accept regular expressions.
    • Re:Regexp by Scaba (Score:2) Sunday January 23 2005, @02:43AM
      • 1 reply beneath your current threshold.
    • Re:Regexp by Alomex (Score:2) Sunday January 23 2005, @09:47AM
      • Re:Regexp by vladd_rom (Score:2) Sunday January 23 2005, @12:59PM
        • Re:Regexp by Alomex (Score:2) Sunday January 23 2005, @03:25PM
          • Re:Regexp by Minna Kirai (Score:2) Sunday January 30 2005, @08:54AM
            • Re:Regexp by Alomex (Score:2) Sunday January 30 2005, @11:10AM
              • Re:Regexp by Minna Kirai (Score:2) Monday January 31 2005, @05:24PM
              • Re:Regexp by Alomex (Score:2) Monday January 31 2005, @08:12PM
            • Re:Regexp by jasonwea (Score:1) Thursday February 03 2005, @06:06AM
  • Google API? Useless. (Score:3, Insightful)

    by Guspaz (556486) on Saturday January 22 2005, @11:57PM (#11445989)
    (http://novasearch.net/)
    "it's also of great help to certain tools using the Google API"

    Hardly. The Google API is limited to 1000 searches per day, making it useless for any sort of web application. About the only thing I can think of that it would be useful for is a desktop program in which the user would only perform a limited number of searches.
  • Google Grid (Score:2)

    by digitalgimpus (468277) on Sunday January 23 2005, @12:57AM (#11446225)
    Seems like another step in the evolution towards Google Grid / EPIC [broom.org]
  • by SufficatedDeveloper (816025) on Sunday January 23 2005, @04:03PM (#11449767)
    The 32 word thing is cool... But adding the ability to add distribution lists to my contacts in GMail would be WAY more useful
  • by binaryspiral (784263) on Sunday January 23 2005, @09:36PM (#11452048)
    Now I'll be able to search for the exact error message my windows boxes toss at me. Woo hoo!

    If google had raised it's limits earlier, I could have skipped that school diploma and just went right into I.T. support.
  • by alexo (9335) on Monday January 24 2005, @04:46PM (#11461212)
    (Last Journal: Wednesday December 01 2004, @10:15PM)

    In no particular order:

    * A better query language, with wildacrds ("Word*") or stemming, proximity operators, parentheses, complex boolean expressions (something like what Dejanews and the pre-Yahoo AltaVista used to offer).

    * Filtering out linkfarms and search-pages.
  • Mod Parent Up (Score:2)

    by zarthrag (650912) on Saturday January 22 2005, @10:18PM (#11445529)
    It's true, I haven't been able to check it ALL DAY- - That's almost as bad as slashdot being down. Some of my other friends have access still, but my account is teh suxxor at the moment.
    [ Parent ]
  • 4 replies beneath your current threshold.