Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Google Releases Web APIs

Posted by Hemos on Fri Apr 12, 2002 06:57 AM
from the developing-with-google dept.
skunkeh writes "Google have released the first beta of their Web APIs package. Used in conjunction with a free license key this SOAP based web service allows developers to execute up to 1000 automated queries a day, but is currently available for non-commercial use only. The download comes with Java and .NET code examples and includes a WSDL description for use with other SOAP supporting languages." There's also a write up about uses on Userland.
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • by skunkeh (410004) on Friday April 12 2002, @07:00AM (#3328731)
    A list of implementations of the Google Web API can be found on SoapWare:

    http://www.soapware.org/directory/4/services/googl eApi/implementations [soapware.org]

    At the time of posting languages catered for were for AppleScript, Frontier/Radio, Perl, Python and Visual Basic. I've written a basic implementation in PHP which has yet to be added to the list - you can find it here:

    http://toys.incutio.com/php/php-google-web-api.htm l [incutio.com]

    This is a very cool toy.

  • This story refers to (Score:3, Informative)

    by sydneyfong (410107) on Friday April 12 2002, @07:08AM (#3328748) Homepage Journal
  • http://radio.weblogs.com/0100012/stories/2002/04/1 1/applescriptForGoogleApi.html

    has some Applescript for your use
  • by shankark (324928) on Friday April 12 2002, @07:16AM (#3328770)
    Other than being a really cool idea, this is a great tactical move from Google. On the one hand, by restricting the number of queries made to Google, they ensure that their APIs aren't misused/compromised, it also gives companies an initiative to purchase Google products and deploy this API (probably an unrestricted-query API) on their own network. Furthermore, an API such as this will easily muscle out any sniff of a competition from other search engine wannabes. Google has managed to do all this and yet be as compliant
    to an Open Source initiative as possible. Remarkable.
    • by skunkeh (410004) on Friday April 12 2002, @07:21AM (#3328786)
      From the FAQ [google.com]:
      2. Does Google have any plans to sell Google Web APIs as a service?

      Not at this time.

      Which seems very strange seeing as this could be a huge money spinner. Surely a license system which allows commercial users to subscribe to a certain number of queries a day, or just buy queries in bulk would generate a lot of income for Google and provide a valuable service to the internet business community at large.
      • by gowen (141411) <gwowen@gmail.com> on Friday April 12 2002, @07:53AM (#3328872) Homepage Journal
        2. Does Google have any plans to sell Google Web APIs as a service?

        Not at this time.
        Sure, thats what they say, but what they mean is :

        "of course we do. You think we're doing this out of love? But we don't know what they are just yet, and we want to get things right. So go away, and we'll put out a press release as and when we're ready."

        Which is fair enough.
      • I wonder if Google is already providing some sort of paid service to large corporations. On my website on day I actually got a hit coming from a chap at the Redmond campus of Microsoft, and he was searching via http://www.google.com/microsoft [google.com] : BTW, at the time I'm quite sure that that page actually displayed the Microsoft logo as well.

        • Google makes these really, really cool little search appliance things that you set up on your network. Slashdot has a story about it, and I'm sure you can find some stuff on the google site. They also provide site indexing services.
    • ...not to mention that the 1,000 query limit/ day is only whetting an appetite. Any wagers on whether or not there will be a sweet little pricetag on 10,000 queries/ day or unlimited queries/ day? A pricetag corporate clients will gobble up? Remarkable indeed!
  • Example of use (Score:5, Informative)

    by dtr20 (442135) on Friday April 12 2002, @07:23AM (#3328795)
    I just had a go with this and some example output is displayed below. Basically you can do a search of their main web pages, request a cached page or use their spellchecker.

    Dave

    $ java -cp googleapi.jar com.google.soap.search.GoogleAPIDemo XXmykeyXX search "british empire"
    Parameters:
    Client key = XXmykeyXX
    Directive = search
    Args = british empire
    Google Search Results:
    ======================
    {
    TM = 0.117071
    Q = "british empire"
    CT = ""
    TT = ""
    CATs =
    {
    {SE="", FVN="Top/Regional/Europe/United_Kingdom/Society_an d_Culture/History"}
    }
    Start Index = 1
    End Index = 10
    Estimated Total Results Number = 688000
    Document Filtering = true
    Estimate Correct = false
    Rs =
    {

    [
    URL = "http://www.btinternet.com/~britishempire/empire/e mpire.htm"
    Title = "The British Empire"
    Snippet = "| Introduction | Articles | Biographies | Timelines
    | Discussio
    n | Map Room | Armed Forces | Art ... "
    Directory Category = {SE="", FVN=""}
    Directory Title = ""
    Summary = ""
    Cached Size = "5k"
    Related information present = true
    Host Name = ""
    ],
    ...
  • by bodin (2097) on Friday April 12 2002, @07:25AM (#3328799) Homepage
    O'Reilly has a good article here with some code as well in both Java and Perl.

    http://www.oreillynet.com/cs/weblog/view/wlg/1283 [oreillynet.com]

  • by AVee (557523) <{gro.eeva} {ta} {todhsals}> on Friday April 12 2002, @07:29AM (#3328809) Homepage
    To create an Google account [google.com] you have te agree with the Google Terms of Service [google.com]. These state the following:

    No Automated Querying

    You may not send automated queries of any sort to Google's system without express permission in advance from Google. Note that "sending automated queries" includes, among other things:
    • using any software which sends queries to Google to determine how a website or webpage "ranks" on Google for various queries;
    • "meta-searching" Google; and
    • performing "offline" searches on Google.
    Now, how can I use the web API?!
    Note that this is not in the Google Api TOS [google.com] wich you must agree to before downloading [google.com] the api. But in the Google Terms of Service [google.com] wich you must agree to before creating a Google account needed to use the Google Api.

    Still, it's fun and i'll play with it!
  • Here's a copy of the write up. My machine barely made it to the site :)
    Google is just the juice
    Thu, Apr 11, 2002; by Dave Winer.
    Good afternoon

    A very quick piece today, a story, a question, an answer and a pointer.

    The story -- 1995. A new release of Netscape. Can't get through to their servers. This thing is exploding. A mind bomb every minute. Wow. I love this. End of story.

    The question: Can it happen again?

    The answer..

    Yes!

    This afternoon Google opened a public SOAP 1.1 interface.

    Now, from scripts, we can call Google as if it were a script running locally.

    What comes back? Data.

    What questions should we ask?

    That's where the mind bombs will come from.

    In the loop

    We've been in the loop with Google, privately, for the last few weeks, so we've had a chance to play with ideas and actually have some.

    Yesterday, as a tease, I put a Google Box on Weblogs.Com. Every hour it recalcs, showing the top 10 hits on Google for the term weblog. To my surprise, it changes, it's not constant. And it took me to places I didn't know about. The serendipity of queries that run for a long time. That, imho, is where the juice is in the Google API; and probably many or most of the APIs that are sure to follow; because Google is so popular.

    Google hits the ball over the net, then we return the volley. Finally, once again, signs of life. Let's hope we learn from the past -- and keep the spark going -- welcoming competition and learning from it instead of snuffing it out. The intoxication of a new idea every day is too good to not want to be there once again.

    Maybe the dark ages are over? I hope so.

    Google is just the juice

    It's happening in real time. As I write this I'm waiting for the embargo to lift. As soon as that happens, we'll start releasing new parts and samples for Radio and Frontier users that connect to Google's SOAP interface, with simple but geekish instructions for getting started.

    Later today Google Boxes will start showing up on Radio weblogs, which you can follow through Weblogs.Com. You'll see SOAP developers, on all platforms, getting to work, creating and publishing the glue that turns the Internet, finally, into a fantastic scripting environment. Google is just the juice we need.

    Dave Winer
  • by Captain Large Face (559804) on Friday April 12 2002, @07:36AM (#3328829) Homepage

    Whilst the potential of a regular Google search is large enough, when you consider the Google search modifiers, the potential becomes staggering. Imagine using the following features:

    • Business Address Lookup
    • File Type Specific Search (.PDF etc..) (filetype:)
    • Stock Quotes
    • Cached Links (/. Favourite) (cache:)
    • Similar Pages (related:)
    • Linked Sites (link:)
    • Site Specific (site:)
    • Maps

    Does anyone happen to know if you can use the other sections of Google (e.g. news, images etc.)?

    Is Google the best company ever or what?!

  • by Captain Large Face (559804) on Friday April 12 2002, @07:53AM (#3328871) Homepage

    I think I speak for most when I ask if you can have your results back in the "interesting" language sets:

  • by sanermind (512885) on Friday April 12 2002, @07:54AM (#3328877)
    If they are going to limit you to only 1000 queries, I fail to see the point. It wouldn't be hard at all to write a simple API on your own to, say, a c++ class that spits out the necessary url's [like http://www.google.com/search?hl=en&q=example]
    or the like, dispatch them to google port 80, and then parse the results into easily program readable data sets/results? A third party could write this sort of thing easily enough if there was demand for it. I mean, esentially the google search API isn't going to be offering anything not available in the standard forms, is it? Except their spell checker, I believe. [Which you could use via html too, actually, "Did you mean: ______" ]
  • What about slashdot? (Score:3, Interesting)

    by aozilla (133143) on Friday April 12 2002, @07:54AM (#3328878) Homepage
    How long until slashdot offers this service?
  • NNTP tunneling ? (Score:3, Insightful)

    by Bert Peers (120166) on Friday April 12 2002, @07:57AM (#3328882) Homepage
    In case the engineers at google are bored now that it's released, here's an idea ;) Open up groups.google.com via a similar API so that an application can get the latest Usenet info even through proxies blocking NNTP and/or newsservers. Showing the latest threads/posts etc on a webpage could be useful too.


    It's not something you have to go to google for, but it'd be nice :)

    • Re:NNTP tunneling ? (Score:4, Interesting)

      by km790816 (78280) <wqhq3gx02NO@SPAMsneakemail.com> on Friday April 12 2002, @08:58AM (#3329152)
      This is where things get interesting.

      Companies have become happy blocking ports to restrict no-nos: messaging, newsgroups, etc.

      I'm wondering how long it will be until we start seeing firewalls that can filter/block SOAP calls for the very reasons you mention. SOAP just forces network admins to move up from ports and protocals to sniffing HTTP requests to keep people from having too much fun.

      Enjoy it while it lasts.
  • The 1000 searches a day is very nice....I know I would never need that many (if results were unlimited anyway).

    HOWEVER...you only get 10 results per search??
  • Anyone else feels that if google ever dissapears, they will become very unproductive?
  • Synthesis (Score:5, Insightful)

    by Asprin (545477) <gsarnold.yahoo@com> on Friday April 12 2002, @09:07AM (#3329189) Homepage Journal
    Ummmmm. Ok, check this out.

    This morning on /. we have an article about Google releasing their SOAP 1.1 API followed immediately by an article from a guy that set up a spambot trap on his web site, and in the margin a poll about giving spammers what they deserve. Putting 2 and 2 and 2 together, I got 4, popped open a google box and started playing.

    All I did was ask google to search for "mailto" and "@msn.com" and lo and behold, she spit back 111,000 hits - hits that contain what look like legit email addresses IN THE THREE LINE SUMMARIES.

    The point is, now that google can be automated, what's to stop spammers from SOAPing their way into Google to do their harvesting? Would there be any point over what they're doing now? It might be cheaper, because you only have to run over the google results not the whole sites and since Google caches pages, you can even grab addresses from the past, somewhat.

    IT ALSO DEFEATS SPAMBOT TRAPS.

    Doesn't this give spammers whole new avenues to exploit?

    Worse, are webmasters going to have to put a halt to Google crawls?
    • &ltsarcasm&gt
      LOL maybe we should just dismantle the whole internet, as clearly the internet is the channel used by spammers! Oh wait. The internet has many many positive uses. Gee!
      &lt/sarcasm&gt


      LOL a 4 for Interesting? Oh come on, this is ignorance, not information.


      Horrors! Spammers can use this!


      Uh 'scuse me but I can write a 10 line perl script that does the same thing. All I have to do is craft a query to google, and put a bunch of work into parsing out the real content from the HTML that comes back. Kind of a pain, but nothing a few regexp can't handle. This API is nothing new, it's just something handy. I'm seriously thinking I can replace a component of a research project here at our research facility with this. Why reinvent the wheel after all?


      Worse, are webmasters going to have to put a halt to Google crawls?


      It's called robots.txt. Ever run a web server? All this API does is let you do searches to google. Google is google is always searching. That's what robots.txt is for. You are not going to get crawled by this! This is not a BOT, just a QUERY TOOL.

  • It seems there is still time to enter the Google Programming Contest [google.com] and although I have neither the time nor the skill to do it, I do have an interesting idea if someone else wants to take a shot at it.

    Years ago, The Hollywood Stock Exchange [hsx.com] was a somewhat popular game (maybe it still is, but it doesn't really interest me). The general idea being that you could "Buy shares of your favorite actors, movies, and music artists and watch their values rise or fall based on the success of their careers and personal life."

    It would be interesting to see a similar game based on the popularity of queries. It's clear from the Google Zeitgeist [google.com] that certain search terms do gain and lose popularity on a regular basis, and for someone tapped in to mainstream culture, it may not be too hard to predict.

    I suppose you could do the same thing with the other info there (Browsers, OSs, Current Events, etc.) but I don't think it would be as interesting. Although... Anime searches might be neat.

    Anyhow, just an idea I'd love to see someone run with.

    -Tommy
  • The Google Rights include rights to the following:......(3) the search results and spell checking you obtain when you use Google Web APIs.

    I never thought I'd read the words "Google Rights" in a legal document, but anyway, how can Google own the rights to "spell checking".. what exactly do they own? The words that come back? The association of misspelled words to spelled words? How could you abuse that??

    I must say this is incredibly cool though.. however I would much rather see a generic "Search Engine API" that isn't owned by Google, and can be implemented by anyone.

  • Not needed (Score:3, Interesting)

    by NineNine (235196) on Friday April 12 2002, @12:12PM (#3330263) Homepage
    Why is this needed? I've been using Google programatically for a while now. What does this offer that I can't use on my own?
    • Legality [google.com] for one thing:
      No Automated Querying

      You may not send automated queries of any sort to Google's system without express permission in advance from Google. Note that "sending automated queries" includes, among other things:

      • using any software which sends queries to Google to determine how a website or webpage "ranks" on Google for various queries;
      • "meta-searching" Google; and
      • performing "offline" searches on Google.
      It also stops your scripts from breaking every time Google redesign their results page.
    • Except maybe in Europe. And some other places, too. I know when I was in London I noticed that collective nouns were treated as plural: IBM have released... and so on.

      Same's true if you watch enough Britcoms or other British imports (damn do they make good crime dramas!).

      My 2 cents...

      GTRacer
      - Should be returning to England in a year or so...

    • by Software (179033) on Friday April 12 2002, @11:50AM (#3330128) Homepage Journal
      OK, your script parses Google's HTML output today, but what about a year from now when Google changes its output, to say, XHTML or plain text or something. How well will your script work then? Although the Google API could change tommorow like some companies' [microsoft.com], in general APIs are more stable. I haven't looked at their API, but I'm guessing it's also easier to develop against their API, and it should be less processor- and network-intensive.