Google Programming Contest Winner 229
asqui writes "The First Annual Google Programming Contest, announced about 4 months ago has ended. The winner is Daniel Egnor, a former Microsoft employee. His project converted addresses found in documents to latitude-longitude coordinates and built a two-dimensional index of these coordinates, thereby allowing you to limit your query to a certain radius from a geographical location. Good for difficult questions like "Where is the nearest all-night pizza place that will deliver at this hour?". Unfortunately there is no mention whether this technology is on its way to the google labs yet. There are also details of 5 other excellent project submissions that didn't quite make it."
if i'd only known (Score:2, Interesting)
The winning idea was cool, but the rest looks
like free development for google rather
than something novel.
Re:if i'd only known (Score:4, Informative)
Let me quote from the homepage of the annual contest:
"Grand Prize
$10,000 in cash
VIP visit to Google Inc. in Mountain View, California
Potentially run your prize-winning code on Google's multi-billion document repository (circumstances permitting)"
Re:if i'd only known (Score:2)
Re: (Score:1)
Re:if i'd only known (Score:2)
Re:if i'd only known (Score:3, Funny)
As previously designed by me (Score:1)
Dosn't do the document lookup thing, but we were using it for finding the neariest piza on a now defunct e-commerce website.
Re:As previously designed by me (Score:1)
I suppose those maps are a bit out-of-date now, aren't they
There always out of date (Score:1)
Get a book on basic qauntum mechanics and it will tell you that your observations are always out-of-date.
What's wrong with Afganistan anyhow, they seemed a nice bunch of people, with a strong religious following before the US regieme ousted there lovley government.
Re:There always out of date (Score:1)
like it or not, the us millitary did some MAJOR terraforming over there.
pot-holes (Score:1)
Lets join up (Score:1)
Who the hell was judging that thing anyhow, there's geo searches all over the place, and i've done plenty of address parsing code in my time!!!
I see one being implemented soon (Score:5, Interesting)
This may help to defeat the current practice of overloading the PageRank results of a given key word as to point to a given page by having people link to that page with a link containing that keyword, aka "Googlebombing". I do think that the winner is a very interesting and useful project, this latter one will probably be implemented ASAP.
Re:I see one being implemented soon (Score:2)
Re:I see one being implemented soon (Score:1)
Re:I see one being implemented soon (Score:2, Informative)
Markovian Dependece [google.com]- The condition where observations in a time series are dependent on previous observations in the near term. Markovian dependence dies quickly, while long-memory effects like Hurst dependence, decay over very long time periods.
Markov processes (Score:3, Informative)
A Markov process is basically a series of random variables where the value of random variable X^(i+1) only depends on X^i. The idea is that if you want to predict the value of X^(i+1), all of the information you could possibly use is in the value of X^i.
Lots of processes are Markovian- for instance, a random walk. If you're at point x at time t, then you know that there's a fifty-fifty chance you will be at x-1 or x+1 at time t+1. Knowing all of the previous points along the random walk won't help you predict the next point any better than that.
Re:I see one being implemented soon (Score:1)
I sent something into the contest. (Score:4, Funny)
Mod parent up! (Score:1)
Re:I sent something into the contest. (Score:1)
Re:I sent something into the contest. (Score:1)
Re:I sent something into the contest. (Score:2)
What a great idea (Score:4, Funny)
Re:What a great idea (Score:1)
Re:What a great idea (Score:1, Offtopic)
Re:What a great idea (Score:1, Offtopic)
BTW, this isn't O/T in the context of this thread.
Google Search (Score:4, Funny)
Latitude/Longitude => 37/180, Pak
Capture
If this would have come out before we could have saved a country
Search = Osama Bin Laden (Score:1)
Re:saved a country (Score:2)
US gov has been trying to
capture Bin Laden before 9/11.
At least some changes would
not have occured that fast.
Idea for a Google Query..... (Score:5, Funny)
"Google Sets" (Score:2)
more details (Score:5, Informative)
This is impressive bit of database manipulation. Somehow I didn't think that all of the datatypes, etc would be so easily parsed.
Although I do recall telephone directories that used to give you results for a specified radius for certain types of businesses
Re:more details (Score:3, Insightful)
Re:more details (Score:5, Insightful)
Re:more details (Score:2)
Google chose his project because his code was clean and robust. It shouldn't be to difficult to get it to work with other data.
Re:more details (Score:4, Informative)
Although I do recall telephone directories that used to give you results for a specified radius for certain types of businesses
That's just a standard spatial query. It's easy to implement an R-Tree to be able to do (relatively) quick "give me points within x meters of this one" type of searches on a database. There's nothing extremely revolutionary about Daniel's project, anyone with some basic geometry knowledge and the patience to download the 33GB of TIGER data could have done it within the course of a few weeks. (Ironically enough I've been doing the same thing with 1.2 million addresses against TIGER data for the past month.)
But that's the true genius and beauty of it. Now that it's been said, it's such a mindbogglingly obvious and useful application of web search and spatial search technology that it's hard to believe nobody thought of it before.
I'd be honestly surprised if Google doesn't run with the ball and fold it into their main search engine. The only thing standing in the way is the storage space and CPU time to do it.
Re:more details (Score:2)
. . . Two things Google should have no trouble finding. [slashdot.org]
404 Page Not Found ? (Score:5, Interesting)
Shouldn't Google automatically check results that a user follows and flag those that cannot be displayed ?
Re:404 Page Not Found ? (Score:3, Insightful)
Thomas Phelps and Robert Wilensky, for their project, Robust Hyperlinks. Traditional hyperlinks are very brittle, in that they are useless if the page later moves to a different URL. This project improves upon traditional hyperlinks by creating a signature of the target page, selecting a set of very rare words that uniquely identify the page, and relying on a search engine query for those rare words to find the page in the future. For example, the Google programming contest can be found using this link.
Re:404 Page Not Found ? (Score:3, Informative)
Re:404 Page Not Found ? (Score:3, Interesting)
What should they do if a page is unavailable, though? What if it's only down for a few seconds?
Re:404 Page Not Found ? (Score:1)
Re:404 Page Not Found ? (Score:5, Insightful)
I would much prefer to see them improve the ease of browsing their cache. Specifically, if a cached site is 404, then present a cached version of the site where all clicks within the site simply link to the cached version, unlike today where all clicks are native (and therefore lead to more 404's). Granted that wouldn't be of any use for links to dynamic pages, but anything is better than what they have today.
Re:404 Page Not Found ? (Score:1)
Re:404 Page Not Found ? (Score:3, Informative)
Re:404 Page Not Found ? (Score:2)
winner is... (Score:1, Troll)
All kidding aside, sounds pretty neat.
Re:winner is... (Score:2)
n0 u 533, h3 0wnz0r3d g00g13 w14h h15 31337 m1cr0$0f4 h4x0r1ng 5ki11z!!!
God, help us all... I *knew* we couldn't go an entire story without someone freaking out about the whole "used to work for Microsoft bit."
Ho hum, back to my OBSD boxen...
Geographical Approximation (Score:1)
Re:Geographical Approximation (Score:3, Interesting)
Re:Geographical Approximation (Score:1)
Dig it?
Not earth shattering, but useful (Score:2, Informative)
Service already exists (Score:2)
Last time I used Lasoo was on Mother's Day, to find the closet florist to my mom's house.
Re:Service already exists (Score:3, Insightful)
The cool thing about the winning google entry is that it actually deduces the location of the search result by finding and parsing any address information that appears on the site!
I think that's pretty clever. - Does anyone know if it's limited to the US?
--
Andy
Nice (Score:3, Interesting)
What would be cool, would be the option to right click on the hyperlink and have the option "Find alternative location".
Or even cooler, have IE (or your favourite browser) on putting up the 404 message have a hyperlink which does the same. Hell, easy enough to do with apache.
Re:Nice (Score:1)
Would be a nice addition to the "google toolbar" though.
Re:Nice (Score:1)
Re: (Score:2)
Funny. (Score:1)
I knew I should have patented it... (Score:2)
But the idea of using it just to find business within a certain radius is very limited thinking.
Mobile phones will soon be broadcasting their position. You want interactive guided tours of a city? How about playing full size monopoly? Driving directions? Any sign you currently see could be removed and replaced with a virtual sign? Any number of VR worlds played out in meat space? etc etc
I think that the ability to automatically tell someone where you are will prove to be a boon.
Kudos to the developer for carrying through, rather than my lazy ass postulating
runner up (Score:2)
sounds a lot like Google sets [google.com]
Robust Hyperlinks has to be my favourite.
NetGeo (Score:5, Informative)
Re:NetGeo (Score:2)
Cached Longitude and Latitude (Score:1, Troll)
More uses (Score:5, Funny)
Results: Apt 2D
Re:More uses (Score:2, Funny)
But how many milliseconds would it take before she's slashdotted?
Re:More uses (Score:2, Funny)
Error: This server is too busy to deal with your request
locatity of businesses and services (Score:1)
have a new meta tag class for "location" which would include the GPS (or long/lat, etc.) co-ordinates for the business, etc. along with human readable country, such as UK, FR, NL, US, NL, etc. and region.
then have another tag for how far the service provider/company does business for with rules for excluding and including areas.
so if a copmany operates, i.e. delivers, within a 3 mile radius then a search engine will know. if they operate in their local state/county it will know. if it operates regionally such as Europe wide or Globally it would be easy to locate a provider of goods and services which will be able to help. The exclusions could cover things like states which the product is deemed illegal or that the merchant doesn't wish to do business with.
Of course you'd need to let the search tools know where abouts you are in order to refine the results. But this needn't be the source of any privacy issues as the engine could submit a complete set of results and be filtered out at the client end dynamically if the user is paranoid.
what do you think?
Mb
Re:Quick! Patent that idea! (Score:1)
but my submitting my idea on this public forum I think I've lost any chance of getting a patent, at least in the UK. Our patent office might be slow, but they're a lot better than the US one. Part of getting a patent over here is that you aren't allowed to disclose anything about the idea until the patent is granted. If that was true in the US you've would have a lot less problems with the patent office and dumb patents such as "1-Click Shopping", etc.
Plus I don't think company's could patent the "discovery" of stuff that we already have in our bodies, etc. such as genes, unless they are artificially produced or altered ones that don't exist in nature.
Also, software shouldn't be patentable imho. Copyright takes enough away from us already without a dumb patent saying that I can't write a program which manipulates some data in a certain way without paying company X some money.
Our kids and grandkids are going to grow up in a world where the only public domain material is the same as what we have... i.e. all from the 19th century and earlier. Star Wars, Mickey Mouse and such like will remain in copyright *forever*, as long as a company can milk them for more money. Just think about that, in 2902 you will probably get locked up in an iso-tube for 55 years for copying Star Wars (6 film collectors edition - now in 3d) using your fingernail media duplicator.
watch out, google. (Score:2)
so as a former microsoft employee he's going to run his code on the google servers.
Nah, this one's too easy!
A project that really could be good for Google... (Score:5, Informative)
So you think that Google's results are fair? You're wrong. The best ranked results are from sites that heavily cheat.
Since Google has aggressively removed fake generated sites linking to each other, new ways of cheating have been immediately adopted.
Apart from cloaking (what the Google crawler sees is different from what user see), generated sites now include fake generated english-like sentences in order to make Google think the text is real. Spam indexing is now distributed on multiple IPs. Content is dynamic, it changes everyday (random links and texts are generated) . Temporary sites are hosted on external (yet non-blacklisted) cheap colocated servers. Invisible frames are added, etc.
I'm not innocently talking about that because the company I'm working for is actively doing it. And it works. And they say "Spam? Uh? Who's talking about Spam? It bring us money, so it's not spam, it's our business".
There are ways to prevents cheating on Google. It's probably very complex, but it's realisable. If any human looks at our 'spam site', he will immediately discover that it's not a real site. It's a mess, just for keywords and links.
If such a project had been made for the Google content, it would have been wonderful.
Google is still the best search engine out there. Their technology rocks, and they are always looking for innovation. But what could make an huge difference between and other search engines is : fair results. Same wheel of fortune for everybody.
Yet this is not the case. Trust me, all well ranked web sites for common keywords belong to a few companies that are actively cheating.
Not all well-ranked sites for common words cheat (Score:3, Interesting)
This statement is easily refuted. Type "linux". The 10 sites you see all belong there, and I can guarantee you that most of them are not engaging in cheating.
But since you admit that you work for a company that engages in this practice, perhaps it helps you sleep at night to believe that "everybody does it".
Re:Not all well-ranked sites for common words chea (Score:2)
Linux is not a common keyword. Linux is not something that bring money. Therefore people don't need to cheat for Linux-related sites.
A good source of innovation... (Score:1)
Geographically limited browsing (Score:1)
As well as any other of the many geography based rules, laws, taxes, restrictions, etc that we have seen talked about on
Daniel Egnor's "Iocaine Powder" (Score:4, Interesting)
In a weird coincidence, I just spent a half-hour last night lecturing about Daniel Egnor's Iocaine Powder [ofb.net], winner of the First International RoShamBo Programming Competition [ualberta.ca]. Credit this guy with two award-winning pieces of extreme programming cleverness!
More Information About the Winner (Score:5, Informative)
Why I never would have written that program. (Score:2)
How many hosts implement their coordinates in their info any more?
5%? 10%?
80% omit it because admins are lazy, and 10% omit it for security reasons.
So Google just gave an award to a tool with half the batting average of a bad baseball player.
--Blair
Re:Why I never would have written that program. (Score:2)
RTFA - it's nothing to do with the hosts record. It parses the addresses from the pages themselves.
Smooth Move Google (Score:3, Insightful)
I'm a little disappointed. (Score:2)
I wonder what this person worked on at Microsoft.. (Score:2)
Speech Recognition (Score:2)
Am I the only one who thinks this would be useful for speech recognition? If you just detected a "federal" and you have two possibilities for the next word, "law" and "paw" say, the software would know it's more likely to be "law". Federal paw is probably fairly uncommon and yet this is exactly the mistake that current software makes.
Wired article (Score:2)
Re:good news for open source? (Score:1)
Re:good news for open source? (Score:2)
Re:good news for open source? (Score:2)
Re:About the "Former Microsoft Employee" bit.. (Score:2, Insightful)
Re:About the "Former Microsoft Employee" bit.. (Score:5, Funny)
Actually he was employed by XYZFind Corp. Literally. And it didn't show up.
Re:About the "Former Microsoft Employee" bit.. (Score:2, Informative)
Yes really, it's not a large room full of monkeys!
Re:About the "Former Microsoft Employee" bit.. (Score:1)
Re:About the "Former Microsoft Employee" bit.. (Score:1, Informative)
Re:About the "Former Microsoft Employee" bit.. (Score:3, Funny)
Being a former M$ employee tells me he learned quite a bit.
Re:About the "Former Microsoft Employee" bit.. (Score:2)
So I think there are some programmers at Microsoft that you could learn from (not by seeing their mistakes).
-jfedor
Re:About the "Former Microsoft Employee" bit.. (Score:2)
Re:About the "Former Microsoft Employee" bit.. (Score:1)
Come on...
Re:About the "Former Microsoft Employee" bit.. (Score:1)
Sigh...
In submitted articles, italics designate the submitted article, while normal text indicates Michael's or CmdrTaco's, etc., additions.
Note that this one was all italics, meaning the 'former Microsoft bit' was included by the person submitting the article.
Re:Some might think.. (Score:2)
Though their operating systems may be riddled with bugs and security flaws of all sorts, look at their applications. They tend to be the epitome of quality software.
Yeah, right. That one dancing PaperclipDude was the "epitome of quality software".
Me: (starts writing a letter in Word)
PaperclipDude: "Hi there! It looks like you're writing a letter!"
No shit, Shirlock. What gave it away? The "Dear Sirs" opening line? Shees.
GMD
Re:Anyone here who got the CDROM with data mailed? (Score:2)
I requested the data CDs on February 6th, and got an acknowlegdement email from Google the same day. But I never received the CDs, either, so more or less forgot about the contest until now. And I'm in the US, so they're not discriminating against Germans.
Some fine way to run a contest!
So I'll second the question... did they follow up with anybody?