Augmenting Data Beats Better Algorithms 179
eldavojohn writes "A teacher is offering empirical evidence that when you're mining data, augmenting data is better than a better algorithm. He explains that he had teams in his class enter the Netflix challenge, and two teams went two different ways. One team used a better algorithm while the other harvested augmenting data on movies from the Internet Movie Database. And this team, which used a simpler algorithm, did much better — nearly as well as the best algorithm on the boards for the $1 million challenge. The teacher relates this back to Google's page ranking algorithm and presents a pretty convincing argument. What do you think? Will more data usually perform better than a better algorithm?"
Hold on a sec... (Score:5, Funny)
I need more data.
Re:attn computer scientists: stop renaming stuff (Score:5, Funny)
Re:attn computer scientists: stop renaming stuff (Score:5, Funny)
Re:attn computer scientists: stop renaming stuff (Score:5, Funny)
Re:attn computer scientists: stop renaming stuff (Score:5, Funny)
Mathematics is physics without purpose, Chemistry is physics without thought, Engineering is physics without tenure.
Re:attn computer scientists: stop renaming stuff (Score:5, Funny)
Riiiht. And mathematical research is just finding a Hamiltonian cycle in a graph defined by the set of axioms used.
Re:Um, Yes? (Score:3, Funny)
Re:attn computer scientists: stop renaming stuff (Score:1, Funny)
Re:attn computer scientists: stop renaming stuff (Score:3, Funny)
Adapted from a joke I saw on Jester the other day:
A physicist, a computer scientist and a mathematician are sharing a hotel room. It must have bad wiring or something.
Late at night when they're all asleep a small fire starts in the room. The smell of smoke wakes the physicist. He gets up, notices the fire and looking round the room, sees a bucket and a sink. He calculates how much water will be required, fills the bucket with precisely that much, douses the flames and goes back to bed.
A little later, another small fire starts. This time the smell of smokes wakes the computer scientist. He wakes up and sees the flames. He looks around and sees the bucket and the sink. He reasons that calculating the quantity of water required would take at least as long as filling the bucket, so he fills it right up, douses the flames and goes back to bed.
Again there is a fire. This time the mathematician smells the smoke and wakes up. He sees the flames, sees the bucket and the sink. He exclaims "there is a solution!" and goes back to bed.