Follow Slashdot blog updates by subscribing to our blog RSS feed

Collaborative Filtering and the Rise of Ensembles 58

Posted by timothy on Tuesday September 01, 2009 @02:49PM from the soon-there-will-be-symphonies dept.

igrigorik writes "First the Netflix challenge was won with the help of ensemble techniques, and now the GitHub challenge is over, and more than half of the top entries are also based on ensembles. Good knowledge of statistics, psychology and algorithms is still crucial, but the ensemble technique alone has the potential to make the collaborative filtering space a lot more, well, collaborative! Here's a look at the basic theory behind ensembles, how they shaped the results of the GitHub challenge, and how this pattern can be used in the future."

This discussion has been archived. No new comments can be posted.

Collaborative Filtering and Rise of Ensembles

Search 58 Comments Log In/Create an Account

Comments Filter:

Group Labor (Score:2, Informative)

by r6_jason ( 893331 ) writes: on Tuesday September 01, 2009 @03:04PM (#29276973) Homepage

Of course having a group of people working together is a strength. If you are having a bad day or just feel like slacking off some one else is there to pick up the slack and keep the project moving. See also Division of labour http://en.wikipedia.org/wiki/Division_of_labour [wikipedia.org]

Share
twitter facebook
Re:Management Summary ? (Score:3, Informative)

by Trepidity ( 597 ) writes: <delirium-slashdotNO@SPAMhackish.org> on Tuesday September 01, 2009 @04:04PM (#29277561)

Here's a stab at 3 sentences:
Making a prediction by running multiple statistical prediction algorithms and combining their results often seems to work well. This is called an "ensemble method". Ensemble methods seem to work particularly well on collaborative filtering problems.

Parent Share
twitter facebook
Re:Open source governance (Score:4, Informative)

by kthejoker ( 931838 ) writes: on Tuesday September 01, 2009 @04:16PM (#29277679)

"Fitness" doesn't always have to be related to the output; it can be related to the quality of a guessed input.
Consider the corollary of a poll test: a model in which "trusted" voters receive extra votes while everyone else still gets on vote. You can determine "trustworthiness" (or "karma", if you will) the same way Slashdot does - through moderation and meta-moderation, or you can use a more objective "de minimis research" criteria (like a poll test but without the punishment for failure.)
So someone voting on a school board bond election who can correctly answer questions about the stated usage of that bond, or the school district's financial bond rating, or who attends a school board meeting discussing the bond, could get 2 votes for the price of one.
This would a) allow "passionate" (albeit informed) voters to have more of a say than someone who is indifferent, and b) encourage people to do research and get involved in politics.
In a way, it's anti-democratic, but if you are going to insert any sort of elitism into the system, it might as well be a meritocracy.

Parent Share
twitter facebook
Re:I'm sorry (Score:5, Informative)

by Trepidity ( 597 ) writes: <delirium-slashdotNO@SPAMhackish.org> on Tuesday September 01, 2009 @04:21PM (#29277731)

Yeah, the term dates back at least to the 1990s. The classic survey paper (over 1000 citations!) on the subject is "Ensemble Methods in Machine Learning" [pdf] [wsu.edu] by Tom Dietterich (2000), for those who want to glance through a survey. Though be warned that some of its specific conclusions are now dated--- e.g. there's been a *lot* written in both statistics and machine learning since then on what boosting "really" is and why it works.
Dietterich presents the more machine-learning view of it, focused on algorithms, combination of predictions, iterative refinement, etc. The best survey from a statistical approach is probably Ch. 16 of this book [amazon.com] by three [stanford.edu] Stanford [stanford.edu] profs [stanford.edu], which you can probably read some of on Google Books [google.com].

Parent Share
twitter facebook

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Collaborative Filtering and the Rise of Ensembles 58

Collaborative Filtering and Rise of Ensembles More Login

Collaborative Filtering and Rise of Ensembles

Group Labor (Score:2, Informative)

Re:Management Summary ? (Score:3, Informative)

Re:Open source governance (Score:4, Informative)

Re:I'm sorry (Score:5, Informative)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot