R Throwdown Challenge 185
theodp (442580) writes "'R beats Python!' screams the headline at Prof. Norm Matloff's Mad (Data) Scientist blog. 'R beats Julia! Anyone else wanna challenge R?' Not that he has anything against Python, Matloff adds, but he just doesn't believe that Python or Julia will become 'the new R' anytime soon, or ever. Why? 'R is written by statisticians, for statisticians,' explains Matloff. 'It matters. An Argentinian chef, say, who wants to make Japanese sushi may get all the ingredients right, but likely it just won't work out quite the same. Similarly, a Pythonista could certainly cook up some code for some statistical procedure by reading a statistics book, but it wouldn't be quite same. It would likely be missing some things of interest to the practicing statistician. And R is Statistically Correct.'"
Re:Bad analogy (Score:5, Interesting)
Exactly. Julia will eat R for lunch soon enough, I think. It's an elegant, well designed and efficient language. It's only been around for a couple of years, and has a very vibrant and rapidly growing community.
Check it out for yourself: The Julia Language Homepage [julialang.org]. It's got a lot to offer anyone with an interest in mathematics, including statisticians. It's based on the LLVM, and interfaces trivially with C libraries - plus it's a very fast language in it's own right, unlike R or Python.
Re:true, but not really because of R itself (Score:4, Interesting)
Re:Bad analogy (Score:5, Interesting)
my friend uses julia, and every few weeks complains about some bug. the other day he mentioned that the latest release broke Bernoulli sampling (wtf?). the others have been pretty fundamental too.
this is a serious problem, of course. the other one is lack of libraries. R is an abysmal pile of shit, but at least it's a standard; pretty much 95%+ of applied stats is at least partially supported by someone's hacked-up library/package. julia is far, far short of that, and it appears that much of its community is more interested in pretty graphics, meta-wankery, and interface methodology than actual working statistics (not that there's anything wrong with that per se).
yeah, yeah, "fix it yourself," and it's on my list to write at least a basic survival analysis package for it. but i wouldn't blame anyone for not using it, and i wouldn't recommend it for doing stats as it is now.
Re:Data mining (Score:2, Interesting)
You got the title wrong.
_Numerical Recipes in C_, by Press, W. et al
http://www.amazon.com/Numerical-Recipes-Scientific-Computing-Edition/dp/0521431085
IIRC there was also a _Numerical Recipes in FORTRAN_ as well.
Also see http://www.nr.com/ . I think they only have a single book now called _Numerical Recipes_ and it is in its third edition.
Re:true, but not really because of R itself (Score:4, Interesting)
R has some pretty unique graphing packages. Nothing that I know of matches the way you can do 2D and 3D plots in R. Not Python, not Gnuplot, not Julia, not Matlab, not Excel, not Mathematica, nothing.
Re:I dislike Python (Score:4, Interesting)
Believe it or not, most statisticians are not programming wizards.
Most stats guys use R, matlab, mathematica, or something similar. Even if it takes days to run a program that would take 20 minutes in C. Sort of like how the business guys will use VBA when they need anything, because that's what they know.
Languages like R are used because they are accessible. And once they reach a critical mass, everyone learns them in a field.
Sort of like how Fortran just won't die.
Beats python at what? (Score:4, Interesting)