Follow Slashdot stories on Twitter


Forgot your password?
Open Source Programming Python

Comparing R, Octave, and Python for Data Analysis 61

Here is a breakdown of R, Octave and Python, and how analysts can rely on open-source software and online learning resources to bring data-mining capabilities into their companies. The article breaks down which of the three is easiest to use, which do well with visualizations, which handle big data the best, etc. The lack of a budget shouldn't prevent you from experiencing all the benefits of a top-shelf data analysis package, and each of these options brings its own set of strengths while being much cheaper to implement than the typical proprietary solutions.
This discussion has been archived. No new comments can be posted.

Comparing R, Octave, and Python for Data Analysis

Comments Filter:
  • by ACK!! ( 10229 ) on Wednesday May 23, 2012 @04:24PM (#40093013) Journal
    The whole article was not much more than a high level review. The graphic naturally draws attention to the parameters the writer wanted to cover but he did not back up his graphic with any sort of serious textual review of what he felt were the weaknesses or advantages of the different programming language at least not in any detail.
  • Julia? (Score:4, Informative)

    by Chrisq ( 894406 ) on Wednesday May 23, 2012 @05:05PM (#40093473)
    There was a previous article about Julia [] which looked cool. I wonder how this measures up
  • I don't understand (Score:5, Informative)

    by utkonos ( 2104836 ) on Wednesday May 23, 2012 @09:33PM (#40095821)
    This article compares three languages that have different purposes. R's purpose is statistical analysis and visualization. Octave is a general mathematical analysis and visualization language. Python is a generalist language that has it's own focuses on code readability among other things.

    These languages also have a target audience. R is for statisticians and scientists. Octave is for mathematicians, and Python is for programmers.
  • by csirac ( 574795 ) on Wednesday May 23, 2012 @11:19PM (#40096487)

    Through pandas [], for a start. The SciPy/NumPy stack is quite nifty, I'm especially interested in how to apply it for working with irregular time series data.

    Not to say anybody should ditch R, I still support our researchers most weeks at work in using it. But it's not as clear-cut as you seem to think it is, especially in terms of memory efficiency.

If a 6600 used paper tape instead of core memory, it would use up tape at about 30 miles/second. -- Grishman, Assembly Language Programming