Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Linux Software

Benchmarking Linux Filesystems In New 2.6 Kernel 56

An anonymous reader writes "KernelTrap has an interesting article about a recent benchmark conducted to compare five journaling filesystems available with the current 2.6.0-test2 Linux development kernel. The tests were conducted with a very simple shell script, mainly timing how long it takes to copy, tar, and remove directories. Looks like reiser4 is the fastest filesystem at the expense of consuming much more CPU, with ext3 trailing a ways behind."
This discussion has been archived. No new comments can be posted.

Benchmarking Linux Filesystems In New 2.6 Kernel

Comments Filter:
  • fs issues (Score:5, Insightful)

    by Tirel ( 692085 ) on Thursday August 07, 2003 @10:00AM (#6634396)
    speed is nice, but I think the more important question is; how stable is it?
    • Re:fs issues (Score:5, Informative)

      by BrokenHalo ( 565198 ) on Thursday August 07, 2003 @10:24AM (#6634612)
      I don't know about 2.6 yet, but Reiserfs hasn't let me down so far.

      For a long time I resisted moving away from ext2, as I didn't want the journalling overhead and didn't mind the occasional e2fsck. After I got a few power outages I changed my mind :-) but I haven't been disappointed with Reiserfs' speed.

    • Another thought... (Score:3, Interesting)

      by BrokenHalo ( 565198 )
      It seems to me to me that these benchmarks would probably have been more meaningful (and useful) if they had run the same tests against 2.4.21. Anybody got any thoughts on this? I know there are low-latency features and so forth with the new kernel, but I would be curious to know if that has any real impact on disk I/O.
    • Stability bugs will be fixed if the performance is good enough, but speed problems can be hard to fix without altering the most basic things about a filesystem.
      • Speed problems can be solved by throwning hardware at the problem. Faster disks, more ram, more servers. A filesystem not desigined with stability in mind wont be stable regardless of the hardware.

        Stability should be at the top of the list when desigining a filesystem. If its a quetsion of stability or speed, stability should win.

        • I think for "Stability" you mean "Robustness", so that data loss can be avoided in a power outage or other out-of-spec situations (things supposed to be fixed with data journaling, atomic transactions, etc.), while I mean "lack of Bugs". It is true that robustness is a problem that should be figured out at design time and it can be more important than speed, but I think the original poster means unbugginess for stability. Anyway, reiser4, ext3 (ordered/data journaling mode) are robust enough, but the form
        • Which is why Reiser4 is so great. While it may yet have some bugs (understandable for a brand-new FS running on a development-series kernel!) it was designed with robustness in mind. It does data-journaling in addition to metadata-journaling, so your file data is protected as well. And it does all this while being really fast!
    • ReiserFS sucks in stability (atleast the eariler versions did).
      There were times when the power used to suddenly go off and when I rebooted there were parts of my files which had been swapped!! Really pissed me off and I switched to ext3 since then...

      Nandz.
  • by Anonymous Coward on Thursday August 07, 2003 @10:03AM (#6634419)
    The first item number is time, in seconds, to complete the test (lower
    is better). The second number is CPU use percentage (lower is better).

    reiser4 171.28s, 30%CPU (1.0000x time; 1.0x CPU)
    reiserfs 302.53s, 16%CPU (1.7663x time; 0.53x CPU)
    ext3 319.71s, 11%CPU (1.8666x time; 0.36x CPU)
    xfs 429.79s, 13%CPU (2.5093x time; 0.43x CPU)
    jfs 470.88s, 6%CPU (2.7492x time 0.20x CPU)

    What's interesting:
    * ext3's syncs tended to take the longest 10 seconds, except
    * JFS took a whopping 38.18s on its final sync
    * xfs used more CPU than ext3 but was slower than ext3
    * reiser4 had highest throughput and most CPU usage
    * jfs had lowest throughput and least CPU usage
    * total performance of course depends on how IO or CPU bound your task is
    • by dustman ( 34626 ) <dleary.ttlc@net> on Thursday August 07, 2003 @02:37PM (#6637414)
      I think another interesting metric to look at would be cpu time used. If one of them took 90% CPU for 10 seconds, that would be a big winner imo.

      reiser4 171.28s @ 30% CPU = 51.384s CPU
      reiserfs 302.53s @ 16% CPU = 48.4048s CPU
      ext3 319.71s @ 11% CPU = 35.1681s CPU
      xfs 429.79s @ 13% CPU = 55.8727s CPU
      jfs 470.88s @ 6% CPU = 28.2528s CPU
      • CPU is certainly a good metric to look at. But over the long term, CPUs speed up a lot faster than disks speed up, so using more CPU to save a little disk access is usually a good tradeoff.

        Many systems, however, do not run many entirely CPU-bound processes. If the CPU would otherwise be idle, it doesn't matter (apart from power consumption in a laptop) whether the usage by the filesystem is 10% or 90%. Only if there are other things wanting to run does it make a difference.

        An interesting benchmark woul
  • by mnmn ( 145599 ) on Thursday August 07, 2003 @10:17AM (#6634534) Homepage
    There are other things about filesystems us sysadmins to know about. Which is the most stable and crashproof filesystem? Ive suspected it to be XFS from which I recovered data after doing dd if=/dev/zero of=/dev/hda1 count=2048576

    Also what filesystem would require the lest or no syncing at all? Befs?

    In server environements with stripped 15K cheetah SCSI drives, you'd worry more about stability than speed.
    • I recovered data [from XFS] after doing
      dd if=/dev/zero of=/dev/hda1 count=2048576
      Cool. How?
      • by Molina the Bofh ( 99621 ) on Thursday August 07, 2003 @01:06PM (#6636235) Homepage
        Pick one of the three answers:

        With the new and improved Magnetic Forensics Tunnelling Electronic Microscope, used by the FBI

        He had a backup

        XFS takes so long to sync that by the time it would start to sync, he turned his computer off

      • by mnmn ( 145599 ) on Thursday August 07, 2003 @04:58PM (#6639221) Homepage
        I did a dd to dump the whole filesystem to a file in a larger filesystem for recovery purposes. The XFS tools that came with RedHat 7.1 seemed to crash so I did a small slackware install and went from there. xfs_restore and other tools sounding like xfs_sync or something found my third superblock. I used it to reconstruct the other superblocks. The root and early directories were gone, and the files I needed were in the root directory, well the xfs_restore linked them all to numbered directories and files. A few greps later I was in a directory that had those files all intact. I recovered around 80% of all the files, and 100% of the ones I was seeking. Just because I succeeded in one major recovery op from an XFS filesystem, I feel confident with it more than the ext3 or reiser guys. I havent really compared them.
    • In server environements with stripped 15K cheetah SCSI drives, you'd worry more about stability than speed.

      I think if I had stripped drives I'd be more worried about what sort of sicko strips a hard drive, and is he still hiding in the server room?
    • In server environements with stripped [sic] 15K cheetah SCSI drives, you'd worry more about stability than speed.

      I'm not convinced. ext3 under 2.4 on SMP can be cripplingly slow entirely for software reasons--lock_kernel() being the biggest culprit.

      Having said that, my new big fileserver is going to be ext3 on 2.6 (eventually!); the data volumes being on a 12-way RAID-5 set and the journals on a RAID-1 pair. It seems to perform adequately with dir_index and sparse_super.

      The main thing that swings i

  • by Futurepower(R) ( 558542 ) on Thursday August 07, 2003 @10:29AM (#6634648) Homepage

    From the article:
    "reiser4 171.28s, 30%CPU (1.0000x time; 1.0x CPU)
    reiserfs 302.53s, 16%CPU (1.7663x time; 0.53x CPU)

    What's interesting:
    * reiser4 had highest throughput and most CPU usage"


    The comparison seems incomplete to me. Reiser4 took about half the time, with twice the CPU usage. The

    Total Work Done by the CPU = Percent * Time.

    Reiser4 did the work in half the time, but the total work was roughly equal. Actually, ReiserFS was more efficient considering total CPU cycles.
    • by makapuf ( 412290 ) on Thursday August 07, 2003 @11:16AM (#6635123)
      Except that if you have a mostly idle CPU and your task is more & more waiting for the disk to complete, you don't care about 11 or 30 % if the other 89 or 70 % of the CPU are idle.

      Comparing CPU cycles needed is NOT a fair benckmark, unless your task is CPU _AND_ IO Bound.(if it's not io bound, take whatever fs you have, it doesn't matter.)

      The benchmark was right in giving results as TWO parameters : CPU used and time spent. Might be interesting to see how it depends of the drive type or the CPU arch, though)
      • Right, but they gave CPU % used, rather than the number of cycles used during the test - Reiser's CPU numbers looked higher *because* it was fast in terms of wall time.
      • by p3d0 ( 42270 ) on Saturday August 09, 2003 @07:38AM (#6653803)
        I disagree. Consider:
        • FS1: 1sec CPU time, 1sec IO time. CPU usage=50%.
        • FS2: 2sec CPU time, 6sec IO time. CPU usage=25%.
        FS1 is undoubtably better. It consumes less CPU and IO time. Yet noobs would complain about the high CPU usage of FS1. The truth is, CPU usage is higher only because IO is more efficient.

        I can think of no corresponding pathological case in which CPU load would be a more appropriate predictor than CPU time.

        if you have a mostly idle CPU and your task is more & more waiting for the disk to complete, you don't care about 11 or 30 % if the other 89 or 70 % of the CPU are idle.
        Exactly. That's why CPU load doesn't matter.
        Comparing CPU cycles needed is NOT a fair benckmark, unless your task is CPU _AND_ IO Bound
        No, CPU cycles matters for CPU-bound tasks. I think that's pretty self-evident.
        (if it's not io bound, take whatever fs you have, it doesn't matter.)
        Wrong. In a compute-bound task, CPU time adds directly to the total execution time of the task, so you should choose the FS with the lower total CPU time.
        • Exactly. That's why CPU load doesn't matter.

          I don't completely agree. Sure, CPU load doesn't matter in a single-tasking system, but if I'm running a multi-tasking system I don't want it to noticably "hiccup" each and every time some background task has to do some disk I/O. The system should _not_ bog down just because Mozilla is cleaning out its cache -- it should still be responsive.

          Obviously, priority levels can have a big effect on this sort of thing -- but in a multitasking environment where you a

  • by molo ( 94384 ) on Thursday August 07, 2003 @10:46AM (#6634837) Journal
    FYI, kerneltrap just moved to a new server (this week!). It used to be run on Jeremy's DSL line, which was why it would get shot to shit whenever it got slashdotted.

    Now its in a colo on new (fast) hardware (paid for by users' donations), and upgraded to drupal 4.2 (faster here too).

    So slashdot away!

    -molo
  • by j.e.hahn ( 1014 ) on Thursday August 07, 2003 @11:29AM (#6635275)
    And a number of people complained that it wasn't a great benchmark. Hans Reiser admitted it was just a quickie, and I forget who it was that said it, but ext3 has some performance enhancements that are on the cusp of merging into Linus' tree from the -mm kernels.

    Wait until 2.6 is out folks. These numbers are still open for mass fluctuation.
  • by Stone316 ( 629009 ) on Thursday August 07, 2003 @03:23PM (#6638076) Journal
    With respect to filesystems and database performance. EXT3 came out on top. [oracle.com]
  • CPU Usage?! (Score:4, Funny)

    by metalmaniac1759 ( 600176 ) on Thursday August 07, 2003 @04:02PM (#6638568) Homepage Journal
    30%-40% CPU Usage - whoa. What happens to poor me - with a PIII 550 MHz, 128 MB SDRAM and KDE running all the time.

    I switched from RedHat to Gentoo and KDE stopped crawling. God knows what will happen with ReiserFS4.

    Nandz.
    • The CPU usage of ReiserFS is higher partially because it spends less time on IO. For IO-bound tasks, even your lowly machine would see an improvement.

      Besides, your P3 550 would have a slower disk.

  • Stupid question (Score:3, Insightful)

    by Hard_Code ( 49548 ) on Thursday August 07, 2003 @06:20PM (#6640073)
    Why do we need fast file systems? Or rather, why should we spend so much effort on performance differences of such small magnitude (take out reiser4 and its high cpu usage)? The only things I can think of are: swapping memory - this will really slow the system down, because disk is orders of magnitude slower than main memory; and databases that absolutely require high performance. But both of these typically *already* use custom file systems (or raw partitions) tailored to their exact needs. As normal user (somebody who is not constantly copying Mozilla source tries around), why should this matter? I can't remember the last time I thought "boy, this file access was slow, I wish I had a faster file system".
    • I care about performance because I have a few hundred thousand files on servers that 100 employees need to access. Caching will only get you so far when you have a lot of people going after a lot of files through out the day. Since we have the choice of which fs to use, it is nice to have more info to pick between them.

      There is more to this world than home users and databases.
    • Something I haven't seen anyone mention...

      Could ReiserFS be used as the custom file system for a database? With it's extensibility and metadata, it's suitable for the basis of at least some databases.

      As normal user (somebody who is not constantly copying Mozilla source tries around), why should this matter? I can't remember the last time I thought "boy, this file access was slow, I wish I had a faster file system".

      Well... a journaling file system is useful for data integrity, which is useful to
    • > Why do we need fast file systems?

      Try streaming a high quality video or two from your drive. An efficient file system will really make it go smoothly, leaving most of the CPU free for the decoder.
      • Re:Stupid question (Score:3, Informative)

        by Arandir ( 19206 )
        When you're streaming a file, you're streaming a single file. All file systems will be roughly equivalent at this point. After all, once the harddrive has seeked to the starting sector, odds are very high that the data will be contiguous.

        Where you want the efficiency is when you have to deal with a large number of files. For example, I use UFS on FreeBSD. Before I used softupdates, untarring the ports tree severly bogged down the system, because there were tens of thousands of very small files. But after I
    • One word: Maildir (Score:3, Informative)

      by Gothmolly ( 148874 )
      If I have 2500 4k files in a directory, and my filesystem uses a stupid algorithm to store that information, it might take 2 or 3 times longer to retrieve a list of files than it does on a decent filesystem. When you've got a webmail frontend to your Maildir, and your webmail code has to grope through all directories, all files, even small differences are HUGE. Clever FS design will win the day.
    • I work with video files that are average size of 5 gigabytes per file, some as large as 11 gigabytes each.

      Also there are people that deal with huge scanned images (as I have in the past) that are many hundreds of megabytes per image.

      Speed is critical.
    • By having several file systems competing, the developers feel the stakes are higher and work harder to be the best. We also see a bit of differentiation, in that the different file systems typically are good at different things. In my experience, XFS is great for largish mysql databases, while ReiserFS does an excellent job on handling many, small files (like your /tmp).

      So, as long as Linus only allows stable filesystems in the stable kernels, I'm all for file system innovations making their way into the k
  • quick question (Score:3, Interesting)

    by perlchild ( 582235 ) on Thursday August 07, 2003 @06:34PM (#6640190)
    quick question, what mount options were used on each? Wouldn't e3fs appear slower than the others if it was data-journaling? (I read the article a bit fast, but I didn't see any "how we tested") Also wouldn't a good test take at least two hours on each filesystem?
  • JFS's figures (Score:1, Informative)

    by Anonymous Coward
    One thing not mentioned in the kerneltrap discussion is the fact that JFS uses lazy allocation, and a complete dissociation of inum and block number. Under JFS, data space on disk isn't allocated until it's ready to be flushed from the buffer, which accounts for the 38 seconds of sync time at the end of the tests.
  • Greetings gentlebeings,

    Since I recently reinstalled my Debian system, I decided to put some effort into implementing my filesystems right. I decided on XFS for various reasons, mainly that it has always been rock solid for me (I've had some problems with ReiserFS causing heavy data corruption -- it's a long time ago and they've undoubtedly improved the system since, but still I prefer XFS since I've never even once had a problem with it, and know nobody who has), and that its good large-file performance is more useful to me than Reiser's kickass small-file performance. Also EAs and ACLs are neat. What sucks about XFS is poor small-file performance and abysmal delete performance.

    Anyways, I made a few bonnie++ runs and messed around with some of the many mkfs and mount options of XFS. In the end, my tweaked XFS filesystems beat ext3 (mode=ordered) for delete performance, which was a substantial improvement over a standard XFS mount.

    I made a writeup about the whole procedure [everything2.com] at Everything2. Go slashdot the poor bastards. Warning: The language is tailored to the fact that not all E2 users are geek hardcore.

Get hold of portable property. -- Charles Dickens, "Great Expectations"

Working...