Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Tuning Linux VM swapping

Posted by michael on Fri Apr 30, 2004 08:28 AM
from the got-swap? dept.
Lank writes "Kernel developers started discussing the pros and cons of swapping to disk on the Linux Kernel mailing list. KernelTrap has coverage of the story on their homepage. Andrew Morton comments, 'My point is that decreasing the tendency of the kernel to swap stuff out is wrong. You really don't want hundreds of megabytes of BloatyApp's untouched memory floating about in the machine. Get it out on the disk, use the memory for something useful.' Personally, I just try to keep my memory usage below the physical memory in my machine, but I guess that's not always possible..."
This discussion has been archived. No new comments can be posted.
Tuning Linux VM swapping | Log In/Create an Account | Top | 324 comments (Spill at 50!) | Index Only | Search Discussion
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • Ob. /. joke by Anonymous Coward (Score:2) Friday April 30 2004, @08:32AM
  • God no... (Score:5, Interesting)

    by 0123456 (636235) on Friday April 30 2004, @08:32AM (#9017815)
    "You really don't want hundreds of megabytes of BloatyApp's untouched memory floating about in the machine. Get it out on the disk, use the memory for something useful."

    I absolutely despise the way that XP swaps out applications in order to make the disk cache larger. I have 1GB of RAM on my machine precisely so I don't have to wait two minutes for it to swap my web browser back in after it's swapped out... yet if I copy a 2GB file from one drive to another, the stupid operating system will swap out all the applications it can just to make the cache larger.

    Please, please, don't take Linux down the same braindead route as Microsoft has done for XP. It's utterly insane to swap out my browser so that a 2GB file can be copied two seconds faster when I then have to wait two minutes for the browser to swap back in. Or at least provide some kind of '#define STOP_VM_SWAPPING_STUPIDITY' so that I can disable it.
    • Re:God no... by ninewands (Score:3) Friday April 30 2004, @08:39AM
      • Re:God no... by Anonymous Coward (Score:1) Friday April 30 2004, @10:14AM
        • Re:God no... by Gilk180 (Score:3) Friday April 30 2004, @10:34AM
          • Re:God no... by julesh (Score:2) Friday April 30 2004, @12:02PM
            • Re:God no... by Gilk180 (Score:1) Friday April 30 2004, @12:32PM
    • Re:God no... by Anonymous Coward (Score:2) Friday April 30 2004, @08:45AM
    • Re:God no... (Score:5, Informative)

      by petabyte (238821) on Friday April 30 2004, @08:45AM (#9017952)
      Actually, you can change it on the fly with /proc/sys/vm/swappiness Increasing the number will increase the agressiveness of the swapout. Mr. Morton runs with his set at 100 (the max). 0, I believe would turn swap.

      My kernel has autoswappiness enabled so it figures out the number on its own. I'm running at 64 ATM on a 256 Meg system (ram donations accepted) :).
      [ Parent ]
      • Re:God no... by SkunkPussy (Score:1) Friday April 30 2004, @10:07AM
      • Re:God no... by mdfst13 (Score:3) Friday April 30 2004, @09:59AM
      • 2 replies beneath your current threshold.
    • Re:God no... (Score:4, Informative)

      by kinema (630983) on Friday April 30 2004, @08:46AM (#9017959)
      All you need to do is: "echo 0 > /proc/sys/vm/swappiness" and the VM will do it's best to keep from swapping pages to disk.
      [ Parent ]
      • Re:God no... by Spackler (Score:3) Friday April 30 2004, @11:36AM
        • Re:God no... by dasmegabyte (Score:2) Friday April 30 2004, @04:38PM
        • 1 reply beneath your current threshold.
      • Re:God no... by dasmegabyte (Score:3) Friday April 30 2004, @12:57PM
    • Re:God no... (Score:5, Interesting)

      by Shakrai (717556) on Friday April 30 2004, @08:48AM (#9017980)
      (Last Journal: Monday November 26, @06:13PM)

      Please, please, don't take Linux down the same braindead route as Microsoft has done for XP. It's utterly insane to swap out my browser so that a 2GB file can be copied two seconds faster when I then have to wait two minutes for the browser to swap back in

      Does it really make it faster anyway? Unless parts of that 2GB file were already in the cache then how is the cache going to make it transfer any faster?

      As a side note I haven't noticed Linux swapping much out in favor of the cache. My home grown samba/sql/dhcp/nat/intranet server has 768 megs of memory. As of today (43 day uptime -- Linux 2.4.25) there is only 2,528k in SWAP. 8,444k of free memory, 191,952k used for buffers, 296,004 used for cache and the rest for applications.

      I wouldn't mind seeing Linux swap out programs that aren't touched in several days/weeks (like the 12 agetty processes on my monitor less machine -- yes I know I could disable them if I wanted) but I definitely don't want to see it swapping out that browser I used 5 minutes ago in favor of increasing the disk cache size. Now if I launch Quake that's a different story.

      As far as the other posts about rule of thumb for swap size go -- I stopped using the 1:1 or 2:1 ratio a long time ago. I have a 256meg swap partition on my 768meg Linux box. That's pretty much as big as I go with swap spaces. Are you seriously going to setup a 768 (or worse x2) swap space? A) You'll never use it, B) If you do use it your machine will barely be useable.

      As far as XP's stupidity goes look under My Computer -> Properties -> Advanced -> Performance Settings -> Advanced and make sure both options (processor scheduling and memory usage) are set to "Programs" and not "background services" or "System cache". That may (or may not -- it is Windows after all) help you a little. On the flipside of the coin I discovered that I needed to reverse the memory option on my Windows 2000 Terminal Server to prevent stupid HP print drivers from sucking up 100% of the CPU and 90% of the physical memory.

      [ Parent ]
      • Re:God no... by sweet cunny muffin (Score:1) Friday April 30 2004, @09:23AM
      • Re:God no... by illtud (Score:2) Friday April 30 2004, @09:43AM
        • Re:God no... by 5E-0W2 (Score:2) Friday April 30 2004, @10:25AM
          • Re:God no... by Anonymous Coward (Score:1) Friday April 30 2004, @12:29PM
            • Re:God no... by 5E-0W2 (Score:1) Friday April 30 2004, @04:51PM
      • Re:God no... by straybullets (Score:1) Friday April 30 2004, @10:03AM
      • Re:God no... by ostrich2 (Score:1) Friday April 30 2004, @11:04AM
      • Re:God no... by scrubmuffin (Score:1) Friday April 30 2004, @11:26AM
    • Re:God no... by Elm Tree (Score:2) Friday April 30 2004, @08:49AM
      • Re:God no... by psavo (Score:2) Friday April 30 2004, @10:48AM
      • Re:God no... by Fjord (Score:1) Friday April 30 2004, @10:53AM
    • Re:God no... by nikster (Score:2) Friday April 30 2004, @09:09AM
      • 1 reply beneath your current threshold.
    • Re:God no... by Monkelectric (Score:2) Friday April 30 2004, @09:11AM
      • Re:God no... by ckaminski (Score:3) Friday April 30 2004, @09:27AM
      • Re:God no... by CommandNotFound (Score:2) Friday April 30 2004, @11:40AM
        • 1 reply beneath your current threshold.
    • Re:God no... by ByteSlicer (Score:1) Friday April 30 2004, @09:15AM
    • Re:God no... by colinleroy (Score:2) Friday April 30 2004, @09:24AM
    • Re:God no... (Score:5, Informative)

      by The Spoonman (634311) on Friday April 30 2004, @09:51AM (#9018664)
      (http://www.workorspoon.com/)
      Right-click My Computer -> Advanced -> Performance -> Advanced -> Memory Usage. Set to Programs. Now, click Change under Virtual Memory. Set your cache size small. For 1G of RAM, you prolly don't need a biggun. I usually set it to 100M for Inital and Max and then up it based on how often the machine swaps.

      Then, make the following changes to the registry:

      HKLM\SYSTEM\CurrentControlSet\Control\Session Manager\Memory Management\ClearPageFileAtShutdown, set to 1. I don't shut my machine down very often, but occasionally XP will increase the size of the pagefile if it absolutely needs to depending on circumstances. This forces it back to the size you want it when you restart.

      HKLM\System\CurrentControlSet\Control\FileSystem\N tfsDisable8dot3NameCreation, set to 1 ONLY IF YOU USE NO 16-BIT APPS ON YOUR MACHINE. Speeds up writes.

      HKLM\System\CurrentControlSet\Control\FileSystem\N tfsDisableLastAccessUpdate, set to 1 if you don't care when files are accessed. This is rarely needed, and the setting speeds up writes.

      HKEY_LOCAL_MACHINE\System\CurrentControlSet\Contro l\Session Manager\Memory Management\IoPageLockLimit. Little more complex:

      Set to 4096 if you've got more than 32M RAM

      Set to 8192 if you've got more than 64M RAM

      16384, 128M

      32768, 160M

      65536, 256M

      131072, 512M

      This changes the maximum number of bytes that can be locked for I/O operations. The default is 512Kb. While the above are the recommendations, I've found stepping down one level to provide the most performance for my needs, YMMV. (For example, I have 256M, but I set my IO limit to 32768.)

      HKLM\System\CurrentControlSet\Control\Session Manager\Memory Management\DisablePagingExecutive, Set to 1 to disable paging of the kernel.

      There, that wasn't so hard, was it? For those who want to flame that statement, keep in mind, that the information above is easier to find than some of the tuning suggestions I've heard for Linux. I've used Linux for 10 years, and only today heard about /proc/sys/vm/swappiness. Oh, and all of the above apply from at least NT4+.
      [ Parent ]
      • Re:God no... by The Spoonman (Score:2) Friday April 30 2004, @10:06AM
      • Re:God no... by jbayes (Score:2) Friday April 30 2004, @11:15AM
      • Re:God no... by Darth Daver (Score:3) Friday April 30 2004, @12:01PM
        • Re:God no... by dasmegabyte (Score:2) Friday April 30 2004, @12:36PM
          • Re:God no... by Darth Daver (Score:1) Friday April 30 2004, @12:49PM
          • Re:God no... by MenTaLguY (Score:2) Friday April 30 2004, @01:55PM
            • Re:God no... by Foolhardy (Score:1) Friday April 30 2004, @03:07PM
              • Re:God no... by MenTaLguY (Score:2) Friday April 30 2004, @05:24PM
              • Re:God no... by Foolhardy (Score:1) Friday April 30 2004, @06:50PM
              • Re:God no... by Foolhardy (Score:1) Friday April 30 2004, @07:23PM
              • 2 replies beneath your current threshold.
        • Re:God no... (Score:4, Interesting)

          by The Spoonman (634311) on Friday April 30 2004, @04:42PM (#9023089)
          (http://www.workorspoon.com/)
          I have been using Windows for 12 years (and Linux for 10), and this is the first time I have heard of the obscure registry hacks you just listed.

          The above hacks aren't for users, they're for administers and geeks. The average user will boot their machine, do what they have to do, and shut it back down. For those who aren't users, we like to leave our machines on for months at a time and these tweaks will help with that. If you were doing tech support, then you'd know them. If you ARE doing tech support and don't know them, please consider another field. These are the basics...IT's already filled up with enough paper MCSEs who can't spell NT unless it's in the 6-week course.

          When I just searched for '/proc linux vm swap' in Google, /proc/sys/vm/swappiness was in the fourth hit from the top. There, that wasn't so hard, was it?

          No, when you know EXACTLY what you're looking for, it never is. Now, search for +linux +performance +tweaks, and tell me if it shows up. Didn't, did it? Now, search for +windows +performance +tweaks. How many of those pages DIDN'T list the tweaks I just gave? Not many.

          I can tell you one thing. I would rather poke around the /proc filesystem than wander through the Windows registry any day.

          Because the difference is...? One's a collection of key-value pairs organized in a virtual filesystem analogy and another is a collection of key-value pairs organized on a filesystem? Or, is it because MS puts a warning that if you don't know what you're doing, editing the registry can fuck your system, but the Linux developers fail to give you the same warning?

          By the way, if you are not shutting your XP system down often, you must not be rebooting for the security patches, and that can be a problem for everyone.

          Could be, but I keep my machines fairly secure to begin with, and few of the security patches issues by MS affect well locked-down machines. They're more for user's PCs, like yours. Also, the last few security updates I've done haven't required a reboot. Unlike the latest kernel updates...

          claiming to release within hours versus the weeks they claim FOSS takes

          Or, years. How long was that latest flaw in the kernel sources that took down the Debian servers? Years? I thought the "many eyes" theory said something like that wouldn't reach production as there's so many people reviewing the code. I'll give you a clue: just 'cause the code's available doesn't mean many more people outside the development team is looking at it. Most are doing ./configure && make && su && make install and trusting it'll all be okay. It must be, right?
          [ Parent ]
      • Re:God no... by Telex4 (Score:2) Friday April 30 2004, @12:32PM
        • Re:God no... by The Spoonman (Score:2) Friday April 30 2004, @04:29PM
          • 1 reply beneath your current threshold.
      • Re:God no... by dasmegabyte (Score:2) Friday April 30 2004, @12:41PM
        • Re:God no... by The Spoonman (Score:2) Friday April 30 2004, @04:50PM
          • Re:God no... by dasmegabyte (Score:1) Friday April 30 2004, @05:26PM
      • Re:God no... by whitmer (Score:1) Friday April 30 2004, @03:11PM
        • Re:God no... by The Spoonman (Score:2) Friday April 30 2004, @04:45PM
          • Re:God no... by Hast (Score:2) Saturday May 01 2004, @05:38PM
      • Re:God no... by diegocgteleline.es (Score:1) Friday April 30 2004, @03:53PM
    • Re:God no... by Fjord (Score:3) Friday April 30 2004, @10:43AM
    • XP paging registry settings by ScrappyLaptop (Score:2) Friday April 30 2004, @11:10AM
    • Re:God no... by WaltFrench (Score:1) Friday April 30 2004, @11:16AM
    • Re:God no... by ctr2sprt (Score:2) Friday April 30 2004, @12:39PM
    • Re:God no... by Wolfrider (Score:2) Saturday May 01 2004, @01:37AM
    • Re:God no... by Progman3K (Score:2) Thursday May 06 2004, @08:56PM
    • Re:God no... (Score:4, Interesting)

      by 0123456 (636235) on Friday April 30 2004, @08:40AM (#9017907)
      "I hate to tell you this but it actually sort of does this already."

      Maybe 2.4 does, but my 2.2 system has never suffered from this problem (haven't got around to upgrading it yet). Small amounts get swapped out, but nothing noticeable in interactive use.

      "Why waste ram that could be used for live data?"

      Because when I want to use my web browser again after playing a game for an hour, I don't want to have to sit there for two minutes watching it slowly swap back in... interactivity is far more important to me than small performance benefits from an extra 64MB of disk cache.
      [ Parent ]
      • Re:God no... by scrubmuffin (Score:1) Friday April 30 2004, @11:14AM
      • Re:God no... by Shakrai (Score:2) Friday April 30 2004, @12:12PM
        • Re:God no... by Anonymous Coward (Score:2) Friday April 30 2004, @02:30PM
          • 1 reply beneath your current threshold.
        • 1 reply beneath your current threshold.
    • Re:God no... by Anonymous Coward (Score:1) Friday April 30 2004, @09:05AM
    • Re:God no... by AKAImBatman (Score:2) Friday April 30 2004, @10:05AM
    • Re:God no... by greppling (Score:2) Friday April 30 2004, @05:26PM
    • 5 replies beneath your current threshold.
  • How fast is swapping really these days? by jaf (Score:2) Friday April 30 2004, @08:32AM
  • by DikSeaCup (767041) on Friday April 30 2004, @08:33AM (#9017827)
    (http://ericdives.com/ | Last Journal: Monday June 12 2006, @12:11PM)
    I had this conversation with a fellow sysadmin, about the time that RAM was fairly cheap and we had a budget.

    She had just procured a new Sun machine with 2 GB of RAM. Mind you, disk space hadn't grown all that significantly and you could still get machines with 9 GB drives.

    The original practice was to make swap 2xRAM. So when the student she had putting the machine came to her and said, "What do I make swap?" she responded "Twice the RAM."

    He said, "Are you sure? That's like almost half the boot drive."

    She thought about it for a second and said, "Oh, yeah. I guess just make it the same as the RAM."

    So this begs the questions: What do you make your swap now? When does your rule of thumb change? And remember when you could run a "fast" linux box on a P100 with 64MB of RAM and 128MB of swap?

  • Problem (Score:5, Interesting)

    by FreeLinux (555387) on Friday April 30 2004, @08:34AM (#9017836)
    Personally, I just try to keep my memory usage below the physical memory in my machine, but I guess that's not always possible..."

    No it isn't possible. With today's RAM prices I almost always have more physical RAM than the system requires. But, due to aggressive VM swapping there are still hundreds of megs swapped out to disk when there is no need at all. This means that those applications, when their time does finally come, are slow because they must be retrieved from disk first. It's really annoying sometimes. Yet, even with excess RAM turning off swap is disasterous.

    • Re:Problem (Score:5, Interesting)

      by Trifthen (40989) on Friday April 30 2004, @08:43AM (#9017935)
      (http://bonesmoses.org/)

      No, turning off swap is not disastrous. We've turned it off on our production web server cluster that routinely serves 60Mb sustained traffic. We've turned it off because we have 2GB of ram in these machines, and Linux insisted on preferring buffers and cache over our running applications. Fuck that, we said. With over 1GB Of buffers and cache, we had RAM to spare; bye-bye swap.

      [ Parent ]
      • Re:Problem by YetAnotherDave (Score:2) Friday April 30 2004, @11:05AM
    • Re:Problem by mcx101 (Score:2) Friday April 30 2004, @09:05AM
      • Re:Problem by Chaostrophy (Score:2) Friday April 30 2004, @10:28AM
        • Re:Problem by mcx101 (Score:2) Friday April 30 2004, @10:36AM
    • Re:Problem by drinkypoo (Score:2) Friday April 30 2004, @09:13AM
    • Re:Problem by gr8_phk (Score:2) Friday April 30 2004, @09:14AM
    • Andrew Morton's patches reflect this by bangular (Score:2) Friday April 30 2004, @09:14AM
    • Re:Problem by DenniRuz (Score:1) Friday April 30 2004, @09:43AM
    • no swap here by BinLadenMyHero (Score:2) Friday April 30 2004, @09:59AM
    • Re:Problem by Jebediah21 (Score:2) Friday April 30 2004, @11:17PM
    • 1 reply beneath your current threshold.
  • Memory by nuggz (Score:2) Friday April 30 2004, @08:34AM
  • 2Gb of RAM, 300Mb of apps running. by Moderation abuser (Score:2) Friday April 30 2004, @08:34AM
  • The big issue (Score:5, Interesting)

    by MrIrwin (761231) on Friday April 30 2004, @08:36AM (#9017858)
    (Last Journal: Thursday April 29 2004, @05:55PM)
    The main cuase of memory usage on Linux is the use of many different shared libraries and not bloated apps.

    I think developers could do more at a library level. For example.....dare I suggest using common sub libraries within libraries, that is people like KDE and GTK get thier heads together and say "are thier functions we include in our libraies that could just as well be linked to an underlying library?"

  • It doesn't work that way (Score:5, Informative)

    by Pharmboy (216950) on Friday April 30 2004, @08:37AM (#9017861)
    (http://www.tanningbeds.org/ | Last Journal: Sunday November 05 2006, @07:23AM)
    Personally, I just try to keep my memory usage below the physical memory in my machine, but I guess that's not always possible..."

    I keep my memory usage much below the total ram on the servers, but in real life, the machine still swaps. This is because even tho the machine NEVER needs more ram than is available at any given time, over a period of days, it will use more than the available ram. It caches out the old data that was used 12 hours ago.

    Unless you reboot every day (as in a client machine) you will use swap on just about any machine. Using swap is not bad. Using swap for a currently running application is not so good. This isn't a bug, its a feature. Reading data from swap after it has been accessed is still faster than reading new data from the drives, especially if its a network drive.
  • by YetAnotherName (168064) on Friday April 30 2004, @08:37AM (#9017862)
    (http://seankelly.biz/)
    You really don't want hundreds of megabytes of BloatyApp's untouched memory floating about in the machine...

    Why not? BloatyApp, if it's that bloaty is probably an object oriented program with template instantiation (or is by Micro$oft); these programs are notoriously huge, but also have notoriously poor locality of reference. The user will get better perceived response if you can keep more of BloatyApp resident.

    If there's space in memory, I don't see the point of pre-emptively ejecting as many LRU pages of BloatyApp. (Of course, I haven't RTFA, but this is /. so you're not supposed to!)
  • Bloaty apps? Are you kidding me? (Score:5, Insightful)

    by Trifthen (40989) on Friday April 30 2004, @08:37AM (#9017866)
    (http://bonesmoses.org/)

    Ah yes. It's all the fault of bloaty apps. Apps like database daemons and high-traffic httpd daemons. We've turned swapping off on our servers because we were sick of seeing almost a GB of cache/buffer memory, while it was swapping 500MB of shit to disk. Want a bloaty app? How about the linux Kernel? I love the thing, but Jesus Tapdancing Christ it would rather swap our starting DB process to disk, than free up the fucking buffers and cache. Is there something wrong with wanting it to give precedence to not swapping?

  • Swapless since 1997 (Score:4, Insightful)

    by redelm (54142) on Friday April 30 2004, @08:38AM (#9017893)
    (http://pages.sbcglobal.net/redelm)
    Ever since I've had a 32 MB machine (1997), I've not bothered to even set up a swap partition. On the rare occasions when I need swap, I'll create a swapfile. Sure it's slower, but swap is already hugely slow.

    With read-only & demand code-page loading and copy-on-write even bloatware really doesn't eat memory. And bloatware has to be frequently restarted to recover the memory it leaks.

    Sure, there are some jobs that needs swap -- lots of seldom used memory pages.

    But not mine. I prefer to save myself the complexity and performance headaches.

  • VM you say? (Score:3, Insightful)

    by freeze128 (544774) on Friday April 30 2004, @08:40AM (#9017902)
    At what point does VM stop meaning Virtual Machine and start meaning Virtual Memory.

    Or is it just the Virtual "M"?
    • Re:VM you say? by jepaton (Score:1) Friday April 30 2004, @09:03AM
      • TLA by Anonymous Coward (Score:1) Friday April 30 2004, @09:33AM
    • Re:VM you say? by julesh (Score:2) Friday April 30 2004, @09:04AM
    • Re:VM you say? by Kardamon (Score:1) Friday April 30 2004, @09:24AM
    • Re:VM you say? by diamondsw (Score:2) Friday April 30 2004, @09:35AM
  • swapoff /dev/hda2 by datastalker (Score:1) Friday April 30 2004, @08:41AM
  • they missed one of the biggest points! by whateverdude (Score:1) Friday April 30 2004, @08:41AM
  • Other reasons (Score:5, Interesting)

    Another reason to gradually and pro-actively swap things out, is that when another program later needs a lot of memory, your system doesn't come to a grinding halt because suddenly a lot of stuff has to be swapped out at once (followed by zeroing all that memory, since you don't want to have one program leaking data to another).

    At least, that's the rationale I've read behind OS X's strategy of swapping things out long before all physical memory is used (and of keeping a pool of zeroed memory pages ready to fulfill most requests). Note that this does not require superfluous swap-ins if your reuse strategy is balanced properly, as the fact that something is swapped out doesn't mean that the memory which contained that data will be cleared/reused immediately (i.e., if it's needed again shortly afterwards, that page can be reactivated without having to go to disk).

    Under most desktop OS'es, programs can even give some hints to the system regarding their usage of a memory region using e.g. the madvise() system call.
    • My vote.... (Score:4, Informative)

      by tsmithnj (738472) on Friday April 30 2004, @09:00AM (#9018081)
      is to do something like AIX does, where I can use "vmtune" to customize the percentages of memory I devote (hard or soft limit) to filesystem pages or computational pages. This way I can tune for my Bloatware, tune for file copying a la XP, or tune for my DBMS, whatever suits me.... The developers could take it one step further and provide a simple, understandable (as opposed to AIX's) interface for configuration......
      [ Parent ]
    • OS X's strategy may not be the best by Cysgod (Score:2) Friday April 30 2004, @09:32AM
    • Re:Other reasons by Just Some Guy (Score:3) Friday April 30 2004, @09:33AM
  • La, La, La by malia8888 (Score:1) Friday April 30 2004, @08:52AM
  • Windows already has this by novakane007 (Score:2) Friday April 30 2004, @08:53AM
  • RTFA - swappiness is tweakable by 42forty-two42 (Score:2) Friday April 30 2004, @08:54AM
  • Return of the Sticky Bit by Anonymous Coward (Score:2) Friday April 30 2004, @08:55AM
    • 1 reply beneath your current threshold.
  • Please set up a proper default by dorfsmay (Score:1) Friday April 30 2004, @09:01AM
  • by buserror (115301) * on Friday April 30 2004, @09:05AM (#9018131)
    I dont mind the kernel swapping out "old" stuff to grow a huge disk cache. Really, thats OK, it makes things faster for disk hungry processes allright.

    However, what I mind is the fact that the pages that are swapped out STAY there!
    Why not aging the disk cache the same way the RAM pages are aged ? On an idle machine, the disk cache would gradualy decay and be replaced by the pages back from the swap, and the machine would be all responsive again.

    It means that if the user leaves for lunch and a cron wants to eat all the disk, with some luck, when the user gets back, his machine is as responsive as it was when he left.

    I have a laptop with 192Mb of ram, I always hate when 2/3 of the ram is "free" while it takes 10 seconds for the kmail window to move to the front. Even if the machine has been idle for hours.

    I even regularly do a "swapoff -a;swapon" to claim back the cache!
    • I have a laptop with 192Mb of ram, I always hate when 2/3 of the ram is "free" while it takes 10 seconds for the kmail window to move to the front. Even if the machine has been idle for hours.

      I know what you mean, but in this case, it seems like your machine is making a reasonable guess: you haven't used Kmail in hours, so the odds of you wanting to resume using it at any particular instant is pretty low. On the other hand, reading from a drive is quite a bit faster than writing, so the penalty for incorrectly swapping out old pages when the system is idle is significantly less than incorrectly not swapping out old pages before users launch giant processes that want to allocate a lot of RAM very quickly.

      [ Parent ]
  • by aksansai (56788) <michael @ y o h e . net> on Friday April 30 2004, @09:06AM (#9018142)
    Many people don't realize how smart modern page caches are designed to speed their system. Linux, MacOS X, Win2K+, etc. all boast aggressive page caches that make loading applications from disk more efficient.

    Without a swap file, the kernel has no place to stick memory segments that are rarely used. They stay in resident memory la-la land until the process is terminated. Those segments add up over time and erode the memory available to the page cache.

    Page caches are wonderful. When you load an application (like Firefox [mozilla.org]), you're not just getting the web browser. You're firing up a large chain of shared objects/DLLs that support the widgets, I/O, and components of the application. All of these components must be read into memory anyhow for program operation, so the kernel tends to just leave it in there for future use (the page cache).

    When you shutdown Firefox, you're also releasing the necessity of those libraries (provided nothing else is using them). Those libraries also remove themselves from memory. If you load another application (like Thunderbird [mozilla.org]) that uses the same type of libraries, the kernel will not have to go to disk in order to fetch those libraries. It will instead opt for the page cache contents.

    Turning off the swap file in the historic era of VM infancy was the best way to remove the hard drive bottleneck from system. The operating systems of yester-year did not have good page cache schemes that took advantage of all that unused memory. It is a little different now.

    Applications are so modularized that they are broken up into a billions of smaller libraries so that code can be shared. This increases memory efficiency by keeping a shared library resident for multiple processes. These libraries are frequently accessed, more often than many people realized. Getting THOSE into memory is better than making sure my 500+ Linux applications stay resident.
    $ cat /proc/meminfo
    total: used: free: shared: buffers: cached:
    Mem: 1055653888 1036296192 19357696 0 70488064 892309504
    Swap: 542367744 235892736 306475008
    Notice that on a web server with 1GB of RAM the Linux kernel is still putting things out to swap. These processes that stay asleep for long periods of time do not need to waste the memory that page cache is currently using (892309504 bytes or 753.7MB). What would be stored in that 753.7MB of memory? The database that drives the website (instead of having to seek the disk). The entire web page hierarchy used to display pages on the web site. All the scripts that are used to display dynamic content on the web site (etc. etc.)

    Now, if we subtracted from the page cache the amount of memory that was stored in the swap file, we would have over 200MB less that we could keep cached in memory. That could be an entire database that the kernel would then waste needless CPU cycles to fetch from disk.

    The only advantage to turning off a swap file on these modern machines would be for a machine that runs only a select few applications, and not having a lot of processes in the background doing things.
  • What algorithm are they going to use (Score:4, Interesting)

    by dorfsmay (566262) on Friday April 30 2004, @09:06AM (#9018143)
    (http://www.cuug.ab.ca/~dorfsmay)
    Hopefully they'll use something modern like ARC [python.org] that tries to keep in cache stuff that have been read at least twice, NOT LRU !!

    AIX uses LRU today, so when you do a backup, the system tries to keep all filesystems in cache (well that what was read last !!), and will happily swap your apps out to disk in order to do so (with default tuning parameter).

    I fondly remember the days when I was running Linux with no swap, none whatsoever...

  • Dumb Swapping is Computer Abuse (Score:5, Interesting)

    by stuffduff (681819) on Friday April 30 2004, @09:07AM (#9018153)
    (Last Journal: Thursday October 11, @08:26AM)
    Programmers have put a lot of time and effort into the VM swapping algorithm; mostly with the intention of being prepared to have a lot of memory ready and waiting for the next thing it will be asked to do. Unfortunately that's not so much of an issue with cheap ram and disk storage and faster and faster front side buses. What we really need is more intelligent swapping, which can only come about when the VM gets a set of API hooks (would make for a great 'shared object') that would enable the system administrator (and maybe someday the end user) to assist an intelligent VM manager to establish priorities and consistently respect those priorities.

    Unfortunately the current crop of best guess VM managers end up denying the end user the experience of their computer's peak performance. Coupled with the horrible state of application bloat, modern 'state of the art' hardware and software combine to give us less and less in terms of overall performance. Software developers throw more code at the cpu to add functionality with little or no concern for performance. And hardware manufacturers add more and more 'special instructions' and 'pipelining' which the majority of software is completely unable to access. If anything it's more like a bunch of dysfunctional co-dependents than an industry that is cogent as to what really needs to be going on. If the folks dealing with processors and the application software could take a page from the gamers (look at the high levels of integration between game engines and video cards) for example, and more effort put into consolidating functionality in dlls and shared libraries; we would be amazed at how truly fast these machines could perform.

  • Copy vs Swap? by SpinyNorman (Score:2) Friday April 30 2004, @09:10AM
  • Not amused (Score:4, Interesting)

    by MrLaminar (774857) <laminar@@@linuxmail...org> on Friday April 30 2004, @09:11AM (#9018195)
    Actually, I haven't been very impressed by the whole swapping thing under Linux lately. I'm running 2.4.22 with a 400MB swapfile.

    Some apps _can_ make the system unresponsive enough to ignore keystrokes, which is *very* annoying. At other times, xmms will stop playing while the disk goes crazy... Switching from emacs to Firefox after 10 minutes usually takes an extra 5 seconds to redraw the window and load all the stuff again.

    Running GNOME2 on this laptop is also quite noisy on the disk. It swaps all the time...
  • Swapping on servers vs. Desktops... by Jonny Royale (Score:2) Friday April 30 2004, @09:15AM
  • Just out of curiosity... by puntloos (Score:1) Friday April 30 2004, @09:23AM
    • 1 reply beneath your current threshold.
  • Swappiness vs. buffering by chthon (Score:2) Friday April 30 2004, @09:27AM
  • Swapping back in. (Score:4, Insightful)

    by AlecC (512609) <alec@aleccawley.com> on Friday April 30 2004, @09:42AM (#9018548)
    (http://www.aleccawley.com/)
    I ffel that there should be some tunable propensity for applications to swap back in. Generally speaking, disk cache is most effective over a pretty short timescale - seconds or a few minutes. It is vey effective with a multi-pass compiler to cache the output of one pass so it can be read in by the next. But this sort of thing has a relatively narrow window.

    So way you want to do is:
    • Apps which haven't been used for a time get swapped out.
    • Cached blocks decay with time, decaying faster if the system idles a lot (presumably the big jobs have stopped), slower if the system is very busy (more likely there is something to re-use cache)
    • As cache blocks decay out, BloatyApp is gradually sucked back in. In Gui Environments, the Window Manager flags the pressure to return as proportional to (say) the number of pixels of visible screen it occupies. Of course, having swapped out once, if it never restarts, you can throw it out second time if you need your cache back.

    So if the guy goes to leaving a big make running, it gradually pushed the big apps out while it runs. But if the big make completes, the apps start crawling slowly back in. If it hasn't finished when he comes back from lunch, he probably wants it to carry on running the make: since the CPU is at 100% load, he is probably not surprised it is sluggish.
  • by chrysalis (50680) on Friday April 30 2004, @10:03AM (#9018784)
    (http://00f.net/)
    http://00f.net/item/14/
    describe why swapping is _good_.

  • From what I've read by Wedge1212 (Score:2) Friday April 30 2004, @10:17AM
  • Turning off swap rocks (Score:4, Interesting)

    by squarooticus (5092) on Friday April 30 2004, @10:37AM (#9019122)
    (http://www.krose.org/~krose/)
    Best performance improvement I ever got with the 2.4 series kernels was shutting off swap. My machine immediately became more responsive. From that point forward, I wouldn't come back to the machine after an hour away and encounter a jerky X mouse cursor because the instant I turned off the screensaver the kernel had to page all 128MB of my applications back into the 512MB RAM because it decided buffer cache was more important than code.

    The 2.4 VM changes causing this behavior were awful, and it's too bad that I have to sacrifice a large (disk-based) physical address space, but I'm not going to put up with my applications being paged out when I have 4x as much RAM as code I'm running. Just allowing the system admin to put a limit on the size of the buffer cache would probably solve most of my problems, but instead I have to turn off swap. Too bad.
  • Hmph! What's the big deal? by Vee Schade (Score:2) Friday April 30 2004, @10:40AM
  • Keep two copies (Score:3, Insightful)

    by kasperd (592156) on Friday April 30 2004, @10:42AM (#9019184)
    (http://kasperd.net/~kasperd/ | Last Journal: Thursday July 08 2004, @10:18AM)
    Swapping out data before you need the free RAM would be a great idea if you kept two copies. One copy on disk and one copy in RAM. In fact it would be fine if the system swapped out 90% or more of the process memory this way. There will now be three different cases to think about.
    1. The process needs to read the page - no problem, one copy is in RAM just read it, and keep both copies.
    2. The process needs to write the page - no problem, you can modify the copy in RAM and discard the copy on disk. Notice that discarding the copy on disk doesn't require any disk access, as the list of swap allocations will typically be in RAM (it is much smaller than the swapspace).
    3. You actually need memory - no problem, discard a not recently used RAM page, you still have a copy on disk.
    The only problem is, that you need to make the page readonly, so you can trap the write and discard the on disk copy. In other words don't do this for pages that are frequently changed. But usually you don't have many pages that are frequently changed, and you certainly don't want to swap out those you have. And should you occationally happen to swap out one, it is not really a major problem. It will cost you a pagefault, but no disk I/O. And a pagefault is compared to a disk I/O. A system that behaves like I have described here would use a lot more space than Linux typically does, but still it should be faster. I wonder why this isn't done more often, it is not like the idea hasn't been known for years.

    Another problem that many have noticed, and that isn't easy to deal with, is heavy diskaccess causing the cache to grow and stuff getting swapped out. Yes even some Linux versions suffer from this problem. A Red Hat 9 system I had running for months was really slow in the morning, because all the programs had been swapped out while cron jobs where running during the night. But you never know when it is a good idea to swap the stuff out and when it is not. When the disk access is going on, the process page might not have been used for hours. But still you might want it to be kept in RAM. File pages that have been accessed just once shouldn't be kept in cache for long time. But of course you shouldn't remove them unless the memory was needed for something else. Removing the pages too early is also bad, because you wouldn't notice, that this was really a page that was going to be accessed frequently. Some people are fanatic, and don't want process pages to ever get swapped out to make room for cache. That isn't a good idea either. You can really have process pages that may not be needed even once, do you want such a page to be kept in ram for months just in case? And notice how disabling swap is not going to solve the problem. You still have to think about memory mapped files, that in many ways must be treated like anonymous mappings.
  • The answer is . . . "It depends" (Score:4, Interesting)

    by kmankmankman2001 (567212) on Friday April 30 2004, @10:52AM (#9019298)
    The universal IT answer of "It depends" applies here as well. Yes, having Mr. Bloaty App glob onto scads of memory that are then not referenced for long periods of time can have a negative impact on other apps if the system becomes memory constrained. And, Yes, if the memory manager swaps a bunch of unreferenced memory out to disk and Mr. User has to wait a long time for Mr. Bloaty App to become responsive because it was his memory that got swapped out, that's a problem, too. The ideal is to be able to address this (haha, bad pun) at the application level and not simply at a global level. This has been the standard on the mainframe (MVS, OS/390, z/OS) operating systems for a long time, where there is a very sophisticated virtual memory manager. If there are, say, a 100 apps and 2 of them are very sensitive to response time, most of them aren't, and 10 are just dead dogs you couldn't care less about how nice is it to be able to actually tell the system that? The 2 "loved ones" then receive preferential storage treatment at the expense of the other, "less loved ones" and the dead dogs are always first on the pecking order of who to steal storage from. The memory manager then is acting to maintain the responsiveness of the applications (the reasons we run OS's in the first place) to meet the needs and expectations of the user(s) (the reasons we run the Apps). Without that ability, arguing over "more swappy" vs. "less swappy" when it's only applied at a global, default, level is not especially productive except within the context of attempting to establish, perhaps, where the best general-use default happy setting is - for the general-use default system we all use (is that you? I know it's not me).
    • 1 reply beneath your current threshold.
  • Personally, I just try to keep my memory usage below the physical memory in my machine, but I guess that's not always possible...

    I've seen a number of posts echoing this point, overlooking one of the key reasons for swapping. It's not just because you're out of memeory for applications, it's because sometimes there are better things to be doing with your memory. Mainstream operating systems use otherwise unused memory to cache disk access, dramatically speading things up. If you've got an process that hasn't been run for a a while it may actually be more efficient to swap it to disk. This frees up memory to cache data that may be being hit quite frequently. inetd hasn't been needed for a while? Swap it out so that your disk cache is larger, benefitting your heavily used web server.

    To be fair, when to make that trade off is very tricky and will never work perfectly 100% of the time. Inevitably you'll occasionally be burned by a bad decision. But there are real benefits. The real question is not how to turn it off, the question is how to improve it and perhaps how to allow users to tune it for their needs.

  • For me, this issue is moot by WhiteWolf666 (Score:2) Friday April 30 2004, @12:11PM
  • adaptive algorithms (Score:3, Interesting)

    by mugnyte (203225) * on Friday April 30 2004, @12:13PM (#9020263)
    (http://morningcuppa.blogspot.com/ | Last Journal: Tuesday September 25, @02:23PM)
    Can someone please describe any adaptive algorithms that could be used. Specifically, I'm thinking of:

    - dirty marking unreferenced pages when swaped. if these mem pages are not used after the swap out, no need to swap them in again. i'm prety sure this already occurs

    - for process using high swap demands, increase their weighted priority for pages, with a window-averaged for swaps. so then, my database process could hog under load while my less-used apps may swap because they're used less often. could be taylored differently for code versus data segs.

    - page-impage comparisons to avoid holding duplicate code segment pages in memory. this plays with the concept of shared libs a bit, but could avoid duplicate pages, especially if this information is saved in a precalc'd hash table that is stored.

    just ideas.
  • Swap vs. disk/file cache (Score:3, Interesting)

    by Peaker (72084) <gnupeaker&yahoo,com> on Friday April 30 2004, @12:27PM (#9020442)
    (http://slashdot.org/)
    Is a difficult dilemma, but that's because an overly complicated scheme is used.

    There is a simpler and more powerful scheme that unifies swapping and disk caches, while allowing applications to persist between reboots, all with better performance than current systems!

    EROS implements [eros-os.org] such a system. Generally it is referred to as "Orthogonal persistence", and functionally it behaves as though the computer is "always on", and returns to the exact state it was in after a reboot. The thing is, with orthogonal persistence, the structure on the disk is not a file system, but just the application data.

    Since applications no longer work with the disk explicitly (open/read/write) but only with one type of memory (persistent memory), the OS manages all of the disk I/O, and it allows it to eliminate almost completely the largest delay in disk-work - the seek time in all writes. Since all application memory is just mapped to disk transparently, all RAM is just considered a "disk cache", and the kernel does not have to make nasty tradeoffs between disk caches (of explicit open/read/write calls) and virtual memory.

    Of course there is still a problem if large work-areas of unimportant applications "swap out" smaller areas of important applications. I suggest solving that by prioritizing pages to the memory manager. In a system like *nix it is not a problem. In more secure systems however (EROS, for instance), it may create additional covert channels between applications so it was avoided.
  • What's the default value? by menscher (Score:2) Friday April 30 2004, @12:57PM
    • 1 reply beneath your current threshold.
  • Swapping out unneeded stuff on bootup (Score:3, Interesting)

    by avij (105924) * on Friday April 30 2004, @01:23PM (#9021029)
    (http://eurobilltracker.net/)
    My approach has been to start all the needed services and then run this small perl script (which I named memhog.pl) to create a process that hogs quite a bit of memory:

    #!/usr/bin/perl -w
    use strict;

    my $a = "xxxxxxxxxx" x (131 *1024*1024);


    This is just a quick hack, you may want to adjust the size to suit your memory size. The server from where this script was copied has 2GB of memory. Essentially I want to page out all the stuff that doesn't get used after starting the server and the related server processes. Of course, given enough time the server would swap out those pages anyway, but this method just does it quicker. After the script has been run, the server will gradually swap in those pages it really needs. OK, doing this may be pointless but I don't care ;)
  • my swap is already turned off by xpyr (Score:1) Friday April 30 2004, @01:26PM
  • /proc? by shish (Score:2) Friday April 30 2004, @02:35PM
  • No swap involved by Handyman (Score:2) Friday April 30 2004, @03:48PM
  • Best of Both World by uslinux.net (Score:2) Friday April 30 2004, @04:17PM
  • These guys are on crack... (Score:3, Insightful)

    by shaitand (626655) * on Friday April 30 2004, @05:44PM (#9023669)
    (http://www.ganjablogger.com/ | Last Journal: Thursday January 05 2006, @05:36PM)
    Sorry but this is not a complex equation and I think these guys are getting wrapped up in too many details and missing the big picture.

    The harddrive is really, really, really, really fscking slow. In comparison Ram is really really really fast. As a result, you want to interact with the hard drive as little as you possibly can, and interact with ram instead as much as you possibly can (the only thing which beats that is interacting with only the cpu registers and avoiding ram and harddrive altogether).

    As is, linux doesn't even begin touching the disk until there is only enough ram left to turn on VM. Now this has a negative impact when that limit is reached because there is overhead turning it on.. this impact is negligable and tweakable since you can wait and see if you hitting the limit, add more memory, see again and reevaluate until you simply aren't swapping. This is a good thing.

    One of the worst things windows does is swap constantly. In fact beyond a certain point (read enough ram to run an XP desktop) the system swaps MORE if you have more ram. You boot the system with all uneeded services turned off and no startup processes and all the eyecandy turned off. And you've got 4gb of ram in the system, guess what, it's already using VM.

    Maybe VM management itself could be tweaked more, but it certainly shouldn't be used unless it absolutely has to (and if you don't have enough ram and it has to all the time then it's not like you suffer that performance hit more than once).

    The only exception to this I've found is a linux desktop running kde or gnome with about 256mb of ram, at that point the numbers seem to work out just about right(or wrong I should say) and the system is constantly turning VM on and off, encountering the performance hit again and again and again, with pretty much every operation you perform.
  • Gee ... two I/Os to swap, One I/O to reclaim cache by Totally_Lost (Score:2) Friday April 30 2004, @07:41PM
  • Sticky Bit!!! (Score:4, Interesting)

    by cgleba (521624) on Saturday May 01 2004, @12:13AM (#9025889)
    Here's a solution to the whole debate -- make the sticky bit have meaning under Linux like it does on other UNIXen -- if the sticky bit is set on the execuatble, do not swap it. If it is not set, the executable is free to be swapped. This solves the entire debate (for instance, if you don't want the 'interactive' mozilla process swapped, set the sticky bit on the executable).
    • 1 reply beneath your current threshold.
  • swapping and interactivity by nothings (Score:2) Saturday May 01 2004, @07:20PM
  • No "N*RAM" rule is logical. by argent (Score:2) Monday May 03 2004, @01:11PM
  • Swapping by GenomeX (Score:1) Tuesday May 04 2004, @02:03PM
  • 9 replies beneath your current threshold.