Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

C Coding Tip - Self-Manage Memory Alllocation 142

Posted by Cliff on Wednesday January 07, 2004 @09:14PM from the free()-your-own-malloc() dept.

An anonymous reader inputs: "The C programming language defines two standard memory management functions: malloc() and free(). C programmers frequently use those functions to allocate buffers at run time to pass data between functions. In many situations, however, you cannot predetermine the actual sizes required for the buffers, which may cause several fundamental problems for constructing complex C programs. This article advocates a self-managing, abstract data buffer. It outlines a pseudo-C implementation of the abstract buffer and details the advantages of adopting this mechanism."

This discussion has been archived. No new comments can be posted.

C Coding Tip - Self-Manage Memory Alllocation

Load All Comments

Search 142 Comments Log In/Create an Account

Comments Filter:

Um (Score:4, Interesting)

by be-fan ( 61476 ) writes: on Wednesday January 07, 2004 @09:17PM (#7909044)

*cough* garbage collection [hp.com] *cough*

Share
twitter facebook
- - Re:Um (Score:5, Insightful)
    
    by scrytch ( 9198 ) writes: <chuck@myrealbox.com> on Wednesday January 07, 2004 @10:04PM (#7909409)
    
    > garbage collection stops system response while it's cleaning up
    
    And malloc is of course free, right? ("well no wally, they're opposites" ... yeah yeah you get my drift)
    
    Good gc's operate incrementally. Good gc's let you turn gc on and off at will and disable it altogether for designated arenas. Good gc's can run in a separate thread on another CPU, whereas malloc/free cannot.
    
    The reason java's gc goes wiggy is not because the gc is bad (it's just not very tunable except on solaris), it's because it allocates new objects all over the place (and is happily helped at it by the standard libraries). If you go hog wild with resource consumption, yes you're going to pay for it later.
    
    For the 99.99% of programs that do NOT need hard realtime, you're better off with gc. Cripes, it's like saying homes shouldn't have thermostats because a home thermostat isn't suitable for a reactor sensor.
    
    Parent Share
    twitter facebook
    - Re:Um (Score:5, Informative)
      
      by Tom7 ( 102298 ) writes: on Thursday January 08, 2004 @12:17AM (#7910601) Homepage Journal
      
      Of course, there are also hard real-time garbage collectors (ie Cheng's [nec.com]), though I don't think you'll find them in general-purpose production compilers. However, you will find good garbage collectors in a number of real production compilers (say, in mlton [mlton.org]). It's definitely worth benchmarking.
      
      Parent Share
      twitter facebook
    - Re:Um (Score:5, Insightful)
      
      by cookd ( 72933 ) writes: <douglascook@nOSPam.juno.com> on Thursday January 08, 2004 @01:51AM (#7911846) Journal
      
      It's even better than that. In many cases, GC can be almost as fast as malloc/free. Looking at the way things are going, I am pretty sure that in the near future GC-based code will be faster, not slower, than equivalent malloc/free code:
      
      A compacting GC heap can make allocation into a REALLY fast operation (unless it triggers a collection), and there is no time spent deallocating. On normal heaps, malloc/new can take quite a lot of work (thread synchronization, best-fit free list searching, etc.). If you amortize the time the GC spends in collection over the allocations, the average allocation isn't that much slower than the corresponding malloc/free. Best of all, the gap is shrinking. Soon, GC may be FASTER overall than malloc/free in many real-world situations. It obviously depends on memory usage patterns and collection strategies, but it is starting to happen.
      
      If you're using GC, your program doesn't have to do all of the bookkeeping anymore. The amount of time spent in resource tracking in big programs is fairly significant -- where the same object is shared by many parts of the program, figuring out when the item can be released is nontrivial. Under a GC, this is handled by the collector. In some cases, the GC can handle the bookkeeping more efficiently than would have been feasible with manual bookkeeping.
      
      As CPUs get faster and storage (memory and disk) gets (relatively) slower, it becomes more and more attractive to spend extra CPU cycles to try to make better use of cache and memory. An L1 cache miss costs around 4-10 cycles; an L2 miss can cost 100-400 cycles; a page fault costs millions. The CPU time spent in garbage collection can become insignificant when compared with storage access time. Do you take the cache line size into consideration when you call malloc? I sure don't. But compacting GC implementations are starting to take things like that into consideration when they collect, and they rearrange the memory of the process to maximize cache hits and minimize memory waste. They can help your program make more efficient use of the cache, and perhaps reduce the working set. Just a 5% reduction in page faults would more than offset all but the most CPU-intensive garbage collectors.
      
      GCs usually collect on a separate thread. That means that with a properly designed collector, while your program is blocked on IO or waiting for user input, the GC might be cleaning up the heap on a low priority thread. With luck, your main thread might NEVER actually be interrupted for a collection -- all of the collection can be done while your work threads were waiting for something else. Your memory management tax was paid in the background, while the machine was otherwise idle. On the other hand, with malloc and free, the memory management tax is paid up front by the currently executing thread. While this isn't always the case (i.e. CPU-bound processes), most apps are NOT CPU-bound, so this is a likely scenario in the future (if not the present).
      
      While I don't think GC is quite to the point where it is free or beneficial to the performance of the average application, it is a lot less harmful than most people think. Given that it simplifies the code and eliminates a lot of bugs (usually more than it introduces), it is definitely worthwhile in almost all new application code (kernel-mode code isn't quite there yet, but it's coming), with only a small performance penalty. And I suspect that it won't be too long before it starts to be more of a speed booster, not a perf hit.
      
      I think this is just another step in the process of handing another menial task over to the CPU. We moved from binary to assembly, assembly to low-level languages, low-level languages to higher-level languages, etc. At each step, the new method had a performance penalty at first, then as the new method matured, it turned out to actually be faster than the old method it replaced, while dramatically increasing programmer productivity (i.e. modern optimizers can usually do a better job than an assembly language programmer; often C++ code is faster than the equivalent C code since the compiler has more information to work with and the programmer can make use of more effective techniques like templates).
      Read the rest of this comment...
      
      Parent Share
      twitter facebook
      - Re:Um (Score:1)
        
        by BlueBiker ( 690984 ) writes:
        
        Very interesting post, wish I had me some mod points. The point about CPU speeds eclipsing storage access times is especially relevant.
      - Re:Um (Score:2, Informative)
        
        by Anonymous Coward writes:
        
        Often C++ code is faster than the equivalent C code since the compiler has more information to work with and the programmer can make use of more effective techniques like templates.
        
        Forget C++! High-level modern multi-paradigm languages like Common Lisp and OCaml, which can do most things C++ can do and a lot of things it can't, often produce code as fast as the average C implementation of any given algorithm, despite their relying heavily on GC. And while it's generally possible to tweak the C version t
        
        Re:Um (Score:1)
        
        by cookd ( 72933 ) writes:
        
        I was using C++ as an example of a language that was "higher level" than C. Lisp and OCaml also fit that description (as well as the "higher level than C++" description). The point was that high-level languages can, in some cases, actually BEAT C in PERFORMANCE, let alone programmer productivity. Look at the "Great Computer Language Shootout" page for more info. OCaml is right up there with C and C++ in performance.
        
        So the point is that people shouldn't be afraid of "high level" features like GC. While
        
        Re:Um (Score:2)
        
        by cheesybagel ( 670288 ) writes:
        
        LISP is a nice language, except for the numerous ()'s. I know you can supposedly get used to them, but I never got used to it.
        
        Re:Um (Score:1)
        
        by key45 ( 706152 ) writes:
        
        Personally I've already relegated C to the position that assembly used to hold ... I wouldn't really consider using it to write an entire application
        
        You lucky dog. I've currently got the fun job of trying to write a hard realtime application for a DSP chip which does not have a C++ (or high level language except C) compiler. For me, this technique looks pretty interesting.
      - Re:Um (Score:1)
        
        by cookd ( 72933 ) writes:
        
        Agent Smith and I are happy to know you care.
      - Re:GC in one thread is the fatal bottleneck (Score:1)
        
        by cookd ( 72933 ) writes:
        
        Yep. That was covered in the part where I was talking about how garbage collectors have really improved in the past few years. That is why most GCs nowadays are concurrent, adaptive, and incremental. Otherwise they interfere with the application too much.
      - Re:Um (Score:3, Interesting)
        
        by Nevyn ( 5505 ) * writes:
        
        If you amortize the time the GC spends in collection over the allocations, the average allocation isn't that much slower than the corresponding malloc/free. Best of all, the gap is shrinking. Soon, GC may be FASTER overall than malloc/free in many real-world situations. It obviously depends on memory usage patterns and collection strategies, but it is starting to happen.
        I've heard that for years, it's yet to be true for the general case ... and most people doing manual allocation don't call malloc/free
        
        Re:Um (Score:2, Insightful)
        
        by cookd ( 72933 ) writes:
        
        I've heard that for years, it's yet to be true for the general case ... and most people doing manual allocation don't call malloc/free for every single grow/shrink operation. The caching slab allocator was put into solaris a long time ago now.
        
        Caching slab, lookaside lists, whatever, you're still calling some kind of allocator with some kind of cost. There are definitely applications where memory allocation/deallocation follows a pattern that allows that kind of optimization to work well. But here is the
        
        Re:Um (Score:2)
        
        by Nevyn ( 5505 ) * writes:
        
        Let me prefix this by saying that I didn't want to get into a GC flamewar, as I said I think GC can be used in some applications where you don't care about the negative side affects. I only replied to you as you seemed to be giving one of the better "Let's burn everything and rebuld using only GC" arguments. Possibly some of the difference is due to you working for MSFT and me being at RHAT, but I doubt it ... and if you don't hold it against me I'll do the same :). So anyway...
        
        Caching slab, lookaside
        
        Re:Um (Score:1)
        
        by cookd ( 72933 ) writes:
        
        Been interesting posting with you. Thanks for the discussion. Obviously, there are two sides to this coin, and definitely there are some worthwhile arguments for and against.
        
        To make sure that I wasn't just blowing steam, I did a little bit of looking around, reading some research papers, etc. Here is what I came up with:
        
        If your program requires the allocation and deallocation of a lot of items of specific, predictable sizes, you take cache size and available memory into consideration, and really know
        
        Re:Um (Score:2)
        
        by Nevyn ( 5505 ) * writes:
        
        To make sure that I wasn't just blowing steam, I did a little bit of looking around, reading some research papers ... I don't have all of the links, but a lot of them were within a few clicks of the so-called
        The Memory Management Reference [memorymanagement.org]website
        Well it's hard to argue with no links :). But your point that large objects are easier on a malloc()/free() seems like common sense. It's much more often that you'd write a custom allocator for small objects.
        Refcounting is actually expensive, more so than
- Re:Um (Score:5, Interesting)
  
  by Screaming Lunatic ( 526975 ) writes: on Wednesday January 07, 2004 @10:04PM (#7909407) Homepage
  
  Which means you don't have predictable destruction. Which means you don't have destructors. Which means you can't use idioms like resource-allocation-is-initialization.
  So in the presence of exceptions, you won't leak memory on the heap. But you will leak mutexes, file handles, etc. You need another idiom to handle those cases.
  In the .NET world, C# introduced synchronization blocks to handle the leaking mutex problem. But it is a pain in Managed C++ and VB.NET.
  Garbage collection is not the be all and end all.
  If I ruled the world, I would create a multi-paradigm (object-oriented, generic, functional, and modular support) strongly-typed low-level language that let you program at a high-level. A second high-level langauge that was loosely-typed, garbage collected, and could be interpreted or natively compiled. Then I would define a standard to interface the two languages.
  In other words, take C++ and add the concept of components/packages. Take Python and add the features (such as generics) that are missing from C++. And then define an interface between components written in both langauges.
  Currently Boost.Python and SWIG exist. But I wish that they would just work automagically, everytime I typed make at the command line or build in VC++.
  
  Parent Share
  twitter facebook
  - Re:Um (Score:5, Insightful)
    
    by be-fan ( 61476 ) writes: on Wednesday January 07, 2004 @10:08PM (#7909433)
    
    If I ruled the world, I would create a multi-paradigm (object-oriented, generic, functional, and modular support) strongly-typed low-level language that let you program at a high-level. A second high-level langauge that was loosely-typed, garbage collected, and could be interpreted or natively compiled. Then I would define a standard to interface the two languages.
    ----------
    You just described Scheme/CL/Dylan.
    
    Parent Share
    twitter facebook
    - Re:Um (Score:2)
      
      by Ryan Amos ( 16972 ) writes:
      
      Scheme! I had to program in that for an intro CS class. Basically the point was to teach the class in a language nobody knew to filter out all the smartasses who had read an "intro to C++" book. Fun little language, though it's applications are a bit limited...
      - Re:Um (Score:1, Insightful)
        
        by Anonymous Coward writes:
        
        If you think that was the point, then you completely missed out. I guess that is to be expected from somebody that went to UT.
        
        Re:Um (Score:1)
        
        by Ryan Amos ( 16972 ) writes:
        
        Heh, probably true, as a few classes down the line I decided I didn't want to be a programmer anyway :) I'm a sysadmin dammit :(
  - Re:Um (Score:3, Funny)
    
    by E_elven ( 600520 ) writes:
    
    > Which means you don't have predictable destruction. Which means you don't have destructors.
    
    My GC does something like this:
    
    1) Allocate memory, reference count 1 ...
    N-1) When ref count reaches 0, we call destroy() on the object ...
    N) Free the memory
    - Re:Um (Score:2)
      
      by Yokaze ( 70883 ) writes:
      
      And something like this:
      1) Allocate object A(1) (reference count 1), B(1) and C(1)
      3) A(1)->B(2) (Reference B by A)... A(2)->B(2)->C(2)->A(2)
      4) Dispose root references to A(1),B(1),C(1)
      5) Have fun with your memory leak.
      - Re:Um (Score:2)
        
        by E_elven ( 600520 ) writes:
        
        That's a good sign the design isn't very good. Mine is :)
  - Re:Um (Score:3, Interesting)
    
    by egomaniac ( 105476 ) writes:
    
    Which means you don't have predictable destruction. Which means you don't have destructors. Which means you can't use idioms like resource-allocation-is-initialization.
    
    So in the presence of exceptions, you won't leak memory on the heap. But you will leak mutexes, file handles, etc. You need another idiom to handle those cases.
    
    Java is probably the most widely-used garbage-collected language in existence. I think I speak for all Java programmers when I say "WTF are you talking about?"
    
    It is true that you
    - Re:Um (Score:5, Interesting)
      
      by Curien ( 267780 ) writes: on Thursday January 08, 2004 @12:01PM (#7915190)
      
      Java is probably the most widely-used garbage-collected language in existence. I think I speak for all Java programmers when I say "WTF are you talking about?"
      
      He's talking about the unpredictability of resource release using GC. If unpredictability isn't a problem, fine. If you need to synchronize your resources carefully (which is what a mutex is /all about/), then you've got a problem.
      
      Now, this article is about C, so let's compare the two.
      
      The post you're responding to wasn't about C: it was about a weakness of GC compared to, say, RAII (which is the idiom C++, among others, uses). But just for fun, let's go on to see how little you know about C.
      
      Java: failing to close a file == usually no problem whatsoever
      
      Unless you need to open the file again before the garbage collector decides to reclaim the handle.
      
      C: failing to close a file == permanently leaked handle
      
      Bzzt... wrong. All file handles are released upon program termination. I like how you used '==' to try impress people with your programming skillz, though.
      
      As far as the other case you mention, mutexes, goes, Java has two means of providing mutual exclusion. The "synchronized" keyword
      
      Wait, wait... you're saying... hold on a second, now... that Java uses a different idiom to handle mutexes? That's exactly what the parent post said it would have to do... because GC isn't as useful as RAII when it comes to general resource allocation (not just memory).
      
      But you make it sound as if garbage collection is a step backwards from malloc/free
      
      He made no such comparison. He compared it (unfavorably) to RAII.
      
      Parent Share
      twitter facebook
      - Re:Um (Score:2)
        
        by Curien ( 267780 ) writes:
        
        It's quite well-defined. When the end of main is reached, it's equivalent to calling the exit function (unless the return type of main is incompatible with int, in which case the value returned to the operating system is implementation-defined). Among other things, exit() has the following affect:
        
        "Next, all open streams with unwritten buffered data are flushed, all open streams are
        closed, and all files created by the tmpfile function are removed."
        
        Maybe you should learn the language a little better.
        
        Re:Um (Score:2)
        
        by Curien ( 267780 ) writes:
        
        No, you paid attention to the wrong part. The relavent part is "all open streams are closed". Remember that a FILE* is a stream.
    - Re:Um (Score:2, Interesting)
      
      by moocat2 ( 445256 ) writes:
      
      Yes, the article is about C, but the poster was talking about predictable destruction, so he was not referring to C when dissing GC.
      
      My guess is that he or she was referring to C++ using the RAII (resource acquisition is initialization) paradigm. Every resource is wrapped in a object and all objects are put on the stack. Voila, no resource leaks.
      
      Also, I think your example is far from complete. While failing to close a file handle may be "no problem whatsoever" for the application itself, but can be a pr
  - Re:Um (Score:2, Insightful)
    
    by Mikkeles ( 698461 ) writes:
    
    If I ruled the world, I would create a multi-paradigm (object-oriented, generic, functional, and modular support) strongly-typed low-level language that let you program at a high-level. A second high-level langauge that was loosely-typed, garbage collected, and could be interpreted or natively compiled. Then I would define a standard to interface the two languages.
    
    In other words, take C++ and add the concept of components/packages. Take Python and add the features (such as generics) that are missing from
    - Re:Um (Score:2)
      
      by Zan Zu from Eridu ( 165657 ) writes:
      
      I don't think of C++ as strongly typed. typedefs don't enforce, e.g., different meanings of ints (so number of apples vs. number of oranges can't really be distinguished without creating multiple classes).
      Would you call C++ strongly typed if the keyword would be "typealias" instead of "typedef"?
      Even enums aren't distinguishable from ints!
      Bzzt, wrong! Try to assign an int to an enum without a cast... You'll get an error.
      - Re:Um (Score:1)
        
        by Mikkeles ( 698461 ) writes:
        
        "Would you call C++ strongly typed if the keyword would be "typealias" instead of "typedef"?"
        I'm afraid I only know of 'typealias' in Fortran (where it is synonymous with 'typedef') and in the Meta Object Facility (MOF) of CORBA. I have found reference to a python class of that name, but know nothing of it.
        Would you be able to provide me a reference? Ta.
        Even enums aren't distinguishable from ints!
        
        Bzzt, wrong! Try to assign an int to an enum without a cast... You'll get an error.
        
        You are correct in
        
        Re:Um (Score:2)
        
        by Zan Zu from Eridu ( 165657 ) writes:
        
        I have found reference to a python class of that name, but know nothing of it. Would you be able to provide me a reference? Ta.
        I was trying to make it clear that typedef doesn't define or declare a new type, but an alias to an existing type. IOW the keyword "typedef" is misleading, but it doesn't make C++ weakly typed.
        The cast, however, provides no type checking.
        Yes, that's a design feature of the language. The compiler errs out if you implicitly try to cast incompatible types, but when you explicitl
  - Re:Um (Score:2)
    
    by The Munger ( 695154 ) writes:
    
    If I ruled the world, I would create a multi-paradigm (object-oriented, generic, functional, and modular support) strongly-typed low-level language that let you program at a high-level.
    
    And if I ruled the world, me and Salma Hayek would... I'm sorry, what was the question?
  - Re:Um (Score:2)
    
    by smallpaul ( 65919 ) writes:
    
    Which means you don't have predictable destruction. Which means you don't have destructors. Which means you can't use idioms like resource-allocation-is-initialization.
    
    In standard Python you have both garbage collection and predictable destruction.
    In other words, take C++ and add the concept of components/packages. Take Python and add the features (such as generics) that are missing from C++. And then define an interface between components written in both langauges.
    If you are using Python for your
  - Re:Um (Score:2, Informative)
    
    by Whip-hero ( 308110 ) writes:
    
    So in the presence of exceptions, you won't leak memory on the heap. But you will leak mutexes, file handles, etc. You need another idiom to handle those cases.
    
    In the .NET world, C# introduced synchronization blocks to handle the leaking mutex problem. But it is a pain in Managed C++ and VB.NET.
    
    File/mutex/whatever leakage in the presence of exceptions? Two words:
    
    unwind-protect
- - Re:Um (Score:1)
    
    by Tukla ( 5899 ) writes:
    
    Hmm? Why does insurance become a bad idea once you need it? (By "need it", I assume you mean "need to file a claim".)
Weeeee Lets implement Slab allocators (Score:2)

by MerlynEmrys67 ( 583469 ) writes:

Hmmm... Didn't this go into Unix kernels almost a decade ago - Thank you Sun
Gee, isn't that handy (Score:5, Interesting)

by c ( 8461 ) writes: <beauregardcp@gmail.com> on Wednesday January 07, 2004 @09:34PM (#7909151)

Anyone who's done C coding for more than, oh, a day would have already figured that out. It's not a coincidence that every programming language that doesn't have "smart" arrays built into the language ends up with some sort of buffer class (Java's ByteStream class, C++'s stream IO buffers, etc).

The fundamental problem is that this sort of thing needs to be done at the C library level. And if it's not done in a flexible fashion, you end up with a library call that rarely gets used. Anyone used hsearch() lately?

If only clib streams (FILE* and friends) were extensible, this article would never have had to be written.

c.

Share
twitter facebook
- Re:Gee, isn't that handy (Score:2)
  
  by Nevyn ( 5505 ) * writes:
  
  The fundamental problem is that this sort of thing needs to be done at the C library level. And if it's not done in a flexible fashion, you end up with a library call that rarely gets used. Anyone used hsearch() lately?
  
  Take a look at, Vstr [and.org] I think it's pretty flexible ... it certainly has much better researched documentation than the content for this IBM "research" article.
  - Re:Gee, isn't that handy (Score:2)
    
    by c ( 8461 ) writes:
    
    Comprehensive, that's for sure, but the examples look like it's also reinventing FILE* in the io_* API.
    
    glibc's fmemopen() moots most of the IBM article, I think, but since I don't code exclusively in a glibc environment... Grrr... If only POSIX specced out FILE* a bit tighter...
    
    c.
    - Re:Gee, isn't that handy (Score:2)
      
      by Nevyn ( 5505 ) * writes:
      
      Comprehensive, that's for sure, but the examples look like it's also reinventing FILE* in the io_* API.
      The io_* API is part of the examples, and not in the library itself. But it's a pretty small wrapper over what's in the library ... and, yes, the library itself was designed so it could do non-blocking IO [and.org] (which FILE* can't). So in that regard, yeh, I don't tend to use FILE* anymore.
      glibc's fmemopen() moots most of the IBM article, I think, but since I don't code exclusively in a glibc environmen
Please (Score:5, Funny)

by YellowElectricRat ( 637662 ) writes: on Wednesday January 07, 2004 @10:04PM (#7909405) Journal

This article, I believe, has already been published in the well known programmers' journal "No shit Sherlock - monthly"

Share
twitter facebook
realloc (Score:2)

by norwoodites ( 226775 ) writes:

It is called realloc, that is the real way that people should use to self-manage memory allocation, and something that detects leaks is also needed.
- Re:realloc (Score:2)
  
  by Josh Booth ( 588074 ) writes:
  
  How about use another language, like not C? Or use a gc library. I have heard of many, but it really should be part of the language so people will actually use it.
- Re:realloc (Score:5, Interesting)
  
  by Permission Denied ( 551645 ) writes: on Thursday January 08, 2004 @12:20AM (#7910636) Journal
  
  Regarding realloc:
  When you call realloc, you're very likely to cause the data to be copied from the old buffer to the new buffer. This is very high overhead. The article discusses how to do similar things, but without this unecessary copying (eg, low overhead). It's actually not that interesting of an article as what it describes is hardly new and I believe any competent programmer could come up with that solution when faced with the particular circumstances that inspired it.
  Realloc works by seeing if there is free memory after the end of the allocated block, and changing the block's size if so. Realloc can do this because it knows about the internals of the malloc/free implementation. If there is allocated memory right after the block in question, a new block must be allocated, as you cannot "move" the later block in a language like C where any memory location can be a pointer. You could try this kind of stuff in other languages (or in some bastardized C where you do not have direct access to memory, but go through more indirection, the next logical abstraction after the article), but when you start automatically finding/checking/updating memory pointers, you get into GC.
  You may be able to overcome some overhead on realloc if you move the problem down into the kernel. The kernel could play page table games so there is little or no actual copying involved, just updating of page tables. This would be fairly easy to implement, but I don't think anyone's done it because (a) flushing the relevant TLB entries could hurt performance more than the copying, and (b) the system call overhead might be more overhead than the copying. Realloc is generally only used for small buffers (due to programmers knowing about the copying overhead) and this trick would only have gains for large buffers spanning multiple pages. For small buffers, the library-level realloc could avoid the system call and do the copying itself, avoiding system call overhead and TLB entry flushes.
  This scheme I describe could make for an interesting paper (especially determining for what size of buffer and what type of program it has gains), but I doubt it would make much difference in real system performance as programmers avoid realloc for large buffers, and there are very few cases where one needs direct linear access to a large range of memory rather than being better-served by organizing that memory into some data structure.
  
  Parent Share
  twitter facebook
  - Re:realloc (Score:2)
    
    by norwoodites ( 226775 ) writes:
    
    But linked lists are worse on performance than realloc.
    One way to use realloc better is to have a current length and max size and also rellocate more memory than need at the needed time. This cause better locality than a linked list. And you can always use realloc to make the list to the current length. Linked lists cause many problems when it comes to prefetching and fast memory access so they are slow when you are accessing things down a list. You can do a tree using one list also, by having an invers
    - Re:realloc (Score:3, Insightful)
      
      by Nevyn ( 5505 ) * writes:
      
      But linked lists are worse on performance than realloc.
      Care to back that up with some [and.org] facts [and.org].
      Also linked lists are very prone to leaking and very hard to figure out which one to free first.
      So you a) have the library code do that ... and b) have lots of tests. Of course you want to do both of those for something using a single block of memory too, and if you want it to be efficient it usualy does something clever to avoid copies ... and so is probably more likely to screw up and use/deallocate th
      - Re:realloc (Score:2)
        
        by Ninja Programmer ( 145252 ) writes:
        
        ...But linked lists are worse on performance than realloc.
        Care to back that up with some facts.
        Linked lists require links -- this puts pressure on the L1 cache. Linked list access is also necessarily serial, meaning that you lose all parallel execution potential (from high level software all the way down to the CPU core). Sub-blocked operations require full iteration through the intersection. And finally, of course, there is the small matter of random access.
        
        Re:realloc (Score:3, Informative)
        
        by Nevyn ( 5505 ) * writes:
        
        Linked lists require links -- this puts pressure on the L1 cache.
        I said facts, not theories. Yes, I know all about cache and his friend random access. Maybe you'll take a noticable cache hit when moving between nodes ... and yes, certainly doing memcpy() over a single large block X will be faster than doing it X/20 times for 20 byte nodes (20 was the figure given in the IBM "research" article). But that doesn't take into account the time taken to call realloc() each time to expand the large block ... o
    - Re:realloc (Score:2)
      
      by koh ( 124962 ) writes:
      
      Also note that many modern incarnations of linked lists use an allocator that requires sequential memory and make their own chunks from that.
      
      AFAIK even glib has these.
  - Re:realloc (Score:2)
    
    by Ninja Programmer ( 145252 ) writes:
    
    When you call realloc, you're very likely to cause the data to be copied from the old buffer to the new buffer. This is very high overhead. [...]
    
    The way you deal with this is by not doing it very often. See: The Better String Library [sf.net].
  - Re:realloc (Score:2)
    
    by sjames ( 1099 ) writes:
    
    Realloc can work well if you always malloc the largest buffer you possibly need, then realloc smaller when the real size is known. For that to work well, the allocations either need to be serial (that is malloc, load, realloc), or you also have a need for smaller blocks that will fit in to the space made by reallocing down.
    
    In some systems such as where you are the only process on the CPU, you can do compression (copy down) when waiting for external events (like an interrupt). At that point, the cycles and
Memory Allocation (Score:5, Funny)

by rmohr02 ( 208447 ) writes: <(mohr.42) (at) (osu.edu)> on Wednesday January 07, 2004 @10:16PM (#7909511)

Just like slashdot allocated extra space for the third "l" in "alllocation".

Share
twitter facebook
Hmmm (Score:5, Interesting)

by JMZero ( 449047 ) writes: on Wednesday January 07, 2004 @10:29PM (#7909606) Homepage

Many small programs are no longer memory, or even performance, constrained. As such, a reasonable strategy for a lot of desktop software is to allocate a huge buffer at startup, and do repetitive flushes and complete reloads of data (always using the same pre-allocated buffers).

This is simple to do, and avoids a lot of errors. It's also not much of a headline.

Share
twitter facebook
- Re:Hmmm (Score:3, Insightful)
  
  by schmaltz ( 70977 ) writes:
  
  reasonable strategy: allocate a huge buffer ... do repetitive flushes and complete reloads of data ...
  
  Um, this is reasonable how? If you're coding on a 3GHz dream machine w/4GB 400mhz RAM, there will be somebody out there who, quite reasonably, will want to run your code on a 64MB Pentium 133... Testing only on unencumbered machines makes for delusional developers. ;)
  
  I'm split: either yer trollin' /., or you gotta tell us what OSS projects you've contributed to, with the above philosophy!
  - Hmmmm (Score:2)
    
    by JMZero ( 449047 ) writes:
    
    You have to understand that most software people write isn't like what you're thinking. If you are writing software for a large audience and long term use, you obviously have to be more careful with your strategies. For many apps, though, you don't require this sort of robustness - and you probably aren't going to spend enough effort to do everything well. As such, if you're micro-managing memory then you are likely also creating memory leaks and bugs.
    
    Also, I'm not suggesting you allocate a 30 meg buffe
  - Re:Hmmm (Score:2)
    
    by sjames ( 1099 ) writes:
    
    Note that in Linux and many other OSes, malloc and friends don't actually cause physical memory to be committed to the process, they just create an unmapped virtual area. RAM is only committed when writing to a page area faults a page in. Reading from such a virtual area will fault THE zero page in (one page in the whole system containing zeros), writing to a zero mapped page will fault in a different page.
    
    So, you can allocate a 1GB buffer and read it all to verify that it contains only zeros, at a cost
- Re:Hmmm (Score:3, Insightful)
  
  by Nevyn ( 5505 ) * writes:
  This is simple to do, and avoids a lot of errors. It's also not much of a headline.
  While I agree the IBM "research" article is terrible, the idea behind it isn't.
  Actually having donetests [and.org] and benchmarks [and.org]. I can safely say:
  
  It's not the simplest solution.
  It's certainly not anywhere near fast.
  - Just to be clear... (Score:2)
    
    by JMZero ( 449047 ) writes:
    
    I like the strategy IBM describes. My strategy is obviously suitable in different places. My intended point was that I didn't figure either would be novel ideas (headlines) to most programmers.
    
    Your benchmarks, on the other hand, are a good headline. Going into a project, you usually have a fair idea of your options for memory management and how long they'll take to implement. However, you don't always have a good grasp of the performance implications - your breakdown is handy.
- Re:Hmmm (Score:1)
  
  by cybergrue ( 696844 ) writes:
  
  Many small programs are no longer memory, or even performance, constrained. As such, a reasonable strategy for a lot of desktop software is to allocate a huge buffer at startup
  It has been my experience that this was done when systems were constrained, ie. test to see if there is enough system memory at start-up instead of running out of memory in the middle of execution. This was apparently so prevelent that some vendors (SGI for one) changed malloc to use a "first touched" memory model. In this model,
  - Indeed (Score:2)
    
    by JMZero ( 449047 ) writes:
    
    I'd suggest this approach only in small footprint sort of apps, or apps where performance footprint or convention means that not much else will likely be running (eg. fullscreen games). In many apps, the memory requirements are consistent enough that total demand is going to be the same either way.
    
    Mostly I just hate people to be doing lots of work in C to save 30k of system memory - and ending up with a buggy program full of memory leaks. Many apps have data sets this small (30k) and yet are spending lot
Wow! (Score:5, Funny)

by arkanes ( 521690 ) writes: <arkanes.gmail@com> on Wednesday January 07, 2004 @10:59PM (#7909812) Homepage

Man, I never thought of that. An abstract memory buffer. What a concept! I don't need to define the lengths of everything at compile time then!
Now, I'll need a nice short catchy name for it... oh! I know! I'll call it a heap!

Share
twitter facebook
Useful (Score:3, Interesting)

by dtfinch ( 661405 ) * writes: on Wednesday January 07, 2004 @11:20PM (#7909956) Journal

But this is like teaching calculus students remedial math. The "Level: Intermediate" at the top of the article should have given that away.

Share
twitter facebook
The solution is one of the "bad ones"... (Score:1)

by mkeeley ( 472971 ) writes:

One of the interacting parties defines the underlying memory allocation mechanism for data exchange. The other party always uses the published interface to allocate or free buffers to avoid possible inconsistency. This model requires both parties to stick to a programming convention that may not be relevant to the software's basic functionality and, in general, can make the code less reusable.

And the proposed solution requires both parties to stick to the common adbtract buffer interface.
Hmmm!
A memory leak IN THE SPECIFICATION?!? (Score:5, Funny)

by MarkusQ ( 450076 ) writes: on Thursday January 08, 2004 @12:58AM (#7911286) Journal
From the article:
- The pLostBlock points to the linked list's last memory block.
pLostBlock?!? This almost sounds as if it's designed to leak!
-- MarkusQ
P.S. Seriously, I think this is a fine idea, if not particularly earth shaking. But the typo was too ironic not to point out.
Share
twitter facebook
Vstr (Score:5, Informative)

by Nevyn ( 5505 ) * writes: on Thursday January 08, 2004 @02:11AM (#7911982) Homepage Journal

The article basically proposes a very bad implementation of Vstr [and.org], most of the advise was extremly simplified at best but more likely just uninformed: an "efficient" abstract buffer that mixes shorts and pointers -- words almost fail me, how to solve the problem of "what do you do with the data when it's all in the buffer" -- "let's just copy it back out again (hey whats a couple of extra copies between friends). Representing in memory object sizes with "long int" *sigh*.

If you are interested in the article, go read this explanation of why you want it for security [and.org] and this explanation of why you want it for speed [and.org].

Vstr is LGPL, has actual benchmark data behind the block sizes it picks, has an extensive test suite ... and has documentation for the many functions that come with the library (including a fully compliant printf like function). Of course, I don't have a PhD ... but after reading this, you might well count that as a plus too

Share
twitter facebook
- Re:Vstr (Score:2)
  
  by 0x0d0a ( 568518 ) writes:
  
  Representing in memory object sizes with "long int" *sigh*.
  
  FWIW, this is not as trivial as one might think (from your project, I suspect you're aware of the issues involved).
  
  I've been working on a program that ptrace()s another program and has to keep track of ranges of memory. Just because I like doing things properly, I figured that using a void * to store memory range offsets would be sufficient. Problem is, C doesn't define some operations that I needed to do (like modulus) on pointers. So I get t
  - Re:Vstr (Score:2)
    
    by Nevyn ( 5505 ) * writes:
    
    Representing in memory object sizes with "long int" *sigh*.
    FWIW, this is not as trivial as one might think [snaip ...] I figured that using a void * to store memory range offsets would be sufficient. Problem is, C doesn't define some operations that I needed to do (like modulus) on pointers. So I get the possibility of casting to...an int of some sort. Grr. It took me quite some time to run across intptr_t, which is an integer type guaranteed to be able to hold a pointer.
    Well I meant the sizes, not
Not News (Score:4, Informative)

by SkewlD00d ( 314017 ) writes: on Thursday January 08, 2004 @09:28AM (#7913812)

Embedded systems do this, they have a pool of Buffers and a BufferManager that allows you to do effectively your own memory mangement (and in some cases, static memory management). malloc() and free() are usually really slow, so if you can save 99% of those calls by reusing memory blocks, you can really speed up your programs.

Share
twitter facebook
- Re:Not News (Score:2)
  
  by The Vulture ( 248871 ) writes:
  
  Heck, in some cases, malloc and free don't work very well.
  
  I do a lot of work in vxWorks. The version that I routinely use in supporting a legacy product (5.4) doesn't do any collection on the free'd spaces. Thus, you can very easily kill the system by doing something like this:
  while (1)
  {
  ptr = malloc(rand());
  free(ptr);
  }
  
  Eventually you'll wind up where memory is so fragmented that it can't find an appropriately sized chunk for a given size, even though there's plenty of memory available.
  
  It's easier
  - Re:Not News (Score:2)
    
    by SkewlD00d ( 314017 ) writes:
    
    i worked w/ a metroworks system that could not use operator new( ) nor malloc(size_t) after system init. There was a line of code that would execute, and all memory had to be statically allocated from the master memory pool. We had a Buffer class and different size Buffer's you could alloc from our own BufferManager class, mainly for IP packet and RS-232 packet buffer passing. Strings were statically compiled. AFAIK, the system was fairly stable, and they're weren't generally any memory leaks *grin*.
SCO owns C++ (and C too?) (Score:1, Offtopic)

by G3ckoG33k ( 647276 ) writes:

"And C++ programming languages, we own those, have licensed them out multiple times, obviously. We have a lot of royalties coming to us from C++"

Darl McBride, SCO

WTF?! It's true, he said that. Read more here [zdnet.com] and here [groklaw.net]
So use C++ already (Score:4, Insightful)

by Animats ( 122034 ) writes: on Friday January 09, 2004 @04:14AM (#7926062) Homepage

Watching C programmers create home-brew objects is pathetic. If you need to encapsulate data structures, use C++.
Yes, C++ has a host of problems and Strostrup and the C++ committee refuse to fix them. But the STL is a huge improvement on malloc/free. (They still can't get auto_ptr right, though.)

Share
twitter facebook
I must be missing something (Score:1)

by boutell ( 5367 ) writes:

I see some of the point in this, but what's wrong with:

unsigned char *buffer = 0;
b(&buffer); /* note: address of */
free(buffer);
- - Re:2 words for this article.. (Score:1, Offtopic)
    
    by netsharc ( 195805 ) writes:
    
    You're just jealous he's smart and hard working enough to have made it to IBM.
- Re:This isn't Slashdot worthy (Score:2)
  
  by 0x0d0a ( 568518 ) writes:
  
  Not only is it not new or interesting -- it's effectively already present in pretty much all general purpose systems.
  
  On my Linux box, if I malloc() a megabyte buffer, but only ever write to the first page of that buffer, the VM system will only ever hand me a page or so to use.
  
  Probably a bit oversimplified, since overcommits might cause pressure to write out buffer data, but still...WTF is this guy thinking?
- Re:What a crap story... (Score:2, Funny)
  
  by Anonymous Coward writes:
  
  How about this: Solve all of your memory management problems by switching to visual basic! All memory management is done automagically. No need to even think about it! Just hook up your data bound controls and write your logic code. No more memory worries :)

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Um (Score:4, Interesting)

Re:Um (Score:5, Insightful)

Re:Um (Score:5, Informative)

Re:Um (Score:5, Insightful)

Re:Um (Score:1)

Re:Um (Score:2, Informative)

Re:Um (Score:1)

Re:Um (Score:2)

Re:Um (Score:1)

Re:Um (Score:1)

Re:GC in one thread is the fatal bottleneck (Score:1)

Re:Um (Score:3, Interesting)

Re:Um (Score:2, Insightful)

Re:Um (Score:2)

Re:Um (Score:1)

Re:Um (Score:2)

Re:Um (Score:5, Interesting)

Re:Um (Score:5, Insightful)

Re:Um (Score:2)

Re:Um (Score:1, Insightful)

Re:Um (Score:1)

Re:Um (Score:3, Funny)

Re:Um (Score:2)

Re:Um (Score:2)

Re:Um (Score:3, Interesting)

Re:Um (Score:5, Interesting)

Re:Um (Score:2)

Re:Um (Score:2)

Re:Um (Score:2, Interesting)

Re:Um (Score:2, Insightful)

Re:Um (Score:2)

Re:Um (Score:1)

Re:Um (Score:2)

Re:Um (Score:2)

Re:Um (Score:2)

Re:Um (Score:2, Informative)

Re:Um (Score:1)

Weeeee Lets implement Slab allocators (Score:2)

Gee, isn't that handy (Score:5, Interesting)

Re:Gee, isn't that handy (Score:2)

Re:Gee, isn't that handy (Score:2)

Re:Gee, isn't that handy (Score:2)

Please (Score:5, Funny)

realloc (Score:2)

Re:realloc (Score:2)

Re:realloc (Score:5, Interesting)

Re:realloc (Score:2)

Re:realloc (Score:3, Insightful)

Re:realloc (Score:2)

Re:realloc (Score:3, Informative)

Re:realloc (Score:2)

Re:realloc (Score:2)

Re:realloc (Score:2)

Memory Allocation (Score:5, Funny)

Hmmm (Score:5, Interesting)

Re:Hmmm (Score:3, Insightful)

Hmmmm (Score:2)

Re:Hmmm (Score:2)

Re:Hmmm (Score:3, Insightful)

Just to be clear... (Score:2)

Re:Hmmm (Score:1)

Indeed (Score:2)

Wow! (Score:5, Funny)

Useful (Score:3, Interesting)

The solution is one of the "bad ones"... (Score:1)

A memory leak IN THE SPECIFICATION?!? (Score:5, Funny)

Vstr (Score:5, Informative)

Re:Vstr (Score:2)

Re:Vstr (Score:2)

Not News (Score:4, Informative)

Re:Not News (Score:2)

Re:Not News (Score:2)

SCO owns C++ (and C too?) (Score:1, Offtopic)

So use C++ already (Score:4, Insightful)

I must be missing something (Score:1)

Re:2 words for this article.. (Score:1, Offtopic)

Re:This isn't Slashdot worthy (Score:2)

Re:What a crap story... (Score:2, Funny)

Related Links Top of the: day, week, month.

Slashdot Top Deals