NetNews Usenet Archive 1992 #27

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1992 #27 / NN_1992_27.iso / spool / comp / windows / x / 19261 < prev next >

Wrap

Internet Message Format | 1992-11-17 | 4.1 KB

Path: sparky!uunet!charon.amdahl.com!pacbell.com!sgiblab!zaphod.mps.ohio-state.edu!pacific.mps.ohio-state.edu!linac!att!bu.edu!news.bbn.com!noc.near.net!news.centerline.com!jimf From: jimf@centerline.com (Jim Frost) Newsgroups: comp.windows.x Subject: Re: FLAME, FLAME ON X!!! Message-ID: <1ebs7rINNp9k@armory.centerline.com> Date: 17 Nov 92 22:37:47 GMT References: <1683@igd.fhg.de> <1992Nov9.235741.25166@dsd.es.com> <RJC.92Nov13144522@daiches.cogsci.ed.ac.uk> <1992Nov16.162457.2923@osf.org> Organization: CenterLine Software, Inc. Lines: 70 NNTP-Posting-Host: 140.239.3.202 drand@spinner.osf.org (Douglas S. Rand) writes: >|> The word is `Fragmentation'. >There are a number of reasonable solutions to memory fragmentation. It really >is up to a decent malloc implementation to attempt to avoid fragmentation. >I'll illustrate with a well known method which we used on Prime's EMACS and >which I believe libg++ uses. >Allocate all memory in set size chunks. Sizes can be on powers of two, for >example. Freeing a block places it on a free list for that size. Allocation >requests first go to the most natural next size block and progress upward in >size. Optionally break larger blocks if no smaller blocks are available. >Optionally run the block lists on occation and join smaller contiguous >blocks into larger blocks. Unfortunately power-of-two allocation is remarkably wasteful of virtual memory (which means wasteful of real live swap space on many systems). It works fine for very tiny allocations but once you start allocating those "unusual" sizes like 80-byte segments you start seeing major memory waste as well as relatively slow free-element location compared to several other techniques. The two best generic allocation systems I've run into are limited-cluster (usually just called cluster) and a nifty tree system done by a research guy at IBM. Limited-cluster systems predict that most programs do a lot of little allocations of the same size (up to, say, 128 bytes) but fewer large ones of widely varying sizes, and therefore allocate a lot of little allocations up front. Managing allocations and frees of the little allocations is dirt cheap and you get very little fragmentation (and a lot of locality to boot). Some systems I've seen use power-of-two fitting for the clustered allocations, but I personally believe that you get better overall behavior by using linear increases since they are far less wasteful after the first few power steps (and in fact most linear fitting schemes are practically the same as power fitting for the first few power steps). Large allocations are done as necessary and fitted as exactly as possible. The big win is keeping the many small allocations in most programs away from the few big ones, which keeps fragmentation down. Limited-cluster allocation systems are commonly used when memory is not a major issue but heap maintenance overhead is. The tree allocation system done at IBM is more complicated, ordering a tree of memory fragments by both address and size. Address ordering makes defragmentation a snap since adjacent allocations will always share adjacent nodes, and size ordering makes finding a good fit easy. It's an interesting system -- interesting enough that IBM shifted from a power-of-two allocation system to their research system between release 3.1 and 3.2 of AIX, largely due to customer complaints about overwhelming memory waste in the 3.1 malloc(*). There's an article in the IBM Journal of Research and Development on the topic but I don't have the issue handy. Informationally yours, jim frost jimf@centerline.com (*) Many of the observed performance problems may have been due to their technique of recycling allocations when the available heap is exhausted rather than using a more aggressive system. Long recycle times lead to poor heap locality and much-higher-than-necessary paging rates. A power-of-two allocator that aggressively reuses allocations would have fewer problems in this regard although locality would still be hurt for multiple allocations which were not particularly well fitted to a power-of-two. The AIX 3.2 system is far more aggressive at reuse.