NetNews Usenet Archive 1993 #1

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1993 #1 / NN_1993_1.iso / spool / comp / clients / 257 < prev next >

Wrap

Internet Message Format | 1993-01-06 | 4.5 KB

Path: sparky!uunet!usc!rpi!think.com!enterpoop.mit.edu!eru.mt.luth.se!lunic!sunic!aun.uninett.no!nuug!ifi.uio.no!steinar From: steinar@ifi.uio.no (Steinar Kj{rnsr|d) Newsgroups: comp.client-server Subject: Re: NFS slowness Message-ID: <CMM.0.90.2.726329963.steinar@fram.ifi.uio.no> Date: 6 Jan 93 14:19:23 GMT References: <1993Jan4.175456.20845@spectrum.xerox.com> Sender: steinar@ifi.uio.no (Steinar Kj{rnsr|d) Organization: Dept. of Informatics, University of Oslo, Norway Lines: 81 Nntp-Posting-Host: fram.ifi.uio.no In-Reply-To: weltman@adoc.xerox.com (Rob Weltman)'s message of Mon, 4 Jan 1993 17:54:56 GMT Originator: steinar@fram.ifi.uio.no In article <1993Jan4.175456.20845@spectrum.xerox.com> weltman@adoc.xerox.com (Rob Weltman) writes: > I received a number of informative responses to my question > about why large NFS writes are so slow, and a few suggestions on > how to improve performance, and where to look for more info on > tuning NFS. The following are all the responses (hope that's OK > to post; everybody responded directly to me, so people on the > net can't really know what's been communicated). > > Rob > For the sake of completeness, I would like to add a few comments to one of Guy Harris' very informative answers, partly cited below: >> >> From auspex.com!guy Thu Dec 31 11:08:24 1992 >> From: guy@auspex.com (Guy Harris) >> To: jmm@imatron.com >> Subject: Re: NFS and RPC discussion. >> Cc: weltman@parc.xerox.com >> Status: RO >> Content-Length: 5983 >> X-Lines: 116 >> >> [stuff deleted] >> >> Normally, at least on most UNIX systems, "some form of storage that >> survives a server crash" means "the disk", because the UNIX buffer cache >> or page pool is *not* necessarily preserved if the system crashes and >> reboots (although a crash *might* cause the OS to try to sync data out >> to disk, that isn't guaranteed to work). >> >> The Prestoserve board acts as "some form of storage that survives a >> server crash", but can be written to faster than can a disk; it's static >> RAM with a battery backup (so that even if the crash is caused by a >> power failure, the data will still survive the crash). >> >> Some vendors, including SGI, HP, and Auspex, have an *option* for their >> NFS server code that allows the system administrator to specify that >> writes to a particular file system by the NFS server should *not* wait >> until the data makes it to "some form of storage that survives a server >> crash" before being marked as "done". True. However, if not exactly documented, it's piece-of-cake to patch up this functionality on any UNIX system with NFS code derived from the original SUN src. The patch can be applied directly to the binaries. (in fact, we use Emacs for this purpose ..) The NFS server source-module in question is 'nfs_server.c', and all that needs to be done is to change the flag IO_SYNC (=4) to zero, two or three places down the code if I remember right. As said above, the net effect is that NFS writes are now asynchronous, more or less like local buffered writes. Instead of the (expensive?) Prestoserve card, you are now using the server's buffer cache for write caching, of course with the potential (but small) risk of loosing data when the server crashes. We have been running with this patch on all our DECstation and SPARC NFS servers for long time now, and have encountered no problems due to server crashes so far. Initially, we ran the original Legato NFS benchmark (nhfsstone) and measured NFS write throughput with the patch installed and compared the results with those supplied by Legato to document the functionality of the Prestoserve card. Not surprisingly, the throughput was equally good (and even slightly better some times). And the interactive performance improvement is really BIG, believe me! >> We (Auspex) do not make that the default; we use it here only for >> diskless clients' swap space. SGI *does* make it the default, claiming >> that crashes are rare enough that you're not actually going to lose any >> data. Sun and SGI folk spend a fair bit of time yelling at one another >> about this.... >> Yes they do, and rumours says that when speaking of NFS "write-security", the SGI folks thinks it's better to have UDP checksumming turned *ON* than running with the IO_SYNC flag set. Although affecting NFS performance, this is another issue though. ! Steinar Kjaernsroed Dpt. of Informatics email:steinar@ifi.uio.no ! University of Oslo, or | NORWAY ..!mcsun!ifi!steinar