I have a problem with NFS and NIS data corruption across my network. Some
of the "blame" seems to be in my cisco routers - HOWEVER - I'm not willing to
point any fingers yet. I want to get more data. Since the problem could
be in a lot of different places and I have a couple of vendors who don't want
to take 'credit' for the problem, I'm soliciting suggestions on the best way to build an 'iron clad' demonstration of the problem.
I have a short script file that uses 'sum' which forces large files to be moved across the network to be summed. I know what the sum 'should' be and when it's corrupted I get an error message. This tells me the error occured but not why. Any suggestions?
I've also looked at lan traffic and I've discovered the UDP checksum is off for NFS packets. Unfortunately the net isn't reliable enough to handle that. We are getting the AIX change that will turn the checksums on. Any other ideas on this point?
Finally, can someone point to some literature that will show me what a NFS/NIS packet is suppose to look like? Also some books on NFS might be helpful.