NetNews Usenet Archive 1993 #1

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1993 #1 / NN_1993_1.iso / spool / comp / arch / 12234 < prev next >

Wrap

Text File | 1993-01-11 | 5.5 KB | 137 lines

Newsgroups: comp.arch Path: sparky!uunet!haven.umd.edu!darwin.sura.net!zaphod.mps.ohio-state.edu!cs.utexas.edu!wotan.compaq.com!twisto.eng.hou.compaq.com!croatia.eng.hou.compaq.com!leigh From: leigh@croatia.eng.hou.compaq.com (Kevin Leigh) Subject: re: COMPAQ PROPOSED SCALABLE I/O ARCHITECTURE Message-ID: <1993Jan11.225229.16027@twisto.eng.hou.compaq.com> Sender: news@twisto.eng.hou.compaq.com (Netnews Account) Organization: Compaq Computer Corp. Date: Mon, 11 Jan 1993 22:52:29 GMT Lines: 126 ********************************************************* On what I've said in my previous mails there are two words I need to explain - "endianess" and "coherency bag". ********************************************************* ENDIANESS In comp.arch, David James (dvj@apple.com) and Glen Stone (stone@apple.com) wrote: > ... > So, with this background, please reconsider your > "no-endian-order" statement. > ... We wanted to say "endian independent" NOT "no-endian-order". I agree that we didn't make it clear and thanks for pointing it out. I will be careful on this word. Specifically... Making the data path 8-bit wide and by sequencing the bytes temporaly in ascending address order, one does not worry about spatial positioning of the bytes on the transfer path. The registers within Anet ports bi-endian with endianess established at system initialization. Yes, the most significant address byte starts first in the packet making Anet channel protocol to be big-addressan. COHERENCY BAG In comp.arch, I wrote: > QuickRing is Apple's highspeed I/O solution and it > is a derivative of SCI. Our rational behind Anet > is very much in-line with SCI guys', except we > wanted a solution that will be > A. for I/O (we don't want to carry the coherency bag) ^^^^^^^^^^^^^^^^^ And Haakon O. Bugge (Dolphin SCI Tech. A.S.) responded: > You can certainly use SCI without any coherency. You > can perform read/write/lock 1-16 bytes, or read/write > of 64-byte entities. None of these have anything to > do with coherency. The addition length of the 64-byte > response packet (due to forwId, backId, stat.cStat) is > 6.25%. > Based on these facts, I would very much like to know > what you mean with "the coherency bag". Also, in comp.arch, David B. Gustavson (SLAC) responded: > 1) Coherency in SCI costs you aobut 2 bits in the packet > header, to allow for more command codes. Otherwise, it > costs you nothing if you don't use it. I was trying to refer to (a) the overhead in the node controller logic to deal with the command codes and the node buffers, and (b) in the transfer protocol. The overhead that I will elaborate below is not only due to cache/coherency but it is one of the major contributors, the others being due to the MP issues (starvation, deadlock, etc.) I am responding specifically to those comments on SCI. The following responses may not apply to other SCI derivatives. Command Codes The command codes are designed to do many things, coherency, broadcast, address range, data symbol length. Currently, 10 commands are reserved for cache related transactions. Node Buffers A SCI packet can hold 16-, 64- or 256-byte data. A send packet has at least 16-byte overhead (address, command, CRC, etc.). The minimum packet data size for each coherent node is 64 bytes (cache line size). Hence the FIFOs and queues need to be at least 80-byte long. If a non-coherent node is to work together with coherent nodes in a SCI topology, it's FIFOs need to be at least 80-byte long. Cache-to-cache (cread00, cread64, cwrite64) and extended coherent memory read (mread00, mread64) transactions have an extended header, which adds another 16-bytes to the FIFOs. There may be several of these FIFOs and queues. That's why SCI chips have "70K-100K gates, most of which are used to implement fast 2-port memory cells!". (An Anet port requires only a few thousand gates.) Protocol Efficiency Yes, the "unused symbol overhead" (USO - I made this one up) for the coherent transaction status (stat.cStat, fid and bid) is 5-byte long and it takes only 6.25% for 64- byte data transfers using a 64-byte command. But, - for 16-byte non-cached data transfers using a 16-byte command, the USO will be 15.63%. The total USO can go up even more (a lot more) if there are "dead" data symbols. For example, - for 34-byte data transfers using three 16-byte commands or one 64-byte command the USO will be 30.21% or 43.75%, respectively, and - for 2-byte data transfers using a 16-byte command the USO will be 59.38%. Note that USO does not take into account the other overheads associated with each packet, such as, targetId, command, sourceId, control and CRC (and idle symbols). What would be the data efficiency if all the overheads are considered? ********************************************************* This is not Compaq's announcement or specification. This is only an example case to point out the cost and performance effectiveness of Anet as an I/O interface. ********************************************************* Assume we are designing an Ethernet NIC for a chosen I/O subsystem. What will be the (IC or card) cost, if the I/O protocol is Anet, SCI, QuickRing, RamLink and other SCI derivatives? [We can do an Anet Ethernet NIC for much cheaper than any existing wide-bus implementation (including ISA). This is just one of the many places we can shave cost by using Anet and offer reasonably high performance on the interface. Anet card form factor can even smaller than PCMCIA if we so desired.]