GEMini Atari

home *** CD-ROM | disk | FTP | other *** search

/ GEMini Atari / GEMini_Atari_CD-ROM_Walnut_Creek_December_1993.iso / files / gnu / libsrc87 / _addsubd.cpp next >

Wrap

Text File | 1993-07-30 | 6.7 KB | 300 lines

| | double floating point add/subtract routine | #ifndef __M68881__ .text .even .globl __subdf3, ___subdf3 .globl __adddf3, ___adddf3 # ifndef sfp004 | | written by Kai-Uwe Bloem (I5110401@dbstu1.bitnet). | Based on a 80x86 floating point packet from comp.os.minix, written by P.Housel | patched by Olaf Flebbe (flebbe@tat.physik.uni-tuebingen.de) | | Revision 1.3.4 olaf 11-92 : | + added support for NaN and infinities | > floating point is now excellent! | | -- still lacks trap handling for exceptions | -- dont know the external representation of quiet and signaling NaN | I decided 0x7fffffff,ffffffff to be a quiet NaN | the rest should be signaling (but isnt) | | Revision 1.3.3 olaf 11-92 : | + changed to get rid of rounding bits. a sticky register (d3) is | sufficient. | | Revision 1.3.2 olaf 10-92 : | + increased comparson by one again. (Dont understand, but it works) | + corrected negation of rounding bits and mantissa | >enquire now detects correct IEEE precision | >paranoia now qualifies add/sub as correctly rounded | | Revision 1.3.1 olaf 10-92 : | + increased comparison of exponents by one. | + initialized sticky byte | + corrected handling of rounding bits | >paranoia now detects no SERIOUS DEFECTS any more | ** Patches need _normdf Rev 1.6.1 (or higher) ** | | Revision 1.3, kub 01-90 : | added support for denormalized numbers | | Revision 1.2, kub 01-90 : | replace far shifts by swaps to gain speed | | Revision 1.1, kub 12-89 : | Ported over to 68k assembler | | Revision 1.0: | original 8088 code from P.S.Housel __subdf3: ___subdf3: eorb #0x80,sp@(12) | reverse sign of v __adddf3: ___adddf3: lea sp@(4),a0 | pointer to u and v parameter moveml d2-d7,sp@- | save registers moveml a0@,d4-d5/d6-d7 | d4-d5 = v, d6-d7 = u movel d6,d0 | d0 = u.exp swap d0 movel d6,d2 | d2.h = u.sign movew d0,d2 lsrw #4,d0 andw #0x07ff,d0 | kill sign bit movel d4,d1 | d1 = v.exp swap d1 eorw d1,d2 | d2.l = u.sign ^ v.sign lsrw #4,d1 andw #0x07ff,d1 | kill sign bit andl #0x0fffff,d6 | remove exponent from u.mantissa andl #0x0fffff,d4 | remove exponent from v.mantissa | | Now perform testing of NaN And infinities | cmpw #0x07ff,d0 beq 0f cmpw #0x07ff,d1 bne nospec bra 1f | first operand is special | 0: cmpw d0,d1 beq bothspec | | u is special | movel d7,d0 orl d6,d0 bne retnan | arith with Nan gives always Nan movel a0@(8),d0 | copy infinity clrl d1 bra return | | v is special | 1: movel d5,d0 orl d4,d0 bne retnan movel a0@,d0 clrl d1 bra return | | u and v are both special | bothspec: movel d7,d0 orl d6,d0 orl d5,d0 orl d4,d0 beq bothinf bra retnan | | Both are infinities Test if cancellation | bothinf: tstw d2 bpl retinf | | return a quiet NaN | retnan: movel #0x7fffffff,d0 moveql #-1,d1 bra return retinf: movel #0x7ff00000,d0 clrl d1 tstl d2 bpl return bchg #31,d0 return: moveml sp@+,d2-d7 rts | | Ok, no inifinty or Nan involved.. | nospec: tstw d0 | check for zero exponent - no leading "1" beq 0f orl #0x100000,d6 | restore implied leading "1" bra 1f 0: addw #1,d0 | "normalize" exponent 1: tstw d1 | check for zero exponent - no leading "1" beq 0f orl #0x100000,d4 | restore implied leading "1" bra 1f 0: addw #1,d1 | "normalize" exponent 1: clrl d3 | init sticky register negw d1 | d1 = u.exp - v.exp addw d0,d1 beq 5f | exponents are equal - no shifting neccessary bgt 1f | not equal but no exchange neccessary exg d4,d6 | exchange u and v exg d5,d7 subw d1,d0 | d0 = u.exp - (u.exp - v.exp) = v.exp negw d1 tstw d2 | d2.h = u.sign ^ (u.sign ^ v.sign) = v.sign bpl 1f bchg #31,d2 1: cmpw #55,d1 | is u so much bigger that v is not bge 7f | significant ? | | shift mantissa left two digits, to allow cancellation of | most significant digit, while gaining an additional digit for | rounding. | moveql #1,d3 2: addl d7,d7 addxl d6,d6 subw #1,d0 | decrement exponent subw #1,d1 | decrement counter dbeq d3,2b clrl d3 | | now shift other mantissa right as fast as possible (almost). | 3: cmpw #16,d1 | see if fast rotate possible blt 4f orw d5,d3 | set sticky word movew d4,d5 | rotate by swapping register halfs swap d5 clrw d4 swap d4 subw #16,d1 bra 3b 0: moveb d5,d2 | use d2.b as scratch andb #1,d2 | test if 1 is shifted out orb d2,d3 | and put it in sticky lsrl #1,d4 | shift v.mant right the rest of the way roxrl #1,d5 | to line it up with u.mant 4: dbra d1,0b | loop 5: tstw d2 | are the signs equal ? bpl 6f | yes, no negate necessary | | negate secand mantissa. One has to check the sticky word in order | to correct the twos complement. | tstw d3 | beq 9f | No cerrection necessary clrl d1 addql #1,d5 addxl d1,d4 9: negl d5 negxl d4 6: addl d5,d7 | u.mant = u.mant + v.mant addxl d4,d6 bcs 7f | need not negate tstw d2 | opposite signs ? bpl 7f | do not need to negate result negl d7 negxl d6 notl d2 | switch sign 7: movel d6,d4 | move result for normalization movel d7,d5 clrl d1 tstl d3 beq 8f moveql #-1,d1 8: swap d2 | put sign into d2 (exponent is in d0) jmp norm_df | leave registers on stack for norm_df #else sfp004 | double precision floating point stuff for Atari-gcc using the SFP004 | developed with gas | | double floating point add/subtract routine | | M. Ritzert (mjr at dmzrzu71) | | 4.10.1990 | | no NAN checking implemented since the 68881 treats this situation "correct", | i.e. according to IEEE | addresses of the 68881 data port. This choice is fastest when much data is | transferred between the two processors. comm = -6 resp = -16 zahl = 0 | waiting loop ... | | wait: | ww: cmpiw #0x8900,a0@(resp) | beq ww | is coded directly by | .long 0x0c688900, 0xfff067f8 __subdf3: ___subdf3: | double precision subtraction | sub second arg from fp0 lea 0xfffffa50:w,a0 movew #0x5400,a0@(comm) | load first argument to fp0 cmpiw #0x8900,a0@(resp) | check movel a7@(4),a0@ movel a7@(8),a0@ movew #0x5428,a0@(comm) .long 0x0c688900, 0xfff067f8 movel a7@(12),a0@ movel a7@(16),a0@ movew #0x7400,a0@(comm) | result to d0/d1 .long 0x0c688900, 0xfff067f8 movel a0@,d0 movel a0@,d1 rts __adddf3: ___adddf3: lea 0xfffffa50:w,a0 movew #0x5400,a0@(comm) | load fp0 cmpiw #0x8900,a0@(resp) | got it? movel a7@(4),a0@ | take a hi from stack to FPU movel a7@(8),a0@ | take a lo from stack to FPU movew #0x5422,a0@(comm) | add second arg to fp0 .long 0x0c688900, 0xfff067f8 movel a7@(12),a0@ | move b hi from stack to FPU movel a7@(16),a0@ | move b lo from stack to FPU movew #0x7400,a0@(comm) | result to d0/d1 .long 0x0c688900, 0xfff067f8 movel a0@,d0 | download result movel a0@,d1 | download result rts #endif sfp004 #endif __M68881__