NetNews Offline 2

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Offline 2 / NetNews Offline Volume 2.iso / news / comp / sys / amiga / programmer / 6849 < prev next >

Wrap

Internet Message Format | 1996-08-05 | 4.6 KB

Path: nntp.teleport.com!sschaem From: sschaem@teleport.com (Stephan Schaem) Newsgroups: comp.sys.amiga.programmer,comp.sys.amiga.games,alt.sys.amiga.demos,comp.sys.amiga.misc Subject: Re: AB3D II beats Quake.... Followup-To: comp.sys.amiga.programmer,comp.sys.amiga.games,alt.sys.amiga.demos,comp.sys.amiga.misc Date: 3 Apr 1996 21:13:34 GMT Organization: Teleport - Portland's Public Access (503) 220-1016 Message-ID: <4juplu$n44@nadine.teleport.com> References: <DosA8o.17B@info.uucp> <6538.6658T1031T2526@mbox.vol.it> <4jgn4o$l2b@hell.team17.com> <4jk515$q3l@nadine.teleport.com> <4jkdpu$1o6@hell.team17.com> <4joj14$ghj@nadine.teleport.com> <4js275$98s@hell.team17.com> <4jt8bh$a70@nadine.teleport.com> <4ju1qf$k4@hell.team17.com> NNTP-Posting-Host: kelly.teleport.com X-Newsreader: TIN [version 1.2 PL2] Stefan Boberg (boberg@team17.com) wrote: : sschaem@teleport.com (Stephan Schaem) wrote: : >Stefan Boberg (boberg@team17.com) wrote: : >: > I think they should do more then just the scan conversion... : >: Such as? : > Clipping, geometry transform, bsp tree traversal, visibility test : BSP tree traversal? I have to disagree. That's the type of thing : compilers are good at optimizing. : Clipping/geometry transform: Possibly, although as far as I can see : from my experiments, the gain is quite small. If quake support a lores mode, it become more and more imporant. the rasterization time can drop 400%, this mean other loop now weigh that much more. This can be true if you include basic 3d object in your scene... like quake does with shield and else. little rendering for the transformation involved, so optimizing the tranformation might be a good idea. Look at a game like "into the shadows" , now with the number of vertices its defenetly worth getting a few cycles out of the matrix operations. : >: I'm not sure about that. In DOOM, very little was in ASM. : > This just mean Doom pass most of its time in very little code. : So does any game. I mean VERY little code, like basicly 80% of the time in a few hundreds line routine :) : >: Yes you can do that. You simply use in-line assembly macros for : >: fixed-point multiplies / divides etc. I have such a system and it : >: works fine. Just change a switch (such as '-DFIXED_MATH') on the : >: command line and all code will use fixed-point. : > Do you get good performance from this? : Yes. Watcom is very good at allocating registers for inline ASM. : BTW, Id used the same approach in Doom. You also need to change your structures? or do you do unions and keep 64bit or more alocated per variable? How about file format? : > Also I often find myself : > otimizjng the fix point setting. not alway 16:16 or 8:8, but maybe : > 8:24, 4:28 ... : You can do this with macros too, if you really need to. It gets less : manageable though. To tell you the truth I dont like macro or conditionaly compliled/assembled code for major purposes. : >: It's not bad practice. It's clever - on the Pentium, at least. Can : >: you figure out why it's done? : > Nope... because the scanline loop use no MUL. and FPU add are slower : > and also you cant use FPU register directly as index or else : > so FPU make sense in the setup when fmul are faster then imul and you : > dont need to convert from ftoi all the time, otherwise not. : You've missed it completely. You can use floats in scan conversion, : and it's faster than fixed-point on the Pentium (about twice as fast, : as a matter of fact). I'm sure if you just think about it, you'll see : why. But you're right, it's NOT in the core pixel-drawing loop. I use float in the rasterizing stage on pentium... like I said fmul are faster then imul. : >: Taking polygon structures (vertices and edges) and turning them into : >: pixels. : > does the edge include slope information? if so no reason to see a float : > operation in the 'scan-conversion' loop. : No. In my case, an edge is a reference to two vertices. It could : include slope information though, as a further optimization. Then it make sense to get slope information via float on pentium... But do you think that this is why its so slow on 486? How much speedup do you get by using your fixedpoint macro VS float on a 486 in that rendering function? : I should add that I focus most of my attention on : perspective-correct texture mapping. But the same optimization : strategies (involving floats) can be applied to affine tmapping. I use affine tmapping in my perspective correct texture mapping... I tryed 7 variant, from 3 class out of 6 of perspective correct rendering methode.... I choose one based on affine tmapping. Stephan