www.delorie.com/archives/browse.cgi   search  
Mail Archives: pgcc/1998/01/23/19:58:54

X-POP3-Rcpt: mlehmann AT universe DOT sgh-net DOT de
Message-ID: <XFMail.980123195854.fortunato@heavymetal.org>
X-Mailer: XFMail 1.3-alpha-011998 [p0] on Linux
X-Priority: 3 (Normal)
MIME-Version: 1.0
In-Reply-To: <Pine.LNX.3.96.980124001554.4337A-100000@goliath.csn.tu-chemnitz.de>
23 Jan 1998 19:58:54 -0500 (EST) :
X-Face: *Ia>]d_8ip;MT*9[|GN8GQ7Na#M-,a#IM#\%x~Q@!j8FrItmo>'bzN2)\Gg+ibqu|(7*Qs!
>?cru}L:j;yt}!v,LbARc`BJdjVD(K&.*s>2\i1wMq>0`GfeJ]A%\"/R/n=qv>/UPa6,(TvC"?p4*L
2+G^)Rka}.YK?1-^>qqs)+|^nrweij%OkVDV=z`i#+\MTDD^qh[|UiNX
From: Scott Lampert <fortunato AT heavymetal DOT org>
To: Ronald Wahl <Ronald DOT Wahl AT Informatik DOT TU-Chemnitz DOT DE>
Subject: Re: Status of AMD K6 Support
Cc: beastium-list AT Desk DOT nl, Steve Bergman <steve AT netplus DOT net>
Sender: Marc Lehmann <pcg AT goof DOT com>
Status: RO
Lines: 34

On 23-Jan-98 Ronald Wahl wrote:
> Try -funroll-loops or -funroll-all-loops. This may give a speedup or a
> slowdown. Maybe you should also check the latest snapshots. But note:
> The
> code generated by this two options is still broken in some cases. What
> you
> also could try is to optimize the assembler routines in Mesa. Try to
> minimize the fpu instructions (e.g. remove the fxch instructions and
> adapt the code) and use integer code in parallel with fpu code. Further
> more recommendations you will find in the optimization guide that is
> located here: http://www.amd.com/K6/k6docs.

        From what I understand something on the order of 95% of the
strength-reduction optimizations are currently disabled due to some bugs
in all current versions of egcs.  Without the strength reductions
unroll-loops/unroll-all-loops are more likely to generate slower code and
in practice this seems to be the case.
 
> ... but the best optimization you will get with a 3D accelerator by 3dfx
> since these chips are supported by Mesa.

        This is true by far, however 3dfx uses the main CPU for all
triangle setups so optimizing floating point operations can give an even
larger increase in performance, hence the much better performance of 3dfx
with a Pentium or AMD as opposed to a Cyrix.
                -Scott

---
Scott Lampert             | Home Page: http://www.heavymetal.org
fortunato AT heavymetal DOT org  | PGP Key: finger fortunato AT heavymetal DOT org
"Sing the Hare Hare,      |_________________________________________
   Dance the Hoochie Koo"


- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019