www.delorie.com/archives/browse.cgi   search  
Mail Archives: djgpp/1997/02/25/06:08:03

From: nikki AT gameboutique DOT co (nikki)
Newsgroups: comp.os.msdos.djgpp
Subject: Re: memcpy(); is there something faster?
Date: 25 Feb 1997 09:35:14 GMT
Organization: GameBoutique Ltd.
Lines: 21
Message-ID: <5euboi$296@flex.uunet.pipex.com>
References: <59g08k$758_001 AT cpe DOT Maroochydore DOT aone DOT net DOT au>
<32be2c51 DOT 87056746 AT nntp DOT southeast DOT net>
<5eq27n$uaf$1 AT doffen DOT uninett DOT no>
NNTP-Posting-Host: www.gameboutique.com
Mime-Version: 1.0
To: djgpp AT delorie DOT com
DJ-Gateway: from newsgroup comp.os.msdos.djgpp

> Actually... The fpu memcpy and MMX memcpy which both copy 8 bytes at a time
> are basically the same. They're both using the same processor and fpu stack.
> 
> hmm.. gotta learn djgpp style asm so I can port it to djgpp..

if y'all want an fpu memcopy in inline asm for djgpp i wrote one here :)
sadly it's not as good as the old rep movsd though :( basically fpu memcopy
will outperform anything given that you can guarentee none of the 8bytes you
are moving will cause an fpu exception. if they do you have to do it the
'safe' way which gives you a transfer rate of 16bytes/16cycles and 1/2 as
many cache write misses. if it's 686 or higher you get 1/4 write cache misses.
sadly, if it's 686 or higher the rep movsd will go faster ;(
basically fpu memcopy is not all it's cracked up to be.

regards,
nik


-- 
Graham Tootell           
nikki AT gameboutique DOT com  

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019