[XviD-devel] Request: optimized version of image_setedges

Christoph Lampert xvid-devel@xvid.org
Sun, 7 Jul 2002 12:24:38 +0200 (CEST)


Hi,

I just saw in profiling SMP that  image_setedges()  is one
of the slowest parts in XviD now. I doubt that this is needed!

I guess the reason is many loops and many calls to library functions
memcpy/memset for very small memory blocks of 32 or even 16 bytes, which
could be done by loop onrolling or MMX-copy much faster. 

There could be a fixed "copy 16 bytes by MMX" inlined function and
something tricky for memset(), too. However, I don't know enough
MMX/assembler for that.

Anyone else?

Christoph 

-- 
Christoph H. Lampert chl@math.uni-bonn.de | Diese Signature wurde maschi-     
Beringstr. 6, Raum 14 Tel. (0228) 73-2948 | nell erstellt und bedarf
Sprechstunden: keine, aber meistens da    | keiner Unterschrift. AZ 27B-6