[XviD-devel] [PATCH] Some patches on the queue for CVS
Edouard Gomez
ed.gomez at free.fr
Sun Aug 1 15:50:45 CEST 2004
Hey hey,
Here is a list of patches that i commited to my branch but
that are waiting a bit for CVS pushing, because someone here
may object.
These patches aim at:
- having faster decoding for halfpel bvop for SIMD
platforms (C is ~just the same speed)
Speedup is like 12%, 15%
- having only one code path for qpel using skal's framework
even for C (this obsoletes part of the PPC port which will
now use more C code again). This patch requires skal's
approval to license his code uner GPL (i'm waiting for his
anwser)
- use the same trick for qpel bvops as for halfpel bvops...
The speedup is not so great, read my comment in the
changelog entry.
For more details about the patches, you can visit
http://archzoom.sourcecontrol.net/demo/ed.gomez@free.fr--2004-1/xvidcore--head--0.0
patches need first to be mirrored by the sourcecontrol.net
daemon so maybe you'll have to wait a bit.
I'm open to any suggestion/objection.
-----------------------8<-------------------------------------
2004-08-01 13:38:36 GMT patch-67
Summary:
Faster bframe decoding (qpel this time)
Revision:
xvidcore--head--0.0--patch-67
From ed.gomez:
* Used the same trick as for halfpel bvops, merge backward
interpolation and dst averaging steps.
NB: i'm currently not able to say if it's a real speedup or not
because my linux kernel uses a process scheduler gives great
variance to results... so far i'm sure this isn't a slowdown
neither for C nor ia32 SIMD.
modified files:
src/decoder.c src/image/qpel.c src/image/qpel.h
2004-08-01 11:24:07 GMT patch-66
Summary:
Unified qpel code path for all platforms
Revision:
xvidcore--head--0.0--patch-66
From ed.gomez and skal:
* Unified qpel code path for all platforms. Next step is
to fully exploit this code path to speedup qpel bframe
decoding
NB: this makes also ports life easier as they would not port
obsoleted function sets...
modified files:
src/decoder.c src/image/interpolate8x8.h src/image/qpel.c
src/image/qpel.h src/motion/motion_comp.c
2004-07-31 15:08:19 GMT patch-65
Summary:
Faster bframe decoding.
Revision:
xvidcore--head--0.0--patch-65
From ed.gomez and skal:
* Faster direct/interpolated bvop blocks decoding for halfpel
sequences. The trick is to compute and average directly with
destination during one of the forward/backward interpolations.
At this moment, this patch covers only halfpel decoding, the
same trick is expected to be hacked for qpel.
modified files:
src/decoder.c src/image/interpolate8x8.c
src/image/interpolate8x8.h
src/image/x86_asm/interpolate8x8_mmx.asm
src/image/x86_asm/interpolate8x8_xmm.asm src/xvid.c
-----------------------8<-------------------------------------
--
Edouard Gomez
More information about the XviD-devel
mailing list