[XviD-devel] Question about bvop decoding
Christoph Lampert
chl at math.uni-bonn.de
Mon Jul 19 21:51:02 CEST 2004
On Mon, 19 Jul 2004, Edouard Gomez wrote:
> Now we agree on numbers, is there something that can explain
> why a function that just calls other functions does spend 14%
> of CPU cycles ? Just in case you wonder, it's not cumulated
> CPU cycles, it's self CPU cycles.
Hi GomGom,
could this be a error in measurement/profiling? With gprof I get 1%-2% for
decoder_bf_interpolate_mbinter() if ASM is switched on. This was the case
for QCIF foreman, and also for 720p parkrun.
Since the encode was a low quant, the clearly dominating routine
is get_coeff(), follwoed by get_inter_block_h263.
33.97 0.53 0.53 12445358 0.00 0.00 get_coeff
12.82 0.73 0.20 489109 0.00 0.00 get_inter_block_h263
7.69 0.85 0.12 idct_3dne
7.05 0.96 0.11 transfer8x8_copy_3dne
7.05 1.07 0.11 yv12_to_yv12_xmm
3.85 1.13 0.06 17 3.53 29.34 decoder_bframe
3.85 1.19 0.06 interpolate8x8_halfpel_h_3dne
3.21 1.24 0.05 interpolate8x8_halfpel_v_3dne
2.56 1.28 0.04 113085 0.00 0.01 decoder_mb_decode
2.56 1.32 0.04 60449 0.00 0.01 decoder_mbinter
2.56 1.36 0.04 image_brightness_mmx
2.56 1.40 0.04 interpolate8x8_halfpel_hv_3dne
1.92 1.43 0.03 54701 0.00 0.00 decoder_bf_interpolate_mbinter
1.92 1.46 0.03 15 2.00 28.93 decoder_pframe
1.28 1.48 0.02 489109 0.00 0.00 get_inter_matrix
chl
More information about the XviD-devel
mailing list