[XviD-devel] Question about bvop decoding

Christoph Lampert chl at math.uni-bonn.de
Mon Jul 19 21:51:02 CEST 2004


On Mon, 19 Jul 2004, Edouard Gomez wrote:
> Now we agree on numbers, is there something that can explain
> why a function that just calls other functions does spend 14%
> of CPU cycles ? Just in case you wonder, it's not cumulated
> CPU cycles, it's self CPU cycles.

Hi GomGom,

could this be a error in measurement/profiling? With gprof I get 1%-2% for
decoder_bf_interpolate_mbinter() if ASM is switched on. This was the case
for QCIF foreman, and also for 720p parkrun.

Since the encode was a low quant, the clearly dominating routine 
is get_coeff(), follwoed by get_inter_block_h263. 

 33.97      0.53     0.53 12445358     0.00     0.00  get_coeff
 12.82      0.73     0.20   489109     0.00     0.00  get_inter_block_h263
  7.69      0.85     0.12                             idct_3dne
  7.05      0.96     0.11                             transfer8x8_copy_3dne
  7.05      1.07     0.11                             yv12_to_yv12_xmm
  3.85      1.13     0.06       17     3.53    29.34  decoder_bframe
  3.85      1.19     0.06                             interpolate8x8_halfpel_h_3dne
  3.21      1.24     0.05                             interpolate8x8_halfpel_v_3dne
  2.56      1.28     0.04   113085     0.00     0.01  decoder_mb_decode
  2.56      1.32     0.04    60449     0.00     0.01  decoder_mbinter
  2.56      1.36     0.04                             image_brightness_mmx
  2.56      1.40     0.04                             interpolate8x8_halfpel_hv_3dne
  1.92      1.43     0.03    54701     0.00     0.00  decoder_bf_interpolate_mbinter
  1.92      1.46     0.03       15     2.00    28.93  decoder_pframe
  1.28      1.48     0.02   489109     0.00     0.00  get_inter_matrix

chl



More information about the XviD-devel mailing list