[Ffmpeg-devel] Re: about mmx instructions

Chengji Zhao czhao
Fri Sep 2 01:42:17 CEST 2005


Hi, all
Firstly, I did rewrite that part of the code in SSE2 instructions and  I 
achieved about 20% performance gain on my Pentium 4 pc
for that part of the code.
Secondly, if you take a closer look at  the original implementation. you 
will find that the code can still be improved  in terms of  CPU resource
 utilization . For example , if you changing some of the movq 
instructions into pshufw xxx, xxx, 0xe4 ( which means use  shift  unit 
instead of
load unit), you can gain some improvement.

By the way, I had a hard time to understand that magic too. I finally 
gave up( but I did write a small code to verify that formula :)
I will try again this weekend.


-- 

Regards

Chengji Zhao

Mediaware International
Phone: 61-2-6163 8776





More information about the ffmpeg-devel mailing list