Quote:
Quote:
Quote:
BETA024:
No improvement for me. No surprise though.
Bojcha even reported that it got worse.
On which processor? If I'm remembering correctly that he has an X2, and it was on the X2, then what is also needed to be known is if the X2 is an "Athlon 64 X2", or if it is an "Athlon X2". The former is based on K8 (Hammer), while the latter is based on K10 (Barcelona), with the minor exception of a few very early X2s that were still Hammer-based.
Quote:
I inlined some code, maybe that wasn't a good idea - turned it off again now. Except for that I only removed code, so I really expect the next beta to be (very! slightly) faster.
I'll need to go back and get in the zone I was in while reading stuff last night, but I think inlining some things is suggested for AMD.
Page 155 of the following PDF lists potential bottlenecks for K8 and K10.
http://www.agner.org/optimize/microarchitecture.pdf
In that PDF is discussion about processors from the original Pentium (P1), clear on up to i5 and i7 (Sandy Bridge) on the Intel side, Athlon (K7) through Bulldozer and Bobcat (post-K10, called family 15h) on the AMD side, and also the VIA Nano 2000 and 3000 cores.
Quote:
For the compressor, I can optimize the code for certain situations (several sliders that might be useful make the code more complex, if I can assume them to be at 0 or at 1 I can remove several lines of code that are executed for each sample; will do that).
The bottleneck for my system does not reside in the code you're currently working on.

Can I have the debug file(s) for StereoTool so I can profile what's happening on my specific system?
I know what I'm asking for there, but as I've said, I have no desire to do anything other than figure out why my processor is choking so badly.
As for the pow() and log stuff, I would guess you're talking about the last link I gave you about the adjustable accuracy code?