I've updated the Windows and 32 bit Linux applications to mostly fix the very slow workunit problem. The problem ended up being in the math libraries, which affected the Windows applications as well as 32bit Linux. This should also fix the workunits failing on systems without SSE2 from the previous builds. I had built the BOINC libraries before with SSE2, since it was required for the N-body. However, since it is irrelevant if they use SSE2 or not, it only ended up polluting the non-SSE2 separation binaries. Non-SSE2 versions will still be quite a bit slower. The SSE2 builds (which also covers 64bit) should be in the range of 25-30% faster than the old 0.19s. The 32bit Linux version is now using crlibm, and will be slightly slower than it should be now as a quick way to get math using SSE2. The 64bit Linux + clang build is still the fastest, but only by < 10% now over the Windows + SSE2 version.

More...