PDA

View Full Version : status update on server errors



RSS
10-25-11, 05:11 PM
We have essentially the web site, scheduler, and validator running. The assimilator, perfecting, etc. are not running. The error rate has been 0 since we dropped the other daemons, watched the load dropped, and restarted the machine. However, this is not to say that the problem(s) are resolved as we are testing to see if our current theories have correct.Martin, our host, and I have been exploring particularly thermal and power issues. (Martin found the upload handler was writing 16KiB chunks to disk but optimization to that helped load but didn't change uploaded files getting corrupted.) Also, the upload handler is essentially the least changed, untouched, and until now basically the upstream boinc code. At present the load is staying low and so are the CPU thermals.Our I/O upgrade only changed the RAID controller and the drives but 3 of the drives were running hotter than they should have been. This extra heat of course impacts the heat in the system.We have the disk space to keep running in a minimal mode like we are now. Our host already has a heavier power supply and high cfm fans ready. We also are planning some contingencies if we still have problems at that point.

More... (http://boinc.freerainbowtables.com/distrrtgen/forum_thread.php?id=57)