I think I may know what's going on. Are your tasks being suspended at all? Like maybe they are suspending when you use the keyboard? Or running benchmarks, or rebooting? These tasks seem to very sensitive to that kind of stuff. Mine sometimes error out when just changing the number of GPUs in the app_config.xml. Maybe by not changing anything, or doing anything that causes the tasks to suspend at all, you can get them to complete?