For a while, the collection servers (CS's) haven't been working well. We've been looking into why. A few weeks ago we overhauled server-CS interaction and added some more CS's to help the load. We have continued to look into CS issues and have some new ideas.
With these new ideas, we've made an update to how the collection server works. We think this will help clients send back results significantly. This update code has been placed on 2 CS's right now (171.64.122.76 and 171.64.122.86) and we're going to watch to see if this improves the situation. We have a few other tweaks that might help as well, but we want to try this one first.
[18:54:29] Folding@home Core Shutdown: FINISHED_UNIT
[18:54:32] CoreStatus = 64 (100)
[18:54:32] Sending work to server
[18:54:32] Project: 4732 (Run 3, Clone 25, Gen 74)
[18:54:32] - Read packet limit of 540015616... Set to 524286976.
[18:54:32] + Attempting to send results [July 31 18:54:32 UTC]
[18:54:53] - Couldn't send HTTP request to server
[18:54:53] + Could not connect to Work Server (results)
[18:54:53] (171.64.65.103:8080)
[18:54:53] + Retrying using alternative port
[18:55:14] - Couldn't send HTTP request to server
[18:55:14] + Could not connect to Work Server (results)
[18:55:14] (171.64.65.103:80)
[18:55:14] - Error: Could not transmit unit 08 (completed July 31) to work server.
[18:55:14] Keeping unit 08 in queue.
[18:55:14] Project: 4732 (Run 3, Clone 25, Gen 74)
[18:55:14] - Read packet limit of 540015616... Set to 524286976.
[18:55:14] + Attempting to send results [July 31 18:55:14 UTC]
[18:55:35] - Couldn't send HTTP request to server
[18:55:35] + Could not connect to Work Server (results)
[18:55:35] (171.64.65.103:8080)
[18:55:35] + Retrying using alternative port
[18:55:56] - Couldn't send HTTP request to server
[18:55:56] + Could not connect to Work Server (results)
[18:55:56] (171.64.65.103:80)
[18:55:56] - Error: Could not transmit unit 08 (completed July 31) to work server.
[18:55:56] - Read packet limit of 540015616... Set to 524286976.
[18:55:56] + Attempting to send results [July 31 18:55:56 UTC]
[18:55:56] - Couldn't send HTTP request to server
[18:55:56] (Got status 503)
[18:55:56] + Could not connect to Work Server (results)
[18:55:56] (171.64.122.76:8080)
[18:55:56] + Retrying using alternative port
[18:55:57] - Couldn't send HTTP request to server
[18:55:57] (Got status 503)
[18:55:57] + Could not connect to Work Server (results)
[18:55:57] (171.64.122.76:80)
[18:55:57] Could not transmit unit 08 to Collection server; keeping in queue.
[18:55:57] - Preparing to get new work unit...
[18:55:57] + Attempting to get work packet
[18:55:57] - Connecting to assignment server
[18:55:58] + No appropriate work server was available; will try again in a bit.
[18:55:58] + Couldn't get work instructions.
[18:55:58] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
...
Posted by: alswieich | July 31, 2008 at 12:20 PM