What's gotten better:
- Reliability has improved significantly due to a variety of debugging tools that have helped me find some bugs in DOVE
- Ease of use has improved due to creation of shell scripts for common tasks
- Flexibility has improved as the analyzer can now output the resulting data in two formats, with the new spreadsheet mode being particularly illuminating
- The analyzer also bins the datapoints into 100ms bins in the spreadsheet mode, which makes comparison across the servers (which have clock skew relative to one another) much easier.
- The network is still not running successfully on certain computers, and I'm not sure why.
- Some of the computers on the network are really, REALLY underpowered so that the algorithm which is supposed to tick every 100ms is actually ticking every second. This might be due to the processing of the algorithm taking so long or the networking calls blocking due to the send buffer being filled.
- The analysis tool still can't quite do the processing I need
For next month:
- The network needs to run reliably, or I'm hosed. Find out what's going on with those dropped connections. Perhaps a non-zero exit will email me the output? We'll just save the output every time into some output.txt for later retrieval.
- Identify the exact bottleneck by profiling a worker during execution.
- There's a bug where doing a run and stopping it, and then running immediately after will capture some of the peers that were trying to connect from the previous run. Make sure that this bug is handled correctly.
- Make the analysis tool spit out graphable data
No comments:
Post a Comment