Tweaks to microbenchmarks, one stall semaphore per thread changes have been backed...