dynamically generate flag expectations in tests

Many of the tests are currently statically generating the flags for expectations in BuildPlanTests.swift based upon #if os which changes behaviour based on the OS rather than the target. This can cause subtle bugs to arise. We should dynamically compute the arguments.