I'm trying to simplify a monitoring system we have.
It has a variety of views for looking at the CPU usage of a server including:
Average CPU usage (overall, this accounts for all cores).
Low and high core occupancy (the # of cores used over 20% or 70% at a given time)
We also have specific metrics on the usage of each individual core.
Core occupancy is useful because you might have 4 cores, 1 core at 100% and 3 cores at 0%. In this case, you can go down to having 1 or 2 cores without affecting your workload, where as the average CPU usage would just show 25% (useless).
Keeping all of these separate metrics is very taxing as we have thousands of servers reporting metrics multiple times a minute.
Is there a standard way people use to measure CPU usage that accounts for both total power and # of cores used (how well you're parallelizing your work)?
Edit: We're getting some awesome/helpful responses in terms of engineering an actual system. But let's focus on the general problem of "How can you score/make a single or small set of metrics to represent a computer's usage of it's CPU resources including its use parallelism?".