I've got a build server where dmesg is reporting that it is having to kill processes because it is running out of memory. As the system is running many builds and other processes concurrently, I need to figure out which process or processes are really using too much memory. i.e. I'm not convinced the process being killed is the one hogging the memory.
Ideally I'd like to dump the memory usage at the point when the out of memory killer kicks in, with a full command line of each process. Is there a way to do this? Alternatively, if I can't dump it at that specific point, I plan to set up a cron job to dump memory usage every minute or two, but I still need some help to get the correct output.
The output from smem is pretty good, but it truncates the command line:
PID User Command Swap USS PSS RSS 39090 user /usr/bin/Xvfb +extension RA 4732 144 148 264 20837 user -bash 0 780 1100 2144 21144 user python /usr/bin/smem 0 12120 12320 13248 19224 user /opt/atlassian/bamboo_home/ 0 234940 235303 237144 12414 user /usr/java/jdk1.8.0_121/bin/ 176128 2249180 2249338 2250428 Is there a way to tell smem to show the full command line? Alternatively a simple way of piping the output to show me what I need? I can pipe into xargs and ps to get the full command line like this:
smem -H -c "pid" | xargs ps However then I've lost the memory usage values from smem.
dmesg) contains the below information about every single process before OOM killer kicks in:[ pid ] uid tgid total_vm rss nr_ptes nr_pmds swapents oom_score_adj nameI believe you're interested in: - total_vm Virtual memory use - rss Resident memory use