Skip to content
This repository was archived by the owner on Sep 30, 2022. It is now read-only.

Conversation

@jjhursey
Copy link
Member

Bring timeout feature introduced in open-mpi/ompi#1724 to the 2.x series. Target is 2.1.0 per the discussion on the original PR.

This required a bit of a custom patch due to architecture differences on master vs the 2.x series. I kept this as two commits to show what I changed. These should be squashed into commit once reviewed.

Three options introduced:

  • --timeout
  • --get-stack-traces
  • --report-state-on-timeout

bot:assign: @rhc54
bot:milestone:v2.1.0
bot:label:enhancement

@rhc54
Copy link

rhc54 commented Aug 16, 2016

👍

@jjhursey
Copy link
Member Author

Would you prefer to have these squashed to 1 commit, or keep them as two (first original from master and second fixes to make it work for v2.x)?

@jsquyres
Copy link
Member

@jjhursey Thanks for asking. I like the trackability of the 2 commits: one directly traceable back to master is a Good Thing.

@jjhursey
Copy link
Member Author

Ok - That sounds fine to me. So we'll leave it as two commits. I think this PR is ready to go once the RMs are ready for it.

Ralph Castain and others added 2 commits August 25, 2016 09:12
… timeout to assist with debugging Jenkins tests If requested, obtain stacktraces for each application process and report it to stderr upon timeout stack traces: minor improvements - Also include the hostname and PID of the each process for which we're sending the stack traces (vs. just including the ORTE process name) - Send a specific error message if we couldn't find "gstack" in the $PATH (e.g., on OS X) - Send a sepcific error message if gstack fails to run - Print a message that obtaining the stack traces may take a few seconds so that users don't wonder what's happening Signed-off-by: Jeff Squyres <jsquyres@cisco.com> help-orterun.txt: minor tweaks Trivial update: show "--timeout" (instead of "-timeout") in the help message, just to encourage the use of double-dash options. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> trivial: stacktrace -> stack trace Trivial word smything. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit ebe159a) -- Josh Hursey : Had to remove 2 conflict, which will be added back shortly * orte/orted/orted_submit.c * orte/orted/orted_submit.h
Three options introduced: * --timeout 2 * --get-stack-traces * --report-state-on-timeout
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.