Skip to content

Conversation

@metalcycling
Copy link
Collaborator

  • Fixed issue with AppWrapper being re-queued after it completes. This was happening because at the end of the job, when all the pods are completed, MCAD would see no 'running' pods so it would think the job needs to be re-queued.
  • Added also a fix to prevent MCAD from deleting AppWrappers after the pods or the application is done

Signed-off-by: Pedro D. Bello-Maldonado metalcycling@gmail.com

…was happening because at the end of the job, when all the pods are completed, MCAD would see no 'running' pods so it would think the job needs to be re-queued. Added also a fix to prevent MCAD from deleting AppWrappers after the pods or the application is done. Signed-off-by: Pedro D. Bello-Maldonado <metalcycling@gmail.com>

if aw.Status.Running < int32(aw.Spec.SchedSpec.MinAvailable) {
if (aw.Status.Running + aw.Status.Succeeded) < int32(aw.Spec.SchedSpec.MinAvailable) {
message = fmt.Sprintf("Insufficient number of Running pods, minimum=%d, running=%v.", aw.Spec.SchedSpec.MinAvailable, aw.Status.Running)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we change logger message, it should reflect Running and Completed

…leted pods Signed-off-by: Pedro D. Bello-Maldonado <metalcycling@gmail.com>
@asm582
Copy link
Member

asm582 commented Dec 7, 2022

Thanks @metalcycling , LGTM!

@asm582 asm582 merged commit ec6cfb4 into project-codeflare:quota-management Dec 7, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants