You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It seems we're depending entirely on the agent to decide what task it's supposed to be doing right now. I think I more or less understand the idea that the content of the markdown files adhere to a schema that unambiguously determines what the next task should be. But if the agent doesn't read the files, or if it has an older copy in context, it could hallucinate and do the wrong task. Why don't we have code that's like find_next_task.py that scans the markdown files looking for unchecked boxes? And probably it should also be checking the git status because sometimes git commit or github pull request would block moving on to the next task. It can then point to the correct snippet of the protocol that describes the current task so that if the agent has for some reason neglected to read the protocol it will have another chance to do so.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
It seems we're depending entirely on the agent to decide what task it's supposed to be doing right now. I think I more or less understand the idea that the content of the markdown files adhere to a schema that unambiguously determines what the next task should be. But if the agent doesn't read the files, or if it has an older copy in context, it could hallucinate and do the wrong task. Why don't we have code that's like find_next_task.py that scans the markdown files looking for unchecked boxes? And probably it should also be checking the git status because sometimes git commit or github pull request would block moving on to the next task. It can then point to the correct snippet of the protocol that describes the current task so that if the agent has for some reason neglected to read the protocol it will have another chance to do so.
Beta Was this translation helpful? Give feedback.
All reactions