Skip to content

feat: add WorkloadExecuting status to evaluation runs#1391

Open
mjnovice wants to merge 2 commits intomainfrom
feat/add-workload-executing-eval-status
Open

feat: add WorkloadExecuting status to evaluation runs#1391
mjnovice wants to merge 2 commits intomainfrom
feat/add-workload-executing-eval-status

Conversation

@mjnovice
Copy link
Contributor

@mjnovice mjnovice commented Feb 28, 2026

Summary

  • Adds WorkloadExecuting status to distinguish agent execution from evaluation/scoring in the eval run lifecycle
  • Eval runs are now created with WorkloadExecuting (4) status instead of IN_PROGRESS (1)
  • After agent execution completes, a new AGENT_EXECUTION_COMPLETED event transitions the run to IN_PROGRESS before evaluators execute
  • New flow: WorkloadExecuting → Running → Completed/Failed

Depends on https://github.com/UiPath/Agents/pull/4529

Changes

  • events.py: Add AGENT_EXECUTION_COMPLETED event type and AgentExecutionCompletedEvent model
  • runtime.py: Publish AGENT_EXECUTION_COMPLETED after agent finishes, before evaluators run
  • _progress_reporter.py: Add WORKLOAD_EXECUTING=4 to enum, create eval runs with new status, handle event to transition to IN_PROGRESS

Related

Test plan

  • Run evaluations end-to-end and verify eval run starts in WorkloadExecuting status
  • Verify status transitions to Running after agent execution completes
  • Verify status transitions to Completed/Failed after evaluators finish
  • Verify resume mode still works correctly
  • Run existing eval tests (pytest tests/cli/eval/)

🤖 Generated with Claude Code

Create eval runs with WorkloadExecuting status (4) instead of
IN_PROGRESS (1) to distinguish agent execution from evaluation/scoring.
After agent execution completes, transition to IN_PROGRESS before
running evaluators.

- Add WORKLOAD_EXECUTING=4 to EvaluationStatus enum
- Add AGENT_EXECUTION_COMPLETED event and AgentExecutionCompletedEvent model
- Create eval runs with WorkloadExecuting status in _create_eval_run_spec
- Publish AGENT_EXECUTION_COMPLETED after agent finishes in _execute_eval
- Handle event in StudioWebProgressReporter to update status to IN_PROGRESS

Flow: WorkloadExecuting (agent runs) → Running (evaluators run) → Completed/Failed

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@github-actions github-actions bot added test:uipath-langchain Triggers tests in the uipath-langchain-python repository test:uipath-llamaindex Triggers tests in the uipath-llamaindex-python repository labels Feb 28, 2026
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

test:uipath-langchain Triggers tests in the uipath-langchain-python repository test:uipath-llamaindex Triggers tests in the uipath-llamaindex-python repository

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant