← Dashboard
Final evaluation

Did ShipRepo complete what you wanted?

Compare the original task with the actual result. Aggregated across your last 30 days.

Completion
PRs created / attempts
Expectation match
avg rating
Tokens used
0
last 30 days
Avg time
queue → done

Quality breakdown

Failure reasons (last 30 days)

Recent tasks — rate the result

Your feedback tunes the agent over time.