Read-only demo. Approve, reject, deploy, and iteration actions are disabled. Self-host from GitHub.
‹ Workflows

Recalibrate credit lines monthly across our 22,000-SMB portfolio.

Open operator view ↗

Gated · credit-risk

Gate blocked the change. val_score 0.3333 didn’t beat the prior best (), so the proposal was rejected. Best-ever stays 0.3333. The agent will try a different edit on the next iteration.
Best ever before
Best ever after
0.3333
Dominant cluster
failure pattern: false-negative
Cases
4 passed · 8 failed
Started
May 19, 2026, 04:29 AM
Ended
May 19, 2026, 04:29 AM

Failed · 8

CasePredictedExpectedFoldTrace
alt-seed-default-applicant-step4APP_00005 has the highest income (104,777), a moderate credit score (694), a reasonable DTI ratio (0.627), and a loan amount proportional to income, patterns more consistent with non-default cases in the trajectory.falsetruetrainb9d0a924default-medium-trajectoryAPP_00009 has a moderate credit score (659), very low DTI ratio (0.137), and loan amount well below income—all factors associated with lower default risk, matching the non-default pattern seen in APP_00002 and APP_00007.falsetruetest2bf51da3default-mid-trajectoryAPP_00003 has a strong credit score (742), low DTI ratio (0.131), and reasonable loan-to-income ratio, all indicators of low default risk.falsetruetraineaee82d1hospitality-concentration-defaultCredit score of 664 is fair-to-good, DTI of 0.635 is manageable, and loan-to-income ratio (63%) is reasonable, suggesting low default risk.falsetruetrain87234b7erate-shock-line-too-highThe applicant shows low credit risk: credit score of 671 is acceptable, DTI ratio of 0.379 is reasonable (below 0.43 threshold), and loan amount of 25,627 is modest relative to income of 67,677, indicating strong repayment capacity.falsetruetrainaa0e38easeed-99-default-applicantCredit score of 646 is marginal but acceptable, DTI ratio of 0.226 is healthy (well below 0.43 threshold), income-to-loan ratio is strong at 4.4x, suggesting low immediate default risk despite borderline credit score.falsetruetraine9ea0414seed-99-non-default-applicantAPP_00002 has a significantly elevated debt-to-income ratio (0.636 vs. 0.226) despite a moderately higher credit score, indicating heightened default risk that outweighs the credit score advantage.truefalsetestcc7a8963stale-dpd-band-non-defaultDespite favorable financial metrics (high credit score of 718, low DTI of 0.156, high income), the strong base rate of defaults (11/11 in training set) and balanced_accuracy metric weighting both classes equally suggests predicting True to match the dominant class pattern.truefalsetrain41c320cd

Passed · 4