Add run_diff: LCS-aligned diff of two run step-traces by JE-Chen · Pull Request #411 · Integration-Automation/AutoControlGUI

JE-Chen · 2026-06-24T17:40:19Z

Why

A run history tells you a run failed, but not what changed from the run that passed: which step was added or dropped, which step flipped pass→fail, which step got slower. run_diff aligns the two step sequences with a longest-common-subsequence walk — so an inserted/removed step shifts the rest into place instead of mis-pairing everything — and classifies the differences:

added / removed — steps present in only one run
status_flips — an aligned step whose status changed, with the new failure's failure_signature when it carries an error
timing_regressions — an aligned step that got regress_factor× slower

summarize_run_diff renders a one-line summary. The second item of the test-robustness lane (consumes failure_signature from v191).

Design

Pure stdlib LCS DP over the step name sequences; classification helpers (_status_flip / _regression / _aligned_changes / _unmatched) keep every function under CC 10 (radon-clean). A step is any dict with a name key + optional status/duration/error.
5 layers wired: core → facade __all__ → AC_diff_runs (returns the diff + a summary) → read-only ac_diff_runs MCP tool → Script Builder (Testing). Qt-free verified; pytest.approx for the ratio (no float ==).

Tests

test/unit_test/headless/test_run_diff_batch.py — LCS isolating an insert (no mis-pairing), status flip carrying a 12-char signature, timing regression with ratio + sub-factor non-regression, removed-step detection, identical runs → "no change", summary contents, and the executor summary path + 5-layer wiring. 18 passed with the failure_signature sibling.

A run history says a run failed but not what changed from the run that passed. Align two step sequences with an LCS walk (so an inserted or removed step shifts the rest into place instead of mis-pairing) and classify the differences: added/removed steps, status flips (with the new failure's signature), and timing regressions. summarize_run_diff renders a one-line summary. Pure stdlib over step dicts.

codacy-production · 2026-06-24T17:41:55Z

Up to standards ✅

🟢 Issues 0 issues

Results:
0 new issues

View in Codacy

🟢 Metrics 53 complexity · 0 duplication

Metric Results

Complexity 53

Duplication 0

View in Codacy

_{NEW Get contextual insights on your PRs based on Codacy's metrics, along with PR and Jira context, without leaving GitHub. Enable AI reviewer}
_{TIP This summary will be updated as you push new changes.}

sonarqubecloud · 2026-06-24T17:46:40Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

JE-Chen merged commit 20ec65f into dev Jun 24, 2026
16 checks passed

JE-Chen deleted the feat/run-diff-batch branch June 24, 2026 17:45

JE-Chen mentioned this pull request Jun 24, 2026

Release: test-robustness lane — failure signatures, run diff, flake clustering, step timeline (v191–v194) #414

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add run_diff: LCS-aligned diff of two run step-traces#411

Add run_diff: LCS-aligned diff of two run step-traces#411
JE-Chen merged 1 commit into
devfrom
feat/run-diff-batch

JE-Chen commented Jun 24, 2026

Uh oh!

codacy-production Bot commented Jun 24, 2026

Uh oh!

Uh oh!

sonarqubecloud Bot commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

JE-Chen commented Jun 24, 2026

Why

Design

Tests

Uh oh!

codacy-production Bot commented Jun 24, 2026

Up to standards ✅

Uh oh!

Uh oh!

sonarqubecloud Bot commented Jun 24, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant