Conversation
…hat changed Existing diffs return raw pixel regions or a11y-element diffs; the gap is 'given a frame diff and a list of element boxes, which of those changed?'. localize_changes diffs reference vs current and scores each element box by its mean per-pixel change; rank_changes is the pure ranker (changed when score >= threshold, sorted most-changed first). cv2/numpy lazy.
…ize-batch Add change_localize: attribute a screen change to the element boxes that changed
Set-of-Marks/element proposers return boxes but not what each box is;
form_fields.checkbox_state reads a box already known to be a checkbox.
box_features extracts {aspect, fill, edge_density, circularity};
classify_widget is the pure heuristic classifier (round->radio,
wide-rounded->toggle, square-sparse->checkbox, wide-hollow->text_field,
wide-filled->button, else icon); classify_icon composes them. cv2 lazy.
…y-batch Add icon_classify: classify a widget from its pixel shape
Set-of-Marks/observation/grounding assume you already have element boxes, but a game/custom-drawn app/remote desktop has no accessibility tree. propose_elements builds the top-of-funnel list from pixels: widget blobs (Canny+morphology+connected_boxes) + text regions, fused via element_parse (ocr>icon priority is the drop-widget-that-is-really-text cross-check), reading-ordered and tagged text/widget. tag_kinds is the pure labeller.
…osal-batch Add element_proposal: propose a clean element list from raw pixels
Up to standards ✅🟢 Issues
|
| Metric | Results |
|---|---|
| Complexity | 89 |
| Duplication | 0 |
NEW Get contextual insights on your PRs based on Codacy's metrics, along with PR and Jira context, without leaving GitHub. Enable AI reviewer
TIP This summary will be updated as you push new changes.
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.



Release — ROUND-15 perception lane (part 2, lane COMPLETE)
Three new headless features completing the perception lane (7/7). Each has a pure / injectable-seam core, full 5-layer wiring, headless tests, and EN/Zh docs; all merged to dev CI-green first try.
change_localize(Add change_localize: attribute a screen change to the element boxes that changed #447):localize_changes/rank_changes— attribute a frame diff to which element boxes changed (the gap between raw pixel-region diffs and a11y-element diffs). cv2 lazy; ranker pure.icon_classify(Add icon_classify: classify a widget from its pixel shape #448):box_features/classify_widget/classify_icon— type a box (checkbox/radio/button/text_field/toggle/icon) from geometric features. Classifier pure; cv2 lazy.element_proposal(Add element_proposal: propose a clean element list from raw pixels #449):propose_elements/tag_kinds— template-free top-of-funnel element list from pixels (widget blobs + text regions fused with the ocr>icon cross-check, reading-ordered). Reuses text_regions/element_parse/connected_boxes; cv2 lazy.Perception lane now COMPLETE: cvd_simulate, marks_layout, contrast_map, theme_normalize, change_localize, icon_classify, element_proposal. Package stays Qt-free; all logic tested without a screen via pure cores + per-function importorskip for the cv2 paths.