diff options
Diffstat (limited to 'docs/design/coverage.org')
| -rw-r--r-- | docs/design/coverage.org | 206 |
1 files changed, 0 insertions, 206 deletions
diff --git a/docs/design/coverage.org b/docs/design/coverage.org deleted file mode 100644 index acd8b4c43..000000000 --- a/docs/design/coverage.org +++ /dev/null @@ -1,206 +0,0 @@ -#+TITLE: Design: Coverage Reporting -#+AUTHOR: Craig Jennings -#+DATE: 2026-04-22 - -* Status - -Implemented for Elisp. - -The shipped path is local-first: =make coverage= produces -=.coverage/simplecov.json= with Undercover, and =cj/coverage-report= reads that -artifact to show either diff-aware coverage or a whole-project summary from -Emacs. Python, TypeScript, and Go backends remain future work. - -* Problem - -Before this work, there was no quick way to answer "are the lines I just -changed actually covered by tests?" Line-level coverage for the *whole* -project was also missing, and there was no local artifact to inspect. - -The primary user-facing need is the first one: point-in-time feedback on -in-flight changes, triggered from Emacs. The implemented system also supports a -whole-project summary and writes a local SimpleCov JSON artifact. - -The tooling should be pluggable so the same workflow covers Elisp today and Python, TypeScript, and Go later — without rebuilding the UI for each language. - -* Non-Goals - -- Continuous in-buffer overlays (fringe marks, line highlights). Parked over performance concerns. -- Mutation testing or any signal other than line coverage. -- CI integration beyond emitting a simplecov JSON artifact. No coveralls, no GitHub Actions wiring. -- Shadowing or replacing existing test-running commands (=make test=, =make test-file=, etc.). - -* Approaches Considered - -** Recommended: diff-aware report with pluggable backends - -Core engine reads a simplecov JSON file, shells to ~git diff~ at a selectable scope, intersects, and displays the result in a compilation-mode-derived buffer. Language-specific "backends" each produce simplecov in their own way and register themselves with the core. - -*Pros:* Directly serves the primary use case. Simplecov is broadly supported across language coverage tools, and Undercover's ~:merge-report t~ option works for simplecov (but not for LCOV), which is essential for the per-file coverage-run strategy. Compilation-mode inheritance gives free =next-error= / =previous-error= navigation. - -*Note on format:* An earlier draft of this design used LCOV. That was changed to simplecov after discovering that Undercover's LCOV writer does not implement report-merging — per-file coverage runs would require custom merge logic or an external ~lcov~ tool. Simplecov's native merge-report support made it the cleaner fit without changing anything about the pluggable backend story. - -*Cons:* More code than a "just run coverage and read the output" approach. Backend registry adds one layer of indirection (small — ~30 lines). - -** Rejected: non-interactive pre-commit hook - -Would run coverage on every commit and report uncovered-changed-lines to stderr. Literal fit for the use case but adds a long delay to every commit and offers no way to inspect non-staged scopes. - -** Rejected: coverage as a =review-code= skill criterion - -Would fold coverage into the existing pre-commit review skill. Clean in principle, but couples =review-code= to Emacs-specific tooling and makes ad-hoc inspection (outside a review) awkward. - -** Rejected: mutation testing instead of line coverage - -Stronger signal than coverage but minutes-to-hours runtime on the current 265-file suite, and no polished Elisp tool exists. Different conversation. - -* Design - -** Architecture - -Three files: - -- =modules/coverage-core.el= — engine + backend registry + user-facing command. Language-agnostic. -- =modules/coverage-elisp.el= — the initial backend. Registers itself on load. -- (Future) =modules/coverage-python.el=, =coverage-typescript.el=, =coverage-go.el= — each ~30 lines, self-registering. - -=init.el= requires the core and the active backends. - -*** Elisp coverage producer - -For the Elisp backend, =make coverage= is the only supported producer of the -coverage artifact. It removes stale compiled files for instrumented sources, -then runs each unit test file in its own batch Emacs process. Before loading -the test file, the Makefile loads =tests/run-coverage-file.el=, which -initializes packages and configures Undercover: - -#+begin_src emacs-lisp -(undercover "modules/*.el" - "gptel-tools/*.el" - (:report-format 'simplecov) - (:report-file ".coverage/simplecov.json") - (:merge-report t) - (:send-report nil)) -#+end_src - -Undercover is therefore the instrumentation layer: it instruments -=modules/*.el= and =gptel-tools/*.el=, records Edebug stop-point hits while -tests execute, and writes the line hit arrays. SimpleCov is the local -interchange format consumed by the rest of this design. The split-per-test-file -Makefile strategy depends on =:merge-report t=; Undercover can merge SimpleCov -reports across separate Emacs processes, while its LCOV writer cannot merge -reports. This is the concrete reason the artifact is -=.coverage/simplecov.json= rather than =coverage.lcov=. - -The Makefile excludes tests that are incompatible with instrumented source -loading, such as byte-compilation checks. If =.coverage/simplecov.json= is not -created, the coverage run is considered failed; downstream report commands -should not infer partial coverage from a missing artifact. - -*** Backend protocol - -Each backend is a plist registered into =cj/coverage-backends=: - -#+begin_src emacs-lisp -(:name 'elisp - :detect (lambda () ...) ; non-nil if current project matches - :run (lambda (cb) ...) ; kick off coverage build; invoke CB with report path - :report-path (lambda () ...)) ; where the simplecov JSON lives (for re-reading without running) -#+end_src - -Detection precedence: =.dir-locals.el= override (=cj/coverage-backend= set to a backend name), then project-root fingerprints (=go.mod=, =pyproject.toml=, =package.json=, =.el= files + Makefile, etc.). First =:detect= that matches wins. No silent fallback — if nothing matches, the command errors with guidance. - -*** Pure helpers - -- =cj/--coverage-parse-simplecov FILE= → hash-table ={file → covered-line-set}=. -- =cj/--coverage-changed-lines SCOPE BASE= → hash-table ={file → changed-line-set}= by shelling a =git diff --unified=0= for the selected scope and parsing hunk headers. -- =cj/--coverage-intersect COVERED CHANGED= → per-file records with three buckets: covered, uncovered, not-tracked. - -These helpers are pure and covered by focused ERT tests. - -** Data Flow - -1. User invokes =cj/coverage-report= (bound to =F7=). -2. Core resolves the backend for the current project. -3. =completing-read= prompts for scope: - - "Working tree — all uncommitted changes" - - "Staged — about to commit" - - "Branch vs parent" (uses =@{upstream}= unless a caller passes an explicit base to the helper) - - "Branch vs main" (explicit) - - "Whole project — all executable lines" -4. If =simplecov.json= is missing, prompt to run coverage. A prefix argument - (=C-u F7=) forces a fresh run. Otherwise the existing report is used as-is. -5. Parse simplecov, compute changed lines or all executable lines, intersect. -6. Display a report buffer in a mode derived from =compilation-mode=. - -** Persistence - -- =.coverage/simplecov.json= at the project root, gitignored. Overwritten on each run. -- No long-term storage. Historical tracking is explicitly out of scope for v1. - -** Error Handling - -*Pre-flight:* -- No backend matches → =user-error= with instructions to register a backend or set =.dir-locals.el=. -- =.dir-locals.el= names an unknown backend → error listing registered backends. -- Not in a git repository → error; don't swallow git's stderr. -- Branch comparison on a repo with no common ancestor (orphan branch, shallow - clone missing the fork point, or missing upstream) reports the underlying git - failure. - -*During the coverage run:* -- Backend =:run= fails (test failure, Make error) → keep the =compile= buffer visible, do *not* proceed to display a report. Partial data is worse than no data. -- Run completes but no simplecov.json produced → error naming the expected path. - -*Post-flight classification:* three buckets, not two. -- *Covered* — changed line in the simplecov covered-line set. -- *Uncovered* — changed line in a tracked file but not covered. -- *Not tracked* — changed file isn't in the simplecov data at all (test files, READMEs, config). Reported separately — don't conflate "coverage didn't look here" with "tests didn't exercise this code." - -*Happy-path degenerates:* -- Zero changed lines in scope → "No changes in this scope; nothing to report." -- All changed lines covered → "N of N changed lines covered. " - -** Keybindings - -*Global:* -- =F7= → =cj/coverage-report= (prompts scope, shows report). -- =C-u F7= → force re-run regardless of report freshness. - -*In the report buffer* (compilation-mode derived, most inherited for free): -- =RET= → jump to source under point. -- =n= / =p= → next / previous uncovered line. -- =q= → bury buffer. - -*Globally available via compilation-mode integration:* -- =M-g n= / =M-g p= → =next-error= / =previous-error= on the last compilation buffer. -- =C-x `= → visit next uncovered line without leaving the current buffer. - -The =F4=–=F7= developer block currently uses =F6= for project-aware test -dispatch and =F7= for coverage. - -** Testing - -*Pure helpers, fully tested* (Normal / Boundary / Error for each): -- =cj/--coverage-parse-simplecov= — handcrafted simplecov JSON in temp files; empty object, all-null coverage arrays, spaces in filenames, multiple test-name keys unioned, malformed JSON. -- =cj/--coverage-simplecov-executable-lines= — whole-project executable-line set, including zero-hit executable lines. -- =cj/--coverage-changed-lines= — =cl-letf= over =shell-command-to-string= to return canned =git diff= output; single hunk, new-file hunk, deletion-only hunk, binary marker, no-diff case. -- =cj/--coverage-intersect= — pure table-in / table-out; covered ⊇ changed, unknown files, nil/empty inputs. -- =cj/--coverage-format-report= and =cj/--coverage-format-summary= — report text and whole-project per-file summary. - -*Backend registry, structurally tested:* -- =cj/coverage-backend-for-project ROOT= — synthetic temp project roots with marker files; assert correct backend. Registration-order test: two backends match, first-registered wins. - -*Not tested:* -- =cj/coverage-report= interactive command — one smoke test with a prepared simplecov report and a stubbed git-diff. No tests for the prompt UI or the compilation-buffer display. -- The elisp backend's =:run= function — shells to =make coverage=; integration-test-shaped, low value, slow. Skipped by design. - -* Current Limitations - -- =make coverage= runs all unit test files except known instrumentation - conflicts. It does not try to select only tests related to changed modules. -- Existing reports are not checked for staleness. Use =C-u F7= or - =make coverage= when a fresh report matters. -- Only the Elisp backend is implemented. -- There is no CI coverage publishing. The generated - =.coverage/simplecov.json= file is local and gitignored. |
