aboutsummaryrefslogtreecommitdiff
path: root/docs/2026-06-10-tool-evaluation-criteria.org
diff options
context:
space:
mode:
Diffstat (limited to 'docs/2026-06-10-tool-evaluation-criteria.org')
-rw-r--r--docs/2026-06-10-tool-evaluation-criteria.org31
1 files changed, 31 insertions, 0 deletions
diff --git a/docs/2026-06-10-tool-evaluation-criteria.org b/docs/2026-06-10-tool-evaluation-criteria.org
new file mode 100644
index 0000000..f5fec34
--- /dev/null
+++ b/docs/2026-06-10-tool-evaluation-criteria.org
@@ -0,0 +1,31 @@
+#+TITLE: Tool Evaluation Criteria and Trade-offs
+#+DATE: 2026-06-10
+#+DESCRIPTION: The standing criteria for evaluating tool replacements in this setup, distilled from the 2026-06-10 evaluation round.
+
+* Purpose
+A replacement tool earns adoption only when it beats the incumbent on a dimension that matters here, without losing on a gating dimension. These criteria make that test explicit so future evaluations (and the annual pain-point review) apply the same bar instead of re-deriving it.
+
+* Gating criteria (fail any → reject)
+1. Wayland-native. No XWayland dependency, ever. X11-only preview/clipboard machinery counts as failure (see: ranger's ueberzug path).
+2. Actively maintained. A release or meaningful commit activity within the last ~12 months, and no open "is this abandoned?" signals. Verify with the repo, not aggregator articles — stale narratives persist for years (see: paru-vs-yay, where the 2021 story had fully inverted by 2026).
+3. Automation-compatible where the installer touches it. Anything archsetup provisions non-interactively must have working unattended flags (see: the --answerdiff/--skipreview mapping check).
+4. Stowable, declarative config. File-based config that lives in the dotfiles repo and survives a stow. GSettings/GUI-only config fails (see: ptyxis as a theming dead end).
+
+* Weighting criteria (compare on these)
+- Daily-use win: does it remove a real friction, or is it novelty? "Answers in one command what took a pipeline" (dust) beats "same answer, prettier" (gping).
+- Workflow fit: Emacs (dired, vterm, magit) and tmux already own many niches. A candidate duplicating them adds maintenance surface without adding capability (see: terminal tabs/splits, broot vs dired+zoxide).
+- Theme-system fit: can set-theme drive it (INI/TOML include, runtime color escape, reload signal)? Tools that can't follow dupre/hudson stay second-class.
+- Packaging: official repo > AUR > git build. Every step down adds update fragility to fresh installs.
+- Migration cost: config translation, muscle-memory retraining, installer changes. Cost scales with how load-bearing the incumbent is.
+
+* Process
+1. Re-verify maintenance claims live (repo releases/commits, not blog posts).
+2. Compare against the incumbent on the weighting criteria; the incumbent wins ties — switching costs are real and "not worth adopting" is a valid verdict.
+3. Adoption decisions are Craig's; evaluations produce a recommendation plus the condition that would change it ("revisit if X").
+4. Adopted tools get codified in archsetup the same cycle, so fresh installs match the live machines.
+5. Cadence: the annual pain-point review (todo) triggers re-evaluation; otherwise re-check only when an incumbent's maintenance lapses or a workflow change opens a gap.
+
+* Trade-offs accepted in the 2026-06-10 round
+- foot keeps its no-kitty-graphics stance; accepted because sixel covers current preview needs.
+- yay's weaker PKGBUILD-review UX accepted for its maintenance health and unattended-install reliability.
+- nautilus's modest keyboard ergonomics accepted for native libadwaita fit and zero extra dependency weight.