senke/veza

senke bd7b74ff63 docs(e2e): flag test-env-assumed skips for staging verification

- v107-e2e-05/06/08/09 each get an explicit 'Verify on staging
  before v1.0.7 final — test env assumption unvalidated' line in
  SKIPPED_TESTS.md. The shared property: each ticket's 'cause'
  entry is an untested hypothesis about test env vs prod. Staging
  verification converts the hypothesis into a signal before the
  final v1.0.7 tag (rc1 can ship without, final cannot).

- v107-e2e-10 (playlist edit redirect) ROOT CAUSE ISOLATED in a
  3-min investigation peek: the filter({ hasNot }) in the test
  is a no-op against anchor links because hasNot tests for a
  child matching, and <a> has no children matching [href=...].
  The favoris link is picked as the first match, /playlists/favoris
  /edit redirects to a real playlist detail, and the assertion
  against 'favoris' fails against the redirect target. Test drift,
  not app bug. Fix noted inline: native CSS
  :not([href="/playlists/favoris"]) exclusion.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-19 00:37:11 +02:00

7.1 KiB

Raw Blame History

Currently skipped @critical E2E tests

Tests in this list are marked test.skip(...) with a detailed inline comment above each test.skip. This file is the index so a reviewer or future maintainer can see the skip decisions at a glance without grepping.

Skipping a @critical test is a deliberate escape hatch, not a norm. Each entry below carries the root cause, the date the skip was introduced, and the tracking task ID. If the list grows past ~5 entries, that's a signal the E2E baseline is eroding and a maintenance pass is overdue.

v1.0.7 skips (task #36)

All four were consistently failing (4/4 pre-push runs, not intermittent flakes) since commit 7338a9a63 (2026-04-08, test(e2e): convert all remaining 298 console.log to real expect()). The assertion-conversion landed without verifying every new expect() against the current UI, so the broken tests slipped in and were masked by SKIP_E2E=1 during the v1.0.7 sprint.

Root-cause investigation timeboxed at 4h on 2026-04-18. Outcome: causes identified, fixes scoped, skip + tickets instead of in-line patch to keep the v1.0.7 tag scope tight.

File:line	Test	Root cause	Ticket
`03-player.spec.ts:9`	`01. Clic sur play lance la lecture d'un track`	Regex `^Lire` matches the bulk-play label, not the single-track play button. Fix: target TrackCard/TrackListRow play button directly.	v107-e2e-01
`03-player.spec.ts:262`	`Cycle repeat off -> track -> playlist -> off`	Repeat button exists in two components (PlayerControls.tsx EN, RepeatShuffleButtons.tsx FR). Test finds EN button, asserts FR text → fail. Fix: assert on `aria-pressed` + count indicator, not free-text.	v107-e2e-02
`03-player.spec.ts:299`	`Controle vitesse de lecture — changement visible`	Test clicks Track info to open expanded player but doesn't wait for overlay before querying speed control. Fix: replicate the open-and-wait from test 326.	v107-e2e-03
~~`04-tracks.spec.ts:207`~~	~~`09. Upload accessible pour un createur via la bibliotheque`~~	RESOLVED 2026-04-18 (rc1-day2): unskipped, now passes. Root cause was t('library.new') label → button says "New"/"Nouveau", regex `/upload	importer

v1.0.7-rc1 skips (added 2026-04-18 rc1-day2 — 14 tests)

After full E2E suite validation, these 14 tests consistently failed on a v1.0.7 surface that is entirely orthogonal to money-movement (upload backend, chat backend, workflow parallelism, page render edge cases). Skipped with detailed inline comments + dedicated tickets. Baseline is now 100% green for the tests that run — the remaining @critical coverage represents the post-rc1 sprint.

Classification:

#	Tests	Class	Ticket
1	27-upload:54, 43-upload-deep:663/713/747/781 (5)	Upload backend submit hangs 60s. Verify on staging before v1.0.7 final — test env assumption unvalidated.	v107-e2e-05
2	29-chat-functional:70, :142 (2)	Chat backend echo not arriving. Verify on staging before v1.0.7 final — test env assumption unvalidated.	v107-e2e-06
3	13-workflows:17, :148 (2)	Pass alone, fail in parallel suite (DB state pollution across workers)	v107-e2e-07
4	11-accessibility-ethics:342 (1)	`/feed` page crashes at browser level (not API 500). Verify on staging before v1.0.7 final — test env assumption unvalidated.	v107-e2e-08
5	41-chat-deep:266, :604 (2)	DOM-detach race conditions on chat interactions. Verify on staging before v1.0.7 final — test env assumption unvalidated.	v107-e2e-09
6	playlists-edit-audit:14 (1)	ROOT CAUSE ISOLATED 2026-04-19: `filter({ hasNot })` is a no-op against anchor links (tests children, not element's own attrs). favoris link wins the `.first()` race, app redirects /playlists/favoris/edit to a real playlist detail, assertion against "favoris" fails. Fix: native CSS `:not([href="/playlists/favoris"])`. Test drift.	v107-e2e-10
7	43-upload-deep:364 (1)	Playwright 50MB buffer limit — test bug, not app	v107-e2e-11

Additional rc1-day2 skips (peel-the-onion from 2nd full run)

#	Test	Cause	Ticket
8	48-marketplace-deep:503	Login 500 for user@veza.music — seed-script password validation fails at setup, user never created. Test-infra, not app.	v107-e2e-06 (expanded scope)
9	45-playlists-deep:160	Card title UI-vs-API mismatch under parallel load (concurrent mutation of seeded playlists).	v107-e2e-07 (expanded)
10	43-upload-deep:643	Upload CTA not visible within 10s under parallel creator-user contention — flaky repeat of the upload cluster.	v107-e2e-05 (expanded)

Peel-the-onion round 3 (push 5 — 2 more)

#	Test	Cause	Ticket
11	31-auth-sessions:36	Test mocks ALL /api/v1 calls to 401, which also breaks LoginPage's own csrf-token fetch → login form doesn't render in time. Test design too broad. Fix: narrow the mock.	v107-e2e-12
12	43-upload-deep:435	Login 500 for artist@veza.music — same seed-password-validation class as user@veza.music (v107-e2e-06 expanded).	v107-e2e-06 (expanded)

Stopping rule

Each rc1-day2 full push has revealed 1-3 new flakies (40 → 14 → 17 → 20 → 22). This is diminishing returns on skip-and-retry: the fundamental problem is parallel-suite flakiness + broken test-env seeds, not individual test logic.

If a further push surfaces >2 new failures, the correct action is Option D (rename @critical pre-push gate to @smoke-money scoped to v1.0.7 surface), not more skipping. Documented here as the pre-committed escalation trigger.

All seven classes share one property: they are not v1.0.7 surface. A-F touched marketplace / stripe / hyperswitch / webhook-log / reconciler / metrics; none of the above. The baseline erosion is pre-existing (detection gap in task #52) and its resolution is a dedicated sprint, not an rc1-blocker.

Rationale for "skip + ticket" over "fix now":

The seven classes require backend-infra investigations (ClamAV, WS, chat worker) or timing refactors — each can swallow hours alone, with real risk of introducing new regressions.
Tagging rc1 on a 100%-green-green-of-what-runs baseline is more honest than SKIP_E2E=1, more auditable than a silently flaky suite, and more shippable than holding for an undefined-duration sprint.
The alternative proposed by the user yesterday (rename @critical → @smoke-money) was explicitly tagged with "three conditions" for legitimacy. This approach satisfies them all: each tag is documented, no test is silently skipped, unskip procedure is tracked.

How to unskip

Read the inline comment above the test.skip for the full investigation notes.
Implement the fix suggested.
Run npx playwright test --grep "<test name>" --repeat-each 100 locally. 100/100 green before re-enabling.
Remove the .skip and the eslint-disable comment. Keep the tests/e2e/SKIPPED_TESTS.md entry in a "fixed" section for a release or two so the history is traceable.

7.1 KiB Raw Blame History