rename(phase 1): CapaKraken → Nexus across code, UI, docs, CI

- @capakraken/* → @nexus/* across 12 packages (root + 11 workspaces), 1551 import lines migrated via codemod - User-visible brand strings renamed (emails, page titles, PWA manifest, mobile header, MFA backup-codes header, tooltips, signin page, invite page, weekly digest, install prompt) - TOTP issuer "CapaKraken" → "Nexus" (existing secrets still valid; re-enrollment relabels them in users' authenticator apps) - Function rename: assertCapaKrakenDbTarget → assertNexusDbTarget - LocalStorage migration shim in apps/web/src/app/layout.tsx copies capakraken_* → nexus_* on first load (guarded by nexus_migrated_v1 sentinel; runs once per browser, then never again) - Service-worker cache name capakraken-v2 → nexus-v2 with one-time caches.delete('capakraken-v2') from the same shim - Email-domain fixtures @capakraken.{dev,app} → @nexus.{dev,app} in seed data, e2e specs, SMTP default fallback - Dockerfile.dev / Dockerfile.prod / all .github/workflows/*.yml pnpm --filter @capakraken/* → @nexus/* - README, CLAUDE.md, LEARNINGS.md, all docs/*.md, .env.example, tooling/deploy/.env.production.example brand sweep Phase 1 deliberately leaves untouched (handled in Phase 3 cutover): - PostgreSQL DB name "capakraken" and POSTGRES_USER "capakraken" - Volume names capakraken_pgdata etc. - Compose project name "capakraken" / "capakraken-prod" - db-target-guard default expectedDatabase - env-var CAPAKRAKEN_EXPECTED_DB_NAME - Container DNS names in docker-compose.ci.yml Quality gates green: pnpm typecheck (7/7), pnpm test:unit (7/7), pnpm lint (0 errors), check:exports/imports/architecture all pass. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-21 15:10:44 +02:00
parent d9a7ec0338
commit 4a5edeef3e
941 changed files with 24475 additions and 16760 deletions
@@ -1,17 +1,19 @@
-# CapaKraken V2 Architecture Proposal
+# Nexus V2 Architecture Proposal

 **Date:** 2026-03-11  
 **Scope:** Codebase review, v2 direction, architecture rethink, parallel agent strategy

 ## Executive Summary

-CapaKraken already has a good base:
+Nexus already has a good base:
+
 - monorepo boundaries are mostly clean
 - `engine` and `staffing` contain useful pure domain logic
 - Next.js + tRPC + Prisma keeps product iteration fast
 - Redis-backed SSE is already a reasonable realtime baseline

 The main issue is not the stack. The issue is that domain logic is split across:
+
 - large client components
 - large tRPC routers
 - JSONB-heavy persistence models
@@ -44,33 +46,39 @@ This gives you a v2 that is safer, easier to change, and still realistic for a s
 ## 1. Critical correctness and security issues exist today

 ### Auth hashing is inconsistent
- Login verifies Argon2 hashes in [`apps/web/src/server/auth.ts#L20`](/home/hartmut/Documents/Copilot/capakraken/apps/web/src/server/auth.ts#L20).
- Admin-created users are still stored with SHA-256 in [`packages/api/src/router/user.ts#L41`](/home/hartmut/Documents/Copilot/capakraken/packages/api/src/router/user.ts#L41).
+
+- Login verifies Argon2 hashes in [`apps/web/src/server/auth.ts#L20`](/home/hartmut/Documents/Copilot/nexus/apps/web/src/server/auth.ts#L20).
+- Admin-created users are still stored with SHA-256 in [`packages/api/src/router/user.ts#L41`](/home/hartmut/Documents/Copilot/nexus/packages/api/src/router/user.ts#L41).
 - Impact: users created from the admin flow are likely unable to log in.

 ### Notification creation is open to any authenticated user
- `notification.create` is only `protectedProcedure` in [`packages/api/src/router/notification.ts#L66`](/home/hartmut/Documents/Copilot/capakraken/packages/api/src/router/notification.ts#L66).
+
+- `notification.create` is only `protectedProcedure` in [`packages/api/src/router/notification.ts#L66`](/home/hartmut/Documents/Copilot/nexus/packages/api/src/router/notification.ts#L66).
 - Impact: any logged-in user can create notifications for arbitrary users.

 ### AI connection testing is Azure-shaped even when provider is OpenAI
- `testAiConnection` always constructs an Azure deployment URL in [`packages/api/src/router/settings.ts#L122`](/home/hartmut/Documents/Copilot/capakraken/packages/api/src/router/settings.ts#L122).
+
+- `testAiConnection` always constructs an Azure deployment URL in [`packages/api/src/router/settings.ts#L122`](/home/hartmut/Documents/Copilot/nexus/packages/api/src/router/settings.ts#L122).
 - Impact: provider abstraction is not actually reliable.

 ### Repo health checks are currently failing
- `pnpm test:unit` fails because `@capakraken/shared` has a Vitest script but no tests in [`packages/shared/package.json`](/home/hartmut/Documents/Copilot/capakraken/packages/shared/package.json).
- `pnpm typecheck` fails because `crypto.randomUUID()` is used without a visible import/global typing in [`packages/shared/src/schemas/project.schema.ts#L5`](/home/hartmut/Documents/Copilot/capakraken/packages/shared/src/schemas/project.schema.ts#L5).
+
+- `pnpm test:unit` fails because `@nexus/shared` has a Vitest script but no tests in [`packages/shared/package.json`](/home/hartmut/Documents/Copilot/nexus/packages/shared/package.json).
+- `pnpm typecheck` fails because `crypto.randomUUID()` is used without a visible import/global typing in [`packages/shared/src/schemas/project.schema.ts#L5`](/home/hartmut/Documents/Copilot/nexus/packages/shared/src/schemas/project.schema.ts#L5).

 These are not “v2 someday” items. They should be fixed before deeper refactoring.

 ## 2. Large surfaces are carrying too much responsibility

 The biggest modules are already a warning sign:
- [`apps/web/src/components/timeline/TimelineView.tsx`](/home/hartmut/Documents/Copilot/capakraken/apps/web/src/components/timeline/TimelineView.tsx) is 1720 lines.
- [`apps/web/src/components/projects/ProjectWizard.tsx`](/home/hartmut/Documents/Copilot/capakraken/apps/web/src/components/projects/ProjectWizard.tsx) is 1171 lines.
- [`packages/api/src/router/resource.ts`](/home/hartmut/Documents/Copilot/capakraken/packages/api/src/router/resource.ts) is 908 lines.
- [`packages/api/src/router/timeline.ts`](/home/hartmut/Documents/Copilot/capakraken/packages/api/src/router/timeline.ts) is 631 lines.
+
+- [`apps/web/src/components/timeline/TimelineView.tsx`](/home/hartmut/Documents/Copilot/nexus/apps/web/src/components/timeline/TimelineView.tsx) is 1720 lines.
+- [`apps/web/src/components/projects/ProjectWizard.tsx`](/home/hartmut/Documents/Copilot/nexus/apps/web/src/components/projects/ProjectWizard.tsx) is 1171 lines.
+- [`packages/api/src/router/resource.ts`](/home/hartmut/Documents/Copilot/nexus/packages/api/src/router/resource.ts) is 908 lines.
+- [`packages/api/src/router/timeline.ts`](/home/hartmut/Documents/Copilot/nexus/packages/api/src/router/timeline.ts) is 631 lines.

 That usually means:
+
 - transport, orchestration, validation, business rules, and data access are mixed
 - testing becomes expensive
 - one change touches too many concerns
@@ -78,12 +86,14 @@ That usually means:
 ## 3. The core planning model is overloaded

 The Prisma schema uses JSONB heavily in core workflows:
- blueprints and role presets in [`packages/db/prisma/schema.prisma#L147`](/home/hartmut/Documents/Copilot/capakraken/packages/db/prisma/schema.prisma#L147)
- resource availability, skills, and dynamic fields in [`packages/db/prisma/schema.prisma#L208`](/home/hartmut/Documents/Copilot/capakraken/packages/db/prisma/schema.prisma#L208)
- project staffing requirements and dynamic fields in [`packages/db/prisma/schema.prisma#L267`](/home/hartmut/Documents/Copilot/capakraken/packages/db/prisma/schema.prisma#L267)
- allocation metadata in [`packages/db/prisma/schema.prisma#L301`](/home/hartmut/Documents/Copilot/capakraken/packages/db/prisma/schema.prisma#L301)
+
+- blueprints and role presets in [`packages/db/prisma/schema.prisma#L147`](/home/hartmut/Documents/Copilot/nexus/packages/db/prisma/schema.prisma#L147)
+- resource availability, skills, and dynamic fields in [`packages/db/prisma/schema.prisma#L208`](/home/hartmut/Documents/Copilot/nexus/packages/db/prisma/schema.prisma#L208)
+- project staffing requirements and dynamic fields in [`packages/db/prisma/schema.prisma#L267`](/home/hartmut/Documents/Copilot/nexus/packages/db/prisma/schema.prisma#L267)
+- allocation metadata in [`packages/db/prisma/schema.prisma#L301`](/home/hartmut/Documents/Copilot/nexus/packages/db/prisma/schema.prisma#L301)

 The bigger modeling problem is that **`Allocation` currently represents both demand and assignment**:
+
 - placeholder demand is modeled with `resourceId = null`
 - headcount is stored on the same entity
 - legacy `role` text and `roleId` coexist
@@ -93,11 +103,13 @@ This is the wrong aggregate for v2.
 ## 4. Staffing logic is not yet trustworthy enough to become a differentiator

 `staffing.getSuggestions` currently:
+
 - loads all active resources with overlapping allocations
 - computes utilization in the router
- uses only Monday availability as the denominator in [`packages/api/src/router/staffing.ts#L45`](/home/hartmut/Documents/Copilot/capakraken/packages/api/src/router/staffing.ts#L45)
+- uses only Monday availability as the denominator in [`packages/api/src/router/staffing.ts#L45`](/home/hartmut/Documents/Copilot/nexus/packages/api/src/router/staffing.ts#L45)

 That means the suggestion layer is:
+
 - hard to scale
 - not consistent with calendar-aware engine logic
 - not a strong base for “AI-assisted staffing”
@@ -105,8 +117,9 @@ That means the suggestion layer is:
 ## 5. Routers are doing application-service work

 Representative examples:
- timeline queries and update workflows live directly in [`packages/api/src/router/timeline.ts#L12`](/home/hartmut/Documents/Copilot/capakraken/packages/api/src/router/timeline.ts#L12)
- allocation creation, placeholder fill, validation, vacation handling, cost calc, audit log, and event emission all live in [`packages/api/src/router/allocation.ts#L8`](/home/hartmut/Documents/Copilot/capakraken/packages/api/src/router/allocation.ts#L8)
+
+- timeline queries and update workflows live directly in [`packages/api/src/router/timeline.ts#L12`](/home/hartmut/Documents/Copilot/nexus/packages/api/src/router/timeline.ts#L12)
+- allocation creation, placeholder fill, validation, vacation handling, cost calc, audit log, and event emission all live in [`packages/api/src/router/allocation.ts#L8`](/home/hartmut/Documents/Copilot/nexus/packages/api/src/router/allocation.ts#L8)

 The pure `engine` package exists, but the application layer that should orchestrate it does not.

@@ -119,6 +132,7 @@ The pure `engine` package exists, but the application layer that should orchestr
 **V2 should be a modular monolith plus worker processes, not a microservice split.**

 Why:
+
 - the product is still changing fast
 - most failures are domain modeling and module-boundary problems, not network topology problems
 - a microservice split would increase operational cost before domain seams are stable
@@ -185,6 +199,7 @@ Replace the current overloaded `Allocation` concept with:
  - supports undo/redo and reasoning

 This removes:
+
 - nullable resource meaning two different business states
 - headcount logic from real assignments
 - placeholder branching across the whole codebase
@@ -192,6 +207,7 @@ This removes:
 ## 2. Normalize the skill model

 Today `Resource.skills` is JSONB. For v2, use:
+
 - `Skill`
 - `ResourceSkill`
 - optional `RoleSkillProfile`
@@ -199,6 +215,7 @@ Today `Resource.skills` is JSONB. For v2, use:
 Keep JSONB only for imported raw skill matrix payloads if needed.

 Benefits:
+
 - real filtering
 - better analytics
 - reusable recommendation features
@@ -207,12 +224,14 @@ Benefits:
 ## 3. Normalize calendar capacity

 Today availability is template-like JSON plus vacation overlays. For v2:
+
 - `AvailabilityTemplate`
 - `ResourceAvailabilityOverride`
 - `CalendarException`
 - `PublicHolidayCalendar`

 This lets the engine answer:
+
 - “what is capacity on this exact date?”
 - “why is this person unavailable?”
 - “what changed after a vacation approval?”
@@ -220,6 +239,7 @@ This lets the engine answer:
 ## 4. Keep blueprints, but narrow their role

 Blueprints should remain for:
+
 - custom fields
 - UI configuration
 - optional default demand templates
@@ -229,10 +249,12 @@ Blueprints should **not** continue to carry too much core planning state in JSON
 ## 5. Add an outbox

 Introduce:
+
 - `DomainEventOutbox`
 - `Job`

 Every important mutation writes:
+
 - domain row changes
 - audit row
 - outbox event
@@ -257,12 +279,14 @@ Every important user action should map to a use case, for example:
 - `GenerateAiSummary`

 Each use case should:
+
 - load aggregates via repositories
 - call pure domain policies
 - persist through a transaction
 - publish outbox events

 Routers then become simple wrappers:
+
 - validate input
 - call use case
 - map result to DTO
@@ -279,6 +303,7 @@ V2 should use a **CQRS-lite** pattern:
 - heavy timeline/dashboard/staffing reads use query services or read models

 Examples:
+
 - `timeline_read_model`
 - `resource_capacity_snapshot`
 - `project_budget_snapshot`
@@ -297,16 +322,19 @@ These are the v2 agents I would actually build. They should run as worker proces
 ## 1. Match Agent

 Input:
+
 - `DemandRequirementCreated`
 - `DemandRequirementChanged`
 - `ResourceSkillChanged`
 - `CalendarChanged`

 Output:
+
 - ranked candidate snapshots
 - recommendation explanations

 Responsibility:
+
 - candidate filtering
 - deterministic scoring
 - optional AI explanation layer after deterministic ranking
@@ -314,72 +342,87 @@ Responsibility:
 ## 2. Conflict Agent

 Input:
+
 - `AssignmentCreated`
 - `AssignmentChanged`
 - `VacationApproved`
 - `CalendarExceptionChanged`

 Output:
+
 - overallocation/conflict records
 - blocked-demand warnings

 Responsibility:
+
 - recompute exact day-level conflicts
 - explain why a conflict exists

 ## 3. Budget Risk Agent

 Input:
+
 - assignment changes
 - project budget changes
 - project date changes

 Output:
+
 - burn snapshots
 - over-budget warnings
 - forecast deltas

 Responsibility:
+
 - separate financial forecasting from request/response latency

 ## 4. Notification Agent

 Input:
+
 - all user-visible domain events

 Output:
+
 - in-app notifications
 - email sends
 - digest batches

 Responsibility:
+
 - centralize fan-out
 - remove notification logic from feature routers

 ## 5. Import Agent

 Input:
+
 - uploaded Excel/CSV/HRIS files

 Output:
+
 - staged import rows
 - validation results
 - normalized upserts

 Responsibility:
+
 - make imports resumable and auditable

 ## 6. AI Agent

 Input:
+
 - explicit AI jobs only

 Output:
+
 - summaries
 - staffing rationale
 - project risk narratives

 Responsibility:
+
 - all model interaction happens asynchronously
 - stores prompt/result metadata for traceability

@@ -394,11 +437,13 @@ If you want to execute v2 with parallel coding agents, use these lanes to avoid
 ## Agent A: Core Model Refactor

 Owns:
+
 - `packages/db`
 - `packages/shared`
 - new domain packages

 Tasks:
+
 - introduce `DemandRequirement`
 - introduce normalized skill/calendar models
 - add outbox and job tables
@@ -407,10 +452,12 @@ Tasks:
 ## Agent B: Application Service Extraction

 Owns:
+
 - `packages/application` new package
 - router-to-service extraction in `packages/api`

 Tasks:
+
 - move create/update/fill/approve workflows out of routers
 - standardize transaction boundaries
 - standardize audit + outbox emission
@@ -418,10 +465,12 @@ Tasks:
 ## Agent C: Timeline V2

 Owns:
+
 - `apps/web/src/components/timeline/*`
 - timeline read models and UI contracts

 Tasks:
+
 - break `TimelineView` into screen shell + view model + row renderers
 - move timeline state machine into dedicated hooks/store
 - consume new query DTOs instead of raw Prisma-shaped payloads
@@ -429,10 +478,12 @@ Tasks:
 ## Agent D: Project Creation And Staffing UX

 Owns:
+
 - `apps/web/src/components/projects/*`
 - staffing query DTO consumers

 Tasks:
+
 - split `ProjectWizard`
 - convert wizard from local mega-state to step reducers / use cases
 - integrate recommendation snapshots from Match Agent
@@ -440,12 +491,14 @@ Tasks:
 ## Agent E: Security, Platform, And Notifications

 Owns:
+
 - auth
 - user management
 - settings
 - notification workflows

 Tasks:
+
 - unify password hashing
 - close permission gaps
 - move secret handling behind infrastructure services
@@ -470,6 +523,7 @@ Do this before any architecture refactor:
 ## Phase 1: Extract The Application Layer

 Without changing the UI yet:
+
 - add use-case services
 - move router logic into them
 - introduce outbox writes
@@ -502,6 +556,7 @@ This phase creates the seam for the rest of v2.
 ## Phase 5: Optional Service Extraction

 Only after the domain seams hold:
+
 - extract workers into separate deployables if load justifies it
 - keep the transactional core close to the DB

@@ -530,4 +585,4 @@ It should be:
 - read models for planning screens
 - normalized planning entities with JSONB reserved for extension points

-That will make CapaKraken better at the thing it claims to be: a planning system, not just a CRUD app with a timeline.
+That will make Nexus better at the thing it claims to be: a planning system, not just a CRUD app with a timeline.