cameleer-server

Author	SHA1	Message	Date
hsiegeln	7677df33e5	ui(api): regen types + drop perExchangeLingerSeconds from SPA Follows backend removal of the field (Task 3.1). Typechecker confirms zero remaining references. The ExchangeMatchForm linger-input is visually removed in Task 4.4. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-22 17:40:43 +02:00
hsiegeln	98cbf8f3fc	refactor(search): drop dead SearchIndexer subsystem After the ExecutionController removal (`0f635576`), SearchIndexer subscribed to ExecutionUpdatedEvent but nothing publishes that event. Every SearchIndexerStats metric returned always-zero, and the admin /api/v1/admin/clickhouse/pipeline endpoint that surfaced those stats carried no signal. Backend removed: - core: SearchIndexer, SearchIndexerStats, ExecutionUpdatedEvent - app: IndexerPipelineResponse DTO, /pipeline endpoint on ClickHouseAdminController (field + ctor param) - StorageBeanConfig.searchIndexer bean UI removed: - IndexerPipeline type + useIndexerPipeline hook in api/queries/admin/clickhouse.ts - Indexer Pipeline card in ClickHouseAdminPage.tsx (plus ProgressBar import and pipeline* CSS classes) OpenAPI schema.d.ts + openapi.json regenerated (stale /pipeline path and IndexerPipelineResponse schema removed). SearchIndex interface + ClickHouseSearchIndex impl kept — those are live and used by SearchService + ExchangeMatchEvaluator. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-21 23:32:49 +02:00
hsiegeln	207ae246af	chore(ui): regenerate OpenAPI schema for alerts inbox redesign New endpoints visible to the SPA: DELETE /alerts/{id}, POST /alerts/{id}/restore, POST /alerts/bulk-delete, POST /alerts/bulk-ack. GET /alerts gains tri-state acked / read query params. AlertDto now includes readAt. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-21 18:58:26 +02:00
hsiegeln	414f7204bf	feat(alerting): AGENT_LIFECYCLE condition kind with per-subject fire mode Allows alert rules to fire on agent-lifecycle events — REGISTERED, RE_REGISTERED, DEREGISTERED, WENT_STALE, WENT_DEAD, RECOVERED — rather than only on current state. Each matching `(agent, eventType, timestamp)` becomes its own ackable AlertInstance, so outages on distinct agents are independently routable. Core: - New `ConditionKind.AGENT_LIFECYCLE` + `AgentLifecycleCondition` record (scope, eventTypes, withinSeconds). Compact ctor rejects empty eventTypes and withinSeconds<1. - Strict allowlist enum `AgentLifecycleEventType` (six entries matching the server-emitted types in `AgentRegistrationController` and `AgentLifecycleMonitor`). Custom agent-emitted event types tracked in backlog issue #145. - `AgentEventRepository.findInWindow(env, appSlug, agentId, eventTypes, from, to, limit)` — new read path ordered `(timestamp ASC, insert_id ASC)` used by the evaluator. Implemented on `ClickHouseAgentEventRepository` with tenant + env filter mandatory. App: - `AgentLifecycleEvaluator` queries events in the last `withinSeconds` window and returns `EvalResult.Batch` with one `Firing` per row. Every Firing carries a canonical `_subjectFingerprint` of `"<agentId>:<eventType>:<tsMillis>"` in context plus `agent` / `event` subtrees for Mustache templating. - `NotificationContextBuilder` gains an `AGENT_LIFECYCLE` branch that exposes `{{agent.id}}`, `{{agent.app}}`, `{{event.type}}`, `{{event.timestamp}}`, `{{event.detail}}`. - Validation is delegated to the record compact ctor + enum at Jackson deserialization time — matches the existing policy of keeping controller validators focused on env-scoped / SQL-injection concerns. Schema: - V16 migration generalises the V15 per-exchange discriminator on `alert_instances_open_rule_uq` to prefer `_subjectFingerprint` with a fallback to the legacy `exchange.id` expression. Scalar kinds still resolve to `''` and keep one-open-per-rule. Duplicate-key path in `PostgresAlertInstanceRepository.save` is unchanged — the index is the deduper. UI: - New `AgentLifecycleForm.tsx` wizard form with multi-select chips for the six allowed event types + `withinSeconds` input. Wired into `ConditionStep`, `form-state` (validation + defaults: WENT_DEAD, 300 s), and `enums.ts` options. Tests in `enums.test.ts` pin the new option array. - `alert-variables.ts` registers `{{agent.app}}`, `{{event.type}}`, `{{event.timestamp}}`, `{{event.detail}}` leaves for the new kind, and extends `agent.id`'s availability list to include `AGENT_LIFECYCLE`. Tests (all passing): - 5 new JSON-roundtrip cases on `AlertConditionJsonTest` (positive + empty/zero/unknown-type rejection). - 5 new evaluator unit tests on `AgentLifecycleEvaluatorTest` (empty window, multi-agent fingerprint shape, scope forwarding, missing env). - `NotificationContextBuilderTest` switch now covers the new kind. - 119 alerting unit tests + 71 UI tests green. Docs: `.claude/rules/{core,app,ui}` and CLAUDE.md migration list updated.	2026-04-21 14:52:08 +02:00
hsiegeln	f037d8c922	feat(alerting): server-side state+severity filters, ButtonGroup filter UI Backend: `GET /environments/{envSlug}/alerts` now accepts optional multi-value `state=…` and `severity=…` query params. Filters are pushed down to PostgresAlertInstanceRepository, which appends `AND state::text = ANY(?)` / `AND severity::text = ANY(?)` to the inbox query (null/empty = no filter). `AlertInstanceRepository.listForInbox` gained a 7-arg overload; the old 5-arg form is preserved as a default delegate so existing callers (evaluator, AlertingFullLifecycleIT, PostgresAlertInstanceRepositoryIT) compile unchanged. `InAppInboxQuery.listInbox` also has a new filtered overload. UI: InboxPage severity filter migrated from `SegmentedTabs` (single-select, no color cues) to `ButtonGroup` (multi-select with severity-coloured dots), matching the topnavbar status-filter pattern. `useAlerts` forwards the filters as query params and cache-keys on the filter tuple so each combo is independently cached. Unit + hook tests updated to the new contract (5 UI tests + 8 Java unit tests passing). OpenAPI types regenerated from the fresh local backend.	2026-04-21 12:47:31 +02:00
hsiegeln	09b49f096c	feat(alerting): per-severity breakdown on unread-count DTO Spec §13 calls for the notification bell to colour-code by highest unread severity (CRITICAL → error, WARNING → amber, INFO → muted). The old { count } DTO forced the UI to pick one static colour, so NotificationBell shipped with a TODO. Grow the contract instead: UnreadCountResponse = { total, bySeverity: { CRITICAL, WARNING, INFO } } Guarantees: - every severity is always present with a >=0 value (no undefined keys on the wire), so the UI can branch without defaults. - total = sum of bySeverity values — kept explicit on the wire for cheap top-line display, not recomputed client-side. Backend - AlertInstanceRepository: replaces countUnreadForUser(long) with countUnreadBySeverityForUser returning Map<AlertSeverity, Long>. One SQL round-trip per (env, user) — GROUP BY ai.severity over the same NOT EXISTS(alert_reads) filter. - UnreadCountResponse.from(Map) normalises and defensively copies; missing severities default to 0. - InAppInboxQuery.countUnread now returns the DTO, caches the full response (still 5s TTL) so severity breakdown gets the same hit-rate as the total did before. - AlertController just hands the DTO back. Breaking change — no backwards-compat shim: the `count` field is gone. UI and tests updated in the same commit; there are no other API consumers in the tree. Frontend - Regenerated openapi.json + schema.d.ts against a fresh build of the new backend. - NotificationBell branches badge colour on the highest unread severity (CRITICAL > WARNING > INFO) via new CSS variants. - Tests cover all four paths: zero, critical-present, warning-only, info-only. Tests: 7 unit tests + 12 ITs (incl. new grouping + empty-map) + 49 vitest (was 46; +3 severity-branch assertions). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 18:15:56 +02:00
hsiegeln	39a134a0db	chore(ui): regenerate openapi.json + schema.d.ts from deployed Plan 02 backend All checks were successful CI / cleanup-branch (pull_request) Has been skipped Details CI / build (pull_request) Successful in 2m37s Details CI / docker (pull_request) Has been skipped Details CI / deploy (pull_request) Has been skipped Details CI / deploy-feature (pull_request) Has been skipped Details Fetched from http://192.168.50.86:30090/api/v1/api-docs via `npm run generate-api:live`. Adds TypeScript types for the new alerting REST surface merged in #140: - 15 alerting paths under /environments/{envSlug}/alerts/** (rules CRUD, enable/disable, render-preview, test-evaluate, inbox, unread-count, ack/read/bulk-read, silences CRUD, per-alert notifications) - 1 flat notification retry path /alerts/notifications/{id}/retry - 4 outbound-connection admin paths (from Plan 01 #139) Verified tsc -p tsconfig.app.json --noEmit exits 0 — no existing SPA call sites break against the fresh types. Plan 03 UI work can consume these directly. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 10:56:43 +02:00
hsiegeln	bfb5a7a895	chore: regenerate openapi.json + schema.d.ts Captures the cursor-paginated /agents/events response shape (AgentEventPageResponse with data/nextCursor/hasMore and a new ?cursor param). Also folds in pre-existing drift from `62dd71b` (environment field on agent event rows). Consumer UI hooks are updated in Tasks 9-11. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-17 12:39:03 +02:00
hsiegeln	59de424ab9	chore: regenerate openapi.json + schema.d.ts from deployed server Fetched from http://192.168.50.86:30090/api/v1/api-docs (running origin/main through `b7a107d` — full P3B/P3C env-scoping migration live there). SPA TS types now match the env-scoped URL shape used at runtime: - /environments/{envSlug}/... for data, config, search, logs, routes, agents - /agents/config (agent-authoritative) - /admin/environments/{envSlug}/... (env CRUD) Note: ExecutionDetail.environment isn't in the regenerated schema yet — commit `d02fa73` (local, not yet pushed/deployed) adds that backend field. The local type extension in ui/src/components/ExecutionDiagram/types.ts covers the gap until the next redeploy + regen. UI typecheck (tsc -p tsconfig.app.json --noEmit) passes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-17 10:24:24 +02:00
hsiegeln	4b264b3308	feat: add CPU usage to agent response and compact cards Backend: - Add cpuUsage field to AgentInstanceResponse (-1 if unavailable) - Add queryAgentCpuUsage() to AgentRegistrationController — queries avg CPU per instance from agent_metrics over last 2 minutes - Wire CPU into agent list response via withCpuUsage() Frontend: - Add cpuUsage to schema.d.ts - Compute maxCpu per AppGroup (max across all instances) - Show "X% cpu" on compact cards when available (hidden when -1) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 14:12:23 +02:00
hsiegeln	0827fd21e3	feat: persist and display exchange properties from agent All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m59s Details CI / docker (push) Successful in 2m13s Details CI / deploy (push) Successful in 58s Details CI / deploy-feature (push) Has been skipped Details Add support for exchange properties sent by the agent alongside headers. Properties flow through the same pipeline as headers: ClickHouse columns (input_properties, output_properties) on both executions and processor_executions tables, MergedExecution record, ChunkAccumulator extraction, DetailService snapshot, and REST API response. UI adds a Properties tab next to Headers in the process diagram detail panel, with the same input/output split table layout. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 14:23:53 +02:00
hsiegeln	d4b530ff8a	refactor: remove PKCE from OIDC flow (confidential client) All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m16s Details CI / docker (push) Successful in 1m2s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 37s Details Backend holds client_secret and does the token exchange server-side, making PKCE redundant. Removes code_verifier/code_challenge from all frontend auth paths and backend exchange method. Eliminates the source of "grant request is invalid" errors from verifier mismatches. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 10:22:13 +02:00
hsiegeln	03ff9a3813	feat: generic OIDC role extraction from access token All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m48s Details CI / docker (push) Successful in 1m1s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 38s Details The OIDC login flow now reads roles from the access_token (JWT) in addition to the id_token. This fixes role extraction with providers like Logto that put scopes/roles in access tokens rather than id_tokens. - Add audience and additionalScopes to OidcConfig for RFC 8707 resource indicator support and configurable extra scopes - OidcTokenExchanger decodes access_token with at+jwt-compatible processor, falls back to id_token if access_token is opaque or has no roles - syncOidcRoles preserves existing local roles when OIDC returns none - SPA includes resource and additionalScopes in authorization requests - Admin UI exposes new config fields Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 10:16:52 +02:00
hsiegeln	c502a42f17	refactor: architecture cleanup — OIDC dedup, PKCE, K8s hardening Some checks failed CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m6s Details CI / docker (push) Successful in 59s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Failing after 2m59s Details - Extract OidcProviderHelper for shared discovery + JWK source construction - Add SystemRole.normalizeScope() to centralize role normalization - Merge duplicate claim extraction in OidcTokenExchanger - Add PKCE (S256) to OIDC authorization flow (frontend + backend) - Add SecurityContext (runAsNonRoot) to all K8s deployments - Fix postgres probe to use $POSTGRES_USER instead of hardcoded username - Remove default credentials from Dockerfile - Extract sanitize_branch() to shared .gitea/sanitize-branch.sh - Fix sidebar to use /exchanges/ paths directly, remove legacy redirects - Centralize basePath computation in router.tsx via config module Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 21:57:29 +02:00
hsiegeln	0c77f8d594	feat: add User ID Claim field to OIDC admin config UI Some checks failed CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m5s Details CI / deploy (push) Has been cancelled Details CI / deploy-feature (push) Has been cancelled Details CI / docker (push) Has been cancelled Details New input in the Claim Mapping section lets admins configure which id_token claim is used as the unique user identifier (default: sub). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 10:19:38 +02:00
hsiegeln	69055f7d74	fix: persist environment selection in Zustand store instead of URL params All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m5s Details CI / docker (push) Successful in 57s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 36s Details Environment selector was losing its value on navigation because URL search params were silently dropped by navigate() calls. Moved to a Zustand store with localStorage persistence so the selection survives navigation, page refresh, and new tabs. Switching environment now resets all filters, clears URL params, invalidates queries, and remounts pages via Outlet key. Also syncs openapi.json schema with running backend. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 17:12:16 +02:00
hsiegeln	694d0eef59	feat: add environment filtering across all APIs and UI Some checks failed CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m8s Details CI / deploy (push) Has been cancelled Details CI / deploy-feature (push) Has been cancelled Details CI / docker (push) Has been cancelled Details Backend: Added optional `environment` query parameter to catalog, search, stats, timeseries, punchcard, top-errors, logs, and agents endpoints. ClickHouse queries filter by environment when specified (literal SQL for AggregatingMergeTree, ? binds for raw tables). StatsStore interface methods all accept environment parameter. UI: Added EnvironmentSelector component (compact native select). LayoutShell extracts distinct environments from agent data and passes selected environment to catalog and agent queries via URL search param (?env=). TopBar shows current environment label. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:42:26 +02:00
hsiegeln	e26266532a	fix: regenerate OpenAPI types, fix search scoping by applicationId All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m10s Details CI / docker (push) Successful in 59s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 40s Details The identity rename (application→applicationId) broke search filtering because the stale schema.d.ts still had 'application' as the field name. The backend silently ignored the unknown field, returning unfiltered results. - Regenerate openapi.json and schema.d.ts from live backend - Fix Dashboard: application→applicationId in search request - Fix RouteDetail: application→applicationId in search request (2 places) - LayoutShell: scope command palette search by appId/routeId - LayoutShell: pass sidebarReveal state on sidebar click navigation Note for DS team: the Sidebar selectedPath logic (line 5451 in dist) has a hardcoded pathname.startsWith("/exchanges/") guard. This should be broadened to simply `S ? S : $.pathname` so sidebarReveal works on all tabs (dashboard, runtime), not just exchanges. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 20:55:19 +02:00
hsiegeln	a028905e41	fix: update agent field names in frontend to match backend DTO All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m6s Details CI / docker (push) Successful in 57s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 37s Details The AgentInstanceResponse backend DTO uses instanceId, displayName, applicationId, status — but the stale schema.d.ts still had id, name, application, state. This caused the runtime table to show no data. - Update schema.d.ts AgentInstanceResponse fields - Fix AgentHealth: row.id→instanceId, row.name→displayName, row.application→applicationId, inst.id→instanceId - Fix AgentInstance: agent.id→instanceId, agent.name→displayName - Fix ExchangeHeader: agent.id→instanceId, agent.state→status - Fix LayoutShell search: agent.state→status, agentTps→tps Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 20:36:31 +02:00
hsiegeln	188810e54b	feat: remove TimescaleDB, dead PG stores, and storage feature flags Some checks failed CI / cleanup-branch (push) Has been skipped Details CI / build (push) Failing after 32s Details CI / docker (push) Has been skipped Details CI / deploy (push) Has been skipped Details CI / deploy-feature (push) Has been skipped Details Complete the ClickHouse migration by removing all PostgreSQL analytics code. PostgreSQL now serves only RBAC, config, and audit — all observability data is exclusively in ClickHouse. - Delete 6 dead PostgreSQL store classes (executions, stats, diagrams, events, metrics, metrics-query) and 2 integration tests - Delete RetentionScheduler (ClickHouse TTL handles retention) - Remove all 7 cameleer.storage.* feature flags from application.yml - Remove all @ConditionalOnProperty from ClickHouse beans in StorageBeanConfig - Consolidate 14 Flyway migrations (V1-V14) into single clean V1 with only RBAC/config/audit tables (no TimescaleDB, no analytics tables) - Switch from timescale/timescaledb-ha:pg16 to postgres:16 everywhere (docker-compose, deploy/postgres.yaml, test containers) - Remove TimescaleDB check and /metrics-pipeline from DatabaseAdminController - Set clickhouse.enabled default to true Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 20:10:58 +02:00
hsiegeln	4cdbcdaeea	fix: update frontend field names for identity rename (applicationId, instanceId) Some checks failed CI / cleanup-branch (push) Has been skipped Details CI / build (push) Failing after 32s Details CI / docker (push) Has been skipped Details CI / deploy (push) Has been skipped Details CI / deploy-feature (push) Has been skipped Details The backend identity rename (applicationName → applicationId, agentId → instanceId) was not reflected in the frontend. This caused drilldown to fail (detail.applicationName was undefined, disabling the diagram fetch) and various display issues. Updated schema.d.ts, ExchangeHeader, ExecutionDiagram, Dashboard, AgentHealth, AgentInstance, LayoutShell, LogTab, InfoTab, DetailPanel, ExchangesPage, and tracing-store. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 18:22:16 +02:00
hsiegeln	cf439248b5	feat: expose iteration/iterationSize fields for diagram overlay All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m12s Details CI / docker (push) Successful in 1m5s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 52s Details Replace synthetic wrapper node approach with direct iteration fields: - ProcessorNode gains iteration (child's index) and iterationSize (container's total) fields, populated from ClickHouse flat records - Frontend hooks detect iteration containers from iterationSize != null instead of scanning for wrapper processorTypes - useExecutionOverlay filters children by iteration field instead of wrapper nodes, eliminating ITERATION_WRAPPER_TYPES entirely - Cleaner data contract: API returns exactly what the DB stores Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 17:14:36 +02:00
hsiegeln	ab7031e6ed	feat: add is_replay flag to execution pipeline and UI Detect replayed exchanges via X-Cameleer-Replay header during ingestion, persist the flag through PostgreSQL and OpenSearch, and surface it in the dashboard (amber replay icon) and exchange detail chain view. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 14:39:40 +02:00
hsiegeln	a517785050	chore: regenerate OpenAPI types and remove type assertion hacks All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m2s Details CI / docker (push) Successful in 56s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 39s Details Regenerated schema.d.ts from live backend — now includes slaCompliance on ExecutionStats/RouteMetrics, filterMatched/duplicateMessage on ProcessorNode, and all new dashboard endpoints (timeseries/by-app, timeseries/by-route, punchcard, errors/top, app-settings). Removed Record<string, unknown> casts that were working around the stale schema. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-30 15:36:44 +02:00
hsiegeln	3d71345181	feat: trace data indicators, inline tap config, and detail tab gating All checks were successful CI / build (push) Successful in 1m46s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Successful in 1m25s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 1m57s Details Trace data visibility: - ProcessorNode now includes hasTraceData flag computed from captured body/headers during tree conversion - ConfigBadge shows teal for tracing configured, green when data captured - Search results show green footprints icon for exchanges with trace data - New has_trace_data column on executions table (V11 migration with backfill) - OpenSearch documents and ExecutionSummary include the flag Inline tap configuration: - Extracted reusable TapConfigModal component from RouteDetail - Diagram context menu opens tap modal inline instead of navigating away - Toggle-trace action works immediately with toast feedback - Modal closes only on ESC, Cancel, Save, or Delete (not backdrop click) Detail panel tab gating: - Headers, Input, Output tabs disabled when no data is available - Works at both exchange and processor level - Falls back to Info tab when active tab becomes empty Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-29 13:08:58 +02:00
hsiegeln	30344d29b1	feat: store raw processor tree JSON and add error categorization fields All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m0s Details CI / docker (push) Successful in 53s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 37s Details Fixes iteration overlay corruption caused by flat storage collapsing duplicate processorIds across loop iterations. Server: - Store raw processor tree as processors_json JSONB on executions table - Detail endpoint serves from processors_json (faithful tree), falls back to flat record reconstruction for older executions - V10 migration: processors_json, error categorization (errorType, errorCategory, rootCauseType, rootCauseMessage), OTel (traceId, spanId), circuit breaker (circuitBreakerState, fallbackTriggered), drops erroneous splitDepth/loopDepth columns - Add all new fields through full ingestion/storage/API chain UI: - Fix overlay wrapper filtering: check wrapper type before status filter - Add new fields to schema.d.ts Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-28 21:44:54 +01:00
hsiegeln	c4b396e618	feat: persist and expose resolvedEndpointUri for execution-level drill-down Wire resolvedEndpointUri through the full chain: - V9 migration adds resolved_endpoint_uri column - IngestionService extracts from ProcessorExecution - PostgresExecutionStore persists and reads the column - ProcessorNode includes field in detail API response - UI schema updated for ProcessorNode and PositionedNode Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-28 18:37:11 +01:00
hsiegeln	3928743ea7	feat: update OpenAPI spec and TypeScript types for execution overlay Add iteration fields (loopIndex, loopSize, splitIndex, splitSize, multicastIndex) to ProcessorNode schema. Add new endpoint path /executions/{executionId}/processors/by-id/{processorId}/snapshot. Remove stale diagramNodeId field that was dropped in V6 migration. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-27 18:38:09 +01:00
hsiegeln	ac32396a57	feat: add interactive ProcessDiagram SVG component (sub-project 1/3) All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m0s Details CI / docker (push) Successful in 56s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 38s Details New interactive route diagram component with SVG rendering using server-computed ELK layout coordinates. TIBCO BW5-inspired top-bar card node style with zoom/pan, hover toolbars, config badges, and error handler sections below the main flow. Backend: add direction query parameter (LR/TB) to diagram render endpoints, defaulting to left-to-right layout. Frontend: 14-file ProcessDiagram component in ui/src/components/ with DiagramNode, CompoundNode, DiagramEdge, ConfigBadge, NodeToolbar, ErrorSection, ZoomControls, and supporting hooks. Dev test page at /dev/diagram for validation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-27 13:55:29 +01:00
hsiegeln	3b31e69ae4	chore: regenerate openapi.json and schema.d.ts from live server All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 54s Details CI / docker (push) Successful in 48s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 36s Details Updated types now include attributes on ExecutionDetail, ProcessorNode, and ExecutionSummary from the actual API. Removed stale detail.children fallback that no longer exists in the schema. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 19:22:55 +01:00
hsiegeln	ae1ee38441	feat: add attributes fields to schema.d.ts types Add optional `attributes?: Record<string, string>` to ExecutionSummary, ExecutionDetail, and ProcessorNode in the manually-maintained OpenAPI schema to reflect the new backend attributes support. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 18:29:47 +01:00
hsiegeln	6a24dd01e9	fix: add exchange body fields to schema.d.ts for CI tsc check All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 54s Details CI / docker (push) Successful in 9s Details CI / deploy (push) Successful in 19s Details CI / deploy-feature (push) Has been skipped Details The CI build runs tsc --noEmit which failed because the ExecutionDetail type in schema.d.ts was missing the new inputBody/outputBody/inputHeaders/ outputHeaders fields added to the backend DTO. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 22:06:26 +01:00
hsiegeln	ff76751629	refactor: rename agent group→application across entire codebase All checks were successful CI / build (push) Successful in 1m22s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Successful in 52s Details CI / deploy (push) Successful in 39s Details CI / deploy-feature (push) Has been skipped Details Complete the group→application terminology rename in the agent registry subsystem: - AgentInfo: field group → application, all wither methods updated - AgentRegistryService: findByGroup → findByApplication - AgentInstanceResponse: field group → application (API response) - AgentRegistrationRequest: field group → application (API request) - JwtServiceImpl: parameter names group → application (JWT claim string "group" preserved for token backward compatibility) - All controllers, lifecycle monitor, command controller updated - Integration tests: JSON request bodies "group" → "application" - Frontend: schema.d.ts, openapi.json, agent queries, AgentHealth RBAC group references (groups table, GroupAdminController, etc.) are NOT affected — they are a separate domain concept. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-24 08:48:12 +01:00
hsiegeln	8ad0016a8e	refactor: rename group/groupName to application/applicationName Some checks failed CI / build (push) Failing after 40s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Has been skipped Details CI / deploy (push) Has been skipped Details CI / deploy-feature (push) Has been skipped Details The execution-related "group" concept actually represents the application name. Rename all Java fields, API parameters, and frontend types from groupName→applicationName and group→application for clarity. - Java records: ExecutionSummary, ExecutionDetail, ExecutionDocument, ExecutionRecord, ProcessorRecord - API params: SearchRequest.group→application, SearchController @RequestParam group→application - Services: IngestionService, DetailService, SearchIndexer, StatsStore - Frontend: schema.d.ts, Dashboard, ExchangeDetail, RouteDetail, executions query hooks Database column names (group_name) and OpenSearch field names are unchanged — only the API-facing Java/TS field names are renamed. RBAC group references (groups table, GroupRepository, GroupsTab) are a separate domain concept and are NOT affected by this change. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-23 21:21:38 +01:00
hsiegeln	c8c62a98bb	fix: add groupName to ExecutionSummary in schema.d.ts Some checks failed CI / build (push) Successful in 1m12s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Successful in 1m10s Details CI / deploy (push) Failing after 2m19s Details CI / deploy-feature (push) Has been skipped Details The Java record was updated but the OpenAPI schema was not regenerated, causing a TypeScript build error in CI. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-23 21:03:45 +01:00
hsiegeln	313d871948	chore: update design system to v0.0.2, regenerate schema.d.ts Bumped @cameleer/design-system from ^0.0.1 to ^0.0.2 (adds onLogout prop to TopBar). Fetched openapi.json from remote backend, stripped /api/v1 prefix, patched ExecutionDetail with groupName and children fields to match UI expectations, then regenerated schema.d.ts via openapi-typescript. TypeScript compiles clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 18:16:15 +01:00
hsiegeln	f2744e3094	fix: correct response field mappings and add logout button All checks were successful CI / build (push) Successful in 1m28s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Successful in 50s Details CI / deploy (push) Successful in 38s Details CI / deploy-feature (push) Has been skipped Details - SearchResult uses 'data' not 'items', 'total' not 'totalCount' - ExecutionStats uses 'p99LatencyMs' not 'p99DurationMs' - TimeseriesBucket uses 'time' not 'timestamp' - Add user Dropdown with logout action to LayoutShell Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 18:06:49 +01:00
hsiegeln	ea5b5a685d	fix: correct SearchRequest field names (offset/limit, sortField/sortDir) All checks were successful CI / build (push) Successful in 1m19s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Successful in 50s Details CI / deploy (push) Successful in 40s Details CI / deploy-feature (push) Has been skipped Details Dashboard was sending page/size but backend expects offset/limit. Schema also had sort/order instead of sortField/sortDir. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 17:55:27 +01:00
hsiegeln	2b111c603c	feat: migrate UI to @cameleer/design-system, add backend endpoints Some checks failed CI / build (push) Failing after 47s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Has been skipped Details CI / deploy (push) Has been skipped Details CI / deploy-feature (push) Has been skipped Details Backend: - Add agent_events table (V5) and lifecycle event recording - Add route catalog endpoint (GET /routes/catalog) - Add route metrics endpoint (GET /routes/metrics) - Add agent events endpoint (GET /agents/events-log) - Enrich AgentInstanceResponse with tps, errorRate, activeRoutes, uptimeSeconds - Add TimescaleDB retention/compression policies (V6) Frontend: - Replace custom Mission Control UI with @cameleer/design-system components - Rebuild all pages: Dashboard, ExchangeDetail, RoutesMetrics, AgentHealth, AgentInstance, RBAC, AuditLog, OIDC, DatabaseAdmin, OpenSearchAdmin, Swagger - New LayoutShell with design system AppShell, Sidebar, TopBar, CommandPalette - Consume design system from Gitea npm registry (@cameleer/design-system@0.0.1) - Add .npmrc for scoped registry, update Dockerfile with REGISTRY_TOKEN arg CI: - Pass REGISTRY_TOKEN build-arg to UI Docker build step Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 17:38:39 +01:00
hsiegeln	17ef48e392	fix: return rotated refresh token from agent token refresh endpoint All checks were successful CI / build (push) Successful in 1m22s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Successful in 56s Details CI / deploy (push) Successful in 47s Details CI / deploy-feature (push) Has been skipped Details Previously the refresh endpoint only returned a new accessToken, causing agents to lose their refreshToken after the first refresh cycle and forcing a full re-registration every ~2 hours. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 16:44:16 +01:00
hsiegeln	708aae720c	chore: regenerate OpenAPI spec and TypeScript types for RBAC endpoints All checks were successful CI / build (push) Successful in 1m11s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Successful in 51s Details CI / deploy (push) Has been skipped Details CI / deploy-feature (push) Successful in 36s Details Remove dead UserInfo type export, patch PositionedNode.children. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 18:11:10 +01:00
hsiegeln	4bc48afbf8	chore: regenerate OpenAPI spec and TypeScript types for admin endpoints All checks were successful CI / build (push) Successful in 1m11s Details CI / cleanup-branch (push) Has been skipped Details CI / docker (push) Successful in 52s Details CI / deploy (push) Has been skipped Details CI / deploy-feature (push) Successful in 37s Details CI / build (pull_request) Successful in 1m9s Details CI / cleanup-branch (pull_request) Has been skipped Details CI / docker (pull_request) Has been skipped Details CI / deploy (pull_request) Has been skipped Details CI / deploy-feature (pull_request) Has been skipped Details Downloaded from deployed feature branch server. Patched PositionedNode to include children field (missing from server-generated spec). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 16:37:43 +01:00
hsiegeln	7778793e7b	Add route diagram page with execution overlay and group-aware APIs All checks were successful CI / build (push) Successful in 1m10s Details CI / docker (push) Successful in 1m3s Details CI / deploy (push) Successful in 31s Details Backend: Add group filtering to agent list, search, stats, and timeseries endpoints. Add diagram lookup by group+routeId. Resolve application group to agent IDs server-side for ClickHouse IN-clause queries. Frontend: New route detail page at /apps/{group}/routes/{routeId} with three tabs (Diagram, Performance, Processor Tree). SVG diagram rendering with panzoom, execution overlay (glow effects, duration/sequence badges, flow particles, minimap), and processor detail panel. uPlot charts for performance tab replacing old SVG sparklines. Ctrl+Click from ExecutionExplorer navigates to route diagram with overlay. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 21:35:42 +01:00
hsiegeln	b64edaa16f	Server-side sorting for execution search results All checks were successful CI / build (push) Successful in 1m12s Details CI / docker (push) Successful in 50s Details CI / deploy (push) Successful in 33s Details Sorting now applies to the entire result set via ClickHouse ORDER BY instead of only sorting the current page client-side. Default sort order is timestamp descending. Supported sort columns: startTime, status, agentId, routeId, correlationId, durationMs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 19:34:22 +01:00
hsiegeln	a6f94e8a70	Full OIDC logout with id_token_hint for provider session termination Some checks failed CI / build (push) Successful in 1m10s Details CI / docker (push) Successful in 48s Details CI / deploy (push) Has been cancelled Details Return the OIDC id_token in the callback response so the frontend can store it and pass it as id_token_hint to the provider's end-session endpoint on logout. This lets Authentik (or any OIDC provider) honor the post_logout_redirect_uri and redirect back to the Cameleer login page instead of showing the provider's own logout page. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 16:14:07 +01:00
hsiegeln	463cab1196	Add displayName to auth response and configurable display name claim for OIDC Some checks failed CI / build (push) Successful in 1m11s Details CI / docker (push) Successful in 49s Details CI / deploy (push) Failing after 2m9s Details - Add displayName field to AuthTokenResponse so the UI shows human-readable names instead of internal JWT subjects (e.g. user:oidc:<hash>) - Add displayNameClaim to OIDC config (default: "name") allowing admins to configure which ID token claim contains the user's display name - Support dot-separated claim paths (e.g. profile.display_name) like rolesClaim - Add admin UI field for Display Name Claim on the OIDC config page - ClickHouse migration: ALTER TABLE adds display_name_claim column Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 16:09:24 +01:00
hsiegeln	465f210aee	Contract-first API with DTOs, validation, and server-side OpenAPI post-processing All checks were successful CI / build (push) Successful in 1m27s Details CI / docker (push) Successful in 2m6s Details CI / deploy (push) Successful in 30s Details Add dedicated request/response DTOs for all controllers, replacing raw JsonNode parameters with validated types. Move OpenAPI path-prefix stripping and ProcessorNode children injection into OpenApiCustomizer beans so the spec served at /api/v1/api-docs is already clean — eliminating the need for the ui/scripts/process-openapi.mjs post-processing script. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 15:33:37 +01:00
hsiegeln	50bb22d6f6	Add OIDC logout, fix OpenAPI schema types, expose end_session_endpoint All checks were successful CI / build (push) Successful in 1m8s Details CI / docker (push) Successful in 51s Details CI / deploy (push) Successful in 29s Details Backend: - Expose end_session_endpoint from OIDC provider metadata in /auth/oidc/config - Add getEndSessionEndpoint() to OidcTokenExchanger Frontend: - On OIDC logout, redirect to provider's end_session_endpoint to clear SSO session - Strip /api/v1 prefix from OpenAPI paths to match client baseUrl convention - Add schema-types.ts with convenience type re-exports from generated schema - Fix all type imports to use schema-types instead of raw generated schema - Fix optional field access (processors, children, duration) with proper typing - Fix AgentInstance.state → status field name Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 14:43:18 +01:00
hsiegeln	103b14d1df	Regenerate OpenAPI spec and TypeScript types from live server Some checks failed CI / build (push) Failing after 39s Details CI / docker (push) Has been skipped Details CI / deploy (push) Has been skipped Details Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 14:24:33 +01:00
hsiegeln	3641dffecc	Add comparison stats: failure rate %, vs-yesterday change, today total All checks were successful CI / build (push) Successful in 1m11s Details CI / docker (push) Successful in 48s Details CI / deploy (push) Successful in 37s Details Stats endpoint now returns current + previous period (24h shift) values plus today's total count. UI shows: - Total Matches: "of 12.3K today" - Avg Duration: arrow + % vs yesterday - Failure Rate: percentage of errors vs total, arrow + % vs yesterday - P99 Latency: arrow + % vs yesterday - In-Flight: unchanged (running executions) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 09:29:14 +01:00

1 2

59 Commits