cameleer-server

Author	SHA1	Message	Date
hsiegeln	188810e54b	feat: remove TimescaleDB, dead PG stores, and storage feature flags Some checks failed CI / cleanup-branch (push) Has been skipped Details CI / build (push) Failing after 32s Details CI / docker (push) Has been skipped Details CI / deploy (push) Has been skipped Details CI / deploy-feature (push) Has been skipped Details Complete the ClickHouse migration by removing all PostgreSQL analytics code. PostgreSQL now serves only RBAC, config, and audit — all observability data is exclusively in ClickHouse. - Delete 6 dead PostgreSQL store classes (executions, stats, diagrams, events, metrics, metrics-query) and 2 integration tests - Delete RetentionScheduler (ClickHouse TTL handles retention) - Remove all 7 cameleer.storage.* feature flags from application.yml - Remove all @ConditionalOnProperty from ClickHouse beans in StorageBeanConfig - Consolidate 14 Flyway migrations (V1-V14) into single clean V1 with only RBAC/config/audit tables (no TimescaleDB, no analytics tables) - Switch from timescale/timescaledb-ha:pg16 to postgres:16 everywhere (docker-compose, deploy/postgres.yaml, test containers) - Remove TimescaleDB check and /metrics-pipeline from DatabaseAdminController - Set clickhouse.enabled default to true Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 20:10:58 +02:00
hsiegeln	aa2d203f4e	feat: add UI usage analytics tracking All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m9s Details CI / docker (push) Successful in 1m14s Details CI / deploy (push) Successful in 46s Details CI / deploy-feature (push) Has been skipped Details Tracks authenticated UI user requests to understand usage patterns: - New ClickHouse usage_events table with 90-day TTL - UsageTrackingInterceptor captures method, path, duration, user - Path normalization groups dynamic segments ({id}, {hash}) - Buffered writes via WriteBuffer + periodic flush - Admin endpoint GET /api/v1/admin/usage with groupBy=endpoint\|user\|hour - Skips agent requests, health checks, and data ingestion Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 17:53:32 +02:00
hsiegeln	d739094a56	fix: update ClickHouse DDL files with new column names instead of ALTER RENAME All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m9s Details CI / docker (push) Successful in 45s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 43s Details ClickHouse can't rename columns that are part of ORDER BY keys. Updated V1-V8 DDL files directly with new column names (instance_id, application_id) and removed V9 migration. Wipe ClickHouse and restart. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 12:40:54 +02:00
hsiegeln	91400defe9	fix: add missing V9 (ClickHouse) and V14 (PostgreSQL) identity column rename migrations All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m7s Details CI / docker (push) Successful in 45s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 39s Details Migration files were lost during worktree merge — recreated. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-01 12:33:02 +02:00
hsiegeln	f7daadaaa9	feat(clickhouse): add DDL for route_diagrams, agent_events, and logs tables Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 23:30:38 +02:00
hsiegeln	606f81a970	fix: align server with protocol v2 chunked transport spec All checks were successful CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m45s Details CI / docker (push) Successful in 59s Details CI / deploy-feature (push) Has been skipped Details CI / deploy (push) Successful in 46s Details - ChunkIngestionController: /data/chunks → /data/executions (matches PROTOCOL.md endpoint the agent actually posts to) - ExecutionController: conditional on ClickHouse being disabled to avoid mapping conflict - Persist originalExchangeId and replayExchangeId from ExecutionChunk envelope through to ClickHouse (was silently dropped) - V5 migration adds the two new columns to executions table Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 23:18:35 +02:00
hsiegeln	1a00eed389	fix: schema initializer skips comment-only SQL segments The V4 DDL had a semicolon inside a comment which caused the split-on-semicolon logic to produce a comment-only segment that ClickHouse rejected as empty query. Fixed the comment and made the initializer strip comment-only segments before execution. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 22:06:31 +02:00
hsiegeln	052990bb59	feat(clickhouse): add ClickHouseStatsStore with -Merge aggregate queries Implements StatsStore interface for ClickHouse using AggregatingMergeTree tables with -Merge combinators (countMerge, countIfMerge, sumMerge, quantileMerge). Uses literal SQL for aggregate table queries to avoid ClickHouse JDBC driver PreparedStatement issues with AggregateFunction columns. Raw table queries (SLA, topErrors, activeErrorTypes) use normal prepared statements. Includes 13 integration tests covering stats, timeseries, grouped timeseries, SLA compliance, SLA counts by app/route, top errors, active error types, punchcard, and processor stats. Also fixes AggregateFunction type signatures in V4 DDL (count() takes no args, countIf takes UInt8). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 21:49:22 +02:00
hsiegeln	eb0d26814f	feat(clickhouse): add stats materialized views DDL (5 tables + 5 MVs) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 20:11:38 +02:00
hsiegeln	b30dfa39f4	feat(clickhouse): add executions and processor_executions DDL for chunked transport Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 19:04:19 +02:00
hsiegeln	b1c5cc0616	fix: cast DateTime64 to DateTime in ClickHouse TTL expression Some checks failed CI / cleanup-branch (push) Has been skipped Details CI / build (push) Successful in 1m23s Details CI / cleanup-branch (pull_request) Has been skipped Details CI / build (pull_request) Successful in 1m46s Details CI / docker (pull_request) Has been skipped Details CI / deploy (pull_request) Has been skipped Details CI / deploy-feature (pull_request) Has been skipped Details CI / docker (push) Successful in 1m8s Details CI / deploy (push) Has been skipped Details CI / deploy-feature (push) Failing after 2m19s Details	2026-03-31 18:10:20 +02:00
hsiegeln	08934376df	feat: add ClickHouse schema initializer with agent_metrics DDL Adds ClickHouseSchemaInitializer that runs on ApplicationReadyEvent, scanning classpath:clickhouse/*.sql in filename order and executing each statement. Adds V1__agent_metrics.sql with MergeTree table, tenant/agent partitioning, and 365-day TTL. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-31 16:51:21 +02:00
hsiegeln	565b548ac1	refactor: remove all ClickHouse code, old interfaces, and SQL migrations - Delete all ClickHouse storage implementations and config - Delete old core interfaces (ExecutionRepository, DiagramRepository, MetricsRepository, SearchEngine, RawExecutionRow) - Delete ClickHouse SQL migration files - Delete AbstractClickHouseIT - Update controllers to use new store interfaces (DiagramStore, ExecutionStore) - Fix IngestionService calls in controllers for new synchronous API - Migrate all ITs from AbstractClickHouseIT to AbstractPostgresIT - Fix count() syntax and remove ClickHouse-specific test assertions - Update TreeReconstructionTest for new buildTree() method Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-16 18:56:13 +01:00
hsiegeln	035356288f	Fix stats rollup: AggregateFunction(count) takes no type argument All checks were successful CI / build (push) Successful in 1m13s Details CI / docker (push) Successful in 40s Details CI / deploy (push) Successful in 31s Details ClickHouse count() accepts no arguments, so the column type must be AggregateFunction(count) not AggregateFunction(count, UInt64). The latter causes countMerge() to fail with ILLEGAL_TYPE_OF_ARGUMENT. Drop and recreate the table/MV to apply the corrected schema. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-15 10:52:29 +01:00
hsiegeln	adf13f0430	Add 5-minute AggregatingMergeTree stats rollup for dashboard queries All checks were successful CI / build (push) Successful in 1m14s Details CI / docker (push) Successful in 42s Details CI / deploy (push) Successful in 30s Details Pre-aggregate route execution stats into 5-minute buckets using a materialized view with -State/-Merge combinators. Rewrite stats() and timeseries() to query the rollup table instead of scanning the wide base table. Active count remains a real-time query since RUNNING is transient. Includes idempotent backfill migration. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-15 10:46:26 +01:00
hsiegeln	463cab1196	Add displayName to auth response and configurable display name claim for OIDC Some checks failed CI / build (push) Successful in 1m11s Details CI / docker (push) Successful in 49s Details CI / deploy (push) Failing after 2m9s Details - Add displayName field to AuthTokenResponse so the UI shows human-readable names instead of internal JWT subjects (e.g. user:oidc:<hash>) - Add displayNameClaim to OIDC config (default: "name") allowing admins to configure which ID token claim contains the user's display name - Support dot-separated claim paths (e.g. profile.display_name) like rolesClaim - Add admin UI field for Display Name Claim on the OIDC config page - ClickHouse migration: ALTER TABLE adds display_name_claim column Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 16:09:24 +01:00
hsiegeln	b024f83c26	Add ALTER TABLE migration for auto_signup column on existing ClickHouse All checks were successful CI / build (push) Successful in 1m14s Details CI / docker (push) Successful in 43s Details CI / deploy (push) Successful in 31s Details The CREATE TABLE IF NOT EXISTS won't add new columns to an existing table. Add 05-oidc-auto-signup.sql with ALTER TABLE ADD COLUMN IF NOT EXISTS and register it in ClickHouseConfig startup schema + test init. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 14:01:45 +01:00
hsiegeln	9d2e6f30a7	Move OIDC config from env vars to database with admin API All checks were successful CI / build (push) Successful in 1m9s Details CI / docker (push) Successful in 41s Details CI / deploy (push) Successful in 2m11s Details OIDC provider settings (issuer, client ID/secret, roles claim) are now stored in ClickHouse and managed via admin REST API at /api/v1/admin/oidc. This allows runtime configuration from the UI without server restarts. - New oidc_config table (ReplacingMergeTree, singleton row) - OidcConfig record + OidcConfigRepository interface in core - ClickHouseOidcConfigRepository implementation - OidcConfigAdminController: GET/PUT/DELETE config, POST test connectivity, client_secret masked in responses - OidcTokenExchanger: reads config from DB, invalidateCache() on config change - OidcAuthController: always registered (no @ConditionalOnProperty), returns 404 when OIDC not configured - Startup seeder: env vars seed DB on first boot only, then admin API takes over - HOWTO.md updated with admin OIDC config API examples Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 13:01:05 +01:00
hsiegeln	a4de2a7b79	Add RBAC with role-based endpoint authorization and OIDC support Some checks failed CI / build (push) Successful in 1m19s Details CI / docker (push) Successful in 1m38s Details CI / deploy (push) Has been cancelled Details Implement three-phase security upgrade: Phase 1 - RBAC: Extend JWT with roles claim, populate Spring GrantedAuthority in filter, enforce role-based access (AGENT for data/heartbeat/SSE, VIEWER+ for search/diagrams, OPERATOR+ for commands, ADMIN for user management). Configurable JWT secret via CAMELEER_JWT_SECRET env var for token persistence across restarts. Phase 2 - User persistence: ClickHouse users table with ReplacingMergeTree, UserRepository interface + ClickHouse impl, UserAdminController for CRUD at /api/v1/admin/users. Local login upserts user on each authentication. Phase 3 - OIDC: Token exchange flow where SPA sends auth code, server exchanges it server-side (keeping client_secret secure), validates id_token via JWKS, resolves roles (DB override > OIDC claim > default), issues internal JWT. Conditional on CAMELEER_OIDC_ENABLED=true. Uses oauth2-oidc-sdk for standards compliance. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 12:35:45 +01:00
hsiegeln	a44a0c970b	Revert to JdbcTemplate for schema init, keep comment-stripping fix All checks were successful CI / build (push) Successful in 48s Details CI / docker (push) Successful in 36s Details CI / deploy (push) Successful in 9s Details The DriverManager-based approach likely failed because the ClickHouse JDBC driver wasn't registered with DriverManager. The original JdbcTemplate approach worked for route_diagrams and agent_metrics — only route_executions was skipped due to the comment-parsing bug. Reverts to simple JdbcTemplate-based init with unqualified table names (DataSource targets cameleer3 database). The CLICKHOUSE_DB env var on the ClickHouse container handles database creation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 22:05:12 +01:00
hsiegeln	ce0eb58b0c	Fix schema init: bypass DataSource, use direct JDBC with qualified table names All checks were successful CI / build (push) Successful in 48s Details CI / docker (push) Successful in 40s Details CI / deploy (push) Successful in 7s Details The auto-configured DataSource targets jdbc:ch://.../cameleer3 which fails if the database doesn't exist yet. Schema init now uses a direct JDBC connection to the root URL, creates the database first, then applies all schema SQL with fully qualified cameleer3.* table names. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 21:50:47 +01:00
hsiegeln	9dffa9ea81	Move schema initialization from ClickHouse init scripts to server startup All checks were successful CI / build (push) Successful in 49s Details CI / docker (push) Successful in 43s Details CI / deploy (push) Successful in 15s Details Server now applies schema via @PostConstruct using classpath SQL files. All statements use IF NOT EXISTS/IF NOT EXISTS so it's idempotent and safe to run on every startup. Removes ConfigMap and init script mount from K8s manifest since ClickHouse no longer needs to manage the schema. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 19:59:33 +01:00

22 Commits