Core features

Recommendation catalog

Every finding OptiHouse can produce, grouped into ten delivery waves — from quick storage wins to SQL rewrites and physical design advice. Each recommendation carries a severity, an expected impact and an auto-generated runbook.

How to read a recommendation

Every finding has a type (one of the rows below), a severity (low / medium / high / critical), an expected impact estimated in bytes, CPU or dollars, and a confidence score. High-confidence findings can be applied or auto-applied; low-confidence ones are surfaced as soft signals.

Status markers
Recommendation types in Wave 1–5 are fully shipped. Wave 6–10 ship at MVP depth — the analyzer, runbook and UI exist, and the runbook spells out the real integration steps.

Wave 1 — Quick wins

IDRecommendationWhat it surfaces
1.1Zombie materialized viewsMVs that still consume merges and storage but feed nothing.
1.2Storage fragmentation mapTables with too many tiny parts and low average part size.
1.3Auto-generated runbooksEvery finding gets probable causes, diagnostic SQL and remediation steps.
1.4Query schedulingConcurrent bursts at the same minute that could be staggered.
1.5Recommendation impact trackingBefore/after metrics once a fix is applied — bytes saved, parts reduced.

Wave 2 — Query intelligence

IDRecommendationWhat it surfaces
2.1Query fingerprint driftA query pattern that started reading or running noticeably more than before.
2.2Partition efficiency scoreQueries that scan far more partitions than they return rows from.
2.3Workload classificationUsers bucketed into ETL / BI / monitoring / ad-hoc traffic.
2.4Workload seasonalityRecurring daily peak hours worth planning capacity around.
2.5Query blast radiusA single user or pattern contending for a disproportionate share of resources.

Wave 3 — Operational health

IDRecommendationWhat it surfaces
3.1Insert pressure analyzerTables receiving many small inserts that create part-merge debt.
3.2Merge debt indexA growing merge backlog that the cluster is not keeping up with.
3.3Replica imbalance detectorReplicas carrying uneven load, lag or part counts.

Wave 4 — Enterprise FinOps & storage

IDRecommendationWhat it surfaces
4.1Cost attributionCompute and storage cost broken down per team, product or service.
4.2Cold data advisorLarge tables not touched for months — candidates for a colder tier.
4.3Schema evolution riskSchema changes that look risky for downstream consumers.
4.4Upgrade advisorReadiness scoring before bumping the ClickHouse version.

Wave 5 — Advanced differentiators

IDRecommendationWhat it surfaces
5.1Query replay simulatorTop tables worth a what-if rewrite (MVP: candidate surfacing).
5.2SQL CI/CD guardrailsAn anti-pattern summary you can wire into a PR check.
5.3Historical incident correlationConcurrent symptoms that tend to appear together during incidents.

Wave 6 — SQL rewriter engine

IDRecommendationWhat it surfaces
6.1Auto-PREWHERE advisorHeavy-read queries that would benefit from a PREWHERE predicate.
6.2FINAL removal rewriterFROM ... FINAL queries rewritten with argMax or LIMIT 1 BY.
6.3JOIN reorder & algorithm hintsMemory-heavy joins that should switch to grace_hash / parallel_hash.
6.4Subquery → CTE refactorRepeated subqueries collapsed into a single shared CTE.

Wave 7 — Physical design advisor

IDRecommendationWhat it surfaces
7.1Per-column codec advisorColumns that would compress better with Delta, Gorilla, T64 or ZSTD.
7.2LowCardinality / Enum advisorString columns with few distinct values worth wrapping in LowCardinality.
7.3Skipping index recommenderFilter columns that would benefit from a minmax, set or bloom_filter index.
7.4Projection advisorRead patterns that a projection inside the source table would accelerate.
7.5PK reorder & type downsizingPrimary-key column order and oversized column types worth revisiting.

Wave 8 — What-if simulator

IDRecommendationWhat it surfaces
8.1Drop column previewLarge columns that never appear in any observed query.
8.2Add projection previewA workload-wide summary of projection candidates.
8.3Repartition / TTL previewOld data on big tables that a TTL move would free up.
8.4Disk capacity forecastDisks above a utilisation threshold, with a days-until-full estimate.

Wave 9 — Integrations

IDRecommendationWhat it surfaces
9.1dbt manifest integrationQueries traced back to the dbt model that emitted them.
9.2GitHub / GitLab PR botAnti-patterns turned into a comment on the pull request that introduced them.
9.3Slack / email weekly digestA scheduled summary of new findings and dollar savings.
9.4Prometheus / OpenTelemetry exporterPlatform metrics exported to your existing observability stack.

Wave 10 — Workload governance

IDRecommendationWhat it surfaces
10.1Resource pool advisorCREATE WORKLOAD pools derived from how users actually behave.
10.2Noisy neighbor detectorA user that is both a large share of load and very bursty.
10.3Pre-flight cost estimatorAn estimate of what a query will cost before it runs.