Kinesis
Pricing & Services

Pay for what runs. Cap at the reserved rate.

True-Util™ is the one pricing model behind every Kinesis service — whether the compute is ours, sourced from a partner, or yours. Pay only for the cycles your workloads actually consume, capped at the reserved rate. Idle capacity isn't wasted — it's monetized.

Traditional cloud

$0.00

True-Util

$0.00

Idle Capacity

$0.00

Available to monetize on the Kinesis grid

00:0008:0016:0024:00
Reserved rate (traditional)True-Util (actual usage)Idle capacity · monetizable

One pricing model. Three ways to buy.

True-Util works the same way regardless of where the compute comes from. Pick the buying mode that matches your workload — the metering, caps, and telemetry are identical.

TRUE-UTIL SHARED

Metered cycles, capped at Reserved.

Multi-tenant compute on the Kinesis grid. You pay for the CPU and GPU cycles your workloads actually consume — and never more than the equivalent Reserved monthly rate. Spiky, variable, or hard-to-forecast workloads save the most.

  • Best for inference, dev/staging, agencies, MVPs
  • Runs on Intel, AMD CPUs; A100, H100, H200 GPUs
  • No upfront commitments
TRUE-UTIL DEDICATED

The whole machine, fixed cost.

Single-tenant compute, reserved monthly. Full control over the box, predictable billing, the same Kinesis orchestration and telemetry as Shared. For workloads where steady utilization is a given.

  • Best for steady training, production HPC, regulated workloads
  • Choice across providers — reduces lock-in
  • Full control over configuration, performance, privacy
TRUE-UTIL ON YOUR COMPUTE

Your hardware. Our orchestration. 80% off.

Run the Kinesis grid on compute you already own or rent — AWS savings plans, on-prem servers, your datacenter, your private cloud. Same product, same kernel-level orchestration, priced at 20% of True-Util Shared.

  • Best for portfolios with existing capacity or commitments
  • Hardware range: Raspberry Pi to H200 superclusters
  • Adds FinOps visibility across what you already pay for
Why True-Util

We make money when your processors are busy. Not when they idle.

Traditional clouds bill wall-clock time on whatever you reserved — full price whether you used 5% of the box or 95%. True-Util inverts the math: Shared meters real cycles, Dedicated is priced for steady use, and On Your Compute is priced as a percentage of Shared. The more output you get from a processor, the more we both earn from it.

Average cloud customer utilization is under 20%. The other 80% is what True-Util captures.

Pricing

Posted rates

The low end is what you pay when the machine is idle (Shared) or unused (On Your Compute). The high end is the Reserved cap — what you'd pay on a traditional cloud regardless.

GPU · H100

28 vCPUs, 96 GB RAM per card · 1×, 2×, 4×, 8× configs

$0$1.90/hr

TRUE-UTIL SHARED OR DEDICATED

CPU · C24

24 vCPUs, 96 GB RAM

$0$0.94/hr

TRUE-UTIL SHARED OR DEDICATED

CPU · Flex

4 vCPUs, 8 GB RAM · Spot-class for fault-tolerant work

$0$0.20/hr

TRUE-UTIL SHARED

Your Compute

Any hardware you own or rent · Raspberry Pi to H200 superclusters

$020% of Shared

YOUR HARDWARE, OUR SOFTWARE

Where True-Util saves the most

The same workloads that cost the most on traditional clouds save the most on True-Util.

AI startups & LLM inference
The pain

H100s sit idle between prompts. The bill is the same whether you served 100 requests or 100,000.

The Kinesis win

True-Util Shared meters inference time only. No queries, no cost. Bursty traffic caps at the Reserved rate.

Early-stage SaaS & MVPs
The pain

Overprovisioning for traffic that hasn’t shown up. Or worse — under-provisioning and falling over the first time it does.

The Kinesis win

Pay pennies at low traffic. Costs cap at the Reserved rate during spikes. Headroom without prepayment.

Dev, Staging & CI/CD
The pain

Staging servers run 24/7 to be ready, but burn nights and weekends.

The Kinesis win

True-Util drops the bill as activity drops. Same reservation, lower cost when the team’s asleep.

Enterprises with idle capacity
The pain

Reserved AWS instances, on-prem servers, donated lab GPUs — capacity already paid for, sitting underused.

The Kinesis win

Run the Kinesis grid on your hardware at 20% of Shared. Same orchestration, FinOps visibility, 80% less spend on what you already own.

Try it on a real app

$100 in free credit. No credit card required. Deploy your first container in under five minutes — bring a GitHub repo, a Dockerfile, or just describe what you want