Pricing & Services

Pay for what runs. Cap at the reserved rate.

True-Util™ is the one pricing model behind every Kinesis service — whether the compute is ours, sourced from a partner, or yours. Pay only for the cycles your workloads actually consume, capped at the reserved rate. Idle capacity isn't wasted — it's monetized.

Try Kinesis — $100 credit

Traditional cloud

$0.00

True-Util

$0.00

Idle Capacity

$0.00

Available to monetize on the Kinesis grid

00:0004:0008:0012:0016:0020:0024:00

Reserved rate (traditional)True-Util (actual usage)Idle capacity · monetizable

One pricing model. Three ways to buy.

True-Util works the same way regardless of where the compute comes from. Pick the buying mode that matches your workload — the metering, caps, and telemetry are identical.

TRUE-UTIL SHARED

Metered cycles, capped at Reserved.

Multi-tenant compute on the Kinesis grid. You pay for the CPU and GPU cycles your workloads actually consume — and never more than the equivalent Reserved monthly rate. Spiky, variable, or hard-to-forecast workloads save the most.

Best for inference, dev/staging, agencies, MVPs
Runs on Intel, AMD CPUs; A100, H100, H200 GPUs
No upfront commitments

TRUE-UTIL DEDICATED

The whole machine, fixed cost.

Single-tenant compute, reserved monthly. Full control over the box, predictable billing, the same Kinesis orchestration and telemetry as Shared. For workloads where steady utilization is a given.

Best for steady training, production HPC, regulated workloads
Choice across providers — reduces lock-in
Full control over configuration, performance, privacy

TRUE-UTIL ON YOUR COMPUTE

Your hardware. Our orchestration. 80% off.

Run the Kinesis grid on compute you already own or rent — AWS savings plans, on-prem servers, your datacenter, your private cloud. Same product, same kernel-level orchestration, priced at 20% of True-Util Shared.

Best for portfolios with existing capacity or commitments
Hardware range: Raspberry Pi to H200 superclusters
Adds FinOps visibility across what you already pay for

Why True-Util

We make money when your processors are busy. Not when they idle.

Traditional clouds bill wall-clock time on whatever you reserved — full price whether you used 5% of the box or 95%. True-Util inverts the math: Shared meters real cycles, Dedicated is priced for steady use, and On Your Compute is priced as a percentage of Shared. The more output you get from a processor, the more we both earn from it.

Average cloud customer utilization is under 20%. The other 80% is what True-Util captures.

Pricing

Posted rates

The low end is what you pay when the machine is idle (Shared) or unused (On Your Compute). The high end is the Reserved cap — what you'd pay on a traditional cloud regardless.

Service

Specs

Idle / Unused

What you pay when idle

Reserved Cap

Traditional cloud rate

Available On

GPU · H10028 vCPUs, 96 GB RAM per card · 1×, 2×, 4×, 8× configs$0$1.90/hrTRUE-UTIL SHARED OR DEDICATED

CPU · C2424 vCPUs, 96 GB RAM$0$0.94/hrTRUE-UTIL SHARED OR DEDICATED

CPU · Flex4 vCPUs, 8 GB RAM · Spot-class for fault-tolerant work$0$0.20/hrTRUE-UTIL SHARED

Your ComputeAny hardware you own or rent · Raspberry Pi to H200 superclusters$020% of SharedYOUR HARDWARE, OUR SOFTWARE

GPU · H100

28 vCPUs, 96 GB RAM per card · 1×, 2×, 4×, 8× configs

$0$1.90/hr

TRUE-UTIL SHARED OR DEDICATED

CPU · C24

24 vCPUs, 96 GB RAM

$0$0.94/hr

TRUE-UTIL SHARED OR DEDICATED

CPU · Flex

4 vCPUs, 8 GB RAM · Spot-class for fault-tolerant work

$0$0.20/hr

TRUE-UTIL SHARED

Your Compute

Any hardware you own or rent · Raspberry Pi to H200 superclusters

$020% of Shared

YOUR HARDWARE, OUR SOFTWARE

Where True-Util saves the most

The same workloads that cost the most on traditional clouds save the most on True-Util.

The pain

The Kinesis win

AI startups & LLM inference

H100s sit idle between prompts. The bill is the same whether you served 100 requests or 100,000.

True-Util Shared meters inference time only. No queries, no cost. Bursty traffic caps at the Reserved rate.

The pain

H100s sit idle between prompts. The bill is the same whether you served 100 requests or 100,000.

The Kinesis win

True-Util Shared meters inference time only. No queries, no cost. Bursty traffic caps at the Reserved rate.

Early-stage SaaS & MVPs

Overprovisioning for traffic that hasn’t shown up. Or worse — under-provisioning and falling over the first time it does.

Pay pennies at low traffic. Costs cap at the Reserved rate during spikes. Headroom without prepayment.

The pain

Overprovisioning for traffic that hasn’t shown up. Or worse — under-provisioning and falling over the first time it does.

The Kinesis win

Pay pennies at low traffic. Costs cap at the Reserved rate during spikes. Headroom without prepayment.

Dev, Staging & CI/CD

Staging servers run 24/7 to be ready, but burn nights and weekends.

True-Util drops the bill as activity drops. Same reservation, lower cost when the team’s asleep.

The pain

Staging servers run 24/7 to be ready, but burn nights and weekends.

The Kinesis win

True-Util drops the bill as activity drops. Same reservation, lower cost when the team’s asleep.

Enterprises with idle capacity

Reserved AWS instances, on-prem servers, donated lab GPUs — capacity already paid for, sitting underused.

Run the Kinesis grid on your hardware at 20% of Shared. Same orchestration, FinOps visibility, 80% less spend on what you already own.

The pain

Reserved AWS instances, on-prem servers, donated lab GPUs — capacity already paid for, sitting underused.

The Kinesis win

Run the Kinesis grid on your hardware at 20% of Shared. Same orchestration, FinOps visibility, 80% less spend on what you already own.

Pay for what runs. Cap at the reserved rate.

One pricing model. Three ways to buy.

Metered cycles, capped at Reserved.

The whole machine, fixed cost.

Your hardware. Our orchestration. 80% off.

We make money when your processors are busy. Not when they idle.

Posted rates

Where True-Util saves the most

Try it on a real app