Question 1

What's the difference between Serverless Jobs and Persistent Endpoints?

Accepted Answer

Serverless Jobs are billed per second of execution and scale to zero when idle — ideal for batch processing, experiments, and CI pipelines. Persistent Endpoints are always-on dedicated GPUs billed at a flat monthly rate — ideal for production APIs that need zero cold-start latency and consistent performance.

Question 2

How does per-second billing work for Jobs?

Accepted Answer

You are billed for the exact number of seconds your GPU is running. When your function completes or your endpoint scales to zero, billing stops immediately. There are no minimum charges or rounding to the nearest minute or hour.

Question 3

Why are Endpoints cheaper for 24/7 workloads?

Accepted Answer

An A100 Persistent Endpoint costs $1,400/month, equivalent to $1.94/hr — versus $2.36/hr for Serverless Jobs. Running a Job 24/7 for 30 days would cost ~$1,700. If your model is serving traffic most of the day, an Endpoint saves 18% or more.

Question 4

Which GPU providers do you use?

Accepted Answer

Velar runs on enterprise-grade GPU infrastructure. We provide secure, on-demand access to a wide range of NVIDIA GPUs from L4 to H200. Prices reflect infrastructure costs plus Velar's operational margin.

Question 5

What happens when my free credits run out?

Accepted Answer

When your free credits are exhausted, you will need to add a payment method to continue running workloads. Your existing deployments will gracefully shut down, and you can upgrade to the Pro plan or add credits at any time.

Question 6

Can I switch plans at any time?

Accepted Answer

Yes, you can upgrade or downgrade your plan at any time. When upgrading, the new plan takes effect immediately and you receive the additional credits. When downgrading, the change takes effect at the start of your next billing cycle.

GPU	VRAM	Per Hour	Per Second
L4 24GB	24 GB	$0.66	$0.000183
RTX 4090 24GB	24 GB	$1.00	$0.000278
L40S 48GB	48 GB	$1.46	$0.000406
A100 80GBPopular	80 GB	$2.36	$0.000656
H100 PCIe 80GB	80 GB	$3.59	$0.000997
H100 SXM 80GB	80 GB	$3.77	$0.001047
H200 141GB	141 GB	$4.31	$0.001197

Two ways to run GPU workloads

Serverless Jobs

Persistent Endpoints

Serverless Jobs pricing

Persistent Endpoints pricing

Platform plans

Free

Pro

Business

Enterprise

Frequently asked questions

Start for free