Does serverless mean there are no servers?

No. There are still servers — you just do not manage them. The cloud provider handles provisioning, patching, and scaling, so from your point of view you upload code and it runs on demand without your ever touching a machine.

What is a cold start?

A cold start is the brief delay when a serverless function has to spin up a fresh execution environment because none was already warm. Once running, subsequent calls reuse that environment and respond quickly until it is idled out.

What is the difference between serverless and containers?

With containers you still define and run long-lived instances and decide how many. Serverless hides that entirely: the platform scales execution from zero to many automatically and bills per request. Containers give more control; serverless gives less operational overhead.

Is serverless cheaper?

Often, for spiky or low-to-moderate traffic, because you pay only for what runs and nothing while idle. At very high, steady volume, dedicated servers can become cheaper per request, so the economics depend on your traffic shape.

What are serverless functions good for?

They excel at event-driven and bursty workloads: API endpoints, webhook handlers, scheduled jobs, image processing, and the backend of modern web frameworks. They are less ideal for long-running or extremely latency-sensitive tasks.

Glossary · Software

What is Serverless?

Serverless is a cloud model where you upload code and the provider runs it on demand — provisioning, scaling, and patching the underlying servers for you, and billing only for the compute your code actually uses.

What serverless means

Serverless is a bit of a misnomer — there are absolutely still servers. What is gone is your responsibility for them. You do not pick instance sizes, patch operating systems, or decide how many machines to keep running. You hand the provider a unit of code, and it figures out where and when to run it, spinning capacity up the instant a request arrives and back down to nothing when traffic stops.

The most common form is Functions as a Service, where you deploy individual functions that the platform triggers on an event — an HTTP request, a message on a queue, a file upload, a scheduled timer. The broader serverless umbrella also covers managed databases, storage, and message systems that scale and bill the same way: you consume the capability and never see the server.

Where it came from

The modern serverless era began in 2014 when Amazon launched AWS Lambda, which let developers run code in response to events without ever provisioning a server and billed in fractions of a second. The other major clouds followed with their own function platforms, and the model expanded outward to databases, queues, and whole application frameworks.

The appeal was a continuation of a long trend: each generation of cloud computing removed another layer of undifferentiated heavy lifting. Virtual machines removed the data center, containers removed the operating system fuss, and serverless removed capacity planning. The promise was that engineers could spend their time on application logic and almost none on the infrastructure beneath it.

How it works

When an event hits a serverless function, the platform finds or creates an execution environment, loads your code, and runs it. If an environment is already warm from a recent call, the response is near-instant. If not, there is a brief cold start while the runtime initializes. Many requests can be handled in parallel because the platform simply spins up more environments — scaling is automatic and effectively unbounded within your configured limits.

Functions are designed to be stateless: any data that must persist lives in an external database, cache, or object store, because the environment running a function can vanish the moment it finishes. Billing follows execution — you pay per invocation and per millisecond of compute, and nothing at all while the function sits idle. That pay-for-what-you-use shape is the model's defining economic feature.

When it matters

Serverless matters most when traffic is spiky or unpredictable, when you want to ship without standing up infrastructure, and when keeping a fleet of always-on servers would mean paying for idle capacity. It is a natural fit for API endpoints, webhook receivers, scheduled jobs, and the backends of modern web frameworks. The trade-offs to weigh are cold-start latency for rarely-hit endpoints, execution-time ceilings that make it poor for very long jobs, and the fact that at sustained high volume, dedicated capacity can become cheaper per request.

At QUANT LAB

A large share of what we ship runs serverless by default. Because we build on Next.js, the application's API routes and server rendering deploy as serverless functions on the hosting platform, which means a new client's app scales from one user to a launch-day spike without us pre-provisioning anything. For event-driven work — processing a webhook, running a nightly report, resizing an upload — a single function is usually the simplest, cheapest tool for the job.

We do not treat serverless as the answer to everything. Our cloud infrastructure work includes choosing, per workload, between serverless functions, long-running containers, and dedicated machines based on your traffic shape and latency needs. For most SaaS platforms we build, a serverless-first architecture keeps the operational burden low while the product finds its footing.

Long-form deep-dives that use this term

All posts

Related terms

Talk to the engineer who would build it

If you want a 30-minute conversation about whether serverless fits your workload and budget — not a pitch — book a call.

Cloud infrastructure