Serverless Cost Optimization

Q: What's the best memory allocation for my function?

Use AWS Lambda Power Tuning tool. Test your function at every memory tier (128 MB to 3,008 MB), measure execution time and cost, and identify the cost-optimal configuration. Most functions are over-allocated by 2-4x. Expected savings: 20-35%.

Q: How do I reduce API Gateway costs?

For APIs > 50M requests/month: migrate from REST API ($3.50 per million requests) to HTTP API ($0.60 per million) or ALB ($0.225 per million). Implement caching to reduce invocations 50-80%. Use Function URLs for internal APIs (free).

Quick Navigation

Serverless Cost Drivers
Pricing Models (AWS, Azure, GCP)
Common Cost Traps
Memory Optimization Triangle
Cold Start Mitigation
Serverless vs Containers
Event-Driven Architecture
API Gateway Optimization
Cost Monitoring Tools
8 Optimization Tactics
FAQ

Understanding Serverless Cost Drivers

Serverless platforms (AWS Lambda, Azure Functions, Google Cloud Functions) charge based on actual consumption, not reserved capacity. However, many cost drivers are hidden or misunderstood. The primary cost factors are:

Invocations: AWS Lambda charges $0.0000002 per invocation (1 million invocations = $0.20)
Duration & Memory: Billed in millisecond increments × memory allocation in GB (compute-seconds)
Provisioned Concurrency: Reserved capacity for eliminating cold starts (3x more expensive than on-demand)
Data Transfer: Egress from function to external services or the internet
API Gateway Requests: HTTP API costs apply per request plus duration costs
Storage & Logging: CloudWatch Logs, DynamoDB, S3 write operations
Lambda@Edge: CloudFront-triggered Lambda at edge locations (higher per-request cost)
Cross-account Data Transfer: Inter-region and cross-account egress charges apply

Pricing Models Across Cloud Providers

AWS Lambda Pricing Model

AWS Lambda is priced on two dimensions: invocations and compute time (GB-seconds).

Expert Advisory

Want independent help negotiating better terms? We rank the top advisory firms across 14 vendor categories, free matching, no commitment.

Get Matched with an Advisor → See Rankings →

Component	Pricing	Free Tier / Month
Invocations	$0.0000002 per request	1 million requests
Duration (GB-seconds)	$0.0000166667 per GB-second	400,000 GB-seconds (128 MB × 1,000,000 sec)
Provisioned Concurrency	$0.015 per GB-hour	None
Ephemeral Storage	$0.0000166667 per GB-second	512 GB free within 15-minute interval

Example calculation: A function with 1,024 MB memory running for 200 ms, invoked 10 million times/month:

Duration cost: 10M × 0.2s × (1,024/1,024) GB × $0.0000166667 = $33.33
Invocation cost: 10M × $0.0000002 = $2.00
Total: $35.33/month (after free tier consumption)

Pro Tip

AWS Lambda's free tier covers 1M invocations and 400,000 GB-seconds monthly. For development and low-volume workloads, you may stay within free tier entirely. Overage costs are trivial until reaching 50+ million invocations/month.

Azure Functions Pricing Model

Azure Functions offers three hosting plans, each with different cost structures:

Hosting Plan	Base Cost	Execution Pricing	Best For
Consumption Plan	None	$0.0000002/execution + $0.000016667/GB-second	Bursty, unpredictable workloads
Premium Plan	$0.04–$0.40/vCPU-hour	No per-execution charge	Sustained or consistent volume
Dedicated App Service	$10–$100+/month	No additional charges	Always-on, predictable workloads

Azure's Consumption Plan mirrors AWS Lambda pricing. Premium Plan reserves 1-4 vCPU cores with guaranteed execution memory, eliminating cold starts but requiring 3-6 month commitment.

GCP Cloud Functions & Cloud Run Pricing

Resource	Cloud Functions	Cloud Run
Invocations	2 million free/month, then $0.40 per million	Included (no per-request charge)
Compute Time	$0.000002778/vCPU-second	$0.000002400/vCPU-second
Memory	$0.000000231/MB-second	$0.000000205/MB-second
Scale-to-Zero	Yes (always)	Optional (configurable minimum instances)

GCP Cloud Run is more cost-effective for sustained workloads because there's no per-invocation fee. Cloud Functions charges per invocation but scales faster from zero. Cloud Run's minimum instances feature (0.5–1+ instances always warm) adds cost but eliminates cold starts.

Common Serverless Cost Traps

1. Over-Provisioned Memory

Many teams allocate 1,024 MB or 2,048 MB "just in case." AWS and Azure don't scale CPU with memory linearly—more memory = more CPU, but teams don't measure the ROI. A function that runs in 2,000 ms with 512 MB may run in 500 ms with 2,048 MB, paying 4x the memory cost but recouping via 4x faster execution.

2. Synchronous Function Chains

Chaining 5-10 synchronous Lambda invocations (e.g., function A calls B calls C) means each function waits for the previous to complete. Total duration = sum of all durations. If you invoke 100,000 times/day, you're paying for the entire chain per day. Switching to asynchronous (SNS/SQS) can reduce billable duration 50-70%.

3. Inefficient Recursion

Self-invoking Lambda functions (recursion without depth limits) can trigger thousands of unintended invocations during errors. One team incurred $8,000 in accidental Lambda costs in 4 hours due to recursive error handling. Always set max depth and implement exponential backoff with DLQs (Dead Letter Queues).

4. API Gateway Per-Request Costs

REST API Gateway charges $3.50 per million requests. If you serve 500M requests/month, that's $1,750 in API costs alone. HTTP API costs only $0.60 per million requests—9x cheaper. Migrating high-volume workloads to HTTP API or ALB saves thousands monthly.

5. Lambda@Edge Pricing

CloudFront-triggered Lambda (Lambda@Edge) costs $0.60 per million invocations in viewer region, plus $0.50 per million in origin region. A single image optimization function at edge serving 100M requests/month costs $110—4x more than a regional function for the same logic.

6. Unoptimized CloudWatch Logs

Lambda logs every invocation by default. Without retention policies, logs accumulate indefinitely. A function logging 5 KB per invocation at 10M invocations/month generates 50 GB/month. CloudWatch charges $0.50 per GB for log ingestion. Without lifecycle policies, you're paying $25/month in log storage alone, plus ingestion costs.

7. Cross-Region Data Transfer

Lambda to S3 in same region: free. Lambda to S3 in different region: $0.02 per GB. A function reading 100 GB/month from cross-region S3 costs $2,000 extra. Many teams don't realize they've deployed Lambda in us-east-1 and data in eu-west-1.

8. Shared Infrastructure Costs

VPC-enabled Lambda costs 38% more than non-VPC. Database connection pools managed by Lambda functions (not RDS Proxy) create connection limits and timeouts, forcing longer execution times.

The Memory/Cost/Performance Triangle

One of the biggest misconceptions about serverless is that more memory always costs more. In reality, AWS Lambda (and similar services) scale CPU proportionally with memory allocation. More memory = faster execution, which can reduce total billable time.

Free Resource

Get the IT Negotiation Playbook, free

Used by 4,200+ IT directors and procurement leads. Oracle, Microsoft, SAP, Cloud all covered.

Formula

AWS Lambda GB-second cost = (memory MB / 1,024) × (duration ms / 1,000) × $0.0000166667

Example scenario: A function processes 100 JSON records:

512 MB allocation: 5,000 ms execution = 2.5 GB-seconds = $0.0000416667
1,024 MB allocation: 2,500 ms execution = 2.5 GB-seconds = $0.0000416667 (same cost!)
2,048 MB allocation: 1,200 ms execution = 2.4 GB-seconds = $0.0000400000 (cheaper!)

The optimal memory is where total GB-seconds (memory × duration) is minimized. AWS Lambda Power Tuning is an open-source tool that automatically tests your function at every memory tier (128 MB to 10,240 MB) and identifies the cost-optimal and speed-optimal configurations.

Implementation Tip

Run AWS Lambda Power Tuning on your top 10 functions (by invocation count). Most teams find 20-30% cost savings by moving from guessed memory to measured optimum. The tool costs $1-5 to run but pays back in days.

Cold Start Mitigation: Cost vs. Benefit

A cold start occurs when a function is invoked after idle time, requiring the Lambda runtime to initialize. Cold starts add 100-500 ms (Node.js/Python) to 2,000+ ms (Java/C#). For a function serving user requests, cold starts degrade UX. For batch workloads, cold starts are negligible.

Provisioned Concurrency: Cost Analysis

AWS offers Provisioned Concurrency to keep a set number of function instances warm 24/7. Cost is $0.015 per GB-hour (vs. $0.0000166667 per GB-second on-demand).

Configuration	Provisioned Concurrency Cost	Execution Cost (10M/month)	Total Monthly
1 instance, 512 MB	$0.015 × 512 × 730 hours = $5,616	$33.33	$5,649
5 instances, 1,024 MB	$0.015 × 1,024 × 5 × 730 = $56,160	$33.33	$56,193
On-demand (no provisioning)	$0	$33.33	$33

Provisioned Concurrency is only cost-effective if cold starts cause revenue loss or SLA breaches. For most batch and asynchronous workloads, the $5,600+ monthly cost is wasted. For user-facing APIs with SLA requirements (sub-100ms response time), Provisioned Concurrency can be justified.

Cheaper Cold Start Alternatives

Code size reduction: Smaller ZIP files (under 50 MB) reduce initialization time 10-20%
Language choice: Node.js and Python cold starts are 100-300 ms; Java/C# are 500-2,000+ ms
Container image optimization: Use Lambda container images under 100 MB (vs. 500 MB+)
Reserved Concurrency: Cheaper than Provisioned; doesn't keep instances warm, but prevents throttling
Async triggers (SQS/SNS): Soft-launch queued invocations before peak times to pre-warm instances

Serverless vs. Containers: Cost Comparison

Teams often assume Lambda is cheaper than ECS Fargate or Cloud Run. This is not always true. At scale, containers become more cost-effective.

Metric	Lambda (2,048 MB)	ECS Fargate (2 vCPU, 4 GB)	Cloud Run (1 vCPU, 2 GB)
Per-invocation cost (100 ms)	$0.00034	N/A (per-second)	N/A (per-second)
1M invocations/month (100 ms avg)	$340	$2,880	$1,440
100M invocations/month (100 ms avg)	$34,000	$2,880	$1,440
Cold start handling	100-500 ms	Always warm	Configurable
Idle cost (24/7 running)	$0 (auto-scales to zero)	$2,880/month	$1,440/month (with min instances)

When to switch to containers:

Sustained load > 100,000 invocations/day (3M+/month)
Functions running > 5-10 minutes (Lambda timeout is 15 min)
Complex pre-warming or initialization requirements
Reusing existing container-based infrastructure (EKS, Docker Compose)

Event-Driven Architecture for Efficiency

Serverless shines with asynchronous, event-driven patterns. Using SQS, SNS, Kinesis, and DynamoDB Streams allows functions to process work without synchronous chains.

Batch Size Optimization

SQS/Kinesis/DynamoDB Stream consumers are triggered with batch sizes (1-100 records). Larger batches = fewer function invocations = lower invocation cost.

Batch Size	100K Events/Day	Daily Invocations	Daily Invocation Cost
1 record per batch	100,000 events	100,000 invocations	$0.02
10 records per batch	100,000 events	10,000 invocations	$0.002
100 records per batch	100,000 events	1,000 invocations	$0.0002

Batch size of 10-100 is typical; larger batches reduce latency variance and cost but increase memory usage per invocation (must hold full batch in memory).

FinOps Insight

Dynamic batching for SQS (AWS feature) automatically increases batch size during high throughput, reducing invocation count 30-50% with zero code changes. Enable this on high-volume event consumers.

API Gateway Cost Optimization

API Gateway is the HTTP entry point for Lambda. Costs differ dramatically by API type.

REST API vs. HTTP API vs. ALB

API Type	Cost per Million Requests	Features	Latency
REST API	$3.50	Full transformations, authorizers, caching	35-100 ms
HTTP API	$0.60	Basic routing, JWT auth, limited transforms	5-10 ms
Application Load Balancer (ALB)	$0.225	Basic HTTP routing, path/host rules	1-5 ms
Function URL	$0	Direct Lambda invocation, no API routing	0-2 ms

Migration path for high-volume APIs:

1-50M requests/month: REST API acceptable, cost is $3.50–$175. No migration needed.
50-500M requests/month: Migrate to HTTP API. Savings: $1,750–$16,800/month.
500M+ requests/month: Use ALB + Lambda target groups. Savings: $1,575–$16,275/month.
Internal/async APIs: Use Function URLs (no API Gateway). Cost: $0.

Caching & Request Reduction

API Gateway caching stores responses for TTL (time-to-live). If 80% of requests are identical, caching reduces Lambda invocations 80%.

API Gateway cache: $0.02 per cache-hour
1 GB cache (default) for 24 hours = $0.48/month
Typical ROI: Breaks even in hours on high-volume APIs

CloudWatch Logs & Cost Monitoring

Log Retention Policies

Lambda logs to CloudWatch by default. Without retention policies, logs are stored indefinitely. CloudWatch Logs costs $0.50 per GB ingested and $0.03 per GB stored (after 1 month).

Log Volume	Retention	Monthly Cost
10M invocations, 5 KB logs	Never expires	$25 ingestion + $375+ storage
10M invocations, 5 KB logs	14 days	$25 ingestion only
10M invocations, 1 KB logs	7 days (production)	$5 ingestion only

Log optimization tactics:

Set retention policies: 7 days for prod, 14 days for staging, 1 day for dev
Reduce log verbosity: Log errors/warnings only in production; debug info in dev
Use log sampling: Log 1% of successful requests, 100% of errors
Ship logs to S3: Export logs to S3 for archive (cheaper than CloudWatch storage)

Cost Monitoring Tools

Native AWS Cost Explorer has limited serverless granularity. Third-party tools provide real-time cost insights:

Lumigo: Serverless observability, cost breakdown by function. $200-500/month for enterprises.
Datadog: Full observability, includes serverless cost tracking. $500-2,000+/month.
Epsagon: (Acquired by Cisco; merged into AppDynamics) Real-time serverless cost alerts.
AWS Lambda Insights: Free CloudWatch monitoring, no third-party cost.

GCP Cloud Run Specifics: Always-On vs. Scale-to-Zero

Google Cloud Run defaults to scale-to-zero but allows configuration of minimum instances.

Configuration	Min Instances	Always-On Cost/Month (1 vCPU, 512 MB)	Cold Start
Scale-to-zero	0	$0	1-2 seconds
Min 1 instance	1	$5.76	< 100 ms
Min 5 instances	5	$28.80	< 50 ms

Cloud Run's minimum instances cost is 2x cheaper than Lambda Provisioned Concurrency but still significant. Use minimum instances only for user-facing APIs requiring predictable latency.

8 Serverless Cost Optimization Tactics

Tactic 1

Right-Size Memory with Power Tuning

Run AWS Lambda Power Tuning on top 10 functions by invocation count. Test each function at every memory tier (128 MB to 3,008 MB). Identify the cost-optimal configuration. Expected savings: 20-35% on duration costs. Cost to run tool: $2-5 (pays back in hours).

Tactic 2

Migrate High-Volume APIs to HTTP API or ALB

Audit API Gateway usage. For APIs serving >50M requests/month, calculate REST API vs. HTTP API savings. Migration is 1-2 day effort for most teams. Savings: $175–$16,800+/month depending on volume. Zero behavioral changes if using basic REST API features.

Tactic 3

Set Aggressive CloudWatch Log Retention

Implement 7-day retention for production, 14-day for staging, 1-day for dev. Reduce log verbosity: log errors/warnings/key events only. Disable debug logs in production. Expected savings: $5-50/month per function (higher for high-volume functions).

Tactic 4

Break Synchronous Chains into Asynchronous Workflows

Audit function-to-function invocations. Replace synchronous Lambda-to-Lambda calls with SNS/SQS/Step Functions async patterns. Reduces billable duration 50-70% for chain workflows. Enables parallelism (multiple functions run concurrently, not sequentially).

Tactic 5

Enable Dynamic Batching for Event Sources

For SQS/Kinesis/DynamoDB Stream consumers, enable dynamic batching (AWS feature, no code change). Automatically increases batch size during high throughput. Reduces invocation count 30-50%. Set batch size floor to 5-10 records minimum.

Tactic 6

Avoid Provisioned Concurrency Except for SLA-Critical APIs

Provisioned Concurrency costs $0.015 per GB-hour ($5,600+/month for 1 instance). Only justify for APIs with strict latency SLAs (<100 ms p99). Use cheaper alternatives: code optimization, container images, Node.js/Python, or conditional scaling based on traffic patterns.

Tactic 7

Evaluate Serverless vs. Container TCO at Scale

For sustained loads > 100,000 invocations/day, calculate Lambda vs. ECS Fargate/Cloud Run total cost of ownership. Beyond 3M invocations/month, containers often become 30-50% cheaper. Consider migrating to ECS/GKE if infrastructure supports it.

Tactic 8

Implement Cost Alerts & Real-Time Monitoring

Set up CloudWatch alarms for Lambda/API Gateway cost anomalies. Use Lumigo or Datadog for function-level cost breakdown. Alert on cost spikes > 20% month-over-month. Catch recursive invocations, unintended deployments, and traffic anomalies within hours.

Right-Sizing Methodology

Effective serverless cost optimization requires a structured approach:

Inventory: List all Lambda functions, Azure Functions, Cloud Functions. Collect invocation counts, average duration, memory allocation, and cost.
Profile: Run AWS Lambda Power Tuning or equivalent benchmarking on top 20 functions (by cost or invocation).
Identify Optimization Opportunities: Prioritize by potential savings (memory right-sizing, log retention, API type migration, async conversions).
Implement & Test: Make changes in non-production first. Monitor duration, memory, and error rates. Validate cost reductions after 1-2 weeks.
Measure & Report: Compare pre/post costs. Document ROI. Set baselines for continuous optimization.
Iterate: Quarterly reviews. Track newly deployed functions for immediate optimization. Reevaluate pricing model as scale changes.

Negotiating serverless contracts with AWS, Azure, or GCP?

Our firm specialists can review your commitment terms, identify savings opportunities, and execute strategic negotiations.

Get Quote

Frequently Asked Questions

Q: Is Lambda always cheaper than containers?

No. Lambda is cheaper for bursty, unpredictable workloads (0-10,000 invocations/day). For sustained loads > 100,000 invocations/day, ECS Fargate and Cloud Run become 30-50% cheaper due to lower per-second costs and no invocation fees.

Q: How much does CloudWatch Logs cost?

$0.50 per GB ingested + $0.03 per GB stored (after 1 month). For a function logging 5 KB per invocation at 10M invocations/month (50 GB/month), cost is $25 ingestion + $1.50+ storage. Set 7-day retention and reduce verbosity to cut costs 80%.

Q: Should I use Provisioned Concurrency?

Only if cold starts violate SLAs. Provisioned Concurrency costs $0.015 per GB-hour ($5,600+/month for 512 MB × 1 instance). For most batch/async workloads, cost is unjustifiable. For user-facing APIs requiring <100 ms p99 latency, it may be justified.

Q: What's the best memory allocation for my function?

Use AWS Lambda Power Tuning tool. Test your function at every memory tier (128 MB to 3,008 MB), measure execution time and cost, and identify the cost-optimal configuration. Most functions are over-allocated by 2-4x. Expected savings: 20-35%.

Q: How do I reduce API Gateway costs?

For APIs >50M requests/month: migrate from REST API ($3.50 per million requests) to HTTP API ($0.60 per million) or ALB ($0.225 per million). Implement caching to reduce invocations 50-80%. Use Function URLs for internal APIs (free).

Serverless Cost Optimization: Reduce Lambda, Functions & Serverless Spend in 2026

Understanding Serverless Cost Drivers

Pricing Models Across Cloud Providers

AWS Lambda Pricing Model

Azure Functions Pricing Model

GCP Cloud Functions & Cloud Run Pricing

Common Serverless Cost Traps

1. Over-Provisioned Memory

2. Synchronous Function Chains

3. Inefficient Recursion

4. API Gateway Per-Request Costs

5. Lambda@Edge Pricing

6. Unoptimized CloudWatch Logs

7. Cross-Region Data Transfer

8. Shared Infrastructure Costs

The Memory/Cost/Performance Triangle

Cold Start Mitigation: Cost vs. Benefit

Provisioned Concurrency: Cost Analysis

Cheaper Cold Start Alternatives

Serverless vs. Containers: Cost Comparison

Event-Driven Architecture for Efficiency

Batch Size Optimization

API Gateway Cost Optimization

REST API vs. HTTP API vs. ALB

Caching & Request Reduction

CloudWatch Logs & Cost Monitoring

Log Retention Policies

Cost Monitoring Tools

GCP Cloud Run Specifics: Always-On vs. Scale-to-Zero

8 Serverless Cost Optimization Tactics

Right-Sizing Methodology

Frequently Asked Questions

Related Resources

Ready to Optimize Serverless Spend? Let's Go.

Get the IT Negotiation Playbook — free

Serverless Cost Optimization: Reduce Lambda, Functions & Serverless Spend in 2026

Understanding Serverless Cost Drivers

Pricing Models Across Cloud Providers

AWS Lambda Pricing Model

Azure Functions Pricing Model

GCP Cloud Functions & Cloud Run Pricing

Common Serverless Cost Traps

1. Over-Provisioned Memory

2. Synchronous Function Chains

3. Inefficient Recursion

4. API Gateway Per-Request Costs

5. Lambda@Edge Pricing

6. Unoptimized CloudWatch Logs

7. Cross-Region Data Transfer

8. Shared Infrastructure Costs

The Memory/Cost/Performance Triangle

Cold Start Mitigation: Cost vs. Benefit

Provisioned Concurrency: Cost Analysis

Cheaper Cold Start Alternatives

Serverless vs. Containers: Cost Comparison

Event-Driven Architecture for Efficiency

Batch Size Optimization

API Gateway Cost Optimization

REST API vs. HTTP API vs. ALB

Caching & Request Reduction

CloudWatch Logs & Cost Monitoring

Log Retention Policies

Cost Monitoring Tools

GCP Cloud Run Specifics: Always-On vs. Scale-to-Zero

8 Serverless Cost Optimization Tactics

Right-Sizing Methodology

Frequently Asked Questions

Related Resources

Ready to Optimize Serverless Spend? Let's Go.

Related coverage

Best Cloud Cost Optimization Consulting Firms

Cloud Cost Optimization: Enterprise FinOps Guide

Cloud Cost Optimization Guide

Kubernetes Cost Optimization: Cluster Right-Sizing

Multi-Cloud Cost Optimization: AWS + Azure + GCP

Cloud Cost Allocation Strategy

Get the IT Negotiation Playbook — free