What is Lambda concurrency?
What is concurrency?
Concurrency is the number of requests that your function is serving at any given time.
If the function is invoked again while a request is still being processed, another instance is allocated, which increases the function’s concurrency.
Provisioned concurrency
is calculated from the time you enable it on your function until it is disabled, rounded up to the nearest 5 minutes. You can enable Provisioned Concurrency for your Lambda functions for greater control over the performance of your serverless applications.
- By allocating provisioned concurrency before an increase in invocations. you can ensure that all requests are served by initialized instances with very low latency
- Lambda also integrates with Application Auto Scaling
- To increase provisioned concurrency automatically as needed, use the Application Auto Scaling API to register a target and create a scaling policy
- If the amount of provisioned concurrency on a function’s versions and aliases adds up to the function’s reserved concurrency. all innovations run on provisioned concurrency
- You can manage provisioned concurrency for all aliases and versions from the function configuration page. Application Auto Scaling API
Reserved concurrency
When a function has reserved concurrency. no other function can use that concurrency. To ensure that a function can always reach a certain level of concurrency. You can configure the function with reserved concurrency.