Amazon Bedrock announces support for cost allocation tags on inference profiles

November 6, 2024

This content originally appeared on DEV Community and was authored by Karthik Sakthivel

What’s new at AWS

Amazon Bedrock now supports customers to allocate and track on-demand foundation model usage.

With this, customers can categorize their GenAI inference costs by department, team, or application using AWS cost allocation tags.

What is Amazon Bedrock:
It is a fully managed service that offers a choice of high-performing foundation models from leading AI companies via a single API.
It also provides a broad set of capabilities such as security, privacy, and responsible AI capabilities built in.

These capabilities help customer to build tailored applications for multiple use cases across different industries.

Importantly it is helping organizations by ensuring customer trust and data governance.

You can leverage this feature by creating an application inference profile and tagging it.

What is Inference profiles:
These profiles are a resource in Amazon Bedrock that define a model and one or more Regions
Inference profile can route model invocation requests.

Types of inference profiles:
1⃣ Cross region inference profiles
2⃣ Application inference profiles

When to use inference profiles:
Track usage metrics
Use tags to monitor costs
Cross-region inference

Explore more about cross-region inference profiles:
https://aws.amazon.com/blogs/machine-learning/getting-started-with-cross-region-inference-in-amazon-bedrock/