This content originally appeared on DEV Community and was authored by Karthik Sakthivel
What’s new at AWS
Amazon Bedrock now supports customers to allocate and track on-demand foundation model usage.
With this, customers can categorize their GenAI inference costs by department, team, or application using AWS cost allocation tags.
What is Amazon Bedrock:
It is a fully managed service that offers a choice of high-performing foundation models from leading AI companies via a single API.
It also provides a broad set of capabilities such as security, privacy, and responsible AI capabilities built in.
These capabilities help customer to build tailored applications for multiple use cases across different industries.
Importantly it is helping organizations by ensuring customer trust and data governance.
You can leverage this feature by creating an application inference profile and tagging it.
What is Inference profiles:
These profiles are a resource in Amazon Bedrock that define a model and one or more Regions
Inference profile can route model invocation requests.
Types of inference profiles:
1⃣ Cross region inference profiles
2⃣ Application inference profiles
When to use inference profiles:
Track usage metrics
Use tags to monitor costs
Cross-region inference
Explore more about cross-region inference profiles:
https://aws.amazon.com/blogs/machine-learning/getting-started-with-cross-region-inference-in-amazon-bedrock/
This content originally appeared on DEV Community and was authored by Karthik Sakthivel