Cloud Provider Surfaces

This curriculum teaches AI engineering through direct provider APIs (OpenAI, Gemini, Anthropic), hosted inference platforms (Hugging Face, Ollama Cloud, GitHub Models), and local runtimes (Ollama). But many teams access the same models through cloud platform surfaces (AWS Bedrock, Google Vertex AI, or Azure AI) rather than calling provider APIs directly.

If that's your situation, this reference explains what changes and what doesn't.

One important learning from building this curriculum: do not flatten all non-direct paths into one bucket. Bedrock and Vertex AI are cloud provider surfaces for models that already have their own direct APIs. GitHub Models is different: it is a hosted inference platform with its own catalog and auth surface. It may expose some publishers and not others on a given account or org.

The key distinction

A cloud provider surface is not a separate model family. It's a different API surface for accessing the same models. Claude on Bedrock is still Claude. Gemini on Vertex AI is still Gemini. The model behavior, prompt engineering, and every concept in this curriculum apply unchanged.

What changes:

Authentication: cloud IAM credentials instead of provider API keys
API shape: cloud-specific SDKs and endpoints instead of provider SDKs
Model IDs: cloud-specific identifiers (e.g., anthropic.claude-sonnet-4-6 on Bedrock)
Operational controls: region selection, VPC networking, compliance guardrails
Billing: through your cloud account, not the provider directly

What stays the same:

Prompt engineering, prompt contracts, temperature, structured outputs
Retrieval patterns, chunking, embeddings, reranking
Tool calling, agent loops, MCP
Evals, benchmarks, run logs
Orchestration, memory, optimization

A useful contrast: GitHub Models is not the same kind of surface

GitHub Models came up repeatedly while I was building these lessons, so it is worth placing it next to Bedrock and Vertex AI even though it is a different kind of platform.

Bedrock / Vertex AI: cloud deployment surfaces for models that already have direct provider identities. Claude on Bedrock is still Claude. Gemini on Vertex AI is still Gemini.
GitHub Models: a hosted inference catalog behind GitHub auth. You call the models GitHub exposes in that catalog, using publisher/model IDs and GitHub's inference endpoint.

That distinction matters in practice:

If you want Claude, Bedrock is a real cloud-platform route to Claude even when you are not using Anthropic's direct API.
If you want Gemini, Vertex AI is a real cloud-platform route to Gemini even when you are not using Google's direct developer API.
If you want to use GitHub Models, check the live catalog your account or organization can actually see. Do not assume it includes every publisher you might want.

On the GITHUB_TOKEN I validated on April 1, 2026, the visible GitHub Models publishers were:

OpenAI
AI21 Labs
Cohere
DeepSeek
Meta
Microsoft
Mistral AI
xAI

On that token, I did not see Anthropic or Google models in the GitHub Models catalog. That means GitHub Models was a good fit for hosted OpenAI-shaped examples in this curriculum, but it was not a path I could honestly use to teach Claude-via-GitHub or Gemini-via-GitHub.

AWS Bedrock

When to use Bedrock

You already work in AWS, need IAM-based access control, want Claude through AWS billing, or have compliance requirements that need VPC endpoints and AWS CloudTrail logging.

Setup

pip install boto3

Bedrock uses AWS credentials (IAM), not ANTHROPIC_API_KEY. Configure your AWS credentials via aws configure, environment variables, or an IAM role.

You may also need to request model access in the Bedrock console before you can call a model. This is a one-time step per model per region.

First model call

Use the Bedrock Converse API, which is the closest match for the patterns we're using:

import boto3

client = boto3.client("bedrock-runtime", region_name="us-east-1")

response = client.converse(
    modelId="anthropic.claude-sonnet-4-6",
    messages=[
        {
            "role": "user",
            "content": [{"text": "Explain tokenization in two short paragraphs."}],
        }
    ],
    inferenceConfig={
        "maxTokens": 300,
        "temperature": 0.2,
    },
)

print(response["output"]["message"]["content"][0]["text"])

Translating Anthropic examples to Bedrock

When a lesson shows Anthropic code, apply this mapping:

Anthropic SDK	Bedrock Converse API
`from anthropic import Anthropic`	`import boto3`
`client = Anthropic()`	`client = boto3.client("bedrock-runtime", region_name="us-east-1")`
`client.messages.create(model="claude-sonnet-4-6", ...)`	`client.converse(modelId="anthropic.claude-sonnet-4-6", ...)`
`response.content[0].text`	`response["output"]["message"]["content"][0]["text"]`
`ANTHROPIC_API_KEY` env var	AWS credentials (IAM)

Tool calling follows the same pattern. Bedrock Converse supports tool definitions and tool results with a shape that maps closely to Anthropic's tool-use API.

Bedrock-specific considerations

Model availability varies by region. Check the supported models list for your region.
Fine-tuning on Bedrock is the only current path for Claude fine-tuning. Anthropic does not offer fine-tuning through their direct API. This is noted in Module 8: Distillation.
Prompt caching on Bedrock may differ from Anthropic's direct API caching behavior.

Google Vertex AI

When to use Vertex AI

You already work in Google Cloud, want Gemini models through GCP billing and IAM, or need Google Cloud's MLOps tooling (pipelines, model registry, feature store) alongside your model calls.

Setup

pip install google-cloud-aiplatform

Vertex AI uses Google Cloud credentials (service account or application default credentials), not provider API keys directly.

First model call

import vertexai
from vertexai.generative_models import GenerativeModel

vertexai.init(project="your-project-id", location="us-central1")

model = GenerativeModel("gemini-2.5-pro")
response = model.generate_content("Explain tokenization in two short paragraphs.")

print(response.text)

Vertex AI considerations

Vertex AI also hosts third-party models (Claude, Llama, Mistral) through its Model Garden
The API surface differs from each provider's native SDK
Billing, quotas, and model availability are managed through GCP

How to use this reference with the curriculum

Pick your cloud surface from this page
Follow the Anthropic tab (for Bedrock) or Gemini tab (for Vertex AI with Gemini) in each lesson
Apply the translation table from this page to convert SDK calls
Use the provider's own docs for prompt engineering and model behavior, since those concepts transfer directly regardless of which API surface you call through

The curriculum's concepts (retrieval, evals, grounding, orchestration, memory, optimization) are entirely independent of which API surface you use. The surface is plumbing. The engineering is the same.

Cloud Provider Surfaces

The key distinction

A useful contrast: GitHub Models is not the same kind of surface

AWS Bedrock

When to use Bedrock

Setup

First model call

Translating Anthropic examples to Bedrock

Bedrock-specific considerations

Google Vertex AI

When to use Vertex AI

Setup

First model call

Vertex AI considerations

How to use this reference with the curriculum

References