Skip to content

AIP-C01: Exam Guide โ€‹

โ† Overview ยท Cheatsheet โ†’

How the Exam Wants You to Think โ€‹

The AIP-C01 is a Professional-level exam for developers building GenAI applications on AWS. It values architectural reasoning, cost-aware design, and responsible AI governance.

Answer Philosophy โ€‹

  1. Architecture first: Prefer the AWS-managed, purpose-built service over a custom-built solution.
  2. Cost-aware: On-demand for sporadic traffic, PTU for steady 24/7 workloads, and batch inference for non-urgent bulk jobs.
  3. Security by default: Prefer private connectivity, least-privilege IAM, and Guardrails.
  4. RAG over fine-tuning: When knowledge is dynamic or traceability matters, RAG is almost always the right answer.

Keyword Detection Table โ€‹

If you see...Look for this in the answer...
"Large context window needed"Claude (Anthropic)
"Open-source / custom fine-tuning needed"Llama (Meta)
"Efficient inference at lower cost"Mistral
"AWS-native embeddings for RAG"Titan Text Embeddings
"Generate summaries / conversational text with an AWS-native model"Amazon Titan Text
"Convert text to vectors for semantic search"Amazon Titan Embeddings
"Analyze images/video with low ops and summarize trends in dashboards"Bedrock multimodal FM + Step Functions + QuickSight
"Document Q&A / knowledge retrieval"Knowledge Bases (RAG)
"Multi-step reasoning / tool use / external API calls"Bedrock Agents + Action Groups
"Unified message-based API across models"Converse API
"Full synchronous response"InvokeModel
"Model-specific request JSON / schema varies by FM"InvokeModel
"Streaming / low-latency UX / chat interface"InvokeModelWithResponseStream
"Persistent bidirectional real-time connection / server push"API Gateway WebSocket API
"Vector store for RAG"Amazon OpenSearch Serverless
"Store embeddings + perform low-latency similarity / k-NN search"Amazon OpenSearch Service / Serverless Vector Engine
"Validate structured data quality before RAG / FM ingestion"AWS Glue Data Quality
"Fully managed RAG with minimal custom pipeline code"Bedrock Knowledge Bases
"Peak-period throttling, same FM, keep Bedrock API compatibility, low ops overhead"Cross-Region inference profile
"Content moderation / block harmful content"Guardrails for Amazon Bedrock
"Block a business-prohibited subject like investment advice"Guardrails Denied Topics
"SageMaker LLM endpoint underutilized: short real sequence lengths, low concurrency, too many GPUs"Reduce max sequence length + adjust tensor parallelism
"Predictable throughput / guaranteed no throttling"Provisioned Throughput (PTU)
"Bulk, non-real-time inference jobs"Batch inference
"Audit trail / compliance logging"AWS CloudTrail
"HIPAA + full control over encryption keys"Customer-managed KMS keys + CloudTrail
"Operational metrics / alarms / dashboards"Amazon CloudWatch
"Private connectivity to Bedrock"VPC Endpoint (PrivateLink) โ€” bedrock-runtime
"Knowledge changes frequently / need source citations"RAG
"Detect hallucinations in RAG output"Groundedness metric
"Sequential evaluation stages with approval gates before model promotion"Step Functions + Bedrock Model Evaluation / A-B comparison
"Log all prompts and responses for AI governance"Model Invocation Logging
"FM inference may exceed 15 minutes"ECS / Bedrock Batch โ€” not Lambda
"API response time could exceed 29 seconds"Async submit-and-poll โ€” not synchronous API Gateway
"Human approval needed for hours or days"Step Functions task token (Standard Workflow)
"Audit which S3 buckets contain PII before RAG ingestion"Amazon Macie
"150-page document needs to fit in one prompt"Claude (200k token context window)

Exam Traps โ€‹

Look out for these!

  • RAG vs. Fine-tuning: RAG is right when knowledge changes frequently or you need source attribution.
  • Guardrails scope: Guardrails filter BOTH inputs AND outputs.
  • Guardrails must be explicitly applied: They do NOT apply automatically to all Bedrock calls.
  • Guardrails vs. Knowledge Bases: Knowledge Bases improve relevance with retrieval; Guardrails enforce safety policy.
  • Denied topics vs. word filters: denied topics block a subject semantically; word filters block exact words and can over-block.
  • OpenSearch Serverless vs. managed OpenSearch: Knowledge Bases use OpenSearch Serverless.
  • Converse vs. InvokeAgent: Converse is a unified model interaction API, not full agent orchestration.
  • InvokeModel vs. Converse: InvokeModel request bodies can vary by model family; Converse reduces that variation with a more consistent message-based format.
  • InvokeModel vs. InvokeModelWithResponseStream: Streaming does NOT save money.
  • CloudTrail vs. CloudWatch: CloudTrail = audit; CloudWatch = operations.
  • PTU for sporadic traffic: PTU is NOT appropriate for development or unpredictable workloads.
  • Bedrock does not train on customer data: mark any answer claiming otherwise as incorrect.
  • Knowledge Base sync: After updating S3 content, you must manually trigger a sync.

Decision Quick Reference โ€‹

"Which FM?" โ€‹

text
Large context window?    โ†’ Claude
Open-source fine-tune?   โ†’ Llama
Efficient inference?     โ†’ Mistral
Embeddings?              โ†’ Titan

"RAG or fine-tuning?" โ€‹

text
Knowledge changes often? โ†’ RAG
Need source traceability? โ†’ RAG
Need new style/format?   โ†’ Fine-tuning

"Knowledge Base or Bedrock Agents?" โ€‹

text
Simple document Q&A?     โ†’ Knowledge Base
Multi-step + tool use?   โ†’ Bedrock Agents

"Which compute for Bedrock?" โ€‹

text
Per-request, < 15 min?          โ†’ Lambda
Long-running / > 15 min?        โ†’ ECS / EKS
Bulk async, non-real-time?      โ†’ Bedrock Batch Inference
Human approval, hours/days?     โ†’ Step Functions (Standard)

"Service limits to remember" โ€‹

text
Lambda max execution            โ†’ 15 minutes
API Gateway integration timeout โ†’ 29 seconds (hard limit)
Step Functions Standard max     โ†’ 1 year
Step Functions Express max      โ†’ 5 minutes
Claude context window           โ†’ 200,000 tokens
Titan context window            โ†’ shorter (varies)

โ† Overview ยท Cheatsheet โ†’

Happy Studying! ๐Ÿš€ โ€ข Privacy-friendly analytics โ€” no cookies, no personal data
Privacy Policy โ€ข AI Disclaimer โ€ข Report an issue