Skip to content

GCP-GAIL: Cheatsheet โ€‹

โ† Overview ยท โ† Exam Guide

Exam Day Reference

Print this or review 5 minutes before the exam.


Vertex AI Components โ€‹

ComponentPurpose
Model GardenBrowse and select AI models
Vertex AI StudioTest prompts, adjust parameters
Vector SearchHigh-scale embeddings search (RAG)
AutoSxSCompare model outputs side-by-side
Embeddings APIGenerate text embeddings

Gemini Models โ€‹

ModelContext WindowBest For
Gemini 1.5 Pro1M+ tokensLong documents, complex reasoning
Gemini Pro32K tokensGeneral tasks
Gemini FlashFastLow latency, simple tasks
Gemini UltraVariesMaximum capability

Key: Gemini is multimodal (text, images, video, audio)


Core Parameters โ€‹

ParameterEffectUse Case
TemperatureControls randomnessHigh (0.8+) for creative writing; Low (0.1) for technical data.
Top-KLimits vocabulary to K wordsPrevents the model from picking highly unlikely "long tail" words.
Top-PDynamic vocabulary based on probabilitySamples from smallest set of words whose cumulative probability is P.

Model Garden Providers โ€‹

ProviderModelsType
GoogleGemini, Imagen, Codey, ChirpFirst-party
Open-sourceLlama, Mistral, GemmaOpen models
Third-partyClaude (Anthropic)Partner models

Gemma = Google's open-weight model (can download and run anywhere)


Customization Ladder (3 Steps) โ€‹

  1. Prompt Design (cheapest, fastest)

    • Zero-shot: No examples
    • Few-shot: 3-5 examples
  2. Grounding/RAG (current data)

    • Google Search
    • BigQuery
    • Document AI
  3. Fine-Tuning (100+ examples)

    • Supervised Fine-Tuning (SFT)
    • Dataset in Cloud Storage (JSONL)

Always try in this order!


RAG Workflow (4 Steps) โ€‹

  1. Generate embeddings (Vertex AI Embeddings API)
  2. Store in Vector Search (Matching Engine)
  3. Retrieve similar documents
  4. Ground LLM with context

Analogy: Library where similar books are on same shelf


Grounding Sources (3) โ€‹

  • Google Search โ€” current events, web data
  • BigQuery โ€” structured data, analytics
  • Document AI โ€” your own documents/PDFs

Purpose: Connect model to verifiable source of truth


Vertex AI Studio Tabs โ€‹

TabPurpose
LanguageTest prompts, adjust temperature, export code
VisionGenerate images (Imagen 2), visual Q&A
SpeechText-to-speech, speech-to-text (Chirp)

AutoSxS (Evaluation) โ€‹

Problem: How to know if Model A > Model B?

Solution: Objective "judge" model compares outputs

Use for: Model selection, prompt comparison


Deployment & Monitoring โ€‹

ConceptMeaning
EndpointDeployed model accessible via API
Model DriftPerformance degrading over time
MonitoringTrack drift, hallucination rates

Key: Models must be deployed to Endpoint for production use


Data Privacy (3 Rules) โ€‹

  1. Google does NOT train foundation models on customer data
  2. Data stays in your GCP project
  3. Respects IAM permissions

Compliance: SOC 2, ISO 27001, GDPR


Limits & Quotas โ€‹

ResourceDefault
Max Output TokensVaries by model (2048-8192)
Context WindowGemini 1.5 Pro: 1M+ tokens
Gemini CapabilitiesText, images, video, audio

Prompt Engineering Tips โ€‹

Zero-shot: "Summarize this."

Few-shot: Give 3-5 input/output examples

Chain-of-Thought: "Think step by step"

Best Practices:

  • Be specific
  • Provide context
  • Specify output format
  • Add constraints

Common Mistakes to Avoid โ€‹

  • โŒ Fine-tuning for current data (use grounding)
  • โŒ Fine-tuning for simple tasks (use prompting)
  • โŒ Ignoring context window limits
  • โŒ Not using grounding for factual accuracy
  • โŒ Forgetting data privacy guarantees

Decision Trees โ€‹

"Which customization?" โ€‹

Simple task โ†’ Zero-shot
Custom format โ†’ Few-shot
Current data โ†’ Grounding
100+ examples โ†’ Fine-Tuning

"Which component?" โ€‹

Explore โ†’ Model Garden
Test โ†’ Vertex AI Studio
RAG โ†’ Vector Search
Compare โ†’ AutoSxS

"Which model?" โ€‹

Multimodal โ†’ Gemini
Long context โ†’ Gemini 1.5 Pro
Fast โ†’ Gemini Flash
Open-source โ†’ Llama/Mistral
Google open โ†’ Gemma

โ† Overview ยท โ† Exam Guide

Last Updated: 2026-02-05

Happy Studying! ๐Ÿš€ โ€ข Privacy-friendly analytics โ€” no cookies, no personal data
Privacy Policy โ€ข AI Disclaimer โ€ข Report an issue