Skip to content

Domain 3: Implement AI capabilities in database solutions (25-30%) โ€‹

โ† Domain 2 ยท Back to Overview

This is the DP-800 domain that makes the cert unique. It connects SQL development with modern AI retrieval patterns.


3.1 Design and implement models and embeddings โ€‹

External model thinking โ€‹

The study guide expects you to evaluate models by:

  • modality
  • language support
  • model size
  • structured output support

Embedding pipeline โ€‹

text
Relational or semi-structured source data
  -> select columns to represent meaning
  -> split content into chunks
  -> generate embeddings
  -> store vectors
  -> refresh embeddings when source data changes

Key design decisions โ€‹

DecisionWhy it matters
Which columns to embedBad input columns produce bad semantic retrieval
Chunk sizeToo large hurts precision; too small loses context
Refresh methodMust match data volatility and operational complexity
Model choiceAffects multilingual support, cost, latency, and output quality

Embedding maintenance methods called out in the guide โ€‹

  • table triggers
  • Change Tracking
  • CDC
  • CES
  • Azure Functions with SQL trigger binding
  • Azure Logic Apps
  • Microsoft Foundry

TIP

If the source data changes frequently, the correct answer usually includes a repeatable embedding refresh path, not one-time manual generation.


Search modes โ€‹

ModeBest for
Full-text searchKeyword/token matching
Semantic vector searchMeaning-based similarity
Hybrid searchBlend keywords with semantic similarity

SQL/vector concepts to know โ€‹

  • vector data type
  • vector indexes
  • vector size and dimensionality
  • vector distance and normalization functions
  • ANN vs ENN search behavior
  • reciprocal rank fusion (RRF)

ANN vs ENN โ€‹

OptionTrade-off
ANNFaster, approximate nearest neighbors, better for scale
ENNExact nearest neighbors, higher precision, usually slower

Hybrid search matters because enterprise SQL use cases often need both:

  • exact domain terms, codes, and identifiers
  • semantic similarity over meaning

Reciprocal Rank Fusion โ€‹

RRF combines ranked results from multiple retrieval methods, typically full-text and vector search, into one final ranked list. If the exam mentions combining multiple ranked result sets into a better blended answer, think RRF.


3.3 Design and implement retrieval-augmented generation (RAG) โ€‹

RAG flow in a SQL-centered app โ€‹

  1. Retrieve relevant rows or chunks.
  2. Convert the context into JSON or structured prompt text.
  3. Send it to the model.
  4. Capture and parse the response.

SQL-specific clue โ€‹

The official guide explicitly mentions using sp_invoke_external_rest_endpoint to create prompts and send results to a language model.

That means you should be ready to think in this pattern:

  • SQL retrieves data
  • SQL shapes data into JSON
  • SQL calls external AI endpoint
  • app or SQL layer parses the response

When RAG is the right answer โ€‹

  • Facts change often
  • Grounding must come from current enterprise data
  • You need retrieval traceability
  • Fine-tuning would be too slow, costly, or stale

RAG Trap

If the requirement is "use the latest database content" or "include current enterprise records", do not jump to fine-tuning. The exam will usually want retrieval + prompt grounding instead.


Quick Decision Rules โ€‹

Need keyword-only precision?
Use full-text search.

Need meaning-based similarity?
Use vector search.

Need both business keywords and semantic relevance?
Use hybrid search, often with RRF.

Need generative answers grounded in current SQL data?
Use RAG.

Need high recall at scale with faster search?
Lean toward ANN.

Need exact nearest-neighbor correctness and can afford cost?
Lean toward ENN.


Fast Recall โ€‹

  • Embeddings turn content into vectors
  • Chunking controls retrieval quality
  • ANN is approximate and fast
  • ENN is exact and slower
  • Hybrid search = keyword + semantic
  • RRF blends ranked results
  • sp_invoke_external_rest_endpoint is the SQL-native exam clue for external model invocation

โ† Domain 2 ยท Back to Overview

Happy Studying! ๐Ÿš€ โ€ข Privacy-friendly analytics โ€” no cookies, no personal data
Privacy Policy โ€ข AI Disclaimer โ€ข Report an issue