Skip to content

Domain 5: Implement natural language processing solutions (15-20%) โ€‹

โ† Domain 4 ยท Domain 6 โ†’


This domain covers analyzing text and speech using the Azure AI Language and Azure AI Speech services. At 15โ€“20% of the exam, it ties with Domains 1 and 6 as the highest-weighted areas โ€” prioritize it.

5.1 Language Service (Analyze Text) โ€‹

Pre-built Text Analytics Features โ€‹

FeatureWhat It DoesExam Trigger Phrase
Sentiment AnalysisReturns positive/negative/neutral at document + sentence level"customer feedback analysis", "sentiment per sentence"
Key Phrase ExtractionIdentifies the main topics in a document"summarize key topics"
NERExtracts people, organizations, locations, dates"extract entities from text"
Entity LinkingDisambiguates named entities using Wikipedia (e.g., "Mercury" โ†’ planet or element)"resolve entity ambiguity"
PII DetectionDetects and can redact personal data (emails, SSN, phone numbers)"remove sensitive data before storage"
Language DetectionIdentifies the language of text"detect language automatically"

Custom Language Models โ€‹

ModelUse CaseTraining Data Needed
Custom NERExtract domain-specific entities (legal terms, medical codes)Labeled examples with entity spans
Custom Text ClassificationCategorize documents into your own labelsLabeled documents per category

NER vs Entity Linking

NER โ†’ identifies entities and their type (Person, Location, etc.) Entity Linking โ†’ disambiguates entities by connecting them to a known knowledge base entry. Both can be used together.


5.2 Conversational Language Understanding (CLU) โ€‹

CLU replaces the older LUIS service. It turns spoken or typed utterances into structured intents + entities.

Core Concepts โ€‹

ConceptDefinitionExample
UtteranceWhat the user says"Set an alarm for 7am tomorrow"
IntentThe user's goalSetAlarm
EntityA parameter extracted from the utteranceTime = 7am, Date = tomorrow
Confidence Score0โ€“1 probability that the model assigned the correct intentUse threshold (e.g., > 0.7) to reject low-confidence calls

CLU Lifecycle โ€‹

Design โ†’ Label utterances โ†’ Train โ†’ Test โ†’ Deploy โ†’ Consume via SDK/REST
  • Deployment slots: Maintain separate production and staging deployments so you can test a new model version before promoting it.
  • Export/Import: Model definition exported as JSON โ€” useful for version control or migrating between projects.

CLU vs Custom QA

CLU โ†’ maps an utterance to an intent + entities (structured output). Use when your app needs to take action based on what the user wants. Custom QA โ†’ matches a question to a stored answer (Q&A pair output). Use when your app needs to return information from a knowledge base.


5.3 Custom Question Answering โ€‹

Custom QA (formerly QnA Maker) builds a knowledge base of Q&A pairs and returns the best matching answer to a user question.

Knowledge Base Sources โ€‹

Source TypeExample
URLsPublic FAQ web pages, product documentation
FilesPDF manuals, Word documents, Excel/TSV Q&A spreadsheets
Manual entryDirectly authored Q&A pairs

Key Features โ€‹

FeatureWhat It Does
Multi-turn conversationsAdds follow-up prompts to an answer (e.g., "Did that help?" โ†’ "Yes/No" branches)
Alternate phrasingAdd synonym questions to improve match rate
Chit-chatPre-built personality sets (Professional, Friendly, Witty) for small talk
Active LearningSurfaces low-confidence questions for human review and improvement
Confidence thresholdRejects answers below a set score โ€” prevents wrong answers from being returned

Deployment โ€‹

  1. Create a project in Language Studio or AI Foundry
  2. Add sources and label/edit Q&A pairs
  3. Train and test
  4. Deploy to a named endpoint
  5. Consume via REST API (POST /knowledgebases/{kbId}/generateAnswer)

Multi-turn Q&A

The exam phrase "multi-turn Q&A from documents" maps to Custom Question Answering โ€” not CLU. CLU is for intent detection; Custom QA is for retrieving stored answers.


5.4 Speech Services โ€‹

Core Capabilities โ€‹

CapabilityDescriptionSDK Method
Speech-to-Text (STT)Real-time or batch audio transcriptionSpeechRecognizer
Text-to-Speech (TTS)Synthesize neural voices with natural prosodySpeechSynthesizer
Speech TranslationReal-time STT + translation in one passTranslationRecognizer
Intent RecognitionDetect CLU intents directly from spoken audioIntentRecognizer with CLU model
Keyword RecognitionLocal on-device detection of a wake wordKeywordRecognizer

Intent Recognition vs Keyword Recognition

Intent Recognition โ†’ understands what the user wants (requires CLU model, cloud call). Keyword Recognition โ†’ detects a specific activation word (e.g., "Hey Cortana") โ€” runs locally on device, no cloud needed.

The exam uses "recognize spoken intent / commands" โ†’ Intent Recognition + Speech SDK. The exam uses "wake word / offline activation" โ†’ Keyword Recognition.

Generative Speaking (GenAI Speaking) โ€‹

Combines Azure OpenAI with Speech TTS to produce expressive, context-aware spoken responses โ€” used in AI agents that speak to users in real-time.

Batch Transcription โ€‹

For large audio files that cannot be processed in real-time:

  • Submit files stored in Azure Blob Storage
  • Use an async pattern: submit โ†’ poll status โ†’ retrieve transcript
  • Returns word-level timestamps and speaker diarization

5.5 Translator Service โ€‹

FeatureDescription
Text TranslationTranslate text across 100+ languages in a single API call
TransliterateConvert script without changing language (e.g., Japanese Kanji โ†’ Romaji)
DetectAuto-detect the source language
Dictionary LookupReturns alternate translations and examples
Document TranslationAsync translation of entire Word/PDF files, preserving layout
Custom TranslatorTrain a domain-specific translation model using parallel corpora (TMX/XLIFF files)

Document Translation Pattern

Document Translation is async โ€” submit โ†’ get Operation-Location header โ†’ poll until complete โ†’ download translated files. Same 202 โ†’ GET pattern as the Read API and batch operations.


Flashcards

1 / 8
โ“

(Click to reveal)
๐Ÿ’ก

โ† Domain 4 ยท Domain 6 โ†’

Happy Studying! ๐Ÿš€ โ€ข Privacy-friendly analytics โ€” no cookies, no personal data
Privacy Policy โ€ข AI Disclaimer โ€ข Report an issue