📊 Model Evaluation & Metrics - AIF-C01 Practice Questions

Model evaluation measures how well an ML model performs. Master accuracy, precision, recall, F1 score, AUC-ROC, confusion matrices, BLEU, ROUGE, and human evaluation for generative AI.

7Questions Available
2Exam Domains

Practice Model Evaluation Questions Now

Start a timed practice session focusing on Model Evaluation & Metrics topics from the AIF-C01 question bank.

Start AIF-C01 Practice Quiz →

AIF-C01 Model Evaluation Question Bank (7 Questions)

Browse all 7 practice questions covering Model Evaluation & Metrics for the AIF-C01 certification exam. Answers are intentionally hidden on this page so you can self-test first before checking results in quiz mode.

  1. Question 1Applications of Foundation Models

    A company wants to evaluate AI-generated responses at scale using automated metrics for relevance and accuracy without relying solely on human reviewers. Which approach provides this scalable evaluation?

    AManual human evaluation only
    BUsing LLM-as-judge with a separate evaluation model
    CTesting only on the training dataset
    DComparing response lengths

    Answer hidden for practice.

    Use the interactive quiz to reveal the correct answer and explanation.

    Start AIF-C01 Quiz
  2. Question 2Fundamentals of AI and ML

    Which metric is the harmonic mean of precision and recall?

    AAccuracy
    BAUC-ROC
    CF1 Score
    DRMSE

    Answer hidden for practice.

    Use the interactive quiz to reveal the correct answer and explanation.

    Start AIF-C01 Quiz
  3. Question 3Fundamentals of Generative AI

    Which Amazon Bedrock capability evaluates FM outputs for quality, accuracy, and safety using automated metrics or human reviewers?

    ABedrock Agents
    BBedrock Knowledge Bases
    CBedrock Model Evaluation
    DBedrock Guardrails

    Answer hidden for practice.

    Use the interactive quiz to reveal the correct answer and explanation.

    Start AIF-C01 Quiz
  4. Question 4Fundamentals of AI and ML

    What does 'precision-recall tradeoff' mean in classification?

    AThe tradeoff between model training precision and recall speed
    BAdjusting the decision threshold changes precision and recall inversely — raising the threshold increases precision but decreases recall
    CThe relationship between model accuracy and the number of training examples
    DChoosing between precision floating-point and recall hyperparameters

    Answer hidden for practice.

    Use the interactive quiz to reveal the correct answer and explanation.

    Start AIF-C01 Quiz
  5. Question 5Applications of Foundation Models

    What is 'Retrieval Precision vs. Retrieval Recall' tradeoff in RAG systems?

    AThe tradeoff between how fast and how accurately the vector store retrieves documents
    BRetrieval precision measures the fraction of retrieved documents that are relevant; recall measures the fraction of all relevant documents that are retrieved — tuning K controls this tradeoff
    CThe tradeoff between LLM response quality and retrieval speed
    DPrecision of the embeddings model vs. recall of the LLM

    Answer hidden for practice.

    Use the interactive quiz to reveal the correct answer and explanation.

    Start AIF-C01 Quiz
  6. Question 6Fundamentals of AI and ML

    What is 'recall' vs 'precision' in the context of medical diagnosis AI?

    AThey measure the same thing with different formulas
    BRecall (sensitivity) is critical — missing a real disease (false negative) is more dangerous; precision matters when false positives cause unnecessary treatment
    CPrecision is always more important than recall in healthcare
    DRecall measures image quality; precision measures text quality

    Answer hidden for practice.

    Use the interactive quiz to reveal the correct answer and explanation.

    Start AIF-C01 Quiz
  7. Question 7Applications of Foundation Models

    A startup wants to compare the performance of multiple foundation models on their summarization task before committing to one. Which Amazon Bedrock feature allows side-by-side evaluation?

    AModel versioning
    BAmazon Bedrock Model Evaluation
    CAmazon SageMaker Clarify
    DBedrock Agents

    Answer hidden for practice.

    Use the interactive quiz to reveal the correct answer and explanation.

    Start AIF-C01 Quiz

Key Model Evaluation Concepts for AIF-C01

evaluationmetricaccuracyprecisionrecallf1aucrocconfusion matrixbleurougebenchmark

AIF-C01 Model Evaluation Exam Tips

Model Evaluation & Metrics questions in AIF-C01 are typically scenario-based. Focus on generative AI fundamentals, responsible AI, and foundation model use cases. Priority concepts: evaluation, metric, accuracy, precision, recall, f1.

What AIF-C01 Expects

  • Anchor your answer in identify the safest and most practical AI implementation approach for business goals.
  • Model Evaluation scenarios for AIF-C01 are frequently mapped to Domain 1 (20%), Domain 3 (28%), so read the objective carefully before picking controls or architecture.
  • Expect multi-topic scenarios where Model Evaluation interacts with IAM, networking, storage, or observability patterns rather than appearing as an isolated question.
  • When two options are both technically valid, prefer the choice that best aligns with the exam's operational scope (Foundational) and vendor best practices.

High-Value Model Evaluation Concepts

  • Know the core Model Evaluation building blocks cold: evaluation, metric, accuracy, precision.
  • Review the edge-case features and limits for recall, f1; these details are commonly used to differentiate answer choices.
  • Practice service-integration reasoning: how Model Evaluation pairs with ML Lifecycle, Responsible AI, Supervised Learning in real deployment patterns.
  • For AIF-C01, explain why the chosen Model Evaluation design meets reliability, security, and cost expectations better than the alternatives.

Common AIF-C01 Traps

  • Watch for ignoring data governance and model safety constraints.
  • Questions in Fundamentals of AI and ML often include distractors that look correct for Model Evaluation but violate least-privilege, durability, or availability requirements.
  • Avoid picking options purely by feature name; validate data path, failure handling, and governance impact before answering.
  • If the prompt hints at automation or repeatability, eliminate manual-only operational answers first.

Fast Review Checklist

  • Can you compare at least two Model Evaluation implementation paths and justify which one best fits the scenario?
  • Can you map the chosen answer back to Fundamentals of AI and ML (20%) outcomes for AIF-C01?
  • Can you explain security and access boundaries for Model Evaluation without relying on default-open assumptions?
  • Can you describe how Model Evaluation integrates with ML Lifecycle and Responsible AI during failure, scaling, and monitoring events?

Exam Domains Covering Model Evaluation

Related Resources

More AIF-C01 Study Resources