🃏 Data Preparation Flashcards

Active recall cards for MLA-C01 data ingestion, transformation, quality, and feature preparation.

Card 1 of 5

Question

Which MLA-C01 domain has the largest weighting?

Click to reveal answer

Answer

Domain 1: Data Preparation for Machine Learning at 28%.

Click to flip back

All Data Preparation Flashcards

1

Q: Which MLA-C01 domain has the largest weighting?

A: Domain 1: Data Preparation for Machine Learning at 28%.

2

Q: Which AWS service catalogs datasets for Athena and Glue ETL?

A: AWS Glue Data Catalog.

3

Q: What is data leakage?

A: Using information during training that would not be available at prediction time.

4

Q: Which feature store mode supports low-latency inference?

A: The online store.

5

Q: Which file formats are usually preferred for analytics over raw CSV?

A: Columnar formats such as Parquet or ORC.

More MLA-C01 Flashcard Decks