Practice Storing & Managing Questions Now
Start a timed practice session focusing on Storing and Managing Data topics from the PDE question bank.
Start PDE Practice Quiz →PDE Storing & Managing Question Bank (6 Questions)
Browse all 6 practice questions covering Storing and Managing Data for the PDE certification exam. Each question includes the full answer and a detailed explanation to help you understand the concepts.
- Question 1Ingesting and Processing Data
You need to replicate changes from a Cloud SQL PostgreSQL database to BigQuery in near real-time. What approach should you use?
Show Answer & Explanation
Correct Answer: BExplanation:Datastream: serverless CDC. Sources: MySQL, PostgreSQL, Oracle, SQL Server, AlloyDB. Destinations: BigQuery, Cloud Storage, Cloud SQL. Features: continuous replication, schema changes propagated, minimal impact on source. Setup: create connection profiles (source + destination), create stream with table selection. Near real-time: seconds to minutes latency.
- Question 2Storing and Managing Data
What file format should you use when storing data in Cloud Storage for BigQuery external tables?
Show Answer & Explanation
Correct Answer: BExplanation:Parquet/ORC: columnar (read only needed columns), compressed (smaller storage), schema embedded, predicate pushdown (BigQuery skips irrelevant row groups). vs CSV/JSON: row-based (must scan all columns), no predicate pushdown, larger files. Parquet preferred on GCP. BigQuery native tables use Capacitor (Google's columnar format). External tables: Parquet for performance, Avro for schema evolution.
- Question 3Ingesting and Processing Data
Which approach efficiently loads large CSV files from Cloud Storage into BigQuery?
Show Answer & Explanation
Correct Answer: BExplanation:BigQuery load jobs provide the most efficient way to bulk-load data from Cloud Storage, with automatic schema detection, format support, and no charge for load operations.
- Question 4Storing and Managing Data
When should you use Firestore vs. Bigtable for NoSQL storage?
Show Answer & Explanation
Correct Answer: BExplanation:Firestore: document model, rich queries, real-time sync for mobile/web. Bigtable: wide-column, millisecond latency at petabyte scale, optimized for high-throughput sequential reads/writes (IoT, time-series).
- Question 5Storing the Data
When should you use Cloud Spanner vs Cloud SQL?
Show Answer & Explanation
Correct Answer: BExplanation:Spanner: globally distributed, strongly consistent, horizontal scaling, 99.999% SLA. Cost: higher. Cloud SQL: regional, vertical scaling (up to 128 vCPU), standard MySQL/PostgreSQL/SQL Server. Choose Spanner for: global apps, financial transactions. Choose SQL for: regional, standard workloads, budget constraints.
- Question 6Storing Data
When should you use Bigtable vs BigQuery?
Show Answer & Explanation
Correct Answer: BExplanation:Bigtable: NoSQL wide-column, <10ms reads/writes, TB-PB scale, row-key design critical, no SQL (but integrates with BigQuery for analytics). Use for: time-series (IoT, financial), real-time serving, and high-throughput workloads. BigQuery: columnar SQL analytics, seconds per query, serverless, automatic optimization. Use for: data warehouse, BI/reporting, ML (BQML), and log analytics.
Key Storing & Managing Concepts for PDE
PDE Storing & Managing Exam Tips
Storing and Managing Data questions in PDE are typically scenario-based. Focus on service-level decision making aligned to official exam objectives. Priority concepts: bigquery, cloud storage, cloud sql, spanner, bigtable, firestore.
What PDE Expects
- Anchor your answer in select the most practical, secure, and scalable answer for the stated scenario.
- Storing & Managing scenarios for PDE are frequently mapped to Domain 3 (~20%), so read the objective carefully before picking controls or architecture.
- Expect multi-service scenarios where Storing & Managing interacts with IAM, networking, storage, or observability patterns rather than appearing as an isolated service question.
- When two options are both technically valid, prefer the choice that best aligns with the exam's operational scope (Professional) and managed-service best practices.
High-Value Storing & Managing Concepts
- Know the core Storing & Managing building blocks cold: bigquery, cloud storage, cloud sql, spanner.
- Review the edge-case features and limits for bigtable, firestore; these details are commonly used to differentiate answer choices.
- Practice service-integration reasoning: how Storing & Managing pairs with Data Processing, Ingesting & Processing in real deployment patterns.
- For PDE, explain why the chosen Storing & Managing design meets reliability, security, and cost expectations better than the alternatives.
Common PDE Traps
- Watch for answers that partially solve the requirement but miss operational constraints.
- Questions in Storing and Managing Data often include distractors that look correct for Storing & Managing but violate least-privilege, durability, or availability requirements.
- Avoid picking options purely by feature name; validate data path, failure handling, and governance impact before answering.
- If the prompt hints at automation or repeatability, eliminate manual-only operational answers first.
Fast Review Checklist
- Can you compare at least two Storing & Managing implementation paths and justify which one best fits the scenario?
- Can you map the chosen answer back to Storing and Managing Data (~20%) outcomes for PDE?
- Can you explain security and access boundaries for Storing & Managing without relying on default-open assumptions?
- Can you describe how Storing & Managing integrates with Data Processing and Ingesting & Processing during failure, scaling, and monitoring events?