Perfect. Below is a pure interview-only AWS service selection question bank (120 questions) — no tutorials, no fluff, exactly the kind of “which service & why” questions used in real interviews for Data Engineer / Python / PySpark roles.
Use this as:
- 🔹 mock interview practice
- 🔹 rapid revision
- 🔹 service-selection clarity
🎯 AWS SERVICE SELECTION — INTERVIEW QUESTION BANK (120)
🧮 COMPUTE (EC2 · Lambda · ECS · EMR · Glue)
- When would you choose EC2 over Lambda?
- When does Lambda become a bad choice?
- Why is Lambda not suitable for Spark?
- EC2 vs ECS — what problem does ECS solve?
- When would you choose containers instead of EC2?
- Why is EMR preferred over EC2 for Spark?
- Glue vs EMR — which one gives more Spark control?
- When should you avoid Glue Spark jobs?
- Why is EMR Serverless better than classic EMR in some cases?
- EMR vs Databricks — selection criteria?
- When would you use multiple EC2 instances instead of EMR?
- What compute service suits long-running Python services?
- What service is best for event-driven execution?
- Can Lambda trigger EMR jobs? Why would you do that?
- Why is Glue considered an ETL service, not general compute?
- When does ECS make more sense than Lambda?
- Why would you NOT use EMR Serverless?
- How does compute choice affect AWS cost?
- Why do companies still use EC2 despite serverless?
- What compute service fits batch vs streaming Spark?
🗄️ STORAGE (S3 · EBS · EFS · Glacier)
- Why is S3 not a file system?
- When would you choose EBS over S3?
- EFS vs S3 — key difference?
- Why is S3 preferred for data lakes?
- Why is rename expensive in S3?
- What happens if you store Spark shuffle data in S3?
- When would you use Glacier?
- S3 vs HDFS — which one is cheaper at scale?
- Why is S3 ideal for ephemeral EMR clusters?
- Why is EFS rarely used in data engineering?
- How does storage choice affect Spark performance?
- Can Lambda write to EBS?
- When would you attach multiple EBS volumes?
- Why does Athena require S3?
- Why do Glue crawlers need S3 paths?
- Why is S3 strongly consistent important?
- When should you avoid too many S3 partitions?
- Why is S3 used instead of databases for analytics?
- How does lifecycle policy reduce cost?
- Why is S3 better than EFS for large datasets?
🧾 DATABASES (RDS · DynamoDB · Glue Catalog)
- When would you choose RDS over DynamoDB?
- Why is DynamoDB bad for joins?
- Why is Glue Catalog NOT a database?
- Why does Spark not write data into Glue Catalog?
- When is Athena preferred over RDS?
- Can Glue Catalog replace RDS? Why not?
- Why do OLTP workloads avoid S3?
- Why is DynamoDB serverless?
- Why is RDS not ideal for analytics?
- What database fits metadata storage?
- Why is schema evolution easier in Glue?
- Why is DynamoDB not used as a data lake?
- When would you store small reference data in RDS?
- Why does Athena not support updates?
- How does database choice affect ETL design?
🔁 ORCHESTRATION (Step Functions · Airflow · EventBridge)
- Step Functions vs Airflow — when to choose which?
- Why is Step Functions called “serverless Airflow-lite”?
- Why is Airflow still used despite Step Functions?
- Can Step Functions replace Airflow completely?
- When would you use EventBridge instead of Step Functions?
- Why is Lambda often combined with Step Functions?
- Why should orchestration be decoupled from compute?
- Why is retry logic important in workflows?
- Why does Glue integrate well with Step Functions?
- When would you avoid Airflow?
- Can Step Functions trigger EMR jobs?
- Why is EventBridge good for loosely coupled systems?
- Why are DAGs better than scripts?
- What happens if orchestration fails?
- Why is orchestration a senior-level topic?
🧪 CI/CD (CodePipeline · GitHub · CloudFormation)
- CI/CD vs Infrastructure as Code — difference?
- Why is CloudFormation not CI/CD?
- Why do many teams use GitHub instead of CodeCommit?
- When would you choose CodePipeline?
- Why should infrastructure be version-controlled?
- Can CloudFormation deploy Spark jobs?
- Why is CI/CD important for data pipelines?
- How does CI/CD help rollback?
- Why is CI/CD rarely used for Glue scripts in small teams?
- What breaks if CI/CD is missing?
🏗️ INFRASTRUCTURE (CloudFormation · Terraform)
- CloudFormation vs Terraform — when to choose which?
- Why is IaC critical in production?
- Can CloudFormation create EMR clusters?
- Why is manual console creation risky?
- Why do companies prefer Terraform for multi-cloud?
- Why is IaC important for cost control?
- Why should IAM roles be created via IaC?
- Can IaC manage Glue jobs?
- Why does IaC reduce human error?
- Why is IaC expected in senior roles?
📊 MONITORING & SECURITY (CloudWatch · IAM · CloudTrail)
- Why is CloudWatch mandatory in production?
- Why is IAM the most common failure point?
- Why should services use IAM roles instead of keys?
- What happens if IAM permissions are too broad?
- Why is CloudTrail important for audits?
- Why should Lambda always log to CloudWatch?
- How does monitoring help cost optimization?
- Why is security shared responsibility?
- Why is least privilege important?
- Why is IAM tested heavily in interviews?
🔥 SCENARIO-BASED (REAL INTERVIEW QUESTIONS)
- You need to process 5 TB daily batch data — what services?
- You need near real-time ingestion — which services?
- You want zero server management — what do you choose?
- You need SQL-only access for analysts — which tool?
- You want cheapest long-term storage — what service?
- You need retries + branching logic — which orchestration?
- You want fast Spark startup — which option?
- You need cross-account access — how?
- You want audit trail of user actions — what service?
- You want to avoid vendor lock-in — what IaC?
- You want fully serverless ETL — which stack?
- You need containerized Spark — what service?
- You want automatic schema detection — which service?
- You need metadata only — what service?
- You want to minimize AWS bill — what design choice?
- You want easy rollback — which practice?
- You need Spark streaming — which compute?
- You want no cluster lifecycle — which service?
- You want tight Spark tuning — which option?
- You want simplest architecture — which services?
🧠 HOW TO USE THIS EFFECTIVELY
- ✅ Answer verbally (no notes)
- ✅ For each question, say:
Service → Why → Why not others - ✅ Practice 10/day → 12 days = mastery
🚀 NEXT (Highly Recommended)
Reply with one or more:
- A → Model answers for all 120
- B → Mock interview (I ask, you answer)
- C → Turn this into a printable PDF
- D → AWS + PySpark scenario-based coding interview
You’re now thinking at senior AWS engineer level — this is exactly how interviewers think 💯