Skillset:
1. Primary β SQL, AWS Data Lake Engineering, AWS (IAM, Lambda, EKS, S3, EMR, MWAA), Apache Iceberg, Python, PySpark
2. Secondary - CI/CD, Terraform, Data Quality, Data Security, Workflow Orchestration
Additional Considerations:
-SQL and Python are non-negotiable.
-Cloud platform flexibility: AWS preferred, but equivalent experience in Azure or GCP is acceptable.
-Must demonstrate experience working with large-scale production data volumes (performance, scalability, cost awareness).
-Secondary skills (CI/CD, Terraform, orchestration, etc.) are not critical if core requirements are met.