🌿 Back to all jobs
🥝 Pyspark Developer(Python,Airflow,Cloudera is mandate) - 5+ YOE - Onsite - Chennai - Immediate -[...]
ValueLabs | dubai, United-Arab-Emirates | Posted June 13, 2026
Job Description
Qualifications
- Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related field.
- 8+ years of experience as a Data Engineer, with a strong focus on PySpark and the Cloudera Data Platform.
Pyspark Job Description
- Data Pipeline Development: Design, develop, and maintain highly scalable and optimized ETL pipelines using PySpark on the Cloudera Data Platform, ensuring data integrity and accuracy.
- Data Ingestion: Implement and manage data ingestion processes from a variety of sources (e.g., relational databases, APIs, file systems) to the data lake or data warehouse on CDP.
- Data Transformation and Processing: Use PySpark to process, cleanse, and transform large datasets into meaningful formats that support analytical needs and business requirements.
- Performance Optimization: Conduct performance tuning of PySpark code and Cloudera components, optimizing resource util...