🌿 Back to all jobs

🥝 Pyspark Developer(Python,Airflow,Cloudera is mandate) - 5+ YOE - Onsite - Chennai - Immediate -[...]

ValueLabs | dubai, United-Arab-Emirates | Posted June 13, 2026

Job Description

Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, Information Systems, or a related field.
  • 8+ years of experience as a Data Engineer, with a strong focus on PySpark and the Cloudera Data Platform.

Pyspark Job Description

  • Data Pipeline Development: Design, develop, and maintain highly scalable and optimized ETL pipelines using PySpark on the Cloudera Data Platform, ensuring data integrity and accuracy.
  • Data Ingestion: Implement and manage data ingestion processes from a variety of sources (e.g., relational databases, APIs, file systems) to the data lake or data warehouse on CDP.
  • Data Transformation and Processing: Use PySpark to process, cleanse, and transform large datasets into meaningful formats that support analytical needs and business requirements.
  • Performance Optimization: Conduct performance tuning of PySpark code and Cloudera components, optimizing resource util...

Apply for This Position

Submit Application