Data Engineering Lead - AWS Glue & PySpark Specialist
UST
Date: 1 week ago
City: Thiruvananthapuram, Kerala
Contract type: Full time

We are seeking a skilled and experienced Data Engineer Lead to join our team. The ideal candidate will have expertise in Apache Spark, PySpark, Python, and AWS services (particularly AWS Glue). You will be responsible for designing, building, and optimizing ETL processes and data workflows in the cloud, specifically on the AWS platform. Your work will focus on leveraging Spark-based frameworks, Python, and AWS services to efficiently process and manage large datasets.
Experience Range - 5 to 7 years
Key Responsibilities:
- Spark & PySpark Development: Design and implement scalable data processing pipelines using Apache Spark and PySpark to support large-scale data transformations.
- ETL Pipeline Development: Build, maintain, and optimize ETL processes for seamless data extraction, transformation, and loading across various data sources and destinations.
- AWS Glue Integration: Utilize AWS Glue to create, run, and monitor serverless ETL jobs for data transformations and integrations in the cloud.
- Python Scripting: Develop efficient, reusable Python scripts to support data manipulation, analysis, and transformation within the Spark and Glue environments.
- Data Pipeline Optimization: Ensure that all data workflows are optimized for performance, scalability, and cost-efficiency on the AWS Cloud platform.
- Collaboration: Work closely with data analysts, data scientists, and other engineering teams to create reliable data solutions that support business analytics and decision-making.
- Documentation & Best Practices: Maintain clear documentation of processes, workflows, and code while adhering to best practices in data engineering, cloud architecture, and ETL design.
Required Skills:
- Expertise in Apache Spark and PySpark for large-scale data processing and transformation.
- Hands-on experience with AWS Glue for building and managing ETL workflows in the cloud.
- Strong programming skills in Python, with experience in data manipulation, automation, and integration with Spark and Glue.
- In-depth knowledge of ETL principles and data pipeline design, including optimization techniques.
- Proficiency in working with AWS services, such as S3, Glue, Lambda, and Redshift.
- Strong skills in writing optimized SQL queries, with a focus on performance tuning.
- Ability to translate complex business requirements into practical technical solutions.
- Familiarity with Apache Airflow for orchestrating data workflows.
- Knowledge of data warehousing concepts and cloud-native analytics tools.
Skills
Aws Glue, Pyspark, Python.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Senior z/OS System Programmer
UST,
Thiruvananthapuram, Kerala
11 hours ago
Job Title: Senior z/OS System ProgrammerExperience: 7+ years relevant experienceLocation: Chennai, Pune, Bangalore, Trivandrum, Kochi, Hyderabad, NoidaWe have an exciting opportunity for an experienced Mainframe Systems Programmer to join our dynamic team. The ideal candidate will have over 7+ years of hands-on experience with z/OS system programming, including system software installation, maintenance, configuration, and hardware management. This role is essential...

Senior Manager, Engineering (Edge)
Armada,
Thiruvananthapuram, Kerala
1 week ago
About The CompanyArmada is an edge computing startup that provides computing infrastructure to remote areas where connectivity and cloud infrastructure is limited, as well as areas where data needs to be processed locally for real-time analytics and AI at the edge. We’re looking to bring on the most brilliant minds to help further our mission of bridging the digital divide...

Applications Developer 2
Oracle,
Thiruvananthapuram, Kerala
2 weeks ago
Job DescriptionAnalyze, design develop, troubleshoot and debug software programs for commercial or end user applications. Writes code, completes programming and performs testing and debugging of applications.Oracle's CX Service Product Development OrganisationParticipates in multiple phases of Oracle's Customer Experience Cloud Products SDLCContributes to the next generation of software capabilities to address needs of global enterprisesWe are looking for developers, who are...
