Big Data Engineer (Lead)

Infogain


Date: 9 hours ago
City: Mumbai, Maharashtra
Contract type: Full time
Roles & Responsibilities

Job Profile Summary:

In this role, you will support the Data Engineering team in setting up the Data Lake on Cloud and the implementation of standardized Data Model, single view of customer. You will develop data pipelines for new sources, data transformations within the Data Lake , implementing graphql , work on no sql database, CI/CD and data delivery as per the business requirements.

Job Description

  • Build pipelines to bring in wide variety of data from multiple sources within the organization as well as from social media and public data sources.
  • Collaborate with cross functional teams to source data and make it available for downstream consumption.
  • Work with the team to provide an effective solution design to meet business needs.
  • Ensure regular communication with key stakeholders, understand any key concerns in how the initiative is being delivered or any risks/issues that have either not yet been identified or are not being progressed.
  • Ensure dependencies and challenges (risks) are escalated and managed. Escalate critical issues to the Sponsor and/or Head of Data Engineering.
  • Ensure timelines (milestones, decisions and delivery) are managed and value of initiative is achieved, without compromising quality and within budget.
  • Ensure an appropriate and coordinated communications plan is in place for initiative execution and delivery, both internal and external.
  • Ensure final handover of initiative to business-as-usual processes, carry out a post implementation review (as necessary) to ensure initiative objectives have been delivered, and any lessons learned are fed into future initiative management processes.

Who We Are Looking For

Competencies & Personal Traits

  • Work as a team player
  • Excellent problem analysis skills
  • Good in the Azure Databricks platform
  • Experience with at least one Cloud Infra provider (Azure/AWS)
  • Experience in building data pipelines using batch processing with Apache Spark (Spark SQL, Dataframe API) or Hive query language (HQL)
  • Experience in building streaming data pipeline using Apache Spark Structured Streaming or Apache Flink on Kafka & Delta Lake
  • Knowledge of NOSQL databases. Good to have experience in Cosmos DB, Restful API’s and GraphQL
  • Knowledge of Big data ETL processing tools, Data modelling and Data mapping.
  • Experience with Hive and Hadoop file formats (Avro / Parquet / ORC)
  • Basic knowledge of scripting (shell / bash)
  • Experience of working with multiple data sources including relational databases (SQL Server / Oracle / DB2 / Netezza), NoSQL / document databases, flat files
  • Basic understanding of CI CD tools such as Jenkins, JIRA, Bitbucket, Artifactory, Bamboo and Azure Dev-ops.
  • Basic understanding of DevOps practices using Git version control
  • Ability to debug, fine tune and optimize large scale data processing jobs

Working Experience

  • 12-15 years of broad experience of working with Enterprise IT applications in cloud platform and big data environments.

Professional Qualifications

  • Certifications related to Data and Analytics would be an added advantage

Education

  • Master/bachelor’s degree in STEM (Science, Technology, Engineering, Mathematics)

Language

  • Fluency in written and spoken English

Experience

  • 11-12 Years

Skills

  • Primary Skill: Data Engineering
  • Sub Skill(s): Data Engineering
  • Additional Skill(s): Big Data, Apache Hadoop, Apache Hive, Azure-Infra, databricks, SQL, Apache Spark, Azure Datalake

About The Company

Infogain is a human-centered digital platform and software engineering company based out of Silicon Valley. We engineer business outcomes for Fortune 500 companies and digital natives in the technology, healthcare, insurance, travel, telecom, and retail & CPG industries using technologies such as cloud, microservices, automation, IoT, and artificial intelligence. We accelerate experience-led transformation in the delivery of digital platforms. Infogain is also a Microsoft (NASDAQ: MSFT) Gold Partner and Azure Expert Managed Services Provider (MSP).

Infogain, an Apax Funds portfolio company, has offices in California, Washington, Texas, the UK, the UAE, and Singapore, with delivery centers in Seattle, Houston, Austin, Kraków, Noida, Gurgaon, Mumbai, Pune, and Bengaluru.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Full Stack Engineer

PluginLive, Mumbai, Maharashtra
11 hours ago
About the Company:PluginLive is an all-in-one tech platform that bridges the gap between all its stakeholders - Corporates, Institutes Students, and Assessment & Training Partners. This ecosystem helps Corporates in brand building/positioning with colleges and the student community to scale its human capital, at the same time increasing student placements for Institutes, and giving students a real time perspective of...

Securities Services - Fund Administration Product Manager - Vice President

JPMorganChase, Mumbai, Maharashtra
12 hours ago
Job DescriptionBrief IntroductionAre you looking to take your Fund Administration experience to a broader level? Through this role within the Fund Administration product management team, you will contribute to driving our business objectives including the strategic development of our service offering, working on client, industry and regulatory changes, supporting new business opportunities and developing your skills as a subject matter...

Senior Regulatory Reporting Analyst

IDFC FIRST Bank, Mumbai, Maharashtra
3 days ago
Role/ Job Title: Senior Regulatory Reporting AnalystFunction/Department: FinanceJob PurposeThe role bearer has the responsibility to perform prepare and establish various financial analyses, opportunities quantifications, financial projections and capital adequacy calculations in order to provide management with all required financial data, with utmost accuracy, timeliness, and within set standards and guidelines. The role holder is expected to maintain the MIS system...