Data Scientist

Wipro


Date: 2 weeks ago
City: Pune, Maharashtra
Contract type: Full time
Role:

Lead ML Developer (NLP)

Job Summary:

This job requires candidate to lead the design and development work with Large Language model and application of Large Language models for different use cases. As an expert in using Generative AI, particularly LLM models, and proficient in Python programming, he/she will be responsible for developing and deploying a system that can convert natural language queries to handle the use cases like building SQL query statements for searching tables in DB or searching the documents for Q&A. This role requires a deep understanding of natural language processing (NLP) techniques, database management systems, SQL Query optimization, and passion for building intelligent systems.

Responsibilities:

  • Develop and implement a robust system for converting natural language queries with help of prompt engineering into useful format, which can be further programmatically used for search a set of SQL tables or set of documents.
  • Development and application of word embedding techniques (Woprd2vec, Transformer model based encoders, LLM based encoder, BERT based encoders etc)
  • Fine Tuning of Large Language Models with custom data sets.
  • Utilize Generative AI techniques, leveraging Large Language models, to accurately interpret and convert natural language queries into SQL statements.
  • Use of LLMs for the NLP state of art techniques like Text Classification, NER, Keyword Extraction, Text-to-SQL conversion.
  • Check the feasibility of use cases based on Large Language Models, specifically in area chatbot for sales and finance problems.
  • Develop mechanism to summarize SQL tables into user-level summary text, providing concise and meaningful insights from the data retrieved.
  • Design and build scalable architecture that can handle a large volume of queries efficiently, ensuring high performance and minimal latency.
  • Collaborate with cross-functional teams, including data scientists, software engineers, and database administrators, to understand requirements and integrate the solution into existing systems.
  • Conduct thorough research and stay up to date with the latest advancements in NLP, machine learning, and Generative AI to continuously improve the system’s accuracy and efficiency.
  • Optimize and fine-tune SQL queries to ensure efficient data retrieval, taking into account query execution plans, indexes, and query performance optimization techniques.
  • Develop testing frameworks and conduct rigorous testing to validate the system’s accuracy, reliability, and scalability.
  • Document the system architecture, design decisions, and codebase to facilitate future maintenance and enhancements.

Skills

MUST

  • Strong experience in architecting and development of ML, NLP based projects is a must.
  • Strong track record of ML led solution development from scratch.
  • Strong proficiency in Python Programming (very strong Python credentials only apply) and experience with relevant libraries and frameworks for NLP such as NLTK, spaCy, Hugging Face transformers.
  • In-depth knowledge and experience in using Generative AI techniques for NLP tasks, preferable with Large Language models (e.g., GPT-3, GPT-4), GCP PALM models (code bison, text bison) or Hugging Face models.
  • Good exposure to word embedding techniques. (BERT, Word2Vec, LLM based encoders etc.)
  • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch) and deep learning architectures.
  • Hands on experience with Langchain framework.
  • Solid understanding of SQL and experience working with popular relational database management systems.
  • Proficiency in writing both simple standard SQL queries and complex joining queries to fetch data from a database.
  • Experience with query optimization techniques and understanding of indexes, execution plans, and performance tuning.
  • Familiarity with cloud-based data warehousing platforms.
  • Strong problem-solving skills and ability to translate business requirements into technical solutions.
  • Excellent communication skills to collaborate effectively with multidisciplinary teams.
  • Ability to work independently, manage priorities, and deliver high-quality results within project timelines.
  • Strong attention to detail and a commitment to producing clean, well-documented code.

Natural Language Processing - NLP

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Search Engine Optimization (SEO) Internship in Dewas, Nagpur, Burhanpur, Pune (Hybrid)

Big Cats India, Pune, Maharashtra
2 days ago
As a Search Engine Optimization (SEO) intern at Big Cats India, you will have the opportunity to work with a team of dedicated professionals who are passionate about wildlife conservation and environmental education. Your role will involve optimizing our website and online content to increase visibility and engagement.Key Responsibilities Conduct keyword research and analysis to identify opportunities for improving search...

Private Equity Fund Accounting - Senior Associate

Apex Group Ltd, Pune, Maharashtra
2 days ago
DescriptionDo you have Fund Accounting experience, and are you seeking a new job in Pune? Apex Group is looking for a Private Equity Fund Accounting Senior Associate, and the hybrid role comes with an attractive salary and benefits package. This full-time hybrid role comes with a favourable salary and the chance to join a progressive company.The successful PEFA - Senior...

Workday - Consultant

Deloitte, Pune, Maharashtra
4 days ago
Summary Position Summary Work you’ll do The key job responsibilities will be to: Demonstrate commitment to continuous improvement through regular discussion with the client and/or internal teams to assess service delivery Serve as a subject matter expert in multiple domains of Workday Security Lookout for potential risks on client engagements diligently and highlight to leadership proactively Demonstrate strong people management...