Logo of Huzzle

Data Scientist

image

EY

1mo ago

  • Job
    Full-time
    Mid Level
  • Data
    IT & Cybersecurity
  • Hyderabad, +1

AI generated summary

  • You should have a relevant degree, 4+ years in data science, strong Python skills, experience with ML, LLMs, NLP, and computer vision, project management, SQL/NoSQL knowledge, and excellent communication.
  • You will analyze data to solve business problems, implement AI/ML models, collaborate on deployment, and extract information from complex documents.

Requirements

  • Excellent academic background, including at a minimum a bachelor or a master’s degree in data science, Business Analytics, Statistics, Engineering, Operational Research, or other related field with strong focus on modern data architectures, processes, and environments.
  • Solid background in Python with excellent coding skills.
  • 4+ years of core data science experience in one or more below areas:
  • Machine Learning (Regression, Classification, Decision Trees, Random Forests, Timeseries Forecasting and Clustering)
  • Understanding and usage of Large Language Models like Open AI models like ChatGPT, GPT4, frameworks like LangChain and Llama Index.
  • Good understanding of open source LLM framework like Mistral, Llama, etc. and fine tuning on custom datasets.
  • Deep Learning (DNN, RNN, LSTM, Encoder-Decoder Models)
  • Natural Language Processing- Text Summarization, Aspect Mining, Question Answering, Text Classification, NER, Language Translation, NLG, Sentiment Analysis, Sentence
  • Computer Vision- Image Classification, Object Detection, Tracking etc.
  • SQL/NoSQL Databases and its manipulation components
  • Working knowledge of API Deployment (Flask/FastAPI/Azure Function Apps) and webapps creation, Docker, Kubernetes.
  • Excellent written, oral, presentation and facilitation skills
  • Ability to coordinate multiple projects and initiatives simultaneously through effective prioritization, organization, flexibility, and self-discipline.
  • Must have demonstrated project management experience.
  • Knowledge of firm’s reporting tools and processes.
  • Proactive, organized, and self-sufficient with ability to priorities and multitask.
  • Analyses complex or unusual problems and can deliver insightful and pragmatic solutions.
  • Ability to quickly and easily create/ gather/ analyze data from a variety of sources.
  • A robust and resilient disposition able to encourage discipline in team behaviors

Responsibilities

  • Convert business problem into analytical problem and devise a solution approach.
  • Clean, aggregate, analyze and interpret the data to derive business insights from it.
  • Own the AI/ML implementation process: Model Design, Feature Planning, Testing, Production Setup, Monitoring, and release management.
  • Work closely with the Solution Architects in deployment of the AI POC’s and scaling up to production level applications.
  • Should have solid background in Python and has deployed on open-source models.
  • Work on data extraction techniques from complex PDF/Word Doc/Forms- entities extraction, table extraction, information comparison.

FAQs

What is the primary focus of the Data Scientist role at EY?

The primary focus is on implementing innovative ideas through AI research to develop impactful products and analytics-enabled solutions that help EY's sector and service line professionals gain insights from data.

What qualifications are required for this position?

A minimum of a bachelor's or master's degree in data science, Business Analytics, Statistics, Engineering, Operational Research, or a related field is required, with a strong focus on modern data architectures, processes, and environments.

How many years of core data science experience are needed for this role?

A minimum of 4 years of core data science experience in relevant areas is required.

What programming language should candidates be proficient in?

Candidates should have a solid background in Python with excellent coding skills.

What types of machine learning techniques should candidates be familiar with?

Candidates should be familiar with techniques such as Regression, Classification, Decision Trees, Random Forests, Time Series Forecasting, and Clustering.

Are there specific AI frameworks that candidates should know?

Yes, candidates should have an understanding of Large Language Models like OpenAI models (e.g., ChatGPT, GPT-4) and frameworks like LangChain and Llama Index.

What additional skills are emphasized for this role?

Additional skills include excellent written and oral communication, project management experience, ability to analyze complex problems, and proficiency in SQL/NoSQL databases.

Will the role involve working closely with Solution Architects?

Yes, the Data Scientist will work closely with Solution Architects to deploy AI proofs of concept and scale them to production-level applications.

What is the importance of data extraction techniques in this role?

Data extraction techniques from complex documents are essential for entities extraction, table extraction, and information comparison to derive insights for business solutions.

What can candidates expect in terms of personal development at EY?

Candidates can expect support, coaching, and feedback from engaged colleagues, as well as opportunities to develop new skills and progress their career with a focus on personal development and individual progression plans.

Accounting
Industry
1-10
Employees

Mission & Purpose

EY exists to build a better working world, helping create long-term value for clients, people and society and build trust in the capital markets. Enabled by data and technology, diverse EY teams in over 150 countries provide trust through assurance and help clients grow, transform and operate. Working across assurance, consulting, law, strategy, tax and transactions, EY teams ask better questions to find new answers for the complex issues facing our world today. Find out more about the EY global network http://ey.com/en_gl/legal-statement