Elsevier Logo

Elsevier

Data Scientist II

Posted 11 Days Ago
India
Junior
India
Junior
The Data Scientist II will develop and maintain NLP solutions, manage data science projects from design to production, create production ready Python packages, and ensure the robustness of data science pipelines against model drift.
The summary above was generated by AI

Are you interested in working with data and analytics to solve problems?  

Are you interested in bringing and building up your NLP and (gen) AI expertise to projects?  

About our Team  

We are a diverse team of natural language processing and gen AI experts, taxonomy experts and scientific content experts in biology and chemistry domains. We mainly develop best-in-class enrichment algorithms that deeply mine scientific literature (journals and patents) for Elsevier life science .com products such as Reaxys and Embase.  

About the Role  

You will be responsible for building, testing and maintaining our NLP solutions. You will work throughout the whole life cycle of data science projects: design, implementation, productionisation and beyond. You will deliver efficient and production ready Python code. You will collaborate with the technology team to deploy and productionize our data science pipelines.

Responsibilities

  • Data collection, data analysis, model development, defining quality metrics, quality assessment of models and regular presentations to stakeholders

  • Creating production ready Python packages for each component of data science pipelines (such as pre-processing and model inference) and their deployment together with the technology team

  • Integration of data science components and end-to-end quality assessment

  • Keeping our data science pipelines robust against model drift and ensuring continuous output quality; development of needed tools and strategies for maintenance such as automatic model re-training.

  • Establishing the reporting process of the performance of the pipeline, and automatic re-training strategy for the existing pipelines

Requirements

  • At least 2 years of relevant applied experience or  Msc/MTech in the field of computer science, data science, artificial intelligence, mathematics, statistics, bioinformatics or other quantitative fields with at least 1 years of relevant experience. International working/education experience is a plus!

  • Strong hands-on knowledge of Python, ability to write unit tests and production ready code adhering to Python best practices and object oriented programming principles.

  • Data processing, cleaning and analysis skills: experience with pandas, numpy, matplotlib, boto3

  • Experience with SOTA deep learning approaches in NLP domain such as LLMs and finetuning for specific use cases such as named entity recognition and relation extraction

  • Affinity with gen AI solutions, various LL models,  vectorization methodologies and evaluation of LLMs

  • Experience with CI/CD, Git, PyTorch, AWS services such as SageMaker. Experience with Spark/Databricks is a plus!

  • Willingness to learn, analytical thinking, problem solving and communication skills; ability to translate complex requirements into practical solutions

  • Experience in classical machine Learning: Classification, Regression, Clustering, Text Mining. You have an excellent understanding of Neural Networks, Random Forests, Logistic Regression, SVM, K-Means etc.

  • Experience in later stages of data science life cycle such as optimizing productionization (techniques such as parallelization, multi threading etc.) and automated model re-training. Interest and affinity in MLOps is a plus!

-----------------------------------------------------------------------

Elsevier is an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form: https://forms.office.com/r/eVgFxjLmAK , or please contact 1-855-833-5120.

Please read our Candidate Privacy Policy.

Top Skills

Python

Elsevier Mumbai, Maharashtra, IND Office

Mumbai, MH , India, 400059

Elsevier Mumbai, Maharashtra, IND Office

Mumbai, Maharashtra, India, 400059

Similar Jobs

18 Days Ago
Mumbai, Maharashtra, IND
Junior
Junior
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
The Data Scientist II will develop machine learning solutions and data analytics for map-making projects. Responsibilities include designing automation processes, analyzing data, building statistical models, and collaborating with cross-functional teams. The role focuses on scalable big data infrastructure and continuous delivery.
Top Skills: PythonSQL
An Hour Ago
Hybrid
Bengaluru, Karnataka, IND
Entry level
Entry level
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
As an Advanced Data Science Associate, you'll develop advanced algorithms, apply statistical techniques, and collaborate to analyze data trends and emerging technologies.
Top Skills: JavaPythonR
17 Hours Ago
Hybrid
Bengaluru, Karnataka, IND
Senior level
Senior level
Financial Services
The Senior Data Scientist Associate at JPMorgan Chase will design, deploy, and manage prompt-based models for LLMs in various NLP tasks. Responsibilities include conducting research on prompt engineering, collaborating with teams to meet business needs, analyzing data to evaluate model performance, and maintaining data processing workflows.
Top Skills: Python

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account