Data Science Intern: AI Data Curation (Paid) – Life Sciences

0 years

0 Lacs

Posted:2 weeks ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Data Science Intern: AI & Life Sciences (Paid)

Location:

Type:

Company:

About Dizzaroo

AI-first tools to transform drug discovery development

Role Overview

Data Science Interns

hands-on experience at the intersection of AI, data science, and life sciences

What You Will Do
  • Curate, clean, and annotate

    domain-specific datasets

    , including:
  • Scientific publications, clinical protocols, and regulatory documents

    for training large language models.
  • Biomedical and genomic data

    for structured AI pipelines.
  • Use

    advanced tools and databases

    (Weaviate, Neo4j, SQL/NoSQL) to organize and manage

    large-scale, multimodal datasets

    .
  • Support

    data pipeline validation and quality checks

    to ensure clean, structured training data for AI models.
  • Assist with

    document chunking, metadata tagging, and knowledge graph development

    to enhance retrieval and structuring of scientific and clinical data.
  • Collaborate with AI engineers and domain experts to align data curation with

    project goals

    .
What We’re Looking For
  • Background:

    Pursuing or recently completed a Bachelor's/Master’s in

    Data Science, Computer Science, Life Sciences, Biomedical Engineering, or related fields.

  • Skills:

  • Strong in at least one domain with working knowledge of the other:

data science

  • Proficiency in

    Python

    (libraries like pandas, numpy, pytorch).
  • Exposure to

    SQL

    , and ideally to graph/vector databases (Neo4j, Weaviate).
  • Experience with

    data cleaning, ETL workflows, or text processing (NLP preprocessing)

    .
  • Curiosity to understand

    life sciences contexts

    .

life sciences

  • Knowledge of

    biomedical or clinical data structures

    , scientific literature, or genomics.
  • Ability to use

    Python or spreadsheets for basic data analysis

    .
  • Interest in applying

    data science tools

    to life sciences problems.

Mindset:

  • Comfortable with ambiguity and learning complex domain contexts.
  • High attention to detail with a commitment to

    data quality

    .
  • Aligns with Dizzaroo’s values of creativity, flexibility, and challenging the status quo.
What You Will Gain
  • Exposure to

    real-world AI model training pipelines

    using

    structured and unstructured data

    in drug discovery development.
  • Experience with

    advanced data infrastructure and tooling

    for cutting-edge AI workflows.
  • Opportunity to contribute to impactful projects across

    knowledge management, multimodal data integration, and computer vision in diagnostics

    .
  • Potential pathway to

    full-time opportunities with Dizzaroo

    based on performance.
How to Apply

CV and a brief note on why you are interested in this role

Mock Interview

Practice Video Interview with JobPe AI

Start NumPy Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You