Please login/register to apply for this job.
25 Apr 2026

Permanant Junior Data Scientist – Discovery Vacancies

Discovery Limited – Posted by MRJobs24 Sandton, Gauteng, South Africa

Job Description

Discovery Vacancies – Junior Data Scientist

An opportunity exists for a Junior Data Scientist to join a team working on advanced Natural Language Processing (NLP) and Large Language Model (LLM) solutions. The team is actively involved in research, development, and deployment of intelligent systems powered by NLP and LLM technologies. This role is ideal for an individual eager to contribute to innovative data science projects and grow within a highly technical environment.

Key Responsibilities

  • Work with large volumes of unstructured text data from diverse sources
  • Review and stay updated on relevant academic research and industry developments
  • Collaborate with senior team members to deliver end-to-end data science projects from concept to deployment and business adoption
  • Develop prototypes for machine learning systems, particularly those based on NLP and LLM technologies
  • Implement solutions aligned with architectural designs defined by senior data scientists and engineers
  • Evaluate models and prototypes to ensure accuracy, scientific rigor, and business value
  • Present findings, insights, and project updates to both technical and non-technical stakeholders
  • Identify new opportunities for leveraging existing and new datasets for innovative business use cases

Personal Attributes

  • Strong curiosity and passion for learning and problem-solving
  • Enthusiasm for building data-driven solutions that address real-world challenges
  • Ability to manage multiple priorities and understand broader business context
  • Innovative mindset aligned with organizational values and purpose
  • Strong communication skills and ability to collaborate effectively within a team

Technical Skills

  • Proficiency in SQL and database management
  • Strong programming skills in Python for data science and machine learning
  • Ability to define problem statements, design analytical approaches, and communicate insights clearly

Advantageous Skills

  • Experience with Git and version control systems
  • Familiarity with R programming language
  • Experience with NLP frameworks and model development
  • Exposure to TensorFlow and/or PyTorch
  • Experience working with or training LLMs
  • Knowledge of distributed computing tools such as Spark or Dask

Education and Experience

  • Honours or Master’s degree in Computer Science, Mathematics, Statistics, Data Science, Actuarial Science, Operations Research, Industrial Engineering, Applied Mathematics, or a related quantitative field
  • PhD qualification is advantageous
  • Candidates at all experience levels will be considered

Employment Equity

The organization is committed to diversity and equal opportunity. Applications from individuals with disabilities are encouraged, and all hiring decisions are aligned with the company’s Employment Equity Plan.

APPLY NOW

7 total views, 1 today

Apply for this Job