Potential occupational standard
Occupational standard in development
Approved occupational standard
Occupational standard without apprenticeship
Custom occupational card
Apprenticeship
Higher Technical Qualification
T Level
Technical Qualification
Career starter apprenticeship
Royal apprenticeship
Occupational progression
Technical education progression
Mid green occupation
Dark green occupation
Favourite occupation
home Digital
Data scientist

Data scientist

Digital

Level 6 - Professional Occupation

Working in a team to find ways to improve an organisation's processes.

Reference: OCC0585

Status: assignment_turned_inApproved occupation

Average (median) salary: £43,283 per year

SOC 2020 code: 2433 Actuaries, economists and statisticians

SOC 2020 sub unit groups:

  • 2433/04 Statistical data scientists
  • 3544/00 Data analysts

Technical Education Products

ST0585:

Data scientist (integrated degree)

(Level 6)

Approved for delivery

Employers involved in creating the standard:

Arup, Astra Zeneca, Aviva, Bank of England, Barclays Bank, BBC Academy, BCS Learning and Development Limited, Boots, CGI, Civil Service Learning Communities, Department for Eduation, DWP, Energy and Utility Skills, Environment Agency, Estee Lauder, Experian, Financial Conduct Authority, First Response Finance, HEFCE, HMRC, IBM, Lloyds Bank, Lowell Financial Limited, Machinable, Microsoft, Ministry of Justice, Ministry of Justice, MOD, NHS, Office for National Statistics, Optimity, Pfizer, Planning-Inc, Public Health England, Risual, Royal Society, Royal Statistical Society, Sanger, SAS, Shop Direct, SocDM Society of Data Miners, The Co-op Group, The Tech Partnership, Thomas Cook, TUI Group, UBS, UK Apprenticeship Programmes, Unilever, Willis Towers Watson, Wipro, Institute for Apprenticeships, EasyJet, QA, Nottinghamshire County Council, Provident Financial Management Services Limited, Julius Baer International Ltd, John Lewis, Norfolk County Council, Hymans Robertson LLP, Warwickshire County Council, BookingGo, Nestle, Covea Insurance, IQVIA, Post Office, British Airways, Telegraph, Asda, The Pension Regulator

Summary

Data Science is a broad and fast-moving field spanning maths and statistics, software engineering and communications. Data Scientists blend experience and knowledge from a wide range of fields and organisations, and continuously seek to expand their range of technical skills. Data Scientists find information in diverse datasets to address complex problems and improve organisational processes. They are inquisitive, they explore and visualise data of all kinds, find and present ‘stories’ within the data in a meaningful way to a range of technical and non-technical audiences. They make recommendations to inform strategic and operational decision making through sourcing, accessing and manipulating data, and engineering data processes. They identify and address data biases, and handle private data ethically and appropriately, complying with (inter)national privacy regulations. They use the insights gathered about the data they have analysed to inform and achieve organisational goals. They achieve desired outcomes by planning, organising and managing resources effectively. Data Scientists are dynamic and adaptable, addressing varied problems with varied techniques. They actively explore innovative ways to use existing and new statistical, algorithmic, predictive, machine learning and artificial intelligence tools and techniques, to find significant and valuable patterns in data and transform this into information for their organisation. They gather new sources of data, and combine datasets to increase their value. Using a scientific approach, they perform statistical analysis, build and validate models from the data, use programming practices, and maintain data, tools and processes to implement robust and valuable data solutions. Data Scientists have an impact at a strategic and operational level by building and maintaining strong collaborative relationships with key stakeholders, subject matter experts and colleagues at all levels. They engage with the wider Data Science community to share ideas, techniques and experiences. They can work in any sector, public or private, and will often work in a multi-disciplinary team with domain experts, Data Architects, Data Engineers, Analysts, and Technology Professionals.

Employers involved in creating the standard:

Arup, Astra Zeneca, Aviva, Bank of England, Barclays Bank, BBC Academy, BCS Learning and Development Limited, Boots, CGI, Civil Service Learning Communities, Department for Eduation, DWP, Energy and Utility Skills, Environment Agency, Estee Lauder, Experian, Financial Conduct Authority, First Response Finance, HEFCE, HMRC, IBM, Lloyds Bank, Lowell Financial Limited, Machinable, Microsoft, Ministry of Justice, Ministry of Justice, MOD, NHS, Office for National Statistics, Optimity, Pfizer, Planning-Inc, Public Health England, Risual, Royal Society, Royal Statistical Society, Sanger, SAS, Shop Direct, SocDM Society of Data Miners, The Co-op Group, The Tech Partnership, Thomas Cook, TUI Group, UBS, UK Apprenticeship Programmes, Unilever, Willis Towers Watson, Wipro, Institute for Apprenticeships, EasyJet, QA, Nottinghamshire County Council, Provident Financial Management Services Limited, Julius Baer International Ltd, John Lewis, Norfolk County Council, Hymans Robertson LLP, Warwickshire County Council, BookingGo, Nestle, Covea Insurance, IQVIA, Post Office, British Airways, Telegraph, Asda, The Pension Regulator

Typical job titles include:

Data Engineer
Data Scientist
Informatics

Keywords:

Data
Data Scientist
Degree
Problem-solving
Software Engineering

Knowledge, skills and behaviours (KSBs)

K1: The context of Data Science and the Data Science community in relation to computer science, statistics and software engineering. How differing schools of thought in these disciplines have driven new approaches to data systems.
K2: How Data Science operates within the context of data governance, data security, and communications. How Data Science can be applied to improve an organisation’s processes, operations and outputs. How data and analysis may exhibit biases and prejudice. How ethics and compliance affect Data Science work, and the impact of international regulations (including the General Data Protection Regulation.)
K3: How data can be used systematically, through an awareness of key platforms for data and analysis in an organisation, including:
K4: Data processing and storage, including on-premise and cloud technologies.
K5: Database systems including relational, data warehousing & online analytical processing, “NoSQL” and real-time approaches; the pros and cons of each approach.
K6: Data-driven decision making and the good use of evidence and analytics in making choices and decisions.
K7: How to design, implement and optimise analytical algorithms – as prototypes and at production scale – using:
K8: Statistical and mathematical models and methods.
K9: Advanced and predictive analytics, machine learning and artificial intelligence techniques, simulations, optimisation, and automation.
K10: Applications such as computer vision and Natural Language Processing.
K11: An awareness of the computing and organisational resource constraints and trade-offs involved in selecting models, algorithms and tools.
K12: Development standards, including programming practice, testing, source control.
K13: The data landscape: how to critically analyse, interpret and evaluate complex information from diverse datasets:
K14: Sources of data including but not exclusive to files, operational systems, databases, web services, open data, government data, news and social media.
K15: Data formats, structures and data delivery methods including “unstructured” data.
K16: Common patterns in real-world data.

S1: Identify and clarify problems an organisation faces, and reformulate them into Data Science problems. Devise solutions and make decisions in context by seeking feedback from stakeholders. Apply scientific methods through experiment design, measurement, hypothesis testing and delivery of results. Collaborate with colleagues to gather requirements.
S2: Perform data engineering: create and handle datasets for analysis. Use tools and techniques to source, access, explore, profile, pipeline, combine, transform and store data, and apply governance (quality control, security, privacy) to data.
S3: Identify and use an appropriate range of programming languages and tools for data manipulation, analysis, visualisation, and system integration. Select appropriate data structures and algorithms for the problem. Develop reproducible analysis and robust code, working in accordance with software development standards, including security, accessibility, code quality and version control.
S4: Use analysis and models to inform and improve organisational outcomes, building models and validating results with statistical testing: perform statistical analysis, correlation vs causation, feature selection and engineering, machine learning, optimisation, and simulations, using the appropriate techniques for the problem.
S5: Implement data solutions, using relevant software engineering architectures and design patterns. Evaluate Cloud vs. on-premise deployment. Determine the implicit and explicit value of data. Assess value for money and Return on Investment. Scale a system up/out. Evaluate emerging trends and new approaches. Compare the pros and cons of software applications and techniques.
S6: Find, present, communicate and disseminate outputs effectively and with high impact through creative storytelling, tailoring the message for the audience. Use the best medium for each audience, such as technical writing, reporting and dashboards. Visualise data to tell compelling and actionable narratives. Make recommendations to decision makers to contribute towards the achievement of organisation goals.
S7: Develop and maintain collaborative relationships at strategic and operational levels, using methods of organisational empathy (human, organisation and technical) and build relationships through active listening and trust development.
S8: Use project delivery techniques and tools appropriate to their Data Science project and organisation. Plan, organise and manage resources to successfully run a small Data Science project, achieve organisational goals and enable effective change.

B1: An inquisitive approach: the curiosity to explore new questions, opportunities, data, and techniques; tenacity to improve methods and maximise insights; and relentless creativity in their approach to solutions.
B2: Empathy and positive engagement to enable working and collaborating in multi-disciplinary teams, championing and highlighting ethics and diversity in data work.
B3: Adaptability and dynamism when responding to varied tasks and organisational timescales, and pragmatism in the face of real-world scenarios.
B4: Consideration of problems in the context of organisation goals.
B5: An impartial, scientific, hypothesis-driven approach to work, rigorous data analysis methods, and integrity in presenting data and conclusions in a truthful and appropriate manner.
B6: A commitment to keeping up to date with current thinking and maintaining personal development. Including collaborating with the data science community.

Occupational Progression

This occupational progression map shows technical occupations that have transferable knowledge and skills.

In this map, the focused occupation is highlighted in yellow. The arrows indicate where transferable knowledge and skills exist between two occupations. This map shows some of the strongest progression links between the focused occupation and other occupations.

It is anticipated that individuals would be required to undertake further learning or training to progress to and from occupations. To find out more about an occupation featured in the progression map, including the learning options available, click the occupation.

Progression decisions have been reached by comparing the knowledge and skills statements between occupational standards, combined with individualised learner movement data.

Technical Occupations

Levels 2-3

Higher Technical Occupations

Levels 4-5

Professional Occupations

Levels 6-7

Progression link into focused occupation.
assignment_turned_in

Level 3

Progression link into focused occupation.
assignment_turned_in

Level 4

This is the focused occupation.
assignment_turned_in

Level 6

Progression link from focused occupation.
assignment_turned_in

Level 6

Progression link from focused occupation.
assignment_turned_in

Level 7

Progression link from focused occupation.
assignment_turned_in

Level 7

Progression link from focused occupation.
assignment_turned_in

Level 7

Progression link from focused occupation.
assignment_turned_in

Level 7

Digital

Health and science