-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'main' into 277-qa-culture-externally-commissioned-work-…
…check-and-fix-after-176
- Loading branch information
Showing
12 changed files
with
585 additions
and
532 deletions.
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,130 +1,123 @@ | ||
::: {.callout-important} | ||
This version of the AQuA book is a preliminary ALPHA draft. It is still in development, and we are still working to ensure that it meets user needs. | ||
This version of the AQuA Book is a preliminary ALPHA draft. It is still in development, and we are still working to ensure that it meets user needs. | ||
|
||
The draft currently has no official status. It is a work in progress and is subject to further revision and reconfiguration (possibly substantial change) before it is finalised. | ||
::: | ||
|
||
# Design | ||
|
||
## Introduction and overview | ||
|
||
The design stage is where the Analyst translates the scope for the analysis agreed with the Commissioner into an actionable analytical plan. This chapter sets out recommended practices around designing the analysis and associated assurance activities, documenting the design and assuring the design. It also discusses considerations around the treatment of uncertainty in design, and design of multi-use and AI models. | ||
During the design stage the analyst creates an actionable [analytical plan]((#the-analytical-plan) from the scope for the analysis agreed with the commissioner. This chapter sets out recommended practices around designing the analysis, deeciding on the associated assurance activities, documenting the design and assuring the design. It also discusses considerations around the treatment of uncertainty in design and the design of multi-use and Artifical Intelligence (AI) models. | ||
|
||
### The analytical plan | ||
|
||
The development of the analytical plan should consider: | ||
|
||
* Methodology for producing results, including the treatment of uncertainty; | ||
* Project management approach (for example Agile, Waterfall or a combination of approaches); | ||
* Sourcing of inputs and assumptions; | ||
* Data and file management; | ||
* Change management and version control; | ||
* Programming language and/or software; | ||
* Code management, documentation and testing; | ||
* Communication between stakeholders; | ||
* Verification and validation procedures during the project lifetime; | ||
* Documentation to be delivered; | ||
* Process for updating the analytical plan; | ||
* Process for ongoing review and maintenance of models, including reviewing inputs and calibrations and ensuring that software relied on continues to be supported and up to date; | ||
* Ethics; | ||
* Reporting; | ||
* Downstream application. | ||
|
||
The use of [Reproducible Analytical Pipelines (RAP)](#rap) is encouraged as a means of effective project design. | ||
|
||
Iteration between the Commissioner and the Analyst is normal and expected whilst the analytical design develops. | ||
* methodology for producing results, including the treatment of uncertainty | ||
* project management approach (for example agile, waterfall or a combination of approaches) | ||
* sourcing of inputs and assumptions | ||
* data and file management | ||
* change management and version control | ||
* programming language and software | ||
* code management, documentation and testing | ||
* communication between stakeholders | ||
* verification and validation procedures during the project lifetime | ||
* documentation to be delivered | ||
* process for updating the analytical plan | ||
* process for ongoing review and maintenance of models, including reviewing inputs and calibrations and ensuring that software relied on continues to be supported and up to date | ||
* ethics | ||
* reporting | ||
* downstream application | ||
|
||
Iteration of the plan between the commissioner and the analyst is normal and expected while the analytical design develops. | ||
|
||
::: {.callout-tip} | ||
# Reproducible analytical pipelines<a name="rap"></a> | ||
|
||
The recommended approach for developing analysis in code is to use a [Reproducible Analytical Pipeline (RAP)](https://analysisfunction.civilservice.gov.uk/support/reproducible-analytical-pipelines/). Reproducible Analytical Pipelines shall: | ||
The recommended approach for developing analysis in code is to use a [Reproducible Analytical Pipeline (RAP)](https://analysisfunction.civilservice.gov.uk/support/reproducible-analytical-pipelines/). RAPs shall: | ||
|
||
* Follow the practices set out in the [Analysis Function Quality Assurance of Code for Analysis and Research manual](https://best-practice-and-impact.github.io/qa-of-code-guidance/intro.html). | ||
* Meet the requirements of the [Reproducible Analytical Pipelines minimum viable product](https://github.com/best-practice-and-impact/rap_mvp_maturity_guidance/blob/master/Reproducible-Analytical-Pipelines-MVP.md). | ||
* follow the practices set out in the [Analysis Function Quality Assurance of Code for Analysis and Research manual](https://best-practice-and-impact.github.io/qa-of-code-guidance/intro.html) | ||
* meet the requirements of the [RAPs minimum viable product](https://github.com/best-practice-and-impact/rap_mvp_maturity_guidance/blob/master/Reproducible-Analytical-Pipelines-MVP.md) | ||
|
||
::: | ||
|
||
|
||
|
||
## Roles and responsibilities in the design stage | ||
|
||
### The Commissioner's responsibilities during the design stage | ||
The Commissioner should confirm that the analytical approach will satisfy their needs. To assist in this, the Commissioner may review the analytical plan. | ||
|
||
The Commissioner's domain expertise can be a useful resource for the analyst in the design stage. The Commissioner might provide information regarding the input assumptions, data requirements and the most effective ways to present the outputs, all of which can inform the design. | ||
|
||
### The commissioner's responsibilities | ||
|
||
The commissioner should confirm that the analytical approach will satisfy their needs. To assist in this, the commissioner may review the analytical plan. | ||
|
||
### The Analyst's responsibilities during the design stage | ||
The Analyst should: | ||
The commissioner's expertise can be a useful resource for the analyst in the design stage. The commissioner might provide information regarding the input assumptions, data requirements and the most effective ways to present the outputs. All of these can inform the design. | ||
|
||
* develop the method and plan to address the Commissioner’s needs; | ||
* establish assurance requirements; | ||
* develop the plan for proportionate verification and validation - see [National Audit Office Framework to review models](https://www.nao.org.uk/wp-content/uploads/2016/03/11018-002-Framework-to-review-models_External_4DP.pdf); | ||
* plan in sufficient time for the assurance activity; | ||
* document the analytical plan in a proportionate manner; and, | ||
* follow any organisation governance procedures for project design. | ||
|
||
### The analyst's responsibilities | ||
|
||
The analyst should: | ||
|
||
### The Assurer's responsibilities during the design stage | ||
The Assurer should review the analytical plan to ensure that they are able to conduct the required assurance activities. They may provide feedback on the analytical plan. The Assurer should plan sufficient time for the assurance activity. | ||
* develop the method and plan to address the commissioner’s needs | ||
* establish assurance requirements | ||
* develop a plan for proportionate verification and validation as described in the [National Audit Office Framework to review models](https://www.nao.org.uk/wp-content/uploads/2016/03/11018-002-Framework-to-review-models_External_4DP.pdf); | ||
* plan in sufficient time for the assurance activity | ||
* document the analytical plan in a proportionate manner | ||
* follow any organisation governance procedures for project design | ||
|
||
### The assurer's responsibilities | ||
|
||
The assurer should review the analytical plan to ensure that it is able to conduct the required assurance activities. They may provide feedback on the analytical plan. The assurer should plan sufficient time for the assurance activity. | ||
|
||
### The Approver's responsibilities during the design stage | ||
In smaller projects, the Approver may not be heavily involved in the design stage. However, for business critical analysis, the Approver may want to confirm that organisational governance procedures for design have been followed. | ||
|
||
### The approver's responsibilities | ||
|
||
## Assurance activities in the design stage | ||
In smaller projects, the approver may not be heavily involved in the design stage. However, for business critical analysis, the approver may want to confirm that organisational governance procedures for design have been followed. | ||
|
||
|
||
## Assurance activities | ||
|
||
On completion of the design stage, the Assurer should be aware of the quality assurance tasks that will be required of them during the project lifetime and have assured the necessary elements of the analytical plan. | ||
When the design stage has been completed the assurer should be aware of the quality assurance tasks that will be required of them during the project lifetime and have assured the necessary elements of the analytical plan. | ||
|
||
The assurance of the design stage should consider whether the analytical plan is likely to: | ||
|
||
* Address commissioner's requirements - validation; | ||
* Deliver as intended - verification; | ||
* Be robust i.e. well-structured, data driven, with a sound overall design. | ||
* address commissioner's requirements (validation) | ||
* deliver as intended (verification) | ||
* be robust (for exmaple, provide a well-structured, data driven plan with a sound overall design) | ||
|
||
The assurance of the design stage may be carried out by the Assurer. For more complex analysis, it is good practice to engage subject matter experts to provide independent assurance, and to ensure the accuracy and limitations of the chosen methods are understood, ideally with tests baselining their response against independent reference cases. | ||
The assurance of the design stage may be carried out by the assurer. For more complex analysis, it is good practice to engage subject matter experts to provide independent assurance and to ensure the accuracy and limitations of the chosen methods are understood, ideally with tests baselining their response against independent reference cases. | ||
|
||
|
||
## Documentation in the design stage | ||
## Documentation | ||
|
||
The design process should be documented in a proportionate manner. A design document that records the [analytical plan](#the-analytical-plan) should be produced by the Analyst and signed-off by the Commissioner. The design document may be reviewed by the Assurer. | ||
|
||
For modelling, an initial model map may be produced that describes data flows and transformations. This can be updated as the project progresses through the Analysis stage. | ||
The design process should be documented in a proportionate manner. A design document that records the analytical plan should be produced by the analyst and signed-off by the commissioner. The design document may be reviewed by the assurer. | ||
|
||
For modelling, an initial model map may be produced that describes data flows and transformations. This can be updated as the project progresses through the Analysis stage. | ||
|
||
It is best practice to use formal version control to track changes in the design document. | ||
|
||
|
||
|
||
## Treatment of uncertainty in the design stage | ||
|
||
During the design stage, Analysts should examine the planned analysis systematically for possible sources and types of uncertainty, to maximise the chance of identifying all that are sufficiently large to breach the acceptable margin of error. This is discussed in Chapter 3 of the [Uncertainty Toolkit for Analysts](https://analystsuncertaintytoolkit.github.io/UncertaintyWeb/chapter_3.html) | ||
During the design stage, analysts should examine the planned analysis systematically for possible sources and types of uncertainty. This is to maximise the chance of identifying all that are sufficiently large to breach the acceptable margin of error. | ||
|
||
You can read more in Chapter 3 of the [Uncertainty Toolkit for Analysts](https://analystsuncertaintytoolkit.github.io/UncertaintyWeb/chapter_3.html). | ||
|
||
## Black box models and the design stage | ||
|
||
## Black box models | ||
|
||
Using [black box models](definitions_and_key_concepts.qmd/#black-box-models) places greater weight on the design of the analysis and the assurance and validation of outputs by domain experts. | ||
|
||
This [guidance on AI assurance](https://www.gov.uk/government/publications/introduction-to-ai-assurance/introduction-to-ai-assurance) outlines considerations for the design of AI models, including risk assessment, impact assessment, bias audits and compliance audits. | ||
|
||
In the Design of AI/ML models, the Analyst should: | ||
|
||
In the design of AI and machine learning models, the analyst should: | ||
* define the situation they wish to model; | ||
* the prediction they wish to make; | ||
* the data that could be used to make the prediction; | ||
* carry out a literature review to identify appropriate modelling, valuation verification methods and document the rationale for selecting their approach; | ||
* consider how to separate the data for the design and testing of models - it's usual to design a model with a fraction of the data and then test it with the data that was not used in the design; | ||
* consider developing automatic checks to identify if the model is behaving unexpectedly, this is important if the model is likely to be used frequently to make regular decisions; and, | ||
* consider whether to refer the model to their ethics committee, or a similar group - see the [Data Ethics Framework](https://www.gov.uk/government/publications/data-ethics-framework/data-ethics-framework-2020). | ||
* assess the quality of data that could be used to make the prediction, this includes the data used in any pretrained models | ||
* carry out a literature review to identify appropriate modelling, valuation verification methods and document the rationale for selecting their approach | ||
* consider the appropriate data training, validation and testing strategy for your models - it's usual to design a model with a fraction of the data, validate with a separate portion and then test the final model with the data that was not used in the design; | ||
* consider your strategy when testing a pre-trained model, including appropriate validation methods for the models such as calculating similarity to labelled images or ground truths for generative AI | ||
* consider developing automatic checks to identify if the model is behaving unexpectedly, this is important if the model is likely to be used frequently to make regular decisions or is deployed into a production environment | ||
* consider the plan for maintenance and continuous review, including the thresholds or timeline to retrain the model and the resources required to support this - see the [maintenance and continuous review](analytical_lifecycle.qmd/#mainenance-and-continuous-review) section | ||
* consider referring the model to their ethics committee, or a similar group dependent on your interal governance structures - see the [Data Ethics Framework](https://www.gov.uk/government/publications/data-ethics-framework/data-ethics-framework-2020) | ||
* consider setting up a peer or academic review process to test the methodologies and design decisions | ||
|
||
## Multi-use models and the design stage | ||
## Multi-use models | ||
|
||
Designing multi-use models should take into account the needs of all users of the analysis. An Analysis Steering Group may be an effective means for communication about the design with a range of user groups. | ||
|
||
The design of multi-use models may entail a modular structure with different Analysts and Assurers responsible for different elements. The design of assurance activities should capture both the assurance of individual modules and their integration. | ||
The design of multi-use models may entail a modular structure with different analysts and assurers responsible for different elements. The design of assurance activities should capture both the assurance of individual modules and their integration. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.