Operational Data Store » History » Sprint/Milestone 2
Steven (eSHIFT) Uggowitzer, 21 July 2022 08:49 AM
1 | 1 | Steven (eSHIFT) Uggowitzer | h1. Logical Model - ODS |
---|---|---|---|
2 | 2 | Steven (eSHIFT) Uggowitzer | |
3 | 1 | Steven (eSHIFT) Uggowitzer | {{>toc}} |
4 | |||
5 | h2. Logical Model of the AMR One Health Operational Data Store (ODS) |
||
6 | |||
7 | There are several challenges arising from attempting to model a One Health version of AMR in a data warehousing platform such as DHIS2. These can often be framed as drivers that are pulling against each other. The success of a national implementation depends on - amongst other things - finding the right balance between: |
||
8 | |||
9 | * A national surveillance viewpoint vs. the desire for an operational viewpoint that reflects laboratory or clinical needs |
||
10 | * Cleaning data before storing it in a national data warehouse vs. the desire to capture everything and "we'll sort it out later" |
||
11 | * Model Complexity (covering all analytical scenarios) vs. Model Simplicity (covering a proportion of analytical needs) |
||
12 | * Human-centric view of AMR vs. One Health multi-sector views of AMR |
||
13 | |||
14 | This release of the AMR One Health platform has made certain decisions around these balance points, each of which has implications on the design of the data models implemented in DHIS2. |
||
15 | |||
16 | h2. Operational Data Stores |
||
17 | |||
18 | The data warehouse discipline includes the concept of an Operational Data Store (or ODS). An ODS brings together data from a number of operational systems into one location, with the data retaining much of the original properties of the source systems. While not considered as a data warehouse proper, they can act as staging areas for data before being processed into data warehouse-type analytical data cubes. Operational reports can query ODS data, and provide an environment for 'digging down into' original source data. |
||
19 | |||
20 | h2. ODS in DHIS2 |
||
21 | |||
22 | With the AMR One Health Platform, a decision was made to treat the Tracked Entity capability in DHIS2 as the equivalent of an ODS construct. Given the competing drivers indicated above and several factors about the AMR data observed from laboratory or other source systems, the following constraints were assumed: |
||
23 | |||
24 | * A single ODS per domain (animal, human, environment, food, and feed) has been created. |
||
25 | * The ODS data model is simple, reflecting the 'line listing' nature of most AMR data sources. |
||
26 | * Compatibility with the WHONET application, widely used for the management and analysis of microbiology laboratory data, was a central consideration to the design. |
||
27 | * A certain 'cleanup' of source system data is assumed as part of the preliminary ETL (extract, transform, load) process for inserting data into the domain ODS's. |
||
28 | * Common metadata lookup lists (pathogens, antimicrobials) were used across the ODS domains to support One Health views. |
||
29 | |||
30 | !AMR_OneHealth_DW_Conceptual-reduced.png! |
||
31 | |||
32 | h2. The ODS Data Model |
||
33 | |||
34 | h3. What are we tracking? |
||
35 | |||
36 | DHIS2's tracker system is designed so one can define entity types (e.g. human), to create an instance of that type (e.g. a specific person with certain attributes), and to enroll that instance in one or more programs. A program (e.g. Childhood Immunization Program) consists of a series of events, with each event (e.g. a vaccination visit) designed to capture a certain set of data items (e.g. the type of vaccine administered). |
||
37 | |||
38 | For the AMR One Health Platform, a key question to answer is what are we tracking? A person? An Animal? A sample? All of these? |
||
39 | |||
40 | To support cross-domain views of AMR activity, the development team agreed that a generic entity type to track is a pathogen sample. A sample is the core of the laboratory testing environment for AMR, and many of the historical human AMR data sets that the team have reviewed only have test results against a sample ID, with no patient data accessible in a digital format. |