I'm preparing a Data quality Report based on couple Contour analyses and would like to do a daily snapshots of the reported incorrect records. Then I want to show these daily numbers as another report in the same dashboard to see the progress on the data quality.
The main questions for me are:
- can a Contour analyses be used as a source for data storing/computation
- how to store these numbers on a daily base (e.g. Fusion spreadsheet or Code workbook etc.)
Here's one process for setting up daily snapshots of a dataset derived from a Contour analysis:
Ensure that the Contour analysis results are saved as a dataset. Let's call this dataset
mydataset:Create a Python Transform that performs daily snapshots and stores them in a dataset named
mydataset_daily_snapshots:Create Build Schedules on both
mydatasetandmydataset_daily_snapshotsthat build the datasets daily (or as frequently as desired), so thatmydataset_daily_snapshotswill have data snapshots for each day. Ensure you checkForce buildso that snapshots will always be built, even if the source data has not changed:You can then use the
mydataset_daily_snapshotsdataset within another Contour analysis to show the changes in the data over time in a Report, or create an Ontology object from it and use Workshop to show the change over time.Something to keep in mind is that this dataset can potentially get very large very quickly -- any filtering to keep the dataset smaller (e.g. to limit snapshots to just the incorrect records or a sum of incorrect records for the day, for example) is a good idea.