@@ -18,28 +18,57 @@ in the future will also be described in this vignette. We will also
18
18
provide validation metrics here whenever we make a change, and track
19
19
these validations over the different versions.
20
20
21
- ## Specific changes
21
+ ## Specific changes since the original validation (version from the paper)
22
22
23
- ### Since the original published validation (version 1)
23
+ ### Version 1.1
24
24
25
25
1 . We don't use purchases of semaglutid, dapagliflozin or
26
26
empagliflozin, neither for inclusion events nor classification of
27
27
diabetes type (due to increasing use in treatment of non-diabetes).
28
28
2 . We no longer use diabetes type reclassification based on insulin
29
29
purchases in the previous year.
30
+ 3 . The logic defining pregnancy index dates has been simplified to only
31
+ use diagnoses of pregnancy endings (no longer uses data on maternal
32
+ care visits).
33
+ 4 . De-duplicates subsequent HbA1c samples taken on the same date
34
+ (originally, if a sampling time was available in the lab data, only
35
+ samples taken at the same time were de-duplicated)
30
36
31
37
## Validity
32
38
33
39
Algorithm validity across versions. Reports ` PPV ` (* positive predictive
34
40
value* ) and ` sensitivity ` for typical cases and cases with atypical age
35
41
at onset of T1D (after age 40) and T2D (before age 40), respectively.
36
42
43
+ ### On pre-2019 data (as in the paper)
44
+
45
+ ** Overall and age at onset-stratified (paper table 1 & 2):**
46
+
47
+ | Version | Diabetes type | PPV | Sensitivity |
48
+ | ---------| ---------------| -------| -------------|
49
+ | Paper | T1D | 0.943 | 0.773 |
50
+ | Paper | T1D \> 40 yrs | 0.708 | 0.378 |
51
+ | Paper | T2D | 0.875 | 0.944 |
52
+ | Paper | T2D \< 40 yrs | 0.471 | 0.863 |
53
+
37
54
| Version | Diabetes type | PPV | Sensitivity |
38
55
| ---------| ---------------| -------| -------------|
39
- | 1 | T1D | 0.943 | 0.773 |
40
- | 1 | T1D \> 40 yrs | 0.708 | 0.378 |
41
- | 1 | T2D | 0.875 | 0.944 |
42
- | 1 | T2D \< 40 yrs | 0.471 | 0.863 |
56
+ | 1.1 | T1D | 0.943 | 0.789 |
57
+ | 1.1 | T1D \> 40 yrs | 0.871 | 0.871 |
58
+ | 1.1 | T2D | 0.883 | 0.941 |
59
+ | 1.1 | T2D \< 40 yrs | 0.519 | 0.857 |
60
+
61
+ ** Bootstrapped metrics (paper S3):**
62
+
63
+ | Version | Diabetes type | Sensitivity | Specificity | PPV | NPV |
64
+ | ---------| ---------------| -------------| -------------| -------| -------|
65
+ | Paper | T1D | 0.774 | 0.999 | 0.951 | 0.997 |
66
+ | Paper | T2D | 0.943 | 0.989 | 0.878 | 0.995 |
67
+
68
+ | Version | Diabetes type | Sensitivity | Specificity | PPV | NPV |
69
+ | ---------| ---------------| -------------| -------------| -------| -------|
70
+ | 1.1 | T1D | 0.781 | 0.999 | 0.949 | 0.997 |
71
+ | 1.1 | T2D | 0.943 | 0.989 | 0.879 | 0.995 |
43
72
44
73
## Potential future changes
45
74
@@ -49,9 +78,7 @@ at onset of T1D (after age 40) and T2D (before age 40), respectively.
49
78
to 1995 (rather than 1997 onward, as the obstetric codes are limited
50
79
to), and enable the extension of the window of valid dates of
51
80
diagnosis to 1996 onward.
52
- 2 . Simplify logic defining pregnancy index dates to remove dependency
53
- on maternal care visits (if performance in validation allows).
54
- 3 . Limit the historic scope of primary diagnoses used to evaluate
81
+ 2 . Limit the historic scope of primary diagnoses used to evaluate
55
82
majority of diabetes-specific diagnoses in type classification (e.g.
56
83
only evaluate majority among the last five type-specific diabetes
57
84
diagnoses).
0 commit comments