1
1
# epiprocess
2
-
3
- <!-- badges: start -->
4
- [ ![ R-CMD-check] ( https://github.com/cmu-delphi/epiprocess/actions/workflows/R-CMD-check.yaml/badge.svg )] ( https://github.com/cmu-delphi/epiprocess/actions/workflows/R-CMD-check.yaml )
5
- <!-- badges: end -->
2
+
3
+ <!-- badges: start -->
4
+
5
+ [ ![ R-CMD-check] ( https://github.com/cmu-delphi/epiprocess/actions/workflows/R-CMD-check.yaml/badge.svg )] ( https://github.com/cmu-delphi/epiprocess/actions/workflows/R-CMD-check.yaml )
6
+
7
+ <!-- badges: end -->
6
8
7
9
This package introduces a common data structure for epidemiological data sets
8
10
measured over space and time, and offers associated utilities to perform basic
9
11
signal processing tasks. See the getting started guide and vignettes for
10
12
examples.
11
13
14
+ ## Installation
15
+
16
+ To install (unless you're making changes to the package, use the stable version):
17
+
18
+ ``` r
19
+ # Stable version
20
+ pak :: pkg_install(" cmu-delphi/epiprocess@main" )
21
+
22
+ # Dev version
23
+ pak :: pkg_install(" cmu-delphi/epiprocess@dev" )
24
+ ```
25
+
12
26
## ` epi_df ` : snapshot of a data set
13
27
14
28
The first main data structure in the ` epiprocess ` package is called
@@ -20,28 +34,26 @@ contains the most up-to-date values of the signals variables, as of a given
20
34
time.
21
35
22
36
By convention, functions in the ` epiprocess ` package that operate on ` epi_df `
23
- objects begin with ` epi ` . For example:
37
+ objects begin with ` epi ` . For example:
24
38
25
- - ` epi_slide() ` , for iteratively applying a custom computation to a variable in
26
- an ` epi_df ` object over sliding windows in time;
27
-
28
- - ` epi_cor() ` , for computing lagged correlations between variables in an
29
- ` epi_df ` object, (allowing for grouping by geo value, time value, or any other
30
- variables).
39
+ - ` epi_slide() ` , for iteratively applying a custom computation to a variable in
40
+ an ` epi_df ` object over sliding windows in time;
41
+ - ` epi_cor() ` , for computing lagged correlations between variables in an
42
+ ` epi_df ` object, (allowing for grouping by geo value, time value, or any other
43
+ variables).
31
44
32
45
Functions in the package that operate directly on given variables do not begin
33
- with ` epi ` . For example:
34
-
35
- - ` growth_rate() ` , for estimating the growth rate of a given signal at given
36
- time values, using various methodologies;
46
+ with ` epi ` . For example:
37
47
38
- - ` detect_outlr() ` , for detecting outliers in a given signal over time, using
39
- either built-in or custom methodologies.
48
+ - ` growth_rate() ` , for estimating the growth rate of a given signal at given
49
+ time values, using various methodologies;
50
+ - ` detect_outlr() ` , for detecting outliers in a given signal over time, using
51
+ either built-in or custom methodologies.
40
52
41
53
## ` epi_archive ` : full version history of a data set
42
54
43
55
The second main data structure in the package is called
44
- [ ` epi_archive ` ] ( reference/epi_archive.html ) . This is a special class (R6 format)
56
+ [ ` epi_archive ` ] ( reference/epi_archive.html ) . This is a special class (R6 format)
45
57
wrapped around a data table that stores the archive (version history) of some
46
58
signal variables of interest.
47
59
@@ -50,20 +62,17 @@ By convention, functions in the `epiprocess` package that operate on
50
62
"archive"). These are just wrapper functions around the public methods for the
51
63
` epi_archive ` R6 class. For example:
52
64
53
- - ` epix_as_of() ` , for generating a snapshot in ` epi_df ` format from the data
54
- archive, which represents the most up-to-date values of the signal variables,
55
- as of the specified version;
56
-
57
- - ` epix_fill_through_version() ` , for filling in some fake version data following
58
- simple rules, for use when downstream methods expect an archive that is more
59
- up-to-date (e.g., if it is a forecasting deadline date and one of our data
60
- sources cannot be accessed to provide the latest versions of its data)
61
-
62
- - ` epix_merge() ` , for merging two data archives with each other, with support
63
- for various approaches to handling when one of the archives is more up-to-date
64
- version-wise than the other;
65
-
66
- - ` epix_slide() ` , for sliding a custom computation to a data archive over local
67
- windows in time, much like ` epi_slide ` for an ` epi_df ` object, but with one
68
- key difference: the sliding computation at any given reference time t is
69
- performed only on the ** data that would have been available as of t** .
65
+ - ` epix_as_of() ` , for generating a snapshot in ` epi_df ` format from the data
66
+ archive, which represents the most up-to-date values of the signal variables,
67
+ as of the specified version;
68
+ - ` epix_fill_through_version() ` , for filling in some fake version data following
69
+ simple rules, for use when downstream methods expect an archive that is more
70
+ up-to-date (e.g., if it is a forecasting deadline date and one of our data
71
+ sources cannot be accessed to provide the latest versions of its data)
72
+ - ` epix_merge() ` , for merging two data archives with each other, with support
73
+ for various approaches to handling when one of the archives is more up-to-date
74
+ version-wise than the other;
75
+ - ` epix_slide() ` , for sliding a custom computation to a data archive over local
76
+ windows in time, much like ` epi_slide ` for an ` epi_df ` object, but with one
77
+ key difference: the sliding computation at any given reference time t is
78
+ performed only on the ** data that would have been available as of t** .
0 commit comments