Skip to content

Commit 8cfb191

Browse files
authored
move dip warning to limitations section (#1587)
1 parent 60df975 commit 8cfb191

File tree

1 file changed

+10
-3
lines changed

1 file changed

+10
-3
lines changed

docs/api/covidcast-signals/google-symptoms.md

+10-3
Original file line numberDiff line numberDiff line change
@@ -16,15 +16,15 @@ nav_order: 1
1616
* **Time type:** day (see [date format docs](../covidcast_times.md))
1717
* **License:** To download or use the data, you must agree to the Google [Terms of Service](https://policies.google.com/terms)
1818

19-
<div style="background-color:#ff00001c; padding: 10px 30px;"><strong>Data issue:</strong> Between May 13 2024 and August 6 2024, signals values were 25%-50% lower compared to previous time periods. This affects <i>all</i> signals and symptom sets. Currently there is no explanation for the decrease in search volume, and the issue is under investigation by our data source partners.</div>
20-
2119
## Overview
2220

2321
This data source is based on the [COVID-19 Search Trends symptoms
2422
dataset](https://console.cloud.google.com/marketplace/product/bigquery-public-datasets/covid19-search-trends?hl=en-GB). Using
2523
this search data, we estimate the volume of searches mapped to symptom sets related
2624
to COVID-19. The resulting daily dataset for each region shows the average relative frequency of searches for each symptom set. The signals are measured in arbitrary units that are normalized for overall search users in the region and scaled by the maximum value of the normalized popularity within a geographic region across a specific time range. **Values are comparable across signals in the same location but NOT across geographic regions**. For example, within a state, we can compare `s01_smoothed_search` and `s02_smoothed_search`. However, we cannot compare `s01_smoothed_search` between states. Larger numbers represent increased relative popularity of symptom-related searches.
2725

26+
Between May 13 2024 and August 6 2024, [signal values were much lower](#limitations) compared to previous time periods due to a data outage.
27+
2828
#### Symptom sets
2929

3030
* _s01_: Cough, Phlegm, Sputum, Upper respiratory tract infection
@@ -94,7 +94,7 @@ population-weighted averaging.
9494

9595
For aggregation purposes only, we assign a value of 0 to source regions that
9696
have no data provided due to quality or privacy issues for a certain day (see
97-
Limitations for details). We do not report aggregated regions if none of their
97+
[Limitations](#limitations) for details). We do not report aggregated regions if none of their
9898
source regions have data. Because of this censoring behavior, the resulting data
9999
for aggregated regions does not fully match the _actual_ search volume for these
100100
regions (which is not provided to us).
@@ -106,6 +106,13 @@ As a result the delay can range from 3 to 10 days or even more. We check for
106106
updates every day and provide the most up-to-date data.
107107

108108
## Limitations
109+
110+
Between May 13 2024 and August 6 2024, signal values were 25%-50% lower compared to previous time periods.
111+
This affected _all_ signals and symptom sets.
112+
The drop does not reflect actual search term popularity during the affected period.
113+
The apparent decrease in search volume was caused by an outage in the data pipeline on the source side.
114+
The data was unfortunately not recoverable and the dip can not be repaired, but data outside the listed time period is unaffected.
115+
109116
When daily volume in a region does not meet quality or privacy thresholds, set
110117
by Google, no daily value is reported. Weekly data may be available from Google
111118
in these cases, but we do not yet support importation using weekly data.

0 commit comments

Comments
 (0)