Skip to content

[Feature] Enhance the alarm kernel with recovered status notification capability for alarm rules. #13492

@youjie23

Description

@youjie23

Search before asking

  • I had searched in the issues and found no similar feature requirement.

Description

Problem Description

Currently, the AlarmMessage (sent via webhook) does not include the silence-period defined in the alarm rule. This makes it difficult for external systems to accurately determine when an alarm has truly recovered, as they don't know the duration for which SkyWalking will suppress duplicate alerts.

Proposed Solution

Add a new field silencePeriod to the AlarmMessage data structure. The value of this field should be the silence-period (in minutes) configured in the triggered alarm rule.

Expected Benefits

  • Downstream systems can calculate the expected end time of the silence period (startTime + silencePeriod).
  • This enables more efficient and accurate detection of alarm recovery: if no new AlarmMessage for the same rule and entity is received after the calculated silence period end time, it can be considered recovered.
  • Facilitates the automation of alarm recovery notifications or self-healing operations.

Use case

No response

Related issues

No response

Are you willing to submit a pull request to implement this on your own?

  • Yes I am willing to submit a pull request on my own!

Code of Conduct

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions