You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When Filebeat uses the Kafka output, it can happen that a miss configured ACL causes writes to fail and the output keeps re-trying to publish the event over and over again without any sort of back off, which makes Filebeat to start consuming high amounts of CPU until the ACL is fixed.
The short term solution is to stop Filebeat, fix the ACL and ensure the credentials are correctly set and then re-start Filebeat.
The problem sees to came from the error handling code:
Unfortunately I have not managed to reproduce this specific situation, however looking at the code and talking with @faec and @mauri870 that seems to be the most likely cause.
The report I saw of this issue, the user is handling authentication with TLS certificates and setting up ACLs. Once the ACL is correct, Filebeat works as expected, however if the ACL is miss configured/missing, then Filebeat goes into this high CPU usage state.
Because the issue seems to be coming from the output code, other Beats could be affected by the same problem
The text was updated successfully, but these errors were encountered:
When Filebeat uses the Kafka output, it can happen that a miss configured ACL causes writes to fail and the output keeps re-trying to publish the event over and over again without any sort of back off, which makes Filebeat to start consuming high amounts of CPU until the ACL is fixed.
The short term solution is to stop Filebeat, fix the ACL and ensure the credentials are correctly set and then re-start Filebeat.
The problem sees to came from the error handling code:
beats/libbeat/outputs/kafka/client.go
Lines 370 to 392 in 4942592
That does not handle specific errors like
sarama.ErrTopicAuthorizationFailed
Unfortunately I have not managed to reproduce this specific situation, however looking at the code and talking with @faec and @mauri870 that seems to be the most likely cause.
The report I saw of this issue, the user is handling authentication with TLS certificates and setting up ACLs. Once the ACL is correct, Filebeat works as expected, however if the ACL is miss configured/missing, then Filebeat goes into this high CPU usage state.
Because the issue seems to be coming from the output code, other Beats could be affected by the same problem
The text was updated successfully, but these errors were encountered: