-
Notifications
You must be signed in to change notification settings - Fork 851
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Expose metric for log export failure #6709 #6779
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -141,8 +141,10 @@ private static final class Worker implements Runnable { | |
private static final Logger logger = Logger.getLogger(Worker.class.getName()); | ||
|
||
private final LongCounter processedLogsCounter; | ||
private final LongCounter logsExportFailureCounter; | ||
private final Attributes droppedAttrs; | ||
private final Attributes exportedAttrs; | ||
private final Attributes exportFailureAttrs; | ||
|
||
private final LogRecordExporter logRecordExporter; | ||
private final long scheduleDelayNanos; | ||
|
@@ -197,6 +199,12 @@ private Worker( | |
"The number of logs processed by the BatchLogRecordProcessor. " | ||
+ "[dropped=true if they were dropped due to high throughput]") | ||
.build(); | ||
logsExportFailureCounter = | ||
meter | ||
.counterBuilder("logsExportFailure") | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The OTLP exporters already have dedicated metrics to track failures: https://github.com/open-telemetry/opentelemetry-java/blob/main/exporters/common/src/main/java/io/opentelemetry/exporter/internal/ExporterMetrics.java Would these serve your needs? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @jack-berg Thanks. I think this does address the requirement. I tried finding if something already exists for exporter in general, as this is a generic need for any kind of exporter not just BatchLogExporter.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. These metrics should be enabled by default if using autoconfigure. Note if not using autoconfigure, you need to carefully order the initialization so that the configured meter provider can be passed to the OTLP exporters for spans and logs to collect internal telemetry. I'm not sure what There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. My bad, I didn't backup the entire collector logs and misinterpreted that these metrics need to be enabled. These are present by default. |
||
.setUnit("1") | ||
.setDescription("Logs export failure in BatchLogRecordProcessor.") | ||
.build(); | ||
droppedAttrs = | ||
Attributes.of( | ||
LOG_RECORD_PROCESSOR_TYPE_LABEL, | ||
|
@@ -209,6 +217,8 @@ private Worker( | |
LOG_RECORD_PROCESSOR_TYPE_VALUE, | ||
LOG_RECORD_PROCESSOR_DROPPED_LABEL, | ||
false); | ||
exportFailureAttrs = | ||
Attributes.of(LOG_RECORD_PROCESSOR_TYPE_LABEL, LOG_RECORD_PROCESSOR_TYPE_VALUE); | ||
|
||
this.batch = new ArrayList<>(this.maxExportBatchSize); | ||
} | ||
|
@@ -324,6 +334,7 @@ private void exportCurrentBatch() { | |
processedLogsCounter.add(batch.size(), exportedAttrs); | ||
} else { | ||
logger.log(Level.FINE, "Exporter failed"); | ||
logsExportFailureCounter.add(1, exportFailureAttrs); | ||
} | ||
} catch (RuntimeException e) { | ||
logger.log(Level.WARNING, "Exporter threw an Exception", e); | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
While this seems like a small change, I'm reluctant to make it because there have been some attempts to standardize the SDKs' internal telemetry (e.g. OTEP#238).
The problem with continuing the pattern of these current metrics is that the structure doesn't conform to our semantic convention recommendations.
{export}
instead of1
Extending the instrumentation extends bad patterns. Fixing the bad patterns exposes our users to breaking changes, only to have more later if / when semantic conventions emerge. So we appear to be stuck. I'll bring it up at next week's java SIG to see if can reach any conclusion.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jack-berg agree with you here that current pattern (existing as well as any proposed metric in future) doesn't conform to semantic recommendations like metric name having namespace, well defined units, etc.
So we are stuck between extending new instrumentations/ rectifying' the existing instrumentations with bad semantics AND the recommended ones. Do let us know how the discussions go with this. As this will be applicable in general, not just here.