-
Notifications
You must be signed in to change notification settings - Fork 6.6k
Implement self-monitoring for BanyanDB via OAP Server and UI #13527
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 15 commits
Commits
Show all changes
37 commits
Select commit
Hold shift + click to select a range
befd51f
Update the Service Level Board
firebor ffcc0da
banyandb添加e2e
firebor 9691d34
banyandb添加e2e
firebor 73e4fe5
banyandb添加mal
firebor 97afa99
banyandb添加ui
firebor 311b128
Merge branch 'apache:master' into master
firebor 8191cf4
add changes
firebor 44341ea
Merge remote-tracking branch 'origin/master'
firebor 7514231
Update oap-server/server-starter/src/main/resources/ui-initialized-te…
firebor 4d0c843
e2e file adds to GHA control file
firebor c5b1c9d
fix: Add the modifications to the end rather than the beginning.
firebor aee11d0
add menu.yaml and UITemplateInitializer.java
firebor b07fb9d
Merge branch 'apache:master' into master
firebor a6254e2
add menu for self_observability_banyandb
firebor a444dd4
add menu for self_observability_banyandb
firebor cbaf6bb
Revert "add menu.yaml and UITemplateInitializer.java"
firebor b62f213
Revert "add menu for self_observability_banyandb"
firebor f276382
Revert "add menu for self_observability_banyandb"
firebor 5c36f56
Merge branch 'master' into master
wu-sheng 22ca133
fix:recover
firebor 867b2be
Merge remote-tracking branch 'origin/master'
firebor a750924
Modify the ID of the banyandb-instance
firebor d00cdf9
Merge branch 'master' into master
wu-sheng 94ddc81
Revise the UI
firebor cef2c22
Merge branch 'master' into master
wankai123 407c7a5
Merge remote-tracking branch 'origin/master'
firebor 30f58fb
Merge branch 'master' into master
wankai123 9b60111
fix:e2e No such file or directory
firebor 41ba225
Merge branch 'master' into master
firebor 00b8229
Add the BanyanDB instruction manual and menu.
firebor f3b8524
Merge remote-tracking branch 'origin/master'
firebor a842785
Merge branch 'master' into master
wankai123 0ecfc2a
fix:e2e path error
firebor f7a6ab3
Merge remote-tracking branch 'origin/master'
firebor f2d5274
fix:dead links found.
firebor 6de93e6
fix:dead links found.
firebor 9736fb6
fix:Revise opentelemetry-receiver.md
firebor File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
84 changes: 84 additions & 0 deletions
84
oap-server/server-starter/src/main/resources/otel-rules/banyandb.yaml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,84 @@ | ||
| # Licensed to the Apache Software Foundation (ASF) under one or more | ||
| # contributor license agreements. See the NOTICE file distributed with | ||
| # this work for additional information regarding copyright ownership. | ||
| # The ASF licenses this file to You under the Apache License, Version 2.0 | ||
| # (the "License"); you may not use this file except in compliance with | ||
| # the License. You may obtain a copy of the License at | ||
| # | ||
| # http://www.apache.org/licenses/LICENSE-2.0 | ||
| # | ||
| # Unless required by applicable law or agreed to in writing, software | ||
| # distributed under the License is distributed on an "AS IS" BASIS, | ||
| # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
| # See the License for the specific language governing permissions and | ||
| # limitations under the License. | ||
|
|
||
| # This will parse a textual representation of a duration. The formats | ||
| # accepted are based on the ISO-8601 duration format {@code PnDTnHnMn.nS} | ||
| # with days considered to be exactly 24 hours. | ||
| # <p> | ||
| # Examples: | ||
| # <pre> | ||
| # "PT20.345S" -- parses as "20.345 seconds" | ||
| # "PT15M" -- parses as "15 minutes" (where a minute is 60 seconds) | ||
| # "PT10H" -- parses as "10 hours" (where an hour is 3600 seconds) | ||
| # "P2D" -- parses as "2 days" (where a day is 24 hours or 86400 seconds) | ||
| # "P2DT3H4M" -- parses as "2 days, 3 hours and 4 minutes" | ||
| # "P-6H3M" -- parses as "-6 hours and +3 minutes" | ||
| # "-P6H3M" -- parses as "-6 hours and -3 minutes" | ||
| # "-P-6H+3M" -- parses as "+6 hours and -3 minutes" | ||
| # </pre> | ||
| filter: "{ tags -> tags.job_name == 'banyandb-monitoring' }" | ||
| expSuffix: service(['host_name'] , Layer.BANYANDB).instance(['host_name'], ['service_instance_id'], Layer.BANYANDB) | ||
| metricPrefix: meter_banyandb | ||
| metricsRules: | ||
| - name: write_rate | ||
| exp: banyandb_measure_total_written.rate('PT15S')+banyandb_stream_tst_total_written.rate('PT15S') | ||
| - name: total_memory | ||
| exp: banyandb_system_memory_state.tagEqual('kind','total') | ||
| - name: disk_usage | ||
| exp: banyandb_system_disk.tagEqual('kind','used').sum(['kind','host_name','service_instance_id']) | ||
| - name: query_rate | ||
| exp: banyandb_liaison_grpc_total_started.sum(['method','host_name','service_instance_id']) | ||
| - name: total_cpu | ||
| exp: banyandb_system_cpu_num | ||
| - name: write_and_query_errors_rate | ||
| exp: banyandb_liaison_grpc_total_err.tagEqual('method','query').sum(['method','host_name','service_instance_id']).rate('PT15S')*60 + banyandb_liaison_grpc_total_stream_msg_sent_err.sum(['host_name','service_instance_id']).rate('PT15S')*60 + banyandb_liaison_grpc_total_stream_msg_received_err.sum(['host_name','service_instance_id']).rate('PT15S')*60 + banyandb_queue_sub_total_msg_sent_err.sum(['host_name','service_instance_id']).rate('PT15S')*60 | ||
| - name: etcd_operation_rate | ||
| exp: banyandb_liaison_grpc_total_registry_started.sum(['host_name','service_instance_id']).rate('PT15S') + banyandb_liaison_grpc_total_started.sum(['host_name','service_instance_id']).rate('PT15S') | ||
| - name: active_instance | ||
| exp: up.sum(['host_name','service_instance_id']).downsampling(MIN) | ||
| - name: cpu_usage | ||
| exp: (((process_cpu_seconds_total.sum(['host_name','service_instance_id']).rate('PT15S') / banyandb_system_cpu_num.sum(['host_name','service_instance_id']))).max(['host_name','service_instance_id']))*1000 | ||
| - name: rss_memory_usage | ||
| exp: ((process_resident_memory_bytes.sum(['host_name','service_instance_id']).downsampling(MAX) / banyandb_system_memory_state.tagEqual('kind','total').sum(['host_name','service_instance_id'])).max(['host_name','service_instance_id']))*1000 | ||
| - name: disk_usage_all | ||
| exp: ((banyandb_system_disk.tagEqual('kind','used').sum(['host_name','service_instance_id']) / banyandb_system_memory_state.tagEqual('kind','total').sum(['host_name','service_instance_id'])).max(['host_name','service_instance_id']))*1000 | ||
| - name: network_usage_recv | ||
| exp: banyandb_system_net_state.tagEqual('kind','bytes_recv').sum(['host_name','service_instance_id']).rate('PT15S') | ||
| - name: network_usage_sent | ||
| exp: banyandb_system_net_state.tagEqual('kind','bytes_sent').sum(['host_name','service_instance_id']).rate('PT15S') | ||
| - name: storage_write_rate | ||
| exp: banyandb_measure_total_written.sum(['host_name','service_instance_id']).rate('PT15S')*1000 | ||
| - name: query_latency | ||
| exp: (banyandb_liaison_grpc_total_latency.tagEqual('method','query').sum(['group','host_name','service_instance_id']).rate('PT15S') / banyandb_liaison_grpc_total_started.tagEqual('method','query').sum(['group','host_name','service_instance_id']).rate('PT15S'))*1000 | ||
| - name: total_data | ||
| exp: banyandb_measure_total_file_elements.sum(['group','host_name','service_instance_id']) | ||
| - name: merge_file_data | ||
| exp: banyandb_measure_total_merge_loop_started.sum(['group','host_name','service_instance_id']).rate('PT15S') * 60 *1000 | ||
| - name: merge_file_latency | ||
| exp: (banyandb_measure_total_merge_latency.tagEqual('type','file').sum(['group','host_name','service_instance_id']).rate('PT15S') / banyandb_measure_total_merge_loop_started.sum(['group','host_name','service_instance_id']).rate('PT15S'))*1000 | ||
| - name: merge_file_partitions | ||
| exp: (banyandb_measure_total_merged_parts.tagEqual('type','file').sum(['group','host_name','service_instance_id']).rate('PT15S') / banyandb_measure_total_merge_loop_started.sum(['group','host_name','service_instance_id']).rate('PT15S'))*1000 | ||
| - name: series_write_rate | ||
| exp: (banyandb_measure_inverted_index_total_updates.sum(['group','host_name','service_instance_id']).rate('PT15S'))*1000 | ||
| - name: series_term_search_rate | ||
| exp: banyandb_stream_storage_inverted_index_total_term_searchers_started.sum(['group','host_name','service_instance_id']).rate('PT15S') | ||
| - name: total_series | ||
| exp: banyandb_measure_inverted_index_total_doc_count.sum(['group','host_name','service_instance_id']) | ||
| - name: stream_write_rate | ||
| exp: banyandb_stream_tst_inverted_index_total_updates.sum(['group','host_name','service_instance_id']).rate('PT15S') | ||
| - name: term_search_rate | ||
| exp: banyandb_stream_tst_inverted_index_total_term_searchers_started.sum(['group','host_name','service_instance_id']).rate('PT15S')* 1000 | ||
| - name: total_document | ||
| exp: banyandb_stream_tst_inverted_index_total_doc_count.sum(['group','host_name','service_instance_id']) |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This seems an incorrect format. How do you format codes?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
modified