Skip to content

Releases: m-cmp/mc-observability

v0.5.0

03 Nov 05:20

Choose a tag to compare

Release Notes v0.5.0

What's Changed

Major Features & Enhancements

🚨 Alert & Trigger System

  • feat: Add direct alert functionality by @suahlingo in #256
  • feat: Add trigger module with email, SMS, and Slack integration by @suahlingo in #201, #205, #170
  • feat: Add RabbitMQ integration for alert message queuing by @suahlingo
  • docs: Add comprehensive RabbitMQ guide by @suahlingo

🤖 AI/LLM Integration

  • feat: Add MCP (Model Context Protocol) core integration by @GreenScreen410 in #164
  • feat: Add external MCP servers (InfluxDB, MariaDB) by @GreenScreen410 in #163
  • feat: Add log analysis API with LLM support by @GreenScreen410 in #166
  • feat: Add conversation summary features with context retention by @kyuengmanKim in #176, #154
  • feat: Add LangGraph workflow & state management by @kyuengmanKim
  • feat: Add prompt management features by @kyuengmanKim
  • feat: Integrate LLM provider APIs and DB tables by @inhun in #175
  • feat: Add anomaly detection feature aligned with o11y manager API by @inhun
  • feat: Add prediction feature aligned with o11y manager API by @inhun

📊 Logging & Tracing

  • feat: Add logging & tracing configuration with OpenTelemetry by @inhun in #190
  • feat: Add m-cmp system log collector by @kyuengmanKim
  • feat: Add Tempo configuration for distributed tracing by @kyuengmanKim
  • feat: Add Loki integration for log aggregation by @suahlingo
  • refactor: Refactor tracing and logging to use OpenTelemetry Java agent by @suahlingo

🏗️ Infrastructure & Architecture

  • refactor: Major project structure reorganization by @kyuengmanKim in #254, #249, #247, #246, #244, #241, #240, #237
  • refactor: Updates docker compose & config directory structure by @kyuengmanKim
  • feat: Add second InfluxDB instance (mcp-influxdb) by @ish-hcc
  • feat: Add MinIO for object storage by @ish-hcc
  • feat: Add Semaphore integration for deployment automation by @ish-hcc
  • feat: Add comprehensive GitHub Actions workflows for CI/CD by @ish-hcc
  • feat: Add Docker multi-stage builds for reduced image sizes by @ish-hcc

🔧 Monitoring Agent Enhancements

  • feat: Implement per metric configuration feature by @ish-hcc in #141
  • feat: Implement plugin feature for monitoring agents by @ish-hcc
  • feat: Add agent configuration update via Ansible by @ish-hcc
  • feat: Add Fluent-Bit agent for log collection by @ish-hcc
  • feat: Add Telegraf agent improvements with procstat top10 CPU/memory by @ish-hcc
  • feat: Use private key for SSH connections by @ish-hcc

Breaking Changes & Migrations

Java Module Restructuring

  • refactor: Rename java-module to java by @kyuengmanKim
  • refactor: Split old mc-o11y-agent and mc-o11y-manager into new mc-o11y-manager structure
  • refactor: Upgrade to JDK 17 and Spring Boot 3.2.12 by @ish-hcc
  • refactor: Upgrade Gradle to 9.1.0 by @ish-hcc
  • refactor: Rename "target" to "vm" throughout codebase by @ish-hcc

Configuration Changes

  • refactor: Externalize sensitive configs via environment variables by @suahlingo
  • feat: Use Grafana environment file by @ish-hcc
  • fix: Update InfluxDB database name handling by @ish-hcc

API Improvements

New Endpoints

  • feat: Add readyz API for health checks by @ish-hcc in #189
  • feat: Add OpenAI API key management endpoints by @GreenScreen410 in #143
  • feat: Add log API improvements (start/end fields optional) by @ish-hcc
  • feat: Add WebSocket support for real-time VM status updates by @ish-hcc
  • feat: Add insight module APIs by @kyuengmanKim

API Refactoring

  • refactor: Split LLM analysis API into feature-specific functions by @kyuengmanKim
  • refactor: Update Insight controller & swagger definitions by @kyuengmanKim
  • refactor: Split controllers by feature by @ish-hcc
  • refactor: Improve error responses and exception handling by @suahlingo

Documentation & Developer Experience

Documentation

API Documentation

Developer Tools

  • feat: Add Makefile with lint, test, and build commands by @ish-hcc
  • feat: Add Spotless for code formatting by @suahlingo
  • chore: Update .gitignore for better project hygiene by @GreenScreen410

Bug Fixes

Critical Fixes

  • fix: Resolve agent installation permission issues by @kyuengmanKim
  • fix: Fix agent status tracking and state management by @suahlingo
  • fix: Fix WebSocket issues when agent status changes by @ish-hcc
  • fix: Apply locks to prevent race conditions in host/VM operations by @ish-hcc
  • fix: Fix InfluxDB connection and database name issues by @ish-hcc
  • fix: Call agent uninstall when deleting VMs by @ish-hcc

Configuration Fixes

  • fix: Add missed InfluxDB2 provisioning file by @ish-hcc
  • fix: Add missed Ansible files for config updates by @ish-hcc
  • fix: Fix Grafana datasource configurations by @ish-hcc
  • fix: Fix RabbitMQ dependency and configuration issues by @ish-hcc

Monitoring Fixes

  • fix: Remove NVIDIA SMI input (fields not provided) by @ish-hcc
  • fix: Remove Fluent-Bit HTTP service port for security by @ish-hcc
  • fix: Optimize Telegraf configuration by @ish-hcc
  • fix: Fix Loki error messages and Log Explorer errors by @최낙수

Scheduler & Timezone

  • fix: Insight scheduler timezone bug by @kyuengmanKim
  • fix: Improve healthcheck configurations by @ish-hcc

Dependency Updates

Infrastructure Components

  • docker-compose: Update cb-tumblebug to v0.11.16 by @ish-hcc
  • go: Update cb-spider to v0.11.16 by @ish-hcc
  • Previous updates include cb-tumblebug v0.11.13, v0.11.9, v0.11.8, v0.11.6
  • Previous updates include cb-spider v0.11.13, v0.11.5, v0.11.4, v0.11.3, v0.11.1

Python Dependencies

  • chore: Optimize packaging with uv to reduce image size by @inhun in #203, #218
  • feat: Reduce Python image size by @ish-hcc

Java Dependencies

  • feat: Upgrade to Gradle 9.1.0 by @ish-hcc
  • feat: Upgrade to JDK 17 and Spring Boot 3.2.12 by @ish-hcc

Performance Improvements

  • feat: Reduce image size of mc-observability-manager by @ish-hcc
  • feat: Reduce image size of mc-observability-grafana by @ish-hcc
  • feat: Optimize Telegraf configuration for better performance by @ish-hcc
  • feat: Implement procstat collection of only top10 CPU/memory by @ish-hcc
  • chore: Optimize Python packaging to reduce image size by @inhun

Contributors

Special thanks to all contributors:

Full Changelog

Full Changelog: v0.4.0...v0.5.0
Swagger: https://m-cmp.github.io/mc-observability/java/swagger/index.html


Migration Guide

For Users Upgrading from v0.4.0

  1. Directory Structure Changes:

    • java-module has been renamed to java
    • Configuration files moved to config/ directory
  2. Environment Variables:

    • Review and update your environment variables
    • Check config/manager/.env for new configuration options
  3. Java Version:

    • JDK 17 is now required (upgraded from JDK 11)
  4. API Changes:

    • "target" endpoints renamed to "vm"
    • Check Swagger documentation for updated API endpoints
  5. New Features:

    • Configure RabbitMQ for alert notifications
    • Set up Loki for log aggregation
    • Configure Tempo for distributed tracing
    • Set up LLM integration if needed

For detailed migration instructions, please refer to the project documentation.

v0.4.4

30 Oct 08:10
4f27ba8

Choose a tag to compare

Notice

This release serves as a pre-release ahead of the upcoming v0.5.0 and fix issues in v0.4.3

What's Changed

  • fix: java: Prevent multiple request conflicts, Fix log API, Remove NVIDIA measurement by @ish-hcc in #255

Full Changelog: v0.4.3...v0.4.4

v0.4.3

24 Oct 07:33
30cb20a

Choose a tag to compare

Notice

This release serves as a pre-release ahead of the upcoming v0.5.0.

API

Major changes

  • Refactored overall project structure for better maintainability.
  • Improved Java module build system and cleaned Docker-related files.
  • Refactored Loki setup and enhanced Swagger API documentation.
  • Added Grafana configurations and environment variables for observability.
  • Fixed timezone issue in Insight scheduler and InfluxDB database name errors.

What's Changed

Full Changelog: v0.4.2...v0.4.3

v0.4.2

17 Oct 08:26
195701e

Choose a tag to compare

API

Swagger UI URL: https://m-cmp.github.io/mc-observability/java-module/swagger/index.html

API Detail Usage Scenario (v0.4.2)

mc-observability Agent support metric list: (link)
mc-observability v0.4.2 monitoring & logging API usage scenarios: (link)
mc-observability v0.4.2 Trigger/Event Handler API usage scenarios: (link)
mc-observability v0.4.2 Insight API usage scenarios: (link)

Major changes

  • Refactored Java module and overall codebase
  • Enhanced Insight API integration with Manager
  • Fixed RabbitMQ and configuration dependency issues
  • Improved observability container stability and healthchecks
  • Upgraded build and workflow system

What's Changed

Full Changelog: v0.4.1...v0.4.2

v0.4.1

01 Oct 07:26
d0b257f

Choose a tag to compare

Notice

This release focuses on feature integration updates, with verification limited to program startup and container health checks.
Detailed feature validation and guidance will be provided in the upcoming v0.4.2 release(mid-October).

API

Swagger UI URL: https://m-cmp.github.io/mc-observability/java-module/swagger/index.html

Major changes

  • Refactored backend architecture for integration and scalability.
  • Migrated to JDK 17 and Spring Boot 3.2.
  • Introduced LLM and log analysis APIs.
  • Added MCP integration.
  • Upgraded cb-tumblebug and cb-spider to v0.11.x.

What's Changed

New Contributors

Full Changelog: v0.4.0...v0.4.1

v0.4.0

22 Jul 06:40

Choose a tag to compare

API

Swagger UI URL: https://m-cmp.github.io/mc-observability/java-module/swagger/index.html

Major changes

  • LLM Log Analysis feature development in progress (add some WIP API)
  • CB-Tumblebug v0.11.0 Integration test complete.
  • CB-Spider v0.11.0 Integration test complete.
  • minor bug fixes

What's Changed

Full Changelog: v0.3.1...v0.4.0

v0.3.1

21 Nov 05:07
e6b67ea

Choose a tag to compare

What's Changed

Full Changelog: v0.3.0...v0.3.1

v0.3.0

31 Oct 06:52
7d5e825

Choose a tag to compare

What's Changed

Major changes

  • 클린한 환경에서 o11y 매니저를 처음 실행하였을때 Database에 필요한 정보가 없는 문제를 수정하였습니다.
  • 간혹 Monitoring Target 등록 후 Agent가 최초 실행될 때 모니터링 정보를 수집하지 못하는 현상을 수정하였습니다.
  • InfluxDB 컨테이너의 데이터가 유지되지 않는 현상을 수정하였습니다.
  • M-CMP 환경의 로그 수집을 위해 o11y 매니저의 로그 수집이 가능하도록 하였습니다.
  • o11y 매니저와 VM 로그 조회 API를 별도로 분리 하였습니다.
  • Tumblebug v0.9.22를 적용하였습니다.

Tested CSPs

  • Azure
  • AWS
  • GCP

참고 사항

  • 모니터링 수집 주기는 1분 마다 수집하도록 되어있습니다.
  • Monitoring Target 등록 후 최소 1분이 지난후에 모니터링 데이터가 정상적으로 수집되는지 확인 가능합니다.
  • CSP 기반 모니터링 API는 Azure에 배포된 VM들을 통해서만 가능합니다! (README.md: Check VM's monitoring data from CSP)

Full Changelog: v0.2.3...v0.3.0

v0.2.3

22 Oct 06:53

Choose a tag to compare

What's Changed

Fixed

  • Fix log search issue
  • Fix tumble bug model mismatch issue (Tested with CB-TB v0.9.18)
  • Fix docker compose default configuration issue

Added

  • Add o11y-manager monitoring configuration

How to Use

Alarm & Trriger

Insight Howto

Full Changelog: v0.2.2...v0.2.3

v0.2.2

15 Oct 12:57
d30e7bc

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.2.1...v0.2.2