Skip to content

Anomaly Detection #398

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 19 commits into from
Aug 3, 2025
Merged

Anomaly Detection #398

merged 19 commits into from
Aug 3, 2025

Conversation

JohT
Copy link
Owner

@JohT JohT commented Jul 13, 2025

🚀 Features

✨Anomaly Detection

This update adds a powerful new component to the code analysis pipeline: an unsupervised machine learning module that automatically detects anomalies across your codebase—with zero manual tuning.
This pipeline generates over 10 insightful plots and CSV reports per code unit type (Java Artifacts, Java Types, Java Packages, and TypeScript Modules). Visualizations range from** intuitive 2D feature plots** that highlight unusual code patterns, to advanced statistical analyses and clustering techniques.
One standout addition: an Isolation Forest model that excels at spotting the “needle in the haystack”—uncovering rare and unexpected feature combinations that could indicate deeper issues or innovation opportunities.
This feature not only flags anomalies—it explains them, helping you understand why certain parts of your code stand out.

📓 Example for anomaly detection with Isolation Forest

📓 Example for hyper-parameter tuning of node embeddings

⚙️ Optimization

📖 Documentation

@JohT JohT self-assigned this Jul 13, 2025
@JohT JohT force-pushed the feature/anomaly-detection branch from f5a9ca5 to e6809d8 Compare July 13, 2025 16:44
@JohT JohT changed the base branch from feature/add-community-detection-with-hierarchical-density-based-spation-clustering-hdbscan- to main July 13, 2025 16:45
@JohT JohT force-pushed the feature/anomaly-detection branch 9 times, most recently from 192feeb to c0b983a Compare July 19, 2025 08:30
@JohT JohT force-pushed the feature/anomaly-detection branch 9 times, most recently from 3b02549 to b524736 Compare July 28, 2025 10:49
@JohT JohT force-pushed the feature/anomaly-detection branch 3 times, most recently from 5064e75 to 4e4ec3d Compare July 31, 2025 05:44
@JohT JohT force-pushed the feature/anomaly-detection branch from 4e4ec3d to 80baa98 Compare July 31, 2025 07:01
@JohT JohT marked this pull request as ready for review July 31, 2025 11:03
@JohT JohT force-pushed the feature/anomaly-detection branch 5 times, most recently from 24dac69 to 19262bf Compare August 2, 2025 12:26
@JohT JohT force-pushed the feature/anomaly-detection branch from 19262bf to 246229d Compare August 2, 2025 12:59
@JohT JohT merged commit 3334b9a into main Aug 3, 2025
8 checks passed
@JohT JohT deleted the feature/anomaly-detection branch August 3, 2025 06:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant