ML models for RAXE Community Edition L2 threat detection.
| Model | Version | Size | Description |
|---|---|---|---|
threat_classifier_gemma_mlp_v3 |
v0.4.0 | ~235MB | Gemma MLP 5-head classifier (INT8 quantized) |
The latest model features:
- Architecture: EmbeddingGemma-300M + 5-head MLP classifier
- Classification Heads:
- Binary (threat/safe): 2 classes
- Threat Family: 15 classes
- Severity: 3 levels (none/moderate/severe)
- Primary Technique: 35 classes
- Harm Types: 10 classes (multilabel)
- Performance: TPR 91.2%, FPR 6.4%, F1 0.94
- Quantization: INT8 for efficient CPU inference
Models are automatically downloaded on first use:
pip install raxe
raxe scan "test prompt" # Auto-downloads model on first useOr download manually:
raxe models downloadThe model package includes:
model_int8.onnx- EmbeddingGemma-300M embeddings (INT8)classifier_is_threat_int8.onnx- Binary threat classifierclassifier_threat_family_int8.onnx- 15-class family classifierclassifier_severity_int8.onnx- 3-class severity classifierclassifier_primary_technique_int8.onnx- 35-class technique classifierclassifier_harm_types_int8.onnx- 10-class harm multilabel classifiertokenizer.json- Tokenizer configurationmodel_metadata.json- Model metadata and inference pipelinelabel_config.json- Label mappings for all headspooling_config.json- Pooling configurationfeature_scaler.pkl- Feature normalization
- Inference time: ~3-5ms per prompt (CPU)
- Memory footprint: ~400MB loaded
- Model download: ~235MB compressed
- Updated to new classification schema
- Severity: 5 levels → 3 levels (none/moderate/severe)
- Threat Family: 9 classes → 15 classes
- Primary Technique: 22 classes → 35 classes
- Improved accuracy: TPR 91.2%, FPR 6.4%
- Initial Gemma MLP 5-head classifier
- TPR 90.4%, FPR 7.4%
These models are provided under the RAXE Community Edition license. See raxe-ai/raxe-ce for details.