Releases: SwayamInSync/QBLAS
Releases · SwayamInSync/QBLAS
QuadBLAS v1.0.0 - Initial Release
🎉 QuadBLAS v1.0.0 - Initial Release
QuadBLAS is a high-performance linear algebra library for IEEE 754 quadruple precision (128-bit) floating-point arithmetic, built on top of SLEEF.
✨ Features
- High Performance: Up to 75x speedup over naive implementations
- SIMD Vectorization: x86-64 SSE/AVX and ARM64 NEON support
- Multi-threading: OpenMP parallelization with excellent scaling
- Header-only: Easy integration, no separate compilation required
- Cross-platform: Ubuntu, macOS support with comprehensive CI
📊 Performance Highlights
- Vector Dot Product: 21x speedup (100K elements)
- Matrix-Vector Multiply: 75x speedup (1500×1500), 1.6 GFLOPS
- Matrix-Matrix Multiply: 2.8x speedup with cache blocking
🛠️ Quick Start
Checkout the homepage SwayamInSync/QuadBLAS for information about usage