Skip to content

Releases: SwayamInSync/QBLAS

QuadBLAS v1.0.0 - Initial Release

09 Jul 10:27
6858d56

Choose a tag to compare

🎉 QuadBLAS v1.0.0 - Initial Release

QuadBLAS is a high-performance linear algebra library for IEEE 754 quadruple precision (128-bit) floating-point arithmetic, built on top of SLEEF.

✨ Features

  • High Performance: Up to 75x speedup over naive implementations
  • SIMD Vectorization: x86-64 SSE/AVX and ARM64 NEON support
  • Multi-threading: OpenMP parallelization with excellent scaling
  • Header-only: Easy integration, no separate compilation required
  • Cross-platform: Ubuntu, macOS support with comprehensive CI

📊 Performance Highlights

  • Vector Dot Product: 21x speedup (100K elements)
  • Matrix-Vector Multiply: 75x speedup (1500×1500), 1.6 GFLOPS
  • Matrix-Matrix Multiply: 2.8x speedup with cache blocking

🛠️ Quick Start

Checkout the homepage SwayamInSync/QuadBLAS for information about usage