PHP Machine Learning from Scratch 🤖

A complete machine learning library built in PHP from the ground up, demonstrating fundamental concepts without external dependencies. This library implements all 12 articles of our comprehensive ML series - from basic linear algebra to production-ready neural networks!

🎯 What This Repository Demonstrates

This codebase proves that you can understand and implement complete, working machine learning systems in any language - even PHP! While Python dominates ML, understanding the concepts in a familiar language builds deeper comprehension and actually trains networks that learn and improve.

📚 Complete Article Series Implementation

Directory	Articles	Concepts Implemented
`LinearAlgebra/`	Article 2	Vectors, Matrices, Dot Products, Matrix Operations
`Algorithms/`	Article 3	Sorting (5 algorithms), Searching (Linear, Binary)
`DataProcessing/`	Article 4	Data Cleaning, Missing Values, Outlier Detection
`NeuralNetwork/`	Articles 5-7	Perceptrons, Neural Networks, Loss Functions, Backpropagation
`Training/`	Articles 8, 10-11	Gradient Descent, Hyperparameter Tuning, Complete Learning
`Evaluation/`	Article 9	Metrics, Data Splitting, Cross-Validation
`Examples/`	Article 12	Complete ML Pipeline

🏗️ Complete Project Structure

ml-php/
├── src/
│   ├── LinearAlgebra/          # Article 2: Mathematical foundations
│   │   ├── Vector.php          # Vector operations and dot products
│   │   └── Matrix.php          # Matrix operations and transformations
│   ├── Algorithms/             # Article 3: Algorithmic thinking
│   │   ├── Sorting.php         # 5 sorting algorithms with performance analysis
│   │   └── Searching.php       # Linear and Binary search
│   ├── DataProcessing/         # Article 4: The "car wash" for data
│   │   └── DataCleaner.php     # Missing values, outliers, validation
│   ├── NeuralNetwork/          # Articles 5-7: The core ML components
│   │   ├── Perceptron.php      # The "bouncer" - basic learning unit
│   │   ├── NeuralNetwork.php   # The "assembly line" with Backpropagation
│   │   └── LossFunctions.php   # The "teacher's red pen" - performance measurement
│   ├── Training/               # Articles 8, 10-11: Complete Learning
│   │   ├── GradientDescent.php # The "mountain hiking" optimizer
│   │   └── HyperparameterTuner.php # Automated parameter optimization
│   ├── Evaluation/             # Article 9: Performance Measurement
│   │   ├── Metrics.php         # Accuracy, precision, recall, F1, AUC
│   │   └── DataSplitter.php    # Train/test splits, cross-validation
│   └── Examples/               # Article 12: Full Demonstration
│       └── CompleteExample.php # Full ML pipeline + production checks
├── comprehensive_demo.php      # COMPLETE 12-ARTICLE DEMONSTRATION
├── test_run.php               # Quick component testing
├── tests/                     # Unit tests
├── data/                      # Sample datasets
└── README.md                  # This file

🚀 Quick Start

Run the Complete 12-Article Demonstration

cd ml-php
php comprehensive_demo.php

This runs the complete demonstration showing all 12 articles:

✅ Linear algebra operations (Article 2)
✅ Sorting and searching algorithms (Article 3)
✅ Data cleaning and preprocessing (Article 4)
✅ Perceptron learning with logic gates (Article 5)
✅ Neural network forward propagation (Article 6)
✅ Loss function comparisons (Article 7)
✅ Gradient descent training (Article 8)
✅ Performance evaluation & metrics (Article 9)
✅ Hyperparameter tuning (Article 10)
✅ Backpropagation learning (Article 11)
✅ Complete ML pipeline (Article 12)

Quick Component Test

php test_run.php

💡 Key Concepts Implemented

1. Linear Algebra (Article 2)

The mathematical foundation of machine learning

use MLPyHP\LinearAlgebra\Vector;
use MLPyHP\LinearAlgebra\Matrix;

// Vector operations (like movie recommendations)
$userPrefs = new Vector([4, 2, 5, 1]); // Comedy, Action, Drama, Horror
$movie = new Vector([3, 1, 4, 0]);
$similarity = $userPrefs->dotProduct($movie); // How much you'd like this movie

// Matrix operations (data transformations)
$data = new Matrix([[25, 50000], [35, 75000]]); // Age, Income
$weights = new Matrix([[0.1], [0.001]]);        // Feature weights
$scores = $data->multiply($weights);             // Customer scores

2. Algorithmic Thinking (Article 3)

Understanding efficiency and trade-offs

use MLPyHP\Algorithms\Sorting;
use MLPyHP\Algorithms\Searching;

// Compare different sorting approaches
$data = [64, 34, 25, 12, 22, 11, 90];
$bubbleSorted = Sorting::bubbleSort($data);      // Simple but slow O(n²)
$quickSorted = Sorting::quickSort($data);        // Fast O(n log n)

// Search efficiently
$index = Searching::binarySearch($bubbleSorted, 25); // O(log n) - much faster!

3. Data Processing (Article 4)

The "car wash" for messy real-world data

use MLPyHP\DataProcessing\DataCleaner;

// Handle messy real-world data
$messyData = [
    ['age' => 25, 'income' => 50000],
    ['age' => null, 'income' => 75000],    // Missing age
    ['age' => 35, 'income' => 'N/A']       // Missing income
];

// Clean it up
$cleaned = DataCleaner::handleMissingValues($messyData, 'mean');
$outliers = DataCleaner::detectOutliers([50000, 75000, 45000, 200000]); // Find the outlier

4. Perceptrons (Article 5)

The "bouncer at the club" - making binary decisions

use MLPyHP\NeuralNetwork\Perceptron;

// Train a perceptron to learn the AND gate
$perceptron = new Perceptron(2, 0.1, 'step');
$andData = Perceptron::createAndGateData();
$history = $perceptron->train($andData, 100);

// Test it
$result = $perceptron->predict([1, 1]); // Should output 1 (true AND true = true)

5. Neural Networks (Article 6)

The "assembly line" - multiple layers working together

use MLPyHP\NeuralNetwork\NeuralNetwork;

// Create a multi-layer network
$network = new NeuralNetwork([2, 4, 1], 'sigmoid'); // 2 inputs, 4 hidden, 1 output

// Forward propagation - data flows through the assembly line
$output = $network->forwardPropagate([0.5, 0.8]);

// Solve XOR (impossible for single perceptron!)
NeuralNetwork::demonstrateXorSolution();

6. Loss Functions (Article 7)

The "teacher's red pen" - measuring performance

use MLPyHP\NeuralNetwork\LossFunctions;

// For regression (predicting numbers)
$predictions = [200000, 250000, 180000];
$actual =      [210000, 240000, 190000];
$mse = LossFunctions::meanSquaredError($predictions, $actual);

// For classification (predicting categories)
$probabilities = [0.9, 0.1, 0.8, 0.3];
$labels =        [1,   0,   1,   0  ];
$crossEntropy = LossFunctions::binaryCrossEntropy($probabilities, $labels);

7. Complete Neural Network Training (Articles 8-12)

The full pipeline - networks that actually learn!

use MLPyHP\NeuralNetwork\NeuralNetwork;
use MLPyHP\Training\GradientDescent;
use MLPyHP\Training\HyperparameterTuner;
use MLPyHP\Evaluation\DataSplitter;
use MLPyHP\Evaluation\Metrics;

// 1. Prepare data with proper splitting
$inputs = [[0,0], [0,1], [1,0], [1,1]];
$targets = [[0], [1], [1], [0]]; // XOR problem
$split = DataSplitter::trainValidationTestSplit($inputs, $targets, 0.2, 0.2);

// 2. Find optimal hyperparameters
$searchSpace = [
    'learning_rate' => [0.01, 0.05, 0.1],
    'architecture' => [[2, 4, 1], [2, 8, 1], [2, 8, 4, 1]],
    'momentum' => [0.0, 0.5, 0.9]
];
$tuner = new HyperparameterTuner($searchSpace, 'f1_score');
$results = $tuner->randomSearch($trainingData, $validationData, 20);

// 3. Train with best parameters using gradient descent + backpropagation
$network = new NeuralNetwork($results['best_parameters']['architecture']);
$optimizer = new GradientDescent(
    $results['best_parameters']['learning_rate'],
    'mini-batch',
    $results['best_parameters']['momentum']
);
$history = $optimizer->train($network, $trainingData, 1000, 32);

// 4. Comprehensive evaluation
$predictions = [];
foreach ($split['test']['inputs'] as $input) {
    $predictions[] = $network->forwardPropagate($input)[0];
}
$report = Metrics::evaluationReport($predictions, $split['test']['targets']);
Metrics::printReport($report);

// Networks that actually learn and improve! 🎉

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PHP Machine Learning from Scratch 🤖

🎯 What This Repository Demonstrates

📚 Complete Article Series Implementation

🏗️ Complete Project Structure

🚀 Quick Start

Run the Complete 12-Article Demonstration

Quick Component Test

💡 Key Concepts Implemented

1. Linear Algebra (Article 2)

2. Algorithmic Thinking (Article 3)

3. Data Processing (Article 4)

4. Perceptrons (Article 5)

5. Neural Networks (Article 6)

6. Loss Functions (Article 7)

7. Complete Neural Network Training (Articles 8-12)

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
README.md		README.md
comprehensive_demo.php		comprehensive_demo.php
test_run.php		test_run.php

millerphp/ml-php

Folders and files

Latest commit

History

Repository files navigation

PHP Machine Learning from Scratch 🤖

🎯 What This Repository Demonstrates

📚 Complete Article Series Implementation

🏗️ Complete Project Structure

🚀 Quick Start

Run the Complete 12-Article Demonstration

Quick Component Test

💡 Key Concepts Implemented

1. Linear Algebra (Article 2)

2. Algorithmic Thinking (Article 3)

3. Data Processing (Article 4)

4. Perceptrons (Article 5)

5. Neural Networks (Article 6)

6. Loss Functions (Article 7)

7. Complete Neural Network Training (Articles 8-12)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages