Recursive Abstractive Processing for Tree-Organized Retrieval

This module implements the RAPTOR retreival strategy for the Neuron PHP AI framework.

The Problem with Traditional Retrieval

Most retrieval-augmented models work by breaking down documents into small chunks and retrieving only the most relevant ones. However, this approach has some limitations:

Loss of Context: Retrieving only small, isolated chunks may miss the bigger picture especially for documents with long contexts.
Difficulty in Multi-Step Reasoning: Some questions require information from multiple sections of a document.

Use RAPTOR when:

Users ask open-ended questions that require comprehensive coverage
Your domain involves complex topics where context matters as much as facts
You need to handle queries about themes, trends, or relationships across documents

Stick with traditional RAG when:

Users primarily need quick, specific fact retrieval
Processing speed and token efficiency are critical constraints

Check out the example in the examples folder.

What is Neuron?

Neuron is a PHP framework for creating and orchestrating AI Agents. It allows you to integrate AI entities in your existing PHP applications with a powerful and flexible architecture. We provide tools for the entire agentic application development lifecycle, from LLM interfaces, to data loading, to multi-agent orchestration, to monitoring and debugging. In addition, we provide tutorials and other educational content to help you get started using AI Agents in your projects.

Go to the official documentation

Video Tutorial

Requirements

PHP: ^8.1
Neuron: ^2.0

Install RAPTOR retrieval

Install the latest version of the package:

composer require neuron-core/raptor-retreival

How to use RAPTOR in your agent

Or use the RAPTOR component directly into the agent. RAPTOR needs a vector store, an embedding provider and uses an LLM to perform the summarization:

use NeuronAI\RAG\Retrieval\RetrievalInterface;
use NeuronCore\RaptorRetrieval\RaptorRetrieval;

class WorkoutTipsAgent extends RAG
{
    protected function retrieval(): RetrievalInterface
    {
        return new RaptorRetrieval(
            $this->resolveVectorStore(),
            $this->resolveEmbeddingsProvider(),
            $this->resolveProvider(), // Used for summarization
        );
    }

    protected function embeddings(): EmbeddingsProviderInterface
    {
        return new ...
    }

    protected function vectorStore(): VectorStoreInterface
    {
        return new ...
    }
}

Clustering strategy

RAPTOR algorithm uses a clustering strategy to group the retrieved documents into clusters. Choose based on your content characteristics:

Similarity Clustering (default)

Groups documents with clear thematic boundaries. Best for already well-organized content with distinct topics.

Use Similarity Clustering when:

You have heterogeneous content with clear topic boundaries
Your documents have distinct themes that don't overlap much
Performance is important (faster processing)

use NeuronAI\RAG\Retrieval\RetrievalInterface;
use NeuronCore\RaptorRetrieval\RaptorRetrieval;
use NeuronCore\RaptorRetrieval\Clustering\SimilarityClustering;

class WorkoutTipsAgent extends RAG
{
    protected function retrieval(): RetrievalInterface
    {
        return new RaptorRetrieval(
            $this->resolveVectorStore(),
            $this->resolveEmbeddingsProvider(),
            $this->resolveProvider(), // Used for summarization
            new SimilarityClustering()
        );
    }

    protected function embeddings(): EmbeddingsProviderInterface
    {
        return new ...
    }

    protected function vectorStore(): VectorStoreInterface
    {
        return new ...
    }
}

Gaussian Mixture Clustering

Handles overlapping topics where documents may belong to multiple themes simultaneously. Useful for research papers, news articles, or any content where topics naturally blend together rather than having sharp boundaries.

Use GMM when:

Documents may relate to multiple topics simultaneously
You want the algorithm to discover the "natural" number of clusters in your data
You're dealing with research papers, news, or complex content where multiple topics blend

use NeuronAI\RAG\Retrieval\RetrievalInterface;
use NeuronCore\RaptorRetrieval\RaptorRetrieval;
use NeuronCore\RaptorRetrieval\Clustering\GaussianMixtureClustering;

class WorkoutTipsAgent extends RAG
{
    protected function retrieval(): RetrievalInterface
    {
        return new RaptorRetrieval(
            $this->resolveVectorStore(),
            $this->resolveEmbeddingsProvider(),
            $this->resolveProvider(), // Used for summarization
            new GaussianMixtureClustering()
        );
    }

    protected function embeddings(): EmbeddingsProviderInterface
    {
        return new ...
    }

    protected function vectorStore(): VectorStoreInterface
    {
        return new ...
    }
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
docs		docs
examples		examples
src		src
tests		tests
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.php-cs-fixer.dist.php		.php-cs-fixer.dist.php
README.md		README.md
composer.json		composer.json
phpstan.neon		phpstan.neon
phpunit.xml.dist		phpunit.xml.dist
rector.php		rector.php

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Recursive Abstractive Processing for Tree-Organized Retrieval

The Problem with Traditional Retrieval

What is Neuron?

Requirements

Install RAPTOR retrieval

How to use RAPTOR in your agent

Clustering strategy

Similarity Clustering (default)

Gaussian Mixture Clustering

About

Uh oh!

Releases 6

Languages

neuron-core/raptor-retrieval

Folders and files

Latest commit

History

Repository files navigation

Recursive Abstractive Processing for Tree-Organized Retrieval

The Problem with Traditional Retrieval

What is Neuron?

Requirements

Install RAPTOR retrieval

How to use RAPTOR in your agent

Clustering strategy

Similarity Clustering (default)

Gaussian Mixture Clustering

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 6

Languages