Anna's Archive API

A powerful, production-ready REST API for searching and retrieving book information from Anna's Archive. Features bot-detection bypass via stealth Playwright, intelligent domain rotation across all mirrors, and a multi-layer caching system.

Features

🤖 Stealth Scraping — Playwright + stealth plugin bypasses Cloudflare & bot detection
🔄 Domain Rotation — Automatically rotates across all known Anna's Archive mirrors, falling back gracefully when one is down
⚡ Smart Caching — Multi-TTL in-memory cache: search results (5 min), book details (1 hour)
📄 Pagination — Full pagination support on search results
🏊 Browser Pool — Reusable browser instances for high performance
🛡️ Rate Limiting — Built-in per-IP rate limiting
📊 Health Endpoint — Domain health & cache statistics

Setup

# Install dependencies
npm install

# Install Playwright's Chromium browser
npm run install:browsers

# Start in development mode
npm run dev

# Start in production
npm start

API Endpoints

`GET /api/search`

Search for books by any query (title, author, ISBN, DOI, MD5, etc.)

Query Parameters:

Param	Type	Default	Description
`q`	string	required	Search query
`page`	number	1	Page number
`lang`	string	—	Language filter (e.g. `en`)
`ext`	string	—	File extension filter (e.g. `pdf`, `epub`)
`sort`	string	—	Sort order (`newest`, `oldest`, `largest`, `smallest`)
`content`	string	—	Content type filter

Response:

{
  "success": true,
  "query": "python programming",
  "page": 1,
  "results": [...],
  "cached": false,
  "domain": "annas-archive.gl",
  "responseTime": 1234
}

`GET /api/book/:md5`

Get full details for a specific book by its MD5 hash.

Response:

{
  "success": true,
  "md5": "d64efd386ed7227592499460aca2044b",
  "book": {
    "title": "Data Science Essentials in Python",
    "author": "Dmitry Zinoviev",
    "publisher": "Pragmatic Bookshelf",
    "year": "2016",
    "language": "en",
    "filesize": 6432380,
    "extension": "pdf",
    "isbn": ["9781680501841", "1680501844"],
    "description": "...",
    "cover": "https://...",
    "md5": "d64efd386ed7227592499460aca2044b",
    "downloadLinks": {
      "fast": [...],
      "slow": [...],
      "external": [...]
    },
    "metadata": {...}
  },
  "cached": true,
  "responseTime": 45
}

`GET /health`

Returns API health, domain status, cache stats, and browser pool status.

`DELETE /api/cache`

Clears the entire cache. Useful for forced refresh.

Mirrors / Domain Rotation

The API will automatically try these domains in order:

annas-archive.gl
annas-archive.org
annas-archive.se
annas-archive.gs
annas-archive.gd
annas-archive.pk

If a domain is unreachable, it is temporarily blacklisted and the next one is tried.

Environment Variables

See .env for all configuration options. Key settings include:

PORT: Server port (default: 3000)
CACHE_TYPE: memory (default) or redis
REDIS_URL: Redis connection string (e.g. redis://localhost:6379)
REDIS_PREFIX: Prefix for keys in Redis (default: annas-api:)
CACHE_TTL_SEARCH: TTL for search results in seconds (default: 300)
CACHE_TTL_BOOK: TTL for book details in seconds (default: 3600)

Caching Options

The API supports two caching engines:

1. In-Memory (NodeCache)

Default option. Best for single-instance deployments.

Set CACHE_TYPE=memory
Automatic cleanup of expired keys
Lightning-fast retrieval

2. Redis

Best for multi-instance deployments or persistent caching.

Set CACHE_TYPE=redis
Configure via REDIS_URL
Shared cache across multiple API nodes
Survives application restarts

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dev.sqlite		dev.sqlite
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anna's Archive API

Features

Setup

API Endpoints

`GET /api/search`

`GET /api/book/:md5`

`GET /health`

`DELETE /api/cache`

Mirrors / Domain Rotation

Environment Variables

Caching Options

1. In-Memory (NodeCache)

2. Redis

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Anna's Archive API

Features

Setup

API Endpoints

GET /api/search

GET /api/book/:md5

GET /health

DELETE /api/cache

Mirrors / Domain Rotation

Environment Variables

Caching Options

1. In-Memory (NodeCache)

2. Redis

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`GET /api/search`

`GET /api/book/:md5`

`GET /health`

`DELETE /api/cache`

Packages