Page Reader Assistant

A Chrome extension that allows you to ask questions about web page content, images, and screenshots using AI. Powered by OpenAI's GPT models.

Features

🤖 Ask questions about any webpage content
📸 Take screenshots and ask questions about them
🖼️ Drag and drop images for visual analysis
🎥 Extract and analyze YouTube and Bilibili video subtitles
💬 Chat-like interface with message history
🌓 Light/Dark theme support
⌨️ Customizable prompt shortcuts
🔄 Real-time streaming responses
📱 Responsive sidebar design
🎨 Support for various image formats (PNG, JPEG, GIF, WebP, SVG)
🧠 Support for multiple AI models and custom API endpoints (including OpenAI, Gemini, and Deepseek)
🔒 Type-safe codebase with TypeScript
🖱️ Web page element selection for precise context
🌍 Multi-lingual UI support (English and Chinese)
📊 Token estimation for API calls

Installation

Download the latest release package (web-reader.zip) from the Releases page
Extract the zip file to a local directory
Open Chrome and go to chrome://extensions/
Enable "Developer mode" in the top right
Click "Load unpacked" and select the extracted directory
Click the extension icon and set your OpenAI API key in the settings

Usage

Toggle the sidebar using the keyboard shortcut Alt+Shift+K (Windows/Linux) or Option+Shift+K (macOS), or by clicking the "Ask AI" button on any webpage.
Choose your context mode:
- Full Page: Ask about the entire page content
- Selection: Ask about selected text or selected page elements
- Screenshot/Image: Take a screenshot or drop an image to analyze
- YouTube/Bilibili: Extract and analyze video subtitles (on YouTube and Bilibili pages)
Type your question and press Enter or click "Ask Question"
View the AI's response in real-time

Development

Setup

Clone this repository
Install dependencies:
```
npm install
```
Build the extension:
```
npm run build
```
Open Chrome and go to chrome://extensions/
Enable "Developer mode" in the top right
Click "Load unpacked" and select the dist directory

The extension is built with TypeScript for enhanced type safety and better development experience.

Project Structure

├── src/                    # TypeScript source files
│   ├── components/         # UI and feature components
│   │   ├── chat/          # Chat-related components
│   │   ├── context/       # Context handling (page, selection, screenshot, youtube)
│   │   └── ui/            # UI components
│   ├── utils/             # Utility functions
│   ├── types/             # TypeScript type definitions
│   │   └── index.d.ts     # Global type definitions
│   ├── background.ts      # Service worker and request handling
│   ├── config.ts          # Configuration
│   ├── main.ts           # Content script entry
│   └── settings.ts        # Settings page logic
├── dist/                  # Compiled JavaScript (generated)
├── lib/                   # Third-party libraries
├── icons/                 # Extension icons
├── settings.html         # Settings page HTML
├── settings.css          # Settings styles
└── sidebar.css           # Sidebar styles

Development Commands

npm install - Install dependencies
npm run build - Build TypeScript files
npm run watch - Watch for changes and rebuild
npm run lint - Run ESLint
npm test - Run tests

Configuration

Set your OpenAI API key in the extension settings
Choose between different GPT models
Customize the theme (Light/Dark)
API endpoint can be configured for self-hosted deployments

Credits

Built with assistance from Claude (Anthropic)
Uses OpenAI's GPT models for AI capabilities
Marked library for Markdown rendering
Icons from various sources

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Development Setup

Fork and clone the repository
Install dependencies: npm install
Make your changes in TypeScript files
Build and test: npm run build
Submit a pull request

License

MIT License

Acknowledgments

Special thanks to:

OpenAI for their powerful API
Claude (Anthropic) for development assistance and code improvements
The Chrome Extensions community
All contributors and users

Support

For issues, questions, or suggestions:

Open an issue in this repository
Check existing issues for solutions
Provide detailed information about your problem

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
.github/workflows		.github/workflows
.vscode		.vscode
coverage		coverage
doc		doc
icons		icons
scripts		scripts
src		src
test		test
.cursorignore		.cursorignore
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
CHART_IMPLEMENTATION.md		CHART_IMPLEMENTATION.md
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
icon-generator.html		icon-generator.html
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
webpack.config.js		webpack.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Page Reader Assistant

Features

Installation

Usage

Development

Setup

Project Structure

Development Commands

Configuration

Credits

Contributing

Development Setup

License

Acknowledgments

Support

Future Plans

About

Uh oh!

Releases 11

Packages

Languages

License

cyberelf/web-reader

Folders and files

Latest commit

History

Repository files navigation

Page Reader Assistant

Features

Installation

Usage

Development

Setup

Project Structure

Development Commands

Configuration

Credits

Contributing

Development Setup

License

Acknowledgments

Support

Future Plans

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Languages

Packages