Skip to content

Raycast extension to use GPT, Claude, Llama, and more... all for FREE! + Full support for custom APIs.

License

Notifications You must be signed in to change notification settings

XInTheDark/raycast-g4f

Repository files navigation

Raycast-G4F (GPT4Free)

Use the powerful GPT-4, Llama-3 and more AI models on Raycast, for FREE! + Full support for custom APIs.

Homepage · Privacy Policy · FAQs · Discord

Screenshots

1 2 3 4

Getting Started

"If you like the extension, please consider giving it a ✨star✨ tysm!" - the developer, probably

Prerequisites

  • Install the Raycast app.
    • Currently, Raycast is only available for macOS. Windows support is in progress.
  • Install Node.js.
    • The minimum version recommended is v20.18.1. If you're using an older version, you may encounter issues.

Installation

This extension is currently not available on the Raycast Extension store, but installation from source is extremely simple.

  1. Download the source code from the latest release, or clone the repository.
  2. Navigate to the directory, and open a Terminal window at the downloaded folder.
  3. Run npm ci --production to install required dependencies.
  4. (Optional) Run pip3 install -r requirements.txt to install Python dependencies. These are required for some features, e.g. web search.
  5. Run npm run dev to build and import the extension.

The extension, and its full set of commands, should then show up in your Raycast app.

Troubleshooting

Please open an issue if any unexpected problems occur during installation.

Updating

Automatically

There is built-in support for updating within the extension itself! Simply run the "Check for Updates" command in the extension, and it will take care of the update process for you. Furthermore, the "Automatically Check for Updates" feature is available in the preferences (enabled by default).

Manually

In the command line, run git pull, npm ci --production and npm run dev (in that order).

You might want to update manually if the automatic update doesn't work (please also open a GitHub issue if this is the case); updating manually also allows you to fetch and view the latest changes to the source code.

Summary

  • ▶️️ Streaming support - see messages load in real-time, providing a seamless experience.
  • ⚡ Ask anything from anywhere - with 18 commands available, there's something for you no matter what you need.
  • 💪 Support for many providers & models (more info below!)
  • 💬 Chat command - interact with the AI in a conversation, and your chat history will be stored in the extension.
  • 🌐 Web search - let GPT search the web for the latest information.
  • 📄 File upload - you can upload image, video, audio and text files to the AI. (only available for a few providers, more to come!)
  • 🎨 Image generation capabilities - imagine anything, and make it reality with state-of-the-art models.
  • ✏️ Custom AI Commands - create your own commands with custom prompts!

Providers & Models

Provider Model Features Status Speed Rating and remarks by extension author
Nexra gpt-4o (default) ▶️ Active Very fast 8.5/10, the best performing model.
Nexra gpt-4-32k Active Medium 6.5/10, no streaming support but otherwise a great model.
Nexra chatgpt ▶️ Unknown Very fast 7.5/10
Nexra Bing ▶️ Active Medium 8/10, GPT-4 based with web search capabilities.
Nexra llama-3.1 ▶️ Active Fast 7/10
Nexra gemini-1.0-pro ▶️ Active Fast 6.5/10
DeepInfra meta-llama-3.3-70b ▶️ Active Fast 8.5/10, recent model with large context size.
DeepInfra meta-llama-3.2-90b-vision ▶️ 📄¹ Active Fast 8/10, recent model with vision capabilities.
DeepInfra meta-llama-3.2-11b-vision ▶️ 📄¹ Active Very fast 7.5/10
DeepInfra meta-llama-3.1-405b ▶️ Inactive Medium 8.5/10, state-of-the-art open model, suitable for complex tasks.
DeepInfra meta-llama-3.1-70b ▶️ Active Fast 8/10
DeepInfra meta-llama-3.1-8b ▶️ Active Very fast 7.5/10
DeepInfra llama-3.1-nemotron-70b ▶️ Active Fast 8/10
DeepInfra WizardLM-2-8x22B ▶️ Active Medium 7/10
DeepInfra DeepSeek-V2.5 ▶️ Active Fast 7.5/10
DeepInfra Qwen2.5-72B ▶️ Active Medium 7.5/10
DeepInfra Qwen2.5-Coder-32B ▶️ Active Fast 7/10
DeepInfra QwQ-32B-Preview ▶️ Active Very fast 7.5/10
Blackbox custom model ▶️ Active Fast 7.5/10, very fast generation with built-in web search ability, but is optimized for coding.
Blackbox llama-3.1-405b ▶️ Active Fast 8.5/10
Blackbox llama-3.1-70b ▶️ Active Very fast 8/10
Blackbox gemini-1.5-flash ▶️ Active Extremely fast 7.5/10
Blackbox qwq-32b-preview ▶️ Active Extremely fast 6.5/10
Blackbox gpt-4o ▶️ Active Very fast 7.5/10
Blackbox claude-3.5-sonnet ▶️ Active Fast 8.5/10
Blackbox gemini-pro ▶️ Active Fast 8/10
DuckDuckGo gpt-4o-mini ▶️ Active Extremely fast 8/10, authentic GPT-4o-mini model with strong privacy.
DuckDuckGo claude-3-haiku ▶️ Active Extremely fast 7/10
DuckDuckGo meta-llama-3.1-70b ▶️ Active Very fast 7.5/10
DuckDuckGo mixtral-8x7b ▶️ Active Extremely fast 7.5/10
BestIM gpt-4o-mini ▶️ Inactive Extremely fast 8.5/10
Rocks claude-3.5-sonnet ▶️ Active Fast 8.5/10
Rocks claude-3-opus ▶️ Active Fast 8/10
Rocks gpt-4o ▶️ Active Fast 7.5/10
Rocks gpt-4 ▶️ Active Fast 7.5/10
Rocks llama-3.1-405b ▶️ Active Fast 7.5/10
Rocks llama-3.1-70b ▶️ Active Very fast 7/10
ChatgptFree gpt-4o-mini ▶️ Active Extremely fast 8.5/10
AI4Chat gpt-4 Active Very fast 7.5/10
DarkAI gpt-4o ▶️ Active Very fast 8/10
Mhystical gpt-4-32k Active Very fast 6.5/10
PizzaGPT gpt-4o-mini Active Extremely fast 7.5/10
Meta AI meta-llama-3.1 ▶️ Active Medium 7/10, recent model with internet access.
Replicate mixtral-8x7b ▶️ Active Medium ?/10
Replicate meta-llama-3.1-405b ▶️ Active Medium ?/10
Replicate meta-llama-3-70b ▶️ Active Medium ?/10
Replicate meta-llama-3-8b ▶️ Active Fast ?/10
Phind Phind Instant ▶️ Active Extremely fast 8/10
Google Gemini auto (gemini-1.5-pro, gemini-1.5-flash) ▶️ 📄 Active Very fast 9/10, very good overall model but requires an API Key. (It's free, see the section below)
Google Gemini (Experimental) auto (changes frequently) ▶️ 📄 Active Very fast -
Google Gemini (Thinking) auto (changes frequently) ▶️ 📄 Active Very fast -
Custom OpenAI-compatible API - ▶️ Active - allows you to use any custom OpenAI-compatible API. read more

▶️ - Supports streaming.

📄 - Supports file upload. Note: By default, all providers support basic file upload functionality for text-based files, like .txt, .md, etc.

¹: Supports images only.

Provider-specific notes

  • Google Gemini: An API Key is required to use this model. You can get one completely for free:
  1. Go to https://aistudio.google.com/app/apikey
  2. Sign in to your Google account if you haven't done so.
  3. Click on "Create API Key" and follow the instructions there.
  4. Copy the API Key and paste it into the corresponding box in the extension preferences.

The rate limit for Google Gemini is 1500 requests per day (as of the time of writing). This should be much more than enough for any normal usage. If your use case needs an increased rate limit, you can even create multiple API Keys with different Google accounts; separate them with commas in the preferences.

  • Google Gemini: This provider supports File upload functionality, as well as the Ask About Screen Content command! To upload a file in AI Chat, press Command-Enter or select "Compose Message" from the actions. Then, simply click on the upload button to get started.

Features

Note

As of v5.0, the extension preferences are now found in the "Preferences" command. Please use this command to access the preferences instead of the Raycast preferences.

Web Search

Let GPT decide to search the web for information if it does not have enough knowledge or context. Uses DuckDuckGo search, fast and free.

Usage

Enabling web search is fast and easy. Go to the extension preferences, and the "Web Search" option will be available. There are 4 options:

  • Disabled (default)
  • Automatic: Enable Web Search only in AI Chat. GPT will automatically decide when to use it.
  • Balanced: Use Web Search in every query for AI commands¹, and automatically in AI Chat. This is basically an extension of the "Automatic" option.
  • Always: Always use Web Search for every query, both in AI Chat and in commands¹.

¹: Commands that support Web Search are: Ask AI, Ask About Selected Text, Explain. Other commands will not use Web Search.

Web Search is also available in the following commands:

  • Custom AI Commands: You can enable Web Search for each command individually.
  • AI Chat: You can enable Web Search for each chat individually.
  • AI Presets: You can enable Web Search for each preset individually.

Smart Chat Naming

Let GPT automatically come up with a name for the current chat session after you send the first message. For example, this is similar to what the ChatGPT web UI does.

Automatically Check for Updates

Let the extension automatically check for updates every day. If a new version is available, you will be notified, along with the option to update the extension with a single click.

Persistent Storage

Enable more persistent storage of the extension's data, like AI Chat data or Custom Commands. This will back up a copy of this data to files on your computer. Useful for saving large amounts of data. Note: With this option off, your data is already well preserved. Do not enable this if you have sensitive data.

Enable Cursor Icon

Show a cursor icon when the response is loading - cosmetic option only.

Code Interpreter (BETA)

Allows GPT to execute Python code locally. The model has been instructed to strictly only produce safe code, but use at your own risk!

Only models with function calling capabilities support this feature. Currently, this includes only selected DeepInfra models.


FAQs

Why is this extension not on the official Raycast store?

  • By default, this extension comes with a number of third-party providers which you can use for free. Raycast doesn't like that because it's not possible to entirely verify the quality of these providers. (I do provide all the privacy details here, though, and the extension is open-source.) ...and also because they're selling their own AI.
  • Also, for me as a developer, publishing updates to the Raycast store is too slow and troublesome; I have to submit a PR, wait a week for it to be reviewed, and possibly have it rejected. This is not a good experience for me or for users.
  • Thus, the extension will have to be installed from source. Regarding this, I apologize as it's indeed more complicated than downloading it from the store. I have tried my best to make the installation process quick and streamlined - please do provide feedback on whether it was simple enough!

How does this extension compare to the paid Raycast AI? Why should I use this instead?

I’ve noticed many users subscribing to Raycast AI, which is quite expensive, and then only using it for a few casual chats a day. Honestly, that’s unnecessary. Here’s my honest suggestion: Everyone should first try out raycast-g4f. Here’s why:

  • Freedom. You get to choose whatever provider/API to use, and can even add multiple at once. OpenAI, Anthropic, Google… basically any API out there. (“Can I use my OpenAI API Key in Raycast AI?” No, you can’t.) That also means your AI access isn’t just locked inside Raycast. (“Can I use my Raycast AI in other apps?” No, you can’t.)
  • Price. This extension is completely free to use if you're sticking with the built-in providers. And if you want to use your own API, you pay only for what you use. That’s cheaper than raycast AI. There is no fixed price - you can choose whatever provider you want. You can even run models locally and connect them to the extension.
  • Privacy. The extension is open source and it respects your privacy. Everything is stored only on your device, and you choose what data you send.

Why is the UI worse than Raycast AI?

  • Because the developer API that Raycast provides is limited, it's not possible to replicate the Raycast AI interface exactly. The components I can use are very simple, and a lot of the features in built-in Raycast commands are not available to extension developers.
  • However, I've spent a lot of effort trying to make the UI really intuitive, and I'm always open to feedback on how to improve it!

The extension isn't working! (e.g. I sent a message but there is no response)

  • Sometimes third party providers can be slow or unresponsive. If you're experiencing this issue, please try again in a few minutes; or if the problem persists, please try switching to another provider.
  • If you've tried various providers and the issue still persists, please open an issue on GitHub!

How do I contribute to the extension?

  • I welcome all contributions! If you have an idea for a new feature, or if you've found a bug, please open an issue.
  • If you'd like to contribute code, please open a pull request, and I'll make sure to review it as soon as possible.

Star History

Star History Chart

License & Acknowledgements

License: GPLv3. Full license is found in LICENSE.txt.

The code base is derived from Raycast Gemini by Evan Zhou.

Third-party libraries used for generation:

(Both packages are maintained by the extension author.)

Some of the code in this repository was inspired or ported from the original gpt4free project (written in Python).