Highlights
- Pro
🤖 AI
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
OCR, layout analysis, reading order, table recognition in 90+ languages
The most reliable AI agent framework that supports MCP.
使用OpenCV部署yolov8检测人脸和关键点以及人脸质量评价,包含C++和Python两个版本的程序,只依赖opencv库就可以运行,彻底摆脱对任何深度学习框架的依赖。
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.
We write your reusable computer vision tools. 💜
real time face swap and one-click video deepfake with only a single image
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording
Agno is a lightweight library for building Multimodal Agents. It exposes LLMs as a unified API and gives them superpowers like memory, knowledge, tools and reasoning.
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Get your documents ready for gen AI
Best Practices on Recommendation Systems
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Official Code for DragGAN (SIGGRAPH 2023)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Sky-T1: Train your own O1 preview model within $450
Robust Speech Recognition via Large-Scale Weak Supervision
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
Fully open reproduction of DeepSeek-R1
A simple open-source chat app that uses Exa's API for web search and Deepseek R1 for reasoning
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥