
- Chile
-
18:56
- 3h behind - https://fardust.tralmor.com/
- @gxfaundez
Lists (13)
Sort Name ascending (A-Z)
🏘️ Data Enginnering
The amazing place to store Data pipelines tools.🧪 Data Science
🔧 Deep Learning
General list for deep learning🎨 Front-End
🎮 Game Dev
🌆 MLOps
This list aims to store tools for mlops✏️ NLP
Natural Language Processing Tools❓ Pending
Starred repositories
can we have dict unpacking in python?
📜 Extract meaningful content from the chaos of a web page
🦜⛏️ Did you say you like data?
Barebones URL scraper w/ evaluation dataset
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
To extract main article from given URL with Node.js
WebRover is an autonomous AI agent designed to interpret user input and execute actions by interacting with web elements to accomplish tasks or answer questions. It leverages advanced language mode…
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
Large Concept Models: Language modeling in a sentence representation space
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Adding guardrails to large language models.
A time-series database for high-performance real-time analytics packaged as a Postgres extension
Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit
RAG that intelligently adapts to your use case, data, and queries
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.