GitHub - Baharcakir/SemanticBookRecommender: Semantic book recommender

Semantic Book Recommender — README A small project that builds a semantic book recommender using OpenAI embeddings, LangChain for retrieval, pandas / numpy for data analysis, and a Gradio dashboard for interactive use. The pipeline ingests a Kaggle books/dataset, preprocesses it, builds vector embeddings, stores them in a vector store, and exposes a conversational / search-style recommender through a Gradio UI. Project overview This repo demonstrates how to build a content-based, semantic book recommender — instead of relying on metadata or co-purchases only, it uses natural language embeddings of book descriptions, or other text to find semantically similar books. You can ask natural-language queries (e.g., "a book about romance") and get recommended books that match the semantics of that request. Features Ingest Kaggle book dataset (metadata, descriptions) Preprocess (clean, filter, chunk long descriptions) Build embeddings for items with OpenAI embeddings Store embeddings in a vector store (Chroma) HuggingFace transformers all-MiniLM-L6-v2 Use LangChain for similarity search Light-weight analytics with pandas/numpy (top genres, tone distributions) Interactive Gradio UI for query → recommendations

Create virtual env and install: python -m venv venv source venv/bin/activate
pip install -r requirements.txt

Environment variables Create a .env file or set these in your environment: OPENAI_API_KEY — your OpenAI API key

If you use a .env file, use python-dotenv to load it in scripts: OPENAI_API_KEY=sk-xxxx HUGGINGFACEHB_API_TOKEN=your_kaggle_user

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
books_cleaned.csv		books_cleaned.csv
books_with_categories.csv		books_with_categories.csv
books_with_emotions.csv		books_with_emotions.csv
cover-not-found.jpg		cover-not-found.jpg
data-exploration.ipynb		data-exploration.ipynb
gradio-dashboard.py		gradio-dashboard.py
requirements.txt		requirements.txt
sentiment-analysis.ipynb		sentiment-analysis.ipynb
tagged_description.txt		tagged_description.txt
text-classification.ipynb		text-classification.ipynb
vector-search.ipynb		vector-search.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

Baharcakir/SemanticBookRecommender

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages