A collection of Jupyter notebooks showcasing the use of Generative AI models, including Large Language Models (LLMs), Vision-Language Models (VLMs), and Diffusion Models
- BEN2 — segmentation, HR-seg, DIS
- CLIP — contrastive, image-features, text-features
- DepthAnything — depth-estimation
- DepthPro — depth-estimation
- DiffDIS — segmentation, HR-seg, DIS
- DPT — depth-estimation
- OVSeg — segmentation, oopen-vocab segmentation
- OWL-v2 — detection, ovd
- OWL-ViT — detection, ovd
- SAM — segmentation
- SAM-2 — segmentation
- SAM-HQ — segmentation
- SAMRefiner — segmentation, seg-refinement
- SmolVLM — VLM, VQA
- VGGT — 3D
3D
contrastive
depth-estimation
detection
DIS
HR-seg
image-features
oopen-vocab segmentation
ovd
seg-refinement
segmentation
text-features
VLM
VQA