This repository provides the full evaluation pipeline for our project:
"On the Use of Large Language Models for 3D Point Cloud Understanding."
We extend the original PointLLM model beyond single-object understanding to handle complex indoor scenes using data from the ScanNet dataset. Rather than retraining, our approach focuses on automated context generation and large-scale evaluation of the modelβs performance under different settings.
-
No Model Retraining
PointLLM is used out-of-the-box without fine-tuning. -
Multi-Scene Evaluation
Evaluate on complete ScanNet scenes with rich object combinations. -
π Captioning & Classification Tasks
Includes fully automated evaluation loops for both tasks. -
π€ LLM-Based Evaluation
Replaces traditional human-based assessments with ChatGPT evaluation strategies. -
β Strict Binary Answer Format
Enforces "Yes"/"No" answers for consistent metric-based evaluation.
βββ pointllm/ # Core PointLLM model and conversation logic β βββ model/ # LLM model classes and loading utilities β βββ conversation/ # Prompt templates and dialogue handling β βββ utils/ # Utility functions for setup and decoding β βββ data/ # Dataset-related files β βββ ground_truth.json # Ground-truth annotations for object presence β βββ material_list_updated.json # Object-to-material mappings β βββ context/ # Natural language scene descriptions β βββ evaluation/ # Automated evaluation scripts β βββ evaluate_classification.py # Classification evaluation loop β βββ evaluate_captioning.py # Captioning evaluation loop β βββ analyze_results.py # Accuracy & metric calculation β βββ preprocessing/ # Data transformation and ScanNet processing β βββ process_scannet.py # Converts ScanNet to usable format β βββ generate_context.py # Creates natural language scene context β βββ results/ # Logs and outputs of evaluations β βββ evaluation_log_*.json # Per-scene evaluation outputs β βββ summary_metrics.json # Summary statistics β βββ scripts/ # Optional CLI wrappers for quick runs β βββ run_eval.sh # Shell script to launch evaluation β βββ README.md # Project overview βββ requirements.txt # Python dependencies