Skip to content

Applied PointLLM to complex scenes. For this a fully automated evaluation loop that relies and strict binary classification and ChatGPT evaluation was implemented.

Notifications You must be signed in to change notification settings

Conscht/PointLLM_Complex_Scenes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

PointLLM 3D Evaluation Pipeline

This repository provides the full evaluation pipeline for our project:
"On the Use of Large Language Models for 3D Point Cloud Understanding."

We extend the original PointLLM model beyond single-object understanding to handle complex indoor scenes using data from the ScanNet dataset. Rather than retraining, our approach focuses on automated context generation and large-scale evaluation of the model’s performance under different settings.


Highlights

  • No Model Retraining
    PointLLM is used out-of-the-box without fine-tuning.

  • Multi-Scene Evaluation
    Evaluate on complete ScanNet scenes with rich object combinations.

  • πŸ“‹ Captioning & Classification Tasks
    Includes fully automated evaluation loops for both tasks.

  • πŸ€– LLM-Based Evaluation
    Replaces traditional human-based assessments with ChatGPT evaluation strategies.

  • βœ… Strict Binary Answer Format
    Enforces "Yes"/"No" answers for consistent metric-based evaluation.


πŸ“ Project Structure

β”œβ”€β”€ pointllm/ # Core PointLLM model and conversation logic β”‚ β”œβ”€β”€ model/ # LLM model classes and loading utilities β”‚ β”œβ”€β”€ conversation/ # Prompt templates and dialogue handling β”‚ └── utils/ # Utility functions for setup and decoding β”‚ β”œβ”€β”€ data/ # Dataset-related files β”‚ β”œβ”€β”€ ground_truth.json # Ground-truth annotations for object presence β”‚ β”œβ”€β”€ material_list_updated.json # Object-to-material mappings β”‚ └── context/ # Natural language scene descriptions β”‚ β”œβ”€β”€ evaluation/ # Automated evaluation scripts β”‚ β”œβ”€β”€ evaluate_classification.py # Classification evaluation loop β”‚ β”œβ”€β”€ evaluate_captioning.py # Captioning evaluation loop β”‚ └── analyze_results.py # Accuracy & metric calculation β”‚ β”œβ”€β”€ preprocessing/ # Data transformation and ScanNet processing β”‚ β”œβ”€β”€ process_scannet.py # Converts ScanNet to usable format β”‚ └── generate_context.py # Creates natural language scene context β”‚ β”œβ”€β”€ results/ # Logs and outputs of evaluations β”‚ β”œβ”€β”€ evaluation_log_*.json # Per-scene evaluation outputs β”‚ └── summary_metrics.json # Summary statistics β”‚ β”œβ”€β”€ scripts/ # Optional CLI wrappers for quick runs β”‚ └── run_eval.sh # Shell script to launch evaluation β”‚ β”œβ”€β”€ README.md # Project overview └── requirements.txt # Python dependencies

About

Applied PointLLM to complex scenes. For this a fully automated evaluation loop that relies and strict binary classification and ChatGPT evaluation was implemented.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published