Scripts used to detect entities in FutureObs data.
python 3.12
run docker run -d --runtime nvidia --gpus all -v ollama:/root/.ollama -p 5005:11434 --name ollamaDeepseekLlama70B ollama/ollama
to start ollama server before running deepseek scripts.
run python -m spacy download fr_core_news_lg
before running script triple_extraction.py.
script | description |
---|---|
test_ollama_instructor.py |
Testing script to deploy Deepseek model via ollama, using Instructor library. |
ollama_deepseek_extraction_base.py |
Deployment of Deepseek model with ollama for tag extraction and description. |
ollama_deepseek_extraction.py |
Preprocessing of FutureObs data and deployment of Deepseek model with ollama for tag extraction and description. |
triple_extraction.py |
Automatic extraction of location and geographical entities with three NER models (camembert, spacy, GliNER). |
triple_extraction_evaluation.py |
Automatic extraction of entities and evaluation of named entity recognition extractions with ground truth on a sample. |
comparison_models.py |
Evaluation of model specificity by calculating the number of unique entities predicted by each model. |