PROJECT MODULE : EXPLAINABLE AI TASK 3

Overview

Welcome to the repository for the Explainable AI course taught as a Project Module as a part of Cognitive Systems master's program at the University of Potsdam! Artificial Intelligence has witnessed remarkable progress in recent years, permeating every aspect of our lives. As these AI systems become more sophisticated, understanding the reasoning behind their decisions has become increasingly crucial. The importance of transparency, fairness, and accountability in AI systems has led to the emergence of Explainable AI.

In this course, our primary objective is to equip the knowledge and skills required to design, implement, and evaluate AI models that are not only accurate but also interpretable. We delve into various domains, including language, vision, and multimodal tasks, to explore different approaches and techniques for achieving explainability in AI systems. The course is divided into tasks, each focusing on a specific aspect of Explainable AI:

Task: Textual and Visual Feature extraction

For this task, the goal is to implement textual and visual feature extraction for the FLICKR30K Dataset.

Dataset

The FLICKR30K Dataset is modified to create a binary classification task. The dataset iniitally consists of 30k images and their respective captions. There are 5 captions for a single image having different levels of description from brief to extensive. The dataset is modified by assigning images with random captions to create a dataset having an image and a caption which may or may not match the image. The caption is assigned by calculating a similarity score, so that the randomly assigned captions does not match the image.

Model

To extract visual features CLIP is used. To extract textual features spacy linguistics features are used.

Usage

There should be a data folder in the root directory containing images from the dataset

After cloning the repository, the directory tree should look like this:

---data.txt

---get_features.py

---img_features.py

---labled_instances.jsonl

---saves.py

---text_features.py

---token_frequencies.txt

Visual features are stored in a dataframe 'image_data.csv'. Generate Visual features by running:

python3 get_features.py --visual

Textual features are stored in a dataframe 'image_text_df.csv' combined with visual features. Generate Textual features by running::

python3 get_features.py --textual

These dataframes can be further used in a XGBOOST tree or other classifiers. LIME/SHAP can be used to do feature importance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PROJECT MODULE : EXPLAINABLE AI TASK 3

Overview

Task: Textual and Visual Feature extraction

Dataset

Model

Usage

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
data.txt		data.txt
get_features.py		get_features.py
img_features.py		img_features.py
labeled_instances.jsonl		labeled_instances.jsonl
saves.py		saves.py
text_features.py		text_features.py
token_frequencies.txt		token_frequencies.txt
vocab.txt		vocab.txt

kushal-10/IMGTXTFTS

Folders and files

Latest commit

History

Repository files navigation

PROJECT MODULE : EXPLAINABLE AI TASK 3

Overview

Task: Textual and Visual Feature extraction

Dataset

Model

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages