Multimodal Graph and Language Learning for Adsorption Configuration in Catalysis

This repository provides the tools for multimodal self-supervised learning (SSL) pretraining, text-only regression fine-tuning, as well as prediction and analysis scripts related to model performance and outputs.

Below are the instructions to effectively use this repository.

1. Prerequisites

Note: 🚧🚧 This section is currently under construction and will be updated soon 🚧🚧.

Before you begin, ensure you have met the following requirements:

Python 3.6 or above (Recommended: Python 3.8)
pip (Python Package Installer)

This project requires the following packages:

torch==2.5.1
transformers==4.47.1
tokenizers==0.21.0
pandas==2.2.3
pydantic==2.10.3
tqdm, wandb

2. Data & Checkpoint

2-1. Preprocessing

For detailed preprocessing steps, please refer to data/README.md.

2-2. Data Files & Checkpoint

The dataset required for training and prediction includes equiformer embeddings and text strings from catberta. This data and checkpoints can be accessed through the following link: Data.

Please download and place the data in the appropriate directory and update the data/checkpoints paths in the YAML files.

3. Training & Prediction

3-1. Graph-assisted Pre-trainin (Multi-modal SSL Pre-training)

To run the graph-assisted pre-training, execute the following command:

python clip_run.py

Adjustments to the data path, training configurations, and other settings can be made in the clip_train.yml file located in the root directory.

Additionally, settings specific to the SSL multimodal approach are defined in model/clip.yml.

3-2. Text-Only Fine-Tuning

For text-only regression fine-tuning, the following command should be used:

python regress_run.py

Specific settings should be defined in regress_train.yml.

3-3. Text-Only Prediction

To make predictions using text-only data, utilize the regress_predict.py script as follows:

python regress_predict.py --data_path <PATH_TO_DATA> \
                          --pt_ckpt_dir_path <PATH_TO_CHECKPOINT> \
                          --save_path <PATH_TO_SAVE_PREDICTIONS>

4. Analysis

4-1. Test Prediction Comparison with Valid DFT Energies

In the paper, test predictions are made on ML-relaxed structures. To assess their accuracy, the predicted values are compared with the valid DFT energies of the ML-relaxed systems.

You can generate the comparison using the following command:

python analysis/parity_plot.py --pred_path <PATH_TO_PRED_RESULTS> \
                               --save_dir <SAVE_DIRECTORY> \
                               --mapping_file_path <PATH_TO_MAPPING_FILE> \
                               --dft_target_path <PATH_TO_DFT_ENERGIES> \
                               --model <MODEL_TYPE; gnoc or scn or escn>

OC20-Dense metadata file: oc20dense_mapping.pkl link
OC20-Dense OCP Challenge DFT energiees: ml_relaxed_dft_targets.pkl link

4-2. Section-wise Attention

To extract section-wise attention from the model, use the get_section_attention.py script:

python analysis/get_section_attention.py --data_path <PATH_TO_DATA> \
                                         --pt_ckpt_dir_path <PATH_TO_CHECKPOINT> \ 
                                         --save_path <PATH_TO_SAVE_OUTPUT>

4-3. Extracting Text Encoder Embeddings for t-SNE Plot

To obtain text encoder embeddings suitable for visualization with t-SNE plots, execute:

python analysis/get_text_embedding.py --data_path <PATH_TO_DATA> \
                                      --pt_ckpt_dir_path <PATH_TO_CHECKPOINT> \
                                      --save_path <PATH_TO_SAVE_EMBEDDINGS>

5. Generation & Prediction with Fine-tuned CrystaLLM

For detailed preprocessing steps, please refer to generation/README.md.

Inquiries

For any questions or further information, please reach out to [email protected] or [email protected].

Citation

If you use this work in your research, please cite two papers as follows:

@misc{ock2024multimodal,
      title={Multimodal Language and Graph Learning of Adsorption Configuration in Catalysis}, 
      author={Janghoon Ock and Srivathsan Badrinarayanan and Rishikesh Magar and Akshay Antony and Amir Barati Farimani},
      year={2024},
      eprint={2401.07408},
      archivePrefix={arXiv},
      primaryClass={cs.CE},
      url={https://arxiv.org/abs/2401.07408}, 
}

@article{ock2023catberta,
         author = {Ock, Janghoon and Guntuboina, Chakradhar and Barati Farimani, Amir},
         title = {Catalyst Energy Prediction with CatBERTa: Unveiling Feature Exploration Strategies through Large Language Models},
         journal = {ACS Catalysis},
         volume = {13},
         number = {24},
         pages = {16032-16044},
         year = {2023},
         doi = {10.1021/acscatal.3c04956},
         URL = {https://doi.org/10.1021/acscatal.3c04956},
         eprint = {https://doi.org/10.1021/acscatal.3c04956}
         }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multimodal Graph and Language Learning for Adsorption Configuration in Catalysis

1. Prerequisites

2. Data & Checkpoint

2-1. Preprocessing

2-2. Data Files & Checkpoint

3. Training & Prediction

3-1. Graph-assisted Pre-trainin (Multi-modal SSL Pre-training)

3-2. Text-Only Fine-Tuning

3-3. Text-Only Prediction

4. Analysis

4-1. Test Prediction Comparison with Valid DFT Energies

4-2. Section-wise Attention

4-3. Extracting Text Encoder Embeddings for t-SNE Plot

5. Generation & Prediction with Fine-tuned CrystaLLM

Inquiries

Citation

About

Uh oh!

Releases 3

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
analysis		analysis
data		data
generation		generation
model		model
README.md		README.md
clip_run.py		clip_run.py
clip_train.yml		clip_train.yml
dataset.py		dataset.py
regress_predict.py		regress_predict.py
regress_run.py		regress_run.py
regress_train.yml		regress_train.yml
utils.py		utils.py

hoon-ock/multi-view

Folders and files

Latest commit

History

Repository files navigation

Multimodal Graph and Language Learning for Adsorption Configuration in Catalysis

1. Prerequisites

2. Data & Checkpoint

2-1. Preprocessing

2-2. Data Files & Checkpoint

3. Training & Prediction

3-1. Graph-assisted Pre-trainin (Multi-modal SSL Pre-training)

3-2. Text-Only Fine-Tuning

3-3. Text-Only Prediction

4. Analysis

4-1. Test Prediction Comparison with Valid DFT Energies

4-2. Section-wise Attention

4-3. Extracting Text Encoder Embeddings for t-SNE Plot

5. Generation & Prediction with Fine-tuned CrystaLLM

Inquiries

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 2

Uh oh!

Languages

Packages