Sequence-Only Prediction of Binding Affinity Changes: A Robust and Interpretable Model for Antibody Engineering

Introduction

ProtAttBA is a protein language model that predicts binding affinity changes based solely on the sequence information of antibody-antigen complexes.

Usage

Install

Create conda environment

conda create -n protab python==3.10

Install environment dependency

# activate environment
source activate protab
# install pytorch
conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=11.8 -c pytorch -c nvidia 
(or use pip: pip install torch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 --index-url https://download.pytorch.org/whl/cu118)

# install dependencies
pip install -r ./requirments.txt

dataset

The source data are located in the source_data(SKEMPI[1], AB-Bind[2]).

[1] Moal I H, Fernández-Recio J. SKEMPI: a structural kinetic and energetic database of mutant protein interactions and its use in empirical models[J]. Bioinformatics, 2012, 28(20): 2600-2607.

[2] Sirin S, Apgar J R, Bennett E M, et al. AB‐bind: antibody binding mutational database for computational affinity predictions[J]. Protein Science, 2016, 25(2): 393-409.

Cross validation dataset is located in the cross_validation/data/csv folder (Using the dataset processed by: Jin et al., 2024). The results are located in cross_validation/results.

Sequence identity dataset is located in the seq-identity_sig-mul/data/identity_data folder (Use MMseqs with --min-seq-id 0.3). The results are located in the seq-identity_sig-mul/result_idt.

Single mutation training and multi-mutation testing dataset is located in the seq-identity_sig-mul/data/sigmul_data folder. The results are located in the seq-identity_sig-mul/result_sigmul.

Training

# For cross validation you can use cross_validation/scripts/bash_cross-validation.sh with different args
cp bash_cross-validation.sh ../
bash bash_cross-validation.sh 

# For Sequence identity you can use seq-identity_sig-mul/scripts/bash_seq_identity.sh with different args
cp bash_seq_identity.sh ../ 
bash bash_seq_identity.sh

# For Single mutation training and multi-mutation testing you can use seq-identity_sig-mul/scripts/bash_seq_sigmul.sh with different args
cp bash_seq_sigmul.sh ../ 
bash bash_seq_sigmul.sh

Evaluation

# For evaluation you can use the seq-identity_sig-mul/eval.py to predict the result by change the args
python eval.py

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
attention_analysis		attention_analysis
cross_validation		cross_validation
seq-identity_sig-mul		seq-identity_sig-mul
source_data		source_data
readme.md		readme.md
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sequence-Only Prediction of Binding Affinity Changes: A Robust and Interpretable Model for Antibody Engineering

Introduction

Usage

Install

dataset

Training

Evaluation

About

Uh oh!

Releases

Packages

Languages

code4luck/ProtAttBA

Folders and files

Latest commit

History

Repository files navigation

Sequence-Only Prediction of Binding Affinity Changes: A Robust and Interpretable Model for Antibody Engineering

Introduction

Usage

Install

dataset

Training

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages