Skip to main navigation Skip to search Skip to main content

A deep learning framework for protein-to-metal binding prediction using protein language models

  • Fairuz Shadmani Shishir
  • , Bishnu Sarker
  • , Farzana Rahman
  • , Sumaiya Shomaji
  • University of Kansas
  • Meharry Medical College
  • Department of Computer Science, Kingston University London, UK

Research output: Contribution to journalArticlepeer-review

41 Downloads (Pure)

Abstract

This study presents an end-to-end deep learning framework for protein–to-metal-ion binding prediction, a critical task in understanding protein function, structural stability, and metal transport mechanisms. A binding site is a residue location in a protein sequence where a metal binds to a protein. Manual curation of metal binding sites is a tedious process involving mining through research articles, making it expensive, laborious, and time-consuming. Therefore, developing a computational pipeline is essential to predict metal ion binding of unannotated proteins. A significant shortcoming of existing computational methods is the failure to capture the long-term dependency of the residues, the absence of positional information, and a pre-determined set of residues and metal ions. In this paper, we propose a metal-ion binding prediction pipeline using a large language model, emphasizing 1) the comparative performance of five state-of-the-art protein language models (pLMs), 2) the impact of positional encoding of binding sites, and 3) the comparison with classical machine learning techniques. A 10-fold cross-validation evaluation yielded a Matthews Correlation Coefficient (MCC) of 0.89, along with precision, recall, and F1 scores exceeding 95% for the six most extensively studied metal ions reported in the literature.
Original languageEnglish
Pages (from-to)2575-2585
Number of pages11
JournalIEEE Transactions on Computational Biology and Bioinformatics
Volume22
Issue number6
DOIs
Publication statusPublished - 2025

Keywords

  • Metal binding site
  • bio-transformers
  • deep learning
  • large language model
  • protein language model
  • transformers

Fingerprint

Dive into the research topics of 'A deep learning framework for protein-to-metal binding prediction using protein language models'. Together they form a unique fingerprint.

Cite this