SMENET: a Multi-view Semantic Model for Multi-level Enzyme Function Prediction

Hanwen Zhou; Wei Zhang; Zhaohong Deng; Guanjin Wang; Zhisheng Wei; Lei Wang; Xiaoyong Pan; Hong-Bin Shen; Dong-Jun Yu; Jing Wu

doi:10.1109/TCBBIO.2025.3644035

Journal article

SMENET: a Multi-view Semantic Model for Multi-level Enzyme Function Prediction

Hanwen Zhou, Wei Zhang, Zhaohong Deng, Guanjin Wang, Zhisheng Wei, Lei Wang, Xiaoyong Pan, Hong-Bin Shen, Dong-Jun Yu and Jing Wu

IEEE Transactions on Computational Biology and Bioinformatics, Early Access

2025

DOI: https://doi.org/10.1109/TCBBIO.2025.3644035

PMID: 41396755

Abstract

Annotations

Artificial intelligence

attention mechanism

Data mining

Databases

deep learning

Encoding

Enzymes

Feature extraction

large language model

Multi-level enzyme function prediction

multi-view learning

Predictive models

Protein engineering

protein sequence embedding

Semantics

Comprehending biological reproduction and cellular metabolism is facilitated by the Enzyme Commission, which matches protein sequences to the biochemical reactions they catalyse through EC numbers. In recent years, several methods have been proposed for predicting enzyme function. However, these methods still encounter challenges. Firstly, traditional methods for manually designing enzyme features are complex and cumbersome, lacking an effective generalized method for embedding enzyme sequences. Secondly, the distribution gap between different enzymes is significant, which resulting in existing methods struggling to predict multilevel enzyme functions. Thirdly, traditional enzyme function prediction models only extract single view feature of enzyme, so there is still room for further improving the ability of these models to extract enzyme data. To address these challenges, a new multilevel enzyme function prediction model (SMENET) based on multi-view semantics is proposed. This method uses protein large language model to extract semantic information. Subsequently, this semantic information is fed into multiple information extraction network modules, followed by using Biologic Sematic Attention to integrate these views' information. Finally, a multi-view adaptive fusion network is designed to extract the best common representation between multiple semantic views. Extensive experiments were conducted on multiple datasets to validate the effectiveness of SMENET. The code and dataset of this study are available at https://github.com/zerohanwen/SMENET.

Details

Title: SMENET: a Multi-view Semantic Model for Multi-level Enzyme Function Prediction
Authors/Creators: Hanwen Zhou - Jiangnan University
Wei Zhang - Beijing Academy of Artificial Intelligence
Zhaohong Deng - Jiangnan University
Guanjin Wang - Murdoch University, School of Information Technology
Zhisheng Wei - Jiangnan University
Lei Wang - Jiangnan University
Xiaoyong Pan - Shanghai Jiao Tong University
Hong-Bin Shen - Shanghai Jiao Tong University
Dong-Jun Yu - Nanjing University of Science and Technology
Jing Wu - Jiangnan University
Publication Details: IEEE Transactions on Computational Biology and Bioinformatics, Early Access
Publisher: IEEE
Number of pages: 11
Identifiers: 991005848588207891
Murdoch Affiliation: School of Information Technology; Murdoch University
Language: English
Resource Type: Journal article

Metrics

3 Record Views