Structure-aware protein sequence alignment using contrastive learning

This article has 1 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

Protein alignment is a critical process in bioinformatics and molecular biology. Despite structure-based alignment methods being able to achieve desirable performance, only a very small number of structures are available among the vast of known protein sequences. Therefore, developing an efficient and effective sequence-based protein alignment method is of significant importance. In this study, we propose CLAlign, which is a structure-aware sequence-based protein alignment method by using contrastive learning. Experimental results show that CLAlign outperforms the state-of-the-art methods by at least 12.5% and 24.5% on two common benchmarks, Malidup and Malisam.

Related articles

Related articles are currently not available for this article.