CRISPRCasIdentifier: Machine learning for accurate identification and classification of CRISPR-Cas systems
Abstract
CRISPR-Cas genes are extraordinarily diverse and evolve rapidly when compared to other prokaryotic genes. With the rapid increase in newly sequenced archaeal and bacterial genomes, manual identification of CRISPR-Cas systems is no longer viable. Thus, an automated approach is required for advancing our understanding of the evolution and diversity of these systems, and for finding new candidates for genome engineering in eukaryotic models. In this paper, we introduce a holistic strategy that combines regression and classification models for improving the quality of protein cascades, predicting their subtypes, detecting signature genes and extracting potential rules that reveal functional modules for CRISPR.
Related articles
Related articles are currently not available for this article.