Giant genes are rare but implicated in cell wall degradation by predatory bacteria

This article has 1 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

Across the tree of life, gene lengths vary, but most are no more than a few thousand base pairs in length. The largest protein often reported is the ∼40,000 aa eukaryotic Titin. Even larger proteins may occur in the rapidly expanding set of metagenome-derived sequences, but their existence may be obscured by assembly fragmentation. Here, we leverage genome curation to complete metagenome-derived sequences that encode predicted proteins of up to 85,804 aa. Overall, the findings illuminate a huge knowledge gap related to giant proteins. Although predicted proteins of >30,000 aa occur in bacterial phyla such asFirmicutesandActinobacteria, they are most common in ca.Omnitrophota,ultra small bacteria that adopt predatory lifestyles. All full length giant genes encode numerous transmembrane regions and most encode divergent secA DEAD helicase domains.In silicostructural prediction of protein subregions was required to identify domains in unannotated protein segments, and revealed putative domains implicated in attachment and carbohydrate degradation. Many giant genes in new complete and near-completeOmnitrophotagenomes occur in close proximity to genes homologous to type II secretion systems as well as carbohydrate import systems. This, in combination with the domain content, suggests that many bacterial giant proteins enable prey adhesion and cell wall digestion during bacterial predation.

Related articles

Related articles are currently not available for this article.