AMULETY: A Python package to embed adaptive immune receptor sequences
Abstract
Large language models have been developed to capture relevant features of adaptive immune receptors, each with unique potential applications. However, the diversity in available models presents challenges in accessibility and usability for downstream applications. Here we present AMULETY (Adaptive imMUne receptor Language model Embedding Tool), a Python-based software package to generate language model embeddings for adaptive immune receptor sequences, enabling users to leverage the strengths of different models without the need for complex configuration. AMULETY offers functions for embedding adaptive immune receptor amino acid sequences using pre-trained protein or antibody language models for paired heavy and light chain or single chain sequences. We showcase the variability on the embedding space for several embeddings on a dataset of antibody binders to several SARS-CoV-2 epitopes and showed that different models may be effective at capturing different aspects of the distinctions between epitope groups. AMULETY is available under GPL v3 license from<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/immcantation/amulety">https://github.com/immcantation/amulety</ext-link>or via<monospace>pip</monospace>from the Python Package Index (PyPI) from<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://pypi.org/project/amulety/">https://pypi.org/project/amulety/</ext-link>.
Abstract Figure
<fig id="ufig1" position="float" fig-type="figure" orientation="portrait"><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="644583v1_ufig1" position="float" orientation="portrait"/></fig>Related articles
Related articles are currently not available for this article.