LDscore: a scalable, Python 3-powered web platform for LD score regression analysis

This article has 4 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

Linkage disequilibrium score regression (LDSC) is an important analytical tool for quantifying heritability and estimating genetic correlations between complex traits. However, the LDSC original implementation relies on an outdated Python 2 framework and deploying the standard command-line tools requires significant setup, data access, and computational expertise, creating a barrier for many researchers. To overcome these limitations, we developed LDscore, a significant technical and accessibility upgraded version of LDSC that allows for rapid analysis of GWAS data. The core advancement is the recoding of the LDSC framework in Python 3, enabling computational optimization and ensuring long-term sustainability. Built on top of this improved foundation, LDscore is implemented as a free, publicly available web application integrated within the popular NCI LDlink framework. LDscore can accelerate scientific research by providing an intuitive graphical interface for heritability estimation, genetic correlation, and LD score calculation, including access to an expanded range of reference populations for online analysis. Notably, our results show that selecting the most appropriate reference population LD panel, even at the subcontinental ancestry group level, is essential for minimizing population stratification bias in heritability estimation. By leveraging cloud computing for superior scalability and eliminating the need for local installation, LDscore adheres to FAIR principles, improving access, traceability, and reproducibility across an expanded set of reference populations, and effectively widens access to researchers worldwide providing support for in-depth genetic analyses.

Brief summary

Linkage disequilibrium score regression (LDSC), a widely-used method for quantifying heritability and genetic correlation, is limited by an outdated Python 2 framework and complex command-line deployment. We developed LDscore, a significant technical upgrade built on Python 3 for sustainability and computational optimization. LDscore is a free, cloud-based web application integrated into NCI LDlink. LDscore eliminates installation barriers, offering an intuitive interface for computing heritability estimates, LD scores, and genetic correlation. Crucially, LDscore expands the range of reference populations available in LDSC, which can reduce population-stratification-based bias. Leveraging cloud computing, LDscore accelerates and widens global researcher access to LDSC-based genetic computation.

Availability

LDscore is freely available within LDlink at <ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://ldlink.nih.gov/ldscore">https://ldlink.nih.gov/ldscore</ext-link> . Source code for the updated LDSC Python3 framework is available at <ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/CBIIT/ldsc">https://github.com/CBIIT/ldsc</ext-link> under the GNU General Public License v3.0 and the webtool code is at <ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/CBIIT/nci-webtools-dceg-linkage">https://github.com/CBIIT/nci-webtools-dceg-linkage</ext-link> (webtool code) under the MIT license.

Related articles

Related articles are currently not available for this article.