Integrating the ENCODE blocklist for machine learning quality control of ChIP-seq data with seqQscorer
Abstract
Motivation
Quality assessment of next-generation sequencing data is a complex but important task to ensure correct conclusions from experiments in molecular biology, biomedicine, and biotechnology. We previously introduced seqQscorer, a quality assessment tool using machine learning to support this process. To improve seqQscorer in terms of accuracy and processing time, we integrated the ENCODE blocklist * to derive a new type of quality-related features, supposed to be more informative and faster in generation than those conventionally used by seqQscorer.
Results
The novel seqQscorer extension, called seqBLQ, allows us to improve the quality assessment for ChIP-seq data derived from human tissues and cell lines. Furthermore, seqBLQ enhances the usability of the tool by simplifying the installation procedure and reducing the computational resources required for feature generation.
Availability and implementation
<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/salbrec/seqQscorer">https://github.com/salbrec/seqQscorer</ext-link>
Related articles
Related articles are currently not available for this article.