gplas: a comprehensive tool for plasmid analysis using short-read graphs

Sergio Arredondo-Alonso, Martin Bootsma, Yaïr Hein, Malbert R C Rogers, Jukka Corander, Rob J L Willems, Anita C Schürch

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

SUMMARY: Plasmids can horizontally transmit genetic traits, enabling rapid bacterial adaptation to new environments and hosts. Short-read whole-genome sequencing data are often applied to large-scale bacterial comparative genomics projects but the reconstruction of plasmids from these data is facing severe limitations, such as the inability to distinguish plasmids from each other in a bacterial genome. We developed gplas, a new approach to reliably separate plasmid contigs into discrete components using sequence composition, coverage, assembly graph information and network partitioning based on a pruned network of plasmid unitigs. Gplas facilitates the analysis of large numbers of bacterial isolates and allows a detailed analysis of plasmid epidemiology based solely on short-read sequence data.

AVAILABILITY AND IMPLEMENTATION: Gplas is written in R, Bash and uses a Snakemake pipeline as a workflow management system. Gplas is available under the GNU General Public License v3.0 at https://gitlab.com/sirarredondo/gplas.git.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Original languageEnglish
Pages (from-to)3874-3876
Number of pages3
JournalBioinformatics
Volume36
Issue number12
DOIs
Publication statusPublished - 15 Jun 2020

Fingerprint

Dive into the research topics of 'gplas: a comprehensive tool for plasmid analysis using short-read graphs'. Together they form a unique fingerprint.

Cite this