Graphite: painting genomes using a colored De Bruijn graph

Research output: Working paperPreprintAcademic

Abstract

The recent growth of microbial sequence data allows comparisons at unprecedented scales, enabling tracking of strains, mobile genetic elements, or genes. Querying a genome against a large reference database can easily yield thousands of matches that are tedious to interpret and pose computational challenges. We developed Graphite that uses a colored De Bruijn graph (cDBG) to paint query genomes, selecting the local best matches along the full query length. By focusing on the closest genomic match of each query region, Graphite reduces the number of matches while providing promising leads for genomic forensics. When applied to hundreds of Campylobacter genomes we found extensive gene sharing, including a previously undetected C. coli plasmid that matched a C. jejuni chromosome. Together, genome painting using cDBGs as enabled by Graphite, can reveal new biological phenomena by mitigating computational hurdles. Graphite is implemented in Julia, available at https://github.com/MGXlab/Graphite.Competing Interest StatementThe authors have declared no competing interest.
Original languageEnglish
PublisherbioRxiv
Number of pages19
DOIs
Publication statusPublished - 1 Jan 2023

Publication series

NamebioRxiv
PublisherCold Spring Harbor Laboratory
ISSN (Print)2692-8205

Fingerprint

Dive into the research topics of 'Graphite: painting genomes using a colored De Bruijn graph'. Together they form a unique fingerprint.

Cite this