TY - UNPB
T1 - Graphite: painting genomes using a colored De Bruijn graph
AU - Beeloo, Rick
AU - Zomer, Aldert L.
AU - Dutilh, Bas E.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - The recent growth of microbial sequence data allows comparisons at unprecedented scales, enabling tracking of strains, mobile genetic elements, or genes. Querying a genome against a large reference database can easily yield thousands of matches that are tedious to interpret and pose computational challenges. We developed Graphite that uses a colored De Bruijn graph (cDBG) to paint query genomes, selecting the local best matches along the full query length. By focusing on the closest genomic match of each query region, Graphite reduces the number of matches while providing promising leads for genomic forensics. When applied to hundreds of Campylobacter genomes we found extensive gene sharing, including a previously undetected C. coli plasmid that matched a C. jejuni chromosome. Together, genome painting using cDBGs as enabled by Graphite, can reveal new biological phenomena by mitigating computational hurdles. Graphite is implemented in Julia, available at https://github.com/MGXlab/Graphite.Competing Interest StatementThe authors have declared no competing interest.
AB - The recent growth of microbial sequence data allows comparisons at unprecedented scales, enabling tracking of strains, mobile genetic elements, or genes. Querying a genome against a large reference database can easily yield thousands of matches that are tedious to interpret and pose computational challenges. We developed Graphite that uses a colored De Bruijn graph (cDBG) to paint query genomes, selecting the local best matches along the full query length. By focusing on the closest genomic match of each query region, Graphite reduces the number of matches while providing promising leads for genomic forensics. When applied to hundreds of Campylobacter genomes we found extensive gene sharing, including a previously undetected C. coli plasmid that matched a C. jejuni chromosome. Together, genome painting using cDBGs as enabled by Graphite, can reveal new biological phenomena by mitigating computational hurdles. Graphite is implemented in Julia, available at https://github.com/MGXlab/Graphite.Competing Interest StatementThe authors have declared no competing interest.
U2 - 10.1101/2023.10.08.561343
DO - 10.1101/2023.10.08.561343
M3 - Preprint
T3 - bioRxiv
BT - Graphite: painting genomes using a colored De Bruijn graph
PB - bioRxiv
ER -