Tools
GitHub links to open-source software and analysis tools, developed as part of my research in pangenomics and machine learning to study microbial epidemiology and evolution.
Featured
A pangenome-graph-based gene caller for bacterial genomes.
A Snakemake pipeline combining ggCaller, Panaroo, and Bakta for pangenome analysis.
A long-context encoder-decoder package for pangenomes.
A nucleotide-level pangenome simulator written in Rust.
Scalable models of pangenome evolution in bacteria, fitting to PopPUNK data using Approximate Bayesian Computation.
A tool for pangenome analysis from metagenome assembled genomes (MAGs).
A pangenome graph-based tool for adaptive sampling in nanopore sequencing.
More
A workflow to analyse experimental evolution data using read alignment, variant calling and annotation.
A Python script to analyse Orthofinder results and determine orthologues based on synteny.
A Snakemake workflow for clustering billions of proteins.
A set of scripts for constructing trees from millions of genomes.
Scripts for generating training datasets for Bacformer v2.
A tool for counting unitigs in a de Bruijn graph for microbial GWAS.
Scripts for applying large language models to pangenome analysis.
A tool for identifying and cutting target sequences from a genome.
Scripts to aid in analysis of adaptive sampling experiments in nanopore sequencing.