publications

My Google Scholar profile also contains a list of my publications/pre-prints. The list below is sorted in order of original pre-printing.

* indicates co-lead authors

  1. Jim Shaw, Christina Boucher, Yun William Yu, Noelle Noyes, Heng Li. devider: long-read reconstruction of many diverse haplotypes (2024). bioRxiv. Accepted to RECOMB2025.
  2. Brian Zhang, Grace Oualline, Jim Shaw, Yun William Yu. skandiver: a divergence-based analysis tool for identifying intercellular mobile genetic elements (2024). ECCB/Bioinformatics.
  3. Jim Shaw, Yun William Yu. Fairy: fast approximate coverage for multi-sample metagenomic binning (2024). Microbiome.
  4. Jim Shaw*, Jean-Sebastien Gounot*, Hanrong Chen, Niranjan Nagarajan, Yun William Yu. Floria: Fast and accurate strain haplotyping in metagenomes (2024). ISMB/Bioinformatics.
  5. Jim Shaw, Yun William Yu. Rapid species-level metagenome profiling and containment estimation with sylph (2024). Nature Biotechnology.
  6. Jim Shaw, Yun William Yu. Fast and robust metagenomic sequence comparison through sparse chaining with skani (2023). Nature Methods.
  7. Andrew Zheng, Jim Shaw, Yun William Yu. Mora: abundance aware metagenomic read re-assignment for disentangling similar strains (2024). BMC Bioinformatics.
  8. Jim Shaw, Yun William Yu. Proving sequence aligners can guarantee accuracy in almost O(m log n) time through an average-case analysis of the seed-chain-extend heuristic (2023). Genome Research.
  9. Martin Frith, Jim Shaw, John Spouge. How to optimally sample a sequence for rapid analysis (2023). Bioinformatics.
  10. Jim Shaw, Yun William Yu. Theory of local k-mer selection with applications to long-read alignment (2022). Bioinformatics.
  11. Jim Shaw, Yun William Yu. Practical probabilistic and graphical formulations of long-read polyploid haplotype phasing (2022). Journal of Computational Biology: RECOMB 2021 Issue.
  12. Tom Ouellette, Jim Shaw, Philip Awadalla. Using image-based haplotype alignments to map global adaptation of SARS-CoV-2 (2020). bioRxiv.
  13. Ryan Cotsakis*, Jim Shaw*, Julien Tierny, Joshua Levine. Implementing Persistence-Based Clustering of Point Clouds in the Topology ToolKit (2021). Topological Methods in Data Analysis and Visualization VI. Mathematics and Visualization.
  14. Simone Hu*, Oliver Schnetz*, Jim Shaw*, Karen Yeats*. Further investigations into the graph theory of phi^4-periods and the c2 invariant (2020). Annales de l’Institut Henri Poincare D.
  15. D. Bertrand, J. Shaw, M Narayan, H.Q.A. Ng, S. Kumar, C. Li, M. Dvornicic, J.P. Soldo, J.Y. Kho, O.T. Ng, T. Barkham, B. Young, K. Marimuthu, K.R. Chng, M. Sikic, N. Nagarajan. Hybrid metagenomic assembly enables high-resolution analysis of resistance determinants and mobile elements in human microbiomes (2019). Nature Biotechnology.

Talks and presentations

  1. Floria talk. (2024). ISMB 2024, Montreal.
  2. sylph talk v2. (2024). Great Lakes Bioinformatics 2024, Pittsburgh. Recorded talk available here.
  3. sylph talk v1. (2023). Genome Informatics 2023, Cold Spring Harbor Laboratory.
  4. skani talk. (2023). ISMB 2023, Lyon.
  5. Seed-chain-extend alignment rigorous average-case analysis talk. (2023). RECOMB 2023, Istanbul. Recorded talk available here (the slides in the recorded talk are delayed).