next.pathogen.watch docs
  • Welcome to Pathogenwatch
  • News & Release Notes
    • Announcements
    • Release Notes 2025
    • Release Notes 2024
    • Release Notes 2023
    • Release Notes 2022
    • Release Notes 2019-2021
  • Getting Started
    • Sign in
    • A Brief Tour of Pathogenwatch
    • Interactive Collection View tutorial
    • Useful Links
  • How to use Pathogenwatch
    • Using the documentation
    • Using The Interactive Collection View
      • The Interactive Collection View
      • The Map Panel
      • The Tree Panel
        • Tree Panel
        • Generating a new tree
      • Data Tables
      • The Timeline Panel
      • Context search panel
      • Legend, Labels, and Colours
      • Searching genomes in a collection
      • Creating sub-collections
    • Genome Uploads & Folders
    • Browsing & Searching Genomes
    • Browsing Collections
    • Creating & Sharing Collections
    • Genome Reports
    • Deleting items
    • SARS-CoV-2 Tutorial
    • Tips and Tricks
  • Technical Descriptions of Analysis Tools
    • Genome Assembly
      • Short Read Assembly
      • Assembling genomes with EToKi
    • Plasmid Annotation
      • Inctyper
    • Assigning species with Speciator
    • Trees, Clustering, and Context Search
      • Core Genome Tree
        • About SNP-based trees
        • Core Assignment
        • Core Filter
        • Reference Assignment
        • Tree Construction
      • cgMLST Clustering & Context Searching
      • SARS-CoV-2 Genome Tree
      • cgMLST Tree
    • Lineage Assignment & Genotyping Methods
      • Genotyphi
      • Kleborate
      • cgMLST
      • Klebsiella LIN Codes
      • MLST
      • NG-MAST
      • Pangolin
      • PopPUNK
      • Vista
      • Finding HierCC codes with hclink
      • SARS-CoV-2 Notable Mutations
    • Serotyping
      • Kaptive
      • SeroBA
      • SISTR
      • ECTyper
    • Antimicrobial Resistance Prediction
      • Pathogenwatch AMR
      • Kleborate AMR
      • SPN-PBP-AMR
      • Resfinder
    • Virulence
      • STECFinder
      • VirulenceFinder
      • BIGSdb schemes
  • WHO bacterial priority pathogens
  • Initiatives powered by Pathogenwatch
    • PATH-SAFE
      • PATH-SAFE Sign in
      • What is the PATH-SAFE Programme?
      • PATH-SAFE powered by Pathogenwatch
      • Two-tool Serotyping with SISTR & SeqSero2
      • S. enterica SNP tree
      • PATH-SAFE analyses
  • How to cite
  • Acknowledgements
  • Privacy and Terms Of Service
  • FAQ
  • Report an Issue
Powered by GitBook
On this page
  • About
  • Method
  • Profile construction
  • Distance scoring
  • Tree construction
  • Code repository
  • Citations
  • FASTME
  1. Technical Descriptions of Analysis Tools
  2. Trees, Clustering, and Context Search

cgMLST Tree

How cgMLST profile-based trees are constructed

PreviousSARS-CoV-2 Genome TreeNextLineage Assignment & Genotyping Methods

Last updated 2 months ago

About

For some species Pathogenwatch uses the cgMLST profiles generated by the inhouse cgMLST tool to build allele distance-based Neighbour Joining trees for genomes in a collection. The distances correspond to those used in the .

Method

Profile construction

For details about how cgMLST profiles are constructed in Pathogenwatch see the . A cgMLST profile consists of a list of gene loci with one allele code or a "gap" assigned to each.

Distance scoring

The distance is calculated by determining the number of differences over the number of positions in common (i.e. not gaps in both) between the two profiles being compared. Each profile is compared against each other to produce a matrix of distance scores.

Tree construction

The matrix is then used to create a Neighbour Joining tree using the . The resulting dendogram is mid-point rooted using .

Code repository

The software can be found at .

Citations

FASTME

Vincent Lefort, Richard Desper, Olivier Gascuel, FastME 2.0: A Comprehensive, Accurate, and Fast Distance-Based Phylogeny Inference Program, Molecular Biology and Evolution, Volume 32, Issue 10, October 2015, Pages 2798–2800,

cgMLST clustering tool
cgMLST description page
FASTME software
Biopython
https://github.com/pathogenwatch-oss/cgmlst-tree
https://doi.org/10.1093/molbev/msv150