dbCAN Logo

📊 MAG Datasets of Human Gut Microbiomes

  • This table summarizes the metagenome-assembled genomes (MAGs) from diverse human populations and lifestyles that are used across the platform.
Database Reference Populations/Lifestyles # of MAGs Used # in Non-Redundant Set # Representative
H3Africa Phase 1Tamburini et al., 2022190 women in South Africa8178073
H3Africa Phase 2Maghini et al., 20251,820 adults across 4 African countries17961779706
HadzaCarter et al., 2024167 Hadza hunter-gatherers961960682
UHGGAlmeida et al., 2021Mostly Global North countries242419801055
WISLeviatan et al., 2022Mostly Israel and USA21731540608
ELGGZeng et al., 2022Children under 3 (Global North)315299178
CGMRHuang et al., 2024Chinese Gut Microbial Reference20241989811
HRGM2Ma et al., 202441 countries (Asia, Africa, S. America)232423071743
IMGGJin et al., 2023Inner Mongolia (ONT + Illumina)481478206
SPMPGounot et al., 2022109 healthy Singaporeans1769174739
Total15,08413,8866,031

🏠 Home Page

  • Quickly browse and search genomes, CGCs, and CGC families with ease.
Home Page Help

🧬 Genome Page

  • Metadata and basic information of genomes can be downloaded after genome browsing and searching.
  • Genome Help 1
  • Each representative genome page provides the Genome list, Basic statistics plots, Taxonomy lineage, CGC list of the rep genome, as well as Read mapping on CGC regions visualized by JBrowse.
  • Genome Help 2

🧪 CGC Family Page

  • Browse CGC families by Substrate to view a list of families and explore the top abundant CAZyme families within each substrate through interactive plots.
  • CGC Family Help 1
  • Browse all CGC families to explore the substrate information of their CGCs and access links to each family’s detailed page.
  • CGC Family Help 2
  • Each CGC family page provides basic statistics plots, taxonomy distribution plot and network diagram of the family members.
  • CGC Family Help 3

🧬 CGC (Gene Cluster) Page

  • Browse CGCs by continent and substrate to explore the distribution of CGCs targeting various substrates across different continents.
  • CGC Help 1
  • Each CGC page provides a Gene composition table, links to Protein gene annotation resources, and information on Genome source and Taxonomic lineage.
  • CGC Help 2
  • The null genes without sequence homology annotation are linked to a protein structure page, containing predicted 3D structure visualization with pLDDT confidence scores and results of structural homologs against AFDB, CAZyme3D, CAZymeID50, PDB, and SwissProt.
  • CGC Help 3
  • Each CGC page also includes Read coverage plots from diet intervention studies to compare CGC coverage in gut metagenome samples across different diets, along with a gene composition diagram.
  • CGC Help 5
  • Explore read mapping of CGC across individual samples using JBrowse.
  • CGC Help 4

🗺️ CAZymes HeatMap

  • Interactive heatmap displaying the distribution of CAZyme families in CGCs across species and genera. Users can select regions of the heatmap to zoom in for a more detailed view and exploration of specific family abundances.
  • CAZymes Help 1