dbCAN Logo

📊 MAG Datasets of Human Gut Microbiomes

  • This table summarizes the metagenome-assembled genomes (MAGs) from diverse human populations and lifestyles that are used across the platform.
Database Reference Populations/Lifestyles # of MAGs Used # in Non-Redundant Set # Representative
H3Africa Phase 1Tamburini et al., 2022190 women in South Africa8178073
H3Africa Phase 2Maghini et al., 20251,820 adults across 4 African countries17961779706
HadzaCarter et al., 2024167 Hadza hunter-gatherers961960682
UHGGAlmeida et al., 2021Mostly Global North countries242419801055
WISLeviatan et al., 2022Mostly Israel and USA21731540608
ELGGZeng et al., 2022Children under 3 (Global North)315299178
CGMRHuang et al., 2024Chinese Gut Microbial Reference20241989811
HRGM2Ma et al., 202441 countries (Asia, Africa, S. America)232423071743
IMGGJin et al., 2023Inner Mongolia (ONT + Illumina)481478206
SPMPGounot et al., 2022109 healthy Singaporeans1769174739
Total15,08413,8866,031

🏠 Home Page

  • Quickly browse and search genomes, CGCs, and CGC families with ease. Home Page Help

🧬 Genome Page

  • Metadata and basic information of genomes can be downloaded after genome browsing and searching. Genome Help 1
  • Each representative genome page provides the Genome list, Basic statistics plots, Taxonomy lineage, CGC list of the rep genome, as well as Read mapping on CGC regions visualized by JBrowse. Genome Help 2

🧪 CGC Family Page

  • Browse CGC families by Substrate to view a list of families and explore the top abundant CAZyme families within each substrate through interactive plots. CGC Family Help 1
  • Browse all CGC families to explore the substrate information of their CGCs and access links to each family’s detailed page. CGC Family Help 2
  • Each CGC family page provides basic statistics plots, taxonomy distribution plot and network diagram of the family members. CGC Family Help 3

🧬 CGC (Gene Cluster) Page

  • Browse CGCs by continent and substrate to explore the distribution of CGCs targeting various substrates across different continents. CGC Help 1
  • Each CGC page provides a Gene composition table, links to Protein gene annotation resources, and information on Genome source and Taxonomic lineage. CGC Help 2
  • The null genes without sequence homology annotation are linked to a protein structure page, containing predicted 3D structure visualization with pLDDT confidence scores and results of structural homologs against AFDB, CAZyme3D, CAZymeID50, PDB, and SwissProt. CGC Help 3
  • Each CGC page also includes Read coverage plots from diet intervention studies to compare CGC coverage in gut metagenome samples across different diets, along with a gene composition diagram. CGC Help 5
  • Explore read mapping of CGC across individual samples using JBrowse. CGC Help 4

🗺️ CAZymes HeatMap

  • Interactive heatmap displaying the distribution of CAZyme families in CGCs across species and genera. Users can select regions of the heatmap to zoom in for a more detailed view and exploration of specific family abundances. CAZymes Help 1