Welcome to dbCAN HGM Database!
dbCAN HGM serves as a comprehensive data repository to catalog and profile human gut microbial CGCs and their families, featuring glycan substrate inference, representation across diverse human populations, diet-associated read mapping, and enhanced functional annotations. It aims to provide better understanding of CGC composition and carbohydrate-degrading potential in gut microbiomes, while also establishing a foundation for future comparative and functional analyses of CGCs.
Similar online resources include CAZy, dbCAN, dbCAN-PUL, and dbCAN-seq.
Unique data and features of dbCAN-HGM:
- 121,883 CAZyme gene clusters (CGCs) are identified from 6,031 high-quality species-level representative MAGs, which span a global range of diverse human populations and lifestyles, especially in the under-represented African gut microbiomes.
- Each CGC page includes metagenomic read mapping results from different diets (vegan, vegetarian, omnivore, flexitarian) and disease statuses(ulcerative colitis [UC] and Crohns disease [CD]) with interactive coverage plot and Jbrowse alignment tracks.
- CGCs are fully annotated with CAZymes, transporters, signal transduction proteins, transcriptional factors, sulfatases, peptidases, Pfam families, and protein 3D structural homologs for unannotated proteins.
- Metadata and visualization of CGC families by, e.g., family size, substrate inference, taxonomy composition, host geographic distribution, top abundant CAZyme families.
- A user-friendly and highly interactive web interface is provided for easy browsing of genomes, CGCs and CGC families by: (i) glycan substrates and (ii) continents, as well as batch downloading of sequence and annotation data.
Note: It should be noted that all predicted substrate assignments for CGCs in this database need experimental validation. It is our hope that these predicted CGCs and substrates in the microbiomes could facilitate the experimental characterization of new polysaccharide utilization loci (PULs) by the carbohydrate community.