logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001021_01085

You are here: Home > Sequence: MGYG000001021_01085

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Ruminococcus_C sp000980705
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus_C; Ruminococcus_C sp000980705
CAZyme ID MGYG000001021_01085
CAZy Family CBM4
CAZyme Description Cellulose 1,4-beta-cellobiosidase
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1054 MGYG000001021_19|CGC1 115502.46 4.3544
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001021 2360136 MAG Sweden Europe
Gene Location Start: 36618;  End: 39782  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.78

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 360 911 1.2e-105 0.9880382775119617
CBM4 42 143 2.2e-20 0.7222222222222222

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 1.04e-67 361 911 2 374
Glycosyl hydrolase family 9.
cd02850 E_set_Cellulase_N 4.09e-23 262 348 4 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam02927 CelD_N 1.07e-21 260 343 3 83
Cellulase N-terminal ig-like domain.
cd14256 Dockerin_I 4.08e-10 983 1040 1 57
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.
pfam02018 CBM_4_9 2.18e-06 43 214 6 131
Carbohydrate binding domain. This family includes diverse carbohydrate binding domains.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CBL17684.1 1.07e-248 1 1019 1 932
BAH56283.1 2.31e-220 35 915 170 1007
AAR01216.1 3.62e-217 13 918 4 902
CAS03458.1 1.14e-216 13 931 8 920
ADU21576.1 7.68e-211 29 913 22 798

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
1UT9_A 3.22e-133 260 915 7 605
ChainA, CELLULOSE 1,4-BETA-CELLOBIOSIDASE [Acetivibrio thermocellus]
1RQ5_A 9.15e-133 260 917 7 607
ChainA, Cellobiohydrolase [Acetivibrio thermocellus]
3X17_A 7.35e-57 262 912 20 554
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
6DHT_A 1.03e-55 260 915 18 565
Bacteroidesovatus GH9 Bacova_02649 [Bacteroides ovatus ATCC 8483]
5U0H_A 2.29e-41 269 916 3 542
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A3DCH1 1.07e-152 36 916 38 813
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
P0C2S1 4.13e-152 36 916 38 813
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1
Q05156 1.38e-101 26 917 17 745
Cellulase 1 OS=Streptomyces reticuli OX=1926 GN=cel1 PE=1 SV=1
P10476 1.86e-98 262 918 39 602
Endoglucanase A OS=Cellvibrio japonicus (strain Ueda107) OX=498211 GN=celA PE=3 SV=2
P14090 4.99e-97 192 917 283 910
Endoglucanase C OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cenC PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.001333 0.997073 0.000232 0.000844 0.000261 0.000219

TMHMM  Annotations      download full data without filtering help

start end
13 35