logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002573_00690

You are here: Home > Sequence: MGYG000002573_00690

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Ruminococcus_C sp900545285
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus_C; Ruminococcus_C sp900545285
CAZyme ID MGYG000002573_00690
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
928 MGYG000002573_13|CGC2 99970.27 4.0527
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002573 2234911 MAG China Asia
Gene Location Start: 30964;  End: 33750  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002573_00690.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 71 350 5.7e-85 0.9855072463768116

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.07e-50 55 354 1 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 7.20e-23 1 336 4 350
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd14256 Dockerin_I 4.08e-10 862 917 1 56
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.
cd14253 Dockerin 3.24e-06 863 918 1 56
Dockerin repeat domain. Dockerins are modules in the cellulosome complex that often anchor catalytic subunits by binding to cohesin domains of scaffolding proteins. Three types of dockerins and their corresponding cohesin have been described in the literature. This alignment models two consecutive dockerin repeats, the functional unit.
pfam00404 Dockerin_1 1.98e-05 863 917 1 55
Dockerin type I repeat. The dockerin repeat is the binding partner of the cohesin domain pfam00963. The cohesin-dockerin interaction is the crucial interaction for complex formation in the cellulosome. The dockerin repeats, each bearing homology to the EF-hand calcium-binding loop bind calcium.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CBL16523.1 1.13e-184 1 548 1 529
QPB75695.1 1.14e-179 1 728 1 719
CAH69214.1 1.59e-90 37 539 20 490
CAL91975.1 1.59e-90 37 539 20 490
AEQ16450.1 9.47e-89 37 543 11 492

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6XRK_A 9.89e-86 37 381 23 373
GH5-4broad specificity endoglucanase from an uncultured bovine rumen ciliate [uncultured bovine rumen ciliate],6XRK_B GH5-4 broad specificity endoglucanase from an uncultured bovine rumen ciliate [uncultured bovine rumen ciliate]
6WQP_A 5.17e-59 36 366 14 343
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
6Q1I_A 3.15e-56 37 372 13 346
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
4IM4_A 3.05e-52 36 367 5 323
ChainA, Endoglucanase E [Acetivibrio thermocellus],4IM4_B Chain B, Endoglucanase E [Acetivibrio thermocellus],4IM4_C Chain C, Endoglucanase E [Acetivibrio thermocellus],4IM4_D Chain D, Endoglucanase E [Acetivibrio thermocellus],4IM4_E Chain E, Endoglucanase E [Acetivibrio thermocellus],4IM4_F Chain F, Endoglucanase E [Acetivibrio thermocellus]
6MQ4_A 7.94e-51 36 367 10 337
ChainA, cellulase [Acetivibrio cellulolyticus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P54937 1.21e-54 37 372 38 371
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P23660 4.23e-50 36 371 25 355
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
P28621 8.93e-49 36 375 40 370
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P10477 4.33e-48 36 367 55 373
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2
P28623 3.51e-47 42 384 47 377
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000344 0.998877 0.000160 0.000240 0.000180 0.000154

TMHMM  Annotations      download full data without filtering help

start end
7 29