logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003005_01967

You are here: Home > Sequence: MGYG000003005_01967

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus;
CAZyme ID MGYG000003005_01967
CAZy Family CBM79
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
688 74452.72 4.0661
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003005 2620760 MAG United States North America
Gene Location Start: 1998;  End: 4064  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003005_01967.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 390 657 1.1e-98 0.9891304347826086
CBM79 89 199 6.5e-29 0.9545454545454546
CBM79 225 326 6.3e-25 0.9636363636363636

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 4.02e-66 378 657 4 269
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 8.97e-37 362 664 46 367
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam18522 DUF5620 6.78e-27 88 201 1 119
Domain of unknown function (DUF5620). This is a domain of unknown function predicted to be a carbohydrate binding module.
pfam18522 DUF5620 4.96e-17 225 327 1 118
Domain of unknown function (DUF5620). This is a domain of unknown function predicted to be a carbohydrate binding module.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
EWM54675.1 2.75e-296 59 688 53 683
CDE11886.1 9.47e-216 64 688 63 710
ERJ92483.1 1.92e-213 64 688 86 734
CCZ83818.1 1.47e-211 62 684 72 713
CDE32105.1 5.36e-205 73 686 71 700

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6MQ4_A 1.50e-94 362 687 14 349
ChainA, cellulase [Acetivibrio cellulolyticus]
6Q1I_A 5.57e-92 357 688 11 354
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
6PZ7_A 5.93e-91 359 675 8 324
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]
3NDY_A 2.45e-89 360 686 12 340
Thestructure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDZ_A The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans]
4IM4_A 1.42e-88 359 686 6 334
ChainA, Endoglucanase E [Acetivibrio thermocellus],4IM4_B Chain B, Endoglucanase E [Acetivibrio thermocellus],4IM4_C Chain C, Endoglucanase E [Acetivibrio thermocellus],4IM4_D Chain D, Endoglucanase E [Acetivibrio thermocellus],4IM4_E Chain E, Endoglucanase E [Acetivibrio thermocellus],4IM4_F Chain F, Endoglucanase E [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P23660 4.93e-92 352 688 17 364
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
P54937 4.55e-90 357 688 36 379
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P28621 1.23e-87 349 686 30 373
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P28623 1.34e-87 360 686 43 371
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P10477 1.79e-82 359 686 56 384
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000001 1.000017 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003005_01967.