logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003337_00222

You are here: Home > Sequence: MGYG000003337_00222

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Ruminococcus_C sp900765125
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus_C; Ruminococcus_C sp900765125
CAZyme ID MGYG000003337_00222
CAZy Family CBM79
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
715 MGYG000003337_26|CGC1 78505.01 4.0813
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003337 2462692 MAG China Asia
Gene Location Start: 8525;  End: 10672  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003337_00222.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 409 681 2.2e-103 0.9891304347826086
CBM79 232 340 1.7e-33 0.9545454545454546
CBM79 96 205 3.8e-27 0.9363636363636364

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.03e-66 409 684 16 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 4.61e-38 383 709 46 389
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam18522 DUF5620 4.40e-29 93 206 1 118
Domain of unknown function (DUF5620). This is a domain of unknown function predicted to be a carbohydrate binding module.
pfam18522 DUF5620 8.80e-28 232 343 1 119
Domain of unknown function (DUF5620). This is a domain of unknown function predicted to be a carbohydrate binding module.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CDE11886.1 9.85e-313 1 715 1 710
CCZ83818.1 1.07e-311 5 715 7 717
ERJ92483.1 4.59e-300 1 715 24 734
CDE32105.1 5.30e-291 1 714 1 701
EWM54675.1 6.46e-195 77 715 66 683

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4IM4_A 3.34e-109 379 714 5 335
ChainA, Endoglucanase E [Acetivibrio thermocellus],4IM4_B Chain B, Endoglucanase E [Acetivibrio thermocellus],4IM4_C Chain C, Endoglucanase E [Acetivibrio thermocellus],4IM4_D Chain D, Endoglucanase E [Acetivibrio thermocellus],4IM4_E Chain E, Endoglucanase E [Acetivibrio thermocellus],4IM4_F Chain F, Endoglucanase E [Acetivibrio thermocellus]
6MQ4_A 1.28e-107 383 714 14 349
ChainA, cellulase [Acetivibrio cellulolyticus]
6Q1I_A 3.19e-106 380 713 13 352
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
6PZ7_A 4.32e-106 379 711 7 334
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]
6WQP_A 5.88e-102 375 712 10 354
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P54937 3.26e-104 380 713 38 377
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P10477 1.13e-103 358 714 34 385
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2
P23660 5.77e-98 371 715 17 364
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
P28621 1.71e-96 376 714 37 374
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P28623 1.54e-95 383 713 45 371
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000001 1.000041 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003337_00222.