logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002140_02644

You are here: Home > Sequence: MGYG000002140_02644

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-510 sp900551115
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; CAG-510; CAG-510 sp900551115
CAZyme ID MGYG000002140_02644
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
917 MGYG000002140_23|CGC1 98709.01 4.1079
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002140 3321781 MAG United States North America
Gene Location Start: 21982;  End: 24735  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002140_02644.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 75 350 1.9e-91 0.9927536231884058

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.15e-59 75 348 15 267
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.50e-26 51 353 46 363
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam02368 Big_2 0.009 503 579 1 77
Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AUO18792.1 1.66e-228 34 842 240 1044
AUO19859.1 2.11e-109 42 583 107 646
BCN29385.1 4.21e-74 44 744 349 951
AIQ53446.1 5.31e-74 13 385 5 400
QAA35398.1 1.61e-72 14 385 11 403

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6WQP_A 2.39e-72 40 383 9 354
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
4IM4_A 9.89e-72 44 384 2 334
ChainA, Endoglucanase E [Acetivibrio thermocellus],4IM4_B Chain B, Endoglucanase E [Acetivibrio thermocellus],4IM4_C Chain C, Endoglucanase E [Acetivibrio thermocellus],4IM4_D Chain D, Endoglucanase E [Acetivibrio thermocellus],4IM4_E Chain E, Endoglucanase E [Acetivibrio thermocellus],4IM4_F Chain F, Endoglucanase E [Acetivibrio thermocellus]
2JEP_A 3.89e-70 49 385 35 394
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
6WQY_A 4.80e-66 48 386 23 383
ChainA, Cellulase [Phocaeicola salanitronis DSM 18170]
3NDY_A 5.59e-66 45 385 8 341
Thestructure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDZ_A The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 1.77e-69 14 385 10 399
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
P10477 3.60e-67 23 384 31 384
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2
P28621 1.15e-64 5 384 7 373
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P28623 2.62e-63 5 385 8 372
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P17901 4.27e-61 8 386 2 401
Endoglucanase A OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCA PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000256 0.999070 0.000201 0.000162 0.000148 0.000132

TMHMM  Annotations      download full data without filtering help

start end
886 908