logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003104_01605

You are here: Home > Sequence: MGYG000003104_01605

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UMGS1601 sp900545345
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; UMGS1601; UMGS1601 sp900545345
CAZyme ID MGYG000003104_01605
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
783 87138.99 9.5863
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003104 2017717 MAG Spain Europe
Gene Location Start: 1094;  End: 3445  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003104_01605.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 65 372 9.6e-98 0.9927536231884058

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 8.22e-63 57 373 7 270
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.42e-32 21 337 26 326
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
PRK15313 PRK15313 9.79e-06 612 658 569 617
intestinal colonization autotransporter adhesin MisL.
NF033839 PspC_subgroup_2 8.20e-05 611 660 320 371
pneumococcal surface protein PspC, LPXTG-anchored form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
NF033839 PspC_subgroup_2 1.12e-04 611 665 386 451
pneumococcal surface protein PspC, LPXTG-anchored form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QAA35398.1 6.89e-153 33 399 38 401
ADL50682.1 2.58e-149 1 403 1 404
BAV13070.1 3.17e-149 1 403 7 410
QYR19601.1 9.72e-149 1 404 3 405
AAR65336.1 2.52e-148 4 397 3 395

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2JEP_A 4.36e-146 29 397 30 390
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
6WQY_A 3.77e-79 41 385 26 364
ChainA, Cellulase [Phocaeicola salanitronis DSM 18170]
6PZ7_A 1.91e-74 35 399 5 335
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]
6WQP_A 5.08e-70 36 396 13 351
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
3NDY_A 7.70e-68 36 398 9 338
Thestructure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDZ_A The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 5.61e-142 27 397 28 395
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
P28621 5.82e-66 36 398 39 371
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P28623 1.01e-65 36 398 40 369
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P10477 4.82e-61 36 397 54 381
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2
P17901 2.16e-58 43 398 51 397
Endoglucanase A OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCA PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000224 0.999069 0.000164 0.000199 0.000170 0.000153

TMHMM  Annotations      download full data without filtering help

start end
7 29