logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004402_00311

You are here: Home > Sequence: MGYG000004402_00311

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species RC9 sp000432515
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; UBA932; RC9; RC9 sp000432515
CAZyme ID MGYG000004402_00311
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
512 55948.94 5.8857
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004402 3046184 MAG Israel Asia
Gene Location Start: 9114;  End: 10652  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 166 460 5.2e-94 0.9927536231884058

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 3.24e-54 158 463 7 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 6.07e-19 128 423 33 322
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd14948 BACON 6.94e-16 35 117 2 83
Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain. The BACON domain is found in diverse domain architectures and accociated with a wide variety of domains, including carbohydrate-active enzymes and proteases. It was named for its suggested function of carbohydrate binding; the latter was inferred from domain architectures, sequence conservation, and phyletic distribution. However, recent experimental data suggest that its primary function in Bacteroides ovatus endo-xyloglucanase BoGH5A is to distance the catalytic module from the cell surface and confer additional mobility to the catalytic domain for attack of the polysaccharide. No evidence for a direct role in carbohydrate binding could be found in that case. The large majority of BACON domains are found in Bacteroidetes.
pfam19190 BACON_2 2.89e-10 34 119 1 91
Viral BACON domain. This family represents a distinct class of BACON domains found in crAss-like phages, the most common viral family in the human gut, in which they are found in tail fiber genes. This suggests they may play a role in phage-host interactions.
pfam13004 BACON 4.99e-08 61 117 2 61
Putative binding domain, N-terminal. The BACON (Bacteroidetes-Associated Carbohydrate-binding Often N-terminal) domain is an all-beta domain found in diverse architectures, principally in combination with carbohydrate-active enzymes and proteases. These architectures suggest a carbohydrate-binding function which is also supported by the nature of BACON's few conserved amino-acids. The phyletic distribution of BACON and other data tentatively suggest that it may frequently function to bind mucin. Further work with the characterized structure of a member of glycoside hydrolase family 5 enzyme, Structure 3ZMR, has found no evidence for carbohydrate-binding for this domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ALJ60576.1 4.61e-177 64 512 67 512
QUT88430.1 6.13e-173 141 512 225 598
AIF26005.1 7.17e-164 142 512 32 402
QRX62664.1 5.95e-160 88 512 90 513
SJX74201.1 3.50e-159 14 511 6 511

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6WQY_A 1.53e-152 143 495 27 383
ChainA, Cellulase [Phocaeicola salanitronis DSM 18170]
4YHE_A 4.11e-130 135 511 2 385
NativeBacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a],4YHE_B Native Bacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a]
4YHG_A 3.30e-129 135 511 2 385
NativeBacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a],4YHG_B Native Bacteroidetes-affiliated Gh5 Cellulase Linked With A Polysaccharide Utilization Locus [Bacteroidetes bacterium AC2a]
2JEP_A 6.43e-66 139 460 34 364
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
6WQP_A 4.31e-60 149 476 25 340
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 1.00e-60 143 460 43 369
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
P17901 2.53e-57 138 479 45 387
Endoglucanase A OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCA PE=1 SV=1
Q12647 1.28e-56 143 476 28 347
Endoglucanase B OS=Neocallimastix patriciarum OX=4758 GN=CELB PE=2 SV=1
P28623 8.58e-54 140 475 43 355
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P23660 4.10e-53 139 476 26 347
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000000 0.000001 1.000063 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000004402_00311.