logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000287_03396

You are here: Home > Sequence: MGYG000000287_03396

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Robinsoniella sp900539655
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Robinsoniella; Robinsoniella sp900539655
CAZyme ID MGYG000000287_03396
CAZy Family CBM51
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2294 MGYG000000287_56|CGC1 251455.1 4.7197
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000287 7249439 MAG Sweden Europe
Gene Location Start: 22455;  End: 29339  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000287_03396.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM51 74 213 2.3e-32 0.9850746268656716

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd14244 GH_101_like 6.60e-44 648 968 2 294
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam08305 NPCBM 5.42e-36 72 213 2 135
NPCBM/NEW2 domain. This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins. This domain has also been called the NEW2 domain (Naumoff DG. Phylogenetic analysis of alpha-galactosidases of the GH27 family. Molecular Biology (Engl Transl). (2004)38:388-399.)
smart00776 NPCBM 3.31e-28 69 213 1 144
This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins.
pfam13290 CHB_HEX_C_1 1.27e-17 1550 1612 4 66
Chitobiase/beta-hexosaminidase C-terminal domain.
sd00036 LRR_3 1.30e-15 2163 2250 62 140
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QJD87425.1 1.51e-252 66 1265 29 1211
QTH41000.1 2.31e-94 453 1189 167 885
SYX82446.1 9.25e-94 614 1083 98 566
SMF83630.1 3.11e-93 535 1086 28 569
AZS16787.1 1.02e-57 73 878 42 691

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7JRM_A 5.76e-13 75 212 86 222
ChainA, F5/8 type C domain protein [Clostridium perfringens ATCC 13124]
7JRL_A 6.30e-13 75 212 108 244
ChainA, F5/8 type C domain protein [Clostridium perfringens ATCC 13124]
7JS4_A 1.23e-12 75 212 838 974
ChainA, F5/8 type C domain protein [Clostridium perfringens ATCC 13124]
2VMG_A 1.18e-09 61 192 3 135
Thestructure of CBM51 from Clostridium perfringens GH95 in complex with methyl-galactose [Clostridium perfringens]
6QFO_A 2.66e-09 672 1080 305 753
EngBFDARPin Fusion 9b 3G124 [Bifidobacterium longum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A9WNA0 2.32e-11 74 212 1164 1302
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2
E8MGH9 3.40e-06 1841 2031 1667 1860
Beta-L-arabinobiosidase OS=Bifidobacterium longum subsp. longum (strain ATCC 15707 / DSM 20219 / JCM 1217 / NCTC 11818 / E194b) OX=565042 GN=hypBA2 PE=1 SV=1
Q2MGH6 7.51e-06 672 777 614 718
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=SP_0368 PE=1 SV=1
Q8DR60 7.51e-06 672 777 614 718
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae (strain ATCC BAA-255 / R6) OX=171101 GN=spr0328 PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000280 0.999033 0.000193 0.000171 0.000153 0.000132

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000287_03396.