logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000150_01565

You are here: Home > Sequence: MGYG000000150_01565

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Hungatella sp005845265
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Hungatella; Hungatella sp005845265
CAZyme ID MGYG000000150_01565
CAZy Family GH59
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2412 MGYG000000150_5|CGC4 265508.69 4.2612
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000150 7470188 Isolate United Kingdom Europe
Gene Location Start: 254833;  End: 262071  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000150_01565.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH59 53 759 3.4e-177 0.9857369255150554

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam02057 Glyco_hydro_59 3.21e-106 55 394 1 291
Glycosyl hydrolase family 59.
COG5263 COG5263 1.73e-25 2286 2412 188 313
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism].
NF033838 PspC_subgroup_1 1.08e-23 2285 2409 499 622
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033930 pneumo_PspA 1.46e-23 2291 2394 483 582
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
NF033838 PspC_subgroup_1 3.75e-23 2291 2410 486 603
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AYA75976.1 2.04e-261 35 1094 35 1264
QNM42518.1 1.72e-252 42 1023 41 1114
QLD11141.1 1.47e-251 41 1614 62 1439
QJW36728.1 4.08e-251 41 1351 67 1193
AEI43030.1 5.46e-249 4 1084 5 1262

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4CCC_A 2.81e-34 55 759 25 650
StructureOf Mouse Galactocerebrosidase With 4nbdg: Enzyme-substrate Complex [Mus musculus],4CCD_A Structure Of Mouse Galactocerebrosidase With D-galactal: Enzyme-intermediate Complex [Mus musculus],4CCE_A Structure Of Mouse Galactocerebrosidase With Galactose: Enzyme-product Complex [Mus musculus],4UFH_A Mouse Galactocerebrosidase complexed with iso-galacto-fagomine IGF [Mus musculus],4UFI_A Mouse Galactocerebrosidase complexed with aza-galacto-fagomine AGF [Mus musculus],4UFJ_A Mouse Galactocerebrosidase complexed with iso-galacto-fagomine lactam IGL [Mus musculus],4UFK_A Mouse Galactocerebrosidase complexed with dideoxy-imino-lyxitol DIL [Mus musculus],4UFL_A Mouse Galactocerebrosidase complexed with deoxy-galacto-noeurostegine DGN [Mus musculus],4UFM_A Mouse Galactocerebrosidase complexed with 1-deoxy-galacto-nojirimycin DGJ [Mus musculus],5NXB_A Mouse galactocerebrosidase in complex with saposin A [Mus musculus],5NXB_B Mouse galactocerebrosidase in complex with saposin A [Mus musculus],6Y6S_A Chain A, Galactocerebrosidase [Mus musculus],6Y6T_A Chain A, Galactocerebrosidase [Mus musculus]
3ZR5_A 2.86e-34 55 759 27 652
STRUCTUREOF GALACTOCEREBROSIDASE FROM MOUSE [Mus musculus],3ZR6_A STRUCTURE OF GALACTOCEREBROSIDASE FROM MOUSE IN COMPLEX WITH GALACTOSE [Mus musculus]
4FDW_A 3.67e-09 1355 1422 21 91
Crystalstructure of a putative cell surface protein (BACOVA_01565) from Bacteroides ovatus ATCC 8483 at 2.05 A resolution [Bacteroides ovatus ATCC 8483]
4FD0_A 4.61e-08 1355 1422 24 94
Crystalstructure of a putative cell surface protein (BACCAC_03700) from Bacteroides caccae ATCC 43185 at 2.07 A resolution [Bacteroides caccae ATCC 43185]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q0VA39 1.47e-35 33 759 31 672
Galactocerebrosidase OS=Xenopus tropicalis OX=8364 GN=galc PE=2 SV=1
Q498K0 1.43e-34 33 759 31 671
Galactocerebrosidase OS=Xenopus laevis OX=8355 GN=galc PE=2 SV=2
P54803 1.12e-33 55 759 55 681
Galactocerebrosidase OS=Homo sapiens OX=9606 GN=GALC PE=1 SV=3
P54804 1.75e-33 55 759 39 665
Galactocerebrosidase OS=Canis lupus familiaris OX=9615 GN=GALC PE=1 SV=1
P54818 1.97e-33 55 759 55 680
Galactocerebrosidase OS=Mus musculus OX=10090 GN=Galc PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000432 0.998780 0.000181 0.000229 0.000187 0.000168

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000150_01565.