logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000909_00943

You are here: Home > Sequence: MGYG000000909_00943

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UBA9502 sp900538475
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; UBA9502; UBA9502 sp900538475
CAZyme ID MGYG000000909_00943
CAZy Family GH59
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2275 MGYG000000909_6|CGC2 249667.73 4.1314
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000909 3035568 MAG China Asia
Gene Location Start: 78528;  End: 85355  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.23

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH59 556 1281 4e-165 0.993660855784469

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam02057 Glyco_hydro_59 1.77e-85 561 894 1 292
Glycosyl hydrolase family 59.
NF033838 PspC_subgroup_1 8.40e-33 2007 2271 315 604
pneumococcal surface protein PspC, choline-binding form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
NF033930 pneumo_PspA 2.34e-31 1939 2271 239 601
pneumococcal surface protein A. The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.
COG5263 COG5263 4.33e-25 2150 2271 175 294
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism].
NF033840 PspC_relate_1 2.80e-24 2172 2275 512 612
PspC-related protein choline-binding protein 1. Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QUL53185.1 4.44e-280 540 1763 315 1533
QOR70567.1 5.39e-271 542 1752 341 1526
QNK59157.1 1.21e-267 541 1838 314 1650
ACL75594.1 3.14e-250 533 1418 25 910
ABG76970.1 3.14e-250 533 1418 25 910

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
3ZR5_A 1.77e-18 558 903 24 329
STRUCTUREOF GALACTOCEREBROSIDASE FROM MOUSE [Mus musculus],3ZR6_A STRUCTURE OF GALACTOCEREBROSIDASE FROM MOUSE IN COMPLEX WITH GALACTOSE [Mus musculus]
4CCC_A 1.77e-18 558 903 22 327
StructureOf Mouse Galactocerebrosidase With 4nbdg: Enzyme-substrate Complex [Mus musculus],4CCD_A Structure Of Mouse Galactocerebrosidase With D-galactal: Enzyme-intermediate Complex [Mus musculus],4CCE_A Structure Of Mouse Galactocerebrosidase With Galactose: Enzyme-product Complex [Mus musculus],4UFH_A Mouse Galactocerebrosidase complexed with iso-galacto-fagomine IGF [Mus musculus],4UFI_A Mouse Galactocerebrosidase complexed with aza-galacto-fagomine AGF [Mus musculus],4UFJ_A Mouse Galactocerebrosidase complexed with iso-galacto-fagomine lactam IGL [Mus musculus],4UFK_A Mouse Galactocerebrosidase complexed with dideoxy-imino-lyxitol DIL [Mus musculus],4UFL_A Mouse Galactocerebrosidase complexed with deoxy-galacto-noeurostegine DGN [Mus musculus],4UFM_A Mouse Galactocerebrosidase complexed with 1-deoxy-galacto-nojirimycin DGJ [Mus musculus],5NXB_A Mouse galactocerebrosidase in complex with saposin A [Mus musculus],5NXB_B Mouse galactocerebrosidase in complex with saposin A [Mus musculus],6Y6S_A Chain A, Galactocerebrosidase [Mus musculus],6Y6T_A Chain A, Galactocerebrosidase [Mus musculus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
B5X3C1 9.17e-20 558 903 34 339
Galactocerebrosidase OS=Salmo salar OX=8030 GN=galc PE=2 SV=1
Q5SNX7 3.59e-19 558 903 30 334
Galactocerebrosidase OS=Danio rerio OX=7955 GN=galc PE=2 SV=1
P54818 5.98e-18 534 903 35 357
Galactocerebrosidase OS=Mus musculus OX=10090 GN=Galc PE=1 SV=2
P54804 6.18e-16 534 903 19 341
Galactocerebrosidase OS=Canis lupus familiaris OX=9615 GN=GALC PE=1 SV=1
O02791 1.45e-15 558 903 52 357
Galactocerebrosidase OS=Macaca mulatta OX=9544 GN=GALC PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000352 0.998739 0.000227 0.000247 0.000218 0.000186

TMHMM  Annotations      download full data without filtering help

start end
7 29