logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001697_01724

You are here: Home > Sequence: MGYG000001697_01724

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Erysipelatoclostridium saccharogumia
Lineage Bacteria; Firmicutes; Bacilli; Erysipelotrichales; Erysipelatoclostridiaceae; Erysipelatoclostridium; Erysipelatoclostridium saccharogumia
CAZyme ID MGYG000001697_01724
CAZy Family GH89
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2174 MGYG000001697_33|CGC1 245029.48 4.1566
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001697 3141483 Isolate Germany Europe
Gene Location Start: 19546;  End: 26070  Strand: -

Full Sequence      Download help

MKRILKGFLV  IAIIISTFVT  GINNTKAVSD  GEIIYPVNAD  AYIRSGNNAN  NNYNYENITQ60
AHGSQYVGKN  YKVINTKYYP  NGSKIISVMK  LELPSIEEIN  ENKLDTYGFE  FNIFKNPDFN120
KSGQTYHFYY  TDDVNWSETT  LTWNNKPAGI  GTGEILFDFT  IPQGFEYESK  SDDEKRIRID180
ITEKIKELIE  KGITAITIYV  EGELKADTSL  MIHSKESGDG  SLGAKLVASN  SNYSKNRLEQ240
LINQCMSIEK  KNYTKTSYDL  LTASLDDAKA  VLESGTTVEI  DNAYAALLAA  KEALIAEYPV300
LEDGYIRSDK  GTSVYNYENI  TQAHGSQYVG  KDYKVINSKY  YPNGNEIIGV  MKFALPTLEE360
ISESDFDTFK  LSFNMFKNPG  FNNGAQTYHF  YYSDDTNWSE  SSLTWNNKPT  SIKHDGENLL420
CDFTIVQGDE  YETKPDLQKA  VNIDISSKIK  ELTKNGINEI  TVFVCAENKL  DTSIMFHSKE480
SGDGTLGAKI  VGSYQNYRGQ  LETLINECKE  LDSSNYTEES  FSVLTQVLNE  VQQEYVEGTS540
LENHVSYEKL  LKAKEALVLV  KDLEDIDNIA  YHKPTRSNLS  KTQSDKVTDG  DLTTYWSGSF600
YPSYVDIDLM  DTYELTKLKI  YTLEGKNCWY  AVYGSNDGST  YDRIYQKHVD  SLATNEGDEI660
VFDTAQSYRI  LRVYIEYTQG  DDKAYLTEVK  AYGTALQTNE  DVLKEGTLEE  ILDVTSYDES720
EYAATITKEE  TIENVYGIID  RTIGEEYRNW  FSFELVDDNS  ENDWYEISDQ  NDKIHIKGNE780
GLSLTTGLNY  YFKNYLNVHI  SEQTMQVNMP  DELVKVNEII  KKETPYQVRY  AYNYCTLSYT840
FAFFGEEQWQ  RENDWLALNG  VNVVLDLAGQ  EATWIKFLMN  FGYSFDDAKD  WLSGPAYYAW900
QFMDNMESFG  GPVPDGYVKD  RLELARSSQR  WKRSLGMQTI  LQGYAGMVPT  NFDEYQPDVT960
VIKQGNWNGF  SRPDMIATDS  KEYDLYAQLF  YQAQEFVYGA  TSDYYAVDPF  HEGGIRPAGL1020
SDATIAAEVL  ESMLDYDQDA  VWVVQGWQSN  PTNDLLKGMG  DNREDHVLIV  DLIKYPISDW1080
TKYDQTSYGS  TKLDAKEFNG  TSWAWGLLAN  FGGNPSMHGQ  MQVMVDDIME  AQKTSSHMVG1140
LGIISEAQYD  NPVMYDLIFD  LAWADDDFDL  DDWLNRYIER  RYGGTSQNAQ  MAWQIMKDAN1200
YDHGVRYTNE  LFGMKSKAPQ  DYKEQNIPYG  ADQLESALRL  LIRDFDKFKD  SECYLYDLSE1260
IMRQQVSNYA  VLKYNDVLAA  KNANDLEAFK  KYKEEFLRAF  EILNEVEATQ  QEQLGGEWIG1320
KAQDIAANYD  DFARDSFTMN  AKALITSWGS  RAGHRSLKDY  GWRNYEGIFK  DLYTNIWSDY1380
LNRVEANMEN  GTPLNNINKN  GYFDLYWKWN  MADQEYTRDA  KDSPEEIYEV  LEKVLENCAI1440
SGDVDANIGN  LALTGIVKDS  NDLKADVNAV  NDGDATTIYS  FKAKDEKAPQ  VIVDLIGVFQ1500
LSEIDVIVND  TYDYEIWVSE  DNVDWLKIGD  GISDETKFAV  QGSVRYIKLV  GINNIAQLSI1560
GEIRAYGERV  LPTLNQLENL  IEFCESINTS  GNEESKITDF  NNILNSAKKA  VDNEAAPDEI1620
NTVYWNLYDA  VTDLNLSGIF  NLSLNKPVTA  HNDPSGKSAN  LVDGNLSSAW  DAGRLSLTGA1680
PYEDKITPAW  AIVDLKEEYQ  ISEIEINFGS  NVWHHYTVYG  SVDGEEWFEI  GNKDTDSLPN1740
DIEDTYKLEN  TFARYIKLEI  TNVQLESSGK  RTPVKVNELI  IIGKMKDLLD  KEALQQQVAF1800
AKAIDLEKYQ  DGIEKDTFII  ILTKAIEILD  SATTQQEIDN  VLTALNEAIE  ALESINIEDK1860
TALQIAVDTA  NTLKTQGALD  NVVPAVVNEF  EAALAEAEGI  LADNYADQIT  IDTSFYRLAN1920
AIHMLDFVKG  DKSALEALLE  EANGHEEENY  TTDSWTALQE  AIEAATEVMN  DENALESDVT1980
EALNNLTNAI  GNLVLRADKT  RLQEAYDMVN  GLDKSLYTEA  SVAGITEPMT  NAKAVLDDPD2040
ATQAEVDSAY  EALIRAYLDM  RLIPNKDLLQ  DLINKANGLN  AANYSAKTWN  VMQDALEKAK2100
ATLTDANATQ  EEVDNAKEVL  TKAMAGLETS  NSVKAGDTTA  SVATGDSASI  IQILSMLVML2160
GTLVIMVNVK  KIED2174

Enzyme Prediction      help

No EC number prediction in MGYG000001697_01724.

CAZyme Signature Domains help

Created with Snap10821732643454365276086997810871195130414131521163017391847195620657711427GH8916501778CBM32
Family Start End Evalue family coverage
GH89 771 1427 3e-182 0.9864253393665159
CBM32 1650 1778 1.5e-20 0.8870967741935484

CDD Domains      download full data without filtering help

Created with Snap10821732643454365276086997810871195130414131521163017391847195620658291161NAGLU11751427NAGLU_C735813NAGLU_N16531761F5_F8_type_C64227DNRLRE_dom
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam05089 NAGLU 2.16e-114 829 1161 1 329
Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This central domain has a tim barrel fold.
pfam12972 NAGLU_C 6.78e-37 1175 1427 3 256
Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This C-terminal domain has an all alpha helical fold.
pfam12971 NAGLU_N 2.15e-23 735 813 3 80
Alpha-N-acetylglucosaminidase (NAGLU) N-terminal domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This N-terminal domain has an alpha-beta fold.
pfam00754 F5_F8_type_C 6.83e-11 1653 1761 9 113
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
NF033679 DNRLRE_dom 1.06e-08 64 227 7 162
DNRLRE domain. The DNRLRE domain, with a length of about 160 amino acids, appears typically in large, repetitive surface proteins of bacteria and archaea, sometimes repeated several times. It occurs, notably, three times in the C-terminal region of the enzyme disaggregatase from the archaeal species Methanosarcina mazei, each time with the motif DNRLRE, for which the domain is named. Archaeal proteins within this family are described particularly well by the currently more narrowly defined Pfam model, PF06848. Note that the catalytic region of disaggregatase, in the N-terminal portion of the protein, is modeled by a different HMM, PF08480.

CAZyme Hits      help

Created with Snap10821732643454365276086997810871195130414131521163017391847195620655761708AMN35090.1|CBM32|GH895761708AOY53263.1|CBM32|GH895761708AXH51964.1|CBM32|GH895761708ALG48242.1|CBM32|GH895761708BAB80572.1|CBM32|GH89|3.2.1.50
Hit ID E-Value Query Start Query End Hit Start Hit End
AMN35090.1 1.68e-163 576 1708 47 1135
AOY53263.1 3.05e-161 576 1708 47 1135
AXH51964.1 4.13e-161 576 1708 47 1135
ALG48242.1 7.56e-161 576 1708 47 1135
BAB80572.1 1.38e-160 576 1708 47 1135

PDB Hits      download full data without filtering help

Created with Snap108217326434543652760869978108711951304141315211630173918471956206557614412VC9_A57614417MFK_A57614414A4A_A77314164XWH_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
2VC9_A 8.96e-169 576 1441 13 880
Family89 Glycoside Hydrolase from Clostridium perfringens in complex with 2-acetamido-1,2-dideoxynojirmycin [Clostridium perfringens],2VCA_A Family 89 glycoside hydrolase from Clostridium perfringens in complex with beta-N-acetyl-D-glucosamine [Clostridium perfringens],2VCB_A Family 89 Glycoside Hydrolase from Clostridium perfringens in complex with PUGNAc [Clostridium perfringens],2VCC_A Family 89 Glycoside Hydrolase from Clostridium perfringens [Clostridium perfringens]
7MFK_A 1.13e-168 576 1441 21 888
ChainA, Alpha-N-acetylglucosaminidase family protein [Clostridium perfringens ATCC 13124],7MFL_A Chain A, Alpha-N-acetylglucosaminidase family protein [Clostridium perfringens ATCC 13124]
4A4A_A 1.19e-167 576 1441 36 903
CpGH89(E483Q, E601Q), from Clostridium perfringens, in complex with its substrate GlcNAc-alpha-1,4-galactose [Clostridium perfringens]
4XWH_A 5.32e-67 773 1416 51 692
Crystalstructure of the human N-acetyl-alpha-glucosaminidase [Homo sapiens]

Swiss-Prot Hits      download full data without filtering help

Created with Snap10821732643454365276086997810871195130414131521163017391847195620657731416sp|P54802|ANAG_HUMAN7731384sp|Q9FNA3|NAGLU_ARATH19312108sp|E8MGH9|HYBA2_BIFL218932053sp|P26831|NAGH_CLOPE
Hit ID E-Value Query Start Query End Hit Start Hit End Description
P54802 4.43e-66 773 1416 74 715
Alpha-N-acetylglucosaminidase OS=Homo sapiens OX=9606 GN=NAGLU PE=1 SV=2
Q9FNA3 5.28e-66 773 1384 94 751
Alpha-N-acetylglucosaminidase OS=Arabidopsis thaliana OX=3702 GN=NAGLU PE=2 SV=1
E8MGH9 6.76e-11 1931 2108 1667 1856
Beta-L-arabinobiosidase OS=Bifidobacterium longum subsp. longum (strain ATCC 15707 / DSM 20219 / JCM 1217 / NCTC 11818 / E194b) OX=565042 GN=hypBA2 PE=1 SV=1
P26831 1.06e-06 1893 2053 1389 1561
Hyaluronoglucosaminidase OS=Clostridium perfringens (strain 13 / Type A) OX=195102 GN=nagH PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000563 0.967256 0.031477 0.000249 0.000227 0.000192

TMHMM  Annotations      download full data without filtering help

start end
2150 2169