CAZyme3D

You are here: Home Cite us: 2025

Entry ID

Information for CAZyme ID: ARK04035.1

Basic Information

GenBank IDARK04035.1
FamilyGH94
Sequence Length1141
UniProt IDA0A1W6DM75(100,100)Download
Average pLDDT?93.97
CAZy50 ID9209
CAZy50 RepNo, UKJ62985.1
Structure Cluster-
EC Number(s)-
Substrates(s)-

Taxonomy

Tax ID1980001
KingdomBacteria
PhylumActinomycetota
ClassActinomycetes
OrderMicrococcales
FamilyPromicromonosporaceae
GenusCellulosimicrobium
SpeciesCellulosimicrobium sp. TH-20

Protein Sequence:
90 < plddt <=100;
70 < plddt <= 90;
50 < plddt <= 70;
0 <= plddt <= 50;     Download help

MTLTATETPA  RATLTSGGLT  VELTGGGDVR  AVSTDGLLVN  QYLPGEHDRM  PGGILLRAAR60
PDGTVEVARL  TGSAPAVTAV  EVGADRVVWS  GAALGLATRV  ALTLDGRTLV  WRVDLTAGPA120
TDAGTRYDVV  HAQDLALAPP  AAALSSEPYV  CQYLLHRALE  HPDAGTVLVS  RQTMSAQPRL180
PLAVAFLVEG  AVAHLTDSLQ  VFTARSRRDG  LPHGLLGPVQ  PGVLQYEYAM  PTLVSRPLDL240
STGTAVVHAV  TVVDADAPGP  LAAHLVEVSG  WAAAAVAAAA  AERPTARFTP  SASALRDAPL300
LAGDELDEAG  LLAAVGLSAD  DVLLPERDAD  GTLLSFFTAT  GTHVVDARKD  TVTERSHGHV360
LKAGDDVLPT  DDVLSTTAFA  PGVFASHVVL  GNTTANRLAT  VHRHHLNLVR  SSGLRVLVDD420
GRGPRLLGLP  SALVLDVGGV  RWLYETPLGR  VDVRTVAHDR  ENRIDVAVRC  ERPLHVTATL480
ELEDEAGGWL  AEHVPAAPGD  AVVVRPVPGG  DVDAHYPDLR  YALASSARLV  VEEEAVAETG540
AEAAASGTTR  RLTSTTGSGS  LTLALTGSLR  GTDAALGLLA  GALDPLADVD  ATLARHVETV600
RGVVRGLRFA  PHATVETQEL  DLLVPWYAHD  ALIHFLVPHG  LEQYSGAAWG  TRDVCQGPFE660
LALAGGRHDV  AREIVLRVLA  HQHTWGEFPQ  WFMFDAYAER  YNDSSHGDVV  VWPLFALAQY720
LEASGDLAVL  DEHVPFWDHE  QRRPAASGPD  AAATVRDHVA  RLLDHLDRDR  LPGTALPAYG780
EGDWDDTLQP  ADPRMRTDLA  STWTSALLVQ  AAELLARTTA  GRDDLAALSG  RASTLAAEVR840
ADLRERALVD  GVLAGYVRHG  ADGDELVIHP  SDTVSGMRYR  LIPMTQSIIA  GILTPEEAAD900
HERLVLEHLH  FPDGVRLMDH  PAAFDEGVPH  TFLRAEQAAN  VGREIGLMYV  HAHIRYVEAL960
AALGRGRALD  ELLRISPVDL  GRRLAHAAPR  QRNAYFSSSD  ADFPDRESFA  RDFDGLRDGS1020
VGVRGGWRVY  SSGPGIYLRQ  LVQGVLGLTE  RGGEIVVDPV  LPAAADGLAV  DLDLGGRTRR1080
VAYRVTATGD  GVQVRAGSGP  DALAPVPTTA  RTGDYRQRGV  VVATRDLGDA  AYVEVTVPAG1140
S1141

Predicted 3D structure by AlphaFold2 with pLDDT = 93.97 ; Download help

pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .

Residues were colored according to plddt ( blue-> high quality; red-> low quality ).

Carbohydrate binding residues Predicted by CAPSIF

Binding site residues are not predicted, since this is not a representative ID (CAZyme3D-ID50).

Full Sequence:
AA;
CE;
PL;
GH;
GT;
CBM;     Download structure help

dbCAN3 predicted domain(s) : GH94(546-1086)

MTLTATETPA  RATLTSGGLT  VELTGGGDVR  AVSTDGLLVN  QYLPGEHDRM  PGGILLRAAR60
PDGTVEVARL  TGSAPAVTAV  EVGADRVVWS  GAALGLATRV  ALTLDGRTLV  WRVDLTAGPA120
TDAGTRYDVV  HAQDLALAPP  AAALSSEPYV  CQYLLHRALE  HPDAGTVLVS  RQTMSAQPRL180
PLAVAFLVEG  AVAHLTDSLQ  VFTARSRRDG  LPHGLLGPVQ  PGVLQYEYAM  PTLVSRPLDL240
STGTAVVHAV  TVVDADAPGP  LAAHLVEVSG  WAAAAVAAAA  AERPTARFTP  SASALRDAPL300
LAGDELDEAG  LLAAVGLSAD  DVLLPERDAD  GTLLSFFTAT  GTHVVDARKD  TVTERSHGHV360
LKAGDDVLPT  DDVLSTTAFA  PGVFASHVVL  GNTTANRLAT  VHRHHLNLVR  SSGLRVLVDD420
GRGPRLLGLP  SALVLDVGGV  RWLYETPLGR  VDVRTVAHDR  ENRIDVAVRC  ERPLHVTATL480
ELEDEAGGWL  AEHVPAAPGD  AVVVRPVPGG  DVDAHYPDLR  YALASSARLV  VEEEAVAETG540
AEAAASGTTR  RLTSTTGSGS  LTLALTGSLR  GTDAALGLLA  GALDPLADVD  ATLARHVETV600
RGVVRGLRFA  PHATVETQEL  DLLVPWYAHD  ALIHFLVPHG  LEQYSGAAWG  TRDVCQGPFE660
LALAGGRHDV  AREIVLRVLA  HQHTWGEFPQ  WFMFDAYAER  YNDSSHGDVV  VWPLFALAQY720
LEASGDLAVL  DEHVPFWDHE  QRRPAASGPD  AAATVRDHVA  RLLDHLDRDR  LPGTALPAYG780
EGDWDDTLQP  ADPRMRTDLA  STWTSALLVQ  AAELLARTTA  GRDDLAALSG  RASTLAAEVR840
ADLRERALVD  GVLAGYVRHG  ADGDELVIHP  SDTVSGMRYR  LIPMTQSIIA  GILTPEEAAD900
HERLVLEHLH  FPDGVRLMDH  PAAFDEGVPH  TFLRAEQAAN  VGREIGLMYV  HAHIRYVEAL960
AALGRGRALD  ELLRISPVDL  GRRLAHAAPR  QRNAYFSSSD  ADFPDRESFA  RDFDGLRDGS1020
VGVRGGWRVY  SSGPGIYLRQ  LVQGVLGLTE  RGGEIVVDPV  LPAAADGLAV  DLDLGGRTRR1080
VAYRVTATGD  GVQVRAGSGP  DALAPVPTTA  RTGDYRQRGV  VVATRDLGDA  AYVEVTVPAG1140
S1141

Predicted CAZyme domains from dbCAN; Download help

Domains were colored according to CAZyme classification: (AA), (CE), (PL), (GH), (GT), (CBM), & (Null)

dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation.

Details:
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:
⋆HMMER search for CAZyme family annotation vs. dbCAN CAZyme domain HMM database
⋆DIAMOND search for BLAST hits in the CAZy database
⋆HMMER search for CAZyme subfamily annotation vs. dbCAN-sub HMM database of CAZyme subfamilies (derived from eCAMI classification of CAZyDB families)

For more details, please see dbCAN3.

Similarites between the same cluster seqeunces from DIAMOND; Download help