Information for CAZyme ID: ARK04035.1
Basic Information
GenBank ID | ARK04035.1 |
Family | GH94 |
Sequence Length | 1141 |
UniProt ID | A0A1W6DM75(100,100)![]() |
Average pLDDT? | 93.97 |
CAZy50 ID | 9209 |
CAZy50 Rep | No, UKJ62985.1 |
Structure Cluster | - |
EC Number(s) | - |
Substrates(s) | - |
Taxonomy
Tax ID | 1980001 |
Kingdom | Bacteria |
Phylum | Actinomycetota |
Class | Actinomycetes |
Order | Micrococcales |
Family | Promicromonosporaceae |
Genus | Cellulosimicrobium |
Species | Cellulosimicrobium sp. TH-20 |
Protein Sequence: 90 < plddt <=100; 70 < plddt <= 90; 50 < plddt <= 70; 0 <= plddt <= 50; Download help
MTLTATETPA RATLTSGGLT VELTGGGDVR AVSTDGLLVN QYLPGEHDRM PGGILLRAAR | 60 |
PDGTVEVARL TGSAPAVTAV EVGADRVVWS GAALGLATRV ALTLDGRTLV WRVDLTAGPA | 120 |
TDAGTRYDVV HAQDLALAPP AAALSSEPYV CQYLLHRALE HPDAGTVLVS RQTMSAQPRL | 180 |
PLAVAFLVEG AVAHLTDSLQ VFTARSRRDG LPHGLLGPVQ PGVLQYEYAM PTLVSRPLDL | 240 |
STGTAVVHAV TVVDADAPGP LAAHLVEVSG WAAAAVAAAA AERPTARFTP SASALRDAPL | 300 |
LAGDELDEAG LLAAVGLSAD DVLLPERDAD GTLLSFFTAT GTHVVDARKD TVTERSHGHV | 360 |
LKAGDDVLPT DDVLSTTAFA PGVFASHVVL GNTTANRLAT VHRHHLNLVR SSGLRVLVDD | 420 |
GRGPRLLGLP SALVLDVGGV RWLYETPLGR VDVRTVAHDR ENRIDVAVRC ERPLHVTATL | 480 |
ELEDEAGGWL AEHVPAAPGD AVVVRPVPGG DVDAHYPDLR YALASSARLV VEEEAVAETG | 540 |
AEAAASGTTR RLTSTTGSGS LTLALTGSLR GTDAALGLLA GALDPLADVD ATLARHVETV | 600 |
RGVVRGLRFA PHATVETQEL DLLVPWYAHD ALIHFLVPHG LEQYSGAAWG TRDVCQGPFE | 660 |
LALAGGRHDV AREIVLRVLA HQHTWGEFPQ WFMFDAYAER YNDSSHGDVV VWPLFALAQY | 720 |
LEASGDLAVL DEHVPFWDHE QRRPAASGPD AAATVRDHVA RLLDHLDRDR LPGTALPAYG | 780 |
EGDWDDTLQP ADPRMRTDLA STWTSALLVQ AAELLARTTA GRDDLAALSG RASTLAAEVR | 840 |
ADLRERALVD GVLAGYVRHG ADGDELVIHP SDTVSGMRYR LIPMTQSIIA GILTPEEAAD | 900 |
HERLVLEHLH FPDGVRLMDH PAAFDEGVPH TFLRAEQAAN VGREIGLMYV HAHIRYVEAL | 960 |
AALGRGRALD ELLRISPVDL GRRLAHAAPR QRNAYFSSSD ADFPDRESFA RDFDGLRDGS | 1020 |
VGVRGGWRVY SSGPGIYLRQ LVQGVLGLTE RGGEIVVDPV LPAAADGLAV DLDLGGRTRR | 1080 |
VAYRVTATGD GVQVRAGSGP DALAPVPTTA RTGDYRQRGV VVATRDLGDA AYVEVTVPAG | 1140 |
S | 1141 |
Predicted 3D structure by AlphaFold2 with pLDDT = 93.97 ; Download help
pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .
Residues were colored according to plddt ( blue-> high quality; red-> low quality ).
Carbohydrate binding residues Predicted by CAPSIF
Binding site residues are not predicted, since this is not a representative ID (CAZyme3D-ID50).
Full Sequence: AA; CE; PL; GH; GT; CBM; Download structure help
dbCAN3 predicted domain(s) : GH94(546-1086)
MTLTATETPA RATLTSGGLT VELTGGGDVR AVSTDGLLVN QYLPGEHDRM PGGILLRAAR | 60 |
PDGTVEVARL TGSAPAVTAV EVGADRVVWS GAALGLATRV ALTLDGRTLV WRVDLTAGPA | 120 |
TDAGTRYDVV HAQDLALAPP AAALSSEPYV CQYLLHRALE HPDAGTVLVS RQTMSAQPRL | 180 |
PLAVAFLVEG AVAHLTDSLQ VFTARSRRDG LPHGLLGPVQ PGVLQYEYAM PTLVSRPLDL | 240 |
STGTAVVHAV TVVDADAPGP LAAHLVEVSG WAAAAVAAAA AERPTARFTP SASALRDAPL | 300 |
LAGDELDEAG LLAAVGLSAD DVLLPERDAD GTLLSFFTAT GTHVVDARKD TVTERSHGHV | 360 |
LKAGDDVLPT DDVLSTTAFA PGVFASHVVL GNTTANRLAT VHRHHLNLVR SSGLRVLVDD | 420 |
GRGPRLLGLP SALVLDVGGV RWLYETPLGR VDVRTVAHDR ENRIDVAVRC ERPLHVTATL | 480 |
ELEDEAGGWL AEHVPAAPGD AVVVRPVPGG DVDAHYPDLR YALASSARLV VEEEAVAETG | 540 |
AEAAASGTTR RLTSTTGSGS LTLALTGSLR GTDAALGLLA GALDPLADVD ATLARHVETV | 600 |
RGVVRGLRFA PHATVETQEL DLLVPWYAHD ALIHFLVPHG LEQYSGAAWG TRDVCQGPFE | 660 |
LALAGGRHDV AREIVLRVLA HQHTWGEFPQ WFMFDAYAER YNDSSHGDVV VWPLFALAQY | 720 |
LEASGDLAVL DEHVPFWDHE QRRPAASGPD AAATVRDHVA RLLDHLDRDR LPGTALPAYG | 780 |
EGDWDDTLQP ADPRMRTDLA STWTSALLVQ AAELLARTTA GRDDLAALSG RASTLAAEVR | 840 |
ADLRERALVD GVLAGYVRHG ADGDELVIHP SDTVSGMRYR LIPMTQSIIA GILTPEEAAD | 900 |
HERLVLEHLH FPDGVRLMDH PAAFDEGVPH TFLRAEQAAN VGREIGLMYV HAHIRYVEAL | 960 |
AALGRGRALD ELLRISPVDL GRRLAHAAPR QRNAYFSSSD ADFPDRESFA RDFDGLRDGS | 1020 |
VGVRGGWRVY SSGPGIYLRQ LVQGVLGLTE RGGEIVVDPV LPAAADGLAV DLDLGGRTRR | 1080 |
VAYRVTATGD GVQVRAGSGP DALAPVPTTA RTGDYRQRGV VVATRDLGDA AYVEVTVPAG | 1140 |
S | 1141 |
Predicted CAZyme domains from dbCAN; Download help
Domains were colored according to CAZyme classification: (AA), (CE), (PL), (GH), (GT), (CBM), & (Null)
dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation.
Details:
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:
⋆HMMER search for CAZyme family annotation vs. dbCAN CAZyme domain HMM database
⋆DIAMOND search for BLAST hits in the CAZy database
⋆HMMER search for CAZyme subfamily annotation vs. dbCAN-sub HMM database of CAZyme subfamilies (derived from eCAMI classification of CAZyDB families)
For more details, please see dbCAN3.