CAZyme3D

You are here: Home Cite us: 2025

Entry ID

Information for CAZyme ID: BAE93687.1

Basic Information

GenBank IDBAE93687.1
FamilyGH66
Sequence Length1200
UniProt IDQ1MVS0(100,100)Download
Average pLDDT?66.28
CAZy50 ID5609
CAZy50 RepNo, AAA21772.1
Structure Cluster-
EC Number(s)3.2.1.11
Substrates(s)alpha-glucan

Taxonomy

Tax ID1333
KingdomBacteria
PhylumBacillota
ClassBacilli
OrderLactobacillales
FamilyStreptococcaceae
GenusStreptococcus
SpeciesStreptococcus criceti

Protein Sequence:
90 < plddt <=100;
70 < plddt <= 90;
50 < plddt <= 70;
0 <= plddt <= 50;     Download help

MLTLPSMLVL  LACGMMFVLP  ARSAHAEDVV  NQPRVADEAS  TASRPASSQT  TLERSQWQED60
TQPAEEDTSN  VSFKGNSQQE  VASSDSSTEL  QDPAKQTPPE  LISGDSASGS  VPASSAIEQT120
GSNQEAAQNQ  STSNQGPGVI  RATSAQVTAT  RSVIASQAGD  AILDLSADKA  SYRPGEDVNL180
SVDFKNTTEQ  EQDITVYADV  YYVDNKLGTY  KFTRHLKAGE  GYKMQAGDLK  IPASQFENNH240
GYLVKIKVAD  VNNNTLGESS  RAVAVESDWT  KFPRYGIVGG  SQDTNNSLLS  KDADRYRAAL300
EKLKNMNINS  YFFYDVYKTA  TNPFPSDEAR  FKQDWNTWSG  SEIDTQAVKD  IVNQVHEGGA360
VAMLYNMILA  ENTNTGEAPA  LPETEYAYNS  DDRGYGAKGQ  PMSYTVKIPK  DGKEEDVQIQ420
RYYNPTSKQW  QNYIADKMGQ  AMKNGGFDGW  QGDTIGDNEI  YSYADKDSND  PSKKHWLTEG480
YAEFLRAIKE  KLPNYYLTVN  DVNGEQIYRL  KDGNQDVIYN  EIWPFGGSAL  KDGRNQTEYG540
DLKARVDEVR  KVTGKSLIVG  AYMEGSEKGG  SKRDAEAGKA  LQTDAVLLTS  ASIAAAGGYH600
MSLAALANQQ  NEIDGGQGIG  VLQTAYYPTQ  SLKTSSELTR  KNNDYQQFIT  AYENILRDGV660
ENDDAQVNTY  NSEGKLLSTD  AKGINGHQVW  TYGKKGNNFR  TVQLLNLMGI  NSDWKNEDGS720
EADKTPDEQT  NLTVKYALGD  VSMEDAQRMA  NQTYVTSPDD  WSKSNMQKVS  ASVETDENGK780
PVLVINVPKL  TLWDVVYISA  EDEKSAPEQD  QTAAPADQAT  DDKAAQDQVN  QPAAPAEQPQ840
VPTQAKPVET  DPAASPTAPE  NSAQPEATEQ  PATQDKAEEE  ASQPLAESVE  PQPEAGNQSD900
EPVTPAGNFS  VEPAAAETPA  TPADEPSTNK  PQDPAAQAQD  PSLTEAPAQP  EQPQVAEPNS960
ADPAADKAAA  PEAAKPDSAQ  SADPATPATE  PAATDPNQPA  VNPAPTSNEE  PSAGPSAPAM1020
PAQPAASPEN  ELPSPNSGQD  NNNQTADLAP  EPSSPAVSDN  QGNPSGGNQS  LQVQGQGNPA1080
DPADNPEGSA  TLPADPADQP  NAGSDQASQP  DDPAANQNSS  ELPKNPLVPG  EVGQPTTPQL1140
PNLETSTSTS  MENESPSAGQ  SSAKSSEQLP  KTGDKTSGLI  FGISLVFLAL  TGLLAKREKN1200
1200

Predicted 3D structure by AlphaFold2 with pLDDT = 66.28 ; Download help

pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .

Residues were colored according to plddt ( blue-> high quality; red-> low quality ).

Carbohydrate binding residues Predicted by CAPSIF

Binding site residues are not predicted, since this is not a representative ID (CAZyme3D-ID50).

Full Sequence:
AA;
CE;
PL;
GH;
GT;
CBM;     Download structure help

dbCAN3 predicted domain(s) : GH66(164-799)

MLTLPSMLVL  LACGMMFVLP  ARSAHAEDVV  NQPRVADEAS  TASRPASSQT  TLERSQWQED60
TQPAEEDTSN  VSFKGNSQQE  VASSDSSTEL  QDPAKQTPPE  LISGDSASGS  VPASSAIEQT120
GSNQEAAQNQ  STSNQGPGVI  RATSAQVTAT  RSVIASQAGD  AILDLSADKA  SYRPGEDVNL180
SVDFKNTTEQ  EQDITVYADV  YYVDNKLGTY  KFTRHLKAGE  GYKMQAGDLK  IPASQFENNH240
GYLVKIKVAD  VNNNTLGESS  RAVAVESDWT  KFPRYGIVGG  SQDTNNSLLS  KDADRYRAAL300
EKLKNMNINS  YFFYDVYKTA  TNPFPSDEAR  FKQDWNTWSG  SEIDTQAVKD  IVNQVHEGGA360
VAMLYNMILA  ENTNTGEAPA  LPETEYAYNS  DDRGYGAKGQ  PMSYTVKIPK  DGKEEDVQIQ420
RYYNPTSKQW  QNYIADKMGQ  AMKNGGFDGW  QGDTIGDNEI  YSYADKDSND  PSKKHWLTEG480
YAEFLRAIKE  KLPNYYLTVN  DVNGEQIYRL  KDGNQDVIYN  EIWPFGGSAL  KDGRNQTEYG540
DLKARVDEVR  KVTGKSLIVG  AYMEGSEKGG  SKRDAEAGKA  LQTDAVLLTS  ASIAAAGGYH600
MSLAALANQQ  NEIDGGQGIG  VLQTAYYPTQ  SLKTSSELTR  KNNDYQQFIT  AYENILRDGV660
ENDDAQVNTY  NSEGKLLSTD  AKGINGHQVW  TYGKKGNNFR  TVQLLNLMGI  NSDWKNEDGS720
EADKTPDEQT  NLTVKYALGD  VSMEDAQRMA  NQTYVTSPDD  WSKSNMQKVS  ASVETDENGK780
PVLVINVPKL  TLWDVVYISA  EDEKSAPEQD  QTAAPADQAT  DDKAAQDQVN  QPAAPAEQPQ840
VPTQAKPVET  DPAASPTAPE  NSAQPEATEQ  PATQDKAEEE  ASQPLAESVE  PQPEAGNQSD900
EPVTPAGNFS  VEPAAAETPA  TPADEPSTNK  PQDPAAQAQD  PSLTEAPAQP  EQPQVAEPNS960
ADPAADKAAA  PEAAKPDSAQ  SADPATPATE  PAATDPNQPA  VNPAPTSNEE  PSAGPSAPAM1020
PAQPAASPEN  ELPSPNSGQD  NNNQTADLAP  EPSSPAVSDN  QGNPSGGNQS  LQVQGQGNPA1080
DPADNPEGSA  TLPADPADQP  NAGSDQASQP  DDPAANQNSS  ELPKNPLVPG  EVGQPTTPQL1140
PNLETSTSTS  MENESPSAGQ  SSAKSSEQLP  KTGDKTSGLI  FGISLVFLAL  TGLLAKREKN1200
1200

Predicted CAZyme domains from dbCAN; Download help

Domains were colored according to CAZyme classification: (AA), (CE), (PL), (GH), (GT), (CBM), & (Null)

dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation.

Details:
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:
⋆HMMER search for CAZyme family annotation vs. dbCAN CAZyme domain HMM database
⋆DIAMOND search for BLAST hits in the CAZy database
⋆HMMER search for CAZyme subfamily annotation vs. dbCAN-sub HMM database of CAZyme subfamilies (derived from eCAMI classification of CAZyDB families)

For more details, please see dbCAN3.

Similarites between the same cluster seqeunces from DIAMOND; Download help