CAZyme3D

You are here: Home Cite us: 2025

Entry ID

Information for CAZyme ID: QYE52146.1

Basic Information

GenBank IDQYE52146.1
FamilyCBM1
Sequence Length2110
UniProt IDQYE52146.1(MOD)Download
Average pLDDT?55.84
CAZy50 ID1251
CAZy50 RepYes, QYE52146.1
Structure ClusterSC_CBM1_clus109
EC Number(s)-
Substrates(s)-

Taxonomy

Tax ID88834
KingdomEukaryota
PhylumOomycota
Class
OrderPythiales
FamilyPythiaceae
GenusPythium
SpeciesPythium porphyrae

Protein Sequence:
90 < plddt <=100;
70 < plddt <= 90;
50 < plddt <= 70;
0 <= plddt <= 50;     Download help

MKLSVVFVGA  IHAAWMALAL  AASDDAFPMA  RVTSADLTET  TTPLATETTT  PLATETPSPT60
SPTQETATPT  ETSSAPTTPA  PVTPIEQCLA  SVIDKVMFQN  DKLISCADET  SYPFYNAPQP120
PTQAQLDAMC  KSTACIDGVA  DALKEEPVEC  IMPLNKLLLR  SEILDRIAVY  CEDGEITTPA180
PTVAEPIPLP  LFPIPGPAAT  PAPGPVPSFP  TPVCSVETLA  PVQTVTDELI  QCATDASFAF240
LPLSRPSEAT  IAKMCASESC  VKVITAALAA  NPTECTAPVG  NIQYRAEFLD  LIGSACEISS300
TTPAPTTTTP  ATTTPATTAP  STPDQVCELA  TLNKLLYEND  EFVTCALDTE  FPLLNFGAAP360
SKEQMDKFCT  SETCSKAFAD  ALEAGPTECR  FPVTQLTLVG  DFLDRVVSYC  KTGVVPTTSP420
SLPPPAPQPT  PAPTPVPAPG  PVPSFPTPVC  SVETLAPVQT  VTDELIQCAT  DASFAFLPLS480
RPSEATIAKM  CASESCVKVI  TAALAANPTE  CTAPVGNIQY  RAEFLDLIGS  ACEISSTTPA540
PTTTTPATTT  PATTAPSTPD  QVCELATLNK  LLYENDEFVT  CALDTEFPLL  NFGAAPSKEQ600
MDKFCTSETC  SKAFADALEA  GPTECRFPVT  QLTLVGDFLD  RVVSYCKTGV  VPTTSPSLPP660
PAPQPTPAPT  PVPAPGPVPS  FPTPVCSVET  LAPVQTVTDE  LIQCATDASF  AFLPLSRPSE720
ATIAKMCASE  SCVKVITAAL  AANPTECTAP  VGNIQYRAEF  LDLIGSACEI  SSTTPAPTTT780
TPATTTPATT  APSTPDQVCE  LATLNKLLYE  NDEFVTCALD  TEFPLLNFGA  APSKEQMDKF840
CTSETCSKAF  ADALEAGPTE  CRFPVTQLTL  VGDFLDRVVS  YCKTGVVPTT  SPSLPPPAPQ900
PTPAPTPVPA  PGPVPSFPTP  VCSVETLAPV  QTVTDELIQC  ATDASFAFLP  LSRPSEATIA960
KMCASESCVK  VITAALAANP  TECTAPVGNI  QYRAEFLDLI  GSACEISSTT  PAPTTTTPAT1020
TTPATTAPST  PDQVCELATL  NKLLYENDEF  VTCALDTEFP  LLNFGAAPSK  EQMDKFCTSE1080
TCSKAFADAL  EAGPTECRFP  VTQLTLVGDF  LDRVVSYCKT  GVVPTTSPSL  PPPAPQPTPA1140
PTPVPAPGPV  PSFPTPVCSV  ETLAPVQTVT  DELIQCATDA  SFAFLPLSRP  SEATIAKMCA1200
SESCVKVITA  ALAANPTECT  APVGNIQYRA  EFLDLIGSAC  EISSTTPAPT  TTTPATTTPA1260
TTAPSTPDQV  CELATLNKLL  YENDEFVTCA  LDTEFPLLNF  GAAPSKEQMD  KFCTSETCSK1320
AFADALEAGP  TECRFPVTQL  TLVGDFLDRV  VSYCKTGVVP  TTSPSLPPPA  PQPTPAPTPV1380
PAPGPVPSFP  TPVCSVETLA  PVQTVTDELI  QCATDASFAF  LPLSRPSEAT  IAKMCASESC1440
VKVITAALAA  NPTECTAPVG  NIQYRAEFLD  LIGSACEISS  TTPAPTTTTP  ATTTPATTAP1500
STPDQVCELA  TLNKLLYEND  EFVTCALDTE  FPLLNFGAAP  SKEQMDKFCT  SETCSKAFAD1560
ALEAGPTECR  FPVTQLTLVG  DFLDRVVSYC  KTGVVPTTSP  SLPPPAPQPT  PAPTPVPAPG1620
PVPSFPTPVC  SVETLAPVQT  VTDELIQCAT  DASFAFLPLS  RPSEATIAKM  CASESCVKVI1680
TAALAANPTE  CTAPVGNIQY  RAEFLDLIGS  ACEISSTTPA  PTTTTPATTT  PATTAPSTPE1740
PTTPASTTPE  PTTPAPITPT  PTAPANGDCG  NDKVGPSQCP  QGQYCQPWNP  SHYQCRSIDA1800
KCGKQEVGID  YFGDDIASLS  VLVPEQCCDK  CRATTGCKAY  TFVNFNADGR  AMCYLKKGTG1860
EKRAAPRAVS  AVIDSSAPTC  AAAGAQCGSD  REGAACCPSG  HHCQPWNPFY  YQCIPSPPKC1920
AAQEVGIDYF  GEDLQTVYGL  QPSGCCDRCA  ETAGCKAYTF  VNYNRDGRTA  CYLKKGRGEK1980
RKMVGAVSST  VITPKPPGCA  TPQWGSCGNE  LGATCCPSGF  YCQPWNPHYY  QCMATPAKCS2040
EQLTDVDFLG  NDIATVFGIT  PEQCCDRCAE  TAGCKAYTFV  NANPGRPACY  LKSSAAGRKT2100
LSGAVSGIVN  2110

Predicted 3D structure by AlphaFold2 with pLDDT = 55.84 ; Download help

pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .

Residues were colored according to plddt ( blue-> high quality; red-> low quality ).

Full Sequence:
CAPSIF:V and CAPSIF:G =99.9;
CAPSIF:V =59.9;
CAPSIF:G =40;
Non-Binding=0;     Download help

MKLSVVFVGA  IHAAWMALAL  AASDDAFPMA  RVTSADLTET  TTPLATETTT  PLATETPSPT60
SPTQETATPT  ETSSAPTTPA  PVTPIEQCLA  SVIDKVMFQN  DKLISCADET  SYPFYNAPQP120
PTQAQLDAMC  KSTACIDGVA  DALKEEPVEC  IMPLNKLLLR  SEILDRIAVY  CEDGEITTPA180
PTVAEPIPLP  LFPIPGPAAT  PAPGPVPSFP  TPVCSVETLA  PVQTVTDELI  QCATDASFAF240
LPLSRPSEAT  IAKMCASESC  VKVITAALAA  NPTECTAPVG  NIQYRAEFLD  LIGSACEISS300
TTPAPTTTTP  ATTTPATTAP  STPDQVCELA  TLNKLLYEND  EFVTCALDTE  FPLLNFGAAP360
SKEQMDKFCT  SETCSKAFAD  ALEAGPTECR  FPVTQLTLVG  DFLDRVVSYC  KTGVVPTTSP420
SLPPPAPQPT  PAPTPVPAPG  PVPSFPTPVC  SVETLAPVQT  VTDELIQCAT  DASFAFLPLS480
RPSEATIAKM  CASESCVKVI  TAALAANPTE  CTAPVGNIQY  RAEFLDLIGS  ACEISSTTPA540
PTTTTPATTT  PATTAPSTPD  QVCELATLNK  LLYENDEFVT  CALDTEFPLL  NFGAAPSKEQ600
MDKFCTSETC  SKAFADALEA  GPTECRFPVT  QLTLVGDFLD  RVVSYCKTGV  VPTTSPSLPP660
PAPQPTPAPT  PVPAPGPVPS  FPTPVCSVET  LAPVQTVTDE  LIQCATDASF  AFLPLSRPSE720
ATIAKMCASE  SCVKVITAAL  AANPTECTAP  VGNIQYRAEF  LDLIGSACEI  SSTTPAPTTT780
TPATTTPATT  APSTPDQVCE  LATLNKLLYE  NDEFVTCALD  TEFPLLNFGA  APSKEQMDKF840
CTSETCSKAF  ADALEAGPTE  CRFPVTQLTL  VGDFLDRVVS  YCKTGVVPTT  SPSLPPPAPQ900
PTPAPTPVPA  PGPVPSFPTP  VCSVETLAPV  QTVTDELIQC  ATDASFAFLP  LSRPSEATIA960
KMCASESCVK  VITAALAANP  TECTAPVGNI  QYRAEFLDLI  GSACEISSTT  PAPTTTTPAT1020
TTPATTAPST  PDQVCELATL  NKLLYENDEF  VTCALDTEFP  LLNFGAAPSK  EQMDKFCTSE1080
TCSKAFADAL  EAGPTECRFP  VTQLTLVGDF  LDRVVSYCKT  GVVPTTSPSL  PPPAPQPTPA1140
PTPVPAPGPV  PSFPTPVCSV  ETLAPVQTVT  DELIQCATDA  SFAFLPLSRP  SEATIAKMCA1200
SESCVKVITA  ALAANPTECT  APVGNIQYRA  EFLDLIGSAC  EISSTTPAPT  TTTPATTTPA1260
TTAPSTPDQV  CELATLNKLL  YENDEFVTCA  LDTEFPLLNF  GAAPSKEQMD  KFCTSETCSK1320
AFADALEAGP  TECRFPVTQL  TLVGDFLDRV  VSYCKTGVVP  TTSPSLPPPA  PQPTPAPTPV1380
PAPGPVPSFP  TPVCSVETLA  PVQTVTDELI  QCATDASFAF  LPLSRPSEAT  IAKMCASESC1440
VKVITAALAA  NPTECTAPVG  NIQYRAEFLD  LIGSACEISS  TTPAPTTTTP  ATTTPATTAP1500
STPDQVCELA  TLNKLLYEND  EFVTCALDTE  FPLLNFGAAP  SKEQMDKFCT  SETCSKAFAD1560
ALEAGPTECR  FPVTQLTLVG  DFLDRVVSYC  KTGVVPTTSP  SLPPPAPQPT  PAPTPVPAPG1620
PVPSFPTPVC  SVETLAPVQT  VTDELIQCAT  DASFAFLPLS  RPSEATIAKM  CASESCVKVI1680
TAALAANPTE  CTAPVGNIQY  RAEFLDLIGS  ACEISSTTPA  PTTTTPATTT  PATTAPSTPE1740
PTTPASTTPE  PTTPAPITPT  PTAPANGDCG  NDKVGPSQCP  QGQYCQPWNP  SHYQCRSIDA1800
KCGKQEVGID  YFGDDIASLS  VLVPEQCCDK  CRATTGCKAY  TFVNFNADGR  AMCYLKKGTG1860
EKRAAPRAVS  AVIDSSAPTC  AAAGAQCGSD  REGAACCPSG  HHCQPWNPFY  YQCIPSPPKC1920
AAQEVGIDYF  GEDLQTVYGL  QPSGCCDRCA  ETAGCKAYTF  VNYNRDGRTA  CYLKKGRGEK1980
RKMVGAVSST  VITPKPPGCA  TPQWGSCGNE  LGATCCPSGF  YCQPWNPHYY  QCMATPAKCS2040
EQLTDVDFLG  NDIATVFGIT  PEQCCDRCAE  TAGCKAYTFV  NANPGRPACY  LKSSAAGRKT2100
LSGAVSGIVN  2110

Carbohydrate binding residues Predicted by CAPSIF from 3D structure; Download help

Residues were colored according to prediction score:

Nonbinder, CAPSIF:G Predicted Binder, CAPSIF:V Predicted Binder, CAPSIF:V and CAPSIF:G Predicted Binder

CArbohydrate–Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G).

Details:
⋆B-Factor = 0.0 : Nonbinder.
⋆B-Factor = 40.0 : CAPSIF:G Predicted Binder.
⋆B-Factor = 59.9 : CAPSIF:V Predicted Binder.
⋆B-Factor = 99.9 : CAPSIF:V and CAPSIF:G Predicted Binder.

For more detail please see CAPSIF.

Full Sequence:
AA;
CE;
PL;
GH;
GT;
CBM;     Download structure help

dbCAN3 predicted domain(s) :

MKLSVVFVGA  IHAAWMALAL  AASDDAFPMA  RVTSADLTET  TTPLATETTT  PLATETPSPT60
SPTQETATPT  ETSSAPTTPA  PVTPIEQCLA  SVIDKVMFQN  DKLISCADET  SYPFYNAPQP120
PTQAQLDAMC  KSTACIDGVA  DALKEEPVEC  IMPLNKLLLR  SEILDRIAVY  CEDGEITTPA180
PTVAEPIPLP  LFPIPGPAAT  PAPGPVPSFP  TPVCSVETLA  PVQTVTDELI  QCATDASFAF240
LPLSRPSEAT  IAKMCASESC  VKVITAALAA  NPTECTAPVG  NIQYRAEFLD  LIGSACEISS300
TTPAPTTTTP  ATTTPATTAP  STPDQVCELA  TLNKLLYEND  EFVTCALDTE  FPLLNFGAAP360
SKEQMDKFCT  SETCSKAFAD  ALEAGPTECR  FPVTQLTLVG  DFLDRVVSYC  KTGVVPTTSP420
SLPPPAPQPT  PAPTPVPAPG  PVPSFPTPVC  SVETLAPVQT  VTDELIQCAT  DASFAFLPLS480
RPSEATIAKM  CASESCVKVI  TAALAANPTE  CTAPVGNIQY  RAEFLDLIGS  ACEISSTTPA540
PTTTTPATTT  PATTAPSTPD  QVCELATLNK  LLYENDEFVT  CALDTEFPLL  NFGAAPSKEQ600
MDKFCTSETC  SKAFADALEA  GPTECRFPVT  QLTLVGDFLD  RVVSYCKTGV  VPTTSPSLPP660
PAPQPTPAPT  PVPAPGPVPS  FPTPVCSVET  LAPVQTVTDE  LIQCATDASF  AFLPLSRPSE720
ATIAKMCASE  SCVKVITAAL  AANPTECTAP  VGNIQYRAEF  LDLIGSACEI  SSTTPAPTTT780
TPATTTPATT  APSTPDQVCE  LATLNKLLYE  NDEFVTCALD  TEFPLLNFGA  APSKEQMDKF840
CTSETCSKAF  ADALEAGPTE  CRFPVTQLTL  VGDFLDRVVS  YCKTGVVPTT  SPSLPPPAPQ900
PTPAPTPVPA  PGPVPSFPTP  VCSVETLAPV  QTVTDELIQC  ATDASFAFLP  LSRPSEATIA960
KMCASESCVK  VITAALAANP  TECTAPVGNI  QYRAEFLDLI  GSACEISSTT  PAPTTTTPAT1020
TTPATTAPST  PDQVCELATL  NKLLYENDEF  VTCALDTEFP  LLNFGAAPSK  EQMDKFCTSE1080
TCSKAFADAL  EAGPTECRFP  VTQLTLVGDF  LDRVVSYCKT  GVVPTTSPSL  PPPAPQPTPA1140
PTPVPAPGPV  PSFPTPVCSV  ETLAPVQTVT  DELIQCATDA  SFAFLPLSRP  SEATIAKMCA1200
SESCVKVITA  ALAANPTECT  APVGNIQYRA  EFLDLIGSAC  EISSTTPAPT  TTTPATTTPA1260
TTAPSTPDQV  CELATLNKLL  YENDEFVTCA  LDTEFPLLNF  GAAPSKEQMD  KFCTSETCSK1320
AFADALEAGP  TECRFPVTQL  TLVGDFLDRV  VSYCKTGVVP  TTSPSLPPPA  PQPTPAPTPV1380
PAPGPVPSFP  TPVCSVETLA  PVQTVTDELI  QCATDASFAF  LPLSRPSEAT  IAKMCASESC1440
VKVITAALAA  NPTECTAPVG  NIQYRAEFLD  LIGSACEISS  TTPAPTTTTP  ATTTPATTAP1500
STPDQVCELA  TLNKLLYEND  EFVTCALDTE  FPLLNFGAAP  SKEQMDKFCT  SETCSKAFAD1560
ALEAGPTECR  FPVTQLTLVG  DFLDRVVSYC  KTGVVPTTSP  SLPPPAPQPT  PAPTPVPAPG1620
PVPSFPTPVC  SVETLAPVQT  VTDELIQCAT  DASFAFLPLS  RPSEATIAKM  CASESCVKVI1680
TAALAANPTE  CTAPVGNIQY  RAEFLDLIGS  ACEISSTTPA  PTTTTPATTT  PATTAPSTPE1740
PTTPASTTPE  PTTPAPITPT  PTAPANGDCG  NDKVGPSQCP  QGQYCQPWNP  SHYQCRSIDA1800
KCGKQEVGID  YFGDDIASLS  VLVPEQCCDK  CRATTGCKAY  TFVNFNADGR  AMCYLKKGTG1860
EKRAAPRAVS  AVIDSSAPTC  AAAGAQCGSD  REGAACCPSG  HHCQPWNPFY  YQCIPSPPKC1920
AAQEVGIDYF  GEDLQTVYGL  QPSGCCDRCA  ETAGCKAYTF  VNYNRDGRTA  CYLKKGRGEK1980
RKMVGAVSST  VITPKPPGCA  TPQWGSCGNE  LGATCCPSGF  YCQPWNPHYY  QCMATPAKCS2040
EQLTDVDFLG  NDIATVFGIT  PEQCCDRCAE  TAGCKAYTFV  NANPGRPACY  LKSSAAGRKT2100
LSGAVSGIVN  2110

Predicted CAZyme domains from dbCAN; Download help

Domains were colored according to CAZyme classification: (AA), (CE), (PL), (GH), (GT), (CBM), & (Null)

dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation.

Details:
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:
⋆HMMER search for CAZyme family annotation vs. dbCAN CAZyme domain HMM database
⋆DIAMOND search for BLAST hits in the CAZy database
⋆HMMER search for CAZyme subfamily annotation vs. dbCAN-sub HMM database of CAZyme subfamilies (derived from eCAMI classification of CAZyDB families)

For more details, please see dbCAN3.

Similarites between the same cluster seqeunces from DIAMOND; Download help