CAZyme3D

You are here: Home Cite us: 2025

Entry ID

Information for CAZyme ID: QOL01085.1

Basic Information

GenBank IDQOL01085.1
FamilyGH31
Sequence Length1214
UniProt IDA0A7L9QE47(100,100)Download
Average pLDDT?74.76
CAZy50 ID7101
CAZy50 RepNo, BDA46793.1
Structure Cluster-
EC Number(s)-
Substrates(s)-

Taxonomy

Tax ID464287
KingdomEukaryota
PhylumChlorophyta
ClassTrebouxiophyceae
OrderChlorellales
FamilyOocystaceae
GenusPseudococcomyxa
SpeciesPseudococcomyxa simplex

Protein Sequence:
90 < plddt <=100;
70 < plddt <= 90;
50 < plddt <= 70;
0 <= plddt <= 50;     Download help

MASASKRQQR  PFCSPSATLC  TLAIVASFGA  LVRCQSGGFG  SPPPAAGQCD  AAGPRQECGW60
NGIEDWKCAS  KGCCYDAKTP  TQVGTANVKV  TTPVCFKPNG  GASNYDLNGG  FTAAANGNGL120
QGTLKQSGSG  TQPELGPDIT  TLSILVENVT  PDILHAKIGA  PGRWEIPKSI  FLAPNVTASN180
GPASYQFNYS  VSPFTFAVAR  SDSNGQALFN  TVGTRLVFKD  QYMEISTSVP  ETAALYGLGE240
RTSSTGLELR  RDGIPLALWN  RDHQAALPDQ  NVYGSHPILM  DVREDGTAHG  VLLLNSNAMD300
VVLTQSRVQW  RVTGGVLDFY  FLMGPTPNAI  LDQLTTIIGR  PVMPPYWSLG  LMNSKYGYGS360
AEFYQQILNG  YGNASIPLET  FVSDSQYMNH  DEDFTLGDKF  PLSDMKDFLN  RIRAQGQRWV420
PILDPPIHIR  KGYEPYDSGI  KEDVFMKDIS  GKPYVGQLWP  GAVHWPDFKN  PNTTTWWTRM480
IKGVYDDLPL  DGLWIDMNEP  SNYCTGDVCW  NDDTVPPRND  FVCMIGCVSG  RDQVLATAGN540
KSVTLNESYF  NPPYTINNGD  NAYNISYKTV  AVTAYHYDGT  LVYNAHNLYG  MLETLATTSA600
LQSLRNKRQF  ILTRSTFLGS  GAYAAHWTGD  TNSKWEDMRW  SIPTILNNGI  AGISFSGADI660
CGFMMKATDE  LCSRWAAVGA  FYPYARNHHS  DGWQEFFRWE  GTSIVARKVL  ATRYRLLPYL720
YTAFFDSHTY  GCPVARPLFF  TFPADNTTRN  IAEQWMMGDA  LLVSPILYEK  TTTVRAYFPK780
GTWYDFYSGR  VLDATNGGKW  DYVTAEMTDN  VPLHVLGGNI  IPMALGSEFM  LTQAVRNASH840
ALVVAFPKAN  STYAGDRCGG  RCGGAPQAGV  QNACGHMYLD  QGEELNMTRS  LNNYLNLASQ900
LVQQASGSYK  GFLSATFAGT  PGGSSGATCG  KDTWTWPTID  TVIVMGVGPV  DGDSIVIQAV960
NAASATPGTV  QTASVDSTPG  VTNLSAGTAK  YDAALQKLTI  SGLNFQLTCP  IGLRVSWNSG1020
APAAAPASAP  ASTGAPAALF  GTAVKEPASP  PNSVVSPPGG  PGSASSPPSN  FGSPSGASTE1080
SSRSSSDAPA  QGSPAAFPSS  SSGSPSPSGS  SSGSPGSSSS  SSGGSSSSPS  GSGSPSYSSY1140
SSSSGSSSSS  PSSYSSPSSY  SSSPNYGSSS  SSSPSYTPSP  TYYSSPSRSP  PRSPSGGGGG1200
GPGGGGPGGG  NFFG1214

Predicted 3D structure by AlphaFold2 with pLDDT = 74.76 ; Download help

pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .

Residues were colored according to plddt ( blue-> high quality; red-> low quality ).

Carbohydrate binding residues Predicted by CAPSIF

Binding site residues are not predicted, since this is not a representative ID (CAZyme3D-ID50).

Full Sequence:
AA;
CE;
PL;
GH;
GT;
CBM;     Download structure help

dbCAN3 predicted domain(s) : GH31(319-821)

MASASKRQQR  PFCSPSATLC  TLAIVASFGA  LVRCQSGGFG  SPPPAAGQCD  AAGPRQECGW60
NGIEDWKCAS  KGCCYDAKTP  TQVGTANVKV  TTPVCFKPNG  GASNYDLNGG  FTAAANGNGL120
QGTLKQSGSG  TQPELGPDIT  TLSILVENVT  PDILHAKIGA  PGRWEIPKSI  FLAPNVTASN180
GPASYQFNYS  VSPFTFAVAR  SDSNGQALFN  TVGTRLVFKD  QYMEISTSVP  ETAALYGLGE240
RTSSTGLELR  RDGIPLALWN  RDHQAALPDQ  NVYGSHPILM  DVREDGTAHG  VLLLNSNAMD300
VVLTQSRVQW  RVTGGVLDFY  FLMGPTPNAI  LDQLTTIIGR  PVMPPYWSLG  LMNSKYGYGS360
AEFYQQILNG  YGNASIPLET  FVSDSQYMNH  DEDFTLGDKF  PLSDMKDFLN  RIRAQGQRWV420
PILDPPIHIR  KGYEPYDSGI  KEDVFMKDIS  GKPYVGQLWP  GAVHWPDFKN  PNTTTWWTRM480
IKGVYDDLPL  DGLWIDMNEP  SNYCTGDVCW  NDDTVPPRND  FVCMIGCVSG  RDQVLATAGN540
KSVTLNESYF  NPPYTINNGD  NAYNISYKTV  AVTAYHYDGT  LVYNAHNLYG  MLETLATTSA600
LQSLRNKRQF  ILTRSTFLGS  GAYAAHWTGD  TNSKWEDMRW  SIPTILNNGI  AGISFSGADI660
CGFMMKATDE  LCSRWAAVGA  FYPYARNHHS  DGWQEFFRWE  GTSIVARKVL  ATRYRLLPYL720
YTAFFDSHTY  GCPVARPLFF  TFPADNTTRN  IAEQWMMGDA  LLVSPILYEK  TTTVRAYFPK780
GTWYDFYSGR  VLDATNGGKW  DYVTAEMTDN  VPLHVLGGNI  IPMALGSEFM  LTQAVRNASH840
ALVVAFPKAN  STYAGDRCGG  RCGGAPQAGV  QNACGHMYLD  QGEELNMTRS  LNNYLNLASQ900
LVQQASGSYK  GFLSATFAGT  PGGSSGATCG  KDTWTWPTID  TVIVMGVGPV  DGDSIVIQAV960
NAASATPGTV  QTASVDSTPG  VTNLSAGTAK  YDAALQKLTI  SGLNFQLTCP  IGLRVSWNSG1020
APAAAPASAP  ASTGAPAALF  GTAVKEPASP  PNSVVSPPGG  PGSASSPPSN  FGSPSGASTE1080
SSRSSSDAPA  QGSPAAFPSS  SSGSPSPSGS  SSGSPGSSSS  SSGGSSSSPS  GSGSPSYSSY1140
SSSSGSSSSS  PSSYSSPSSY  SSSPNYGSSS  SSSPSYTPSP  TYYSSPSRSP  PRSPSGGGGG1200
GPGGGGPGGG  NFFG1214

Predicted CAZyme domains from dbCAN; Download help

Domains were colored according to CAZyme classification: (AA), (CE), (PL), (GH), (GT), (CBM), & (Null)

dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation.

Details:
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:
⋆HMMER search for CAZyme family annotation vs. dbCAN CAZyme domain HMM database
⋆DIAMOND search for BLAST hits in the CAZy database
⋆HMMER search for CAZyme subfamily annotation vs. dbCAN-sub HMM database of CAZyme subfamilies (derived from eCAMI classification of CAZyDB families)

For more details, please see dbCAN3.

Similarites between the same cluster seqeunces from DIAMOND; Download help