CAZyme3D

You are here: Home Cite us: 2025

Entry ID

Information for CAZyme ID: UEL47268.1

Basic Information

GenBank IDUEL47268.1
FamilyCBM32, CBM51
Sequence Length2217
UniProt IDA0A1I3NSJ0(99.7,100)Download
Average pLDDT?83.44
CAZy50 ID1070
CAZy50 RepYes, UEL47268.1
Structure ClusterSC_CBM32_clus20, SC_CBM32_clus47, SC_CBM32_clus54, SC_CBM51_clus23, SC_CBM51_clus7
EC Number(s)-
Substrates(s)-

Taxonomy

Tax ID2813371
KingdomBacteria
PhylumBacillota
ClassClostridia
OrderPeptostreptococcales
FamilyPeptostreptococcaceae
GenusTerrisporobacter
SpeciesTerrisporobacter hibernicus

Protein Sequence:
90 < plddt <=100;
70 < plddt <= 90;
50 < plddt <= 70;
0 <= plddt <= 50;     Download help

MRKKVSTLVL  ASILSANIAP  TINVFADEVL  KEEAKIIEEN  VLSTAKVSDF  NLENYSNFQE60
YNSKYRLKRD  EIKSISNNGK  QYASSSLDKA  IDGNLSTHWE  TGIENSSTFK  NEVVVEFNNV120
ESIDRIGYAT  RQDGAKGKGY  PNKFEIYASV  SGDNEDFKLV  STGSSSTSRD  MMELKFDKIT180
AKKIKFVFKE  ANQNWASASE  FWFYREDKLL  NKLNNLFTDD  NKNTLNSEFN  TLDKLEALEN240
ESKLHPFYDD  FKEDLSDAKI  ILESGEFDYS  DSKVSKFLSS  EDERLSSYDN  SYKIKQSEVK300
SISANGGQYS  DMSISKAMDG  DFQTRWHSGK  QNTDDFTNQV  VIELNEITTL  NRLVYTLGVS360
RGFAQEFDIY  VSKTSKGDTF  HKISSGTSKV  TKDSVEIRFN  PVDARRIKFV  YKNGYENWAC420
ATEFGLYKQD  KTKEKVERLF  TDSTMNTLSE  EFNTLEKIVA  LENQCKEHIL  YDDFKEDIEN480
AKALLEDGNI  EATKAQVSKF  DVFYTDYKEA  YDNTFKMDNS  NISSISTNGG  NYGSYVKDRM540
IDGDLKTFWE  TGKQNSDSFD  NEIVFTLKEA  KVLDRMSYRS  AENTIGFADE  FEIYASQTTK600
GDTFKLVSNG  TARRTSDMLE  FKFEPTKFKR  IKFVYKKCNA  NKASASEIMF  YSEDQLKDKI660
KNVFTNRLMN  ELSQEYNTLK  KIEELEEGAK  VHPLKEQYQE  IIDLAKSLLN  DQDSANTSRI720
VTGVQRGHYT  TESGKRMVNG  AAYISMESFG  KYVTPGEEIV  VYLDADPDGK  LPELWFGQVG780
KVIDGWTRRI  KLQPGKNTIK  APTNMNCAAV  YLSNQATTKE  QAYAPRARMV  GGTSFPTYFH840
GETDPQEYRK  ELEEYAKKVE  LSDSSFTNGN  PEGKVFNIAE  FVSDNVVITT  SALGALEGLK900
LAEEQDLDIS  DTMTQWEKMY  ELFQTFMGLE  KDADEEKDSF  FPNKFVARVF  QNVPFGFAAH960
GYTGYLGSDN  AQRDGGFFKM  IAAPPNMKGN  DNWCYTHEFG  HILNTKYIVD  GEVTNNLYAQ1020
EYRRINADKG  INEDRGSWDE  IFKRFNGANE  EYAMHYFERL  AVLSQLNIAY  GYDAYAKASK1080
AVRDNTDLIK  SINGYDVQRL  SVAYSLGLGV  NLLDFFEGWN  YSDIEITDQM  RDAVKDLPKP1140
NKKIEYLHGG  AYDYEGDGFT  KDIDVNVKST  LGEEENTLSL  KLSVDKNNQD  DVLGYEILKD1200
GKVVGFTKTN  LFTIREYDET  ADYTIVAYAK  DLSTAQAINS  KSQKPTLSIE  ENITLSIGDK1260
FDAKKYVTAL  DYQGNVIEDI  KVDSNVDTSK  KGNYEVKYTV  THSDLTVEKT  MKVSVVSKIT1320
YASDVQETSY  KVGWGQLGKD  KAPNNTAIEL  DRQGIVTTYT  KGLGAHANSE  VVYDVENYDK1380
FESYIGIDQS  MRDNPNSCAK  FIIYADGEKV  YESKKFTSNR  DHDFVSIDLE  GVKELKLVTD1440
GLGSNGADQT  VWADAKFINN  NTKPIINAQD  VTTVKLNSKF  DIRNGVTAKD  IEDGDLTESI1500
VVEEGNLTTS  KTGKYKVKYI  VTDSNNNTTT  KTREILVYSG  QDYASDTKYT  IQKSDWGGIK1560
NDKAPAGSAI  SVLVDDEETT  FGKGIGAHAN  SEITFDLSDK  NYEFFTSRIG  LDGKERGNTN1620
ASAKFKVLVD  GKEVFESKTF  KTNDDSQVIN  IPVNGASEIK  LITDQANNNN  ASDHTVWADA1680
KFLVTNSKPE  ITTEDVQIEV  GQVIDINKNV  TAKDAEDGDL  TSSVEVISNN  FEKNKIGRFE1740
VVYRVTNSDK  NTTEKTRYIT  AYEVFDVKKS  KYLGFDNLEQ  YNEQFKIPVS  SISNNAGNYG1800
SDVITKAIDN  NIDTHWETNK  PNSNTFKNEV  TFDLGEMQEI  SKMSYAARRA  GKGFATSFSI1860
YVSTKAEGND  FILAGKGNYN  GNASDVVEFD  INDTTARRVK  FVFDSAIENW  ASMGEMSFYK1920
KDELADKVAS  MFTNSNKEEV  TEGYDTLDEI  NALKEEVASH  PANELFQADF  DKAEELVRAT1980
FPTLNIPKSQ  SVKVGETLES  LIGKISATDT  KDGNITSKIK  VTGTDKVNFN  KVGEYEITYS2040
VTDSDNNTVS  KVRKINVVDM  KDFKYLSDYD  WKSANSGWGT  VNKDNSVSSN  VLRLTDEKGQ2100
TVNFEKGIGT  HSTSTIVYDL  SDKNSVRFTS  YVGVDRQMYN  SPGSIQFEVY  VDGEKTFDSG2160
VMNSTTPMKF  VDVDITEAKE  LKLIVKDGGN  GNGSDHATWG  DAKLHYVNEN  SVDKTSL2217

Predicted 3D structure by AlphaFold2 with pLDDT = 83.44 ; Download help

pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .

Residues were colored according to plddt ( blue-> high quality; red-> low quality ).

Full Sequence:
CAPSIF:V and CAPSIF:G =99.9;
CAPSIF:V =59.9;
CAPSIF:G =40;
Non-Binding=0;     Download help

MRKKVSTLVL  ASILSANIAP  TINVFADEVL  KEEAKIIEEN  VLSTAKVSDF  NLENYSNFQE60
YNSKYRLKRD  EIKSISNNGK  QYASSSLDKA  IDGNLSTHWE  TGIENSSTFK  NEVVVEFNNV120
ESIDRIGYAT  RQDGAKGKGY  PNKFEIYASV  SGDNEDFKLV  STGSSSTSRD  MMELKFDKIT180
AKKIKFVFKE  ANQNWASASE  FWFYREDKLL  NKLNNLFTDD  NKNTLNSEFN  TLDKLEALEN240
ESKLHPFYDD  FKEDLSDAKI  ILESGEFDYS  DSKVSKFLSS  EDERLSSYDN  SYKIKQSEVK300
SISANGGQYS  DMSISKAMDG  DFQTRWHSGK  QNTDDFTNQV  VIELNEITTL  NRLVYTLGVS360
RGFAQEFDIY  VSKTSKGDTF  HKISSGTSKV  TKDSVEIRFN  PVDARRIKFV  YKNGYENWAC420
ATEFGLYKQD  KTKEKVERLF  TDSTMNTLSE  EFNTLEKIVA  LENQCKEHIL  YDDFKEDIEN480
AKALLEDGNI  EATKAQVSKF  DVFYTDYKEA  YDNTFKMDNS  NISSISTNGG  NYGSYVKDRM540
IDGDLKTFWE  TGKQNSDSFD  NEIVFTLKEA  KVLDRMSYRS  AENTIGFADE  FEIYASQTTK600
GDTFKLVSNG  TARRTSDMLE  FKFEPTKFKR  IKFVYKKCNA  NKASASEIMF  YSEDQLKDKI660
KNVFTNRLMN  ELSQEYNTLK  KIEELEEGAK  VHPLKEQYQE  IIDLAKSLLN  DQDSANTSRI720
VTGVQRGHYT  TESGKRMVNG  AAYISMESFG  KYVTPGEEIV  VYLDADPDGK  LPELWFGQVG780
KVIDGWTRRI  KLQPGKNTIK  APTNMNCAAV  YLSNQATTKE  QAYAPRARMV  GGTSFPTYFH840
GETDPQEYRK  ELEEYAKKVE  LSDSSFTNGN  PEGKVFNIAE  FVSDNVVITT  SALGALEGLK900
LAEEQDLDIS  DTMTQWEKMY  ELFQTFMGLE  KDADEEKDSF  FPNKFVARVF  QNVPFGFAAH960
GYTGYLGSDN  AQRDGGFFKM  IAAPPNMKGN  DNWCYTHEFG  HILNTKYIVD  GEVTNNLYAQ1020
EYRRINADKG  INEDRGSWDE  IFKRFNGANE  EYAMHYFERL  AVLSQLNIAY  GYDAYAKASK1080
AVRDNTDLIK  SINGYDVQRL  SVAYSLGLGV  NLLDFFEGWN  YSDIEITDQM  RDAVKDLPKP1140
NKKIEYLHGG  AYDYEGDGFT  KDIDVNVKST  LGEEENTLSL  KLSVDKNNQD  DVLGYEILKD1200
GKVVGFTKTN  LFTIREYDET  ADYTIVAYAK  DLSTAQAINS  KSQKPTLSIE  ENITLSIGDK1260
FDAKKYVTAL  DYQGNVIEDI  KVDSNVDTSK  KGNYEVKYTV  THSDLTVEKT  MKVSVVSKIT1320
YASDVQETSY  KVGWGQLGKD  KAPNNTAIEL  DRQGIVTTYT  KGLGAHANSE  VVYDVENYDK1380
FESYIGIDQS  MRDNPNSCAK  FIIYADGEKV  YESKKFTSNR  DHDFVSIDLE  GVKELKLVTD1440
GLGSNGADQT  VWADAKFINN  NTKPIINAQD  VTTVKLNSKF  DIRNGVTAKD  IEDGDLTESI1500
VVEEGNLTTS  KTGKYKVKYI  VTDSNNNTTT  KTREILVYSG  QDYASDTKYT  IQKSDWGGIK1560
NDKAPAGSAI  SVLVDDEETT  FGKGIGAHAN  SEITFDLSDK  NYEFFTSRIG  LDGKERGNTN1620
ASAKFKVLVD  GKEVFESKTF  KTNDDSQVIN  IPVNGASEIK  LITDQANNNN  ASDHTVWADA1680
KFLVTNSKPE  ITTEDVQIEV  GQVIDINKNV  TAKDAEDGDL  TSSVEVISNN  FEKNKIGRFE1740
VVYRVTNSDK  NTTEKTRYIT  AYEVFDVKKS  KYLGFDNLEQ  YNEQFKIPVS  SISNNAGNYG1800
SDVITKAIDN  NIDTHWETNK  PNSNTFKNEV  TFDLGEMQEI  SKMSYAARRA  GKGFATSFSI1860
YVSTKAEGND  FILAGKGNYN  GNASDVVEFD  INDTTARRVK  FVFDSAIENW  ASMGEMSFYK1920
KDELADKVAS  MFTNSNKEEV  TEGYDTLDEI  NALKEEVASH  PANELFQADF  DKAEELVRAT1980
FPTLNIPKSQ  SVKVGETLES  LIGKISATDT  KDGNITSKIK  VTGTDKVNFN  KVGEYEITYS2040
VTDSDNNTVS  KVRKINVVDM  KDFKYLSDYD  WKSANSGWGT  VNKDNSVSSN  VLRLTDEKGQ2100
TVNFEKGIGT  HSTSTIVYDL  SDKNSVRFTS  YVGVDRQMYN  SPGSIQFEVY  VDGEKTFDSG2160
VMNSTTPMKF  VDVDITEAKE  LKLIVKDGGN  GNGSDHATWG  DAKLHYVNEN  SVDKTSL2217

Carbohydrate binding residues Predicted by CAPSIF from 3D structure; Download help

Residues were colored according to prediction score:

Nonbinder, CAPSIF:G Predicted Binder, CAPSIF:V Predicted Binder, CAPSIF:V and CAPSIF:G Predicted Binder

CArbohydrate–Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G).

Details:
⋆B-Factor = 0.0 : Nonbinder.
⋆B-Factor = 40.0 : CAPSIF:G Predicted Binder.
⋆B-Factor = 59.9 : CAPSIF:V Predicted Binder.
⋆B-Factor = 99.9 : CAPSIF:V and CAPSIF:G Predicted Binder.

For more detail please see CAPSIF.

Full Sequence:
AA;
CE;
PL;
GH;
GT;
CBM;     Download structure help

dbCAN3 predicted domain(s) : CBM32(76-201)+CBM32(300-423)+CBM32(523-648)+CBM51(1321-1457)+CBM51(1543-1682)+CBM32(1790-1916)+CBM51(2065-2204)

MRKKVSTLVL  ASILSANIAP  TINVFADEVL  KEEAKIIEEN  VLSTAKVSDF  NLENYSNFQE60
YNSKYRLKRD  EIKSISNNGK  QYASSSLDKA  IDGNLSTHWE  TGIENSSTFK  NEVVVEFNNV120
ESIDRIGYAT  RQDGAKGKGY  PNKFEIYASV  SGDNEDFKLV  STGSSSTSRD  MMELKFDKIT180
AKKIKFVFKE  ANQNWASASE  FWFYREDKLL  NKLNNLFTDD  NKNTLNSEFN  TLDKLEALEN240
ESKLHPFYDD  FKEDLSDAKI  ILESGEFDYS  DSKVSKFLSS  EDERLSSYDN  SYKIKQSEVK300
SISTNGGQYS  DMSISKAMDG  DFQTRWHSGK  QNTDDFTNQV  VIELNEITTL  NRLVYTLGVS360
RGFAQEFDIY  VSKTSKGDTF  HKISSGTSKV  TKDSVEIRFN  PVDARRIKFV  YKNGYENWAC420
ATEFGLYKQD  KTKEKVERLF  TDSTMNTLSE  EFNTLEKIVA  LENQCKEHIL  YDDFKEDIEN480
AKALLEDGNI  EATKAQVSKF  DVFYTDYKEA  YDNTFKMDNS  NISSISTNGG  NYGSYVKDRM540
IDGDLKTFWE  TGKQNSDSFD  NEIVFTLKEA  KVLDRMSYRS  AENTIGFADE  FEIYASQTTK600
GDTFKLVSNG  TARRTSDMLE  FKFEPTKFKR  IKFVYKKCNS  NKASASEIMF  YSEDQLKDKI660
KNVFTNRLMN  ELSQEYNTLK  KIEELEEGAK  VHPLKEQYQE  IIDLAKSLLN  DQDSANTSRI720
VTGVQRGHYT  TESGKRMVNG  AAYISMESFG  KYVTPGEEIV  VYLDADPDGK  LPELWFGQVG780
KVIDGWTRRI  KLQPGKNTIK  APTNMNCAAV  YLSNQATTKE  QAYAPRARMV  GGTSFPTYFH840
GETDPQEYRK  ELEEYAKKVE  LSDSSFTNGN  PEGKVFNIAE  FVSDNVVITT  SALGALEGLK900
LAEEQDLDIS  DTMTQWEKMY  ELFQTFMGLE  KDADEEKDSF  FPNKFVARVF  QNVPFGFAAH960
GYTGYLGSDN  AQRDGGFFKM  IAAPPNMKGN  DNWCYTHEFG  HILNTKYIVD  GEVTNNLYAQ1020
EYRRINADKG  INEDRGSWDE  IFKRFNGANE  EYAMHYFERL  AVLSQLNIAY  GYDAYAKASK1080
AVRDNTDLIK  SINGNDVQRL  SVAYSLGLGV  NLLDFFEGWN  YSDIEITDQM  RDAVKDLPKP1140
NKKIEYLHGG  AYDYEGDGFT  KDIDVNVKST  LGEEENTLSL  KLSVDKNNQD  DVLGYEILKD1200
GKVVGFTKTN  LFTIREYDET  ADYTIVAYAK  DLSTAQAINS  KSQKPTLSIE  ENITLSIGDK1260
FDAKKYVTAL  DYQGNVIEDI  KVDSNVDTSK  KGNYEVKYTV  THSDSTVEKT  MKVSVVSKIT1320
YASDVQETSY  KVGWGQLGKD  KAPNNTAIEL  DRQGIVTTYT  KGLGAHANSE  VVYDVENYDK1380
FESYIGIDQS  MRDNPNSYAK  FIIYADGEKV  YESKKFTSNR  DHDFVSIDLE  GVKELKLVTD1440
GLGSNGADQT  VWADAKFINN  NTKPIINAQD  VTTVKLNSKF  DIRNGVTAKD  IEDGDLTESI1500
VVEEGNLTTS  KTGKYKVKYI  VTDSNNNTTT  KTREILVYSG  QDYASDTKYT  IQKSDWGGIK1560
NDKAPAGSAI  SVLVDGEETT  FGKGIGAHAN  SEITFDLSDK  NYEFFTSRIG  LDGKERGNTN1620
ASAKFKVLVD  GKEVFESKTF  KTNDDSQVIN  IPVNGASEIK  LITDQANNNN  ASDHTVWADA1680
KFLVTNSKPE  ITTEDVQIEV  GQVIDINKNV  TAKDAEDGDL  TSSVEVISNN  FEKNKIGRFE1740
VVYRVTDSDK  NTTEKTRYIT  AYEVFDVKKS  KYLGFDNLEQ  YNEQFKIPVS  SISNNAGNYG1800
SDVITKAIDN  NIDTHWETNK  PNSNTFKNEV  TFDLGEMQEI  SKMSYAARRA  GKGFATSFSI1860
YVSTKAEGND  FILAGKGNYN  GNASDVVEFD  INDTTARRVK  FVFDSAIENW  ASMGEMSFYK1920
KDELADKVAS  MFTNSNKEEV  TEGYDTLDEI  NALKEEVASH  PANELFQADF  DKAEELVRAT1980
FPTLNIPKSQ  SVKVGETLES  LIGKISATDT  KDGNITSKIK  VTGTDKVNFN  KVGEYEITYS2040
VTDSDNNTVS  KVRKINVVDM  KDFKYLSDYD  WKSANSGWGT  VNKDNSVSSN  VLRLTDEKGQ2100
TVNFEKGIGT  HSTSTIVYDL  SDKNSVRFTS  YVGVDRQMYN  SPGSIQFEVY  VDGEKTFDSG2160
VMNSTTPMKF  VDVDITEAKE  LKLIVKDGGN  GNGSDHATWG  DAKLHYVNEN  SVDKTSL2217

Predicted CAZyme domains from dbCAN; Download help

Domains were colored according to CAZyme classification: (AA), (CE), (PL), (GH), (GT), (CBM), & (Null)

dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation.

Details:
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:
⋆HMMER search for CAZyme family annotation vs. dbCAN CAZyme domain HMM database
⋆DIAMOND search for BLAST hits in the CAZy database
⋆HMMER search for CAZyme subfamily annotation vs. dbCAN-sub HMM database of CAZyme subfamilies (derived from eCAMI classification of CAZyDB families)

For more details, please see dbCAN3.

Similarites between the same cluster seqeunces from DIAMOND; Download help