Information for CAZyme ID: QGG54517.1
Basic Information
GenBank ID | QGG54517.1 |
Family | CBM61, GH53 |
Sequence Length | 1224 |
UniProt ID | A0A5Q2NJZ1(100,100)![]() |
Average pLDDT? | 82.89 |
CAZy50 ID | 5796 |
CAZy50 Rep | No, QJC50504.1 |
Structure Cluster | - |
EC Number(s) | - |
Substrates(s) | - |
Taxonomy
Tax ID | 2660554 |
Kingdom | Bacteria |
Phylum | Bacillota |
Class | Bacilli |
Order | Bacillales |
Family | Paenibacillaceae |
Genus | Paenibacillus |
Species | Paenibacillus sp. B01 |
Protein Sequence: 90 < plddt <=100; 70 < plddt <= 90; 50 < plddt <= 70; 0 <= plddt <= 50; Download help
MHTKLLRRLG AAMLALVIGL AGLGLPAGGA VADAAASSDP DGFIKGVDIS TLQALEDKGV | 60 |
AFYDDGTERD LLAILKEHGV NYVRLRLWND PVQADGYNDK AHLIEMAKRV KAAGMGLLVD | 120 |
FHYSDFWADP GQQVKPAAWA SLGFDELKAA LYGYTREVMD ELLAEGAYPD MVQIGNEINS | 180 |
GMLLPDGSTS RFAQLAQLLS EGVRAVRETT PAGHETKIML HLAEGGSNAK FRTFFDQARQ | 240 |
QGIDYDVIGL SYYPYWHGTF QELKSNMDNL AARYGKEVVV AETAYPYTLE DADGHGNIAG | 300 |
EAQTKLTGFE ASVASQKLVT ELILNTVAHV QGGKGLGVFY WEPAWLAGVG WKAGEGNAWE | 360 |
NQAMFDFDGN ALESLDAFRF VPGDLDDIKP LLVYPSLDVT ASAGSAPELP AEAEVLLSEG | 420 |
SIEKRAVVWD EPADPEQWKV PGTYALQGSV QGVDAAMRAA VTVKVVDNPN LVRNPGFEQD | 480 |
GLAGWTLSGT EGAAKLSKEA GNAHSGGHSV NYYYGTEYGY RISQTVTGLE NGTYRLSAWA | 540 |
SGGGGETKLR LFAEGYGGDP LGADAVNAGW NVWKPYAVEE IEVTNGQVTI GFDVEAPGGT | 600 |
WGYLDDFELV KAPEKNPVQN PGFESGDLTG WTLGGTAGAG KVENNAANAH AGTHAFNYWY | 660 |
EDPYRFTLTQ IVSQLPKGVY ELTAAASGGG GETKLQLYAE TGIGSRWNAD IVNTGWNVWK | 720 |
EYKVSGIEVA NGQVTIGFDV EAPGAAWGYF DDIRLTRTGD LPGTGEPPAT GEPPATGEPP | 780 |
ATGEPPATGQ PPATGGPPAT GEPPATGEPP ATGQPPATGE PPATGQPPAT PTPTPTPERT | 840 |
PAPTATPAPS AAASPTPSAG SGIVAVQAGQ LSGDAGAAAV LKLDGAATVR LGAELKPRLA | 900 |
DGLRLELPGL SAELSGDELR QLWQQAGDAP LSIRLSAADG VGEQLAPLQQ PGSRAYAAAG | 960 |
GSVELAIGSI AADGRELPLE AASIRLKLSA AAGFDPERSG IFRLDANGAP AYQLGSRLAG | 1020 |
GDWTADVRSA GRYAVLELSV AYADVPAGHW AQVGIASLSA KGIVQGAGAA GFLPAKSVSR | 1080 |
AELAAMLVRA LGLSASGSAA AGFADVSADA WYAGAVSAAL DAGILRGQAD GIAAPQALLS | 1140 |
REQMAAMLVR AAAAAGKPLE AAAGDRPAAA DAAAIGAWAA PSVAAAYEAG LLQGDANGAF | 1200 |
RPQAELTRAE AAAAVQRLLK LLQA | 1224 |
Predicted 3D structure by AlphaFold2 with pLDDT = 82.89 ; Download help
pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .
Residues were colored according to plddt ( blue-> high quality; red-> low quality ).
Carbohydrate binding residues Predicted by CAPSIF
Binding site residues are not predicted, since this is not a representative ID (CAZyme3D-ID50).
Full Sequence: AA; CE; PL; GH; GT; CBM; Download structure help
dbCAN3 predicted domain(s) : GH53(44-378)+CBM61(470-609)+CBM61(615-755)+SLH(1043-1082)+SLH(1103-1142)+SLH(1171-1211)
MHTKLLRRLG AAMLALVIGL AGLGLPAGGA VADAAASSDP DGFIKGVDIS TLQALEDKGV | 60 |
AFYDDGTERD LLAILKEHGV NYVRLRLWND PVQADGYNDK AHLIEMAKRV KAAGMGLLVD | 120 |
FHYSDFWADP GQQVKPAAWA SLGFDELKAA LYGYTREVMD ELLAEGAYPD MVQIGNEINS | 180 |
GMLLPDGSTS RFAQLAQLLS EGVRAVRETT PAGHETKIML HLAEGGSNAK FRTFFDQARQ | 240 |
QGIDYDVIGL SYYPYWHGTF QELKSNMDNL AARYGKEVVV AETAYPYTLE DADGHGNIAG | 300 |
EAQTKLTGFE ASVASQKLVT ELILNTVAHV QGGKGLGVFY WEPAWLAGVG WKAGEGNAWE | 360 |
NQAMFDFDGN ALESLDAFRF VPGDLDDIKP LLVYPSLDVT ASAGSAPELP AEAEVLLSEG | 420 |
SIEKRAVVWD EPADPEQWKV PGTYALQGSV QGVDAAMRAA VTVKVVDNPN LVRNPGFEQD | 480 |
GLAGWTLSGT EGAAKLSKEA GNAHSGGHSV NYYYGTEYGY RISQTVTGLE NGTYRLSAWA | 540 |
SGGGGETKLR LFAEGYGGDP LGADAVNAGW NVWKPYAVEE IEVTNGQVTI GFDVEAPGGT | 600 |
WGYLDDFELV KAPEKNPVQN PGFESGDLTG WTLGGTAGAG KVENNAANAH AGTHAFNYWY | 660 |
EDPYRFTLTQ IVSQLPKGVY ELTAAASGGG GETKLQLYAE TGIGSRWNAD IVNTGWNVWK | 720 |
EYKVSGIEVA NGQVTIGFDV EAPGAAWGYF DDIRLTRTGD LPGTGEPPAT GEPPATGEPP | 780 |
ATGEPPATGQ PPATGGPPAT GEPPATGEPP ATGQPPATGE PPATGQPPAT PTPTPTPERT | 840 |
PAPTATPAPS AAASPTPSAG SGIVAVQAGQ LSGDAGAAAV LKLDGAATVR LGAELKPRLA | 900 |
DGLRLELPGL SAELSGDELR QLWQQAGDAP LSIRLSAADG VGEQLAPLQQ PGSRAYAAAG | 960 |
GSVELAIGSI AADGRELPLE AASIRLKLSA AAGFDPERSG IFRLDANGAP AYQLGSRLAG | 1020 |
GDWTADVRSA GRYAVLELSV AYADVPAGHW AQVGIASLSA KGIVQGAGAA GFLPAKSVSR | 1080 |
AELAAMLVRA LGLSASGSAA AGFADVSADA WYAGAVSAAL DAGILRGQAD GIAAPQALLS | 1140 |
REQMAAMLVR AAAAAGKPLE AAAGDRPAAA DAAAIGAWAA PSVAAAYEAG LLQGDANGAF | 1200 |
RPQAELTRAE AAAAVQRLLK LLQA | 1224 |
Predicted CAZyme domains from dbCAN; Download help
Domains were colored according to CAZyme classification: (AA), (CE), (PL), (GH), (GT), (CBM), & (Null)
dbCAN3 server is a web server for automated Carbohydrate-active enzyme ANnotation.
Details:
dbCAN3 server integrates three state-of-the-art tools/databases for automated CAZyme annotation:
⋆HMMER search for CAZyme family annotation vs. dbCAN CAZyme domain HMM database
⋆DIAMOND search for BLAST hits in the CAZy database
⋆HMMER search for CAZyme subfamily annotation vs. dbCAN-sub HMM database of CAZyme subfamilies (derived from eCAMI classification of CAZyDB families)
For more details, please see dbCAN3.