Information for CAZyme ID: WHY22263.1
Basic Information
GenBank ID | WHY22263.1 |
Family | CE12, PL11_1 |
Sequence Length | 2123 |
UniProt ID | A0A5B0WS76(95.4,96.9)![]() |
Average pLDDT? | 87.16 |
CAZy50 ID | 1223 |
CAZy50 Rep | Yes, WHY22263.1 |
Structure Cluster | SC_CE12_clus85, SC_PL11_clus21 |
EC Number(s) | - |
Substrates(s) | - |
Taxonomy
Tax ID | 3047872 |
Kingdom | Bacteria |
Phylum | Bacillota |
Class | Bacilli |
Order | Bacillales |
Family | Paenibacillaceae |
Genus | Paenibacillus |
Species | Paenibacillus sp. G2S3 |
Protein Sequence: 90 < plddt <=100; 70 < plddt <= 90; 50 < plddt <= 70; 0 <= plddt <= 50; Download help
MIASGNYSPE KYLSKEEGIR ARSELIFEER ITTKLIVSVY YNWARCVLSS FNHKKAGEAR | 60 |
KMKRNKMGNK ILIVLLSWIF ICSSLFPSTT FGVQPASADS GPITLLYDFG TATSPVMSGY | 120 |
TGVHESKLYT KELGYGLDQA VASRNRSGGD ALTNDFVLGL SYSFLVDLPN GDYDVTIFSG | 180 |
DLLAGTSTTK TTITLEGITA GSISSKQAVN QATYRTTVQD GQLTVGITGT GVGGYLNGLM | 240 |
IQQIVPGPLK APEGLAVTNL SPTAVSLGWS SVTEAVYYNI YRTELPSGTI QPVAQVAVNS | 300 |
YVDSDVNEGG GYIYNVSAVN GTGEESALSA SVTVDKIPGV EVPAAPTGLS IVSVGISSVQ | 360 |
LSWNNVAGAT RYTILRSDSA DGTFHEIGQS ETATFTDVAV DTSKRQYYGV KAANAQGESQ | 420 |
LSNKVESLVY TPPVTLPDGN VYSFDFGPGA AAEGYLKVDA GVSYSPAVKY GFTDISKVTG | 480 |
VDRGTSDPLR SDFVVPKETT FNVDLPNGDY TVSLIAGDSA GDTDIGIKVE SIQKVQQTSK | 540 |
TNGQYLEMNF DIALVDGQMN FVFSGTKPNI NALVITKQPD RPANELPAVY IAGDSTVQTY | 600 |
DPYWIPQAGW GQMIAEFFSQ EVTFKNHAIG GRSSKSFIVE GRLDEVLRKI QPGDYFLIQF | 660 |
GHNDATISVP DRYASPADYK NYLKTYVEGA RQRGATPILV TPMGRRDFNA ATGKFNVSFP | 720 |
EYVQAMKEVA NELHVDLVDL SALSVAYYNS IGFAATRSVF LHLDAGIYGA FPNGSADDTH | 780 |
FQEYGAIQMA RLLAKGIEQL NIPLSSFVQD IKQPETVPAK PKGLVAGSIS NAGAVLKWDK | 840 |
VEGADIYKIY RKLASEAESA YTLAGTATVP TLTLSGMAEG NSYSVRVTAV NGLGESQPSD | 900 |
EVKLTTKSAQ YRYDFGPVGS PVAAGYTEVN RNVLYTSERG YGLTSSEGMI DRDRGSATDA | 960 |
LRRDFVIYFG GSYEFKVDLP NGYYSVKTYT GDWIGSAKTN VAIEGKDYGT VSSGKENIAE | 1020 |
KLYNQIAVKD GQMNLVFSGT TAHLNGLEIT PLLLAPTNLK LGGLDLNSEP ITANLSWDEM | 1080 |
DGALKYRVYR QATVASSAEL LGETTGPVYT DTTADIGMEY IYTVTSVDST GLESVGSNAL | 1140 |
KVSMIDPSVA KAAVPSGLAV QSTNKNDVTF TWNEVPDARM FNIYRAKSAD GEFILIGKSF | 1200 |
EASYTDTTIL STIPYYYKVA SVNAGGISNL SATLETSAVT TLYRKMEALD RAPVAVKTDA | 1260 |
GVYISWRMLG LDSESIGFNL YRGEEKLNDS LITQSTNYLD TSGTADAKYR ITSVINGVEK | 1320 |
AASEEFSVWQ KQYLSIPLQK PADDYTKDGQ PYTYSAGDAS VGDVDGDGVY EIIMLWSPSN | 1380 |
SKDNSQAGYT GLVYMDAYKL DGTRLWRINL GPNIRAGAHY SPFMVYDLDS DGRAEIMLKT | 1440 |
ADGTVDGQGT VIGDASADYR NSSGYVLLGN EYLTVFEGAT GRAVDTVSYD PPRGDVGAWG | 1500 |
DAYGNRVDRF LAAVAYLDGE QPSVIFSRGY YTRTVLAAYN YRGGKLEKVW RFDSNDEGYG | 1560 |
EYAGQGNHNL SVGDVDGDGK DEITFGAMAI DDDGLPLYNT KLGHGDAIHF GDLDPTRPGL | 1620 |
EVFDVHEHTD SKYGIEMRDA ATGETLWGVF TGIDTGRGMS ADIDPRYTGE EVWAATITNE | 1680 |
VQIPVTGVYS AQGELITNKL PSSTNFGIWW DGDLLRELLD SNRVDKWDYT NQTTANLLTA | 1740 |
TGASSNNGTK ANPSLQADLF GDWREEVIWR ATDSSELRIY TTTDMTDYRI RTLMHDPIYR | 1800 |
LGVAWQNVGY NQPPHPGFFL GEGMELPAAP KIQYVGSPVE TEDTTPPVIT GLPSIQMSES | 1860 |
DILKVQVVAE DPESGIRSLD ITFDGKEVVY GDEIPLKGLA GSHTFIATAV NNAGLSTTEQ | 1920 |
VIVVVSGPQK ATGVPGQPVL SNNNGQDIGL LDGDYKITMN MWWGNNGTVY KLYENGTLID | 1980 |
TQTLRDDSPA AQTAMTSVTG KENGTYTYKA ELTNAFGTTA STSHVVTVKD AAPGKPVLSN | 2040 |
DNWDGDGEYK VTMNLWWGMN GKVYRLYENG VLIDTQTLTA NTPNAQTAST SITNRSPGAY | 2100 |
EYRVELMNDQ GVSESTVMKV TVK | 2123 |
Predicted 3D structure by AlphaFold2 with pLDDT = 87.16 ; Download help
pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .
Residues were colored according to plddt ( blue-> high quality; red-> low quality ).
Full Sequence: CAPSIF:V and CAPSIF:G =99.9; CAPSIF:V =59.9; CAPSIF:G =40; Non-Binding=0; Download help
MIASGNYSPE KYLSKEEGIR ARSELIFEER ITTKLIVSVY YNWARCVLSS FNHKKAGEAR | 60 |
KMKRNKMGNK ILIVLLSWIF ICSSLFPSTT FGVQPASADS GPITLLYDFG TATSPVMSGY | 120 |
TGVHESKLYT KELGYGLDQA VASRNRSGGD ALTNDFVLGL SYSFLVDLPN GDYDVTIFSG | 180 |
DLLAGTSTTK TTITLEGITA GSISSKQAVN QATYRTTVQD GQLTVGITGT GVGGYLNGLM | 240 |
IQQIVPGPLK APEGLAVTNL SPTAVSLGWS SVTEAVYYNI YRTELPSGTI QPVAQVAVNS | 300 |
YVDSDVNEGG GYIYNVSAVN GTGEESALSA SVTVDKIPGV EVPAAPTGLS IVSVGISSVQ | 360 |
LSWNNVAGAT RYTILRSDSA DGTFHEIGQS ETATFTDVAV DTSKRQYYGV KAANAQGESQ | 420 |
LSNKVESLVY TPPVTLPDGN VYSFDFGPGA AAEGYLKVDA GVSYSPAVKY GFTDISKVTG | 480 |
VDRGTSDPLR SDFVVPKETT FNVDLPNGDY TVSLIAGDSA GDTDIGIKVE SIQKVQQTSK | 540 |
TNGQYLEMNF DIALVDGQMN FVFSGTKPNI NALVITKQPD RPANELPAVY IAGDSTVQTY | 600 |
DPYWIPQAGW GQMIAEFFSQ EVTFKNHAIG GRSSKSFIVE GRLDEVLRKI QPGDYFLIQF | 660 |
GHNDATISVP DRYASPADYK NYLKTYVEGA RQRGATPILV TPMGRRDFNA ATGKFNVSFP | 720 |
EYVQAMKEVA NELHVDLVDL SALSVAYYNS IGFAATRSVF LHLDAGIYGA FPNGSADDTH | 780 |
FQEYGAIQMA RLLAKGIEQL NIPLSSFVQD IKQPETVPAK PKGLVAGSIS NAGAVLKWDK | 840 |
VEGADIYKIY RKLASEAESA YTLAGTATVP TLTLSGMAEG NSYSVRVTAV NGLGESQPSD | 900 |
EVKLTTKSAQ YRYDFGPVGS PVAAGYTEVN RNVLYTSERG YGLTSSEGMI DRDRGSATDA | 960 |
LRRDFVIYFG GSYEFKVDLP NGYYSVKTYT GDWIGSAKTN VAIEGKDYGT VSSGKENIAE | 1020 |
KLYNQIAVKD GQMNLVFSGT TAHLNGLEIT PLLLAPTNLK LGGLDLNSEP ITANLSWDEM | 1080 |
DGALKYRVYR QATVASSAEL LGETTGPVYT DTTADIGMEY IYTVTSVDST GLESVGSNAL | 1140 |
KVSMIDPSVA KAAVPSGLAV QSTNKNDVTF TWNEVPDARM FNIYRAKSAD GEFILIGKSF | 1200 |
EASYTDTTIL STIPYYYKVA SVNAGGISNL SATLETSAVT TLYRKMEALD RAPVAVKTDA | 1260 |
GVYISWRMLG LDSESIGFNL YRGEEKLNDS LITQSTNYLD TSGTADAKYR ITSVINGVEK | 1320 |
AASEEFSVWQ KQYLSIPLQK PADDYTKDGQ PYTYSAGDAS VGDVDGDGVY EIIMLWSPSN | 1380 |
SKDNSQAGYT GLVYMDAYKL DGTRLWRINL GPNIRAGAHY SPFMVYDLDS DGRAEIMLKT | 1440 |
ADGTVDGQGT VIGDASADYR NSSGYVLLGN EYLTVFEGAT GRAVDTVSYD PPRGDVGAWG | 1500 |
DAYGNRVDRF LAAVAYLDGE QPSVIFSRGY YTRTVLAAYN YRGGKLEKVW RFDSNDEGYG | 1560 |
EYAGQGNHNL SVGDVDGDGK DEITFGAMAI DDDGLPLYNT KLGHGDAIHF GDLDPTRPGL | 1620 |
EVFDVHEHTD SKYGIEMRDA ATGETLWGVF TGIDTGRGMS ADIDPRYTGE EVWAATITNE | 1680 |
VQIPVTGVYS AQGELITNKL PSSTNFGIWW DGDLLRELLD SNRVDKWDYT NQTTANLLTA | 1740 |
TGASSNNGTK ANPSLQADLF GDWREEVIWR ATDSSELRIY TTTDMTDYRI RTLMHDPIYR | 1800 |
LGVAWQNVGY NQPPHPGFFL GEGMELPAAP KIQYVGSPVE TEDTTPPVIT GLPSIQMSES | 1860 |
DILKVQVVAE DPESGIRSLD ITFDGKEVVY GDEIPLKGLA GSHTFIATAV NNAGLSTTEQ | 1920 |
VIVVVSGPQK ATGVPGQPVL SNNNGQDIGL LDGDYKITMN MWWGNNGTVY KLYENGTLID | 1980 |
TQTLRDDSPA AQTAMTSVTG KENGTYTYKA ELTNAFGTTA STSHVVTVKD AAPGKPVLSN | 2040 |
DNWDGDGEYK VTMNLWWGMN GKVYRLYENG VLIDTQTLTA NTPNAQTAST SITNRSPGAY | 2100 |
EYRVELMNDQ GVSESTVMKV TVK | 2123 |
Carbohydrate binding residues Predicted by CAPSIF from 3D structure; Download help
Residues were colored according to prediction score:
Nonbinder, CAPSIF:G Predicted Binder, CAPSIF:V Predicted Binder, CAPSIF:V and CAPSIF:G Predicted Binder
CArbohydrate–Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G).
Details:
⋆B-Factor = 0.0 : Nonbinder.
⋆B-Factor = 40.0 : CAPSIF:G Predicted Binder.
⋆B-Factor = 59.9 : CAPSIF:V Predicted Binder.
⋆B-Factor = 99.9 : CAPSIF:V and CAPSIF:G Predicted Binder.
For more detail please see CAPSIF.