Information for CAZyme ID: AWI09507.1
Basic Information
GenBank ID | AWI09507.1 |
Family | PL0 |
Sequence Length | 3667 |
UniProt ID | A0A2U8E3N6(100,100)![]() |
Average pLDDT? | 47.70 |
CAZy50 ID | 167 |
CAZy50 Rep | Yes, AWI09507.1 |
Structure Cluster | - |
EC Number(s) | - |
Substrates(s) | - |
Taxonomy
Tax ID | 1796921 |
Kingdom | Bacteria |
Phylum | Verrucomicrobiota |
Class | Opitutae |
Order | Opitutales |
Family | Opitutaceae |
Genus | Ereboglobus |
Species | Ereboglobus luteus |
Protein Sequence: 90 < plddt <=100; 70 < plddt <= 90; 50 < plddt <= 70; 0 <= plddt <= 50; Download help
MHTTTTPTRI VRIATLAVAF AASFATLRAA PAPTIPSSAY NLTGFAATTT GGGVINDTDA | 60 |
AYRKVTTALE FITAIRDSNK TAGAVKVIEV MNDLNLGWNE IGSEAQNLDS NPARAHAAPK | 120 |
LHPTLITSGV TLLDIKTKGG GLTIFSANGA TIKHCTFNIK NTSNIIIRNL KFDEMWEWDE | 180 |
ATKGDYDSND WDFITLGNGG DTTNIWIDHC TFTSAYDGIT DMKAKASNVT YSWCRITGSD | 240 |
DAPGGFIRAQ LDVLEASKSS HPMYNTLRTK AGFTIEEIAF ITRGAKKGIL MGANSLKAEN | 300 |
ANLTATFHHI LMTNLWDRAI PRLRGGNVHI YNVILDDTEA LAAKRLRDTR AAALDSAGLS | 360 |
ALNKYKFNPP LNGAISTEDG AILLEKSIYT DCLWPLRNNQ TDPSNAAYTG KILGLDVIYN | 420 |
FDSTTIRGNS TDDGNPLGPF QSAIKPFSWS FADNSQTLPY TIDNMDDPSV LPVILEAGAG | 480 |
AGVIEWADTD GKYNWLKTTY PAAPASDIAP VITLQPKSQS VIEGSSVTLS VSAIASPAPS | 540 |
YQWYKDSLPV AGGTTSTLTI ASASAASVGA YYAVVTNGIA PDATSATATI SLTSPPAAPV | 600 |
AAAATGITGE GFTANWALSA NATGYSIDVS PDPTFANCWP GYENLAVGNV ASHAITGLSP | 660 |
ATTYYYRVRA TVGSYSTADS NAIAAATAAV ANIPAEVDDP MDDTDRLSAP SLTNTRWVVA | 720 |
NSASYSQLAA TTGGITWTLA SANGTLAVGY FPKVTVPVGE TLTATLEFTP GTAGATADGF | 780 |
RFAFLSSGSD GILSNDVTSG TNDVFKTHTG YALFSKSGNV GGGSTAALAV DAYRRNDSPG | 840 |
TPADLLSKSG DWTKQSSSSG ASGNLVAGST YKLVFQVLNN GSSLTVTESL TGPGLSNVSA | 900 |
SFTDNAPAAN YFDAIALRFA KGSNQFGSIT LNSLKVTGGV VETAVDPLPA IYSAPTASGA | 960 |
QGAAFNYACV STNGATSYSA TGLPSGLSIN ASTGVISGTP AEYGSFDAIV TASNANGAGA | 1020 |
DFTLTLTIEP EPAAAPVAAE ATARTAEGFT ANWTAVPGAT SYQLDIATDS GFTTMVAGYE | 1080 |
ELNVGDVTSY AVTGLDAATT YYYRVSAVTE SVVGNASATI TVVTAGEGGG YLVNDSFTDA | 1140 |
DRIGGTDGSS THKTNGPFVA TPSSANTQWV ANQVSTLVAT GTGLVWGYTG TSSATALGYF | 1200 |
PDVTVQSGVP MTIALTFTTG TTGGTVNNIP NNLRIGLIND TASGRANNDG ISSTDARFEG | 1260 |
DTGYAVFSAS SVVGGGSTAD IGLKTYKRTP TDDKKTDLIN TDVNWTSLGA STGATGNLGV | 1320 |
NTSYTLTFTL NYNGSKMTIN TKLAGGDLAG FDYTVEDETS PVLTFNTIAF RLGKGVGQFS | 1380 |
EINFTNLKIW EGAEPASGPA APTLAEPNSI TAEGFTANWN TAAGATGYYL DVSTDPDFGS | 1440 |
FVSGYENLYV GNNLSHAITG LTAGVTYYYR VRAANADGTS ASTGASTAVV TTGGGGNTYL | 1500 |
VNDSFTDYDR IGGFDGSATA PDAPLVGTPT ATNTQWVASS TNTLVATGTG MVWNYAGTGN | 1560 |
SMTIGYFPEV TVANSGTVTV QLKFTTGVVG TGTNNLRIAL INDTPNGRFE TDGVSSASDY | 1620 |
YKGDTGYAIM SAASNIGNAT ANLVLRTYKH INLETTDLLG TAGNWGTATG TTSQIGNSSG | 1680 |
STGYFQGETD YTLTLTLAKN TSGTEMTIGT KLEGGNFTGL EYSVVDRDTT VAIPGSFNAL | 1740 |
AFRLGGSTTQ FDHLKFTSLK VWEGDEPAPV AAAPVITCPA TASATQGAAF TYNITAINSP | 1800 |
TSYALASGAL PAGVTLNTST GVISGTPTES GTFNVTLTAT NDIGASAPHS LAITVATAIT | 1860 |
EPPVINSETT ATAVIGAAFT YTITATNEPS SFAATGLPAS GNLQLDTATG VISGTAAVGD | 1920 |
LGTHTVVLTA TNAIGASAPV TLTLTVELPP ALSAPVALPA TGETTTGFIA RWEAVTGADS | 1980 |
YRLDVATDAG FTSLVSGYGD LNIGTMTGRM VTGLSADTDY YYRVRAVNEA GPSASSNTGA | 2040 |
ARTASSEVVL VDDSFNDTDR IGGFDGTSTS SSSPAINTPT SANTQWVIGN AGQLIATTGG | 2100 |
MNWNFLVTNA VSALGYFPTV NVANGTTVTL RLKFTTGTLG GATSGNNFRI VMIDSSPNGY | 2160 |
RQTDGAGSTA DPFIGDKGYA FFMPSPVAGA STMPVTLQSF KRTALTSDNL FGSDASWTRS | 2220 |
NSVTQADGHR FASNTAYTLT ITLTRSSATS ITTNAVITGG NFDNLICEIA DADTPVTKFD | 2280 |
TLGLRFGAGI NQFNQITLNS LRITTTGSSE VTEPPVITSA LTASATRGAA FSYTIVADNV | 2340 |
PTSFTAEGLP SGLELETSTG VISGTPTATG SYAVTIGAHN AIGSDTRELI ITVTAGGSDI | 2400 |
TTIVPPSGGL TSPASTVFDA AGNAYIADIA AGAIMKVAGD GTVTTFATIP QLAVVAADSS | 2460 |
GNLYAAGNDG NVTKILADGT VVSPALATGV TTPGGIAVDS DGNVYVSKTA SNTIVKITPA | 2520 |
GAVSTLAGSG SAGSADATGD AASFNGPTGL ALNGNTGTLY VADTVNCTIR AIDLATGTVT | 2580 |
TVAGRAGVAG DIGSEGSDGK ATDGTLDTPE AITIDAAGLL YIADTGNNLI RGFDPVSGAL | 2640 |
TTLAGDSGSI TLVAPAGLAF NPVSGLLNVA DTGNGALRAV TIKPVIAAQI VDRIAKLGTT | 2700 |
VTLDGTAWAS PAASYQWSVH GTALAGSEAT ITINVKSVTD SGVYTIAASN TAGESTAGMR | 2760 |
LTVTGNDSTQ GPNDNSPNMN DSGGGGGAPS LWILGAMALL ALVRKFSARR MPAKLLPLLV | 2820 |
FLSFLAIHTS PFAIAQQATP SAELPPDDEI ITMSAFEVTG QSIKGYTASE SVTGTRVASL | 2880 |
LRDLPFNVNV VTDEFIADFN AFDLADQLSN VSSFSPSENI GQFQLRGFEA STQLVDGFRR | 2940 |
VGLTGVTVTD RVEVIKGPAA SIYGAIQPGG AVNTLRKKPA AKPKYGLTLG VGTHDQARAS | 3000 |
FYATGPVGNS KKLFYRLDTE YRRTERQQEF TRTRNGYAAL QLAYKPTSRT TLSVFIDHAD | 3060 |
RHDHPVSQLA TSAARVDIGN LPSWFPYDTS DLRRTWAKYF TQYFAQDFDY YDMNFYGPTA | 3120 |
AKYSRLTSGS VTLDHKVNSL WSIRASFNMS TNLSHTENAN IGYYPFGYGV INAATTEPPT | 3180 |
AEVRITPKHA ETTNKATGFQ LDNLFRFDTG PIKHQLLVTG DYYRNSSREF SATWSNAFFY | 3240 |
DPADPYNLAG NPSYASWDYA TWDDDRSPYN RVGDNSKVIN RNYGLFVSER ATMFKGRLIA | 3300 |
MAGVRYDYVD CTATHMPTDD NYTTKVVDYS PDSWTYQLGL TAVINRNITA YVNASSAFDP | 3360 |
QPQLDEYDNP LPNKESDGYE FGFKFTLFSE ALNITLNRFY IRQENLTYSV NDPETGQKET | 3420 |
IVTGEQKAKG YEIDFNWQLT RSLNIIGGYG YVDAEITDAG KLNWLNNTTP RRVPKHNLGI | 3480 |
SVRYEFVSGP LKGLFATGGA TYYSKSLVNA GSGYNITPYK GTEDFNRMSQ ELIYNVRFPN | 3540 |
GGLPYPYLPE NAIVSYFTPA SGSTPAMLYW TDNQGATTVD ELKKYSYASP YLNGAVYVID | 3600 |
GRTQIYNRSS VVWKMGVGYK FKARGFGKKL SHKIQVNMNN VFNEKSTIGG GIPILERNVM | 3660 |
VTYSVTF | 3667 |
Predicted 3D structure by AlphaFold2 with pLDDT = 47.70 ; Download help
pLDDT is for per-residue accuracy of the structure, which representes the quality of the residue. A higher value indicates better prediction accuracy. More detail please see AlphaFold .
Residues were colored according to plddt ( blue-> high quality; red-> low quality ).
Full Sequence: CAPSIF:V and CAPSIF:G =99.9; CAPSIF:V =59.9; CAPSIF:G =40; Non-Binding=0; Download help
MHTTTTPTRI VRIATLAVAF AASFATLRAA PAPTIPSSAY NLTGFAATTT GGGVINDTDA | 60 |
AYRKVTTALE FITAIRDSNK TAGAVKVIEV MNDLNLGWNE IGSEAQNLDS NPARAHAAPK | 120 |
LHPTLITSGV TLLDIKTKGG GLTIFSANGA TIKHCTFNIK NTSNIIIRNL KFDEMWEWDE | 180 |
ATKGDYDSND WDFITLGNGG DTTNIWIDHC TFTSAYDGIT DMKAKASNVT YSWCRITGSD | 240 |
DAPGGFIRAQ LDVLEASKSS HPMYNTLRTK AGFTIEEIAF ITRGAKKGIL MGANSLKAEN | 300 |
ANLTATFHHI LMTNLWDRAI PRLRGGNVHI YNVILDDTEA LAAKRLRDTR AAALDSAGLS | 360 |
ALNKYKFNPP LNGAISTEDG AILLEKSIYT DCLWPLRNNQ TDPSNAAYTG KILGLDVIYN | 420 |
FDSTTIRGNS TDDGNPLGPF QSAIKPFSWS FADNSQTLPY TIDNMDDPSV LPVILEAGAG | 480 |
AGVIEWADTD GKYNWLKTTY PAAPASDIAP VITLQPKSQS VIEGSSVTLS VSAIASPAPS | 540 |
YQWYKDSLPV AGGTTSTLTI ASASAASVGA YYAVVTNGIA PDATSATATI SLTSPPAAPV | 600 |
AAAATGITGE GFTANWALSA NATGYSIDVS PDPTFANCWP GYENLAVGNV ASHAITGLSP | 660 |
ATTYYYRVRA TVGSYSTADS NAIAAATAAV ANIPAEVDDP MDDTDRLSAP SLTNTRWVVA | 720 |
NSASYSQLAA TTGGITWTLA SANGTLAVGY FPKVTVPVGE TLTATLEFTP GTAGATADGF | 780 |
RFAFLSSGSD GILSNDVTSG TNDVFKTHTG YALFSKSGNV GGGSTAALAV DAYRRNDSPG | 840 |
TPADLLSKSG DWTKQSSSSG ASGNLVAGST YKLVFQVLNN GSSLTVTESL TGPGLSNVSA | 900 |
SFTDNAPAAN YFDAIALRFA KGSNQFGSIT LNSLKVTGGV VETAVDPLPA IYSAPTASGA | 960 |
QGAAFNYACV STNGATSYSA TGLPSGLSIN ASTGVISGTP AEYGSFDAIV TASNANGAGA | 1020 |
DFTLTLTIEP EPAAAPVAAE ATARTAEGFT ANWTAVPGAT SYQLDIATDS GFTTMVAGYE | 1080 |
ELNVGDVTSY AVTGLDAATT YYYRVSAVTE SVVGNASATI TVVTAGEGGG YLVNDSFTDA | 1140 |
DRIGGTDGSS THKTNGPFVA TPSSANTQWV ANQVSTLVAT GTGLVWGYTG TSSATALGYF | 1200 |
PDVTVQSGVP MTIALTFTTG TTGGTVNNIP NNLRIGLIND TASGRANNDG ISSTDARFEG | 1260 |
DTGYAVFSAS SVVGGGSTAD IGLKTYKRTP TDDKKTDLIN TDVNWTSLGA STGATGNLGV | 1320 |
NTSYTLTFTL NYNGSKMTIN TKLAGGDLAG FDYTVEDETS PVLTFNTIAF RLGKGVGQFS | 1380 |
EINFTNLKIW EGAEPASGPA APTLAEPNSI TAEGFTANWN TAAGATGYYL DVSTDPDFGS | 1440 |
FVSGYENLYV GNNLSHAITG LTAGVTYYYR VRAANADGTS ASTGASTAVV TTGGGGNTYL | 1500 |
VNDSFTDYDR IGGFDGSATA PDAPLVGTPT ATNTQWVASS TNTLVATGTG MVWNYAGTGN | 1560 |
SMTIGYFPEV TVANSGTVTV QLKFTTGVVG TGTNNLRIAL INDTPNGRFE TDGVSSASDY | 1620 |
YKGDTGYAIM SAASNIGNAT ANLVLRTYKH INLETTDLLG TAGNWGTATG TTSQIGNSSG | 1680 |
STGYFQGETD YTLTLTLAKN TSGTEMTIGT KLEGGNFTGL EYSVVDRDTT VAIPGSFNAL | 1740 |
AFRLGGSTTQ FDHLKFTSLK VWEGDEPAPV AAAPVITCPA TASATQGAAF TYNITAINSP | 1800 |
TSYALASGAL PAGVTLNTST GVISGTPTES GTFNVTLTAT NDIGASAPHS LAITVATAIT | 1860 |
EPPVINSETT ATAVIGAAFT YTITATNEPS SFAATGLPAS GNLQLDTATG VISGTAAVGD | 1920 |
LGTHTVVLTA TNAIGASAPV TLTLTVELPP ALSAPVALPA TGETTTGFIA RWEAVTGADS | 1980 |
YRLDVATDAG FTSLVSGYGD LNIGTMTGRM VTGLSADTDY YYRVRAVNEA GPSASSNTGA | 2040 |
ARTASSEVVL VDDSFNDTDR IGGFDGTSTS SSSPAINTPT SANTQWVIGN AGQLIATTGG | 2100 |
MNWNFLVTNA VSALGYFPTV NVANGTTVTL RLKFTTGTLG GATSGNNFRI VMIDSSPNGY | 2160 |
RQTDGAGSTA DPFIGDKGYA FFMPSPVAGA STMPVTLQSF KRTALTSDNL FGSDASWTRS | 2220 |
NSVTQADGHR FASNTAYTLT ITLTRSSATS ITTNAVITGG NFDNLICEIA DADTPVTKFD | 2280 |
TLGLRFGAGI NQFNQITLNS LRITTTGSSE VTEPPVITSA LTASATRGAA FSYTIVADNV | 2340 |
PTSFTAEGLP SGLELETSTG VISGTPTATG SYAVTIGAHN AIGSDTRELI ITVTAGGSDI | 2400 |
TTIVPPSGGL TSPASTVFDA AGNAYIADIA AGAIMKVAGD GTVTTFATIP QLAVVAADSS | 2460 |
GNLYAAGNDG NVTKILADGT VVSPALATGV TTPGGIAVDS DGNVYVSKTA SNTIVKITPA | 2520 |
GAVSTLAGSG SAGSADATGD AASFNGPTGL ALNGNTGTLY VADTVNCTIR AIDLATGTVT | 2580 |
TVAGRAGVAG DIGSEGSDGK ATDGTLDTPE AITIDAAGLL YIADTGNNLI RGFDPVSGAL | 2640 |
TTLAGDSGSI TLVAPAGLAF NPVSGLLNVA DTGNGALRAV TIKPVIAAQI VDRIAKLGTT | 2700 |
VTLDGTAWAS PAASYQWSVH GTALAGSEAT ITINVKSVTD SGVYTIAASN TAGESTAGMR | 2760 |
LTVTGNDSTQ GPNDNSPNMN DSGGGGGAPS LWILGAMALL ALVRKFSARR MPAKLLPLLV | 2820 |
FLSFLAIHTS PFAIAQQATP SAELPPDDEI ITMSAFEVTG QSIKGYTASE SVTGTRVASL | 2880 |
LRDLPFNVNV VTDEFIADFN AFDLADQLSN VSSFSPSENI GQFQLRGFEA STQLVDGFRR | 2940 |
VGLTGVTVTD RVEVIKGPAA SIYGAIQPGG AVNTLRKKPA AKPKYGLTLG VGTHDQARAS | 3000 |
FYATGPVGNS KKLFYRLDTE YRRTERQQEF TRTRNGYAAL QLAYKPTSRT TLSVFIDHAD | 3060 |
RHDHPVSQLA TSAARVDIGN LPSWFPYDTS DLRRTWAKYF TQYFAQDFDY YDMNFYGPTA | 3120 |
AKYSRLTSGS VTLDHKVNSL WSIRASFNMS TNLSHTENAN IGYYPFGYGV INAATTEPPT | 3180 |
AEVRITPKHA ETTNKATGFQ LDNLFRFDTG PIKHQLLVTG DYYRNSSREF SATWSNAFFY | 3240 |
DPADPYNLAG NPSYASWDYA TWDDDRSPYN RVGDNSKVIN RNYGLFVSER ATMFKGRLIA | 3300 |
MAGVRYDYVD CTATHMPTDD NYTTKVVDYS PDSWTYQLGL TAVINRNITA YVNASSAFDP | 3360 |
QPQLDEYDNP LPNKESDGYE FGFKFTLFSE ALNITLNRFY IRQENLTYSV NDPETGQKET | 3420 |
IVTGEQKAKG YEIDFNWQLT RSLNIIGGYG YVDAEITDAG KLNWLNNTTP RRVPKHNLGI | 3480 |
SVRYEFVSGP LKGLFATGGA TYYSKSLVNA GSGYNITPYK GTEDFNRMSQ ELIYNVRFPN | 3540 |
GGLPYPYLPE NAIVSYFTPA SGSTPAMLYW TDNQGATTVD ELKKYSYASP YLNGAVYVID | 3600 |
GRTQIYNRSS VVWKMGVGYK FKARGFGKKL SHKIQVNMNN VFNEKSTIGG GIPILERNVM | 3660 |
VTYSVTF | 3667 |
Carbohydrate binding residues Predicted by CAPSIF from 3D structure; Download help
Residues were colored according to prediction score:
Nonbinder, CAPSIF:G Predicted Binder, CAPSIF:V Predicted Binder, CAPSIF:V and CAPSIF:G Predicted Binder
CArbohydrate–Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G).
Details:
⋆B-Factor = 0.0 : Nonbinder.
⋆B-Factor = 40.0 : CAPSIF:G Predicted Binder.
⋆B-Factor = 59.9 : CAPSIF:V Predicted Binder.
⋆B-Factor = 99.9 : CAPSIF:V and CAPSIF:G Predicted Binder.
For more detail please see CAPSIF.