y
Basic Information | |
---|---|
Species | Malus domestica |
Cazyme ID | MDP0000251002 |
Family | CBM43 |
Protein Properties | Length: 838 Molecular Weight: 93788.1 Isoelectric Point: 9.5798 |
Chromosome | Chromosome/Scaffold: 009311528 Start: 375 End: 10622 |
Description | O-Glycosyl hydrolases family 17 protein |
View CDS |
External Links |
---|
NCBI Taxonomy |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM43 | 620 | 696 | 6.2e-31 |
CVAKADADPDKLQDGLNWACGQGGANCTPIQKGQRCYLPDSIVNHASYAFNDYYQKMQSAGGTCDFDGTAMTTTVDP | |||
GH17 | 284 | 604 | 0 |
IGINIGTDVSDLPSETDTVALLKAHQITHVRLYNADAHMLKALSNSGIEVMVGVTNEEVLGIGESASAAAAWINKNVAAYLPSTNITAIAVGSEVITTIP NAAPVLVSAMNYLHKALVASNLNFQIKVSTPQSMDIIPKPFPPSTATFNYSWGPTIYRILQFIKNTNSYYMLNAYPYYGYTSGDGIFPLDYALFRPLPSV KQIVDPNTLFHYNSMFDAMVDATYYSIEDFNFSGISVVVTETGWPWFGGSKEPDANTGNAQTYTNNLIQRVLNGSGPPSQPKLPINTYIYELFNEDQRPG PVSEKNWGVFYTNGSAVYPLS |
Full Sequence |
---|
Protein Sequence Length: 838 Download |
MRGRSYSPSP PPRGGYGRRG QRSPSPRGRY SGGRGSSRDL PTSLLVRNLR HDCRPEDLRR 60 PFGQFGVLKD IYLPKDYYTG EPRGFGFVQF VEPSDAEEAK YQMDGQLLLG REITVVFAEE 120 NRKKPSDMRH RERPSSRTTS RYRDRRRSPP RYSLSPPPRR ARSRSHSHDY YSPPKRRDYS 180 RSVSPQERRY SREKSFSRSP PPYEGSRSRS QSPVRGPSRS RSRSHSPRRS IRRSRSPINE 240 DYPREPNGDR SPSLAEFSSL VRAHLPCISP QNRRYSLCLG GSFIGINIGT DVSDLPSETD 300 TVALLKAHQI THVRLYNADA HMLKALSNSG IEVMVGVTNE EVLGIGESAS AAAAWINKNV 360 AAYLPSTNIT AIAVGSEVIT TIPNAAPVLV SAMNYLHKAL VASNLNFQIK VSTPQSMDII 420 PKPFPPSTAT FNYSWGPTIY RILQFIKNTN SYYMLNAYPY YGYTSGDGIF PLDYALFRPL 480 PSVKQIVDPN TLFHYNSMFD AMVDATYYSI EDFNFSGISV VVTETGWPWF GGSKEPDANT 540 GNAQTYTNNL IQRVLNGSGP PSQPKLPINT YIYELFNEDQ RPGPVSEKNW GVFYTNGSAV 600 YPLSLNDSNQ ITGNSSGVFC VAKADADPDK LQDGLNWACG QGGANCTPIQ KGQRCYLPDS 660 IVNHASYAFN DYYQKMQSAG GTCDFDGTAM TTTVDPILIQ AHLRDLHQRQ LHLQAPLEAG 720 VRIYKFPIFN ISYRLHFLYC FCFDLRFRPV CYHKQHLSFF PLNSPKSEET GASKMTTPSA 780 PSLASSSRWA STTSVSSSGK RIQREMVELN MDPPPDCCAG PKGDNLYHWI ATLFGPPX 840 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam07983 | X8 | 3.0e-22 | 620 | 690 | 76 | + X8 domain. The X8 domain domain contains at least 6 conserved cysteine residues that presumably form three disulphide bridges. The domain is found in an Olive pollen allergen as well as at the C-terminus of several families of glycosyl hydrolases. This domain may be involved in carbohydrate binding. This domain is characteristic of GPI-anchored domains. | ||
cd12559 | RRM_SRSF10 | 1.0e-23 | 42 | 125 | 84 | + RNA recognition motif in serine/arginine-rich splicing factor 10 (SRSF10) and similar proteins. This subgroup corresponds to the RRM of SRSF10, also termed 40 kDa SR-repressor protein (SRrp40), or FUS-interacting serine-arginine-rich protein 1 (FUSIP1), or splicing factor SRp38, or splicing factor, arginine/serine-rich 13A (SFRS13A), or TLS-associated protein with Ser-Arg repeats (TASR). SRSF10 is a serine-arginine (SR) protein that acts as a potent and general splicing repressor when dephosphorylated. It mediates global inhibition of splicing both in M phase of the cell cycle and in response to heat shock. SRSF10 emerges as a modulator of cholesterol homeostasis through the regulation of low-density lipoprotein receptor (LDLR) splicing efficiency. It also regulates cardiac-specific alternative splicing of triadin pre-mRNA and is required for proper Ca2+ handling during embryonic heart development. In contrast, the phosphorylated SRSF10 functions as a sequence-specific splicing activator in the presence of a nuclear cofactor. It activates distal alternative 5' splice site of adenovirus E1A pre-mRNA in vivo. Moreover, SRSF10 strengthens pre-mRNA recognition by U1 and U2 snRNPs. SRSF10 localizes to the nuclear speckles and can shuttle between nucleus and cytoplasm. It contains a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by a C-terminal RS domain rich in serine-arginine dipeptides. | ||
cd12312 | RRM_SRSF10_SRSF12 | 2.0e-29 | 42 | 125 | 84 | + RNA recognition motif in serine/arginine-rich splicing factor SRSF10, SRSF12 and similar proteins. This subfamily corresponds to the RRM of SRSF10 and SRSF12. SRSF10, also termed 40 kDa SR-repressor protein (SRrp40), or FUS-interacting serine-arginine-rich protein 1 (FUSIP1), or splicing factor SRp38, or splicing factor, arginine/serine-rich 13A (SFRS13A), or TLS-associated protein with Ser-Arg repeats (TASR). It is a serine-arginine (SR) protein that acts as a potent and general splicing repressor when dephosphorylated. It mediates global inhibition of splicing both in M phase of the cell cycle and in response to heat shock. SRSF10 emerges as a modulator of cholesterol homeostasis through the regulation of low-density lipoprotein receptor (LDLR) splicing efficiency. It also regulates cardiac-specific alternative splicing of triadin pre-mRNA and is required for proper Ca2+ handling during embryonic heart development. In contrast, the phosphorylated SRSF10 functions as a sequence-specific splicing activator in the presence of a nuclear cofactor. It activates distal alternative 5' splice site of adenovirus E1A pre-mRNA in vivo. Moreover, SRSF10 strengthens pre-mRNA recognition by U1 and U2 snRNPs. SRSF10 localizes to the nuclear speckles and can shuttle between nucleus and cytoplasm. SRSF12, also termed 35 kDa SR repressor protein (SRrp35), or splicing factor, arginine/serine-rich 13B (SFRS13B), or splicing factor, arginine/serine-rich 19 (SFRS19), is a serine/arginine (SR) protein-like alternative splicing regulator that antagonizes authentic SR proteins in the modulation of alternative 5' splice site choice. For instance, it activates distal alternative 5' splice site of the adenovirus E1A pre-mRNA in vivo. Both, SRSF10 and SRSF12, contain a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by a C-terminal RS domain rich in serine-arginine dipeptides. | ||
smart00768 | X8 | 1.0e-35 | 619 | 696 | 78 | + Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges. | ||
pfam00332 | Glyco_hydro_17 | 1.0e-82 | 284 | 604 | 321 | + Glycosyl hydrolases family 17. |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003676 | nucleic acid binding |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds |
GO:0005975 | carbohydrate metabolic process |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
DDBJ | BAH57260.1 | 0 | 278 | 696 | 20 | 439 | AT3G13560 [Arabidopsis thaliana] |
EMBL | CAN72077.1 | 0 | 285 | 706 | 27 | 457 | hypothetical protein [Vitis vinifera] |
RefSeq | XP_002283548.1 | 0 | 285 | 696 | 27 | 438 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002283548.1 | 5e-16 | 789 | 836 | 522 | 569 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002297638.1 | 0 | 295 | 696 | 1 | 401 | predicted protein [Populus trichocarpa] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 2cyg_A | 0 | 284 | 605 | 1 | 312 | A Chain A, Crystal Structure At 1.45- Resolution Of The Major Allergen Endo-Beta-1,3-Glucanase Of Banana As A Molecular Basis For The Latex-Fruit Syndrome |
PDB | 1ghs_B | 0 | 284 | 605 | 1 | 306 | A Chain A, The Three-Dimensional Structures Of Two Plant Beta-Glucan Endohydrolases With Distinct Substrate Specificities |
PDB | 1ghs_A | 0 | 284 | 605 | 1 | 306 | A Chain A, The Three-Dimensional Structures Of Two Plant Beta-Glucan Endohydrolases With Distinct Substrate Specificities |
PDB | 3ur8_B | 1.4013e-45 | 284 | 606 | 3 | 315 | A Chain A, The Three-Dimensional Structures Of Two Plant Beta-Glucan Endohydrolases With Distinct Substrate Specificities |
PDB | 3ur8_A | 1.4013e-45 | 284 | 606 | 3 | 315 | A Chain A, The Three-Dimensional Structures Of Two Plant Beta-Glucan Endohydrolases With Distinct Substrate Specificities |
EST Download unfiltered results here | ||||
---|---|---|---|---|
Hit | Length | Start | End | EValue |
DV161369 | 276 | 279 | 554 | 0 |
BG648951 | 263 | 315 | 577 | 0 |
DV150956 | 253 | 309 | 561 | 0 |
GO007298 | 242 | 394 | 635 | 0 |
DV154597 | 235 | 392 | 626 | 0 |
Sequence Alignments (This image is cropped. Click for full image.) |
---|