y
Basic Information | |
---|---|
Species | Malus domestica |
Cazyme ID | MDP0000319911 |
Family | CBM43 |
Protein Properties | Length: 751 Molecular Weight: 84591.7 Isoelectric Point: 9.774 |
Chromosome | Chromosome/Scaffold: 01724544 Start: 1036 End: 9651 |
Description | O-Glycosyl hydrolases family 17 protein |
View CDS |
External Links |
---|
NCBI Taxonomy |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM43 | 615 | 691 | 3.4e-28 |
CTAKANADPDKLQDGINWACGQGGANCTPIQRGQRCYLPDTIVNHASYAYNDYYQKMKSAGGTCDFDDTAIISTVDP | |||
GH17 | 279 | 599 | 0 |
IGINIGTDVSDLPSETDTVALLKAHQITHVRLYNADAHILKALSNSGIEVMVGITNEEVLSVGESSSAAAAWINKNVAAYLPSTNITAIAVGSEVITSIP HAAPVLVDAMNYLHKALVASNLNFQVKVSTPQSMDIIPKPFPPSTATFNYSWGHTIYQILQFIKNTNSYYMLNAYPYYGYTNGDGIFPLDYALFRPLPSV KQIVDPNTVFHYNSMFDAMVDATYYSIEDFNFSGISVVVTETGWPWFGGSKEPDANTGNAQTYTNNLIRRVLNDSGPPSQPKLPINTYIYELFNEDKRPG PVSEKNWGVLYTNGSAVYPLS |
Full Sequence |
---|
Protein Sequence Length: 751 Download |
MRGRSYSPSP PPPPPRGGYG RRGQRSPSPR GRYSGGRGSG RDLPTSLLVR NLRHDCRPED 60 LRRPFGQFGV LKDIYLPKDY YTGEPRGFGF VQFVEPADAE EAKYQMDGQL LLGREVTVVF 120 AEENRKKPSD MRHRERSSSR TTSRYRDRRR SPPRYSRSPR YSRSPPPRRA RSRSHSRGYY 180 SPPKRRDYSR CFCLXXPCIY SDVWFLALPA ASVFLRSVSP QERRYSREKS FSRSPPPYEG 240 SRSRSQSPVR GPSRSRSRSH SPRRNIRRSR SPINEGAFIG INIGTDVSDL PSETDTVALL 300 KAHQITHVRL YNADAHILKA LSNSGIEVMV GITNEEVLSV GESSSAAAAW INKNVAAYLP 360 STNITAIAVG SEVITSIPHA APVLVDAMNY LHKALVASNL NFQVKVSTPQ SMDIIPKPFP 420 PSTATFNYSW GHTIYQILQF IKNTNSYYML NAYPYYGYTN GDGIFPLDYA LFRPLPSVKQ 480 IVDPNTVFHY NSMFDAMVDA TYYSIEDFNF SGISVVVTET GWPWFGGSKE PDANTGNAQT 540 YTNNLIRRVL NDSGPPSQPK LPINTYIYEL FNEDKRPGPV SEKNWGVLYT NGSAVYPLSL 600 SDSSQITGNS SGVFCTAKAN ADPDKLQDGI NWACGQGGAN CTPIQRGQRC YLPDTIVNHA 660 SYAYNDYYQK MKSAGGTCDF DDTAIISTVD PILIQANLRD LHQRQLHLQP LLEAGVRIYK 720 FPIFNISFRQ HFSHCYCFDL RFHPVYFYRA A 780 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
cd12560 | RRM_SRSF12 | 1.0e-21 | 45 | 128 | 84 | + RNA recognition motif in serine/arginine-rich splicing factor 12 (SRSF12) and similar proteins. This subgroup corresponds to the RRM of SRSF12, also termed 35 kDa SR repressor protein (SRrp35), or splicing factor, arginine/serine-rich 13B (SFRS13B), or splicing factor, arginine/serine-rich 19 (SFRS19). SRSF12 is a serine/arginine (SR) protein-like alternative splicing regulator that antagonizes authentic SR proteins in the modulation of alternative 5' splice site choice. For instance, it activates distal alternative 5' splice site of the adenovirus E1A pre-mRNA in vivo. SRSF12 contains a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by a C-terminal RS domain rich in serine-arginine dipeptides. | ||
cd12559 | RRM_SRSF10 | 2.0e-23 | 45 | 128 | 84 | + RNA recognition motif in serine/arginine-rich splicing factor 10 (SRSF10) and similar proteins. This subgroup corresponds to the RRM of SRSF10, also termed 40 kDa SR-repressor protein (SRrp40), or FUS-interacting serine-arginine-rich protein 1 (FUSIP1), or splicing factor SRp38, or splicing factor, arginine/serine-rich 13A (SFRS13A), or TLS-associated protein with Ser-Arg repeats (TASR). SRSF10 is a serine-arginine (SR) protein that acts as a potent and general splicing repressor when dephosphorylated. It mediates global inhibition of splicing both in M phase of the cell cycle and in response to heat shock. SRSF10 emerges as a modulator of cholesterol homeostasis through the regulation of low-density lipoprotein receptor (LDLR) splicing efficiency. It also regulates cardiac-specific alternative splicing of triadin pre-mRNA and is required for proper Ca2+ handling during embryonic heart development. In contrast, the phosphorylated SRSF10 functions as a sequence-specific splicing activator in the presence of a nuclear cofactor. It activates distal alternative 5' splice site of adenovirus E1A pre-mRNA in vivo. Moreover, SRSF10 strengthens pre-mRNA recognition by U1 and U2 snRNPs. SRSF10 localizes to the nuclear speckles and can shuttle between nucleus and cytoplasm. It contains a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by a C-terminal RS domain rich in serine-arginine dipeptides. | ||
cd12312 | RRM_SRSF10_SRSF12 | 4.0e-29 | 45 | 128 | 84 | + RNA recognition motif in serine/arginine-rich splicing factor SRSF10, SRSF12 and similar proteins. This subfamily corresponds to the RRM of SRSF10 and SRSF12. SRSF10, also termed 40 kDa SR-repressor protein (SRrp40), or FUS-interacting serine-arginine-rich protein 1 (FUSIP1), or splicing factor SRp38, or splicing factor, arginine/serine-rich 13A (SFRS13A), or TLS-associated protein with Ser-Arg repeats (TASR). It is a serine-arginine (SR) protein that acts as a potent and general splicing repressor when dephosphorylated. It mediates global inhibition of splicing both in M phase of the cell cycle and in response to heat shock. SRSF10 emerges as a modulator of cholesterol homeostasis through the regulation of low-density lipoprotein receptor (LDLR) splicing efficiency. It also regulates cardiac-specific alternative splicing of triadin pre-mRNA and is required for proper Ca2+ handling during embryonic heart development. In contrast, the phosphorylated SRSF10 functions as a sequence-specific splicing activator in the presence of a nuclear cofactor. It activates distal alternative 5' splice site of adenovirus E1A pre-mRNA in vivo. Moreover, SRSF10 strengthens pre-mRNA recognition by U1 and U2 snRNPs. SRSF10 localizes to the nuclear speckles and can shuttle between nucleus and cytoplasm. SRSF12, also termed 35 kDa SR repressor protein (SRrp35), or splicing factor, arginine/serine-rich 13B (SFRS13B), or splicing factor, arginine/serine-rich 19 (SFRS19), is a serine/arginine (SR) protein-like alternative splicing regulator that antagonizes authentic SR proteins in the modulation of alternative 5' splice site choice. For instance, it activates distal alternative 5' splice site of the adenovirus E1A pre-mRNA in vivo. Both, SRSF10 and SRSF12, contain a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by a C-terminal RS domain rich in serine-arginine dipeptides. | ||
smart00768 | X8 | 1.0e-32 | 614 | 691 | 78 | + Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges. | ||
pfam00332 | Glyco_hydro_17 | 9.0e-77 | 279 | 599 | 321 | + Glycosyl hydrolases family 17. |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003676 | nucleic acid binding |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds |
GO:0005975 | carbohydrate metabolic process |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
DDBJ | BAH57260.1 | 0 | 275 | 691 | 22 | 439 | AT3G13560 [Arabidopsis thaliana] |
EMBL | CAN72077.1 | 0 | 280 | 701 | 27 | 457 | hypothetical protein [Vitis vinifera] |
RefSeq | NP_187965.1 | 0 | 275 | 691 | 22 | 439 | glycosyl hydrolase family 17 protein [Arabidopsis thaliana] |
RefSeq | XP_002283548.1 | 0 | 280 | 691 | 27 | 438 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002297638.1 | 0 | 290 | 691 | 1 | 401 | predicted protein [Populus trichocarpa] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 2cyg_A | 0 | 279 | 600 | 1 | 312 | A Chain A, Crystal Structure At 1.45- Resolution Of The Major Allergen Endo-Beta-1,3-Glucanase Of Banana As A Molecular Basis For The Latex-Fruit Syndrome |
PDB | 1ghs_B | 0 | 279 | 600 | 1 | 306 | A Chain A, The Three-Dimensional Structures Of Two Plant Beta-Glucan Endohydrolases With Distinct Substrate Specificities |
PDB | 1ghs_A | 0 | 279 | 600 | 1 | 306 | A Chain A, The Three-Dimensional Structures Of Two Plant Beta-Glucan Endohydrolases With Distinct Substrate Specificities |
PDB | 3ur8_B | 8.96831e-44 | 279 | 588 | 3 | 302 | A Chain A, The Three-Dimensional Structures Of Two Plant Beta-Glucan Endohydrolases With Distinct Substrate Specificities |
PDB | 3ur8_A | 8.96831e-44 | 279 | 588 | 3 | 302 | A Chain A, The Three-Dimensional Structures Of Two Plant Beta-Glucan Endohydrolases With Distinct Substrate Specificities |
EST Download unfiltered results here | ||||
---|---|---|---|---|
Hit | Length | Start | End | EValue |
GO007298 | 242 | 389 | 630 | 0 |
DV161369 | 274 | 276 | 549 | 0 |
BG648951 | 263 | 310 | 572 | 0 |
CO418267 | 192 | 456 | 647 | 0 |
DV150956 | 253 | 304 | 556 | 0 |
Sequence Alignments (This image is cropped. Click for full image.) |
---|