y
Basic Information | |
---|---|
Species | Malus domestica |
Cazyme ID | MDP0000716700 |
Family | CBM43 |
Protein Properties | Length: 945 Molecular Weight: 97223.1 Isoelectric Point: 8.1091 |
Chromosome | Chromosome/Scaffold: 012993205 Start: 5021 End: 10713 |
Description | O-Glycosyl hydrolases family 17 protein |
View CDS |
External Links |
---|
NCBI Taxonomy |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM43 | 366 | 446 | 5.2e-22 |
WCVVNNNRDLSNATASALEACLHADCSAMSPGGSCXNISWPGNISYAFNSYYQQHNQSADSCEFGSLGLITTVDPSLDNCK | |||
GH17 | 22 | 346 | 0 |
IGVNWGTTASHPLPPTKVVELLKSNNVTKVKLFDADPGVLEALSGSKLSVTVGIPNALLKVLNSSKKAAESWVHDNVTRYVSNSGGGGVKIEYVAVGDEP FLQSYGEQFHPFVIGAAMNIHAALARAKLESNVKVVVPCSFDSFLSESGHPSKGHFRADLNRTMIELLTFLSKHNSPFFAXISPFISLRQNKNISLDFTL LKENAKPHNDSHRTYKNXFDLIYDTLVTALSTVGFPKMEIVVSQIGWPTDGAANATPSTAETFMKGLMAHLRSKSGTPLRPHNPPVETYIFSLLDEDQRS ISMGNFERHWGVFTFDGQAKYHFDF |
Full Sequence |
---|
Protein Sequence Length: 945 Download |
MPPNLRTLLL LLATTASTCG AIGVNWGTTA SHPLPPTKVV ELLKSNNVTK VKLFDADPGV 60 LEALSGSKLS VTVGIPNALL KVLNSSKKAA ESWVHDNVTR YVSNSGGGGV KIEYVAVGDE 120 PFLQSYGEQF HPFVIGAAMN IHAALARAKL ESNVKVVVPC SFDSFLSESG HPSKGHFRAD 180 LNRTMIELLT FLSKHNSPFF AXISPFISLR QNKNISLDFT LLKENAKPHN DSHRTYKNXF 240 DLIYDTLVTA LSTVGFPKME IVVSQIGWPT DGAANATPST AETFMKGLMA HLRSKSGTPL 300 RPHNPPVETY IFSLLDEDQR SISMGNFERH WGVFTFDGQA KYHFDFIQGS KNLVDAQNVE 360 YLPAKWCVVN NNRDLSNATA SALEACLHAD CSAMSPGGSC XNISWPGNIS YAFNSYYQQH 420 NQSADSCEFG SLGLITTVDP SLDNCKFSVQ LRASLSGSLH PAYLPLWMTX LIMAGMYGQE 480 GDGAPPPYGS SGGGGYGGSG GYGGGSGGYG GGGGGGSYGG SGGGGYGGKG GDXYGGGGGR 540 GGGGYGGGGG RGGGYQGDRG GGGRGGDRGG GGGRGGRGGS GRDGDWXCPN PGCGNLNFAR 600 RVECNKCGTP SPSGAAGGDR GSGGGGNYNR GGTGGGNRGG GTGGGNYDGG RSGNYEGGKG 660 SSYDGGRGGS YESRGGGGSR GGSYGGSQGR DDGGYGQVPP NAPPSYGTAG GNYPSYNASY 720 GTDAVPPPTS YTGGPASYPP SYGGPAGGYG GDGPGDARGG ARGGPPAKYD GGYGAGGRGG 780 YGSAATEAPS KVKQCDQNCD DTCDNARIYI SNLPPDVTVD ELQQLFGGIG QVGRIKQKRG 840 YKDQWPYNIK IYTDESGKNK GDACLAYEDP SAAHSAGGFY NDYELRGYKI SVAMAERSAP 900 RPSFEQGGGG GGGRGGYGGG DRRNNYRDGG ADRHQHGGNR SRPY* 960 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam07983 | X8 | 3.0e-12 | 365 | 433 | 79 | + X8 domain. The X8 domain domain contains at least 6 conserved cysteine residues that presumably form three disulphide bridges. The domain is found in an Olive pollen allergen as well as at the C-terminus of several families of glycosyl hydrolases. This domain may be involved in carbohydrate binding. This domain is characteristic of GPI-anchored domains. | ||
cd12534 | RRM_SARFH | 4.0e-15 | 808 | 895 | 89 | + RNA recognition motif in Drosophila melanogaster RNA-binding protein cabeza and similar proteins. This subgroup corresponds to the RRM in cabeza, also termed P19, or sarcoma-associated RNA-binding fly homolog (SARFH). It is a putative homolog of human RNA-binding proteins FUS (also termed TLS or Pigpen or hnRNP P2), EWS (also termed EWSR1), TAF15 (also termed hTAFII68 or TAF2N or RPB56), and belongs to the of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a nuclear RNA binding protein that may play an important role in the regulation of RNA metabolism during fly development. Cabeza contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). | ||
smart00768 | X8 | 1.0e-25 | 365 | 447 | 85 | + Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges. | ||
cd12280 | RRM_FET | 2.0e-35 | 808 | 894 | 87 | + RNA recognition motif in the FET family of RNA-binding proteins. This subfamily corresponds to the RRM of FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA-binding proteins. This ubiquitously expressed family of similarly structured proteins predominantly localizing to the nuclear, includes FUS (also known as TLS or Pigpen or hnRNP P2), EWS (also known as EWSR1), TAF15 (also known as hTAFII68 or TAF2N or RPB56), and Drosophila Cabeza (also known as SARFH). The corresponding coding genes of these proteins are involved in deleterious genomic rearrangements with transcription factor genes in a variety of human sarcomas and acute leukemias. All FET proteins interact with each other and are therefore likely to be part of the very same protein complexes, which suggests a general bridging role for FET proteins coupling RNA transcription, processing, transport, and DNA repair. The FET proteins contain multiple copies of a degenerate hexapeptide repeat motif at the N-terminus. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a putative zinc-finger domain, and a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), which is flanked by 3 arginine-glycine-glycine (RGG) boxes. FUS and EWS might have similar sequence specificity; both bind preferentially to GGUG-containing RNAs. FUS has also been shown to bind strongly to human telomeric RNA and to small low-copy-number RNAs tethered to the promoter of cyclin D1. To date, nothing is known about the RNA binding specificity of TAF15. | ||
pfam00332 | Glyco_hydro_17 | 8.0e-69 | 22 | 346 | 326 | + Glycosyl hydrolases family 17. |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003676 | nucleic acid binding |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds |
GO:0005622 | intracellular |
GO:0005975 | carbohydrate metabolic process |
GO:0008270 | zinc ion binding |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
EMBL | CBI28434.1 | 0 | 1 | 613 | 1 | 649 | unnamed protein product [Vitis vinifera] |
EMBL | CBI28434.1 | 0 | 720 | 944 | 693 | 873 | unnamed protein product [Vitis vinifera] |
RefSeq | NP_200656.2 | 0 | 12 | 467 | 16 | 464 | glycosyl hydrolase family 17 protein [Arabidopsis thaliana] |
RefSeq | XP_002269108.1 | 0 | 21 | 484 | 25 | 488 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002313970.1 | 0 | 18 | 473 | 18 | 472 | predicted protein [Populus trichocarpa] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 3f55_D | 9.80909e-45 | 22 | 346 | 2 | 315 | A Chain A, Structure Of Alfa-Galactosidase From Saccharomyces Cerevisiae With Raffinose |
PDB | 3f55_C | 9.80909e-45 | 22 | 346 | 2 | 315 | A Chain A, Structure Of Alfa-Galactosidase From Saccharomyces Cerevisiae With Raffinose |
PDB | 3f55_B | 9.80909e-45 | 22 | 346 | 2 | 315 | A Chain A, Structure Of Alfa-Galactosidase From Saccharomyces Cerevisiae With Raffinose |
PDB | 3f55_A | 9.80909e-45 | 22 | 346 | 2 | 315 | A Chain A, Structure Of Alfa-Galactosidase From Saccharomyces Cerevisiae With Raffinose |
PDB | 3em5_D | 9.80909e-45 | 22 | 346 | 2 | 315 | A Chain A, Crystal Structure Of A Native Endo Beta-1,3-Glucanase (Hev B 2), A Major Allergen From Hevea Brasiliensis |