Yin Lab Research

Publications

# corresponding author   | ^ UNL/NIU grad student or postdoc   | + Undergraduate student   | * co-first authors   | [P] Plant bioinformatics/genomics   | [M] Microbial bioinformatics/genomics

Overview

A snapshot of our publication record and impact.

Total Publications

131

Most Significant

57

Highlight-tagged

15

Latest Year Output

2025 (8 pubs)

Most Significant Publications

Selected highlights
  1. M Feidaki R^, Yan Y^, Yang B^, Argiriou A, Panas P, Kourkoutas Y, Hutkins R#, Yin Y#. (2026) Genomic characterization of three Lacticaseibacillus paracasei strains from Greek fermented olives reveals common and unique characteristics and three district clades, Applied and Environmental Microbiology, DOI: 10.1128/aem.00515-26

  2. M Akresi JE^, Do TVT^, Cui Z^, Shanmugam NRS^, Moraïs S, Bayer EA, Auchtung JA, and Yin Y#. (2026) Limousia bacteria encode mucinolysome for mucin utilization in animal gut microbiomes. bioRxiv [Preprint]. 2025 Oct 14:2025.10.14.682383. Preprint link, Gut Microbes, doi: https://doi.org/10.1080/19490976.2026.2645267

  3. P Feng X^, Patel R^, Yan Y^, Annabhemoju V^, Meinhardt L, Bailey B, Park S, Cohen S, Zhang D, Yin Y#. (2026) Chromosomal-scale and haplotype-resolved genome assembly of the first Trinitario hybrid cacao ICS 1, Scientific Data, doi: 10.1038/s41597-026-07054-0

  4. P Tan H, Lou Y, Yan Y^, Meinhardt L, Bailey B, Gutierrez O, Park S, Cohen S, Zhang D, Yin Y#. (2025) Near T2T haplotype-resolved genomes of cacao (Theobroma cacao) variety CCN51. Scientific Data. doi: 10.1038/s41597-025-06480-w

  5. M Hu L^, Zhang X^, Gao X, Yang M, Dong X, Cheng Y, Liang T, Yang B, Zhang Y, Yin Y#. (2025) Bacillus velezensis T971 genome informs starch degradation in tobacco. Frontiers in Microbiology. DOI 10.3389/fmicb.2025.1689015

  6. M Yan Y^, Patel R^, Shanmugam NRS^, Akresi J^, Yin Y#. (2025) dbCAN-HGM: CAZyme gene clusters in gut microbiomes of diverse human populations. Nucleic Acids Res. DOI: 10.1093/nar/gkaf1185

  7. M Khatri M^, Shanmugam NRS^, Zhang X^, Patel R^, and Yin Y#. (2025) AcrDB update: predicted 3D structures of anti-CRISPRs in human gut viromes. Protein Science. DOI: 10.1002/pro.70177

  8. M Shanmugam NRS^, Yin Y#. (2025) CAZyme3D: a database of 3D structures for carbohydrate-active enzymes. Journal of Molecular Biology. doi.org/10.1016/j.jmb.2025.169001

  9. P Feng X^, Permann C, Holzinger A, Yin Y#. (2024) High molecular weight DNA extraction from mucilage rich zygnematophycean green algae for long read DNA sequencing. BMC Methods. DOI: 10.1186/s44330-024-00018-z

  10. M Zheng J, Huang L, Yi H, Yan Y, Zhang X, Akresi J, Yin Y#. (2024) Carbohydrate-active enzyme annotation in microbiomes using dbCAN. bioRxiv. Preprint link

  11. P Nousias O^, Zheng J^, Li T^, Meinhardt L, Bailey B, Gutierrez O, Baruah IK, Cohen S, Zhang D, Yin Y#. (2024) Three de novo assembled wild cacao genomes from the Upper Amazon. Scientific Data. DOI: 10.1038/s41597-024-03215-1

  12. P Li T^, Zheng J^, Nousias O^, Meinhardt L, Goenaga R, Zhang D, Yin Y#. (2024) The cherimoya genome reveals insights into the intra-specific divergence, the evolution of Magnoliales, and a putative gene cluster for acetogenin biosynthesis. Plants 13(5):636. doi

  13. M Dexheimer S^, Shrestha N, Chapagain BS^, Bujarski J#, Yin Y#. (2024) Characterization of Multivariate RNAs Encapsidated During Infection by Two Bromoviruses Based on ViReMa. Pathogens 13(1):96. doi

  14. M Yan Y^, Zheng J^, Zhang X^, Yin Y#. (2023) dbAPIS: a database of anti-prokaryotic immune system genes. Nucleic Acids Res. doi:10.1093/nar/gkad932

  15. M Yang B*, Khatri M*, Zheng J, Deogun J, Yin Y#. (2023) Genome mining for anti-CRISPR operons using machine learning. Bioinformatics. doi:10.1093/bioinformatics/btad309

  16. M Zheng J*, Ge Q*, Yan Y, Zhang X, Huang L, Yin Y#. (2023) dbCAN3: automated carbohydrate-active enzyme and substrate annotation. Nucleic Acids Res. doi:10.1093/nar/gkad328 ★ ESI Highly Cited Hot (Top 0.1%) Paper — Biology & Biochemistry

  17. P Feng X*, Zheng J*, Irisarri I*, Yu H, Zheng B, Ali Z, de Vries S, Keller J, Fürst-Jansen J, Dadras A, Zegers J, Rieseberg TP, Ashok AD, Darienko T, Bierenbroodspot MJ, Gramzow L, Petroll R, Haas FB, Fernandez-Pozo N, Nousias O, Li T, Fitzek E, Grayburn WS, Rittmeier N, Permann C, Rümpler F, Archibald JM, Theißen G, Mower JP, Lorenz M, Buschmann H, von Schwartzenberg K, Boston L, Hayes RD, Daum C, Barry K, Grigoriev IV, Wang X, Li FW, Rensing SA, Ben-Ari J, Keren N, Mosquna A, Holzinger A, Delaux PM, Zhang C, Huang J, Mutwil M, de Vries J#, Yin Y#. (2023) Genomes of multicellular algal sisters to land plants illuminate signaling network evolution. Nature Genetics. doi.org/10.1038/s41588-024-01737-3. Highlighted at Nature Plants. ★ ESI Highly Cited (Top 1%) — Molecular Biology & Genetics

  18. M Yang B^, Zheng J^, Yin Y#. (2022) AcaFinder: genome mining of anti-CRISPR associated genes in bacteria and phages. mSystems. doi:10.1128/msystems.00817-22

  19. M Zheng J^, Hu B^, Zhang X^, Ge Q^, Yan Y^, Piyush V^, Huang L, Yin Y#. (2022) dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes. Nucleic Acids Res. doi:10.1093/nar/gkac1068

  20. M Li T^, Yin Y#. (2022) Critical assessment of pan-genomics of metagenome-assembled genomes. Briefings in Bioinformatics. doi:10.1093/bib/bbac413

  21. P Zheng J^, Meinhardt L, Goenaga R, Matsumoto T, Zhang D, Yin Y#. (2021) The chromosome-level rambutan genome reveals a significant role of segmental duplication in the expansion of resistance genes. Horticulture Research 9:uhac014. doi

  22. M Zheng B^, He Y+, Zhang P+, Huo YX, Yin Y#. (2021) Polyphenol utilization proteins in human gut microbiome. Applied and Environmental Microbiology. doi Editor Spotlight Paper

  23. M Cheng X*, Yang B^*, Zheng J^, Wei H, Feng X^, Yin Y#. (2021) Cadmium stress triggers significant metabolic reprogramming in Enterococcus faecium CX 2-6. Computational and Structural Biotechnology Journal. doi

  24. M Peterson D^, Li T^, Calvo AM, Yin Y#. (2021) Categorization of orthologous gene clusters in 92 Ascomycota genomes reveals functions important for phytopathogenicity. Journal of Fungi 7(5):337. doi

  25. P Zheng J^, Meinhardt L, Goenaga R, Zhang D, Yin Y#. (2021) The chromosome-level genome of dragon fruit reveals whole genome duplication and chromosomal co-localization of betacyanin biosynthetic genes. Horticulture Research 8:63. doi

  26. P Feng X^, Holzinger A, Permann C, Anderson C, Yin Y#. (2021) Characterization of two Zygnema strains (SAG 698-1a and SAG 698-1b) and a rapid method to estimate nuclear genome size of Zygnematophycean green algae. Frontiers in Plant Science 12:610381. doi

  27. M Huang L^*, Yang B^*, Yi H^, Asif A, Wang J, Lithgow T, Zhang H, Minhas F, Yin Y#. (2020) AcrDB: a database of anti-CRISPR operons in prokaryotes and viruses. Nucleic Acids Res. doi:10.1093/nar/gkaa857

  28. M Ausland C^, Zheng J^, Yi H^, Yang B^, Li T^, Feng X^, Zheng B^, Yin Y#. (2020) dbCAN-PUL: a database of experimentally characterized CAZyme gene clusters and their substrates. Nucleic Acids Res. doi:10.1093/nar/gkaa742 ★ ESI Highly Cited (Top 1%) — Biology & Biochemistry

  29. P Fitzek E.^, Balazic R.+, Yin Y#. (2020) Bioinformatics Analysis of Plant Cell Wall Evolution. In: Popper Z. (eds) The Plant Cell Wall. Methods in Molecular Biology 2149:483–502. Humana, New York. doi

  30. M Cao H^, Shimura Y, Steffen MM, Yang Z, Lu J, Joel A, Jenkins L, Kawachi M, Yin Y#, Garcia-Pichel F#. (2020) The trait repertoire enabling cyanobacteria to bloom assessed through comparative genomic complexity and metatranscriptomics. mBio 11:e01155-20. doi

  31. M Yi H^, Huang L^, Yang B^, Zhang H, Yin Y#. (2020) AcrFinder: genome mining anti-CRISPR operons in prokaryotes and their viruses. Nucleic Acids Res. doi:10.1093/nar/gkaa351

  32. P Orton L^, Fitzek E^, Feng X^, Grayburn SW, Mower JP, Liu K, Zhang C, Duvall MR, Yin Y#. (2020) Zygnema circumcarinatum UTEX 1559 chloroplast and mitochondrial genomes provide insight into land plant evolution. Journal of Experimental Botany. doi:10.1093/jxb/eraa149

  33. M Xu J, Zhang H, Zheng J^, Dovoedo P^, Yin Y#. (2019) eCAMI: simultaneous classification and motif identification for enzyme annotation. Bioinformatics. doi:10.1093/bioinformatics/btz908

  34. M Yin Y#, Yang B^, Entwistle S^. (2019) Bioinformatics identification of anti-CRISPR loci using homology, guilt-by-association, and CRISPR self-targeting spacer approaches. mSystems. doi:10.1128/mSystems.00455-19

  35. P Fitzek E^, Orton F^, Entwistle S^, Grayburn SW, Ausland C^, Duvall DR, Yin Y#. (2019) Cell wall enzymes in Zygnema circumcarinatum UTEX 1559 respond to osmotic stress in a plant-like fashion. Frontiers in Plant Science. doi:10.3389/fpls.2019.00732

  36. M Entwistle S^, Li X^, Yin Y#. (2019) Orphan genes shared by pathogenic genomes are more associated with bacterial pathogenicity. mSystems 4(1):e00290-18. doi:10.1128/mSystems.00290-18

  37. M Zhang H*, Yohe T*+, Huang L, Entwistle S^, Wu P, Yang Z, Busk P, Xu Y, Yin Y#. (2018) dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. (Web server issue 2018) 46(W1):W95–W101. doi:10.1093/nar/gky418 ★ ESI Highly Cited (Top 1%) — Biology & Biochemistry [95+ citations]

  38. M Huang L, Zhang H, Wu P, Entwistle S+, Li X^, Yohe T+, Yi H, Yang Z, Yin Y#. (2017) dbCAN-seq: a database of carbohydrate-active enzyme (CAZyme) sequence and annotation. Nucleic Acids Res. (database issue 2018) 46(D1):D516–D521. doi:10.1093/nar/gkx894

  39. M Huo L, Zhang H, Huo X, Yang Y^, Li X^, Yin Y#. (2017) pHMM-tree: Phylogeny of Profile Hidden Markov Models. Bioinformatics 33(7):1093–1095. doi:10.1093/bioinformatics/btw779

  40. M Ekstrom A+, Yin Y#. (2016) ORFanFinder: automated identification of taxonomically restricted orphan genes. Bioinformatics 32(13):2053–2055. doi:10.1093/bioinformatics/btw122

  41. M Hu L, Taujale R^, Song J, Guo J, Yin Y#. (2016) Draft genome sequence of Talaromyces verruculosus ("Penicillium verruculosum") strain TS63-9, a fungus with great potential for industrial production of polysaccharide-degrading enzymes. Journal of Biotechnology 219:5–6. doi

  42. M Nguyen M+, Ekstrom A+, Li X^, Yin Y#. (2015) HGT-Finder: a new tool for horizontal gene transfer finding and application to Aspergillus genomes. Toxins 7(10):4035–4053. doi:10.3390/toxins7104035 Highly accessed · 14+ citations

  43. P Taujale R^, Yin Y#. (2015) Glycosyltransferase family 43 is also found in early eukaryotes and has three subfamilies in Charophycean green algae. PLoS ONE. doi:10.1371/journal.pone.0128409

  44. P Ekstrom A+, Taujale R^, McGinn N+, Yin Y#. (2014) PlantCAZyme: a database for plant carbohydrate-active enzymes. Database: The Journal of Biological Databases and Curation. doi:10.1093/database/bau079 NIU Undergraduate Research & Artistry Day Award · 17+ citations

  45. M Cao H^, Yin Y#. (2014) Rapid evolution of cellulosome modules by comparative analyses of five Clostridiales genomes. Bioenergy Research 7:1369–1381. doi

  46. P Yin Y#, Johns MA, Cao H^, Rupani M^. (2014) A survey of the genomes and transcriptomes of diverse plants and algae reveals new insights into the evolution and function of the cellulose synthase superfamily. BMC Genomics 15:260. doi Highly accessed · 34+ citations

  47. M Cao H^, Shimura Y, Masanobu K, Yin Y#. (2014) Draft genome sequence of the toxic bloom-forming cyanobacterium Aphanizomenon flos-aquae NIES-81. Genome Announc. 2(1):e00044-14. doi

  48. M Ma Q*, Yin Y*, Schell M, Li G, Xu Y. (2013) Computational analyses of transcriptomic data reveal the dynamic organization of the Escherichia coli chromosome under different conditions. Nucleic Acids Research 41(11):5594–5603. doi

  49. P Wang S*, Yin Y*, Ma Q, Tang X, Xu Y. Genome-scale identification of cell-wall related genes in Arabidopsis based on co-expression network analysis. BMC Plant Biology 2012;12:138. 39+ citations

  50. P Yin Y*, Mao X*, Yang JC, Chen X, Xu Y. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. (web server issue 2012) 40:W445–W451. doi ★ ESI Highly Cited (Top 1%) — Biology & Biochemistry · DOE Key Advance for Biofuels Production · 832+ citations

  51. P Yin Y, Huang J, Gu X, Bar-Peled M, Xu Y. (2011) Evolution of plant nucleotide-sugar interconversion enzymes. PLoS One 6(11):e27995. PubMed PMCID: PMC3220709. 40+ citations

  52. M Zhang H*, Yin Y*, Olman V, Xu Y. (2012) Genomic arrangement of regulons in bacterial genomes. PLoS One 7(1):e29496. PubMed PMCID: PMC3250446. 40+ citations

  53. P Yin Y, Mohnen D, Xu Y, Hahn M. (2011) Glycosyltransferase Family 8 (GT8). Book chapter of Plant Cell Wall Polysaccharides: Biosynthesis and Bioengineering. Annual Plant Review series (Blackwell) 41:167–212. 16+ citations

  54. P Yin Y, Chen H, Hahn MG, Mohnen D, Xu Y. (2010) Evolution and function of the plant cell wall synthesis-related glycosyltransferase family 8. Plant Physiol. 153(4):1729–1746. PubMed PMCID: PMC2923890. 83+ citations

  55. M Yin Y*, Zhang H*, Olman V, Xu Y. (2010) Genomic arrangement of bacterial operons is constrained by biological pathways encoded in the genome. PNAS 107(14):6310–6315. PubMed PMCID: PMC2851940. 38+ citations

  56. P Yin Y, Huang J, Xu Y. (2009) The cellulose synthase superfamily in fully sequenced plants and algae. BMC Plant Biol. 9:99. PubMed PMCID: PMC3091534. Highly accessed · 115+ citations

  57. P Shen H*, Yin Y*, Chen F, Xu Y, Dixon R. (2009) A bioinformatic analysis of NAC genes for plant cell wall development in relation to lignocellulosic bioenergy production. Bioenergy Research 2(4):217–232. 112+ citations

  58. M Yin Y, Fischer D. (2008) Identification and investigation of ORFans in the viral world. BMC Genomics 9:24. PubMed PMCID: PMC2245933. 78+ citations

  59. M Yin Y, Fischer D. (2006) On the origin of microbial ORFans: quantifying the strength of the evidence for viral lateral transfer. BMC Evol Biol. 6:63. PubMed PMCID: PMC1559721. 52+ citations

Complete Publication List

2026 4 publications
  1. Feidaki R^, Yan Y^, Yang B^, Argiriou A, Panas P, Kourkoutas Y, Hutkins R#, Yin Y#. (2026) Genomic characterization of three Lacticaseibacillus paracasei strains from Greek fermented olives reveals common and unique characteristics and three district clades. Applied and Environmental Microbiology. doi: 10.1128/aem.00515-26

  2. Feng X^, Patel R^, Yan Y^, Annabhemoju V^, Meinhardt L, Bailey B, Park S, Cohen S, Zhang D, Yin Y#. (2026) Chromosomal-scale and haplotype-resolved genome assembly of the first Trinitario hybrid cacao ICS 1. Scientific Data doi: 10.1038/s41597-026-07054-0

  3. Chen H, …, Zhang X^, Lin X, Yin Y, Huang X, Zheng J, Liang T. (2026) Causal Prediction of TP53 Variant Pathogenicity Using a Perturbation-informed Protein Language Model. Advanced Science doi:10.1002/advs.202516332

2025 8 publications
  1. Tan H, Lou Y, Yan Y, Meinhardt L, Bailey B, Gutierrez O, Park S, Cohen S, Zhang D, Yin Y#. (2025) Near T2T haplotype-resolved genomes of cacao (Theobroma cacao) variety CCN51. Scientific Data. doi: 10.1038/s41597-025-06480-w

  2. Hu L^, Zhang X^, Gao X, Yang M, Dong X, Cheng Y, Liang T, Yang B, Zhang Y, Yin Y#. (2025) Bacillus velezensis T971 genome informs starch degradation in tobacco. Frontiers in Microbiology. DOI 10.3389/fmicb.2025.1689015

  3. Akresi JE^, Do TVT^, Cui Z^, Shanmugam NRS^, Moraïs S, Bayer EA, Auchtung JA, and Yin Y#. (2026) Limousia bacteria encode mucinolysome for mucin utilization in animal gut microbiomes. bioRxiv [Preprint]. 2025 Oct 14:2025.10.14.682383. Preprint link, Gut Microbes, doi: https://doi.org/10.1080/19490976.2026.2645267

  4. Yan Y, Patel R, Shanmugam NRS, Akresi J, Yin Y#. (2025) dbCAN-HGM: CAZyme gene clusters in gut microbiomes of diverse human populations. Nucleic Acids Res. DOI: 10.1093/nar/gkaf1185

  5. Lyu H, Song J, Yin Y, Wang ML, Matsumoto T, Annabhemoju V, Meinhardt LW, Park S, Lim S, Zhang D, Yu Q. (2025) A chromosome-level genome assembly of Coffea arabica L. var. 'Kona Typica'. Scientific Data 12(1):1314. doi: 10.1038/s41597-025-05658-6

  6. Li W, Huang B, Guo M, Zeng Z, Cai T, Feng L, Zhang X^, Guo L, Jiang X, Yin Y, Wang E, Huang X, Zheng J. (2025) Unveiling the Evolution of Antimicrobial Peptides in Gut Microbes via Foundation Model-Powered Framework. Cell Reports 44(6):115773. doi: 10.1016/j.celrep.2025.115773

  7. Khatri M^, Shanmugam NRS^, Zhang X^, Patel R^, and Yin Y#. (2025) AcrDB update: predicted 3D structures of anti-CRISPRs in human gut viromes. Protein Science. DOI: 10.1002/pro.70177

  8. Shanmugam NRS^, Yin Y#. (2025) CAZyme3D: a database of 3D structures for carbohydrate-active enzymes. Journal of Molecular Biology. doi.org/10.1016/j.jmb.2025.169001

2024 10 publications
  1. Feng X^, Permann C, Holzinger A, Yin Y#. (2024) High molecular weight DNA extraction from mucilage rich zygnematophycean green algae for long read DNA sequencing. BMC Methods. DOI: 10.1186/s44330-024-00018-z

  2. Kang C, Zhang H, Yin Y. (2024) A dual-modality complex-valued fusion method for predicting side effects of drug-drug interactions based on graph neural network. IEEE Journal of Biomedical and Health Informatics. DOI: 10.1109/JBHI.2024.3422673

  3. Khatri M, Yin Y, Deogun J. (2024) Enhancing Interpretability In Medical Image Classification By Integrating Formal Concept Analysis With Convolutional Neural Networks. Biomimetics 9(7):421. doi

  4. Li M, Guo H, Wang K, Kang C, Yin Y, Zhang H. (2024) AVBAE-MODFR: A novel deep learning framework of embedding and feature selection on multi-omics data for pan-cancer classification. Computers in Biology and Medicine 177:108614. doi: 10.1016/j.compbiomed.2024.108614

  5. Zheng J, Huang L, Yi H, Yan Y, Zhang X, Akresi J, Yin Y#. (2024) Carbohydrate-active enzyme annotation in microbiomes using dbCAN. bioRxiv. Preprint link

  6. Nousias O^, Zheng J^, Li T^, Meinhardt L, Bailey B, Gutierrez O, Baruah IK, Cohen S, Zhang D, Yin Y#. (2024) Three de novo assembled wild cacao genomes from the Upper Amazon. Scientific Data. DOI: 10.1038/s41597-024-03215-1

  7. Zhan X, Yin Y, Zhang H. (2024) BERMAD: batch effect removal for single-cell RNA-seq data using a multi-layer adaptation autoencoder with dual-channel framework. Bioinformatics. doi:10.1093/bioinformatics/btae127

  8. Li T^, Zheng J^, Nousias O^, Meinhardt L, Goenaga R, Zhang D, Yin Y#. (2024) The cherimoya genome reveals insights into the intra-specific divergence, the evolution of Magnoliales, and a putative gene cluster for acetogenin biosynthesis. Plants 13(5):636. doi

  9. Dexheimer S^, Shrestha N, Chapagain BS^, Bujarski J#, Yin Y#. (2024) Characterization of Multivariate RNAs Encapsidated During Infection by Two Bromoviruses Based on ViReMa. Pathogens 13(1):96. doi

  10. Feng X*, Zheng J*, Irisarri I*, Yu H, Zheng B, Ali Z, de Vries S, Keller J, Fürst-Jansen J, Dadras A, Zegers J, Rieseberg TP, Ashok AD, Darienko T, Bierenbroodspot MJ, Gramzow L, Petroll R, Haas FB, Fernandez-Pozo N, Nousias O, Li T, Fitzek E, Grayburn WS, Rittmeier N, Permann C, Rümpler F, Archibald JM, Theißen G, Mower JP, Lorenz M, Buschmann H, von Schwartzenberg K, Boston L, Hayes RD, Daum C, Barry K, Grigoriev IV, Wang X, Li FW, Rensing SA, Ben-Ari J, Keren N, Mosquna A, Holzinger A, Delaux PM, Zhang C, Huang J, Mutwil M, de Vries J#, Yin Y#. (2023/2024) Genomes of multicellular algal sisters to land plants illuminate signaling network evolution. Nature Genetics. doi.org/10.1038/s41588-024-01737-3. Highlighted at Nature Plants. ★ ESI Highly Cited (Top 1%) — Molecular Biology & Genetics

2023 7 publications
  1. Han M, Yin Y, Zhang H, Wang H. (2023) Hierarchical Semantic Augmentation Graph Neural Network for Drug-Disease Association Prediction. IEEE BIBM 2023. DOI: 10.1109/BIBM58861.2023.10385961

  2. Quan X, Guo X, Zhang H, Han M, Li Z, and Yin Y. (2023) Unsupervised Feature Selection by Fusing Spectral Clustering and Locality Preserving Projection. IEEE BIBM 2023. DOI: 10.1109/BIBM58861.2023.10385296

  3. Yan Y^, Zheng J^, Zhang X^, Yin Y#. (2023) dbAPIS: a database of anti-prokaryotic immune system genes. Nucleic Acids Res. doi:10.1093/nar/gkad932

  4. Sandesh SP, Zheng J^, Yin Y, Lorber S, Puel O, Dhingra S, Espeso EA, Calvo AM. (2023) Homeobox Transcription Factor HbxA Influences Expression of over One Thousand Genes in the Model Fungus Aspergillus nidulans. PLoS ONE 18(7):e0286271. doi

  5. Han M, Li W, Zhang H, Yin Y. (2023) Semantic-guided Graph Neural Network for Heterogeneous Graph Embedding. Expert Systems with Applications 232:120810. doi

  6. Yang B*, Khatri M*, Zheng J, Deogun J, Yin Y#. (2023) Genome mining for anti-CRISPR operons using machine learning. Bioinformatics. doi:10.1093/bioinformatics/btad309

  7. Zheng J*, Ge Q*, Yan Y, Zhang X, Huang L, Yin Y#. (2023) dbCAN3: automated carbohydrate-active enzyme and substrate annotation. Nucleic Acids Res. doi:10.1093/nar/gkad328 ★ ESI Highly Cited Hot (Top 0.1%) — Biology & Biochemistry

2022 6 publications
  1. Huang W, Zhang H, Li Z, and Yin Y. (2022) Mutual Information Estimation-Based Disentangled Representation Network for Medical Image Fusion. IEEE BIBM 2022, Las Vegas, pp. 991–998. doi: 10.1109/BIBM55620.2022.9995221 Best Student Paper Award

  2. Yang B^, Zheng J^, Yin Y#. (2022) AcaFinder: genome mining of anti-CRISPR associated genes in bacteria and phages. mSystems. doi:10.1128/msystems.00817-22

  3. Zheng J^, Hu B^, Zhang X^, Ge Q^, Yan Y^, Piyush V^, Huang L, Yin Y#. (2022) dbCAN-seq update: CAZyme gene clusters and substrates in microbiomes. Nucleic Acids Res. doi:10.1093/nar/gkac1068

  4. Wei H, Li Y, Yan J, Peng S, Wei S, Yin Y, Li K, Cheng X. (2022) Root cell wall remodeling: a way for exopolysaccharides to mitigate cadmium toxicity in rice seedling. Journal of Hazardous Materials.

  5. Li W, Zhang H, Li M, Han M, Yin Y. (2022) MGEGFP: a multi-view graph embedding method for gene function prediction based on adaptive estimation with GCN. Briefings in Bioinformatics. doi:10.1093/bib/bbac333

  6. Li T^, Yin Y#. (2022) Critical assessment of pan-genomics of metagenome-assembled genomes. Briefings in Bioinformatics. doi:10.1093/bib/bbac413

2021 12 publications
  1. Zheng J^, Meinhardt L, Goenaga R, Matsumoto T, Zhang D, Yin Y#. (2021) The chromosome-level rambutan genome reveals a significant role of segmental duplication in the expansion of resistance genes. Horticulture Research 9:uhac014. doi

  2. Wang X, Wang J, Huang S, Zhang H, Yin Y. (2021) HDMC: a novel deep learning based framework for removing batch effects in single-cell RNA-seq data. Bioinformatics. doi:10.1093/bioinformatics/btab821

  3. Zheng B^, He Y+, Zhang P+, Huo YX, Yin Y#. (2021) Polyphenol utilization proteins in human gut microbiome. Applied and Environmental Microbiology. doi Editor Spotlight Paper

  4. Kang C, Zhang H, Liu Z, Huang S, Yin Y. (2021) LR-GNN: a graph neural network based on link representation for predicting molecular associations. Briefings in Bioinformatics. doi:10.1093/bib/bbab513

  5. Jin C, Shi Z, Zhang H, and Yin Y. (2021) Predicting lncRNA–protein interactions based on graph autoencoders and collaborative training. IEEE BIBM 2021.

  6. Li W, Zhang H, Zhao Q, Liu J, and Yin Y. (2021) Adversarial Dual-Channel Variational Graph Autoencoder for Synthetic Lethality Prediction in Human Cancers. IEEE BIBM 2021.

  7. Cheng X*, Yang B^*, Zheng J^, Wei H, Feng X^, Yin Y#. (2021) Cadmium stress triggers significant metabolic reprogramming in Enterococcus faecium CX 2-6. Computational and Structural Biotechnology Journal. doi

  8. Shi Z, Zhang H, Jin C, Quan X, Yin Y. (2021) A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations. BMC Bioinformatics 22:136. doi

  9. Peterson D^, Li T^, Calvo AM, Yin Y#. (2021) Categorization of orthologous gene clusters in 92 Ascomycota genomes reveals functions important for phytopathogenicity. Journal of Fungi 7(5):337. doi

  10. Zheng J^, Meinhardt L, Goenaga R, Zhang D, Yin Y#. (2021) The chromosome-level genome of dragon fruit reveals whole genome duplication and chromosomal co-localization of betacyanin biosynthetic genes. Horticulture Research 8:63. doi

  11. Feng X^, Holzinger A, Permann C, Anderson C, Yin Y#. (2021) Characterization of two Zygnema strains (SAG 698-1a and SAG 698-1b) and a rapid method to estimate nuclear genome size of Zygnematophycean green algae. Frontiers in Plant Science 12:610381. doi

  12. Li W, Zhao Q, Zhang H, Quan X, Xu J, and Yin Y. (2021) Bayesian Multi-scale Convolutional Neural Network for Motif Occupancy Identification. IEEE BIBM 2020, pp. 293–298. doi: 10.1109/BIBM49941.2020.9313556

2020 8 publications
  1. Huang L^*, Yang B^*, Yi H^, Asif A, Wang J, Lithgow T, Zhang H, Minhas F, Yin Y#. (2020) AcrDB: a database of anti-CRISPR operons in prokaryotes and viruses. Nucleic Acids Res. doi:10.1093/nar/gkaa857

  2. Ausland C^, Zheng J^, Yi H^, Yang B^, Li T^, Feng X^, Zheng B^, Yin Y#. (2020) dbCAN-PUL: a database of experimentally characterized CAZyme gene clusters and their substrates. Nucleic Acids Res. doi:10.1093/nar/gkaa742 ★ ESI Highly Cited (Top 1%) — Biology & Biochemistry

  3. Fitzek E.^, Balazic R.+, Yin Y#. (2020) Bioinformatics Analysis of Plant Cell Wall Evolution. In: Popper Z. (eds) The Plant Cell Wall. Methods in Molecular Biology, vol 2149. Humana, New York. doi

  4. Cao H^, Shimura Y, Steffen MM, Yang Z, Lu J, Joel A, Jenkins L, Kawachi M, Yin Y#, Garcia-Pichel F#. (2020) The trait repertoire enabling cyanobacteria to bloom assessed through comparative genomic complexity and metatranscriptomics. mBio 11:e01155-20. doi

  5. Wang X, Yi H, Wang J, Zhan D, Yin Y, Zhang H. (2020) GDASC: A GPU parallel based web server for detecting hidden batch factors. Bioinformatics. doi:10.1093/bioinformatics/btaa427

  6. Yi H^, Huang L^, Yang B^, Zhang H, Yin Y#. (2020) AcrFinder: genome mining anti-CRISPR operons in prokaryotes and their viruses. Nucleic Acids Res. doi:10.1093/nar/gkaa351

  7. Orton L^, Fitzek E^, Feng X^, Grayburn SW, Mower JP, Liu K, Zhang C, Duvall MR, Yin Y#. (2020) Zygnema circumcarinatum UTEX 1559 chloroplast and mitochondrial genomes provide insight into land plant evolution. Journal of Experimental Botany. doi:10.1093/jxb/eraa149

  8. Becker B, Feng X^, Yin Y, Holzinger A. (2020) Desiccation tolerance in streptophyte algae and the algae to land plant transition: Evolution of LEA and MIP protein families within the Viridiplantae. Journal of Experimental Botany. doi:10.1093/jxb/eraa105

2019 8 publications
  1. Su X, Xu J, Yin Y, Quan X, Zhang H. (2019) Antimicrobial Peptide Identification Using Multi-scale Convolutional Network. BMC Bioinformatics 20:730. doi

  2. Xu J, Zhang H, Zheng J^, Dovoedo P^, Yin Y#. (2019) eCAMI: simultaneous classification and motif identification for enzyme annotation. Bioinformatics. doi:10.1093/bioinformatics/btz908

  3. Li W, Yin Y, Quan X, Zhang H. (2019) Gene Expression Value Prediction Based on XGBoost Algorithm. Frontiers in Genetics. doi

  4. Satterlee T, Entwistle S, Yin Y, Cary J, Lebar M, Losada L, Calvo A. (2019) rmtA-Dependent Transcriptome and its Role in Secondary Metabolism, Environmental Stress, and Virulence in Aspergillus flavus. G3: Genes Genomes Genetics. doi

  5. Yin Y#, Yang B^, Entwistle S^. (2019) Bioinformatics identification of anti-CRISPR loci using homology, guilt-by-association, and CRISPR self-targeting spacer approaches. mSystems. doi:10.1128/mSystems.00455-19

  6. Fitzek E^, Orton F^, Entwistle S^, Grayburn SW, Ausland C^, Duvall DR, Yin Y#. (2019) Cell wall enzymes in Zygnema circumcarinatum UTEX 1559 respond to osmotic stress in a plant-like fashion. Frontiers in Plant Science. doi

  7. Han W, Fan X, Teng L, Kaczurowski M, Zhang X, Xu D, Yin Y, Ye X. (2018/2019) Classification, Evolution and Function of putative Xylosyltransferases from Algae. Protoplasma. doi:10.1007/s00709-019-01358-2

  8. Entwistle S^, Li X^, Yin Y#. (2019) Orphan genes shared by pathogenic genomes are more associated with bacterial pathogenicity. mSystems 4(1):e00290-18. doi

2018 6 publications
  1. Cary J, Entwistle S, Satterlee T, Mack B, Gilbert M, Chang P, Scharfenstein L, Yin Y, Calvo A. (2018) The transcriptional regulator Hbx1 affects the expression of thousands of genes in the aflatoxin-producing fungus Aspergillus flavus. G3: Genes Genomes Genetics.

  2. Wu G, Jurick W, Yin G, Yu J, Peng H, Gaskins V, Yin Y, Hua S, Peter K, Bennett J. (2018) Penicillium expansum and Penicillium solitum comparative genomics yields new insights into phylogeny and candidate genes involved in postharvest apple-fungus interactions. PeerJ. DOI: 10.7287/peerj.preprints.27430v1

  3. Lyu Q*, Zhang K, Zhu Q, Li Z, Liu Y, Fitzek E^, Yohe Y+, Zhao L, Li W, Liu T, Yin Y, Liu W. (2018) Structural and biochemical characterization of a multidomain alginate lyase reveals a novel role of CBM32 in CAZymes. Biochim Biophys Acta 1862(9):1862–1869.

  4. Driscoll CB, Meyer KA, Šulčius S, Brown NM, Dick GJ, Cao H^, Gasiūnas G, Timinskas A, Yin Y, Landry ZC, Otten TG, Davis TW, Watson SB, Dreher TW. (2018) A closely-related clade of globally distributed bloom-forming cyanobacteria within the Nostocales. Harmful Algae 77:93–107.

  5. Zhang H*, Yohe T*+, Huang L, Entwistle S^, Wu P, Yang Z, Busk P, Xu Y, Yin Y#. (2018) dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. (Web server issue 2018) 46(W1):W95–W101. doi:10.1093/nar/gky418 ★ ESI Highly Cited (Top 1%) — Biology & Biochemistry

  6. Han W, Ibarra G, Gupta M, Yin Y, Elsawa S. (2018) Elevated Expression of GLI3 in Germinal Center Diffuse Large B Cell Lymphoma. Leukemia and Lymphoma. doi

2017 5 publications
  1. Huang L, Zhang H, Wu P, Entwistle S+, Li X^, Yohe T+, Yi H, Yang Z, Yin Y#. (2017) dbCAN-seq: a database of carbohydrate-active enzyme (CAZyme) sequence and annotation. Nucleic Acids Res. (database issue 2018) 46(D1):D516–D521. doi:10.1093/nar/gkx894

  2. Yang X, …, Fitzek E^, …, Yin Y, …, Tuskan GA. (2017) The Kalanchoë genome provides insights into convergent evolution and building blocks of crassulacean acid metabolism. Nature Communications 8:1899. doi

  3. Jiang X, Zhang H, Quan X, Yin Y. (2017) A Heterogeneous Networks Fusion Algorithm Based on Local Topological Information for Neurodegenerative Disease. Current Bioinformatics 12(5):387–397.

  4. Jiang X, Quan X, Zhang H, Zhan D, Yin Y. (2017) Disease-related Gene Module Detection Based on A Multi-label Propagation Clustering Algorithm. PLoS One 12(5):e0178006. doi

  5. Huo L, Zhang H, Huo X, Yang Y^, Li X^, Yin Y#. (2017) pHMM-tree: Phylogeny of Profile Hidden Markov Models. Bioinformatics 33(7):1093–1095. doi:10.1093/bioinformatics/btw779

2016 4 publications
  1. Wysocki W, Riuz-Sanches E, Yin Y, Duvall M. (2016) The floral transcriptomes of four bamboo species (Bambusoideae; Poaceae): support for common ancestry among woody bamboos. BMC Genomics 17:384. doi

  2. Yu J, Wu G, Jurick WM Jr, Gaskins VL, Yin Y, Yin G, Bennett JW, Shelton DR. (2016) Genome sequence of Penicillium solitum RS1, which causes postharvest apple decay. Genome Announc. 4(3):e00363-16. doi

  3. Ekstrom A+, Yin Y#. (2016) ORFanFinder: automated identification of taxonomically restricted orphan genes. Bioinformatics 32(13):2053–2055. doi

  4. Hu L, Taujale R^, Song J, Guo J, Yin Y#. (2016) Draft genome sequence of Talaromyces verruculosus ("Penicillium verruculosum") strain TS63-9. Journal of Biotechnology 219:5–6. doi

2015 6 publications
  1. Yang X, Leeben-Mack J, Chen F, Yin Y. (2015) Plant Comparative and Functional Genomics. International Journal of Genomics 2015:924369. doi

  2. Nguyen M+, Ekstrom A+, Li X^, Yin Y#. (2015) HGT-Finder: a new tool for horizontal gene transfer finding and application to Aspergillus genomes. Toxins 7(10):4035–4053. doi Highly accessed

  3. Cary JW*, Han Z*, Yin Y*, Lohmar JM, Shantappa S, Harris-Coward PY, Mack B, Ehrlich KC, Wei Q, Arroyo-Manzanares N, Uka V, Vanhaecke L, Bhatnagar D, Yu J, Nierman WC, Johns MA, Sorensen D+, Shen H^, De Saeger S, Diana Di Mavungu J, Calvo AM. (2015) Transcriptome analysis of Aspergillus flavus reveals veA-dependent regulation of secondary metabolite gene clusters, including the novel aflavarin cluster. Eukaryot Cell 14:983–997. doi Articles of Significant Interest — Editors' Selection

  4. Taujale R^, Yin Y#. (2015) Glycosyltransferase family 43 is also found in early eukaryotes and has three subfamilies in Charophycean green algae. PLoS ONE. doi

  5. Cao H^, Ekstrom A+, Yin Y#. (2015) Plant carbohydrate active enzyme (CAZyme) repertoires: a comparative study. Invited book chapter in Advances in the Understanding of Biological Sciences using Next Generation Sequencing (NGS) Approaches, Springer. doi, pp. 115–134.

  6. Cui J, Yin Y, Ma Q, Wang G, Olman V, Zhang Y, Chou WC, Hong CS, Zhang C, Cao S, Mao X, Li Y, Qin S, Zhao S, Jiang J, Hastings P, Li F, Xu Y. (2015) Comprehensive Characterization of the Genomic Alterations in Human Gastric Cancer. Int J Cancer. doi

2014 9 publications
  1. Ekstrom A+, Taujale R^, McGinn N+, Yin Y#. (2014) PlantCAZyme: a database for plant carbohydrate-active enzymes. Database: The Journal of Biological Databases and Curation. doi NIU Undergraduate Research & Artistry Day Award

  2. Zhao Q, Zeng Y, Yin Y, Pu Y, Jackson LA, Engle NL, Martin MZ, Tschaplinski TJ, Ding SY, Ragauskas AJ, Dixon RA. (2014) Pinoresinol reductase 1 impacts lignin distribution during secondary cell wall biosynthesis in Arabidopsis. Phytochemistry. doi

  3. Yu J, Jurick WM, Cao H^, Yin Y, Gaskins VL, Losada L, Zafar N, Kim M, Bennett JW, Nierman WC. (2014) Draft genome sequence of Penicillium expansum (R19) that causes postharvest decay of apple fruit. Genome Announcements 2(3):e00635-14. doi

  4. Zhou C, Ma Q, Mao X, Yin Y#, Xu Y#. (2014) New Insights Revealed through Comparative Genomic Analysis of 40 Clostridium Genomes. Bioenergy Research 7:1481–1492. doi

  5. Zhou C, Mao F, Yin Y, Huang J, Gogarten JP, et al. (2014) AST: An Automated Sequence-Sampling Method for Improving the Taxonomic Diversity of Gene Phylogenetic Trees. PLoS ONE 9(6):e98844. doi

  6. Cao H^, Yin Y#. (2014) Rapid evolution of cellulosome modules by comparative analyses of five Clostridiales genomes. Bioenergy Research 7:1369–1381. doi

  7. Yin Y#, Johns MA, Cao H^, Rupani M^. (2014) A survey of the genomes and transcriptomes of diverse plants and algae reveals new insights into the evolution and function of the cellulose synthase superfamily. BMC Genomics 15:260. doi Highly accessed · 34+ citations

  8. Cao H^, Shimura Y, Masanobu K, Yin Y#. (2014) Draft genome sequence of the toxic bloom-forming cyanobacterium Aphanizomenon flos-aquae NIES-81. Genome Announc. 2(1):e00044-14. doi

  9. Escamilla-Trevino L, Shen H, Hernandez T, Yin Y, Xu Y, and Dixon RA. (2014) Early lignin pathway enzymes and routes to chlorogenic acid in switchgrass (Panicum virgatum L.). Plant Mol Biol. 84(4-5):565–576. doi

2013 6 publications
  1. Zhao Q, Nakashima Q, Chen F, Yin Y, Yun J, Shao H, Wang X, and Dixon RA. (2013) LACCASE is necessary and non-redundant with PEROXIDASE for lignin polymerization during vascular development in Arabidopsis thaliana. Plant Cell 25(10):3976–3987. doi ★ ESI Highly Cited (Top 1%) — Plant & Animal Science · Faculty 1000 Selection

  2. Liu J, Duan X, Sun J, Yin Y, Li G, Wang L, Liu B. (2013) Bi-factor Analysis Based on Noise-reduction (BIFANR): A New Algorithm for Detecting Coevolving Amino Acid Sites in Proteins. PLoS ONE 8(11):e79764. doi

  3. Ma Q, Liu B, Zhou C, Yin Y, Li G, Xu Y. (2013) An integrated toolkit for accurate prediction and analysis of cis-regulatory motifs at a genome scale. Bioinformatics 29(18):2261–2268.

  4. Yin Y#. (2013) Databases for bioenergy-related enzymes. Chapter 6 of Bioenergy Research: Advances & Applications, Elsevier Publishing, edited by Gupta VJ et al., pp. 95–107.

  5. Ma Q*, Yin Y*, Schell M, Li G, Xu Y. (2013) Elucidation of the Dynamic Supercoil Structures of folded E. coli Chromosome: a computational approach. Nucleic Acids Research 41(11):5594–5603.

  6. Zhang JY, Lee YC, Torres-Jerez I, Wang M, Yin Y, Chou WC, He J, Shen H, Srivastava AC, Pennacchio C, Lindquist E, Grimwood J, Schmutz J, Xu Y, Sharma M, Sharma R, Bartley LE, Ronald PC, Saha MC, Dixon RA, Tang Y, Udvardi MK. (2013) Development of an integrated transcript sequence database and a gene expression atlas for gene discovery and analysis in switchgrass (Panicum virgatum L.). Plant J. 74(1):160–173. doi

2012 7 publications
  1. Wang S*, Yin Y*, Ma Q, Tang X, Xu Y. (2012) Genome-scale identification of cell-wall related genes in Arabidopsis based on co-expression network analysis. BMC Plant Biology 12:138.

  2. Li G, Köllner TG, Yin Y, Jiang Y, Xu Y, Gershenzon J, Pichersky E, Chen F. (2012) Nonseed plant Selaginella moellendorffii has both seed plant and microbial types of terpene synthases. PNAS 109(36):14711–14715. Faculty 1000 — Must Read

  3. Blumer-Schuette SE, Giannone RJ, Zurawski JV, Ozdemir I, Mistry DB, Ma Q, Yin Y, Xu Y, Poole FL II, Kataeva I, Adams MWW, Hamilton-Brehm SD, Elkins JG, Larimer FW, Land ML, Hauser L, Cottingham RW, Hettich RL, and Kelly RM. (2012) Caldicellulosiruptor core and pan genomes reveal determinants for non-cellulosomal thermophilic deconstruction of lignocellulose. J Bacteriol. 194(15):4015–4028.

  4. Mao X, Zhang H, Yin Y, Xu Y. (2012) The percentage of bacterial genes on leading versus lagging strands is influenced by multiple balancing forces. Nucleic Acids Res. 40(17):8210–8218.

  5. Yin Y*, Mao X*, Yang JC, Chen X, Xu Y. (2012) dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. (web server issue) 40:W445–W451. doi ★ ESI Highly Cited (Top 1%) — Biology & Biochemistry · DOE Key Advance for Biofuels Production · 832+ citations

  6. Zhang H*, Yin Y*, Olman V, Xu Y. (2012) Genomic arrangement of regulons in bacterial genomes. PLoS One 7(1):e29496. PubMed PMCID: PMC3250446.

  7. Kulkarni AR, Peña MJ, Avci U, Mazumder K, Urbanowicz BR, Pattathil S, Yin Y, O'Neill MA, Roberts AW, Hahn MG, Xu Y, Darvill AG, York WS. (2012) The ability of land plants to synthesize glucuronoxylans predates the evolution of tracheophytes. Glycobiology 22(3):439–451. PubMed PMID: 22048859.

2011 5 publications
  1. Yin Y, Huang J, Gu X, Bar-Peled M, Xu Y. (2011) Evolution of plant nucleotide-sugar interconversion enzymes. PLoS One 6(11):e27995. PubMed PMCID: PMC3220709.

  2. Yin Y, Mohnen D, Xu Y, Hahn M. (2011) Glycosyltransferase Family 8 (GT8). Book chapter of Plant Cell Wall Polysaccharides: Biosynthesis and Bioengineering. Annual Plant Review series (Blackwell) 41:167–212.

  3. Li G, Ma Q, Mao X, Yin Y, Zhu X, Xu Y. (2011) Integration of sequence-similarity and functional association information can overcome intrinsic problems in orthology mapping across bacterial genomes. Nucleic Acids Res. 39(22):e150. PubMed PMCID: PMC3239196.

  4. Kong Y, Zhou G, Yin Y, Xu Y, Pattathil S, Hahn MG. (2011) Molecular analysis of a family of Arabidopsis genes related to galacturonosyltransferases. Plant Physiol. 155(4):1791–1805. PubMed PMCID: PMC3091093.

  5. Dam P, Kataeva I, Yang SJ, Zhou F, Yin Y, Chou W, Poole FL 2nd, Westpheling J, Hettich R, Giannone R, Lewis DL, Kelly R, Gilbert HJ, Henrissat B, Xu Y, Adams MW. (2011) Insights into plant biomass conversion from the genome of the anaerobic thermophilic bacterium Caldicellulosiruptor bescii DSM 6725. Nucleic Acids Res. 39(8):3240–3254. PubMed PMCID: PMC3082886.

2010 7 publications
  1. Yin Y, Chen H, Hahn MG, Mohnen D, Xu Y. (2010) Evolution and function of the plant cell wall synthesis-related glycosyltransferase family 8. Plant Physiol. 153(4):1729–1746. PubMed PMCID: PMC2923890.

  2. Yin Y*, Zhang H*, Olman V, Xu Y. (2010) Genomic arrangement of bacterial operons is constrained by biological pathways encoded in the genome. PNAS 107(14):6310–6315. PubMed PMCID: PMC2851940.

  3. Chou WC, Yin Y, Xu Y. (2010) GolgiP: prediction of Golgi-resident proteins in plants. Bioinformatics 26(19):2464–2465. PubMed PMCID: PMC2944200.

  4. Zhou C, Yin Y, Dam P, Xu Y. (2010) Identification of novel proteins involved in plant cell-wall synthesis based on protein-protein interaction data. J Proteome Res. 9(10):5025–5037.

  5. Zhao Q, Wang H, Yin Y, Xu Y, Chen F, Dixon RA. (2010) Syringyl lignin biosynthesis is directly regulated by a secondary cell wall master switch. PNAS 107(32):14496–14501. PubMed PMCID: PMC2922545.

  6. Gu X, Glushka J, Yin Y, Xu Y, Denny T, Smith J, Jiang Y, Bar-Peled M. (2010) Identification of a bifunctional UDP-4-keto-pentose/UDP-xylose synthase in the plant pathogenic bacterium Ralstonia solanacearum strain GMI1000. J Biol Chem. 285(12):9030–9040. PubMed PMCID: PMC2838324.

  7. Yang SJ, Kataeva I, Wiegel J, Yin Y, Dam P, Xu Y, Westpheling J, Adams MW. (2010) Classification of 'Anaerocellum thermophilum' strain DSM 6725 as Caldicellulosiruptor bescii sp. nov. Int J Syst Evol Microbiol. 60(Pt 9):2011–2015.

2009 6 publications
  1. Yin Y, Huang J, Xu Y. (2009) The cellulose synthase superfamily in fully sequenced plants and algae. BMC Plant Biol. 9:99. PubMed PMCID: PMC3091534. Highly accessed · 115+ citations

  2. Shen H*, Yin Y*, Chen F, Xu Y, Dixon R. (2009) A bioinformatic analysis of NAC genes for plant cell wall development in relation to lignocellulosic bioenergy production. Bioenergy Research 2(4):217–232.

  3. Escamilla-Treviño LL, Shen H, Uppalapati SR, Ray T, Tang Y, Hernandez T, Yin Y, Xu Y, Dixon RA. (2009) Switchgrass (Panicum virgatum) possesses a divergent family of cinnamoyl CoA reductases with distinct biochemical properties. New Phytol. 185(1):143–155.

  4. Mao F, Yin Y, Zhou F, Chou WC, Chen H, Zhou C, Dam P, Xu Y. (2009) Computational annotation of plant and algae genomes with an emphasis on cell wall biosynthesis. Bioenergy Research 2(4):209–216.

  5. Kong Y, Zhou G, Avci U, Gu X, Jones C, Yin Y, Xu Y, Hahn MG. (2009) Two poplar glycosyltransferase genes, PdGATL1.1 and PdGATL1.2, are functional orthologs to PARVUS/AtGATL1 in Arabidopsis. Mol Plant 2(5):1040–1050.

  6. Kataeva IA, Yang SJ, Dam P, Poole FL 2nd, Yin Y, Zhou F, Chou WC, Xu Y, Goodwin L, Sims DR, Detter JC, Hauser LJ, Westpheling J, Adams MW. (2009) Genome sequence of the anaerobic, thermophilic, and cellulolytic bacterium "Anaerocellum thermophilum" DSM 6725. J Bacteriol. 191(11):3760–3761. PubMed PMCID: PMC2681903.

2006–2008 5 publications
  1. Yin Y, Fischer D. (2008) Identification and investigation of ORFans in the viral world. BMC Genomics 9:24. PubMed PMCID: PMC2245933.

  2. Huang ML, Sun DC, Lou YX, Yin Y, Li CY, Zhang Y, Gao G, Wang SQ, Bo XC, Wei LP, Li SG. (2008) Analysis and Preparation of Oligonucleotide Microarray of Apoptosis related Genes. Prog. Biochem. Biophys. 35(12):1451–1460.

  3. Yin Y, Fischer D. (2006) On the origin of microbial ORFans: quantifying the strength of the evidence for viral lateral transfer. BMC Evol Biol. 6:63. PubMed PMCID: PMC1559721.

  4. Qian Z, Yin Y, Zhang Y, Lu L, Li Y, Jiang Y. (2006) Genomic characterization of ribitol teichoic acid synthesis in Staphylococcus aureus: genes, genomic organization and gene duplication. BMC Genomics 7:74. PubMed PMCID: PMC1458327.

  5. Huang M, Wang Y, Sun D, Zhu H, Yin Y, Zhang W, Yang S, Quan L, Bai J, Wang S, Chen Q, Li S, Xu N. (2006) Identification of genes regulated by Wnt/beta-catenin pathway and involved in apoptosis via microarray analysis. BMC Cancer 6:221. PubMed PMCID: PMC1574340.

2005 & Earlier 6 publications
  1. Chen Y, Zhang Y, Yin Y, Gao G, Li S, Jiang Y, Gu X, Luo J. (2005) SPD — a web-based secreted protein database. Nucleic Acids Res. 33(Database issue):D169–D173. PubMed PMCID: PMC540047.

  2. Yin Y, Zhang Y, Yu P, Luo JC, Jiang Y, Li SG. (2005) Comparative study of apoptosis-related gene loci in human, mouse and rat genomes. Acta Biochim Biophys Sin. 37(5):341–348.

  3. Wang B, Xu Z, Xu C, Yin Y, Ding W, Yu H. (2004) A Study of Gridifying Scientific Computing Legacy Codes. Lecture Notes in Computer Science 3251:404–412.

  4. Zhang Y, Yin Y, Chen Y, Gao G, Yu P, Luo J, Jiang Y. (2003) PCAS — a precomputed proteome annotation database resource. BMC Genomics 4(1):42. PubMed PMCID: PMC293463.

  5. Yin Y, Luo J, Jiang Y. (2003) Advances in G-protein-coupled receptors and related bioinformatics study. Chinese Science Bulletin 48(6):511–516.

  6. Jiang Y, Gao G, Fang G, Gustafson EL, Laverty M, Yin Y, Zhang Y, Luo J, Greene JR, Bayne ML, Hedrick JA, Murgolo NJ. (2003) PepPat, a pattern-based oligopeptide homology search method and the identification of a novel tachykinin-like peptide. Mamm Genome 14(5):341–349. PubMed PMID: 12856286.

* co-first authors  |  # corresponding authors  |  ^ UNL/NIU grad student/postdoc  |  + undergraduate student