摘要
Non-conventional peptides(NCPs),which include small open reading frame-encoded peptides,play critical roles in fundamental biological processes.In this study,we developed an integrated peptidogenomic pipeline using high-throughput mass spectra to probe a customized six-frame translation database and applied it to large-scale identification of NCPs in plants.A total of 1993 and 1860 NCPs were unambiguously identified in maize and Arabidopsis,respectively.These NCPs showed distinct characteristics compared with conventional peptides and were derived from introns,3′UTRs,5′UTRs,junctions,and intergenic regions.Furthermore,our results showed that translation events in unannotated transcripts occur more broadly than previously thought.In addition,we found that dozens of maize NCPs are enriched within regions associated with phenotypic variations and domestication selection,indicating that they potentially are involved in genetic regulation of complex traits and domestication in maize.Taken together,our study developed an integrated peptidogenomic pipeline for large-scale identification of NCPs in plants,which would facilitate global characterization of NCPs from other plants.The identification of large-scale NCPs in both monocot(maize)and dicot(Arabidopsis)plants indicates that a large portion of plant genome can be translated into biologically functional molecules,which has important implications for functional genomic studies.
基金
This work is supported by the National Natural Science Foundation of China(nos.31872872 and U1804113)
National Key Research and Deveopment Program of China(no.2016YFD0101003),and Henan Association for Science and Technology.