摘要
期刊论文结构化加工在期刊界已经逐步形成共识,国内期刊平台多采用新版期刊文章标签集(Journal Article Tag Suite,JATS)标准进行加工,但JATS标准仅对数据属性提出建议值,自行拓展空间较大,导致实际的数据加工结果千差万别,数据交换困难重重。本文分析了国内外数字化加工和标准进化的历程及我国在XML结构化数据加工中存在的问题,进一步分析了存档及交换标签集、出版标签集等不同子集的特点,提出既能完整保留论文原始信息,又便于提取各类结构化信息的数据加工及存储解决方案,可以根据需要通过减法转换生成符合各平台标准的数据加工存储格式,从而真正实现一次加工、多渠道投放和传播。
Structured data processing of papers has gradually formed a consensus in academic journal field.Domestic journals and platforms mostly adopt the Journal Article Tag Suite(JATS)standard for processing,but the JATS standard only puts forward suggested values for data attributes,which has a large space for self-expansion,resulting in different actual data processing results and difficulties in data exchange.This study analyzed the process of digital processing and standard evolution at home and abroad and the problems existing in XML structured data processing in China,and further analyzed the characteristics of different subsets such as Journal Archiving and Interchange Tag Set and Journal Publishing Tag Set.A data processing and storage solution were proposed,which can not only completely retain the original information of the paper,but also facilitate the extraction of various structured information.It can be used to generate data compliant with each platform’s standard through subtraction and conversion as needed,thus truly realizing one-time processing and multi-channel delivery and communication.
作者
彭劲松
李璐
PENG Jinsong;LI Lu(Formax BPO Beijing Inc.,100085,Beijing,China)
出处
《数字出版研究》
2024年第2期57-64,共8页
DIGITAL PUBLISHING RESEARCH
关键词
期刊论文结构化
JATS
存档及交换标签集
出版标签集
数据加工存储标准
XML
Structured data of academic journals
JATS
Journal Archiving and Interchange Tag Set
Journal Publishing Tag Set
Data processing and storage standard
XML