摘要
文章分析了HTML和XML的特点、论述了HTML向XML转换的必要性、介绍了转换的有关原理。采用了基于把HTML文档解析为DOM树形成节点信息,然后进行深度遍历的方法对各节点信息进行抽取映射为XML结构的信息。以达到转换为XML文档的目的。
The paper analyses the characteristics of HTML and XML, discusses the significances and related principles, and uses a method of parsing the HTML documents to DOM tree, and searching it by depth to extract the information from every nodes and map to the XML, thereby to attain the aim of conversion the HTML to XML.
出处
《电脑知识与技术》
2006年第7期64-65,79,共3页
Computer Knowledge and Technology