As historical Chinese calligraphy works are being digitized, the problem of retrieval becomes a new challenge. But, currently no OCR technique can convert calligraphy character images into text, nor can the existing H...As historical Chinese calligraphy works are being digitized, the problem of retrieval becomes a new challenge. But, currently no OCR technique can convert calligraphy character images into text, nor can the existing Handwriting Character Recognition approach does not work for it. This paper proposes a novel approach to efficiently retrieving Chinese calligraphy characters on the basis of similarity: calligraphy character image is represented by a collection of discriminative features, and high retrieval speed with reasonable effectiveness is achieved. First, calligraphy characters that have no possibility similar to the query are filtered out step by step by comparing the character complexity, stroke density and stroke protrusion. Then, similar calligraphy characters axe retrieved and ranked according to their matching cost produced by approximate shape match. In order to speed up the retrieval, we employed high dimensional data structure - PK-tree. Finally, the efficiency of the algorithm is demonstrated by a preliminary experiment with 3012 calligraphy character images.展开更多
由中国教育部、美国国家科学基金会、印度科学院联合发起,中国教育部主办,浙江大学承办的首届全球数字图书馆国际学术研讨会(The 1st International Conference on Universal Digital Library,ICUDL2005)于2005年10月31日~11月2...由中国教育部、美国国家科学基金会、印度科学院联合发起,中国教育部主办,浙江大学承办的首届全球数字图书馆国际学术研讨会(The 1st International Conference on Universal Digital Library,ICUDL2005)于2005年10月31日~11月2日在浙江大学紫金港校区国际会议中心举行。来自国内外数字图书馆领域的226位代表参加了研讨会。其中,国外代表73位,来自美国、印度、埃及、澳大利亚、新加坡等国家;国内代表153位,包括大陆代表148位、台湾代表2位、香港代表3位。全国人大常委会副委员长、中国科学院院长路甬祥院士向研讨会发来了贺信,教育部副部长吴启迪、浙江省副省长盛昌黎等领导参加了会议。浙江大学校长潘云鹤院士和美国Carnegie Mellon大学教授Raj Reddy博士任大会联合主席。现将本次会议有关学术交流的情况总结如下。展开更多
China-America Digital Academic Library Project (CADAL) is a collaborative project between universities and institutes in China and the USA, which aims to provide universal access to large scale digital resources and e...China-America Digital Academic Library Project (CADAL) is a collaborative project between universities and institutes in China and the USA, which aims to provide universal access to large scale digital resources and explore the ways of applying multimedia and virtual reality technologies to digital library. The distinct characteristic of the resources in CADAL is that it not only contains one million digital books of different languages, but also contains Terabyte level multimedia resources (image, video, and so on), which are utilized for education and research purposes. So, in the Portal to CADAL, both the traditional services of browsing and searching of digital books, and the services of quickly retrieving and structurally browsing of multimedia documents should be provided. In addition, the services of visual presentation of retrieved results are required too. In this paper, the underlying novel multimedia retrieval methods as well as visualization techniques, which are used in the CADAL portal, are investigated.展开更多
基金Supported by the National Natural Science Foundation of China(Grant Nos.60533090,60525108)the National Grand Fundamental Research 973 Program of China(Grant No.2002CB312101)+1 种基金the Science and Technology Project of Zhejiang Province(2005C13032,2005C11001-05)the China-US Million Book Digital Library Project(www.cadal.zju.edu.cn).
文摘As historical Chinese calligraphy works are being digitized, the problem of retrieval becomes a new challenge. But, currently no OCR technique can convert calligraphy character images into text, nor can the existing Handwriting Character Recognition approach does not work for it. This paper proposes a novel approach to efficiently retrieving Chinese calligraphy characters on the basis of similarity: calligraphy character image is represented by a collection of discriminative features, and high retrieval speed with reasonable effectiveness is achieved. First, calligraphy characters that have no possibility similar to the query are filtered out step by step by comparing the character complexity, stroke density and stroke protrusion. Then, similar calligraphy characters axe retrieved and ranked according to their matching cost produced by approximate shape match. In order to speed up the retrieval, we employed high dimensional data structure - PK-tree. Finally, the efficiency of the algorithm is demonstrated by a preliminary experiment with 3012 calligraphy character images.
文摘由中国教育部、美国国家科学基金会、印度科学院联合发起,中国教育部主办,浙江大学承办的首届全球数字图书馆国际学术研讨会(The 1st International Conference on Universal Digital Library,ICUDL2005)于2005年10月31日~11月2日在浙江大学紫金港校区国际会议中心举行。来自国内外数字图书馆领域的226位代表参加了研讨会。其中,国外代表73位,来自美国、印度、埃及、澳大利亚、新加坡等国家;国内代表153位,包括大陆代表148位、台湾代表2位、香港代表3位。全国人大常委会副委员长、中国科学院院长路甬祥院士向研讨会发来了贺信,教育部副部长吴启迪、浙江省副省长盛昌黎等领导参加了会议。浙江大学校长潘云鹤院士和美国Carnegie Mellon大学教授Raj Reddy博士任大会联合主席。现将本次会议有关学术交流的情况总结如下。
基金Project supported by the National Natural Science Foundation of China (Nos. 60272031 and 90412014) and the China-America Digital Academic Library Project
文摘China-America Digital Academic Library Project (CADAL) is a collaborative project between universities and institutes in China and the USA, which aims to provide universal access to large scale digital resources and explore the ways of applying multimedia and virtual reality technologies to digital library. The distinct characteristic of the resources in CADAL is that it not only contains one million digital books of different languages, but also contains Terabyte level multimedia resources (image, video, and so on), which are utilized for education and research purposes. So, in the Portal to CADAL, both the traditional services of browsing and searching of digital books, and the services of quickly retrieving and structurally browsing of multimedia documents should be provided. In addition, the services of visual presentation of retrieved results are required too. In this paper, the underlying novel multimedia retrieval methods as well as visualization techniques, which are used in the CADAL portal, are investigated.