期刊文献+
共找到1,050篇文章
< 1 2 53 >
每页显示 20 50 100
基于改进ZS细化算法的手写体汉字骨架提取 被引量:28
1
作者 常庆贺 吴敏华 骆力明 《计算机应用与软件》 北大核心 2020年第7期107-113,164,共8页
手写体汉字图像细化所提取出的骨架,突出了汉字的结构特征并减少了冗余信息,对手写体汉字的识别有着重要作用。Zhang-Suen细化算法迭代次数少、运行速度快,适合处理直线、T行交叉和拐角,但应用于细化手写体汉字图像时,细化后的汉字骨架... 手写体汉字图像细化所提取出的骨架,突出了汉字的结构特征并减少了冗余信息,对手写体汉字的识别有着重要作用。Zhang-Suen细化算法迭代次数少、运行速度快,适合处理直线、T行交叉和拐角,但应用于细化手写体汉字图像时,细化后的汉字骨架无法保证单一像素宽,并且汉字骨架有毛刺。针对该问题,提出一种改进算法。使用消除模板和保留模板在保证手写体汉字骨架连续性的基础上,实现骨架的单一像素化;引进门限机制的判定方法,通过毛刺长度值与设定的阈值进行对比的方式去除了骨架毛刺。结果表明,改进算法实现了汉字骨架的单一像素化、无毛刺,准确突出了手写体汉字的拓扑结构。 展开更多
关键词 细化算法 手写体汉字 冗余像素 毛刺 骨架
下载PDF
“有效行”特征对手写体字符的识别 被引量:7
2
作者 王贵新 刘建胜 +3 位作者 居琰 汪同庆 彭健 杨波 《电子科技大学学报》 EI CAS CSCD 北大核心 2001年第3期287-291,共5页
从无限制的手写体数字的结构出发,提出“有效行”特征的概念及其提取算法。该特征具有维数小、平移不变、字符小角度旋转不变等特点。建立了相应的字符特征库,利用BP网络对样本字符进行研究,通过大量的手写体数字识别测试表明:该... 从无限制的手写体数字的结构出发,提出“有效行”特征的概念及其提取算法。该特征具有维数小、平移不变、字符小角度旋转不变等特点。建立了相应的字符特征库,利用BP网络对样本字符进行研究,通过大量的手写体数字识别测试表明:该方法在识别速度上优于利用单一的矩特征、小波特征等传统识别方法;在误识率方面也优于一般的单一识别方法。 展开更多
关键词 特征提取 手写体 字符识别 “有效行”特征
下载PDF
基于笔划合并的手写体信函地址汉字切分识别 被引量:8
3
作者 王嵘 丁晓青 刘长松 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2004年第4期498-502,共5页
为了自动地处理存在着大量的笔划交叉与粘连的实际信函地址行,采用了一种基于笔划提取合并的手写体汉字切分识别方法。对于从实际信函中提取出的单行地址文本图像,首先提取出字符的横、竖、撇、捺等笔划,再根据一定的准则将笔划合并成字... 为了自动地处理存在着大量的笔划交叉与粘连的实际信函地址行,采用了一种基于笔划提取合并的手写体汉字切分识别方法。对于从实际信函中提取出的单行地址文本图像,首先提取出字符的横、竖、撇、捺等笔划,再根据一定的准则将笔划合并成字根,最终应用与地址解释相结合的动态规划算法得到最终的切分结果,获得投递区域。用从邮政分拣机上获得的443个信函地址行二值图像样本进行测试,省市一级和市县一级投递地址的正确识别率已经达到了66%。 展开更多
关键词 笔划合并 文字识别 汉字切分 手写体汉字 信函地址 自动处理方式 信函处理
原文传递
隐马尔可夫模型在脱机手写体汉字识别中的应用 被引量:8
4
作者 童学锋 邓刚 柴佩琪 《计算机应用》 CSCD 北大核心 2002年第10期1-3,共3页
介绍了一种新的脱机手写汉字识别方法———隐马尔可夫模型 (HMM )法 ,该方法对每个汉字建立 8个HMM ,通过等比重综合方法将 8个分类器的计算结果进行综合 ,从而得到识别结果 ,实践证明该方法是可行的。
关键词 隐马尔可夫模型 脱机 手写体汉字识别 模式识别 计算机
下载PDF
AN ADAPTIVELY TRAINED KERNEL-BASED NONLINEAR REPRESENTOR FOR HANDWRITTEN DIGIT CLASSIFICATION 被引量:12
5
作者 Liu Benyong Zhang Jing 《Journal of Electronics(China)》 2006年第3期379-383,共5页
In practice, retraining a trained classifier is necessary when novel data become available. This paper adopts an incremental learning procedure to adaptively train a Kernel-based Nonlinear Representor (KNR), a recentl... In practice, retraining a trained classifier is necessary when novel data become available. This paper adopts an incremental learning procedure to adaptively train a Kernel-based Nonlinear Representor (KNR), a recently presented nonlinear classifier for optimal pattern representation, so that its generalization ability may be evaluated in time-variant situation and a sparser representation is obtained for computationally intensive tasks. The addressed techniques are applied to handwritten digit classification to illustrate the feasibility for pattern recognition. 展开更多
关键词 Pattern recognition handwritten digit recognition Incremental learning Sparse representation Kernel-based Nonlinear Representor (KNR)
下载PDF
Multimodal Dependence Attention and Large-Scale Data Based Offline Handwritten Formula Recognition
6
作者 刘汉超 董兰芳 张信明 《Journal of Computer Science & Technology》 SCIE EI CSCD 2024年第3期654-670,共17页
Offline handwritten formula recognition is a challenging task due to the variety of handwritten symbols and two-dimensional formula structures.Recently,the deep neural network recognizers based on the encoder-decoder ... Offline handwritten formula recognition is a challenging task due to the variety of handwritten symbols and two-dimensional formula structures.Recently,the deep neural network recognizers based on the encoder-decoder frame-work have achieved great improvements on this task.However,the unsatisfactory recognition performance for formulas with long LTeX strings is one shortcoming of the existing work.Moreover,lacking sufficient training data also limits the capability of these recognizers.In this paper,we design a multimodal dependence attention(MDA)module to help the model learn visual and semantic dependencies among symbols in the same formula to improve the recognition perfor-mance of the formulas with long LTeX strings.To alleviate overfitting and further improve the recognition performance,we also propose a new dataset,Handwritten Formula Image Dataset(HFID),which contains 25620 handwritten formula images collected from real life.We conduct extensive experiments to demonstrate the effectiveness of our proposed MDA module and HFID dataset and achieve state-of-the-art performances,63.79%and 65.24%expression accuracy on CROHME 2014 and CROHME 2016,respectively. 展开更多
关键词 handwritten formula recognition multimodal dependence attention semantic dependence visual dependence handwritten Formula Image Dataset
原文传递
KurdSet: A Kurdish Handwritten Characters Recognition Dataset Using Convolutional Neural Network
7
作者 Sardar Hasen Ali Maiwan Bahjat Abdulrazzaq 《Computers, Materials & Continua》 SCIE EI 2024年第4期429-448,共20页
Handwritten character recognition(HCR)involves identifying characters in images,documents,and various sources such as forms surveys,questionnaires,and signatures,and transforming them into a machine-readable format fo... Handwritten character recognition(HCR)involves identifying characters in images,documents,and various sources such as forms surveys,questionnaires,and signatures,and transforming them into a machine-readable format for subsequent processing.Successfully recognizing complex and intricately shaped handwritten characters remains a significant obstacle.The use of convolutional neural network(CNN)in recent developments has notably advanced HCR,leveraging the ability to extract discriminative features from extensive sets of raw data.Because of the absence of pre-existing datasets in the Kurdish language,we created a Kurdish handwritten dataset called(KurdSet).The dataset consists of Kurdish characters,digits,texts,and symbols.The dataset consists of 1560 participants and contains 45,240 characters.In this study,we chose characters only from our dataset.We utilized a Kurdish dataset for handwritten character recognition.The study also utilizes various models,including InceptionV3,Xception,DenseNet121,and a customCNNmodel.To show the performance of the KurdSet dataset,we compared it to Arabic handwritten character recognition dataset(AHCD).We applied the models to both datasets to show the performance of our dataset.Additionally,the performance of the models is evaluated using test accuracy,which measures the percentage of correctly classified characters in the evaluation phase.All models performed well in the training phase,DenseNet121 exhibited the highest accuracy among the models,achieving a high accuracy of 99.80%on the Kurdish dataset.And Xception model achieved 98.66%using the Arabic dataset. 展开更多
关键词 CNN models Kurdish handwritten recognition KurdSet dataset Arabic handwritten recognition DenseNet121 model InceptionV3 model Xception model
下载PDF
Development of a Lightweight Model for Handwritten Dataset Recognition: Bangladeshi City Names in Bangla Script
8
作者 MdMahbubur Rahman Tusher Fahmid Al Farid +6 位作者 MdAl-Hasan Abu Saleh Musa Miah Susmita Roy Rinky Mehedi Hasan Jim Sarina Mansor MdAbdur Rahim Hezerul Abdul Karim 《Computers, Materials & Continua》 SCIE EI 2024年第8期2633-2656,共24页
The context of recognizing handwritten city names,this research addresses the challenges posed by the manual inscription of Bangladeshi city names in the Bangla script.In today’s technology-driven era,where precise t... The context of recognizing handwritten city names,this research addresses the challenges posed by the manual inscription of Bangladeshi city names in the Bangla script.In today’s technology-driven era,where precise tools for reading handwritten text are essential,this study focuses on leveraging deep learning to understand the intricacies of Bangla handwriting.The existing dearth of dedicated datasets has impeded the progress of Bangla handwritten city name recognition systems,particularly in critical areas such as postal automation and document processing.Notably,no prior research has specifically targeted the unique needs of Bangla handwritten city name recognition.To bridge this gap,the study collects real-world images from diverse sources to construct a comprehensive dataset for Bangla Hand Written City name recognition.The emphasis on practical data for system training enhances accuracy.The research further conducts a comparative analysis,pitting state-of-the-art(SOTA)deep learning models,including EfficientNetB0,VGG16,ResNet50,DenseNet201,InceptionV3,and Xception,against a custom Convolutional Neural Networks(CNN)model named“Our CNN.”The results showcase the superior performance of“Our CNN,”with a test accuracy of 99.97% and an outstanding F1 score of 99.95%.These metrics underscore its potential for automating city name recognition,particularly in postal services.The study concludes by highlighting the significance of meticulous dataset curation and the promising outlook for custom CNN architectures.It encourages future research avenues,including dataset expansion,algorithm refinement,exploration of recurrent neural networks and attention mechanisms,real-world deployment of models,and extension to other regional languages and scripts.These recommendations offer exciting possibilities for advancing the field of handwritten recognition technology and hold practical implications for enhancing global postal services. 展开更多
关键词 handwritten recognition Bangladeshi city names Bangla handwritten city name automated postal services
下载PDF
Part-based methods for handwritten digit recognition 被引量:4
9
作者 Song WANG Seiichi UCHIDA +1 位作者 Marcus LIWICKI Yaokai FENG 《Frontiers of Computer Science》 SCIE EI CSCD 2013年第4期514-525,共12页
In this paper, we intensively study the behavior of three part-based methods for handwritten digit recognition. The principle of the proposed methods is to represent a handwritten digit image as a set of parts and rec... In this paper, we intensively study the behavior of three part-based methods for handwritten digit recognition. The principle of the proposed methods is to represent a handwritten digit image as a set of parts and recognize the image by aggregating the recognition results of individual parts. Since part-based methods do not rely on the global structure of a character, they are expected to be more robust against various delormations which may damage the global structure. The proposed three methods are based on the same principle but different in their details, for example, the way of aggregating the individual results. Thus, those methods have different performances. Experimental results show that even the simplest part-based method can achieve recognition rate as high as 98.42% while the improved one achieved 99.15%, which is comparable or even higher than some state-of-the-art method. This result is important because it reveals that characters can be recognized without their global structure. The results also show that the part-based method has robustness against deformations which usually appear in handwriting. 展开更多
关键词 handwritten digit recognition local features part-based method
原文传递
A hyperspectral unmixing approach for ink mismatch detection in unbalanced clusters
10
作者 Faryal Aurooj Nasir Salman Liaquat +1 位作者 Khurram Khurshid Nor Muzlifah Mahyuddin 《Journal of Information and Intelligence》 2024年第2期177-190,共14页
With the rapid development of location-based services and online social networks,POI recommendation services considering geographic and social factors have received extensive attention.Meanwhile,the vigorous developme... With the rapid development of location-based services and online social networks,POI recommendation services considering geographic and social factors have received extensive attention.Meanwhile,the vigorous development of cloud computing has prompted service providers to outsource data to the cloud to provide POI recommendation services.However,there is a degree of distrust of the cloud by service providers.To protect digital assets,service providers encrypt data before outsourcing it.However,encryption reduces data availability,making it more challenging to provide POI recommendation services in outsourcing scenarios.Some privacy-preserving schemes for geo-social-based POI recommendation have been presented,but they have some limitations in supporting group query,considering both geographic and social factors,and query accuracy,making these schemes impractical.To solve this issue,we propose two practical and privacy-preserving geo-social-based POI recommendation schemes for single user and group users,which are named GSPR-S and GSPR-G.Specifically,we first utilize the quad tree to organize geographic data and the MinHash method to index social data.Then,we apply BGV fully homomorphic encryption to design some private algorithms,including a private max/min operation algorithm,a private rectangular set operation algorithm,and a private rectangular overlapping detection algorithm.After that,we use these algorithms as building blocks in our schemes for efficiency improvement.According to security analysis,our schemes are proven to be secure against the honest-but-curious cloud servers,and experimental results show that our schemes have good performance. 展开更多
关键词 k-means clustering Gaussian mixture model(GMM) Hyper spectral imaging(HSI) iVision handwritten hyperspectral images dataset(HHID) Document forensics
原文传递
Improving the Segmentation of Arabic Handwriting Using Ligature Detection Technique
11
作者 Husam Ahmad Al Hamad Mohammad Shehab 《Computers, Materials & Continua》 SCIE EI 2024年第5期2015-2034,共20页
Recognizing handwritten characters remains a critical and formidable challenge within the realm of computervision. Although considerable strides have been made in enhancing English handwritten character recognitionthr... Recognizing handwritten characters remains a critical and formidable challenge within the realm of computervision. Although considerable strides have been made in enhancing English handwritten character recognitionthrough various techniques, deciphering Arabic handwritten characters is particularly intricate. This complexityarises from the diverse array of writing styles among individuals, coupled with the various shapes that a singlecharacter can take when positioned differently within document images, rendering the task more perplexing. Inthis study, a novel segmentation method for Arabic handwritten scripts is suggested. This work aims to locatethe local minima of the vertical and diagonal word image densities to precisely identify the segmentation pointsbetween the cursive letters. The proposed method starts with pre-processing the word image without affectingits main features, then calculates the directions pixel density of the word image by scanning it vertically and fromangles 30° to 90° to count the pixel density fromall directions and address the problem of overlapping letters, whichis a commonly attitude in writing Arabic texts by many people. Local minima and thresholds are also determinedto identify the ideal segmentation area. The proposed technique is tested on samples obtained fromtwo datasets: Aself-curated image dataset and the IFN/ENIT dataset. The results demonstrate that the proposed method achievesa significant improvement in the proportions of cursive segmentation of 92.96% on our dataset, as well as 89.37%on the IFN/ENIT dataset. 展开更多
关键词 Arabic handwritten SEGMENTATION image processing ligature detection technique intelligent recognition
下载PDF
Method to Remove Handwritten Texts Using Smart Phone
12
作者 Haiquan Fang 《Journal of Harbin Institute of Technology(New Series)》 CAS 2024年第2期12-21,共10页
To remove handwritten texts from an image of a document taken by smart phone,an intelligent removal method was proposed that combines dewarping and Fully Convolutional Network with Atrous Convolutional and Atrous Spat... To remove handwritten texts from an image of a document taken by smart phone,an intelligent removal method was proposed that combines dewarping and Fully Convolutional Network with Atrous Convolutional and Atrous Spatial Pyramid Pooling(FCN-AC-ASPP).For a picture taken by a smart phone,firstly,the image is transformed into a regular image by the dewarping algorithm.Secondly,the FCN-AC-ASPP is used to classify printed texts and handwritten texts.Lastly,handwritten texts can be removed by a simple algorithm.Experiments show that the classification accuracy of the FCN-AC-ASPP is better than FCN,DeeplabV3+,FCN-AC.For handwritten texts removal effect,the method of combining dewarping and FCN-AC-ASPP is superior to FCN-AC-ASP alone. 展开更多
关键词 handwritten texts printed texts CLASSIFICATION FCN-AC-ASPP smart phone
下载PDF
Optimised CNN Architectures for Handwritten Arabic Character Recognition
13
作者 Salah Alghyaline 《Computers, Materials & Continua》 SCIE EI 2024年第6期4905-4924,共20页
Handwritten character recognition is considered challenging compared with machine-printed characters due to the different human writing styles.Arabic is morphologically rich,and its characters have a high similarity.T... Handwritten character recognition is considered challenging compared with machine-printed characters due to the different human writing styles.Arabic is morphologically rich,and its characters have a high similarity.The Arabic language includes 28 characters.Each character has up to four shapes according to its location in the word(at the beginning,middle,end,and isolated).This paper proposed 12 CNN architectures for recognizing handwritten Arabic characters.The proposed architectures were derived from the popular CNN architectures,such as VGG,ResNet,and Inception,to make them applicable to recognizing character-size images.The experimental results on three well-known datasets showed that the proposed architectures significantly enhanced the recognition rate compared to the baseline models.The experiments showed that data augmentation improved the models’accuracies on all tested datasets.The proposed model outperformed most of the existing approaches.The best achieved results were 93.05%,98.30%,and 96.88%on the HIJJA,AHCD,and AIA9K datasets. 展开更多
关键词 Optical character recognition(OCR) handwritten arabic characters deep learning
下载PDF
Hybrid Optimization Algorithm for Handwritten Document Enhancement
14
作者 Shu-Chuan Chu Xiaomeng Yang +2 位作者 Li Zhang Václav Snášel Jeng-Shyang Pan 《Computers, Materials & Continua》 SCIE EI 2024年第3期3763-3786,共24页
The Gannet Optimization Algorithm (GOA) and the Whale Optimization Algorithm (WOA) demonstrate strong performance;however, there remains room for improvement in convergence and practical applications. This study intro... The Gannet Optimization Algorithm (GOA) and the Whale Optimization Algorithm (WOA) demonstrate strong performance;however, there remains room for improvement in convergence and practical applications. This study introduces a hybrid optimization algorithm, named the adaptive inertia weight whale optimization algorithm and gannet optimization algorithm (AIWGOA), which addresses challenges in enhancing handwritten documents. The hybrid strategy integrates the strengths of both algorithms, significantly enhancing their capabilities, whereas the adaptive parameter strategy mitigates the need for manual parameter setting. By amalgamating the hybrid strategy and parameter-adaptive approach, the Gannet Optimization Algorithm was refined to yield the AIWGOA. Through a performance analysis of the CEC2013 benchmark, the AIWGOA demonstrates notable advantages across various metrics. Subsequently, an evaluation index was employed to assess the enhanced handwritten documents and images, affirming the superior practical application of the AIWGOA compared with other algorithms. 展开更多
关键词 Metaheuristic algorithm gannet optimization algorithm hybrid algorithm handwritten document enhancement
下载PDF
Handwritten digit recognition based on ghost imaging with deep learning 被引量:3
15
作者 Xing He Sheng-Mei Zhao Le Wang 《Chinese Physics B》 SCIE EI CAS CSCD 2021年第5期367-372,共6页
We present a ghost handwritten digit recognition method for the unknown handwritten digits based on ghost imaging(GI)with deep neural network,where a few detection signals from the bucket detector,generated by the cos... We present a ghost handwritten digit recognition method for the unknown handwritten digits based on ghost imaging(GI)with deep neural network,where a few detection signals from the bucket detector,generated by the cosine transform speckle,are used as the characteristic information and the input of the designed deep neural network(DNN),and the output of the DNN is the classification.The results show that the proposed scheme has a higher recognition accuracy(as high as 98%for the simulations,and 91%for the experiments)with a smaller sampling ratio(say 12.76%).With the increase of the sampling ratio,the recognition accuracy is enhanced.Compared with the traditional recognition scheme using the same DNN structure,the proposed scheme has slightly better performance with a lower complexity and non-locality property.The proposed scheme provides a promising way for remote sensing. 展开更多
关键词 ghost imaging handwritten digit recognition ghost handwritten recognition deep learning
下载PDF
一种基于KNN算法的手写数字识别实现 被引量:5
16
作者 迟殿委 《信息与电脑》 2019年第17期20-22,共3页
KNN是比较成熟的分类算法,关于KNN手写数字识别的分类应用实战很多都是基于sklearn提供的手写数字识别数据集traningDigits。笔者结合KNN算法原理用Python实现其手写数字识别的算法过程,并支持用户用拍照、绘图软件手写数字,方法就是将... KNN是比较成熟的分类算法,关于KNN手写数字识别的分类应用实战很多都是基于sklearn提供的手写数字识别数据集traningDigits。笔者结合KNN算法原理用Python实现其手写数字识别的算法过程,并支持用户用拍照、绘图软件手写数字,方法就是将图片处理成sklearn提供的数据集格式,然后作为测试样本应用在分类模型中进行预测,经过运行验证算法分类效果良好。 展开更多
关键词 KNN 手写 识别 分类
下载PDF
Kernel principal component analysis network for image classification 被引量:5
17
作者 吴丹 伍家松 +3 位作者 曾瑞 姜龙玉 Lotfi Senhadji 舒华忠 《Journal of Southeast University(English Edition)》 EI CAS 2015年第4期469-473,共5页
In order to classify nonlinear features with a linear classifier and improve the classification accuracy, a deep learning network named kernel principal component analysis network( KPCANet) is proposed. First, the d... In order to classify nonlinear features with a linear classifier and improve the classification accuracy, a deep learning network named kernel principal component analysis network( KPCANet) is proposed. First, the data is mapped into a higher-dimensional space with kernel principal component analysis to make the data linearly separable. Then a two-layer KPCANet is built to obtain the principal components of the image. Finally, the principal components are classified with a linear classifier. Experimental results showthat the proposed KPCANet is effective in face recognition, object recognition and handwritten digit recognition. It also outperforms principal component analysis network( PCANet) generally. Besides, KPCANet is invariant to illumination and stable to occlusion and slight deformation. 展开更多
关键词 deep learning kernel principal component analysis net(KPCANet) principal component analysis net(PCANet) face recognition object recognition handwritten digit recognition
下载PDF
Science Letters:Binary tree of posterior probability support vector machines 被引量:2
18
作者 Dong-li WANG Jian-guo ZHENG Yan ZHOU 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2011年第2期83-87,共5页
Posterior probability support vector machines (PPSVMs) prove robust against noises and outliers and need fewer storage support vectors (SVs). Gonen et al. (2008) extended PPSVMs to a multiclass case by both single-mac... Posterior probability support vector machines (PPSVMs) prove robust against noises and outliers and need fewer storage support vectors (SVs). Gonen et al. (2008) extended PPSVMs to a multiclass case by both single-machine and multimachine approaches. However, these extensions suffer from low classification efficiency, high computational burden, and more importantly, unclassifiable regions. To achieve higher classification efficiency and accuracy with fewer SVs, a binary tree of PPSVMs for the multiclass classification problem is proposed in this letter. Moreover, a Fisher ratio separability measure is adopted to determine the tree structure. Several experiments on handwritten recognition datasets are included to illustrate the proposed approach. Specifically, the Fisher ratio separability accelerated binary tree of PPSVMs obtains overall test accuracy, if not higher than, at least comparable to those of other multiclass algorithms, while using significantly fewer SVs and much less test time. 展开更多
关键词 Binary tree Support vector machine handwritten recognition Classification
原文传递
An Auto-Grading Oriented Approach for Off-Line Handwritten Organic Cyclic Compound Structure Formulas Recognition
19
作者 Ting Zhang Yifei Wang +3 位作者 Xinxin Jin Zhiwen Gu Xiaoliang Zhang Bin He 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第6期2267-2285,共19页
Auto-grading,as an instruction tool,could reduce teachers’workload,provide students with instant feedback and support highly personalized learning.Therefore,this topic attracts considerable attentions from researcher... Auto-grading,as an instruction tool,could reduce teachers’workload,provide students with instant feedback and support highly personalized learning.Therefore,this topic attracts considerable attentions from researchers recently.To realize the automatic grading of handwritten chemistry assignments,the problem of chemical notations recognition should be solved first.The recent handwritten chemical notations recognition solutions belonging to the end-to-end trainable category suffered fromthe problem of lacking the accurate alignment information between the input and output.They serve the aim of reading notations into electrical devices to better prepare relevant edocuments instead of auto-grading handwritten assignments.To tackle this limitation to enable the auto-grading of handwritten chemistry assignments at a fine-grained level.In this work,we propose a component-detectionbased approach for recognizing off-line handwritten Organic Cyclic Compound Structure Formulas(OCCSFs).Specifically,we define different components of OCCSFs as objects(including graphical objects and text objects),and adopt the deep learning detector to detect them.Then,regarding the detected text objects,we introduce an improved attention-based encoder-decoder model for text recognition.Finally,with these detection results and the geometric relationships of detected objects,this article designs a holistic algorithm for interpreting the spatial structure of handwritten OCCSFs.The proposedmethod is evaluated on a self-collected data set consisting of 3000 samples and achieves promising results. 展开更多
关键词 handwritten chemical structure formulas structure interpretation components detection text recognition
下载PDF
Semantic Document Layout Analysis of Handwritten Manuscripts
20
作者 Emad Sami Jaha 《Computers, Materials & Continua》 SCIE EI 2023年第5期2805-2831,共27页
A document layout can be more informative than merely a document’s visual and structural appearance.Thus,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed docume... A document layout can be more informative than merely a document’s visual and structural appearance.Thus,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed document image analysis to be further used in several applications and different objectives.This research extends the traditional approaches of DLA and introduces the concept of semantic document layout analysis(SDLA)by proposing a novel framework for semantic layout analysis and characterization of handwritten manuscripts.The proposed SDLA approach enables the derivation of implicit information and semantic characteristics,which can be effectively utilized in dozens of practical applications for various purposes,in a way bridging the semantic gap and providingmore understandable high-level document image analysis and more invariant characterization via absolute and relative labeling.This approach is validated and evaluated on a large dataset ofArabic handwrittenmanuscripts comprising complex layouts.The experimental work shows promising results in terms of accurate and effective semantic characteristic-based clustering and retrieval of handwritten manuscripts.It also indicates the expected efficacy of using the capabilities of the proposed approach in automating and facilitating many functional,reallife tasks such as effort estimation and pricing of transcription or typing of such complex manuscripts. 展开更多
关键词 Semantic characteristics semantic labeling document layout analysis semantic document layout analysis handwritten manuscripts clustering RETRIEVAL image processing computer vision machine learning
下载PDF
上一页 1 2 53 下一页 到第
使用帮助 返回顶部