摘要
采用一种表格识别方法实现对多种类型表格的识别,系统利用表格投影轮廓的功率谱密度作为表格的不变性特征向量。为了解决具有相互对称结构表格的识别问题,提出一种新的特征提取方法:采用区域划分的策略,综合考虑表格图像在水平方向及垂直方向上的特征,以分区投影轮廓的功率谱密度作为表格图像的特征向量。实验表明,这种方法能够有效解决具有对称结构表格的识别问题。
A method of forms identification is introduced, which can be employed in processing of documents with a certain style of application form. In order to solve the identification of such forms with a symmetry figure, a new approach of feature extraction is presented, which is a method of area division done by partition the forms image into areas. Both the horizontal feature and the vertical feature of forms image are token into account, and the power spectral density of the subareas' profile is used as forms image's feature vector. It is shown to be an effective method to identify the forms with symmetry figure.
出处
《计算机工程》
EI
CAS
CSCD
北大核心
2006年第6期215-217,共3页
Computer Engineering
基金
国家自然科学基金资助项目(60475003)
关键词
特征提取
区域划分
表格识别
功率谱密度
Feature extraction
Area division
Form identification
Power spectral density