To solve the problem of data recovery on free disk sectors, an approach of data recovering based on intelligent pattern matching is proposed in this paper. Different from the methods based on the file directory, this ...To solve the problem of data recovery on free disk sectors, an approach of data recovering based on intelligent pattern matching is proposed in this paper. Different from the methods based on the file directory, this approach utilizes the consistency among the data on the disk. A feature pattern library is established based on different types of fries according to the internal constructions of text. Data on sectors will be classified automatically by data clustering and evaluating. When the conflict happens on data classification, the digestion will be initiated by adopting context pattern. Based on this approach, the paper achieved the data recovery system aiming at pattern matching of txt, word and PDF fries. Raw and formatting recovery tests proved that the system works well.展开更多
机读目录 MARC 的产生,是书目工作的一个重大进展,也是目录工作现代化的重要标志之一。根据我国具体情况,建立机读目录系统,已不再是什么遥远的事,各图书馆、科技情报单位从现在起就应着手进行各种准备,以期在尽可能短的时间内顺利地实...机读目录 MARC 的产生,是书目工作的一个重大进展,也是目录工作现代化的重要标志之一。根据我国具体情况,建立机读目录系统,已不再是什么遥远的事,各图书馆、科技情报单位从现在起就应着手进行各种准备,以期在尽可能短的时间内顺利地实现从手工式书本(卡片)目录向机读目录的过渡。展开更多
文摘To solve the problem of data recovery on free disk sectors, an approach of data recovering based on intelligent pattern matching is proposed in this paper. Different from the methods based on the file directory, this approach utilizes the consistency among the data on the disk. A feature pattern library is established based on different types of fries according to the internal constructions of text. Data on sectors will be classified automatically by data clustering and evaluating. When the conflict happens on data classification, the digestion will be initiated by adopting context pattern. Based on this approach, the paper achieved the data recovery system aiming at pattern matching of txt, word and PDF fries. Raw and formatting recovery tests proved that the system works well.