Status quo and future trends of 2015children’s publications released by the Shanghai Press and Publication shows that in the past decade,the domestic children’s book market is developing rapidly with an average annu...Status quo and future trends of 2015children’s publications released by the Shanghai Press and Publication shows that in the past decade,the domestic children’s book market is developing rapidly with an average annual growth of 10%.Children’s books are seeing an increasing ratio with a market share of over 40%.展开更多
The Bibliotheca Alexandrina (BA) has been developing and putting to use a workflow for tuming printed books into digital books as its contribution to the building of a Universal Digital Library. This workflow is a p...The Bibliotheca Alexandrina (BA) has been developing and putting to use a workflow for tuming printed books into digital books as its contribution to the building of a Universal Digital Library. This workflow is a process consisting of multiple phases, namely, scanning, image processing, OCR, digital archiving, document encoding, and publishing. Over the past couple of years, the BA has defined procedures and special techniques for the scanning, processing, OCR and publishing, especially of Arabic books. This workflow has been automated, allowing the governance of the different phases and making possible the production of 18000 books so far. The BA has also designed and implemented a framework for the encoding of digital books that allows publishing as well as a software system for managing the creation, maintenance, and publishing of the overall digital repository.展开更多
This paper briefly introduces the main ideas of a sustainable development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on ...This paper briefly introduces the main ideas of a sustainable development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on computer clusters, for the purpose of dynamically improving the recognition precision of the digitized texts of a million volumes of books produced by the China-US Million Books Digital Library (CADAL) Project. The practice of this center will provide helpful reference for other digital library projects.展开更多
文摘Status quo and future trends of 2015children’s publications released by the Shanghai Press and Publication shows that in the past decade,the domestic children’s book market is developing rapidly with an average annual growth of 10%.Children’s books are seeing an increasing ratio with a market share of over 40%.
文摘The Bibliotheca Alexandrina (BA) has been developing and putting to use a workflow for tuming printed books into digital books as its contribution to the building of a Universal Digital Library. This workflow is a process consisting of multiple phases, namely, scanning, image processing, OCR, digital archiving, document encoding, and publishing. Over the past couple of years, the BA has defined procedures and special techniques for the scanning, processing, OCR and publishing, especially of Arabic books. This workflow has been automated, allowing the governance of the different phases and making possible the production of 18000 books so far. The BA has also designed and implemented a framework for the encoding of digital books that allows publishing as well as a software system for managing the creation, maintenance, and publishing of the overall digital repository.
基金Project supported by China-US Million Books Digital Library Project
文摘This paper briefly introduces the main ideas of a sustainable development OCR system based on open architecture techniques and then describes the construction of an optical character recognition (OCR) center built on computer clusters, for the purpose of dynamically improving the recognition precision of the digitized texts of a million volumes of books produced by the China-US Million Books Digital Library (CADAL) Project. The practice of this center will provide helpful reference for other digital library projects.