With the development of big data,all walks of life in society have begun to venture into big data to serve their own enterprises and departments.Big data has been embraced by university digital libraries.The most cumb...With the development of big data,all walks of life in society have begun to venture into big data to serve their own enterprises and departments.Big data has been embraced by university digital libraries.The most cumbersome work for the management of university libraries is document retrieval.This article uses Hadoop algorithm to extract semantic keywords and then calculates semantic similarity based on the literature retrieval keyword calculation process.The fast-matching method is used to determine the weight of each keyword,so as to ensure an efficient and accurate document retrieval in digital libraries,thus completing the design of the document retrieval method for university digital libraries based on Hadoop technology.展开更多
The objective of Universal Digital Library (UDL) is to capture all books in digital format. A text to speech (TTS) interface for UDL portal would enable access to the digital content in voice mode, and also provide ac...The objective of Universal Digital Library (UDL) is to capture all books in digital format. A text to speech (TTS) interface for UDL portal would enable access to the digital content in voice mode, and also provide access to the digital content for illiterate and vision-impaired people. Our work focuses on design and implementation of text to speech interface for UDL portal primarily for Indian languages. This paper is aimed at identifying the issues involved in integrating text to speech system into UDL portal and describes the development process of Hindi, Telugu and Tamil voices under Festvox framework using unit selection techniques. We demonstrate the quality of the Tamil and Telugu voices and lay out the plan for integrating the TTS into the UDL portal.展开更多
Researchers in P.R.China commonly create triangulate irregular networks(TINs) from contours and then convert TINs into digital elevation models(DEMs).However,the DEM produced by this method can not precisely describe ...Researchers in P.R.China commonly create triangulate irregular networks(TINs) from contours and then convert TINs into digital elevation models(DEMs).However,the DEM produced by this method can not precisely describe and simulate key hydrological features such as rivers and drainage borders.Taking a hilly region in southwestern China as a research area and using ArcGISTM software,we analyzed the errors of different interpolations to obtain distributions of the errors and precisions of different algorithms and to provide references for DEM productions.The results show that different interpolation errors satisfy normal distributions,and large error exists near the structure line of the terrain.Furthermore,the results also show that the precision of a DEM interpolated with the Australian National University digital elevation model(ANUDEM) is higher than that interpolated with TIN.The DEM interpolated with TIN is acceptable for generating DEMs in the hilly region of southwestern China.展开更多
Transliteration editors are essential for keying-in Indian language scripts into the computer using QWERTY keyboard. Applications of transliteration editors in the context of Universal Digital Library (UDL) include en...Transliteration editors are essential for keying-in Indian language scripts into the computer using QWERTY keyboard. Applications of transliteration editors in the context of Universal Digital Library (UDL) include entry of meta-data and diction- aries for Indian languages. In this paper we propose a simple approach for building transliteration editors for Indian languages using Unicode and by taking advantage of its rendering engine. We demonstrate the usefulness of the Unicode based approach to build transliteration editors for Indian languages, and report its advantages needing little maintenance and few entries in the mapping table, and ease of adding new features such as adding letters, to the transliteration scheme. We demonstrate the trans- literation editor for 9 Indian languages and also explain how this approach can be adapted for Arabic scripts.展开更多
In this research paper,we have presented variable area type capacitive sensor signal conditioning system for angular displacement measurement and for this purpose we have used timer LM555 based astable multivibrator a...In this research paper,we have presented variable area type capacitive sensor signal conditioning system for angular displacement measurement and for this purpose we have used timer LM555 based astable multivibrator and universal frequency to digital converter (UFDC). Due to variation in angular displacement in the variable area type capacitor which is connected in the timer based astable circuit,capacitance changes which in turn changes the time period of the timer circuit output. The time period of the timer output waveform is linear with the capacitance and hence linear with angular displacement. The timer output is further processed with UFDC for the measurement. The experimental results show that the time period is linear with the angular displacement in the range of 0- 180° and the uncertainty we should associate it with this average time period value is the standard deviation of the mean,often called the standard error (SE),which is ± 0.023 μs. Because of the simplicity,this measurement system can be used in both electronic and industrial instrumentation.展开更多
The Bibliotheca Alexandrina (BA) has been developing and putting to use a workflow for tuming printed books into digital books as its contribution to the building of a Universal Digital Library. This workflow is a p...The Bibliotheca Alexandrina (BA) has been developing and putting to use a workflow for tuming printed books into digital books as its contribution to the building of a Universal Digital Library. This workflow is a process consisting of multiple phases, namely, scanning, image processing, OCR, digital archiving, document encoding, and publishing. Over the past couple of years, the BA has defined procedures and special techniques for the scanning, processing, OCR and publishing, especially of Arabic books. This workflow has been automated, allowing the governance of the different phases and making possible the production of 18000 books so far. The BA has also designed and implemented a framework for the encoding of digital books that allows publishing as well as a software system for managing the creation, maintenance, and publishing of the overall digital repository.展开更多
Copyright and its international complications have presented a significant barrier to the Universal Digital Library (UDL)'s mission to digitize all the published works of mankind and make them available throughout ...Copyright and its international complications have presented a significant barrier to the Universal Digital Library (UDL)'s mission to digitize all the published works of mankind and make them available throughout the world. We discuss the effect of existing copyright treaties and various proposals, such as compulsory licensing and the public lending fight that would allow access to copyrighted works without requiring permission of their owners. We argue that these schemes are ineffective for purposes of the UDL. Instead, making use of the international consensus that copyright does not protect facts, information or processes, we propose to scan works digitally to extract their intellectual content, and then generate by machine synthetic works that capture this content, and then translate the generated works automatically into multiple languages and distribute them free of copyright restriction.展开更多
文摘With the development of big data,all walks of life in society have begun to venture into big data to serve their own enterprises and departments.Big data has been embraced by university digital libraries.The most cumbersome work for the management of university libraries is document retrieval.This article uses Hadoop algorithm to extract semantic keywords and then calculates semantic similarity based on the literature retrieval keyword calculation process.The fast-matching method is used to determine the weight of each keyword,so as to ensure an efficient and accurate document retrieval in digital libraries,thus completing the design of the document retrieval method for university digital libraries based on Hadoop technology.
文摘The objective of Universal Digital Library (UDL) is to capture all books in digital format. A text to speech (TTS) interface for UDL portal would enable access to the digital content in voice mode, and also provide access to the digital content for illiterate and vision-impaired people. Our work focuses on design and implementation of text to speech interface for UDL portal primarily for Indian languages. This paper is aimed at identifying the issues involved in integrating text to speech system into UDL portal and describes the development process of Hindi, Telugu and Tamil voices under Festvox framework using unit selection techniques. We demonstrate the quality of the Tamil and Telugu voices and lay out the plan for integrating the TTS into the UDL portal.
基金Funded by the Natural Science Foundation of Chongqing under Grant No. CSTC2006AB1015.
文摘Researchers in P.R.China commonly create triangulate irregular networks(TINs) from contours and then convert TINs into digital elevation models(DEMs).However,the DEM produced by this method can not precisely describe and simulate key hydrological features such as rivers and drainage borders.Taking a hilly region in southwestern China as a research area and using ArcGISTM software,we analyzed the errors of different interpolations to obtain distributions of the errors and precisions of different algorithms and to provide references for DEM productions.The results show that different interpolation errors satisfy normal distributions,and large error exists near the structure line of the terrain.Furthermore,the results also show that the precision of a DEM interpolated with the Australian National University digital elevation model(ANUDEM) is higher than that interpolated with TIN.The DEM interpolated with TIN is acceptable for generating DEMs in the hilly region of southwestern China.
文摘Transliteration editors are essential for keying-in Indian language scripts into the computer using QWERTY keyboard. Applications of transliteration editors in the context of Universal Digital Library (UDL) include entry of meta-data and diction- aries for Indian languages. In this paper we propose a simple approach for building transliteration editors for Indian languages using Unicode and by taking advantage of its rendering engine. We demonstrate the usefulness of the Unicode based approach to build transliteration editors for Indian languages, and report its advantages needing little maintenance and few entries in the mapping table, and ease of adding new features such as adding letters, to the transliteration scheme. We demonstrate the trans- literation editor for 9 Indian languages and also explain how this approach can be adapted for Arabic scripts.
文摘In this research paper,we have presented variable area type capacitive sensor signal conditioning system for angular displacement measurement and for this purpose we have used timer LM555 based astable multivibrator and universal frequency to digital converter (UFDC). Due to variation in angular displacement in the variable area type capacitor which is connected in the timer based astable circuit,capacitance changes which in turn changes the time period of the timer circuit output. The time period of the timer output waveform is linear with the capacitance and hence linear with angular displacement. The timer output is further processed with UFDC for the measurement. The experimental results show that the time period is linear with the angular displacement in the range of 0- 180° and the uncertainty we should associate it with this average time period value is the standard deviation of the mean,often called the standard error (SE),which is ± 0.023 μs. Because of the simplicity,this measurement system can be used in both electronic and industrial instrumentation.
文摘The Bibliotheca Alexandrina (BA) has been developing and putting to use a workflow for tuming printed books into digital books as its contribution to the building of a Universal Digital Library. This workflow is a process consisting of multiple phases, namely, scanning, image processing, OCR, digital archiving, document encoding, and publishing. Over the past couple of years, the BA has defined procedures and special techniques for the scanning, processing, OCR and publishing, especially of Arabic books. This workflow has been automated, allowing the governance of the different phases and making possible the production of 18000 books so far. The BA has also designed and implemented a framework for the encoding of digital books that allows publishing as well as a software system for managing the creation, maintenance, and publishing of the overall digital repository.
文摘Copyright and its international complications have presented a significant barrier to the Universal Digital Library (UDL)'s mission to digitize all the published works of mankind and make them available throughout the world. We discuss the effect of existing copyright treaties and various proposals, such as compulsory licensing and the public lending fight that would allow access to copyrighted works without requiring permission of their owners. We argue that these schemes are ineffective for purposes of the UDL. Instead, making use of the international consensus that copyright does not protect facts, information or processes, we propose to scan works digitally to extract their intellectual content, and then generate by machine synthetic works that capture this content, and then translate the generated works automatically into multiple languages and distribute them free of copyright restriction.