期刊文献+
共找到22,205篇文章
< 1 2 250 >
每页显示 20 50 100
A Framework Based on the DAO and NFT in Blockchain for Electronic Document Sharing
1
作者 Lin Chen Jiaming Zhu +2 位作者 Yuting Xu Huanqin Zheng Shen Su 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第9期2373-2395,共23页
In the information age,electronic documents(e-documents)have become a popular alternative to paper documents due to their lower costs,higher dissemination rates,and ease of knowledge sharing.However,digital copyright ... In the information age,electronic documents(e-documents)have become a popular alternative to paper documents due to their lower costs,higher dissemination rates,and ease of knowledge sharing.However,digital copyright infringements occur frequently due to the ease of copying,which not only infringes on the rights of creators but also weakens their creative enthusiasm.Therefore,it is crucial to establish an e-document sharing system that enforces copyright protection.However,the existing centralized system has outstanding vulnerabilities,and the plagiarism detection algorithm used cannot fully detect the context,semantics,style,and other factors of the text.Digital watermark technology is only used as a means of infringement tracing.This paper proposes a decentralized framework for e-document sharing based on decentralized autonomous organization(DAO)and non-fungible token(NFT)in blockchain.The use of blockchain as a distributed credit base resolves the vulnerabilities inherent in traditional centralized systems.The e-document evaluation and plagiarism detection mechanisms based on the DAO model effectively address challenges in comprehensive text information checks,thereby promoting the enhancement of e-document quality.The mechanism for protecting and circulating e-document copyrights using NFT technology ensures effective safeguarding of users’e-document copyrights and facilitates e-document sharing.Moreover,recognizing the security issues within the DAO governance mechanism,we introduce an innovative optimization solution.Through experimentation,we validate the enhanced security of the optimized governance mechanism,reducing manipulation risks by up to 51%.Additionally,by utilizing evolutionary game analysis to deduce the equilibrium strategies of the framework,we discovered that adjusting the reward and penalty parameters of the incentive mechanism motivates creators to generate superior quality and unique e-documents,while evaluators are more likely to engage in assessments. 展开更多
关键词 Electronic document sharing blockchain DAO NFT evolutionary game
下载PDF
An explorative study on document type assignment of review articles in Web of Science,Scopus and journals’websites
2
作者 Manman Zhu Xinyue Lu +2 位作者 Fuyou Chen Liying Yang Zhesi Shen 《Journal of Data and Information Science》 CSCD 2024年第1期11-36,共26页
Purpose:Accurately assigning the document type of review articles in citation index databases like Web of Science(WoS)and Scopus is important.This study aims to investigate the document type assignation of review arti... Purpose:Accurately assigning the document type of review articles in citation index databases like Web of Science(WoS)and Scopus is important.This study aims to investigate the document type assignation of review articles in Web of Science,Scopus and Publisher’s websites on a large scale.Design/methodology/approach:27,616 papers from 160 journals from 10 review journal series indexed in SCI are analyzed.The document types of these papers labeled on journals’websites,and assigned by WoS and Scopus are retrieved and compared to determine the assigning accuracy and identify the possible reasons for wrongly assigning.For the document type labeled on the website,we further differentiate them into explicit review and implicit review based on whether the website directly indicates it is a review or not.Findings:Overall,WoS and Scopus performed similarly,with an average precision of about 99% and recall of about 80%.However,there were some differences between WoS and Scopus across different journal series and within the same journal series.The assigning accuracy of WoS and Scopus for implicit reviews dropped significantly,especially for Scopus.Research limitations:The document types we used as the gold standard were based on the journal websites’labeling which were not manually validated one by one.We only studied the labeling performance for review articles published during 2017-2018 in review journals.Whether this conclusion can be extended to review articles published in non-review journals and most current situation is not very clear.Practical implications:This study provides a reference for the accuracy of document type assigning of review articles in WoS and Scopus,and the identified pattern for assigning implicit reviews may be helpful to better labeling on websites,WoS and Scopus.Originality/value:This study investigated the assigning accuracy of document type of reviews and identified the some patterns of wrong assignments. 展开更多
关键词 document type Web of Science SCOPUS Review article
下载PDF
Hybrid Optimization Algorithm for Handwritten Document Enhancement
3
作者 Shu-Chuan Chu Xiaomeng Yang +2 位作者 Li Zhang Václav Snášel Jeng-Shyang Pan 《Computers, Materials & Continua》 SCIE EI 2024年第3期3763-3786,共24页
The Gannet Optimization Algorithm (GOA) and the Whale Optimization Algorithm (WOA) demonstrate strong performance;however, there remains room for improvement in convergence and practical applications. This study intro... The Gannet Optimization Algorithm (GOA) and the Whale Optimization Algorithm (WOA) demonstrate strong performance;however, there remains room for improvement in convergence and practical applications. This study introduces a hybrid optimization algorithm, named the adaptive inertia weight whale optimization algorithm and gannet optimization algorithm (AIWGOA), which addresses challenges in enhancing handwritten documents. The hybrid strategy integrates the strengths of both algorithms, significantly enhancing their capabilities, whereas the adaptive parameter strategy mitigates the need for manual parameter setting. By amalgamating the hybrid strategy and parameter-adaptive approach, the Gannet Optimization Algorithm was refined to yield the AIWGOA. Through a performance analysis of the CEC2013 benchmark, the AIWGOA demonstrates notable advantages across various metrics. Subsequently, an evaluation index was employed to assess the enhanced handwritten documents and images, affirming the superior practical application of the AIWGOA compared with other algorithms. 展开更多
关键词 Metaheuristic algorithm gannet optimization algorithm hybrid algorithm handwritten document enhancement
下载PDF
Multimodal Deep Neural Networks for Digitized Document Classification
4
作者 Aigerim Baimakhanova Ainur Zhumadillayeva +4 位作者 Bigul Mukhametzhanova Natalya Glazyrina Rozamgul Niyazova Nurseit Zhunissov Aizhan Sambetbayeva 《Computer Systems Science & Engineering》 2024年第3期793-811,共19页
As digital technologies have advanced more rapidly,the number of paper documents recently converted into a digital format has exponentially increased.To respond to the urgent need to categorize the growing number of d... As digital technologies have advanced more rapidly,the number of paper documents recently converted into a digital format has exponentially increased.To respond to the urgent need to categorize the growing number of digitized documents,the classification of digitized documents in real time has been identified as the primary goal of our study.A paper classification is the first stage in automating document control and efficient knowledge discovery with no or little human involvement.Artificial intelligence methods such as Deep Learning are now combined with segmentation to study and interpret those traits,which were not conceivable ten years ago.Deep learning aids in comprehending input patterns so that object classes may be predicted.The segmentation process divides the input image into separate segments for a more thorough image study.This study proposes a deep learning-enabled framework for automated document classification,which can be implemented in higher education.To further this goal,a dataset was developed that includes seven categories:Diplomas,Personal documents,Journal of Accounting of higher education diplomas,Service letters,Orders,Production orders,and Student orders.Subsequently,a deep learning model based on Conv2D layers is proposed for the document classification process.In the final part of this research,the proposed model is evaluated and compared with other machine-learning techniques.The results demonstrate that the proposed deep learning model shows high results in document categorization overtaking the other machine learning models by reaching 94.84%,94.79%,94.62%,94.43%,94.07%in accuracy,precision,recall,F-score,and AUC-ROC,respectively.The achieved results prove that the proposed deep model is acceptable to use in practice as an assistant to an office worker. 展开更多
关键词 document categorization deep learning machine learning CLASSIFICATION DIGITIZATION
下载PDF
Pre-training transformer with dual-branch context content module for table detection in document images
5
作者 Yongzhi LI Pengle ZHANG +2 位作者 Meng SUN Jin HUANG Ruhan HE 《虚拟现实与智能硬件(中英文)》 EI 2024年第5期408-420,共13页
Background Document images such as statistical reports and scientific journals are widely used in information technology.Accurate detection of table areas in document images is an essential prerequisite for tasks such... Background Document images such as statistical reports and scientific journals are widely used in information technology.Accurate detection of table areas in document images is an essential prerequisite for tasks such as information extraction.However,because of the diversity in the shapes and sizes of tables,existing table detection methods adapted from general object detection algorithms,have not yet achieved satisfactory results.Incorrect detection results might lead to the loss of critical information.Methods Therefore,we propose a novel end-to-end trainable deep network combined with a self-supervised pretraining transformer for feature extraction to minimize incorrect detections.To better deal with table areas of different shapes and sizes,we added a dualbranch context content attention module(DCCAM)to high-dimensional features to extract context content information,thereby enhancing the network's ability to learn shape features.For feature fusion at different scales,we replaced the original 3×3 convolution with a multilayer residual module,which contains enhanced gradient flow information to improve the feature representation and extraction capability.Results We evaluated our method on public document datasets and compared it with previous methods,which achieved state-of-the-art results in terms of evaluation metrics such as recall and F1-score.https://github.com/Yong Z-Lee/TD-DCCAM. 展开更多
关键词 Table detection document image analysis TRANSFORMER Dilated convolution Deformable convolution Feature fusion
下载PDF
Impact of Laboratory Value Flowsheet in Electronic Health Record (EHR) Documentation Time
6
作者 Isabel Rosado Pogozelski 《Open Journal of Nursing》 2024年第1期40-50,共11页
Research on the use of EHR is contradictory since it presents contradicting results regarding the time spent documenting. There is research that supports the use of electronic records as a tool to speed documentation;... Research on the use of EHR is contradictory since it presents contradicting results regarding the time spent documenting. There is research that supports the use of electronic records as a tool to speed documentation;and research that found that it is time consuming. The purpose of this quantitative retrospective before-after project was to measure the impact of using the laboratory value flowsheet within the EHR on documentation time. The research question was: “Does the use of a laboratory value flowsheet in the EHR impact documentation time by primary care providers (PCPs)?” The theoretical framework utilized in this project was the Donabedian Model. The population in this research was the two PCPs in a small primary care clinic in the northwest of Puerto Rico. The sample was composed of all the encounters during the months of October 2019 and December 2019. The data was obtained through data mining and analyzed using SPSS 27. The evaluative outcome of this project is that there is a decrease in documentation time after implementation of the use of the laboratory value flowsheet in the EHR. However, patients per day increase therefore having an impact on the number of patients seen per day/week/month. The implications for clinical practice include the use of templates to improve workflow and documentation as well as decreasing documentation time while also increasing the number of patients seen per day. . 展开更多
关键词 Electronic Health Record EHR Laboratory Results Template documentation Time
下载PDF
Research and Analysis of Grammatical Error Correction Technology for Chinese Documents
7
作者 Wei Jin Feng Jiang +2 位作者 Xiulai Wang Ningling Ma Yutao Zhang 《Journal of Computer and Communications》 2024年第8期202-223,共22页
With the widespread use of Chinese globally, the number of Chinese learners has been increasing, leading to various grammatical errors among beginners. Additionally, as domestic efforts to develop industrial informati... With the widespread use of Chinese globally, the number of Chinese learners has been increasing, leading to various grammatical errors among beginners. Additionally, as domestic efforts to develop industrial information grow, electronic documents have also proliferated. When dealing with numerous electronic documents and texts written by Chinese beginners, manually written texts often contain hidden grammatical errors, posing a significant challenge to traditional manual proofreading. Correcting these grammatical errors is crucial to ensure fluency and readability. However, certain special types of text grammar or logical errors can have a huge impact, and manually proofreading a large number of texts individually is clearly impractical. Consequently, research on text error correction techniques has garnered significant attention in recent years. The advent and advancement of deep learning have paved the way for sequence-to-sequence learning methods to be extensively applied to the task of text error correction. This paper presents a comprehensive analysis of Chinese text grammar error correction technology, elaborates on its current research status, discusses existing problems, proposes preliminary solutions, and conducts experiments using judicial documents as an example. The aim is to provide a feasible research approach for Chinese text error correction technology. 展开更多
关键词 Chinese Text Error Judicial documents Neural Network Deep Learning TRANSFORMER
下载PDF
Semantic Document Layout Analysis of Handwritten Manuscripts
8
作者 Emad Sami Jaha 《Computers, Materials & Continua》 SCIE EI 2023年第5期2805-2831,共27页
A document layout can be more informative than merely a document’s visual and structural appearance.Thus,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed docume... A document layout can be more informative than merely a document’s visual and structural appearance.Thus,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed document image analysis to be further used in several applications and different objectives.This research extends the traditional approaches of DLA and introduces the concept of semantic document layout analysis(SDLA)by proposing a novel framework for semantic layout analysis and characterization of handwritten manuscripts.The proposed SDLA approach enables the derivation of implicit information and semantic characteristics,which can be effectively utilized in dozens of practical applications for various purposes,in a way bridging the semantic gap and providingmore understandable high-level document image analysis and more invariant characterization via absolute and relative labeling.This approach is validated and evaluated on a large dataset ofArabic handwrittenmanuscripts comprising complex layouts.The experimental work shows promising results in terms of accurate and effective semantic characteristic-based clustering and retrieval of handwritten manuscripts.It also indicates the expected efficacy of using the capabilities of the proposed approach in automating and facilitating many functional,reallife tasks such as effort estimation and pricing of transcription or typing of such complex manuscripts. 展开更多
关键词 Semantic characteristics semantic labeling document layout analysis semantic document layout analysis handwritten manuscripts clustering RETRIEVAL image processing computer vision machine learning
下载PDF
Local-to-Global Causal Reasoning for Cross-Document Relation Extraction
9
作者 Haoran Wu Xiuyi Chen +3 位作者 Zefa Hu Jing Shi Shuang Xu Bo Xu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第7期1608-1621,共14页
Cross-document relation extraction(RE),as an extension of information extraction,requires integrating information from multiple documents retrieved from open domains with a large number of irrelevant or confusing nois... Cross-document relation extraction(RE),as an extension of information extraction,requires integrating information from multiple documents retrieved from open domains with a large number of irrelevant or confusing noisy texts.Previous studies focus on the attention mechanism to construct the connection between different text features through semantic similarity.However,similarity-based methods cannot distinguish valid information from highly similar retrieved documents well.How to design an effective algorithm to implement aggregated reasoning in confusing information with similar features still remains an open issue.To address this problem,we design a novel local-toglobal causal reasoning(LGCR)network for cross-document RE,which enables efficient distinguishing,filtering and global reasoning on complex information from a causal perspective.Specifically,we propose a local causal estimation algorithm to estimate the causal effect,which is the first trial to use the causal reasoning independent of feature similarity to distinguish between confusing and valid information in cross-document RE.Furthermore,based on the causal effect,we propose a causality guided global reasoning algorithm to filter the confusing information and achieve global reasoning.Experimental results under the closed and the open settings of the large-scale dataset Cod RED demonstrate our LGCR network significantly outperforms the state-ofthe-art methods and validate the effectiveness of causal reasoning in confusing information processing. 展开更多
关键词 Causal reasoning cross document graph reasoning relation extraction(RE)
下载PDF
Restoration of Folk Document Covers from the Qing Dynasty to the Republic of China
10
作者 Jiao Yao Fujiang Geng +1 位作者 Huanhuan Wang Yunpei Lu 《Paper And Biomaterials》 CAS 2023年第3期66-74,共9页
The covers of booklets and books in folk documents primarily serve to protect the pages.Owing to long-term storage limitations,a considerable number of book covers have suffered varying degrees of damage.Following the... The covers of booklets and books in folk documents primarily serve to protect the pages.Owing to long-term storage limitations,a considerable number of book covers have suffered varying degrees of damage.Following the principles of restoration,a comparative analysis and restoration of folk document covers were conducted,selecting four different types of carriers from the Taihang Mountain Documents,ranging from the Qing dynasty to the Republican Era.These carriers included hemp,mulberry bark,and machinemade paper,and cotton blue cloth.Each cover type was matched with an appropriate restoration paper,and different methods were employed during the restoration process.Through restoration,the previously damaged document covers can continue to fulfill their role in protecting the books,thereby extending the lifespan of these four folk documents. 展开更多
关键词 RESTORATION folk documents COVERS
下载PDF
Functions of Karez to Xinjiang Agriculture in the Qing Dynasty from the Perspective of Historical Documents
11
作者 Danyang GONG 《Asian Agricultural Research》 2023年第3期70-71,共2页
Desertification is increasingly serious in Xinjiang,and the construction of water conservancy is a precondition for the development of agriculture.The main project for the development of agriculture and water conserva... Desertification is increasingly serious in Xinjiang,and the construction of water conservancy is a precondition for the development of agriculture.The main project for the development of agriculture and water conservancy in Xinjiang is to build Karez,which played a vital role in the development of Xinjiang agriculture in the Qing Dynasty.It has been recorded many times in historical documents of the Qing Dynasty,such as Lin Zexu s Diary,Tao Baolian s Diary,Xinjiang Atlas and Zuo Zongtang s Memorial to the Emperor,etc.,which recorded the situation and historical origin of Karez.Karez made a significant contribution to the development of agriculture in the Qing Dynasty.It increased the cultivated land in Xinjiang at that time,and increased the types and yields of crops.It is conducive to the stability and development of Xinjiang s economy.Until today,Karez is still an important water source for agricultural irrigation in Xinjiang. 展开更多
关键词 KAREZ Historical documents in the Qing Dynasty Xinjiang agriculture
下载PDF
Human Rights in Civil Judicial Documents:Conception and Function
12
作者 郑若瀚 《The Journal of Human Rights》 2023年第4期851-868,共18页
Traditional human rights theory tends to hold that human rights should be aimed at defending public authority and that the legal issue of human rights is a matter of public law.However,the development of human rights ... Traditional human rights theory tends to hold that human rights should be aimed at defending public authority and that the legal issue of human rights is a matter of public law.However,the development of human rights concepts and practices is not just confined to this.A textual search shows that the term“human rights”exists widely in China’s civil judicial documents.Among the 3,412 civil judicial documents we researched,the concept of“human rights”penetrates all kinds of disputes in lawsuits,ranging from property rights,contracts,labor,and torts to marital property,which is embedded in both the claims of the parties concerned and the reasoning of judges.Human rights have become the discourse and yardstick for understanding and evaluating social behavior.The widespread use of the term“human rights”in civil judicial documents reflects at least three concepts related to human rights:first,the rights to subsistence and development are the primary basic human rights;second,the judicial protection of human rights is a bottom-line guarantee;third,the protection of human rights aims to achieve equal rights.Today,judges quote the theory of human rights in judicial judgments from time to time,evidencing that human rights have a practical function in judicial adjudication activities,and in practice this is mainly manifested in declaring righteous values and strengthening arguments with the values and ideas related to human rights,using the provisions concerning human rights in the Constitution to interpret the constitutionality,and using the principles of human rights to interpret blurred rules and rank the importance of different rights. 展开更多
关键词 human rights concept of human rights civil judicature judicial documents judicial reasons
下载PDF
Fostering Critical Thinking Skills in the EFL College-Level Classroom Through Online Collaborative Document Tools
13
作者 JIANG Fangzhou 《Sino-US English Teaching》 2023年第9期365-369,共5页
This paper explores the potential of applying online collaborative documents to foster critical thinking skills in EFL college-level classrooms.Considering the limitations of traditional teacher-centered approaches an... This paper explores the potential of applying online collaborative documents to foster critical thinking skills in EFL college-level classrooms.Considering the limitations of traditional teacher-centered approaches and the need for innovative methods,the study examines the integration of online collaborative tools,using Tencent Docs as an example.The discussion highlights the importance of critical thinking in the academic and professional spheres and introduces the concept of online collaborative documents for enhancing this cognitive skill.Through a detailed exploration,the paper presents a model of employing collaborative documents within a college English class,demonstrating how students collaboratively learning an article.Then,the paper discusses the pros and cons of employing this technology in classroom.The conclusion emphasizes the transformative potential of integrating technology into pedagogy and its role in creating a dynamic learning environment.The paper underscores the importance of striking a balance between technology and traditional methods,foreseeing avenues for further research and development. 展开更多
关键词 critical thinking skills online collaborative document tools EFL at college level
下载PDF
Fully Automated Paper Document Sorting Robot Design
14
作者 Guo-Long Yang Biao-Hua Zhang 《Journal of Electronic Research and Application》 2023年第6期1-9,共9页
A fully automated paper document sorting robot was developed in this project.This robot classifies documents efficiently and accurately.The objective of this project was to improve the efficiency of classifying or sor... A fully automated paper document sorting robot was developed in this project.This robot classifies documents efficiently and accurately.The objective of this project was to improve the efficiency of classifying or sorting paper documents,reduce costs,and save time.The robot can classify documents according to user-defined rules,such as keywords,dates,serial numbers,bar codes,and the meaning of paragraphs.Since it can classify or sort documents intelligently,it can complete large-scale document classification quickly.The robot is constructed using an aluminum profile to create a box-type truss gantry structure frame.It was built on the LubanCat 4 motherboard and controlled through Python language programming.Driven by a stepper motor to move the manipulator.The camera module is combined with an artificial intelligence algorithm to recognize paper in real time,and the text is recognized after taking pictures of the paper.The sorting function is performed by several sensors.In addition,a web-based human-computer interaction platform was developed using the Flask web framework in Python.Users could access this platform in a variety of ways,allowing them to easily and swiftly configure parameters and send operational instructions to perform various functions. 展开更多
关键词 Paper documents Sorting robot PYTHON Human-computer interaction
下载PDF
Documentation Concordance,Sharing and Utilization of Tea Germplasm Resources in Yunnan 被引量:3
15
作者 刘本英 宋维希 +6 位作者 孙雪梅 蒋会兵 马玲 矣兵 季鹏章 汪云刚 王平盛 《Agricultural Science & Technology》 CAS 2011年第12期1842-1848,共7页
In this paper,the research achievements and progress of Yunnan tea germplasm resource in past sixty years are systematically reviewed from the following aspects:exploration,collecting,conservation,protection,identifi... In this paper,the research achievements and progress of Yunnan tea germplasm resource in past sixty years are systematically reviewed from the following aspects:exploration,collecting,conservation,protection,identification,evaluation and shared utilization.Simultaneously,the current problems and the suggestions about subsequent development of tea germplasm resources in Yunnan were discussed,including superior and rare germplasm collection,tea genetic diversity research,biotechnology utilization in tea germplasm innovation,super gene exploration and function,the construction of utilization platform,biological base of species and population conservation. 展开更多
关键词 YUNNAN Tea germplasm resource documentation Concordance SHARING UTILIZATION
下载PDF
Document classification approach by rough-set-based corner classification neural network 被引量:1
16
作者 张卫丰 徐宝文 +1 位作者 崔自峰 徐峻岭 《Journal of Southeast University(English Edition)》 EI CAS 2006年第3期439-444,共6页
A rough set based corner classification neural network, the Rough-CC4, is presented to solve document classification problems such as document representation of different document sizes, document feature selection and... A rough set based corner classification neural network, the Rough-CC4, is presented to solve document classification problems such as document representation of different document sizes, document feature selection and document feature encoding. In the Rough-CC4, the documents are described by the equivalent classes of the approximate words. By this method, the dimensions representing the documents can be reduced, which can solve the precision problems caused by the different document sizes and also blur the differences caused by the approximate words. In the Rough-CC4, a binary encoding method is introduced, through which the importance of documents relative to each equivalent class is encoded. By this encoding method, the precision of the Rough-CC4 is improved greatly and the space complexity of the Rough-CC4 is reduced. The Rough-CC4 can be used in automatic classification of documents. 展开更多
关键词 document classification neural network rough set meta search engine
下载PDF
论学界对“record”与“document”翻译之争 被引量:1
17
作者 周莉莉 张伟斌 《办公室业务》 2017年第10期114-115,共2页
随着我国档案学界的发展,我们越来越注重国内外档案学界的融合。故本文通过对学界"record"和"document"观点的梳理,以期加强我国与国外档案学界的融合,从而促使我国档案事业的进一步发展。
关键词 RECORD document 文件 文档
下载PDF
INFORMATION RETRIEVAL FOR SHORT DOCUMENTS 被引量:2
18
作者 Qi Haoliang Li Mu +1 位作者 Gao Jianfeng Li Sheng 《Journal of Electronics(China)》 2006年第6期933-936,共4页
The major problem of the most current approaches of information models lies in that individual words provide unreliable evidence about the content of the texts. When the document is short, e.g. only the abstract is av... The major problem of the most current approaches of information models lies in that individual words provide unreliable evidence about the content of the texts. When the document is short, e.g. only the abstract is available, the word-use variability problem will have substantial impact on the Information Retrieval (IR) performance. To solve the problem, a new technology to short document retrieval named Reference Document Model (RDM) is put forward in this letter. RDM gets the statistical semantic of the query/document by pseudo feedback both for the query and document from reference documents. The contributions of this model are three-fold: (1) Pseudo feedback both for the query and the document; (2) Building the query model and the document model from reference documents; (3) Flexible indexing units, which can be ally linguistic elements such as documents, paragraphs, sentences, n-grams, term or character. For short document retrieval, RDM achieves significant improvements over the classical probabilistic models on the task of ad hoc retrieval on Text REtrieval Conference (TREC) test sets. Results also show that the shorter the document, the better the RDM performance. 展开更多
关键词 Information retrieval Short documents Reference document Model (RDM)
下载PDF
Novel Adaptive Binarization Method for Degraded Document Images 被引量:1
19
作者 Siti Norul Huda Sheikh Abdullah Saad M.Ismail +1 位作者 Mohammad Kamrul Hasan Palaiahnakote Shivakumara 《Computers, Materials & Continua》 SCIE EI 2021年第6期3815-3832,共18页
Achieving a good recognition rate for degraded document images is difficult as degraded document images suffer from low contrast,bleedthrough,and nonuniform illumination effects.Unlike the existing baseline thresholdi... Achieving a good recognition rate for degraded document images is difficult as degraded document images suffer from low contrast,bleedthrough,and nonuniform illumination effects.Unlike the existing baseline thresholding techniques that use fixed thresholds and windows,the proposed method introduces a concept for obtaining dynamic windows according to the image content to achieve better binarization.To enhance a low-contrast image,we proposed a new mean histogram stretching method for suppressing noisy pixels in the background and,simultaneously,increasing pixel contrast at edges or near edges,which results in an enhanced image.For the enhanced image,we propose a new method for deriving adaptive local thresholds for dynamic windows.The dynamic window is derived by exploiting the advantage of Otsu thresholding.To assess the performance of the proposed method,we have used standard databases,namely,document image binarization contest(DIBCO),for experimentation.The comparative study on well-known existing methods indicates that the proposed method outperforms the existing methods in terms of quality and recognition rate. 展开更多
关键词 Global and local thresholding adaptive binarization degraded document image image histogram document image binarization contest
下载PDF
DYNAMIC ENGINEERING DOCUMENT MANAGEMENT BASED ON XML TECHNOLOGY
20
作者 翟建军 陈文亮 丁秋林 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI 2005年第1期38-41,共4页
The eXtensible markup language (XML) is a kind of new meta language for replacing HTML and has many advantages. Traditional engineering documents have too many expression forms to be expediently managed and have no dy... The eXtensible markup language (XML) is a kind of new meta language for replacing HTML and has many advantages. Traditional engineering documents have too many expression forms to be expediently managed and have no dynamic correlation functions. This paper introduces a new method and uses XML to store and manage engineering documents to realize the format unity of engineering documents and their dynamic correlations. 展开更多
关键词 XML dynamic engineering document MANAGEMENT
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部