期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
LAME:Layout-Aware Metadata Extraction Approach for Research Articles
1
作者 jongyun choi Hyesoo Kong +2 位作者 Hwamook Yoon Heungseon Oh Yuchul Jung 《Computers, Materials & Continua》 SCIE EI 2022年第8期4019-4037,共19页
The volume of academic literature,such as academic conference papers and journals,has increased rapidly worldwide,and research on metadata extraction is ongoing.However,high-performing metadata extraction is still cha... The volume of academic literature,such as academic conference papers and journals,has increased rapidly worldwide,and research on metadata extraction is ongoing.However,high-performing metadata extraction is still challenging due to diverse layout formats according to journal publishers.To accommodate the diversity of the layouts of academic journals,we propose a novel LAyout-aware Metadata Extraction(LAME)framework equipped with the three characteristics(e.g.,design of automatic layout analysis,construction of a large meta-data training set,and implementation of metadata extractor).In the framework,we designed an automatic layout analysis using PDF Miner.Based on the layout analysis,a large volume of metadata-separated training data,including the title,abstract,author name,author affiliated organization,and keywords,were automatically extracted.Moreover,we constructed a pre-trainedmodel,Layout-Meta BERT,to extract the metadata from academic journals with varying layout formats.The experimental results with our metadata extractor exhibited robust performance(Macro-F1,93.27%)in metadata extraction for unseen journals with different layout formats. 展开更多
关键词 Automatic layout analysis layout-MetaBERT metadata extraction research article
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部