摘要
法文本语义接受度(SAS)研究属于基于语料库的SAS研究分支。以等距离系统随机抽样方法进行对比实验。抽取公式为A+BX≤C,A为起始页码,B为抽取间距,C为文本总页码,X为可取页数集。当以L’Assommoi(r1953版)为语料,B∈(10;5;4;3;2;1)时,词句抽取率(SR)为9.55%,20.29%,25.13%,33.13%,50.36%和100%,SAS为0.0897,0.0841,0.0854,0.0847,0.0848和0.0854。依次攀升SR没有带来SAS显著变化。说明基于英日语料的SAS评价体系也适用于法文本,便于文学评论家对法文本进行量化评析。
The study on Semantic Accessibility Scale(SAS) based on French corpus comes under the umbrella of corpus-involved SAS research.Systematic random sampling skill is used here to discuss the comparison of six different groups extracted from one text.The formula,A +BX≤C, is provided to equidistantly extract L' Assommoir( 1953 ed.)."A "means the start page ; "B", the space of extraction; "C", the total number of text; "X", the set of available pages.lf the condition is B e ( 10; 5 ; 4; 3 ; 2; 1 ), sampling ratios (SR)are 9.55%,20.29%,25.13%,33.13%,50.36%,100%,correspondingly bringing different values of SAS,namely,0.0897,0.084 1, 0.085 4,0.0847,0.084 8,0.085 4.The non-relevance between SR and SAS establishes the fact that the system of SAS evaluation based on English and Japanese corpora can safely extend to French corpus,which helps the critics to analyze French texts by computer.
出处
《计算机工程与应用》
CSCD
北大核心
2009年第29期125-127,179,共4页
Computer Engineering and Applications
基金
国家社会科学项目(No.08BYY046)
山东省社会科学规划项目(No.07CWXJ03)
关键词
文学文本
法语语料库
语义接受度
抽取率
literary texts
French corpus
semantic accessibility scale
sampling ratio