As a major resource and tool of language learning, learner’s dictionaries have provided sufficient information forL2 writing, which serves as a good guide of peer feedback. Hence, learner’s dictionaries are an indis...As a major resource and tool of language learning, learner’s dictionaries have provided sufficient information forL2 writing, which serves as a good guide of peer feedback. Hence, learner’s dictionaries are an indispensable partof scaffolding in the L2 writing feedback system. However, the effects of dictionary use in L2 writing have longbeen ignored either in L2 writing pedagogy or in learner lexicography. By applying the concept of “scaffolding” topeer feedback as the theoretical framework, this study first clarifies three distinct types of scaffolding informationpresented in current English learner’s dictionaries, and then makes an investigation into EFL learners’ perceptionand practical use of scaffolding information in their English writing. Results show that most EFL learners havepositive attitudes towards scaffolding information and its role in motivating effective feedback in English writing.But their practical use of such information is not satisfactory owing to their inadequate skills and knowledge ofdictionary use. This reflects a high demand of a dictionary use course in universities, which will help to raise EFLlearners’ dictionary use efficiency as well as improve English teachers’ lexicographical expertise in English writingpedagogy.展开更多
传统中医本草文献含有丰富的中医知识,是中医理论研究的重要载体.为了更好地挖掘中医本草知识,精准地实现中医本草文献命名实体识别任务,提出了一种基于特征增强的Bert-BiGRU-CRF中医本草命名实体识别模型,使用特征融合器拼接Bert生成...传统中医本草文献含有丰富的中医知识,是中医理论研究的重要载体.为了更好地挖掘中医本草知识,精准地实现中医本草文献命名实体识别任务,提出了一种基于特征增强的Bert-BiGRU-CRF中医本草命名实体识别模型,使用特征融合器拼接Bert生成的词向量与实体特征作为输入,以双向门控循环单元(bi-directional gated recurrent unit, BiGRU)为特征提取器,以条件随机场(conditional random fields, CRF)进行标签预测,通过特征增强的方法更好地识别中医本草的药名、药性、药味、归经等实体及其边界信息,完成中医本草命名实体任务.在中医本草数据集上的实验结果表明,融入特征的模型F1值达到了90.54%,证明了所提出的方法可以更好地提高中医本草命名实体识别精度.展开更多
Chinese word segmentation is the basis of natural language processing. The dictionary mechanism significantly influences the efficiency of word segmentation and the understanding of the user’s intention which is impl...Chinese word segmentation is the basis of natural language processing. The dictionary mechanism significantly influences the efficiency of word segmentation and the understanding of the user’s intention which is implied in the user’s query. As the traditional dictionary mechanisms can't meet the present situation of personalized mobile search, this paper presents a new dictionary mechanism which contains the word classification information. This paper, furthermore, puts forward an approach for improving the traditional word bank structure, and proposes an improved FMM segmentation algorithm. The results show that the new dictionary mechanism has made a significant increase on the query efficiency and met the user’s individual requirements better.展开更多
Hashing and Trie tree data structures are among the preeminent data mining techniques considered for the ideal search. Hashing techniques have the amortized time complexity of O(1). Although in worst case, searching a...Hashing and Trie tree data structures are among the preeminent data mining techniques considered for the ideal search. Hashing techniques have the amortized time complexity of O(1). Although in worst case, searching a hash table can take as much as θ(n) time [1]. On the other hand, Trie tree data structure is also well renowned data structure. The ideal lookup time for searching a string of length m in database of n strings using Trie data structure is O(m) [2]. In the present study, we have proposed a novel Prime Box parallel search algorithm for searching a string of length m in a dictionary of dynamically increasing size, with a worst case search time complexity of O(log2m). We have exploited parallel techniques over this novel algorithm to achieve this search time complexity. Also this prime Box search is independent of the total words present in the dictionary, which makes it more suitable for dynamic dictionaries with increasing size.展开更多
文摘As a major resource and tool of language learning, learner’s dictionaries have provided sufficient information forL2 writing, which serves as a good guide of peer feedback. Hence, learner’s dictionaries are an indispensable partof scaffolding in the L2 writing feedback system. However, the effects of dictionary use in L2 writing have longbeen ignored either in L2 writing pedagogy or in learner lexicography. By applying the concept of “scaffolding” topeer feedback as the theoretical framework, this study first clarifies three distinct types of scaffolding informationpresented in current English learner’s dictionaries, and then makes an investigation into EFL learners’ perceptionand practical use of scaffolding information in their English writing. Results show that most EFL learners havepositive attitudes towards scaffolding information and its role in motivating effective feedback in English writing.But their practical use of such information is not satisfactory owing to their inadequate skills and knowledge ofdictionary use. This reflects a high demand of a dictionary use course in universities, which will help to raise EFLlearners’ dictionary use efficiency as well as improve English teachers’ lexicographical expertise in English writingpedagogy.
文摘传统中医本草文献含有丰富的中医知识,是中医理论研究的重要载体.为了更好地挖掘中医本草知识,精准地实现中医本草文献命名实体识别任务,提出了一种基于特征增强的Bert-BiGRU-CRF中医本草命名实体识别模型,使用特征融合器拼接Bert生成的词向量与实体特征作为输入,以双向门控循环单元(bi-directional gated recurrent unit, BiGRU)为特征提取器,以条件随机场(conditional random fields, CRF)进行标签预测,通过特征增强的方法更好地识别中医本草的药名、药性、药味、归经等实体及其边界信息,完成中医本草命名实体任务.在中医本草数据集上的实验结果表明,融入特征的模型F1值达到了90.54%,证明了所提出的方法可以更好地提高中医本草命名实体识别精度.
文摘Chinese word segmentation is the basis of natural language processing. The dictionary mechanism significantly influences the efficiency of word segmentation and the understanding of the user’s intention which is implied in the user’s query. As the traditional dictionary mechanisms can't meet the present situation of personalized mobile search, this paper presents a new dictionary mechanism which contains the word classification information. This paper, furthermore, puts forward an approach for improving the traditional word bank structure, and proposes an improved FMM segmentation algorithm. The results show that the new dictionary mechanism has made a significant increase on the query efficiency and met the user’s individual requirements better.
文摘Hashing and Trie tree data structures are among the preeminent data mining techniques considered for the ideal search. Hashing techniques have the amortized time complexity of O(1). Although in worst case, searching a hash table can take as much as θ(n) time [1]. On the other hand, Trie tree data structure is also well renowned data structure. The ideal lookup time for searching a string of length m in database of n strings using Trie data structure is O(m) [2]. In the present study, we have proposed a novel Prime Box parallel search algorithm for searching a string of length m in a dictionary of dynamically increasing size, with a worst case search time complexity of O(log2m). We have exploited parallel techniques over this novel algorithm to achieve this search time complexity. Also this prime Box search is independent of the total words present in the dictionary, which makes it more suitable for dynamic dictionaries with increasing size.