Text segmentation of health examination item based on character statistics and information measurement 被引量：1

Text segmentation of health examination item based on character statistics and information measurement

下载PDF

导出

摘要 This study explores the segmentation algorithm of item examination. In the specific implementation, a large amount of h method of character statistics, the connection tightness values text data, especially of sing storical health examination e long length data in health data is analysed. Using the TABS between two adjacent characters are calculated Three parameters, the candidate number N, the best position BP, and balance weight BW are set. The total segmentation indexes Sis are calculated, thus determined the segmentation position Pos. The optimal parameter values are determined by the method of information measurement. Experimental results show that the accuracy rate is 78.6% and reaches 82.9% in the most frequently appeared text item. The complexity of the algorithm is O（n）. Using no existing domain knowledge, it is very simple and fast. By executed repeatedly, it is convenient to obtain the characteristics of each single item of text data, furthermore, to distinguish respective express preference of different physicians to the same item. The assumption is verified that without professional domain knowledge, a large amount of historical data can provide valuable clues for the text understanding. The results of this research are being applied and verified in the following research works in the field of health examination.

作者 Hui An Dahui Wang Zhigeng Pan Meiling Chen Xinting Wang Hui An;Dahui Wang;Zhigeng Pan;Meiling Chen;Xinting Wang(DigitalMedia & Interaction Research Center, Hangzhou Normal University, Wenzhou People's Hospital, Wenzhou 325000, People's Republic of China;Department of Health Examination, Hangzhou Normal University, Hangzhou, People's Republic of China;Institute of Industrial VR, Foshan University, Guangdong, People's Republic of China)

机构地区 DigitalMedia & Interaction Research Center Department of Health Examination Institute of Industrial VR

出处《CAAI Transactions on Intelligence Technology》 2018年第1期28-32,共5页 智能技术学报（英文）

关键词分割算法字符医疗卫生行业人工智能

分类号 TP317.2 [自动化与计算机技术—计算机软件与理论] R197 [医药卫生—卫生事业管理]

引文网络
相关文献

同被引文献19

1徐林明,李美娟.动态综合评价中的数据预处理方法研究[J].中国管理科学,2020,0(1):162-169. 被引量：52
2关文玲,王少莉,朱晓莉.基于支持向量机的化工企业安全预警模型研究[J].天津理工大学学报,2017,33(4):16-20. 被引量：7
3陈孝慈,谭章禄,单斐,高青.基于Bigram的安全隐患文本分类研究[J].中国安全科学学报,2017,27(8):156-161. 被引量：10
4刘文,赵挺生,张亚静,陈昱锟,周炜.地铁盾构施工安全风险规律分析与对策[J].中国安全科学学报,2017,27(10):130-136. 被引量：22
5唐凯,田水承,李红霞,杨鹏飞.企业安全文化培训效能路径研究[J].安全与环境学报,2019,19(5):1638-1642. 被引量：6
6翟美佳.企业事故风险管控的经济学分析——评《安全经济学》[J].中国安全科学学报,2019,29(11):192-192. 被引量：2
7杨弘,田晶,王可,张青,韩清华,张岩波.混合型缺失数据填补方法比较与应用[J].中国卫生统计,2020,37(3):395-399. 被引量：14
8周泽人,舒印彪,董存,梁志峰,王铮,陈敏.基于混合威布尔分布的风能资源分布统计分析研究[J].数理统计与管理,2020,39(4):584-594. 被引量：9
9李珏,王幼芳.基于文本挖掘的建筑施工高处坠落事故致因网络分析[J].安全与环境学报,2020,20(4):1284-1290. 被引量：34
10冉连,张曦.地方政府数据开放中的数据安全政策研究——基于全国33个地级市政策文本的内容分析[J].情报杂志,2020,39(11):96-103. 被引量：26

引证文献1

1段在鹏,张灿,谢汉青,王寓霖,李帆.面向“数值-文本”大安全数据的企业风险分析[J].安全与环境学报,2022,22(6):3164-3173.

1金镭,鞠贤玮,郭旭.基于专利分析研究中国石油行业发展现状[J].现代化工,2018,38(11):12-17. 被引量：5
2朱慧,田容雨.Vsftp在实验室运维和实践教学中的应用与研究[J].科技创新导报,2018,15(12):227-228.
3彭圳生,巩青歌,高志强,段妍羽,曾子贤.基于密度及文本特征的新闻标题抽取算法[J].中文信息学报,2018,32(10):78-86. 被引量：6
4Samir Amin.The Communist Manifesto,170 Years Later[J].学术界,2018(11):214-227.
5Ming Wang,Ting Xu,Yanli Zhu,Wenhong Yin,Hong Guo,Ertuan Zhao,Xiaoying Fang,Weiguo Wang.Evolution of interface character distribution in duplex stainless steel processed by cross-rolling and annealing[J].Journal of Materials Science & Technology,2018,34(11):2160-2166.
6王学锋,杨若鹏,朱巍.基于深度学习的军事命名实体识别方法[J].装甲兵工程学院学报,2018,32(4):94-98. 被引量：23
7Ryuichi Morishita.World Journal of Hypertension:A new bench mark in the hypertension world[J].World Journal of Hypertension,2011,1(1):1-2.
8晏丽.The Archetypal Natural Man-Joe Gargery[J].海外英语,2018(22):192-193.

CAAI Transactions on Intelligence Technology

2018年第1期

浏览历史

内容加载中请稍等...

Text segmentation of health examination item based on character statistics and information measurement 被引量：1

同被引文献19

引证文献1

相关作者

相关机构

相关主题

浏览历史