Chinese Clinical Named Entity Recognition(CNER)is a crucial step in extracting medical information and is of great significance in promoting medical informatization.However,CNER poses challenges due to the specificity...Chinese Clinical Named Entity Recognition(CNER)is a crucial step in extracting medical information and is of great significance in promoting medical informatization.However,CNER poses challenges due to the specificity of clinical terminology,the complexity of Chinese text semantics,and the uncertainty of Chinese entity boundaries.To address these issues,we propose an improved CNER model,which is based on multi-feature fusion and multi-scale local context enhancement.The model simultaneously fuses multi-feature representations of pinyin,radical,Part of Speech(POS),word boundary with BERT deep contextual representations to enhance the semantic representation of text for more effective entity recognition.Furthermore,to address the model’s limitation of focusing just on global features,we incorporate Convolutional Neural Networks(CNNs)with various kernel sizes to capture multi-scale local features of the text and enhance the model’s comprehension of the text.Finally,we integrate the obtained global and local features,and employ multi-head attention mechanism(MHA)extraction to enhance the model’s focus on characters associated with medical entities,hence boosting the model’s performance.We obtained 92.74%,and 87.80%F1 scores on the two CNER benchmark datasets,CCKS2017 and CCKS2019,respectively.The results demonstrate that our model outperforms the latest models in CNER,showcasing its outstanding overall performance.It can be seen that the CNER model proposed in this study has an important application value in constructing clinical medical knowledge graph and intelligent Q&A system.展开更多
There was an evident increase in the number of earthquakes in the Xinfengjiang Reservoir from June to July 2014 after the landing of Typhoon Hagibis.To understand the spatial and temporal evolution of this microseismi...There was an evident increase in the number of earthquakes in the Xinfengjiang Reservoir from June to July 2014 after the landing of Typhoon Hagibis.To understand the spatial and temporal evolution of this microseismicity,we built a high-precision earthquake catalog for 2014 and relocated 2275 events using recently developed methods for event picking and catalog construction.Seismicity occurred in the southeastern part of the reservoir,with the preferred fault plane orientation aligned along the Heyuan Fault.The total seismic energy peaked when the typhoon passed through the reservoir,and seismicity correlated with typhoon energy.In contrast,a limited seismic response was observed during the later Typhoon Rammasun.Combining data regarding the water level in the Xinfengjiang Reservoir and seismicity frequency changes in the Taiwan region during these two typhoon events,we suggest that typhoon activity may increase microseism energy by impacting fault stability around the Xinfengjiang Reservoir.Whether a fault can be activated also depends on how close the stress accumulation is to its failure point.展开更多
基金This study was supported by the National Natural Science Foundation of China(61911540482 and 61702324).
文摘Chinese Clinical Named Entity Recognition(CNER)is a crucial step in extracting medical information and is of great significance in promoting medical informatization.However,CNER poses challenges due to the specificity of clinical terminology,the complexity of Chinese text semantics,and the uncertainty of Chinese entity boundaries.To address these issues,we propose an improved CNER model,which is based on multi-feature fusion and multi-scale local context enhancement.The model simultaneously fuses multi-feature representations of pinyin,radical,Part of Speech(POS),word boundary with BERT deep contextual representations to enhance the semantic representation of text for more effective entity recognition.Furthermore,to address the model’s limitation of focusing just on global features,we incorporate Convolutional Neural Networks(CNNs)with various kernel sizes to capture multi-scale local features of the text and enhance the model’s comprehension of the text.Finally,we integrate the obtained global and local features,and employ multi-head attention mechanism(MHA)extraction to enhance the model’s focus on characters associated with medical entities,hence boosting the model’s performance.We obtained 92.74%,and 87.80%F1 scores on the two CNER benchmark datasets,CCKS2017 and CCKS2019,respectively.The results demonstrate that our model outperforms the latest models in CNER,showcasing its outstanding overall performance.It can be seen that the CNER model proposed in this study has an important application value in constructing clinical medical knowledge graph and intelligent Q&A system.
基金Strategic Priority Research Program(B)of the Chinese Academy of Sciences(No.XDB42020304)National Natural Science Foundation of China(No.42074059).
文摘There was an evident increase in the number of earthquakes in the Xinfengjiang Reservoir from June to July 2014 after the landing of Typhoon Hagibis.To understand the spatial and temporal evolution of this microseismicity,we built a high-precision earthquake catalog for 2014 and relocated 2275 events using recently developed methods for event picking and catalog construction.Seismicity occurred in the southeastern part of the reservoir,with the preferred fault plane orientation aligned along the Heyuan Fault.The total seismic energy peaked when the typhoon passed through the reservoir,and seismicity correlated with typhoon energy.In contrast,a limited seismic response was observed during the later Typhoon Rammasun.Combining data regarding the water level in the Xinfengjiang Reservoir and seismicity frequency changes in the Taiwan region during these two typhoon events,we suggest that typhoon activity may increase microseism energy by impacting fault stability around the Xinfengjiang Reservoir.Whether a fault can be activated also depends on how close the stress accumulation is to its failure point.