With the rapid development of information technology,the electronifi-cation of medical records has gradually become a trend.In China,the population base is huge and the supporting medical institutions are numerous,so ...With the rapid development of information technology,the electronifi-cation of medical records has gradually become a trend.In China,the population base is huge and the supporting medical institutions are numerous,so this reality drives the conversion of paper medical records to electronic medical records.Electronic medical records are the basis for establishing a smart hospital and an important guarantee for achieving medical intelligence,and the massive amount of electronic medical record data is also an important data set for conducting research in the medical field.However,electronic medical records contain a large amount of private patient information,which must be desensitized before they are used as open resources.Therefore,to solve the above problems,data masking for Chinese electronic medical records with named entity recognition is proposed in this paper.Firstly,the text is vectorized to satisfy the required format of the model input.Secondly,since the input sentences may have a long or short length and the relationship between sentences in context is not negligible.To this end,a neural network model for named entity recognition based on bidirectional long short-term memory(BiLSTM)with conditional random fields(CRF)is constructed.Finally,the data masking operation is performed based on the named entity recog-nition results,mainly using regular expression filtering encryption and principal component analysis(PCA)word vector compression and replacement.In addi-tion,comparison experiments with the hidden markov model(HMM)model,LSTM-CRF model,and BiLSTM model are conducted in this paper.The experi-mental results show that the method used in this paper achieves 92.72%Accuracy,92.30%Recall,and 92.51%F1_score,which has higher accuracy compared with other models.展开更多
The China Conference on Knowledge Graph and Semantic Computing(CCKS)2020 Evaluation Task 3 presented clinical named entity recognition and event extraction for the Chinese electronic medical records.Two annotated data...The China Conference on Knowledge Graph and Semantic Computing(CCKS)2020 Evaluation Task 3 presented clinical named entity recognition and event extraction for the Chinese electronic medical records.Two annotated data sets and some other additional resources for these two subtasks were provided for participators.This evaluation competition attracted 354 teams and 46 of them successfully submitted the valid results.The pre-trained language models are widely applied in this evaluation task.Data argumentation and external resources are also helpful.展开更多
Objective: To obtain fundamental information for the standardization of herbal medicine in Korea. Methods: We analyzed the herbal medicine prescription data of patients at the Pusan National University Korean Medici...Objective: To obtain fundamental information for the standardization of herbal medicine in Korea. Methods: We analyzed the herbal medicine prescription data of patients at the Pusan National University Korean Medicine Hospital from March 2010 to February 2013. We used the Dongui-Bogam (Dong Yi Bao Jian) to classify prescribed herbal medicines. Results: The study revealed that the most frequently prescribed herbal medicine was ‘Liuwei Dihuang Pill (LWDHP, 六味地黄丸)' which was used for invigorating ‘Shen (Kidndy)-yin'. ‘LWDHP' was most frequently prescribed to male patients aged 50-59, 60-69, 70-79 and 80-89 years, and ‘Xionggui Tiaoxue Decoction (XGTXD, 芎归调血饮)' was most frequently prescribed to female patients aged 30-39 and 40-49 years. According to the International Classification of Diseases (ICD) codes,‘Diseases of the musculoskeletal system and connective tissue' showed the highest prevalence. ‘LWDHP' and 'XGTXD' was the most frequently prescribed in categories 5 and 3, respectively. Based on the percentage of prescriptions for each sex, ‘Ziyin Jianghuo Decoction (滋阴降火汤)' was prescribed to mainly male patients, and ‘XGTXD' with ‘Guima Geban Decoction (桂麻各半汤)' were prescribed to mainly female patients. Conclusion: This study analysis successfully determined the frequency of a variety of herbal medicines, and many restorative herbal medicines were identified and frequently administered.展开更多
The development of hospital information has been carried out for nearly 50 years, and originally started Le hospital information system (HIS)1 So far HIS isas the hospital information system (HIS)J So far HIS is t...The development of hospital information has been carried out for nearly 50 years, and originally started Le hospital information system (HIS)1 So far HIS isas the hospital information system (HIS)J So far HIS is the most widely and deeply used management system for hospitals in China.2 "General function standard for hospital information system" issued by China's Ministry of Health in 2002 defined that "The hospital information system refers to using of computer hardware and software technology, network communications technology, and other modem technology to comprehensively manage personnel, logistics, and finance in various departments in hospital. Gather, store, treat, extract, transport, aggregate,and process data in various stages of the medical activities, so that provide comprehensive and automatic information management and service to the hospital."展开更多
Background: The usage of modem technology in healthcare record system is now a must throughout the world. However, many doctors and nurses has been reporting facing numerous challenges and obstacles in the implementa...Background: The usage of modem technology in healthcare record system is now a must throughout the world. However, many doctors and nurses has been reporting facing numerous challenges and obstacles in the implementation. The aim of the present study is to determine the prevalence of depression, anxiety and stress among doctors and nurses who utilize EMR (electronic medical record) and its associated factor. Methods: A comparative cross-sectional study was conducted ~om January till April 2012 among doctors and nurses in two public tertiary hospitals in Johor in which one of them uses EMR and the other one still using the MMR (manual medical record) system. Data was collected using self-administered validated Malay version of DASS-21 (Depression, Anxiety, and Stress Scales-21) items questionnaire. It comprises of socio-demographic and occupational characteristics. Findings: There were 130 respondents with a response rate of 91% for EMR and 123 respondents with a response rate of 86% for MMR. The mean (SD) age of respondents in EMR and MMR groups were 34.7 (9.42) and 29.7 (6.15) respectively. The mean (SD) duration of respondents using EMR was 46.1 (35.83) months. The prevalence of depression, anxiety and stress among respondents using EMR were 6.9%, 25.4% and 12.3%. There were no significant difference between the study groups related to the depression, anxiety and stress scores. In multivariable analysis, the significant factors associated with depression among respondents using EMR was age (OR 1.10, 95% CI 1.02, 1.19). The significant factors associated with stress among respondents using EMR was marital status (OR 3.33, 95% CI 1.10, 10.09) and borderline significant was computer skill course (OR 2.94, 95% CI 0.98, 8.78). Conclusion: The prevalence of depression, anxiety and stress of those who uses EMR were within acceptable range. Age, marital status and computer skill are the identified factor associated with the depression and stress level which need to be considered in its implementation.展开更多
Introduction: Today, information technology is considered as an important national development principle in each country which is applied in different fields. Health care as a whole and the hospitals could be regarded...Introduction: Today, information technology is considered as an important national development principle in each country which is applied in different fields. Health care as a whole and the hospitals could be regarded as a field and organizations with most remarkable IT applications respectively. Although different benchmarks and frameworks have been developed to assess different aspects of Hospital Information Systems (HISs) by various researchers, there is not any suitable reference model yet to benchmark HIS in the world. Electronic Medical Record Adoption Model (EMRAM) has been currently presented and is globally well-known to benchmark the rate of HIS utilization in the hospitals. Notwithstanding, this model has not been introduced in Iran so far. Methods: This research was carried out based on an applied descriptive method in three private hospitals of Isfahan—one of the most important provinces of Iran—in the year 2015. The purpose of this study was to investigate IT utilization stage in three selected private hospitals. Conclusion: The findings revealed that HIS is not at the center of concern in studied hospitals and is in the first maturity stage in accordance with EMRAM. However, hospital managers are enforced and under the pressure of different beneficiaries including insurance companies to improve their HIS. Therefore, it could be concluded that these types of hospitals are still far away from desirable conditions and need to enhance their IT utilization stage significantly.展开更多
The purpose of this paper is to discuss the development of medical informatization in the era of big data.Through literature review and theoretical analysis,the development of medical informatization in the era of big...The purpose of this paper is to discuss the development of medical informatization in the era of big data.Through literature review and theoretical analysis,the development of medical informatization in the era of big data is deeply discussed.The results show that medical informatization has developed rapidly in the era of big data,and its role in clinical decision-making,scientific research,teaching,and management has become increasingly prominent.The development of medical informatization in the era of big data has important purposes and methods,which can produce important results and conclusions and provide strong support for the development of the medical field.展开更多
Rationale: Medical treatment on short-term primary care medical service trips (MSTs) is generally symptom-based and supplemented by point-of-care testing. This pilot study contributes to the effective planning for suc...Rationale: Medical treatment on short-term primary care medical service trips (MSTs) is generally symptom-based and supplemented by point-of-care testing. This pilot study contributes to the effective planning for such austere settings based on predicted symptomology. Objective: We aimed to prospectively document the epidemiology of patients seen during two low-resource clinics on a MST in Honduras and apply predefined case definitions adapted from guidelines used by international healthcare organizations (e.g. World Health Organization). Methods: An observational design was used to track the epidemiology during two clinics on an MST in Limon, Honduras in March 2015. The QuickChart mobile electronic medical record (EMR) application was piloted to document diagnoses according to predefined case definitions. Results: The most commonly diagnosed syndromes were upper respiratory complaints (20.19%), nonspecific abdominal complaints (20.19%), general pain (15.38%), hypertension (9.62%), pruritus (6.73%), and asthma/ COPD (4.81%). The case definitions accounted for 94% of all complaints and diagnoses on the brigade. Discussion: The distribution of common patient diagnoses on this MST was similar to that which had been reported elsewhere. The use of broader symptom-based case definitions for epidemiologic surveillance could also facilitate the syndromic management of patients seen on MSTs, and improve the consistency of treatment offered. Conclusion: Case definitions for common syndromes on primary care MSTs may be a feasible method of standardizing patient management. Preliminary use of the QuickChart EMR was acceptable for documentation of epidemiology in the field. Further study is necessary to investigate the reliability of syndromic diagnostic criteria between different clinicians and in a variety of MST settings.展开更多
Objective:To classify the subtypes of metabolic-associated fatty liver disease(MAFLD)and provide new insights into the heterogeneity of MAFLD.Methods:Electronic medical records(EMR)of MAFLD diagnosed in accordance wit...Objective:To classify the subtypes of metabolic-associated fatty liver disease(MAFLD)and provide new insights into the heterogeneity of MAFLD.Methods:Electronic medical records(EMR)of MAFLD diagnosed in accordance with the diagnostic criteria of Hubei Provincial Hospital of Traditional Chinese Medicine from 2016-2020 were included in the study.for physical annotation,and the data on each clinical phenotype was normalized according to corresponding aspirational standards.The MAFLD heterogeneous medical record network(HEMnet)was constructed using sex,age,disease diagnosis,symptoms,and Western medicine prescriptions as nodes and the co-occurrence times between phenotypes as edges.K-means clustering was used for disease classification.Relative risk(RR)was used to assess the specificity of each phenotype.Statistical methods were used to compare differences in laboratory indicators among subtypes.Results:A total of patients(12,626)with a mean age of 55.02(±14.21)years were included in the study.MAFLD can be divided into five subtypes:digestive diseases(C0),mental disorders and gynecological diseases(C1),chronic liver diseases and decompensated complications(C2),diabetes mellitus and its complications(C3),and immune joint system diseases(C4).Conclusions:Patients with MAFLD experience various symptoms and complications.The classification of MAFLD based on the HEMnet method is highly reliable.展开更多
Sodium homeostasis disorder is one of the most common abnormal symptoms of elderly patients in intensive care unit(ICU),which may lead to physiological disorders of many organs.The current prediction of serum sodium i...Sodium homeostasis disorder is one of the most common abnormal symptoms of elderly patients in intensive care unit(ICU),which may lead to physiological disorders of many organs.The current prediction of serum sodium in ICU is mainly based on the subjective judgment of doctors’experience.This study aims at this problem by studying the clinical retrospective electronic medical record data of ICU to establish a machine learning model to predict the short-term serum sodium value of ICU patients.The data set used in this study is the open-source intensive care medical information set Medical Information Mart for Intensive Care(MIMIC)-IV.The time point of serum sodium detection was selected from the ICU clinical records,and the ICU records of 25risk factors related to serum sodium were extracted from the patients within the first 12 h for statistical analysis.A prediction model of serum sodium value within 48 h was established using a feedforward neural network,and compared with previous methods.Our research results show that the neural network learning model can predict the development of serum sodium in patients using physiological indicators recorded in clinical electronic medical records within 12 h,and has better prediction effect than the serum sodium formula and other machine learning models.展开更多
With the implementation of the“Internet+”strategy,electronic medi-cal records are generally applied in the medicalfield.Deep mining of electronic medical record content data is an effective means to obtain medical kn...With the implementation of the“Internet+”strategy,electronic medi-cal records are generally applied in the medicalfield.Deep mining of electronic medical record content data is an effective means to obtain medical knowledge and analyse patients’states,but the existing methods for extracting entities from electronic medical records have problems of redundant information,overlapping entities,and low accuracy rates.Therefore,this paper proposes an entity extrac-tion method for electronic medical records based on the network framework of BERT-BiLSTM,which incorporates a multichannel self-attention mechanism and location relationship features.First,the text input sequence was encoded using the BERT-BiLSTM network framework,and the global semantic information of the sentence was mined more deeply using the multichannel self-attention mech-anism.Then,the position relation characteristic was used to extract the local semantic message of the text,and the position relation characteristic of the word and the position embedding matrix of the whole sentence were obtained.Next,the extracted global semantic information was stitched with the positional embedding matrix of the sentence to obtain the current entity classification matrix.Finally,the proposed method was validated on the dataset of Chinese medical text entity relationship extraction and the 2010i2b2/VA relationship corpus,and the exper-imental results indicate that the proposed method surpasses existing methods in terms of precision,recall,F1 value and training time.展开更多
In the digital era,electronic medical record(EMR)has been a major way for hospitals to store patients’medical data.The traditional centralized medical system and semi-trusted cloud storage are difficult to achieve dy...In the digital era,electronic medical record(EMR)has been a major way for hospitals to store patients’medical data.The traditional centralized medical system and semi-trusted cloud storage are difficult to achieve dynamic balance between privacy protection and data sharing.The storage capacity of blockchain is limited and single blockchain schemes have poor scalability and low throughput.To address these issues,we propose a secure and efficient medical data storage and sharing scheme based on double blockchain.In our scheme,we encrypt the original EMR and store it in the cloud.The storage blockchain stores the index of the complete EMR,and the shared blockchain stores the index of the shared part of the EMR.Users with different attributes can make requests to different blockchains to share different parts according to their own permissions.Through experiments,it was found that cloud storage combined with blockchain not only solved the problem of limited storage capacity of blockchain,but also greatly reduced the risk of leakage of the original EMR.Content Extraction Signature(CES)combined with the double blockchain technology realized the separation of the privacy part and the shared part of the original EMR.The symmetric encryption technology combined with Ciphertext-Policy Attribute-Based Encryption(CP–ABE)not only ensures the safe storage of data in the cloud,but also achieves the consistency and convenience of data update,avoiding redundant backup of data.Safety analysis and performance analysis verified the feasibility and effectiveness of our scheme.展开更多
Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare.Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians t...Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare.Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians to make decisions with high confidence.However,such analysis is not straightforward due to the characteristics of medical records:high dimensionality,irregularity in time,and sparsity.To address this challenge,we introduce a method for similarity calculation of medical records.Our method employs event and sequence embeddings.While we use an autoencoder for the event embedding,we apply its variant with the self-attention mechanism for the sequence embedding.Moreover,in order to better handle the irregularity of data,we enhance the self-attention mechanism with consideration of different time intervals.We have developed a visual analytics system to support comparative studies of patient records.To make a comparison of sequences with different lengths easier,our system incorporates a sequence alignment method.Through its interactive interface,the user can quickly identify patients of interest and conveniently review both the temporal and multivariate aspects of the patient records.We demonstrate the effectiveness of our design and system with case studies using a real-world dataset from the neonatal intensive care unit of UC Davis.展开更多
Personalized medicine is the development of “tailored” therapies that reflect traditional medical approaches with the incorporation of the patient’s unique genetic profile and the environmental basis of the disease...Personalized medicine is the development of “tailored” therapies that reflect traditional medical approaches with the incorporation of the patient’s unique genetic profile and the environmental basis of the disease. These individualized strategies encompass disease prevention and diagnosis, as well as treatment strategies. Today’s healthcare workforce is faced with the availability of massive amounts of patient- and disease-related data. When mined effectively, these data will help produce more efficient and effective diagnoses and treatment, leading to better prognoses for patients at both the individual and population level. Designing preventive and therapeutic interventions for those patients who will benefit most while minimizing side effects and controlling healthcare costs requires bringing diverse data sources together in an analytic paradigm. A resource to clinicians in the development and application of personalized medicine is largely facilitated, perhaps even driven, by the analysis of “big data”. For example, the availability of clinical data warehouses is a significant resource for clinicians in practicing personalized medicine. These “big data” repositories can be queried by clinicians, using specific questions, with data used to gain an understanding of challenges in patient care and treatment. Health informaticians are critical partners to data analytics including the use of technological infrastructures and predictive data mining strategies to access data from multiple sources, assisting clinicians’ interpretation of data and development of personalized, targeted therapy recommendations. In this paper, we look at the concept of personalized medicine, offering perspectives in four important, influencing topics: 1) the availability of “big data” and the role of biomedical informatics in personalized medicine, 2) the need for interdisciplinary teams in the development and evaluation of personalized therapeutic approaches, and 3) the impact of electronic medical record systems and clinical data warehouses on the field of personalized medicine. In closing, we present our fourth perspective, an overview to some of the ethical concerns related to personalized medicine and health equity.展开更多
The lack of labeled image data poses a serious challenge to the application of artificial intelligence(AI)in medical image diagnosis.Medical image notes contain valuable patient information that could be used to label...The lack of labeled image data poses a serious challenge to the application of artificial intelligence(AI)in medical image diagnosis.Medical image notes contain valuable patient information that could be used to label images for machine learning tasks.However,most image note texts are unstructured with heterogeneity and short-paragraph characters,which fail traditional keyword-based techniques.We utilized a deep learning approach to recover missing labels for medical image notes automatically by using a combination of deep word embedding and deep neural network classifiers.Bidirectional encoder representations from transformers trained on medical image notes corpus(MinBERT)were proposed.We applied the proposed techniques to two typical classification tasks:Medical image type identification and clinical diagnosis identification.The two methods significantly outperformed baseline methods and presented high accuracies of 99.56%and 99.72%in image type identification and of 94.56%and 92.45%in clinical diagnosis identification.Visualization analysis further indicated that word embedding could efficiently capture semantic similarities and regularities across diverse expressions.Results indicated that our proposed framework could accurately recover the missing label information of medical images through the automatic extraction of electronic medical record information.Hence,it could serve as a powerful tool for exploring useful training data in various medical AI applications.展开更多
The introduction of the electronic medical record(EHR)sharing system has made a great contribution to the management and sharing of healthcare data.Considering referral treatment for patients,the original signature ne...The introduction of the electronic medical record(EHR)sharing system has made a great contribution to the management and sharing of healthcare data.Considering referral treatment for patients,the original signature needs to be converted into a re-signature that can be verified by the new organization.Proxy re-signature(PRS)can be applied to this scenario so that authenticity and nonrepudiation can still be insured for data.Unfortunately,the existing PRS schemes cannot realize forward and backward security.Therefore,this paper proposes the first PRS scheme that can provide key-insulated property,which can guarantee both the forward and backward security of the key.Although the leakage of the private key occurs at a certain moment,the forward and backward key will not be attacked.Thus,the purpose of key insulation is implemented.What’s more,it can update different corresponding private keys in infinite time periods without changing the identity information of the user as the public key.Besides,the unforgeability of our scheme is proved based on the extended Computational Diffie-Hellman assumption in the random oracle model.Finally,the experimental simulation demonstrates that our scheme is feasible and in possession of promising properties.展开更多
Electronic medical record (EMR) containing rich biomedical information has a great potential in disease diagnosis and biomedical research. However, the EMR information is usually in the form of unstructured text, whic...Electronic medical record (EMR) containing rich biomedical information has a great potential in disease diagnosis and biomedical research. However, the EMR information is usually in the form of unstructured text, which increases the use cost and hinders its applications. In this work, an effective named entity recognition (NER) method is presented for information extraction on Chinese EMR, which is achieved by word embedding bootstrapped deep active learning to promote the acquisition of medical information from Chinese EMR and to release its value. In this work, deep active learning of bi-directional long short-term memory followed by conditional random field (Bi-LSTM+CRF) is used to capture the characteristics of different information from labeled corpus, and the word embedding models of contiguous bag of words and skip-gram are combined in the above model to respectively capture the text feature of Chinese EMR from unlabeled corpus. To evaluate the performance of above method, the tasks of NER on Chinese EMR with “medical history” content were used. Experimental results show that the word embedding bootstrapped deep active learning method using unlabeled medical corpus can achieve a better performance compared with other models.展开更多
Acute Kidney Injury (AKI) is one of the most common acute and critical illnesses in general wards and intensive care units. Its high morbidity and high fatality rate have become a major global public health problem. T...Acute Kidney Injury (AKI) is one of the most common acute and critical illnesses in general wards and intensive care units. Its high morbidity and high fatality rate have become a major global public health problem. There are often serious lags in clinical diagnosis of AKI. Early diagnosis and timely intervention and effective care become critical. The use of electronic medical record data to build an AKI risk prediction model has been proven to help prevent the occurrence of AKI. However, in actual clinical applications, the distribution of historical data and new data will continue to vary over time, resulting in a significant decrease in the performance of the model. How to solve the problem of model performance degradation over time will be a core challenge for the long-term use of predictive models in clinical applications. Aiming at the above problems, this paper studies the classic Transfer-Stacking model migration algorithm. Aiming at the lack of this algorithm, such as the loss of a large amount of feature information of the target domain and poor fit when integrating the model of the target domain, the Accumulate-Transfer-Stacking algorithm is proposed to improve it. Improvements include: 1) Optimize the input vector and model integration algorithm of Transfer-Stacking’s target domain model. 2) Optimize Transfer-Stacking from a single-source domain model to a multi-source domain model. The experimental results show that for the improved algorithm proposed in this paper when the data is sufficient and insufficient, the average AUC value of the model on the data of subsequent years is 0.89 and 0.87, and the average F1 Score value is 0.45 and 0.36. Moreover, this method is significantly better than the unimproved Transfer-Stacking algorithm and baseline method, and can effectively overcome the problem of data distribution heterogeneity caused by time factors.展开更多
Clinical data have strong features of complexity and multi-disciplinarity. Clinical data are generated both from the documentation of physicians' interactions with the patient and by diagnostic systems. During the ca...Clinical data have strong features of complexity and multi-disciplinarity. Clinical data are generated both from the documentation of physicians' interactions with the patient and by diagnostic systems. During the care process, a number of different actors and roles (physicians, specialists, nurses, etc.) have the need to access patient data and document clinical activities in different moments and settings. Thus, data sharing and flexible aggregation based on different users' needs have become more and more important for supporting continuity of care at home, at hospitals, at outpatient clinics. In this paper, the authors identify and describe needs and challenges for patient data management at provider level and regional- (or inter-organizational-) level, because nowadays sharing patient data is needed to improve continuity and quality of care. For each level, the authors describe state-of-the-art Information and Communication Technology solutions to collect, manage, aggregate and share patient data. For each level some examples of best practices and solution scenarios being implemented in the Italian Healthcare setting are described as well.展开更多
Objectives: To report our experience in using an electronic database for management of breast diseases in a developing country. Materials and methods: E-Breast is a database developed on FileMaker Pro Advanced to serv...Objectives: To report our experience in using an electronic database for management of breast diseases in a developing country. Materials and methods: E-Breast is a database developed on FileMaker Pro Advanced to serve as patient file and breast diseases registry. The development of the platform, its usage and advantages on a manual filing system are described. Results: For 6 years, we use this database, which accounts more than 2000 patients and includes data from more than 10 years. An overview of the activity is easily generated by E-Breast. The generated reports are used to the routine care of patients, statistics and clinical research. Data entered are immediately useful in addition to simultaneously implement the database for clinical research. Many custom features are integrated. For research purposes, the system has the ability to perform detailed analyses on subsets defined by the user as breast cancer, breast benign diseases, etc. Conclusion: E-Breast has proven to be a useful way of documentation that has become an integral and essential part of the daily activity and also a valuable research tool.展开更多
基金This research was supported by the National Natural Science Foundation of China under Grant(No.42050102)the Postgraduate Education Reform Project of Jiangsu Province under Grant(No.SJCX22_0343)Also,this research was supported by Dou Wanchun Expert Workstation of Yunnan Province(No.202205AF150013).
文摘With the rapid development of information technology,the electronifi-cation of medical records has gradually become a trend.In China,the population base is huge and the supporting medical institutions are numerous,so this reality drives the conversion of paper medical records to electronic medical records.Electronic medical records are the basis for establishing a smart hospital and an important guarantee for achieving medical intelligence,and the massive amount of electronic medical record data is also an important data set for conducting research in the medical field.However,electronic medical records contain a large amount of private patient information,which must be desensitized before they are used as open resources.Therefore,to solve the above problems,data masking for Chinese electronic medical records with named entity recognition is proposed in this paper.Firstly,the text is vectorized to satisfy the required format of the model input.Secondly,since the input sentences may have a long or short length and the relationship between sentences in context is not negligible.To this end,a neural network model for named entity recognition based on bidirectional long short-term memory(BiLSTM)with conditional random fields(CRF)is constructed.Finally,the data masking operation is performed based on the named entity recog-nition results,mainly using regular expression filtering encryption and principal component analysis(PCA)word vector compression and replacement.In addi-tion,comparison experiments with the hidden markov model(HMM)model,LSTM-CRF model,and BiLSTM model are conducted in this paper.The experi-mental results show that the method used in this paper achieves 92.72%Accuracy,92.30%Recall,and 92.51%F1_score,which has higher accuracy compared with other models.
文摘The China Conference on Knowledge Graph and Semantic Computing(CCKS)2020 Evaluation Task 3 presented clinical named entity recognition and event extraction for the Chinese electronic medical records.Two annotated data sets and some other additional resources for these two subtasks were provided for participators.This evaluation competition attracted 354 teams and 46 of them successfully submitted the valid results.The pre-trained language models are widely applied in this evaluation task.Data argumentation and external resources are also helpful.
基金Supported by a grant to Korean Medical Science Research Center for Healthy Aging from the National Research Foundation of Korean government(No.2014R1A5A2009936)
文摘Objective: To obtain fundamental information for the standardization of herbal medicine in Korea. Methods: We analyzed the herbal medicine prescription data of patients at the Pusan National University Korean Medicine Hospital from March 2010 to February 2013. We used the Dongui-Bogam (Dong Yi Bao Jian) to classify prescribed herbal medicines. Results: The study revealed that the most frequently prescribed herbal medicine was ‘Liuwei Dihuang Pill (LWDHP, 六味地黄丸)' which was used for invigorating ‘Shen (Kidndy)-yin'. ‘LWDHP' was most frequently prescribed to male patients aged 50-59, 60-69, 70-79 and 80-89 years, and ‘Xionggui Tiaoxue Decoction (XGTXD, 芎归调血饮)' was most frequently prescribed to female patients aged 30-39 and 40-49 years. According to the International Classification of Diseases (ICD) codes,‘Diseases of the musculoskeletal system and connective tissue' showed the highest prevalence. ‘LWDHP' and 'XGTXD' was the most frequently prescribed in categories 5 and 3, respectively. Based on the percentage of prescriptions for each sex, ‘Ziyin Jianghuo Decoction (滋阴降火汤)' was prescribed to mainly male patients, and ‘XGTXD' with ‘Guima Geban Decoction (桂麻各半汤)' were prescribed to mainly female patients. Conclusion: This study analysis successfully determined the frequency of a variety of herbal medicines, and many restorative herbal medicines were identified and frequently administered.
文摘The development of hospital information has been carried out for nearly 50 years, and originally started Le hospital information system (HIS)1 So far HIS isas the hospital information system (HIS)J So far HIS is the most widely and deeply used management system for hospitals in China.2 "General function standard for hospital information system" issued by China's Ministry of Health in 2002 defined that "The hospital information system refers to using of computer hardware and software technology, network communications technology, and other modem technology to comprehensively manage personnel, logistics, and finance in various departments in hospital. Gather, store, treat, extract, transport, aggregate,and process data in various stages of the medical activities, so that provide comprehensive and automatic information management and service to the hospital."
文摘Background: The usage of modem technology in healthcare record system is now a must throughout the world. However, many doctors and nurses has been reporting facing numerous challenges and obstacles in the implementation. The aim of the present study is to determine the prevalence of depression, anxiety and stress among doctors and nurses who utilize EMR (electronic medical record) and its associated factor. Methods: A comparative cross-sectional study was conducted ~om January till April 2012 among doctors and nurses in two public tertiary hospitals in Johor in which one of them uses EMR and the other one still using the MMR (manual medical record) system. Data was collected using self-administered validated Malay version of DASS-21 (Depression, Anxiety, and Stress Scales-21) items questionnaire. It comprises of socio-demographic and occupational characteristics. Findings: There were 130 respondents with a response rate of 91% for EMR and 123 respondents with a response rate of 86% for MMR. The mean (SD) age of respondents in EMR and MMR groups were 34.7 (9.42) and 29.7 (6.15) respectively. The mean (SD) duration of respondents using EMR was 46.1 (35.83) months. The prevalence of depression, anxiety and stress among respondents using EMR were 6.9%, 25.4% and 12.3%. There were no significant difference between the study groups related to the depression, anxiety and stress scores. In multivariable analysis, the significant factors associated with depression among respondents using EMR was age (OR 1.10, 95% CI 1.02, 1.19). The significant factors associated with stress among respondents using EMR was marital status (OR 3.33, 95% CI 1.10, 10.09) and borderline significant was computer skill course (OR 2.94, 95% CI 0.98, 8.78). Conclusion: The prevalence of depression, anxiety and stress of those who uses EMR were within acceptable range. Age, marital status and computer skill are the identified factor associated with the depression and stress level which need to be considered in its implementation.
文摘Introduction: Today, information technology is considered as an important national development principle in each country which is applied in different fields. Health care as a whole and the hospitals could be regarded as a field and organizations with most remarkable IT applications respectively. Although different benchmarks and frameworks have been developed to assess different aspects of Hospital Information Systems (HISs) by various researchers, there is not any suitable reference model yet to benchmark HIS in the world. Electronic Medical Record Adoption Model (EMRAM) has been currently presented and is globally well-known to benchmark the rate of HIS utilization in the hospitals. Notwithstanding, this model has not been introduced in Iran so far. Methods: This research was carried out based on an applied descriptive method in three private hospitals of Isfahan—one of the most important provinces of Iran—in the year 2015. The purpose of this study was to investigate IT utilization stage in three selected private hospitals. Conclusion: The findings revealed that HIS is not at the center of concern in studied hospitals and is in the first maturity stage in accordance with EMRAM. However, hospital managers are enforced and under the pressure of different beneficiaries including insurance companies to improve their HIS. Therefore, it could be concluded that these types of hospitals are still far away from desirable conditions and need to enhance their IT utilization stage significantly.
文摘The purpose of this paper is to discuss the development of medical informatization in the era of big data.Through literature review and theoretical analysis,the development of medical informatization in the era of big data is deeply discussed.The results show that medical informatization has developed rapidly in the era of big data,and its role in clinical decision-making,scientific research,teaching,and management has become increasingly prominent.The development of medical informatization in the era of big data has important purposes and methods,which can produce important results and conclusions and provide strong support for the development of the medical field.
文摘Rationale: Medical treatment on short-term primary care medical service trips (MSTs) is generally symptom-based and supplemented by point-of-care testing. This pilot study contributes to the effective planning for such austere settings based on predicted symptomology. Objective: We aimed to prospectively document the epidemiology of patients seen during two low-resource clinics on a MST in Honduras and apply predefined case definitions adapted from guidelines used by international healthcare organizations (e.g. World Health Organization). Methods: An observational design was used to track the epidemiology during two clinics on an MST in Limon, Honduras in March 2015. The QuickChart mobile electronic medical record (EMR) application was piloted to document diagnoses according to predefined case definitions. Results: The most commonly diagnosed syndromes were upper respiratory complaints (20.19%), nonspecific abdominal complaints (20.19%), general pain (15.38%), hypertension (9.62%), pruritus (6.73%), and asthma/ COPD (4.81%). The case definitions accounted for 94% of all complaints and diagnoses on the brigade. Discussion: The distribution of common patient diagnoses on this MST was similar to that which had been reported elsewhere. The use of broader symptom-based case definitions for epidemiologic surveillance could also facilitate the syndromic management of patients seen on MSTs, and improve the consistency of treatment offered. Conclusion: Case definitions for common syndromes on primary care MSTs may be a feasible method of standardizing patient management. Preliminary use of the QuickChart EMR was acceptable for documentation of epidemiology in the field. Further study is necessary to investigate the reliability of syndromic diagnostic criteria between different clinicians and in a variety of MST settings.
基金supported by grants from the Key project Natural Science Foundation of Hubei Province(No.2020CFA023)Project of the State Administration of Traditional Chinese Medicine(No Z155080000004):Key Laboratory of Liver and Kidney Treatment of Chronic Liver Diseases.
文摘Objective:To classify the subtypes of metabolic-associated fatty liver disease(MAFLD)and provide new insights into the heterogeneity of MAFLD.Methods:Electronic medical records(EMR)of MAFLD diagnosed in accordance with the diagnostic criteria of Hubei Provincial Hospital of Traditional Chinese Medicine from 2016-2020 were included in the study.for physical annotation,and the data on each clinical phenotype was normalized according to corresponding aspirational standards.The MAFLD heterogeneous medical record network(HEMnet)was constructed using sex,age,disease diagnosis,symptoms,and Western medicine prescriptions as nodes and the co-occurrence times between phenotypes as edges.K-means clustering was used for disease classification.Relative risk(RR)was used to assess the specificity of each phenotype.Statistical methods were used to compare differences in laboratory indicators among subtypes.Results:A total of patients(12,626)with a mean age of 55.02(±14.21)years were included in the study.MAFLD can be divided into five subtypes:digestive diseases(C0),mental disorders and gynecological diseases(C1),chronic liver diseases and decompensated complications(C2),diabetes mellitus and its complications(C3),and immune joint system diseases(C4).Conclusions:Patients with MAFLD experience various symptoms and complications.The classification of MAFLD based on the HEMnet method is highly reliable.
基金supported by the National Natural Science Foundation of China(No.12345678)。
文摘Sodium homeostasis disorder is one of the most common abnormal symptoms of elderly patients in intensive care unit(ICU),which may lead to physiological disorders of many organs.The current prediction of serum sodium in ICU is mainly based on the subjective judgment of doctors’experience.This study aims at this problem by studying the clinical retrospective electronic medical record data of ICU to establish a machine learning model to predict the short-term serum sodium value of ICU patients.The data set used in this study is the open-source intensive care medical information set Medical Information Mart for Intensive Care(MIMIC)-IV.The time point of serum sodium detection was selected from the ICU clinical records,and the ICU records of 25risk factors related to serum sodium were extracted from the patients within the first 12 h for statistical analysis.A prediction model of serum sodium value within 48 h was established using a feedforward neural network,and compared with previous methods.Our research results show that the neural network learning model can predict the development of serum sodium in patients using physiological indicators recorded in clinical electronic medical records within 12 h,and has better prediction effect than the serum sodium formula and other machine learning models.
基金This work is partly supported by the General Project of Scientific Research Funds of Liaoning Provincial Department of Education under Grant Nos.LJKZ0085,and LJKMZ20220447the Project of PublicWelfareResearch Fund for Science(Soft Science Research Program)of Liaoning Province under Grant No.2023JH4/10700056the Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education,Jilin University under Grant No.93K172018K01.
文摘With the implementation of the“Internet+”strategy,electronic medi-cal records are generally applied in the medicalfield.Deep mining of electronic medical record content data is an effective means to obtain medical knowledge and analyse patients’states,but the existing methods for extracting entities from electronic medical records have problems of redundant information,overlapping entities,and low accuracy rates.Therefore,this paper proposes an entity extrac-tion method for electronic medical records based on the network framework of BERT-BiLSTM,which incorporates a multichannel self-attention mechanism and location relationship features.First,the text input sequence was encoded using the BERT-BiLSTM network framework,and the global semantic information of the sentence was mined more deeply using the multichannel self-attention mech-anism.Then,the position relation characteristic was used to extract the local semantic message of the text,and the position relation characteristic of the word and the position embedding matrix of the whole sentence were obtained.Next,the extracted global semantic information was stitched with the positional embedding matrix of the sentence to obtain the current entity classification matrix.Finally,the proposed method was validated on the dataset of Chinese medical text entity relationship extraction and the 2010i2b2/VA relationship corpus,and the exper-imental results indicate that the proposed method surpasses existing methods in terms of precision,recall,F1 value and training time.
基金the Natural Science Foundation of Heilongjiang Province of China under Grant No.LC2016024Natural Science Foundation of the Jiangsu Higher Education Institutions Grant No.17KJB520044Six Talent Peaks Project in Jiangsu Province No.XYDXX–108.
文摘In the digital era,electronic medical record(EMR)has been a major way for hospitals to store patients’medical data.The traditional centralized medical system and semi-trusted cloud storage are difficult to achieve dynamic balance between privacy protection and data sharing.The storage capacity of blockchain is limited and single blockchain schemes have poor scalability and low throughput.To address these issues,we propose a secure and efficient medical data storage and sharing scheme based on double blockchain.In our scheme,we encrypt the original EMR and store it in the cloud.The storage blockchain stores the index of the complete EMR,and the shared blockchain stores the index of the shared part of the EMR.Users with different attributes can make requests to different blockchains to share different parts according to their own permissions.Through experiments,it was found that cloud storage combined with blockchain not only solved the problem of limited storage capacity of blockchain,but also greatly reduced the risk of leakage of the original EMR.Content Extraction Signature(CES)combined with the double blockchain technology realized the separation of the privacy part and the shared part of the original EMR.The symmetric encryption technology combined with Ciphertext-Policy Attribute-Based Encryption(CP–ABE)not only ensures the safe storage of data in the cloud,but also achieves the consistency and convenience of data update,avoiding redundant backup of data.Safety analysis and performance analysis verified the feasibility and effectiveness of our scheme.
基金the U.S.National Science Foundation through grant IIS-1741536 and a 2019 Seed Fund Award from CITRIS and the Banatao Institute at the University of California,United States.
文摘Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare.Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians to make decisions with high confidence.However,such analysis is not straightforward due to the characteristics of medical records:high dimensionality,irregularity in time,and sparsity.To address this challenge,we introduce a method for similarity calculation of medical records.Our method employs event and sequence embeddings.While we use an autoencoder for the event embedding,we apply its variant with the self-attention mechanism for the sequence embedding.Moreover,in order to better handle the irregularity of data,we enhance the self-attention mechanism with consideration of different time intervals.We have developed a visual analytics system to support comparative studies of patient records.To make a comparison of sequences with different lengths easier,our system incorporates a sequence alignment method.Through its interactive interface,the user can quickly identify patients of interest and conveniently review both the temporal and multivariate aspects of the patient records.We demonstrate the effectiveness of our design and system with case studies using a real-world dataset from the neonatal intensive care unit of UC Davis.
文摘Personalized medicine is the development of “tailored” therapies that reflect traditional medical approaches with the incorporation of the patient’s unique genetic profile and the environmental basis of the disease. These individualized strategies encompass disease prevention and diagnosis, as well as treatment strategies. Today’s healthcare workforce is faced with the availability of massive amounts of patient- and disease-related data. When mined effectively, these data will help produce more efficient and effective diagnoses and treatment, leading to better prognoses for patients at both the individual and population level. Designing preventive and therapeutic interventions for those patients who will benefit most while minimizing side effects and controlling healthcare costs requires bringing diverse data sources together in an analytic paradigm. A resource to clinicians in the development and application of personalized medicine is largely facilitated, perhaps even driven, by the analysis of “big data”. For example, the availability of clinical data warehouses is a significant resource for clinicians in practicing personalized medicine. These “big data” repositories can be queried by clinicians, using specific questions, with data used to gain an understanding of challenges in patient care and treatment. Health informaticians are critical partners to data analytics including the use of technological infrastructures and predictive data mining strategies to access data from multiple sources, assisting clinicians’ interpretation of data and development of personalized, targeted therapy recommendations. In this paper, we look at the concept of personalized medicine, offering perspectives in four important, influencing topics: 1) the availability of “big data” and the role of biomedical informatics in personalized medicine, 2) the need for interdisciplinary teams in the development and evaluation of personalized therapeutic approaches, and 3) the impact of electronic medical record systems and clinical data warehouses on the field of personalized medicine. In closing, we present our fourth perspective, an overview to some of the ethical concerns related to personalized medicine and health equity.
基金This work was supported in part by the Shenzhen Science and Technology Program(No.JCYJ20180703145002040)the Strategic Priority Research Program of Chinese Academy of Sciences(No.XDB38050100)the Shenzhen Science and Technology Program(No.JCYJ20180507182818013).
文摘The lack of labeled image data poses a serious challenge to the application of artificial intelligence(AI)in medical image diagnosis.Medical image notes contain valuable patient information that could be used to label images for machine learning tasks.However,most image note texts are unstructured with heterogeneity and short-paragraph characters,which fail traditional keyword-based techniques.We utilized a deep learning approach to recover missing labels for medical image notes automatically by using a combination of deep word embedding and deep neural network classifiers.Bidirectional encoder representations from transformers trained on medical image notes corpus(MinBERT)were proposed.We applied the proposed techniques to two typical classification tasks:Medical image type identification and clinical diagnosis identification.The two methods significantly outperformed baseline methods and presented high accuracies of 99.56%and 99.72%in image type identification and of 94.56%and 92.45%in clinical diagnosis identification.Visualization analysis further indicated that word embedding could efficiently capture semantic similarities and regularities across diverse expressions.Results indicated that our proposed framework could accurately recover the missing label information of medical images through the automatic extraction of electronic medical record information.Hence,it could serve as a powerful tool for exploring useful training data in various medical AI applications.
基金supported by the Network and Data Security Key Laboratory of Sichuan Province under the Grant No.NDS2021-2in part by Science and Technology Project of Educational Commission of Jiangxi Province under the Grant No.GJJ190464in part by National Natural Science Foundation of China under the Grant No.71661012.
文摘The introduction of the electronic medical record(EHR)sharing system has made a great contribution to the management and sharing of healthcare data.Considering referral treatment for patients,the original signature needs to be converted into a re-signature that can be verified by the new organization.Proxy re-signature(PRS)can be applied to this scenario so that authenticity and nonrepudiation can still be insured for data.Unfortunately,the existing PRS schemes cannot realize forward and backward security.Therefore,this paper proposes the first PRS scheme that can provide key-insulated property,which can guarantee both the forward and backward security of the key.Although the leakage of the private key occurs at a certain moment,the forward and backward key will not be attacked.Thus,the purpose of key insulation is implemented.What’s more,it can update different corresponding private keys in infinite time periods without changing the identity information of the user as the public key.Besides,the unforgeability of our scheme is proved based on the extended Computational Diffie-Hellman assumption in the random oracle model.Finally,the experimental simulation demonstrates that our scheme is feasible and in possession of promising properties.
基金the Artificial Intelligence Innovation and Development Project of Shanghai Municipal Commission of Economy and Information (No. 2019-RGZN-01081)。
文摘Electronic medical record (EMR) containing rich biomedical information has a great potential in disease diagnosis and biomedical research. However, the EMR information is usually in the form of unstructured text, which increases the use cost and hinders its applications. In this work, an effective named entity recognition (NER) method is presented for information extraction on Chinese EMR, which is achieved by word embedding bootstrapped deep active learning to promote the acquisition of medical information from Chinese EMR and to release its value. In this work, deep active learning of bi-directional long short-term memory followed by conditional random field (Bi-LSTM+CRF) is used to capture the characteristics of different information from labeled corpus, and the word embedding models of contiguous bag of words and skip-gram are combined in the above model to respectively capture the text feature of Chinese EMR from unlabeled corpus. To evaluate the performance of above method, the tasks of NER on Chinese EMR with “medical history” content were used. Experimental results show that the word embedding bootstrapped deep active learning method using unlabeled medical corpus can achieve a better performance compared with other models.
文摘Acute Kidney Injury (AKI) is one of the most common acute and critical illnesses in general wards and intensive care units. Its high morbidity and high fatality rate have become a major global public health problem. There are often serious lags in clinical diagnosis of AKI. Early diagnosis and timely intervention and effective care become critical. The use of electronic medical record data to build an AKI risk prediction model has been proven to help prevent the occurrence of AKI. However, in actual clinical applications, the distribution of historical data and new data will continue to vary over time, resulting in a significant decrease in the performance of the model. How to solve the problem of model performance degradation over time will be a core challenge for the long-term use of predictive models in clinical applications. Aiming at the above problems, this paper studies the classic Transfer-Stacking model migration algorithm. Aiming at the lack of this algorithm, such as the loss of a large amount of feature information of the target domain and poor fit when integrating the model of the target domain, the Accumulate-Transfer-Stacking algorithm is proposed to improve it. Improvements include: 1) Optimize the input vector and model integration algorithm of Transfer-Stacking’s target domain model. 2) Optimize Transfer-Stacking from a single-source domain model to a multi-source domain model. The experimental results show that for the improved algorithm proposed in this paper when the data is sufficient and insufficient, the average AUC value of the model on the data of subsequent years is 0.89 and 0.87, and the average F1 Score value is 0.45 and 0.36. Moreover, this method is significantly better than the unimproved Transfer-Stacking algorithm and baseline method, and can effectively overcome the problem of data distribution heterogeneity caused by time factors.
文摘Clinical data have strong features of complexity and multi-disciplinarity. Clinical data are generated both from the documentation of physicians' interactions with the patient and by diagnostic systems. During the care process, a number of different actors and roles (physicians, specialists, nurses, etc.) have the need to access patient data and document clinical activities in different moments and settings. Thus, data sharing and flexible aggregation based on different users' needs have become more and more important for supporting continuity of care at home, at hospitals, at outpatient clinics. In this paper, the authors identify and describe needs and challenges for patient data management at provider level and regional- (or inter-organizational-) level, because nowadays sharing patient data is needed to improve continuity and quality of care. For each level, the authors describe state-of-the-art Information and Communication Technology solutions to collect, manage, aggregate and share patient data. For each level some examples of best practices and solution scenarios being implemented in the Italian Healthcare setting are described as well.
文摘Objectives: To report our experience in using an electronic database for management of breast diseases in a developing country. Materials and methods: E-Breast is a database developed on FileMaker Pro Advanced to serve as patient file and breast diseases registry. The development of the platform, its usage and advantages on a manual filing system are described. Results: For 6 years, we use this database, which accounts more than 2000 patients and includes data from more than 10 years. An overview of the activity is easily generated by E-Breast. The generated reports are used to the routine care of patients, statistics and clinical research. Data entered are immediately useful in addition to simultaneously implement the database for clinical research. Many custom features are integrated. For research purposes, the system has the ability to perform detailed analyses on subsets defined by the user as breast cancer, breast benign diseases, etc. Conclusion: E-Breast has proven to be a useful way of documentation that has become an integral and essential part of the daily activity and also a valuable research tool.