Structural choice is a significant decision having an important influence on structural function, social economics, structural reliability and construction cost. A Case Based Reasoning system with its retrieval part c...Structural choice is a significant decision having an important influence on structural function, social economics, structural reliability and construction cost. A Case Based Reasoning system with its retrieval part constructed with a KDD subsystem, is put forward to make a decision for a large scale engineering project. A typical CBR system consists of four parts: case representation, case retriever, evaluation, and adaptation. A case library is a set of parameterized excellent and successful structures. For a structural choice, the key point is that the system must be able to detect the pattern classes hidden in the case library and classify the input parameters into classes properly. That is done by using the KDD Data Mining algorithm based on Self Organizing Feature Maps (SOFM), which makes the whole system more adaptive, self organizing, self learning and open.展开更多
Thallium has been used geochemical exploration of gold deposits. However, as an indicator element in searching for hydrothermal the T1 minerals and mineralization are rare in nature. Lorandite T1AsS2, a relatively un...Thallium has been used geochemical exploration of gold deposits. However, as an indicator element in searching for hydrothermal the T1 minerals and mineralization are rare in nature. Lorandite T1AsS2, a relatively uncommon mineral, has been dominantly discovered in some Carlin gold deposits, and minor Sb- Hg, U and Pb-Zn-Ag deposits.展开更多
Objective To explore the candidate genes that play significant roles in the interconnection between abdominal aortic aneurysm(AAA)and type 2 diabetes mellitus(DM).Methods We used the Biomedical Discovery Support Syste...Objective To explore the candidate genes that play significant roles in the interconnection between abdominal aortic aneurysm(AAA)and type 2 diabetes mellitus(DM).Methods We used the Biomedical Discovery Support System(BITOLA)to screen out the candidate intermediate molecular(CIM)"Gene or Gene Product”that are related to AAA and DM.The dataset of GSE13760,GSE7084,GSE57691,GSE47472 were used to analyze the differentially expressed genes(DEGs)of AAA and DM compared to the healthy status.We used the online tool ofVenny 2.1 assisted by manual checking to identify the overlapped DEGs with the CIMs.The Human eFP Browser was applied to examine the tissue specific expression levels of the detected genes in order to recognize strong expressed genes in both human artery and pancreatic tissue.Results There were 86 CIMs suggested by the closed BITOLA system.Among all the DEGs of AAA and DM,8 genes in GSE7084(ISG20,ITGAX,DSTN,CCL5,CCR5,AGTR1,CD19,CD44)and 2 genes in GSE 13760(PSMD12,FAS)were found to be overlapped with the 86 CIMs.By manual checking and comparing with tissuespecific gene data through Human eFP Browser,the gene PSMD12(proteasome 26S subunit,non-ATPase 12)was recognized to be strongly expressed in both the aorta and pancreatic tissue.Conclusion We proposed a hypothesis through text mining that PSMD12 might be involved or potentially involved in the interconnection between AAA and DM,which may provide a new clue for studies on novel therapeutic strategies for the two diseases.展开更多
Tauopathies,diseases characterized by neuropathological aggregates of tau including Alzheimer's disease and subtypes of fro ntotemporal dementia,make up the vast majority of dementia cases.Although there have been...Tauopathies,diseases characterized by neuropathological aggregates of tau including Alzheimer's disease and subtypes of fro ntotemporal dementia,make up the vast majority of dementia cases.Although there have been recent developments in tauopathy biomarkers and disease-modifying treatments,ongoing progress is required to ensure these are effective,economical,and accessible for the globally ageing population.As such,continued identification of new potential drug targets and biomarkers is critical."Big data"studies,such as proteomics,can generate information on thousands of possible new targets for dementia diagnostics and therapeutics,but currently remain underutilized due to the lack of a clear process by which targets are selected for future drug development.In this review,we discuss current tauopathy biomarkers and therapeutics,and highlight areas in need of improvement,particularly when addressing the needs of frail,comorbid and cognitively impaired populations.We highlight biomarkers which have been developed from proteomic data,and outline possible future directions in this field.We propose new criteria by which potential targets in proteomics studies can be objectively ranked as favorable for drug development,and demonstrate its application to our group's recent tau interactome dataset as an example.展开更多
The need for the analysis of modern businesses is rapidly increasing as the supporting enterprise systems generate more and more data.This data can be extremely valuable for executing organizations because the data al...The need for the analysis of modern businesses is rapidly increasing as the supporting enterprise systems generate more and more data.This data can be extremely valuable for executing organizations because the data allows constant monitoring,analyzing,and improving the underlying processes,which leads to the reduction of cost and the improvement of the quality.Process mining is a useful technique for analyzing enterprise systems by using an event log that contains behaviours.This research focuses on the process discovery and refinement using real-life event log data collected from a large multinational organization that deals with coatings and paints.By investigating and analyzing their order handling pro-cesses,this study aims at learning a model that gives insight inspection of the processes and performance analysis.Furthermore,the animation is also performed for the better inspection,diagnostics,and compliance-related questions to specify the system.The configuration of the system and the conformance checking for further enhancement is also addressed in this research.To achieve the objectives,this research uses process mining techniques,i.e.process discovery in the form of formal Petri nets models with the help of process maps,and process refinement through conformance checking and enhancement.Initially,the identified executed process is reconstructed by using the process discovery techniques.Following the reconstruction,we perform a deep analysis for the underlying process to ensure the process improvement and redesigning.Finally,some recommendations are made to improve the enterprise management system processes.展开更多
A new structure of ESKD (expert system based on knowledge discovery system KD (D&K)) is first presented on the basis of KD (D&K)-a synthesized knowledge discovery system based on double-base (database and know...A new structure of ESKD (expert system based on knowledge discovery system KD (D&K)) is first presented on the basis of KD (D&K)-a synthesized knowledge discovery system based on double-base (database and knowledge base) cooperating mechanism. With all new features, ESKD may form a new research direction and provide a great probability for solving the wealth of knowledge in the knowledge base. The general structural frame of ESKD and some sub-systems among ESKD have been described, and the dynamic knowledge base based on double-base cooperating mechanism has been emphased on. According to the result of demonstrative experi- ment, the structure of ESKD is effective and feasible.展开更多
To overcome the problem of existing neighboring access point (AP) discovery methods in WLAN, for example they (PnP) of mul proposed. Us can not provide the accurate neighboring APs information needed for the Plug-...To overcome the problem of existing neighboring access point (AP) discovery methods in WLAN, for example they (PnP) of mul proposed. Us can not provide the accurate neighboring APs information needed for the Plug-and-Play ti-mode APs, three kinds of neighboring AP discovery and information exchange methods are ing these three neighboring AP discovery methods, passive discovery method, active discovery method and station assistant discovery method, the multi-mode AP can discover all neighboring APs and obtain needed information. We further propose two whole process flows, which combine three discovery methods in different manner, to achieve different goals. One process flow is to discover the neighboring AP as fast as possible, called fast discovery process flow. The other is to discover the neighboring AP with minimal interference to neighboring and accuracy of the method is confirmed APs, called the minimal interference process flow. The validity by the simulation.展开更多
A method is presented for performing knowledge discovery on the dynamic data of a nonlinear system. In the proposed approach, a synchronized phasor measurement technique is used to acquire the dynamic data of the nonl...A method is presented for performing knowledge discovery on the dynamic data of a nonlinear system. In the proposed approach, a synchronized phasor measurement technique is used to acquire the dynamic data of the nonlinear system and a hyper-rectangular type neural network (HRTNN) is then applied to extract crisp and fuzzy rules with which to estimate the system stability. The effectiveness of the proposed methodology is verified using the dynamic data of a typical real-world nonlinear system, namely an AEP-14 bus, and the extracted rules are relating to the knowledge discovery of the stability levels for the nonlinear system. The discovered relationships among the dynamic data (i.e., the operating state), the extracted rules, and the system stability are confirmed by means of a two-stage confirmatory factor analysis.展开更多
Discovering floating wastes,especially bottles on water,is a crucial research problem in environmental hygiene.Nevertheless,real-world applications often face challenges such as interference from irrelevant objects an...Discovering floating wastes,especially bottles on water,is a crucial research problem in environmental hygiene.Nevertheless,real-world applications often face challenges such as interference from irrelevant objects and the high cost associated with data collection.Consequently,devising algorithms capable of accurately localizing specific objects within a scene in scenarios where annotated data is limited remains a formidable challenge.To solve this problem,this paper proposes an object discovery by request problem setting and a corresponding algorithmic framework.The proposed problem setting aims to identify specified objects in scenes,and the associated algorithmic framework comprises pseudo data generation and object discovery by request network.Pseudo-data generation generates images resembling natural scenes through various data augmentation rules,using a small number of object samples and scene images.The network structure of object discovery by request utilizes the pre-trained Vision Transformer(ViT)model as the backbone,employs object-centric methods to learn the latent representations of foreground objects,and applies patch-level reconstruction constraints to the model.During the validation phase,we use the generated pseudo datasets as training sets and evaluate the performance of our model on the original test sets.Experiments have proved that our method achieves state-of-the-art performance on Unmanned Aerial Vehicles-Bottle Detection(UAV-BD)dataset and self-constructed dataset Bottle,especially in multi-object scenarios.展开更多
The continued expansion of the world population,increasingly inconsistent climate and shrinking agricultural resources present major challenges to crop breeding.Fortunately,the increasing ability to discover and manip...The continued expansion of the world population,increasingly inconsistent climate and shrinking agricultural resources present major challenges to crop breeding.Fortunately,the increasing ability to discover and manipulate genes creates new opportunities to develop more productive and resilient cultivars.Many genes have been described in papers as being beneficial for yield increase.However,few of them have been translated into increased yield on farms.In contrast,commercial breeders are facing gene decidophobia,i.e.,puzzled about which gene to choose for breeding among the many identified,a huge chasm between gene discovery and cultivar innovation.The purpose of this paper is to draw attention to the shortfalls in current gene discovery research and to emphasise the need to align with cultivar innovation.The methodology dictates that genetic studies not only focus on gene discovery but also pay good attention to the genetic backgrounds,experimental validation in relevant environments,appropriate crop management,and data reusability.The close of the gaps should accelerate the application of molecular study in breeding and contribute to future global food security.展开更多
Marine natural products(MNPs)are valuable resources for drug development.To date,17 drugs from marine sources are in clinical use,and 33 pharmaceutical compounds are in clinical trials.Presently the success of drug de...Marine natural products(MNPs)are valuable resources for drug development.To date,17 drugs from marine sources are in clinical use,and 33 pharmaceutical compounds are in clinical trials.Presently the success of drug development from the marine resources is higher than the industry average.It is a feasible strategy to conduct the discovery of druglead compounds based on marine chemical ecology by fully exploiting the pharmacological potential of marine chemical defense matters.In the search for bioactive MNPs,our group has constructed a biological resources library including more than 1500 strains of fungi.Focusing on the strategy of Blue Drug Library,we have discovered a series of novel MNPs with abundant biological functions.Highly efficient and scalable total synthesis of(+)-aniduquinolone A(44)and pesimquinolone I(48)have been completed,which will facilitate access to sufficient quantities of candidates for in vivo pharmacological and toxicological studies.As a nucleoprotein(NP)inhibitor,QLA(75)possesses significant anti-influenza A virus(IAV)activities both in vitro and in vivo.CHNQD-00803(76)is a potent and selective AMP-activated kinase(AMPK)activator that can effectively inhibit metabolic disorders and metabolic dysfunction-associated steatohepatitis(MASH)progression.Moreover,we identified two new candidate molecules with potent anti-hepatocellular carcinoma effects.Particularly,as a natural guanine-nucleotide exchange factors for ADP-ribosylation factor GTPases(Arf-GEFs)inhibitor prodrug,CHNQD-01255(78)is qualified to be developed as a targeted candidate anticancer drug,which may be promising to apply for cancer immunotherapy.Hence,it is evident that MNPs play an important role in drug development.展开更多
Unmanned and aerial systems as interactors among different system components for communications,have opened up great opportunities for truth data discovery in Mobile Crowd Sensing(MCS)which has not been properly solve...Unmanned and aerial systems as interactors among different system components for communications,have opened up great opportunities for truth data discovery in Mobile Crowd Sensing(MCS)which has not been properly solved in the literature.In this paper,an Unmanned Aerial Vehicles-supported Intelligent Truth Discovery(UAV-ITD)scheme is proposed to obtain truth data at low-cost communications for MCS.The main innovations of the UAV-ITD scheme are as follows:(1)UAV-ITD scheme takes the first step in employing UAV joint Deep Matrix Factorization(DMF)to discover truth data based on the trust mechanism for an Information Elicitation Without Verification(IEWV)problem in MCS.(2)This paper introduces a truth data discovery scheme for the first time that only needs to collect a part of n data samples to infer the data of the entire network with high accuracy,which saves more communication costs than most previous data collection schemes,where they collect n or kn data samples.Finally,we conducted extensive experiments to evaluate the UAV-ITD scheme.The results show that compared with previous schemes,our scheme can reduce estimated truth error by 52.25%–96.09%,increase the accuracy of workers’trust evaluation by 0.68–61.82 times,and save recruitment costs by 24.08%–54.15%in truth data discovery.展开更多
In this paper,we propose a Multi-token Sector Antenna Neighbor Discovery(M-SAND)protocol to enhance the efficiency of neighbor discovery in asynchronous directional ad hoc networks.The central concept of our work invo...In this paper,we propose a Multi-token Sector Antenna Neighbor Discovery(M-SAND)protocol to enhance the efficiency of neighbor discovery in asynchronous directional ad hoc networks.The central concept of our work involves maintaining multiple tokens across the network.To prevent mutual interference among multi-token holders,we introduce the time and space non-interference theorems.Furthermore,we propose a master-slave strategy between tokens.When the master token holder(MTH)performs the neighbor discovery,it decides which 1-hop neighbor is the next MTH and which 2-hop neighbors can be the new slave token holders(STHs).Using this approach,the MTH and multiple STHs can simultaneously discover their neighbors without causing interference with each other.Building on this foundation,we provide a comprehensive procedure for the M-SAND protocol.We also conduct theoretical analyses on the maximum number of STHs and the lower bound of multi-token generation probability.Finally,simulation results demonstrate the time efficiency of the M-SAND protocol.When compared to the QSAND protocol,which uses only one token,the total neighbor discovery time is reduced by 28% when 6beams and 112 nodes are employed.展开更多
Recent developments in database technology have seen a wide variety of data being stored in huge collections. The wide variety makes the analysis tasks of a generic database a strenuous task in knowledge discovery. On...Recent developments in database technology have seen a wide variety of data being stored in huge collections. The wide variety makes the analysis tasks of a generic database a strenuous task in knowledge discovery. One approach is to summarize large datasets in such a way that the resulting summary dataset is of manageable size. Histogram has received significant attention as summarization/representative object for large database. But, it suffers from computational and space complexity. In this paper, we propose an idea to transform the histogram object into a Piecewise Linear Regression (PLR) line object and suggest that PLR objects can be less computational and storage intensive while compared to those of histograms. On the other hand to carry out a cluster analysis, we propose a distance measure for computing the distance between the PLR lines. Case study is presented based on the real data of online education system LMS. This demonstrates that PLR is a powerful knowledge representative for very large database.展开更多
Identification of underlying partial differential equations(PDEs)for complex systems remains a formidable challenge.In the present study,a robust PDE identification method is proposed,demonstrating the ability to extr...Identification of underlying partial differential equations(PDEs)for complex systems remains a formidable challenge.In the present study,a robust PDE identification method is proposed,demonstrating the ability to extract accurate governing equations under noisy conditions without prior knowledge.Specifically,the proposed method combines gene expression programming,one type of evolutionary algorithm capable of generating unseen terms based solely on basic operators and functional terms,with symbolic regression neural networks.These networks are designed to represent explicit functional expressions and optimize them with data gradients.In particular,the specifically designed neural networks can be easily transformed to physical constraints for the training data,embedding the discovered PDEs to further optimize the metadata used for iterative PDE identification.The proposed method has been tested in four canonical PDE cases,validating its effectiveness without preliminary information and confirming its suitability for practical applications across various noise levels.展开更多
BACKGROUND Colorectal cancer(CRC)is the third most frequent and the second most fatal cancer.The search for more effective drugs to treat this disease is ongoing.A better understanding of the mechanisms of CRC develop...BACKGROUND Colorectal cancer(CRC)is the third most frequent and the second most fatal cancer.The search for more effective drugs to treat this disease is ongoing.A better understanding of the mechanisms of CRC development and progression may reveal new therapeutic strategies.Ubiquitin-specific peptidases(USPs),the largest group of the deubiquitinase protein family,have long been implicated in various cancers.There have been numerous studies on the role of USPs in CRC;however,a comprehensive view of this role is lacking.AIM To provide a systematic review of the studies investigating the roles and functions of USPs in CRC.METHODS We systematically queried the MEDLINE(via PubMed),Scopus,and Web of Science databases.RESULTS Our study highlights the pivotal role of various USPs in several processes implicated in CRC:Regulation of the cell cycle,apoptosis,cancer stemness,epithelial–mesenchymal transition,metastasis,DNA repair,and drug resistance.The findings of this study suggest that USPs have great potential as drug targets and noninvasive biomarkers in CRC.The dysregulation of USPs in CRC contributes to drug resistance through multiple mechanisms.CONCLUSION Targeting specific USPs involved in drug resistance pathways could provide a novel therapeutic strategy for overcoming resistance to current treatment regimens in CRC.展开更多
We propose a novel framework for learning a low-dimensional representation of data based on nonlinear dynamical systems,which we call the dynamical dimension reduction(DDR).In the DDR model,each point is evolved via a...We propose a novel framework for learning a low-dimensional representation of data based on nonlinear dynamical systems,which we call the dynamical dimension reduction(DDR).In the DDR model,each point is evolved via a nonlinear flow towards a lower-dimensional subspace;the projection onto the subspace gives the low-dimensional embedding.Training the model involves identifying the nonlinear flow and the subspace.Following the equation discovery method,we represent the vector field that defines the flow using a linear combination of dictionary elements,where each element is a pre-specified linear/nonlinear candidate function.A regularization term for the average total kinetic energy is also introduced and motivated by the optimal transport theory.We prove that the resulting optimization problem is well-posed and establish several properties of the DDR method.We also show how the DDR method can be trained using a gradient-based optimization method,where the gradients are computed using the adjoint method from the optimal control theory.The DDR method is implemented and compared on synthetic and example data sets to other dimension reduction methods,including the PCA,t-SNE,and Umap.展开更多
We present a numerical approach for modeling unknown dynamical systems using partially observed data,with a focus on biological systems with(relatively)complex dynamical behavior.As an extension of the recently develo...We present a numerical approach for modeling unknown dynamical systems using partially observed data,with a focus on biological systems with(relatively)complex dynamical behavior.As an extension of the recently developed deep neural network(DNN)learning methods,our approach is particularly suitable for practical situations when(i)measurement data are available for only a subset of the state variables,and(ii)the system parameters cannot be observed or measured at all.We demonstrate that,with a properly designed DNN structure with memory terms,effective DNN models can be learned from such partially observed data containing hidden parameters.The learned DNN model serves as an accurate predictive tool for system analysis.Through a few representative biological problems,we demonstrate that such DNN models can capture qualitative dynamical behavior changes in the system,such as bifurcations,even when the parameters controlling such behavior changes are completely unknown throughout not only the model learning process but also the system prediction process.The learned DNN model effectively creates a“closed”model involving only the observables when such a closed-form model does not exist mathematically.展开更多
The rapidly advancing field of artificial intelligence(AI)has garnered substantial attention for its potential application in drug discovery and development.This opinion review critically examined the feasibility and ...The rapidly advancing field of artificial intelligence(AI)has garnered substantial attention for its potential application in drug discovery and development.This opinion review critically examined the feasibility and prospects of integrating AI as a transformative tool in the pharmaceutical industry.AI,encompassing machine learning algorithms,deep learning,and data analytics,offers unprecedented opportunities to streamline and enhance various stages of drug development.This opinion review delved into the current landscape of AI-driven approaches,discussing their utilization in target identification,lead optimization,and predictive modeling of pharmacokinetics and toxicity.We aimed to scrutinize the integration of large-scale omics data,electronic health records,and chemical informatics,highlighting the power of AI in uncovering novel therapeutic targets and accelerating drug repurposing strategies.Despite the considerable potential of AI,the review also addressed inherent challenges,including data privacy concerns,interpretability of AI models,and the need for robust validation in realworld clinical settings.Additionally,we explored ethical considerations surrounding AI-driven decision-making in drug development.This opinion review provided a nuanced perspective on the transformative role of AI in drug discovery by discussing the existing literature and emerging trends,presenting critical insights and addressing potential hurdles.In conclusion,this study aimed to stimulate discourse within the scientific community and guide future endeavors to harness the full potential of AI in drug development.展开更多
文摘Structural choice is a significant decision having an important influence on structural function, social economics, structural reliability and construction cost. A Case Based Reasoning system with its retrieval part constructed with a KDD subsystem, is put forward to make a decision for a large scale engineering project. A typical CBR system consists of four parts: case representation, case retriever, evaluation, and adaptation. A case library is a set of parameterized excellent and successful structures. For a structural choice, the key point is that the system must be able to detect the pattern classes hidden in the case library and classify the input parameters into classes properly. That is done by using the KDD Data Mining algorithm based on Self Organizing Feature Maps (SOFM), which makes the whole system more adaptive, self organizing, self learning and open.
基金supported by the National Science Foundation of China(grants No.41372090 and 41573042)the National Special Research Programs for Non-Profit Trades (grant No.201311136)Basic Scientific Research Operation Cost of State-Leveled Public Welfare Scientific Research Courtyard(grant No.K1203)
文摘Thallium has been used geochemical exploration of gold deposits. However, as an indicator element in searching for hydrothermal the T1 minerals and mineralization are rare in nature. Lorandite T1AsS2, a relatively uncommon mineral, has been dominantly discovered in some Carlin gold deposits, and minor Sb- Hg, U and Pb-Zn-Ag deposits.
文摘Objective To explore the candidate genes that play significant roles in the interconnection between abdominal aortic aneurysm(AAA)and type 2 diabetes mellitus(DM).Methods We used the Biomedical Discovery Support System(BITOLA)to screen out the candidate intermediate molecular(CIM)"Gene or Gene Product”that are related to AAA and DM.The dataset of GSE13760,GSE7084,GSE57691,GSE47472 were used to analyze the differentially expressed genes(DEGs)of AAA and DM compared to the healthy status.We used the online tool ofVenny 2.1 assisted by manual checking to identify the overlapped DEGs with the CIMs.The Human eFP Browser was applied to examine the tissue specific expression levels of the detected genes in order to recognize strong expressed genes in both human artery and pancreatic tissue.Results There were 86 CIMs suggested by the closed BITOLA system.Among all the DEGs of AAA and DM,8 genes in GSE7084(ISG20,ITGAX,DSTN,CCL5,CCR5,AGTR1,CD19,CD44)and 2 genes in GSE 13760(PSMD12,FAS)were found to be overlapped with the 86 CIMs.By manual checking and comparing with tissuespecific gene data through Human eFP Browser,the gene PSMD12(proteasome 26S subunit,non-ATPase 12)was recognized to be strongly expressed in both the aorta and pancreatic tissue.Conclusion We proposed a hypothesis through text mining that PSMD12 might be involved or potentially involved in the interconnection between AAA and DM,which may provide a new clue for studies on novel therapeutic strategies for the two diseases.
基金supported by funding from the Bluesand Foundation,Alzheimer's Association(AARG-21-852072 and Bias Frangione Early Career Achievement Award)to EDan Australian Government Research Training Program scholarship and the University of Sydney's Brain and Mind Centre fellowship to AH。
文摘Tauopathies,diseases characterized by neuropathological aggregates of tau including Alzheimer's disease and subtypes of fro ntotemporal dementia,make up the vast majority of dementia cases.Although there have been recent developments in tauopathy biomarkers and disease-modifying treatments,ongoing progress is required to ensure these are effective,economical,and accessible for the globally ageing population.As such,continued identification of new potential drug targets and biomarkers is critical."Big data"studies,such as proteomics,can generate information on thousands of possible new targets for dementia diagnostics and therapeutics,but currently remain underutilized due to the lack of a clear process by which targets are selected for future drug development.In this review,we discuss current tauopathy biomarkers and therapeutics,and highlight areas in need of improvement,particularly when addressing the needs of frail,comorbid and cognitively impaired populations.We highlight biomarkers which have been developed from proteomic data,and outline possible future directions in this field.We propose new criteria by which potential targets in proteomics studies can be objectively ranked as favorable for drug development,and demonstrate its application to our group's recent tau interactome dataset as an example.
文摘The need for the analysis of modern businesses is rapidly increasing as the supporting enterprise systems generate more and more data.This data can be extremely valuable for executing organizations because the data allows constant monitoring,analyzing,and improving the underlying processes,which leads to the reduction of cost and the improvement of the quality.Process mining is a useful technique for analyzing enterprise systems by using an event log that contains behaviours.This research focuses on the process discovery and refinement using real-life event log data collected from a large multinational organization that deals with coatings and paints.By investigating and analyzing their order handling pro-cesses,this study aims at learning a model that gives insight inspection of the processes and performance analysis.Furthermore,the animation is also performed for the better inspection,diagnostics,and compliance-related questions to specify the system.The configuration of the system and the conformance checking for further enhancement is also addressed in this research.To achieve the objectives,this research uses process mining techniques,i.e.process discovery in the form of formal Petri nets models with the help of process maps,and process refinement through conformance checking and enhancement.Initially,the identified executed process is reconstructed by using the process discovery techniques.Following the reconstruction,we perform a deep analysis for the underlying process to ensure the process improvement and redesigning.Finally,some recommendations are made to improve the enterprise management system processes.
文摘A new structure of ESKD (expert system based on knowledge discovery system KD (D&K)) is first presented on the basis of KD (D&K)-a synthesized knowledge discovery system based on double-base (database and knowledge base) cooperating mechanism. With all new features, ESKD may form a new research direction and provide a great probability for solving the wealth of knowledge in the knowledge base. The general structural frame of ESKD and some sub-systems among ESKD have been described, and the dynamic knowledge base based on double-base cooperating mechanism has been emphased on. According to the result of demonstrative experi- ment, the structure of ESKD is effective and feasible.
基金NTT-DoCoMo Beijing Communication Labs the National High Technology Research and Development Program of China(No.2006AA01Z276).
文摘To overcome the problem of existing neighboring access point (AP) discovery methods in WLAN, for example they (PnP) of mul proposed. Us can not provide the accurate neighboring APs information needed for the Plug-and-Play ti-mode APs, three kinds of neighboring AP discovery and information exchange methods are ing these three neighboring AP discovery methods, passive discovery method, active discovery method and station assistant discovery method, the multi-mode AP can discover all neighboring APs and obtain needed information. We further propose two whole process flows, which combine three discovery methods in different manner, to achieve different goals. One process flow is to discover the neighboring AP as fast as possible, called fast discovery process flow. The other is to discover the neighboring AP with minimal interference to neighboring and accuracy of the method is confirmed APs, called the minimal interference process flow. The validity by the simulation.
文摘A method is presented for performing knowledge discovery on the dynamic data of a nonlinear system. In the proposed approach, a synchronized phasor measurement technique is used to acquire the dynamic data of the nonlinear system and a hyper-rectangular type neural network (HRTNN) is then applied to extract crisp and fuzzy rules with which to estimate the system stability. The effectiveness of the proposed methodology is verified using the dynamic data of a typical real-world nonlinear system, namely an AEP-14 bus, and the extracted rules are relating to the knowledge discovery of the stability levels for the nonlinear system. The discovered relationships among the dynamic data (i.e., the operating state), the extracted rules, and the system stability are confirmed by means of a two-stage confirmatory factor analysis.
文摘Discovering floating wastes,especially bottles on water,is a crucial research problem in environmental hygiene.Nevertheless,real-world applications often face challenges such as interference from irrelevant objects and the high cost associated with data collection.Consequently,devising algorithms capable of accurately localizing specific objects within a scene in scenarios where annotated data is limited remains a formidable challenge.To solve this problem,this paper proposes an object discovery by request problem setting and a corresponding algorithmic framework.The proposed problem setting aims to identify specified objects in scenes,and the associated algorithmic framework comprises pseudo data generation and object discovery by request network.Pseudo-data generation generates images resembling natural scenes through various data augmentation rules,using a small number of object samples and scene images.The network structure of object discovery by request utilizes the pre-trained Vision Transformer(ViT)model as the backbone,employs object-centric methods to learn the latent representations of foreground objects,and applies patch-level reconstruction constraints to the model.During the validation phase,we use the generated pseudo datasets as training sets and evaluate the performance of our model on the original test sets.Experiments have proved that our method achieves state-of-the-art performance on Unmanned Aerial Vehicles-Bottle Detection(UAV-BD)dataset and self-constructed dataset Bottle,especially in multi-object scenarios.
基金supported by the Sichuan province Science&Technology Department Crops Breeding Project(2021YFYZ0002)。
文摘The continued expansion of the world population,increasingly inconsistent climate and shrinking agricultural resources present major challenges to crop breeding.Fortunately,the increasing ability to discover and manipulate genes creates new opportunities to develop more productive and resilient cultivars.Many genes have been described in papers as being beneficial for yield increase.However,few of them have been translated into increased yield on farms.In contrast,commercial breeders are facing gene decidophobia,i.e.,puzzled about which gene to choose for breeding among the many identified,a huge chasm between gene discovery and cultivar innovation.The purpose of this paper is to draw attention to the shortfalls in current gene discovery research and to emphasise the need to align with cultivar innovation.The methodology dictates that genetic studies not only focus on gene discovery but also pay good attention to the genetic backgrounds,experimental validation in relevant environments,appropriate crop management,and data reusability.The close of the gaps should accelerate the application of molecular study in breeding and contribute to future global food security.
基金supported by the Shandong Province Special Fund ‘Frontier Technology and Free Exploration’ from Laoshan Laboratory (No. 8-01)the National Natural Science Foundation of China (No. 42376116)+3 种基金the Special Funds of Shandong Province for Qingdao National Laboratory of Marine Science and Technology (No. 2022QN LM030003)the State Key Laboratory for Chemistry and Molecular Engineering of Medicinal Resources, Guangxi Normal University (No. CMEMR2023-B16)the National Key Research and Development Program of China (No. 2022YFC2601305)the Innovation Center for Academicians of Hainan Province, and the Fundamental Research Funds for the Central Universities (No. 202461059)
文摘Marine natural products(MNPs)are valuable resources for drug development.To date,17 drugs from marine sources are in clinical use,and 33 pharmaceutical compounds are in clinical trials.Presently the success of drug development from the marine resources is higher than the industry average.It is a feasible strategy to conduct the discovery of druglead compounds based on marine chemical ecology by fully exploiting the pharmacological potential of marine chemical defense matters.In the search for bioactive MNPs,our group has constructed a biological resources library including more than 1500 strains of fungi.Focusing on the strategy of Blue Drug Library,we have discovered a series of novel MNPs with abundant biological functions.Highly efficient and scalable total synthesis of(+)-aniduquinolone A(44)and pesimquinolone I(48)have been completed,which will facilitate access to sufficient quantities of candidates for in vivo pharmacological and toxicological studies.As a nucleoprotein(NP)inhibitor,QLA(75)possesses significant anti-influenza A virus(IAV)activities both in vitro and in vivo.CHNQD-00803(76)is a potent and selective AMP-activated kinase(AMPK)activator that can effectively inhibit metabolic disorders and metabolic dysfunction-associated steatohepatitis(MASH)progression.Moreover,we identified two new candidate molecules with potent anti-hepatocellular carcinoma effects.Particularly,as a natural guanine-nucleotide exchange factors for ADP-ribosylation factor GTPases(Arf-GEFs)inhibitor prodrug,CHNQD-01255(78)is qualified to be developed as a targeted candidate anticancer drug,which may be promising to apply for cancer immunotherapy.Hence,it is evident that MNPs play an important role in drug development.
基金supported by the National Natural Science Foundation of China under Grant No.62072475.
文摘Unmanned and aerial systems as interactors among different system components for communications,have opened up great opportunities for truth data discovery in Mobile Crowd Sensing(MCS)which has not been properly solved in the literature.In this paper,an Unmanned Aerial Vehicles-supported Intelligent Truth Discovery(UAV-ITD)scheme is proposed to obtain truth data at low-cost communications for MCS.The main innovations of the UAV-ITD scheme are as follows:(1)UAV-ITD scheme takes the first step in employing UAV joint Deep Matrix Factorization(DMF)to discover truth data based on the trust mechanism for an Information Elicitation Without Verification(IEWV)problem in MCS.(2)This paper introduces a truth data discovery scheme for the first time that only needs to collect a part of n data samples to infer the data of the entire network with high accuracy,which saves more communication costs than most previous data collection schemes,where they collect n or kn data samples.Finally,we conducted extensive experiments to evaluate the UAV-ITD scheme.The results show that compared with previous schemes,our scheme can reduce estimated truth error by 52.25%–96.09%,increase the accuracy of workers’trust evaluation by 0.68–61.82 times,and save recruitment costs by 24.08%–54.15%in truth data discovery.
基金supported in part by the National Natural Science Foundations of CHINA(Grant No.61771392,No.61771390,No.61871322 and No.61501373)Science and Technology on Avionics Integration Laboratory and the Aeronautical Science Foundation of China(Grant No.201955053002 and No.20185553035)。
文摘In this paper,we propose a Multi-token Sector Antenna Neighbor Discovery(M-SAND)protocol to enhance the efficiency of neighbor discovery in asynchronous directional ad hoc networks.The central concept of our work involves maintaining multiple tokens across the network.To prevent mutual interference among multi-token holders,we introduce the time and space non-interference theorems.Furthermore,we propose a master-slave strategy between tokens.When the master token holder(MTH)performs the neighbor discovery,it decides which 1-hop neighbor is the next MTH and which 2-hop neighbors can be the new slave token holders(STHs).Using this approach,the MTH and multiple STHs can simultaneously discover their neighbors without causing interference with each other.Building on this foundation,we provide a comprehensive procedure for the M-SAND protocol.We also conduct theoretical analyses on the maximum number of STHs and the lower bound of multi-token generation probability.Finally,simulation results demonstrate the time efficiency of the M-SAND protocol.When compared to the QSAND protocol,which uses only one token,the total neighbor discovery time is reduced by 28% when 6beams and 112 nodes are employed.
文摘Recent developments in database technology have seen a wide variety of data being stored in huge collections. The wide variety makes the analysis tasks of a generic database a strenuous task in knowledge discovery. One approach is to summarize large datasets in such a way that the resulting summary dataset is of manageable size. Histogram has received significant attention as summarization/representative object for large database. But, it suffers from computational and space complexity. In this paper, we propose an idea to transform the histogram object into a Piecewise Linear Regression (PLR) line object and suggest that PLR objects can be less computational and storage intensive while compared to those of histograms. On the other hand to carry out a cluster analysis, we propose a distance measure for computing the distance between the PLR lines. Case study is presented based on the real data of online education system LMS. This demonstrates that PLR is a powerful knowledge representative for very large database.
基金supported by the National Natural Science Foundation of China(Grant Nos.92152102 and 92152202)the Advanced Jet Propulsion Innovation Center/AEAC(Grant No.HKCX2022-01-010)。
文摘Identification of underlying partial differential equations(PDEs)for complex systems remains a formidable challenge.In the present study,a robust PDE identification method is proposed,demonstrating the ability to extract accurate governing equations under noisy conditions without prior knowledge.Specifically,the proposed method combines gene expression programming,one type of evolutionary algorithm capable of generating unseen terms based solely on basic operators and functional terms,with symbolic regression neural networks.These networks are designed to represent explicit functional expressions and optimize them with data gradients.In particular,the specifically designed neural networks can be easily transformed to physical constraints for the training data,embedding the discovered PDEs to further optimize the metadata used for iterative PDE identification.The proposed method has been tested in four canonical PDE cases,validating its effectiveness without preliminary information and confirming its suitability for practical applications across various noise levels.
文摘BACKGROUND Colorectal cancer(CRC)is the third most frequent and the second most fatal cancer.The search for more effective drugs to treat this disease is ongoing.A better understanding of the mechanisms of CRC development and progression may reveal new therapeutic strategies.Ubiquitin-specific peptidases(USPs),the largest group of the deubiquitinase protein family,have long been implicated in various cancers.There have been numerous studies on the role of USPs in CRC;however,a comprehensive view of this role is lacking.AIM To provide a systematic review of the studies investigating the roles and functions of USPs in CRC.METHODS We systematically queried the MEDLINE(via PubMed),Scopus,and Web of Science databases.RESULTS Our study highlights the pivotal role of various USPs in several processes implicated in CRC:Regulation of the cell cycle,apoptosis,cancer stemness,epithelial–mesenchymal transition,metastasis,DNA repair,and drug resistance.The findings of this study suggest that USPs have great potential as drug targets and noninvasive biomarkers in CRC.The dysregulation of USPs in CRC contributes to drug resistance through multiple mechanisms.CONCLUSION Targeting specific USPs involved in drug resistance pathways could provide a novel therapeutic strategy for overcoming resistance to current treatment regimens in CRC.
文摘We propose a novel framework for learning a low-dimensional representation of data based on nonlinear dynamical systems,which we call the dynamical dimension reduction(DDR).In the DDR model,each point is evolved via a nonlinear flow towards a lower-dimensional subspace;the projection onto the subspace gives the low-dimensional embedding.Training the model involves identifying the nonlinear flow and the subspace.Following the equation discovery method,we represent the vector field that defines the flow using a linear combination of dictionary elements,where each element is a pre-specified linear/nonlinear candidate function.A regularization term for the average total kinetic energy is also introduced and motivated by the optimal transport theory.We prove that the resulting optimization problem is well-posed and establish several properties of the DDR method.We also show how the DDR method can be trained using a gradient-based optimization method,where the gradients are computed using the adjoint method from the optimal control theory.The DDR method is implemented and compared on synthetic and example data sets to other dimension reduction methods,including the PCA,t-SNE,and Umap.
基金supported by the NSF(No.DMS-1813071)(Chou)and the AFSOR(No.FA9550-22-1-0011)(Xiu).
文摘We present a numerical approach for modeling unknown dynamical systems using partially observed data,with a focus on biological systems with(relatively)complex dynamical behavior.As an extension of the recently developed deep neural network(DNN)learning methods,our approach is particularly suitable for practical situations when(i)measurement data are available for only a subset of the state variables,and(ii)the system parameters cannot be observed or measured at all.We demonstrate that,with a properly designed DNN structure with memory terms,effective DNN models can be learned from such partially observed data containing hidden parameters.The learned DNN model serves as an accurate predictive tool for system analysis.Through a few representative biological problems,we demonstrate that such DNN models can capture qualitative dynamical behavior changes in the system,such as bifurcations,even when the parameters controlling such behavior changes are completely unknown throughout not only the model learning process but also the system prediction process.The learned DNN model effectively creates a“closed”model involving only the observables when such a closed-form model does not exist mathematically.
基金Supported by the European Union-NextGenerationEU,through the National Recovery and Resilience Plan of the Republic of Bulgaria,No.BG-RRP-2.004-0008.
文摘The rapidly advancing field of artificial intelligence(AI)has garnered substantial attention for its potential application in drug discovery and development.This opinion review critically examined the feasibility and prospects of integrating AI as a transformative tool in the pharmaceutical industry.AI,encompassing machine learning algorithms,deep learning,and data analytics,offers unprecedented opportunities to streamline and enhance various stages of drug development.This opinion review delved into the current landscape of AI-driven approaches,discussing their utilization in target identification,lead optimization,and predictive modeling of pharmacokinetics and toxicity.We aimed to scrutinize the integration of large-scale omics data,electronic health records,and chemical informatics,highlighting the power of AI in uncovering novel therapeutic targets and accelerating drug repurposing strategies.Despite the considerable potential of AI,the review also addressed inherent challenges,including data privacy concerns,interpretability of AI models,and the need for robust validation in realworld clinical settings.Additionally,we explored ethical considerations surrounding AI-driven decision-making in drug development.This opinion review provided a nuanced perspective on the transformative role of AI in drug discovery by discussing the existing literature and emerging trends,presenting critical insights and addressing potential hurdles.In conclusion,this study aimed to stimulate discourse within the scientific community and guide future endeavors to harness the full potential of AI in drug development.