Aiming at the lack of professional knowledge to guide apparel recommendation,an apparel recommendation method based on image design expert knowledge has been proposed.Then,apparel recommendation knowledge graphs have ...Aiming at the lack of professional knowledge to guide apparel recommendation,an apparel recommendation method based on image design expert knowledge has been proposed.Then,apparel recommendation knowledge graphs have been created and a apparel recommendation question and answer(Q&A)system has been designed and implemented.The question templates in the apparel recommendation domain were defined,the task of recognizing the named entities of question sentences was completed by the Bi-directional encoder representations from transformer-Bi-directional long short-term memory-conditional random field(BERT-BiLSTM-CRF)model,and the question template with the highest matching degree to the user’s question was obtained by using term frequency-inverse document frequency(TF-IDF)algorithm.The corresponding cypher graph database query statement was generated to retrieve the knowledge graph for answers,and iFLYTEK’s voice application programming interface(API)was called to implement the Q&A.The experimental results have shown that the Q&A system has a high accuracy rate and application value in the field of apparel recommendations.展开更多
Performance of Video Question and Answer(VQA)systems relies on capturing key information of both visual images and natural language in the context to generate relevant questions’answers.However,traditional linear com...Performance of Video Question and Answer(VQA)systems relies on capturing key information of both visual images and natural language in the context to generate relevant questions’answers.However,traditional linear combinations of multimodal features focus only on shallow feature interactions,fall far short of the need of deep feature fusion.Attention mechanisms were used to perform deep fusion,but most of them can only process weight assignment of single-modal information,leading to attention imbalance for different modalities.To address above problems,we propose a novel VQA model based on Triple Multimodal feature Cyclic Fusion(TMCF)and Self-AdaptiveMultimodal Balancing Mechanism(SAMB).Our model is designed to enhance complex feature interactions among multimodal features with cross-modal information balancing.In addition,TMCF and SAMB can be used as an extensible plug-in for exploring new feature combinations in the visual image domain.Extensive experiments were conducted on MSVDQA and MSRVTT-QA datasets.The results confirm the advantages of our approach in handling multimodal tasks.Besides,we also provide analyses for ablation studies to verify the effectiveness of each proposed component.展开更多
Service providers - from public institutions to primary care facilities - need to constantly attend toclients' inquiries to provide useful information and directive guidelines. Ensuring high quality serviceis challen...Service providers - from public institutions to primary care facilities - need to constantly attend toclients' inquiries to provide useful information and directive guidelines. Ensuring high quality serviceis challenging as it not only demands detailed domain-specific knowledge, but also the ability toquickly understand the clients' issues through their diverse - and often casual - descriptions. Thispaper aims to provide a framework for the development of an automated information broker agent whoperforms the task of a helper. The main task of the agent is to interact with the client and direct them toobtain further services that cater their personalized need. To do so, the agent should accomplish asequence of tasks that include natural language inquiry, knowledge gathering, reasoning, and givingfeedback; in this way, it simulates a human helper to engage in interaction with the client. Theframework combines a question-answering reasoning mechanism while utilizing domain-specificknowledge base. When the users cannot describe clearly their needs, the system tries to narrow downthe possibilities by an iterative question-answering process, until it eventually identifies the target. Inrealizing our framework, we make a proof-of-concept project, M andy, a primary care chatbot systemcreated to assist healthcare staffs by automating the patient intake process. We describe in detail thesystem functionalities and design of the system, and evaluate our proof-of-concept on benchmark casestudies.展开更多
文摘Aiming at the lack of professional knowledge to guide apparel recommendation,an apparel recommendation method based on image design expert knowledge has been proposed.Then,apparel recommendation knowledge graphs have been created and a apparel recommendation question and answer(Q&A)system has been designed and implemented.The question templates in the apparel recommendation domain were defined,the task of recognizing the named entities of question sentences was completed by the Bi-directional encoder representations from transformer-Bi-directional long short-term memory-conditional random field(BERT-BiLSTM-CRF)model,and the question template with the highest matching degree to the user’s question was obtained by using term frequency-inverse document frequency(TF-IDF)algorithm.The corresponding cypher graph database query statement was generated to retrieve the knowledge graph for answers,and iFLYTEK’s voice application programming interface(API)was called to implement the Q&A.The experimental results have shown that the Q&A system has a high accuracy rate and application value in the field of apparel recommendations.
基金This work was supported by the National Natural Science Foundation of China(No.61872231)the National Key Research and Development Program of China(No.2021YFC2801000)the Major Research plan of the National Social Science Foundation of China(No.20&ZD130).
文摘Performance of Video Question and Answer(VQA)systems relies on capturing key information of both visual images and natural language in the context to generate relevant questions’answers.However,traditional linear combinations of multimodal features focus only on shallow feature interactions,fall far short of the need of deep feature fusion.Attention mechanisms were used to perform deep fusion,but most of them can only process weight assignment of single-modal information,leading to attention imbalance for different modalities.To address above problems,we propose a novel VQA model based on Triple Multimodal feature Cyclic Fusion(TMCF)and Self-AdaptiveMultimodal Balancing Mechanism(SAMB).Our model is designed to enhance complex feature interactions among multimodal features with cross-modal information balancing.In addition,TMCF and SAMB can be used as an extensible plug-in for exploring new feature combinations in the visual image domain.Extensive experiments were conducted on MSVDQA and MSRVTT-QA datasets.The results confirm the advantages of our approach in handling multimodal tasks.Besides,we also provide analyses for ablation studies to verify the effectiveness of each proposed component.
文摘Service providers - from public institutions to primary care facilities - need to constantly attend toclients' inquiries to provide useful information and directive guidelines. Ensuring high quality serviceis challenging as it not only demands detailed domain-specific knowledge, but also the ability toquickly understand the clients' issues through their diverse - and often casual - descriptions. Thispaper aims to provide a framework for the development of an automated information broker agent whoperforms the task of a helper. The main task of the agent is to interact with the client and direct them toobtain further services that cater their personalized need. To do so, the agent should accomplish asequence of tasks that include natural language inquiry, knowledge gathering, reasoning, and givingfeedback; in this way, it simulates a human helper to engage in interaction with the client. Theframework combines a question-answering reasoning mechanism while utilizing domain-specificknowledge base. When the users cannot describe clearly their needs, the system tries to narrow downthe possibilities by an iterative question-answering process, until it eventually identifies the target. Inrealizing our framework, we make a proof-of-concept project, M andy, a primary care chatbot systemcreated to assist healthcare staffs by automating the patient intake process. We describe in detail thesystem functionalities and design of the system, and evaluate our proof-of-concept on benchmark casestudies.