Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The p...Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The proposed approach detects the topic-caption frames, and integrates them with silence clips detection results, as well as shot segmentation results to locate the news story boundaries. The integration of audio-visual features and text information overcomes the weakness of the approach using only image analysis techniques. On test data with 135 400 frames, when the boundaries between news stories are detected, the accuracy rate 85.8% and the recall rate 97.5% are obtained. The experimental results show the approach is valid and robust.展开更多
In recent years, text visualization has been widely acknowledged as an effective approach for understanding the structure and patterns hidden in complicated textual information. In this paper, we propose a new visuali...In recent years, text visualization has been widely acknowledged as an effective approach for understanding the structure and patterns hidden in complicated textual information. In this paper, we propose a new visualization system called TextInsight with two of our contributions. Firstly, a textual entropy theory is introduced to encode the semantic importance distribution in the corpus. Based on the proposed multidimensional joint probability histogram in vector fields, the improved algorithm provides a novel way to position valuable information in massive short texts accurately. Secondly, a map-like metaphor is generated to visualize the textual topics and their relationships. For the problem of over-segmentation in the layout and clustering procedure, we propose an optimization algorithm combining Affinity Propagation(AP) and MultiDimensional Scaling(MDS), and the improved geographical representation is more comprehensible and aesthetically appealing. Our experimental results and initial user feedback suggest that this system is effective in aiding text analysis.展开更多
Text visualization is concerned with the representation of text in a graphicalform to facilitate comprehension of large textual data. Its aim is to improve the ability tounderstand and utilize the wealth of text-based...Text visualization is concerned with the representation of text in a graphicalform to facilitate comprehension of large textual data. Its aim is to improve the ability tounderstand and utilize the wealth of text-based information available. An essential task inany scientific research is the study and review of previous works in the specified domain,a process that is referred to as the literature survey process. This process involves theidentification of prior work and evaluating its relevance to the research question. With theenormous number of published studies available online in digital form, this becomes acumbersome task for the researcher. This paper presents the design and implementationof a tool that aims to facilitate this process by identifying relevant work and suggestingclusters of articles by conceptual modeling, thus providing different options that enablethe researcher to visualize a large number of articles in a graphical easy-to-analyze form.The tool helps the researcher in analyzing and synthesizing the literature and building aconceptual understanding of the designated research area. The evaluation of the toolshows that researchers have found it useful and that it supported the process of relevantwork analysis given a specific research question, and 70% of the evaluators of the toolfound it very useful.展开更多
In order to ensure the safety,quality and efficiency of computer numerical control(CNC)machine tool processing,a real-time monitoring and visible solution for CNC machine tools based on hyper text markup language(HTML...In order to ensure the safety,quality and efficiency of computer numerical control(CNC)machine tool processing,a real-time monitoring and visible solution for CNC machine tools based on hyper text markup language(HTML)5 is proposed.The characteristics of the real-time monitoring technology of CNC machine tools under the traditional Client/Server(C/S)structure are compared and analyzed,and the technical drawbacks are proposed.Web real-time communication technology and browser drawing technology are deeply studied.A real-time monitoring and visible system for CNC machine tool data is developed based on Metro platform,combining WebSocket real-time communication technology and Canvas drawing technology.The system architecture is given,and the functions and implementation methods of the system are described in detail.The practical application results show that the WebSocket real-time communication technology can effectively reduce the bandwidth and network delay and save server resources.The numerical control machine data monitoring system can intuitively reflect the machine data,and the visible effect is good.It realizes timely monitoring of equipment alarms and prompts maintenance and management personnel.展开更多
In this paper, visualization of special features in “The Tale of Genji”, which is a typical Japanese classical literature, is studied by text mining the auxiliary verbs and examining the similarity in the sentence s...In this paper, visualization of special features in “The Tale of Genji”, which is a typical Japanese classical literature, is studied by text mining the auxiliary verbs and examining the similarity in the sentence style by the correspondence analysis with clustering. The result shows that the text mining error in the number of auxiliary verbs can be as small as 15%. The extracted feature in this study supports the multiple authors of “The Tale of Genji”, which agrees well with the result by Murakami and Imanishi [1]. It is also found that extracted features are robust to the text mining error, which suggests that the classification error is less affected by the text mining error and the possible use of this technique for further statistical study in classical literatures.展开更多
基于CiteSpace软件和文献计量学方法,以Web of science核心数据库为采集对象,“Pueraria Lobata”等为主题词。将1995—2023年的1683篇文献作为研究对象,对文献的发文量、发文作者、发文国家(地区)、发文机构、出版物等进行数据挖掘。...基于CiteSpace软件和文献计量学方法,以Web of science核心数据库为采集对象,“Pueraria Lobata”等为主题词。将1995—2023年的1683篇文献作为研究对象,对文献的发文量、发文作者、发文国家(地区)、发文机构、出版物等进行数据挖掘。运用关键词共现图谱、关键词聚类图谱、关键词突现分析、关键词时区图等方法进行数据可视化分析。结果显示:(1)葛根研究发文量总体呈现上升趋势;葛根研究的核心作者主要来自于中国、美国、韩国等地;核心作者群与核心机构群逐渐形成;葛根投稿期刊呈现明显的层次划分格局。(2)中国的发文量占发文总量的50%,位居全球第一。中国科学院、中国香港中文大学等机构为世界的葛根研究做出了巨大贡献,但从篇被引频次上分析,仍与其他机构存在一定差距。(3)葛根的研究逐渐从理论走向实践,时区图结果显示,目前葛根淀粉、肠道微生物群、网络药理学等成为新兴研究热点。葛根素、异黄酮等关键词贯穿葛根研究整个发展阶段,逐渐形成以药理学为基础,食品科学、化学、植物科学等多学科综合发展的新模式。展开更多
文摘Video data are composed of multimodal information streams including visual, auditory and textual streams, so an approach of story segmentation for news video using multimodal analysis is described in this paper. The proposed approach detects the topic-caption frames, and integrates them with silence clips detection results, as well as shot segmentation results to locate the news story boundaries. The integration of audio-visual features and text information overcomes the weakness of the approach using only image analysis techniques. On test data with 135 400 frames, when the boundaries between news stories are detected, the accuracy rate 85.8% and the recall rate 97.5% are obtained. The experimental results show the approach is valid and robust.
基金Supported by the National High Technology Research and Development Program of China(863 Program)(No.2013AA7013033)
文摘In recent years, text visualization has been widely acknowledged as an effective approach for understanding the structure and patterns hidden in complicated textual information. In this paper, we propose a new visualization system called TextInsight with two of our contributions. Firstly, a textual entropy theory is introduced to encode the semantic importance distribution in the corpus. Based on the proposed multidimensional joint probability histogram in vector fields, the improved algorithm provides a novel way to position valuable information in massive short texts accurately. Secondly, a map-like metaphor is generated to visualize the textual topics and their relationships. For the problem of over-segmentation in the layout and clustering procedure, we propose an optimization algorithm combining Affinity Propagation(AP) and MultiDimensional Scaling(MDS), and the improved geographical representation is more comprehensible and aesthetically appealing. Our experimental results and initial user feedback suggest that this system is effective in aiding text analysis.
文摘Text visualization is concerned with the representation of text in a graphicalform to facilitate comprehension of large textual data. Its aim is to improve the ability tounderstand and utilize the wealth of text-based information available. An essential task inany scientific research is the study and review of previous works in the specified domain,a process that is referred to as the literature survey process. This process involves theidentification of prior work and evaluating its relevance to the research question. With theenormous number of published studies available online in digital form, this becomes acumbersome task for the researcher. This paper presents the design and implementationof a tool that aims to facilitate this process by identifying relevant work and suggestingclusters of articles by conceptual modeling, thus providing different options that enablethe researcher to visualize a large number of articles in a graphical easy-to-analyze form.The tool helps the researcher in analyzing and synthesizing the literature and building aconceptual understanding of the designated research area. The evaluation of the toolshows that researchers have found it useful and that it supported the process of relevantwork analysis given a specific research question, and 70% of the evaluators of the toolfound it very useful.
文摘In order to ensure the safety,quality and efficiency of computer numerical control(CNC)machine tool processing,a real-time monitoring and visible solution for CNC machine tools based on hyper text markup language(HTML)5 is proposed.The characteristics of the real-time monitoring technology of CNC machine tools under the traditional Client/Server(C/S)structure are compared and analyzed,and the technical drawbacks are proposed.Web real-time communication technology and browser drawing technology are deeply studied.A real-time monitoring and visible system for CNC machine tool data is developed based on Metro platform,combining WebSocket real-time communication technology and Canvas drawing technology.The system architecture is given,and the functions and implementation methods of the system are described in detail.The practical application results show that the WebSocket real-time communication technology can effectively reduce the bandwidth and network delay and save server resources.The numerical control machine data monitoring system can intuitively reflect the machine data,and the visible effect is good.It realizes timely monitoring of equipment alarms and prompts maintenance and management personnel.
文摘In this paper, visualization of special features in “The Tale of Genji”, which is a typical Japanese classical literature, is studied by text mining the auxiliary verbs and examining the similarity in the sentence style by the correspondence analysis with clustering. The result shows that the text mining error in the number of auxiliary verbs can be as small as 15%. The extracted feature in this study supports the multiple authors of “The Tale of Genji”, which agrees well with the result by Murakami and Imanishi [1]. It is also found that extracted features are robust to the text mining error, which suggests that the classification error is less affected by the text mining error and the possible use of this technique for further statistical study in classical literatures.
文摘基于CiteSpace软件和文献计量学方法,以Web of science核心数据库为采集对象,“Pueraria Lobata”等为主题词。将1995—2023年的1683篇文献作为研究对象,对文献的发文量、发文作者、发文国家(地区)、发文机构、出版物等进行数据挖掘。运用关键词共现图谱、关键词聚类图谱、关键词突现分析、关键词时区图等方法进行数据可视化分析。结果显示:(1)葛根研究发文量总体呈现上升趋势;葛根研究的核心作者主要来自于中国、美国、韩国等地;核心作者群与核心机构群逐渐形成;葛根投稿期刊呈现明显的层次划分格局。(2)中国的发文量占发文总量的50%,位居全球第一。中国科学院、中国香港中文大学等机构为世界的葛根研究做出了巨大贡献,但从篇被引频次上分析,仍与其他机构存在一定差距。(3)葛根的研究逐渐从理论走向实践,时区图结果显示,目前葛根淀粉、肠道微生物群、网络药理学等成为新兴研究热点。葛根素、异黄酮等关键词贯穿葛根研究整个发展阶段,逐渐形成以药理学为基础,食品科学、化学、植物科学等多学科综合发展的新模式。