We present a method of 3D image mosaicing for real 3D representation of roadside buildings, and implement a Web-based interactive visualization environment for the 3D video mosaics created by 3D image mosaicing. The 3...We present a method of 3D image mosaicing for real 3D representation of roadside buildings, and implement a Web-based interactive visualization environment for the 3D video mosaics created by 3D image mosaicing. The 3D image mo- saicing technique developed in our previous work is a very powerful method for creating textured 3D-GIS data without excessive data processing like the laser or stereo system. For the Web-based open access to the 3D video mosaics, we build an interactive visualization environment using X3D, the emerging standard of Web 3D. We conduct the data preprocessing for 3D video mosaics and the X3D modeling for textured 3D data. The data preprocessing includes the conversion of each frame of 3D video mosaics into concatenated image files that can be hyperlinked on the Web. The X3D modeling handles the representation of concatenated images using necessary X3D nodes. By employing X3D as the data format for 3D image mosaics, the real 3D representation of roadside buildings is extended to the Web and mobile service systems.展开更多
With the continuous development of various types of fixed marine observation equipment,satellite remote sensing technology and computer simulation technology,modern marine scientific research has entered the era of bi...With the continuous development of various types of fixed marine observation equipment,satellite remote sensing technology and computer simulation technology,modern marine scientific research has entered the era of big data.Interactive ocean visuali-zation has become ubiquitous owing to the use of ocean data in studies of marine disasters,global climate change and fisheries.However,the primary challenge in analyzing large amounts of ocean data originates from the complexity of the data themselves.Therefore,an interactive multi-scale,multivariate visualization sys-tem with dynamic expansion potential is needed for analyzing larger volumes of ocean data.In this study,a unified visual data service was constructed,and a component-based interactive visua-lization structure for multi-dimensional,spatiotemporal ocean data is presented in this paper.Based on this structure,users can easily customize the system to visualize other types of scientific data.展开更多
An approach for generating interactive 3D graphical visualization of the genetic architectures of complex traits in multiple environments is described. 3D graphical visualization is utilized for making improvements on...An approach for generating interactive 3D graphical visualization of the genetic architectures of complex traits in multiple environments is described. 3D graphical visualization is utilized for making improvements on traditional plots in quan- titative trait locus (QTL) mapping analysis. Interactive 3D graphical visualization for abstract expression of QTL, epistasis and their environmental interactions for experimental populations was developed in framework of user-friendly software QTLNetwork (http://ibi.zju.edu.cn/software/qtlnetwork). Novel definition of graphical meta system and computation of virtual coordinates are used to achieve explicit but meaningful visualization. Interactive 3D graphical visualization for QTL analysis provides geneticists and breeders a powerful and easy-to-use tool to analyze and publish their research results.展开更多
More diverse data on animal ecology are now available.This“data deluge”presents challenges for both biologists and computer scientists;however,it also creates opportunities to improve analysis and answer more holist...More diverse data on animal ecology are now available.This“data deluge”presents challenges for both biologists and computer scientists;however,it also creates opportunities to improve analysis and answer more holistic research questions.We aim to increase awareness of the current opportunity for interdisciplinary research between animal ecology researchers and computer scientists.Immersive analytics(IA)is an emerging research field in which investigations are performed into how immersive technologies,such as large display walls and virtual reality and augmented reality devices,can be used to improve data analysis,outcomes,and communication.These investigations have the potential to reduce the analysis effort and widen the range of questions that can be addressed.We propose that biologists and computer scientists combine their efforts to lay the foundation for IA in animal ecology research.We discuss the potential and the challenges and outline a path toward a structured approach.We imagine that a joint effort would combine the strengths and expertise of both communities,leading to a well-defined research agenda and design space,practical guidelines,robust and reusable software frameworks,reduced analysis effort,and better comparability of results.展开更多
Mining the core value of Industrial Internet of Things(IIoT)data safely and reducing the risk of malicious attacks are the inherent requirements of industrial data visualization.Visualization technology has become the...Mining the core value of Industrial Internet of Things(IIoT)data safely and reducing the risk of malicious attacks are the inherent requirements of industrial data visualization.Visualization technology has become the main tool for data aggregation,mining and analysis of IIoT data through graphical representation.However,visualization technology still has two shortcomings in big data calculation and analysis scenarios.On the one hand,visual results will lead to the disclosure of sensitive privacy.On the other hand,most visualization tools can't provide an interactive framework for users to select the suitable solutions.To address these problems,we present an open accessible Visual framework based on Differential Privacy theory(VisDP),which provides Multi-index Quantitative comprehensive Evaluation technology(MQE)for data mining results.Considering the advantages of interactive mechanism,VisDP provides rich optional schemes,including the operating web,calling API and the downloading SDK.Finally,we verify the availability and privacy of MQE through mathematical proofs,analyze the hospital medical waste detection system that actually applies the framework,and the experimental results have showed the effectiveness and practicality of the proposed platform.展开更多
Machine vision measurement(MVM)is an essential approach that measures the area or length of a target efficiently and non-destructively for product quality control.The result of MVM is determined by its configuration,e...Machine vision measurement(MVM)is an essential approach that measures the area or length of a target efficiently and non-destructively for product quality control.The result of MVM is determined by its configuration,especially the lighting scheme design in image acquisition and the algorithmic parameter optimization in image processing.In a traditional workflow,engineers constantly adjust and verify the configuration for an acceptable result,which is time-consuming and significantly depends on expertise.To address these challenges,we propose a target-independent approach,visual interactive image clustering,which facilitates configuration optimization by grouping images into different clusters to suggest lighting schemes with common parameters.Our approach has four steps:data preparation,data sampling,data processing,and visual analysis with our visualization system.During preparation,engineers design several candidate lighting schemes to acquire images and develop an algorithm to process images.Our approach samples engineer-defined parameters for each image and obtains results by executing the algorithm.The core of data processing is the explainable measurement of the relationships among images using the algorithmic parameters.Based on the image relationships,we develop VMExplorer,a visual analytics system that assists engineers in grouping images into clusters and exploring parameters.Finally,engineers can determine an appropriate lighting scheme with robust parameter combinations.To demonstrate the effiectiveness and usability of our approach,we conduct a case study with engineers and obtain feedback from expert interviews.展开更多
Digital phenotyping is the characterization of human behavior patterns based on data from digital devices such as smartphones in order to gain insights into the users’state and especially to identify ailments.To supp...Digital phenotyping is the characterization of human behavior patterns based on data from digital devices such as smartphones in order to gain insights into the users’state and especially to identify ailments.To support supervised machine learning,digital phenotyping requires gathering data from study participants’smartphones as they live their lives.Periodically,participants are then asked to provide ground truth labels about their health status.Analyzing such complex data is challenging due to limited contextual information and imperfect health/wellness labels.We propose INteractive PHOne-o-typing VISualization(INPHOVIS),an interactive visual framework for exploratory analysis of smartphone health data to study phone-o-types.Prior visualization work has focused on mobile health data with clear semantics such as steps or heart rate data collected using dedicated health devices and wearables such as smartwatches.However,unlike smartphones which are owned by over 85 percent of the US population,wearable devices are less prevalent thus reducing the number of people from whom such data can be collected.In contrast,the‘‘low-level"sensor data(e.g.,accelerometer or GPS data)supported by INPHOVIS can be easily collected using smartphones.Data visualizations are designed to provide the essential contextualization of such data and thus help analysts discover complex relationships between observed sensor values and health-predictive phone-o-types.To guide the design of INPHOVIS,we performed a hierarchical task analysis of phone-o-typing requirements with health domain experts.We then designed and implemented multiple innovative visualizations integral to INPHOVIS including stacked bar charts to show diurnal behavioral patterns,calendar views to visualize day-level data along with bar charts,and correlation views to visualize important wellness predictive data.We demonstrate the usefulness of INPHOVIS with walk-throughs of use cases.We also evaluated INPHOVIS with expert feedback and received encouraging responses.展开更多
Data imputation is an essential pre-processing task for data governance,aimed at filling in incomplete data.However,conventional data imputation methods can only partly alleviate data incompleteness using isolated tab...Data imputation is an essential pre-processing task for data governance,aimed at filling in incomplete data.However,conventional data imputation methods can only partly alleviate data incompleteness using isolated tabular data,and they fail to achieve the best balance between accuracy and eficiency.In this paper,we present a novel visual analysis approach for data imputation.We develop a multi-party tabular data association strategy that uses intelligent algorithms to identify similar columns and establish column correlations across multiple tables.Then,we perform the initial imputation of incomplete data using correlated data entries from other tables.Additionally,we develop a visual analysis system to refine data imputation candidates.Our interactive system combines the multi-party data imputation approach with expert knowledge,allowing for a better understanding of the relational structure of the data.This significantly enhances the accuracy and eficiency of data imputation,thereby enhancing the quality of data governance and the intrinsic value of data assets.Experimental validation and user surveys demonstrate that this method supports users in verifying and judging the associated columns and similar rows using theirdomain knowledge.展开更多
Large,complex networks are commonly found in many application domains,such as sociology,biology,and software engineering.Analyzing such networks can be a non-trivial task,as it often takes many interactions to derive ...Large,complex networks are commonly found in many application domains,such as sociology,biology,and software engineering.Analyzing such networks can be a non-trivial task,as it often takes many interactions to derive a finding.It is thus beneficial to capture and summarize the important steps in an analysis.This provenance would then effectively support recalling,reusing,reproducing,and sharing the analysis process and results.However,the provenance of analyzing a large,complex network would often be a long interaction record.To automatically compose a concise visual summarization of network analysis provenance,we introduce a ranking model together with a reduction algorithm.The model identifies and orders important interactions used in the network analysis.Based on this model,our algorithm is able to minimize the provenance,while still preserving all the essential steps for recalling and sharing the analysis process and results.We create a prototype system demonstrating the effectiveness of our model and algorithm with two usage scenarios.展开更多
We introduce the concept of time mask,which is a type of temporal filter suitable for selection of multiple disjoint time intervals in which some query conditions fulfil.Such a filter can be applied to time-referenced...We introduce the concept of time mask,which is a type of temporal filter suitable for selection of multiple disjoint time intervals in which some query conditions fulfil.Such a filter can be applied to time-referenced objects,such as events and trajectories,for selecting those objects or segments of trajectories that fit in one of the selected time intervals.The selected subsets of objects or segments are dynamically summarized in various ways,and the summaries are represented visually on maps and/or other displays to enable exploration.The time mask filtering can be especially helpful in analysis of disparate data(e.g.,event records,positions of moving objects,and time series of measurements),which may come from different sources.To detect relationships between such data,the analyst may set query conditions on the basis of one dataset and investigate the subsets of objects and values in the other datasets that co-occurred in time with these conditions.We describe the desired features of an interactive tool for time mask filtering and present a possible implementation of such a tool.By example of analysing two real world data collections related to aviation and maritime traffic,we show the way of using time masks in combination with other types of filters and demonstrate the utility of the time mask filtering.展开更多
The open and distributed connection of the powersystem makes it vulnerable to various potential cyber-attacks,which may lead to power outages and even casualties. Therefore,the construction of attack and defense drill...The open and distributed connection of the powersystem makes it vulnerable to various potential cyber-attacks,which may lead to power outages and even casualties. Therefore,the construction of attack and defense drill (ADD) platforms forattack mechanism investigation and protection strategy evaluationhas become a research hotspot. However, for the massiveand heterogeneous security analysis data generated during thedrill, it is rare to have a comprehensive and intuitive methodto visually and efficiently display the perspective of the attackerand defender. In order to solve this problem, this paper proposesa visual analysis scheme of an ADD framework for a grid cyberphysicalsystem (GCPS) based on the interactive visual analysismethod. Specifically, it realizes system weakness discovery basedon knowledge visualization, optimization of the detection modeland visualization interaction. Finally, the case study on thesimulation platform of ADD proves the effectiveness of theproposed method.展开更多
Human Bio-Behavioral Rhythms(HBRs)such as sleep-wake cycles(Circadian Rhythms),and the degree of regularity of sleep and physical activity have important health ramifications.Ubiquitous devices such as smartphones can...Human Bio-Behavioral Rhythms(HBRs)such as sleep-wake cycles(Circadian Rhythms),and the degree of regularity of sleep and physical activity have important health ramifications.Ubiquitous devices such as smartphones can sense HBRs by continuously analyzing data gathered passively by built-in sensors to discover important clues about the degree of regularity and disruptions in behavioral patterns.As human behavior is complex and smartphone data is voluminous with many channels(sensor types),it can be challenging to make meaningful observations,detect unhealthy HBR deviations and most importantly pin-point the causes of disruptions.Prior work has largely utilized computational methods such as machine and deep learning approaches,which while accurate,are often not explainable and present few actionable insights on HBR patterns or causes.To assist analysts in the discovery and understanding of HBR patterns,disruptions and causes,we propose ARGUS,an interactive visual analytics framework.As a foundation of ARGUS,we design an intuitive Rhythm Deviation Score(RDS)that analyzes users’smartphone sensor data,extracts underlying twenty-four-hour rhythms and quantifies their degree of irregularity.This score is then visualized using a glyph that makes it easy to recognize disruptions in the regularity of HBRs.ARGUS also facilitates deeper HBR insights and understanding of causes by linking multiple visualization panes that are overlaid with objective sensor information such as geo-locations and phone state(screen locked,charging),and user-provided or smartphone-inferred ground truth information.This array of visualization overlays in ARGUS enables analysts to gain a more comprehensive picture of HBRs,behavioral patterns and deviations from regularity.The design of ARGUS was guided by a goal and task analysis study involving an expert versed in HBR and smartphone sensing.To demonstrate its utility and generalizability,two different datasets were explored using ARGUS and our use cases and designs were strongly validated in evaluation sessions with expert and non-expert users.展开更多
Interactive model analysis,the process of understanding,diagnosing,and refining a machine learning model with the help of interactive visualization,is very important for users to efficiently solve real-world artificia...Interactive model analysis,the process of understanding,diagnosing,and refining a machine learning model with the help of interactive visualization,is very important for users to efficiently solve real-world artificial intelligence and data mining problems.Dramatic advances in big data analytics have led to a wide variety of interactive model analysis tasks.In this paper,we present a comprehensive analysis and interpretation of this rapidly developing area.Specifically,we classify the relevant work into three categories:understanding,diagnosis,and refinement.Each category is exemplified by recent influential work.Possible future research opportunities are also explored and discussed.展开更多
Computed tomography(CT)generates cross-sectional images of the body.Visualizing CT images has been a challenging problem.The emergence of the augmented and virtual reality technology has provided promising solutions.H...Computed tomography(CT)generates cross-sectional images of the body.Visualizing CT images has been a challenging problem.The emergence of the augmented and virtual reality technology has provided promising solutions.However,existing solutions suffer from tethered display or wireless transmission latency.In this paper,we present ARSlice,a proof-of-concept prototype that can visualize CT images in an untethered manner without wireless transmission latency.Our ARSlice prototype consists of two parts,the user end and the projector end.By employing dynamic tracking and projection,the projector end can track the user-end equipment and project CT images onto it in real time.The user-end equipment is responsible for displaying these CT images into the 3D space.Its main feature is that the user-end equipment is a pure optical device with light weight,low cost,and no energy consumption.Our experiments demonstrate that our ARSlice prototype provides part of six degrees of freedom for the user,and a high frame rate.By interactively visualizing CT images into the 3D space,our ARSlice prototype can help untrained users better understand that CT images are slices of a body.展开更多
Exploratory data analysis plays a major role in obtaining insights from data.Over the last two decades,researchers have proposed several visual data exploration tools that can assist with each step of the analysis pro...Exploratory data analysis plays a major role in obtaining insights from data.Over the last two decades,researchers have proposed several visual data exploration tools that can assist with each step of the analysis process.Nevertheless,in recent years,data analysis requirements have changed significantly.With constantly increasing size and types of data to be analyzed,scalability and analysis duration are now among the primary concerns of researchers.Moreover,in order to minimize the analysis cost,businesses are in need of data analysis tools that can be used with limited analytical knowledge.To address these challenges,traditional data exploration tools have evolved within the last few years.In this paper,with an in-depth analysis of an industrial tabular dataset,we identify a set of additional exploratory requirements for large datasets.Later,we present a comprehensive survey of the recent advancements in the emerging field of exploratory data analysis.We investigate 50 academic and non-academic visual data exploration tools with respect to their utility in the six fundamental steps of the exploratory data analysis process.We also examine the extent to which these modern data exploration tools fulfill the additional requirements for analyzing large datasets.Finally,we identify and present a set of research opportunities in the field of visual exploratory data analysis.展开更多
Neural attention-based encoders,which effectively attend sentence tokens to their associated context without being restricted by long-term distance or dependency,have demonstrated outstanding performance in embedding ...Neural attention-based encoders,which effectively attend sentence tokens to their associated context without being restricted by long-term distance or dependency,have demonstrated outstanding performance in embedding sentences into meaningful representations(embeddings).The Universal Sentence Encoder(USE)is one of the most well-recognized deep neural network(DNN)based solutions,which is facilitated with an attention-driven transformer architecture and has been pre-trained on a large number of sentences from the Internet.Besides the fact that USE has been widely used in many downstream applications,including information retrieval(IR),interpreting its complicated internal working mechanism remains challenging.In this work,we present a visual analytics solution towards addressing this challenge.Specifically,focused on semantics and syntactics(concepts and relations)that are critical to domain clinical IR,we designed and developed a visual analytics system,i.e.,USEVis.The system investigates the power of USE in effectively extracting sentences’semantics and syntactics through exploring and interpreting how linguistic properties are captured by attentions.Furthermore,by thoroughly examining and comparing the inherent patterns of these attentions,we are able to exploit attentions to retrieve sentences/documents that have similar semantics or are closely related to a given clinical problem in IR.By collaborating with domain experts,we demonstrate use cases with inspiring findings to validate the contribution of our work and the effectiveness of our system.展开更多
Non-coding regions are the major component of human genomes and the long non-coding RNA(IncRNA)is a class of pervasive genes located in noncoding regions(Morris and Mattick,2014).IncRNAs play a wide range of regul...Non-coding regions are the major component of human genomes and the long non-coding RNA(IncRNA)is a class of pervasive genes located in noncoding regions(Morris and Mattick,2014).IncRNAs play a wide range of regulatory roles in gene transcription,translation,epigenetic modification and protein function by interacting with different types of molecules including DNA,展开更多
文摘We present a method of 3D image mosaicing for real 3D representation of roadside buildings, and implement a Web-based interactive visualization environment for the 3D video mosaics created by 3D image mosaicing. The 3D image mo- saicing technique developed in our previous work is a very powerful method for creating textured 3D-GIS data without excessive data processing like the laser or stereo system. For the Web-based open access to the 3D video mosaics, we build an interactive visualization environment using X3D, the emerging standard of Web 3D. We conduct the data preprocessing for 3D video mosaics and the X3D modeling for textured 3D data. The data preprocessing includes the conversion of each frame of 3D video mosaics into concatenated image files that can be hyperlinked on the Web. The X3D modeling handles the representation of concatenated images using necessary X3D nodes. By employing X3D as the data format for 3D image mosaics, the real 3D representation of roadside buildings is extended to the Web and mobile service systems.
基金the Key R&D project of Shandong Province(2019JZZY010102)the Big Earth Data Science Engineering Project(XDA19060104)the 13th Five-year Informatization Plan of the Chinese Academy of Sciences,the Construction of Scientific Data Center System(XXH-13514).
文摘With the continuous development of various types of fixed marine observation equipment,satellite remote sensing technology and computer simulation technology,modern marine scientific research has entered the era of big data.Interactive ocean visuali-zation has become ubiquitous owing to the use of ocean data in studies of marine disasters,global climate change and fisheries.However,the primary challenge in analyzing large amounts of ocean data originates from the complexity of the data themselves.Therefore,an interactive multi-scale,multivariate visualization sys-tem with dynamic expansion potential is needed for analyzing larger volumes of ocean data.In this study,a unified visual data service was constructed,and a component-based interactive visua-lization structure for multi-dimensional,spatiotemporal ocean data is presented in this paper.Based on this structure,users can easily customize the system to visualize other types of scientific data.
基金Project supported by the National Natural Science Foundation of China (Nos. 60473106, 60273060 and 60333010)and the Ph.D. Programs Foundation of Ministry of Education,China (Nos.20030335064 and 20060335114 )
文摘An approach for generating interactive 3D graphical visualization of the genetic architectures of complex traits in multiple environments is described. 3D graphical visualization is utilized for making improvements on traditional plots in quan- titative trait locus (QTL) mapping analysis. Interactive 3D graphical visualization for abstract expression of QTL, epistasis and their environmental interactions for experimental populations was developed in framework of user-friendly software QTLNetwork (http://ibi.zju.edu.cn/software/qtlnetwork). Novel definition of graphical meta system and computation of virtual coordinates are used to achieve explicit but meaningful visualization. Interactive 3D graphical visualization for QTL analysis provides geneticists and breeders a powerful and easy-to-use tool to analyze and publish their research results.
文摘More diverse data on animal ecology are now available.This“data deluge”presents challenges for both biologists and computer scientists;however,it also creates opportunities to improve analysis and answer more holistic research questions.We aim to increase awareness of the current opportunity for interdisciplinary research between animal ecology researchers and computer scientists.Immersive analytics(IA)is an emerging research field in which investigations are performed into how immersive technologies,such as large display walls and virtual reality and augmented reality devices,can be used to improve data analysis,outcomes,and communication.These investigations have the potential to reduce the analysis effort and widen the range of questions that can be addressed.We propose that biologists and computer scientists combine their efforts to lay the foundation for IA in animal ecology research.We discuss the potential and the challenges and outline a path toward a structured approach.We imagine that a joint effort would combine the strengths and expertise of both communities,leading to a well-defined research agenda and design space,practical guidelines,robust and reusable software frameworks,reduced analysis effort,and better comparability of results.
基金supported by the National Key Research and Development Program of China under Grant No.2020YFC2006600the National Natural Science Foundation of China under Grant No.62003291the National Science and Technology Foundation Project under Grant No.2019FY100100,and the QingLan Project.
文摘Mining the core value of Industrial Internet of Things(IIoT)data safely and reducing the risk of malicious attacks are the inherent requirements of industrial data visualization.Visualization technology has become the main tool for data aggregation,mining and analysis of IIoT data through graphical representation.However,visualization technology still has two shortcomings in big data calculation and analysis scenarios.On the one hand,visual results will lead to the disclosure of sensitive privacy.On the other hand,most visualization tools can't provide an interactive framework for users to select the suitable solutions.To address these problems,we present an open accessible Visual framework based on Differential Privacy theory(VisDP),which provides Multi-index Quantitative comprehensive Evaluation technology(MQE)for data mining results.Considering the advantages of interactive mechanism,VisDP provides rich optional schemes,including the operating web,calling API and the downloading SDK.Finally,we verify the availability and privacy of MQE through mathematical proofs,analyze the hospital medical waste detection system that actually applies the framework,and the experimental results have showed the effectiveness and practicality of the proposed platform.
基金Project supported by the National Key R&D Program of China(No.2020YFB1707700)the Zhejiang Provincial Natural Science Foundation of China(No.LR23F020003)the National Nat-ural Science Foundation of China(Nos.61972356 and 62036009)。
文摘Machine vision measurement(MVM)is an essential approach that measures the area or length of a target efficiently and non-destructively for product quality control.The result of MVM is determined by its configuration,especially the lighting scheme design in image acquisition and the algorithmic parameter optimization in image processing.In a traditional workflow,engineers constantly adjust and verify the configuration for an acceptable result,which is time-consuming and significantly depends on expertise.To address these challenges,we propose a target-independent approach,visual interactive image clustering,which facilitates configuration optimization by grouping images into different clusters to suggest lighting schemes with common parameters.Our approach has four steps:data preparation,data sampling,data processing,and visual analysis with our visualization system.During preparation,engineers design several candidate lighting schemes to acquire images and develop an algorithm to process images.Our approach samples engineer-defined parameters for each image and obtains results by executing the algorithm.The core of data processing is the explainable measurement of the relationships among images using the algorithmic parameters.Based on the image relationships,we develop VMExplorer,a visual analytics system that assists engineers in grouping images into clusters and exploring parameters.Finally,engineers can determine an appropriate lighting scheme with robust parameter combinations.To demonstrate the effiectiveness and usability of our approach,we conduct a case study with engineers and obtain feedback from expert interviews.
基金This material is based on research sponsored by DARPA,United States under agreement number FA8750-18-2-0077The U.S.Government is authorized to reproduce and distribute reprints for Governmental purposes not withstanding any copyright notation thereonThe views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements,either expressed or implied,of DARPA or the U.S.Government。
文摘Digital phenotyping is the characterization of human behavior patterns based on data from digital devices such as smartphones in order to gain insights into the users’state and especially to identify ailments.To support supervised machine learning,digital phenotyping requires gathering data from study participants’smartphones as they live their lives.Periodically,participants are then asked to provide ground truth labels about their health status.Analyzing such complex data is challenging due to limited contextual information and imperfect health/wellness labels.We propose INteractive PHOne-o-typing VISualization(INPHOVIS),an interactive visual framework for exploratory analysis of smartphone health data to study phone-o-types.Prior visualization work has focused on mobile health data with clear semantics such as steps or heart rate data collected using dedicated health devices and wearables such as smartwatches.However,unlike smartphones which are owned by over 85 percent of the US population,wearable devices are less prevalent thus reducing the number of people from whom such data can be collected.In contrast,the‘‘low-level"sensor data(e.g.,accelerometer or GPS data)supported by INPHOVIS can be easily collected using smartphones.Data visualizations are designed to provide the essential contextualization of such data and thus help analysts discover complex relationships between observed sensor values and health-predictive phone-o-types.To guide the design of INPHOVIS,we performed a hierarchical task analysis of phone-o-typing requirements with health domain experts.We then designed and implemented multiple innovative visualizations integral to INPHOVIS including stacked bar charts to show diurnal behavioral patterns,calendar views to visualize day-level data along with bar charts,and correlation views to visualize important wellness predictive data.We demonstrate the usefulness of INPHOVIS with walk-throughs of use cases.We also evaluated INPHOVIS with expert feedback and received encouraging responses.
基金Project supported by the Key R&D"Pioneer"Tackling Plan Program of Zhejiang Province,China(No.2023C01119)the"Ten Thousand Talents Plan"Science and Technology Innovation Leading Talent Program of Zhejiang Province,China(No.2022R52044)+1 种基金the Major Standardization Pilot Projects for the Digital Economy(Digital Trade Sector)of Zhejiang Province,China(No.SJ-Bz/2023053)the National Natural Science Foundationof China(No.62132017)。
文摘Data imputation is an essential pre-processing task for data governance,aimed at filling in incomplete data.However,conventional data imputation methods can only partly alleviate data incompleteness using isolated tabular data,and they fail to achieve the best balance between accuracy and eficiency.In this paper,we present a novel visual analysis approach for data imputation.We develop a multi-party tabular data association strategy that uses intelligent algorithms to identify similar columns and establish column correlations across multiple tables.Then,we perform the initial imputation of incomplete data using correlated data entries from other tables.Additionally,we develop a visual analysis system to refine data imputation candidates.Our interactive system combines the multi-party data imputation approach with expert knowledge,allowing for a better understanding of the relational structure of the data.This significantly enhances the accuracy and eficiency of data imputation,thereby enhancing the quality of data governance and the intrinsic value of data assets.Experimental validation and user surveys demonstrate that this method supports users in verifying and judging the associated columns and similar rows using theirdomain knowledge.
基金This research is sponsored in part by the U.S.National Science Foundation through grants IIS-1528203 and IIS-1741536.
文摘Large,complex networks are commonly found in many application domains,such as sociology,biology,and software engineering.Analyzing such networks can be a non-trivial task,as it often takes many interactions to derive a finding.It is thus beneficial to capture and summarize the important steps in an analysis.This provenance would then effectively support recalling,reusing,reproducing,and sharing the analysis process and results.However,the provenance of analyzing a large,complex network would often be a long interaction record.To automatically compose a concise visual summarization of network analysis provenance,we introduce a ranking model together with a reduction algorithm.The model identifies and orders important interactions used in the network analysis.Based on this model,our algorithm is able to minimize the provenance,while still preserving all the essential steps for recalling and sharing the analysis process and results.We create a prototype system demonstrating the effectiveness of our model and algorithm with two usage scenarios.
基金This work was supported in part by EU in project datAcron(grant agreement 687591).
文摘We introduce the concept of time mask,which is a type of temporal filter suitable for selection of multiple disjoint time intervals in which some query conditions fulfil.Such a filter can be applied to time-referenced objects,such as events and trajectories,for selecting those objects or segments of trajectories that fit in one of the selected time intervals.The selected subsets of objects or segments are dynamically summarized in various ways,and the summaries are represented visually on maps and/or other displays to enable exploration.The time mask filtering can be especially helpful in analysis of disparate data(e.g.,event records,positions of moving objects,and time series of measurements),which may come from different sources.To detect relationships between such data,the analyst may set query conditions on the basis of one dataset and investigate the subsets of objects and values in the other datasets that co-occurred in time with these conditions.We describe the desired features of an interactive tool for time mask filtering and present a possible implementation of such a tool.By example of analysing two real world data collections related to aviation and maritime traffic,we show the way of using time masks in combination with other types of filters and demonstrate the utility of the time mask filtering.
基金the Science and Technology Project of State Grid Corporation of China(Research on key technologies of integrated electric power network security simulation and verification environment,521304190004).
文摘The open and distributed connection of the powersystem makes it vulnerable to various potential cyber-attacks,which may lead to power outages and even casualties. Therefore,the construction of attack and defense drill (ADD) platforms forattack mechanism investigation and protection strategy evaluationhas become a research hotspot. However, for the massiveand heterogeneous security analysis data generated during thedrill, it is rare to have a comprehensive and intuitive methodto visually and efficiently display the perspective of the attackerand defender. In order to solve this problem, this paper proposesa visual analysis scheme of an ADD framework for a grid cyberphysicalsystem (GCPS) based on the interactive visual analysismethod. Specifically, it realizes system weakness discovery basedon knowledge visualization, optimization of the detection modeland visualization interaction. Finally, the case study on thesimulation platform of ADD proves the effectiveness of theproposed method.
基金This material is based on research sponsored by DARPA,USA under agreement number FA8750-18-2-0077。
文摘Human Bio-Behavioral Rhythms(HBRs)such as sleep-wake cycles(Circadian Rhythms),and the degree of regularity of sleep and physical activity have important health ramifications.Ubiquitous devices such as smartphones can sense HBRs by continuously analyzing data gathered passively by built-in sensors to discover important clues about the degree of regularity and disruptions in behavioral patterns.As human behavior is complex and smartphone data is voluminous with many channels(sensor types),it can be challenging to make meaningful observations,detect unhealthy HBR deviations and most importantly pin-point the causes of disruptions.Prior work has largely utilized computational methods such as machine and deep learning approaches,which while accurate,are often not explainable and present few actionable insights on HBR patterns or causes.To assist analysts in the discovery and understanding of HBR patterns,disruptions and causes,we propose ARGUS,an interactive visual analytics framework.As a foundation of ARGUS,we design an intuitive Rhythm Deviation Score(RDS)that analyzes users’smartphone sensor data,extracts underlying twenty-four-hour rhythms and quantifies their degree of irregularity.This score is then visualized using a glyph that makes it easy to recognize disruptions in the regularity of HBRs.ARGUS also facilitates deeper HBR insights and understanding of causes by linking multiple visualization panes that are overlaid with objective sensor information such as geo-locations and phone state(screen locked,charging),and user-provided or smartphone-inferred ground truth information.This array of visualization overlays in ARGUS enables analysts to gain a more comprehensive picture of HBRs,behavioral patterns and deviations from regularity.The design of ARGUS was guided by a goal and task analysis study involving an expert versed in HBR and smartphone sensing.To demonstrate its utility and generalizability,two different datasets were explored using ARGUS and our use cases and designs were strongly validated in evaluation sessions with expert and non-expert users.
文摘Interactive model analysis,the process of understanding,diagnosing,and refining a machine learning model with the help of interactive visualization,is very important for users to efficiently solve real-world artificial intelligence and data mining problems.Dramatic advances in big data analytics have led to a wide variety of interactive model analysis tasks.In this paper,we present a comprehensive analysis and interpretation of this rapidly developing area.Specifically,we classify the relevant work into three categories:understanding,diagnosis,and refinement.Each category is exemplified by recent influential work.Possible future research opportunities are also explored and discussed.
基金the National Natural Science Foundation of China under Grant No.61872210the Guangdong Basic and Applied Basic Research Foundation under Grant Nos.2021A1515012596 and 2021B1515120064the Guangdong Academy of Sciences Special Foundation under Grant No.2021GDASYL-20210102006.
文摘Computed tomography(CT)generates cross-sectional images of the body.Visualizing CT images has been a challenging problem.The emergence of the augmented and virtual reality technology has provided promising solutions.However,existing solutions suffer from tethered display or wireless transmission latency.In this paper,we present ARSlice,a proof-of-concept prototype that can visualize CT images in an untethered manner without wireless transmission latency.Our ARSlice prototype consists of two parts,the user end and the projector end.By employing dynamic tracking and projection,the projector end can track the user-end equipment and project CT images onto it in real time.The user-end equipment is responsible for displaying these CT images into the 3D space.Its main feature is that the user-end equipment is a pure optical device with light weight,low cost,and no energy consumption.Our experiments demonstrate that our ARSlice prototype provides part of six degrees of freedom for the user,and a high frame rate.By interactively visualizing CT images into the 3D space,our ARSlice prototype can help untrained users better understand that CT images are slices of a body.
文摘Exploratory data analysis plays a major role in obtaining insights from data.Over the last two decades,researchers have proposed several visual data exploration tools that can assist with each step of the analysis process.Nevertheless,in recent years,data analysis requirements have changed significantly.With constantly increasing size and types of data to be analyzed,scalability and analysis duration are now among the primary concerns of researchers.Moreover,in order to minimize the analysis cost,businesses are in need of data analysis tools that can be used with limited analytical knowledge.To address these challenges,traditional data exploration tools have evolved within the last few years.In this paper,with an in-depth analysis of an industrial tabular dataset,we identify a set of additional exploratory requirements for large datasets.Later,we present a comprehensive survey of the recent advancements in the emerging field of exploratory data analysis.We investigate 50 academic and non-academic visual data exploration tools with respect to their utility in the six fundamental steps of the exploratory data analysis process.We also examine the extent to which these modern data exploration tools fulfill the additional requirements for analyzing large datasets.Finally,we identify and present a set of research opportunities in the field of visual exploratory data analysis.
文摘Neural attention-based encoders,which effectively attend sentence tokens to their associated context without being restricted by long-term distance or dependency,have demonstrated outstanding performance in embedding sentences into meaningful representations(embeddings).The Universal Sentence Encoder(USE)is one of the most well-recognized deep neural network(DNN)based solutions,which is facilitated with an attention-driven transformer architecture and has been pre-trained on a large number of sentences from the Internet.Besides the fact that USE has been widely used in many downstream applications,including information retrieval(IR),interpreting its complicated internal working mechanism remains challenging.In this work,we present a visual analytics solution towards addressing this challenge.Specifically,focused on semantics and syntactics(concepts and relations)that are critical to domain clinical IR,we designed and developed a visual analytics system,i.e.,USEVis.The system investigates the power of USE in effectively extracting sentences’semantics and syntactics through exploring and interpreting how linguistic properties are captured by attentions.Furthermore,by thoroughly examining and comparing the inherent patterns of these attentions,we are able to exploit attentions to retrieve sentences/documents that have similar semantics or are closely related to a given clinical problem in IR.By collaborating with domain experts,we demonstrate use cases with inspiring findings to validate the contribution of our work and the effectiveness of our system.
基金supported by the National Natural Science Foundation of China(Nos.31270885 and 31471247)the open fund of Functional Oil Laboratory Associated by Oil Crops Research Institute,Chinese Academy of Agricultural Sciences and Infinitus (China) Company Ltd
文摘Non-coding regions are the major component of human genomes and the long non-coding RNA(IncRNA)is a class of pervasive genes located in noncoding regions(Morris and Mattick,2014).IncRNAs play a wide range of regulatory roles in gene transcription,translation,epigenetic modification and protein function by interacting with different types of molecules including DNA,