This paper establishes a new framework for assessing multimodal statistical causality between cryptocurrency market(cryptomarket)sentiment and cryptocurrency price processes.In order to achieve this,we present an effi...This paper establishes a new framework for assessing multimodal statistical causality between cryptocurrency market(cryptomarket)sentiment and cryptocurrency price processes.In order to achieve this,we present an efficient algorithm for multimodal statistical causality analysis based on Multiple-Output Gaussian Processes.Signals from different information sources(modalities)are jointly modelled as a Multiple-Output Gaussian Process,and then using a novel approach to statistical causality based on Gaussian Processes(GPs),we study linear and non-linear causal effects between the different modalities.We demonstrate the effectiveness of our approach in a machine learning application by studying the relationship between cryptocurrency spot price dynamics and sentiment time-series data specific to the crypto sector,which we conjecture influences retail investor behaviour.The investor sentiment is extracted from cryptomarket news data via methods developed in the area of statistical machine learning known as Natural Language Processing(NLP).To capture sentiment,we present a novel framework for text to time-series embedding,which we then use to construct a sentiment index from publicly available news articles.We conduct a statistical analysis of our sentiment statistical index model and compare it to alternative state-of-the-art sentiment models popular in the NLP literature.In regard to the multimodal causality,the investor sentiment is our primary modality of exploration,in addition to price and a blockchain technologyrelated indicator(hash rate).Analysis shows that our approach is effective in modelling causal structures of variable degree of complexity between heterogeneous data sources and illustrates the impact that certain modelling choices for the different modalities can have on detecting causality.A solid understanding of these factors is necessary to gauge cryptocurrency adoption by retail investors and provide sentiment-and technologybased insights about the cryptocurrency market dynamics.展开更多
基金Ioannis Chalkiadakis acknowledges the support of Heriot-Watt University through a James-Watt scholarship while undertaking this work.
文摘This paper establishes a new framework for assessing multimodal statistical causality between cryptocurrency market(cryptomarket)sentiment and cryptocurrency price processes.In order to achieve this,we present an efficient algorithm for multimodal statistical causality analysis based on Multiple-Output Gaussian Processes.Signals from different information sources(modalities)are jointly modelled as a Multiple-Output Gaussian Process,and then using a novel approach to statistical causality based on Gaussian Processes(GPs),we study linear and non-linear causal effects between the different modalities.We demonstrate the effectiveness of our approach in a machine learning application by studying the relationship between cryptocurrency spot price dynamics and sentiment time-series data specific to the crypto sector,which we conjecture influences retail investor behaviour.The investor sentiment is extracted from cryptomarket news data via methods developed in the area of statistical machine learning known as Natural Language Processing(NLP).To capture sentiment,we present a novel framework for text to time-series embedding,which we then use to construct a sentiment index from publicly available news articles.We conduct a statistical analysis of our sentiment statistical index model and compare it to alternative state-of-the-art sentiment models popular in the NLP literature.In regard to the multimodal causality,the investor sentiment is our primary modality of exploration,in addition to price and a blockchain technologyrelated indicator(hash rate).Analysis shows that our approach is effective in modelling causal structures of variable degree of complexity between heterogeneous data sources and illustrates the impact that certain modelling choices for the different modalities can have on detecting causality.A solid understanding of these factors is necessary to gauge cryptocurrency adoption by retail investors and provide sentiment-and technologybased insights about the cryptocurrency market dynamics.