摘要
通过分析网站日志文件,可以清楚地知道一个网站每天的页面访问量、用户访问量、独立IP数、用户通过什么渠道和设备访问网站等,这样企业就可以通过对网站日志文件进行数据分析进而对网站进行多方面的优化建设。利用Hadoop平台存储并计算海量日志文件,利用Hive进行数据仓库建设和数据分析,使得数据更具有说服力,真正实现了让数据驱动业务,进而驱动公司发展。
By analyzing the web log file,some data of a website daily could be obtained,such as amount of page visit,independent IP numbers,number of user visiting,what channels and devices users access to the website. In this way,the enterprise could analyze the data of the website log file and optimize the website in many aspects. Hadoop platform is used to store and calculate massive log files,the data warehouse construction and data analysis are carried out by using Hive. which would make data more convincing and results more intuitive. It really enables the realizing of data driven-business and promotes the company's development.
出处
《山西电子技术》
2017年第6期71-73,82,共4页
Shanxi Electronic Technology
关键词
大数据平台
日志业务分析
数据仓库
big data platform
log business analysis
visualization
data warehouse