摘要
随着互联网技术的发展,传统的只根据榜单数据进行电影筛选的方式已经不能满足消费者的需求。基于Python实现了豆瓣网站TOP250电影数据爬虫,调用Requests下载网页并使用Beautifulsoup进行网页解析,利用PyeCharts等技术进行数据可视化分析,将数据以图表的形式展现,以让消费者更清晰地看到热门电影数据特征,为消费者选择电影提供参考依据。通过可视化分析发现,电影的评分与评论人数无正相关性。
With the development of internet technology,the traditional method of screening movies based solely on ranking data can no longer meet the needs of consumers.The Douban website TOP250 movie data crawler is implemented based on Python,calling Requests to download Web pages,realizing Web page parsing through Beautifulsoup,and using technologies such as PyeCharts for data visualization analysis.It presents the data in the form of charts to enable consumers to see the characteristics of popular movie data more clearly,providing reference for consumers to choose movies.The visualization analysis has found that there is no positive correlation between movie ratings and number of reviews.
作者
王晨
WANG Chen(Xi'an Peihua University,Xi'an 710125,China)
出处
《现代信息科技》
2024年第16期93-97,共5页
Modern Information Technology