摘要
随着“互联网+”和大数据时代的到来,网络上充斥着各种各样的数据,过滤并获取有用的数据在当今环境下至关重要。文章提出一种基于Python和Requests模块的快速获取网页数据的方法,使用该方法可以获取解析前的网页源代码文本和图片数据,并保存为本地文件,为之后的数据分析和深入学习大数据技术奠定基础。实验结果表明,该方法步骤和代码编写简单易学,运行结果较好,具有一定的实用性。
With the arrival of the“Internet+”and big data era,the network is full of all kinds of data.Filtering and obtaining useful data is crucial in today's environment.This paper proposes a method to quickly acquire web data based on Python and Requests modules.Using this method,you can obtain the text and image data of the web source code before parsing,and save them as local files,laying the foundation for later data analysis and in-depth study of big data technology.The experimental results show that the steps and coding of this method are easy to learn,the running results are good,and it has certain practicability.
作者
姜庆玲
张樊
JIANG Qingling;ZHANG Fan(Wuchang Institute of Technology,Wuhan 430065,China)
出处
《现代信息科技》
2023年第16期100-103,108,共5页
Modern Information Technology
基金
基于计算机设计大赛视角的计算机类专业应用型创新能力培养的研究与实践(2023JY11)。