摘要
为了快速地获取职位信息,根据"前程无忧"的网页特点,设计了3种基于Python的爬虫程序,进行职位相关数据的抓取。通过对关键字的提取,匹配符合条件的职位信息,并且抓取相关内容存入Excel文件中,便于寻找相关职位信息及具体要求。实验结果表明:该程序能够快速且大量地抓取相关职位信息,针对性强,简单易读,有利于对职位信息的进一步挖掘及分析。
In order to obtain job information quickly,according to the characteristics of web pages with"Worry-free Future",three kinds of Python-based crawler programs are designed to capture job-related data. Through the extraction of the keywords,the job information is matched,and the relevant content is captured in an Excel file,so that the related job information and specific requirements can be easily found. The experimental results show that this program can quickly and massively capture relevant job information,and it is highly targeted and easy to read,which is conducive to further mining and analysis of job information.
作者
崔玉娇
孙结冰
祁晓波
凌强
朱勇
CUI Yujiao;SUN Jiebing;QI Xiaobo;LING Qiang;ZHU Yong(School of Electronic Engineering,Heilongjiang University,Harbin 150080,China)
出处
《无线电通信技术》
2018年第4期416-419,共4页
Radio Communications Technology