In this paper we investigate the effectiveness of ensemble-based learners for web robot session identification from web server logs. We also perform multi fold robot session labeling to improve the performance of lear...In this paper we investigate the effectiveness of ensemble-based learners for web robot session identification from web server logs. We also perform multi fold robot session labeling to improve the performance of learner. We conduct a comparative study for various ensemble methods (Bagging, Boosting, and Voting) with simple classifiers in perspective of classification. We also evaluate the effectiveness of these classifiers (both ensemble and simple) on five different data sets of varying session length. Presently the results of web server log analyzers are not very much reliable because the input log files are highly inflated by sessions of automated web traverse software’s, known as web robots. Presence of web robots access traffic entries in web server log repositories imposes a great challenge to extract any actionable and usable knowledge about browsing behavior of actual visitors. So web robots sessions need accurate and fast detection from web server log repositories to extract knowledge about genuine visitors and to produce correct results of log analyzers.展开更多
采用ASP.NET技术设计了基于B/S模式的质量管理信息系统平台.在Internet客户端采用Visual Studio C#作为开发工具,运用ASP发布动态网页,SQL Server作为后台数据库服务器.系统构建了质量检验、质量评价、质量体系和文档管理几大功能子模块...采用ASP.NET技术设计了基于B/S模式的质量管理信息系统平台.在Internet客户端采用Visual Studio C#作为开发工具,运用ASP发布动态网页,SQL Server作为后台数据库服务器.系统构建了质量检验、质量评价、质量体系和文档管理几大功能子模块,实现了离散企业质量信息计算机管理的网络化和快捷化;嵌入了质量管理信息系统的知识库开发模块,为后续知识库创建、维护和学习等提供开放接口.展开更多
文摘In this paper we investigate the effectiveness of ensemble-based learners for web robot session identification from web server logs. We also perform multi fold robot session labeling to improve the performance of learner. We conduct a comparative study for various ensemble methods (Bagging, Boosting, and Voting) with simple classifiers in perspective of classification. We also evaluate the effectiveness of these classifiers (both ensemble and simple) on five different data sets of varying session length. Presently the results of web server log analyzers are not very much reliable because the input log files are highly inflated by sessions of automated web traverse software’s, known as web robots. Presence of web robots access traffic entries in web server log repositories imposes a great challenge to extract any actionable and usable knowledge about browsing behavior of actual visitors. So web robots sessions need accurate and fast detection from web server log repositories to extract knowledge about genuine visitors and to produce correct results of log analyzers.
文摘采用ASP.NET技术设计了基于B/S模式的质量管理信息系统平台.在Internet客户端采用Visual Studio C#作为开发工具,运用ASP发布动态网页,SQL Server作为后台数据库服务器.系统构建了质量检验、质量评价、质量体系和文档管理几大功能子模块,实现了离散企业质量信息计算机管理的网络化和快捷化;嵌入了质量管理信息系统的知识库开发模块,为后续知识库创建、维护和学习等提供开放接口.