网络爬虫技术在品牌维权系统中的应用
时间:2023-01-28 10:33 来源:毕业论文 作者:毕业论文 点击:次
摘 要随着时代的迁移,互联网技术迅速发展,移动智能产品得到大量的普及,网民数量也大幅度增长,这些因素使得近年来电子商务快速发展。在电子商务带来便利的同时,因为消费者无法在线立即看到实物辨别真伪,这就使得假冒伪劣产品泛滥不止。因此构建品牌维权系统,帮助用户维权商家打假变得至关重要。87240 本文提出的品牌维权系统利用商品信息,比如图片、价格、买家评论等,帮助用户进行维权打假。其主要的功能是信息采集,这就要需要用到网络爬虫技术。一个好的网络爬虫算法使品牌维权系统不仅能精确定位商品信息而且有着较高的信息抓取效率。因此本文的主要目的是研究网络爬虫算法并且将其运用到品牌维权系统中。本文的主要工作如下: 1、研究和分析通用网络爬虫算法、主题网络爬虫算法,并对两种算法进行比较。 2、针对淘宝网介绍了三种算法,分别为基于htmlunit的爬虫算法、基于httpclient的爬虫算法以及基于淘宝API的爬虫算法,并对三者进行比较。 3、根据研究的算法将其运用到品牌维权系统中,并实现系统相应的模块。 品牌维权系统的出现在一定程度上维护了各商家品牌的权益,也在一定程度上帮助了消费者做出正确的购买决策,维护自身的权益。 关键词:品牌维权;数据采集;网络爬虫;淘宝 Abstract With the migration of the times, the rapid development of Internet technology, mobile intelligent products get a lot of popularity and the number of Internet users is also greatly increased, which make the rapid development of e-commerce in recent years。 E-commerce brings much convenience 。However, consumers can not immediately see the authenticity of the real products to identify the authenticity, which makes the proliferation of fake and shoddy products。 Therefore,it’s important to build the brand rights system to help businesses and consumer maintain their legal rights。 The brand rights system use commodity information such as pictures, the price, the buyer reviews and so on to help users to maintain their legal rights。 Its main function is to collect information, which requires the use of web crawler technology。 A good web crawler algorithm makes the brand rights system not only can accurately locate the commodity information but also have a higher efficiency of information capture。 So the main purpose of this paper is to study the network crawler algorithm and apply it to the brand rights protection system。 The main work of this paper is as follows: (1)Research and analysis of the general web crawler algorithm, the theme of web crawler algorithm, and the comparison of the two algorithms。 (2)Introduce three algorithms, respectively,which are the crawler algorithm based on the htmlunit, the crawler algorithm based on the httpclient and Taobao API based crawler algorithm。Then the comparison of the three algorithms。 (3)The algorithm which was researched will be applied to the brand rights system。 Then realize the corresponding modules of the system。 To a certain extent, brand right protection system not only protects the rights and interests of the business brand of sellers, but also helps consumers make the right purchase decision and safeguard their own rights and interests。 Keywords: Brand Rights Protection; Data Collection; Web Crawler; Taobao 目 录 第一章 绪论 1 1。1课题研究的背景及意义 1 1。1。1研究背景 1 1。1。2研究意义 1 1。2 国内外研究现状 2 1。2。1 通用网络爬虫研究现状 2 1。2。2 主题网络爬虫研究现状 3 1。3 本文研究内容与组织结构 4 (责任编辑:qin) |