Abstract
With the migration of the times, the rapid development of Internet technology, mobile intelligent products get a lot of popularity and the number of Internet users is also greatly increased, which make the rapid development of e-commerce in recent years。 E-commerce brings much convenience 。However, consumers can not immediately see the authenticity of the real products to identify the authenticity, which makes the proliferation of fake and shoddy products。 Therefore,it’s important to build the brand rights system to help businesses and consumer maintain their legal rights。
The brand rights system use commodity information such as pictures, the price, the buyer reviews and so on to help users to maintain their legal rights。 Its main function is to collect information, which requires the use of web crawler technology。 A good web crawler algorithm makes the brand rights system not only can accurately locate the commodity information but also have a higher efficiency of information capture。 So the main purpose of this paper is to study the network crawler algorithm and apply it to the brand rights protection system。 The main work of this paper is as follows:
(1)Research and analysis of the general web crawler algorithm, the theme of web crawler algorithm, and the comparison of the two algorithms。
(2)Introduce three algorithms, respectively,which are the crawler algorithm based on the htmlunit, the crawler algorithm based on the httpclient and Taobao API based crawler algorithm。Then the comparison of the three algorithms。
(3)The algorithm which was researched will be applied to the brand rights system。 Then realize the corresponding modules of the system。
To a certain extent, brand right protection system not only protects the rights and interests of the business brand of sellers, but also helps consumers make the right purchase decision and safeguard their own rights and interests。
Keywords: Brand Rights Protection; Data Collection; Web Crawler; Taobao