摘要在目前各种社交网络大规模兴起的时代,网络上的信息数据量飞快增长,这些数据量带来很多可以挖掘的信息,并可以借助可视化的技术表现出来。
本论文是从过程以及方法结合起来的角度来阐述,即获取数据—导入数据—分析数据这一整体过程进行了完整的阐述。此次设计方案是以QQ空间里面的自我网络分析为中心,预先在网站后台分析json数据,然后设计出完整的数据库模型,使用java爬虫框架Webmagic编写爬虫程序,使用爬虫下载得到数据存入Mysql,这一部分是数据获取;然后思考结合Jfreechart以及E-charts接受的数据形式,对数据进行了一定的预处理,将数据导入并生成图形之,以个人为中心,并从人际关系的角度来分析每个图形,最后得到与个人相关的人际网络以及个人习惯。25871
关键词 社交网络数据 过程 数据获取 数据可视化 毕业论文设计说明书外文摘要
Title Research and application of social data visualization
Abstract
With the rapid development of social networks,the amount of information on the internet is growing rapidly. A lot of useful information can be found using data visualization technologies with the large amount of data from the social networks .
In this paper,I just combine the procedures and means to expound my own ideas on this project whose procedures include the access of data,importing data and the analysis of the acquired data. As for the schema of this project aimed at the analysis of the social networks with the center of mysef,my general idea should be pided into several steps.
First I have to complete designing the database model on the basis of the data who takes the format of json from the website background ,then a spider using java with the frame of webmagic must be designed and started to download large amount of data from the website of QQzone,what is next must be to get the data strored in the database which I just design.Up to now I have the acess of data,for the target of displaying the data in the form of several kinds of charts,the pretreatment of the data set should be done to get a new data object for Jfreechart and Echarts to use to generate different forms of charts.With these charts,I can analyse the charts to get the cincept of the social relations networks setting myself centered.
Keywords social network data data access of data data acquisition
目 次
1 绪论 1
1.1 社交网络大数据背景 1
1.2 社交网络数据可视化的概念和意义 1
1.3 社交网络数据可视化的现状和发展 1
1.4 本文的主要工作及章节安排 2
2 数据库设计 4
2.1 数据库需求分析 4
2.2 E-R图 7
2.3 关系设计 7
3 爬虫设计 10
3.1 webmagic爬虫框架介绍 10
3.2 PageProcessor的定制 11
3.3 Pipeline的定制 16
3.4 dao文件的设计 18
4 数据的预处理 21
4.1 可视化工具的需求分析 21
4.2 数据预处理的设计和计算 23
5 Jfreechart的使用以及图形设计 27
5.1 Jfreechart的介绍以及配置安装 27
5.2 Jfreechart的核心类库以及图形设计的基本步骤 27
5.3 Jfreechart中数据的导入 28