基于网络的竹子的信息整合分析软件研发毕业论文

 2021-04-10 10:04

摘 要

竹子与人民生活、经济生产息息相关。近年来,竹业已经发展为一个庞大的产业,全球约有3500多万人投身竹子生产和竹业研究,竹业的年产值也达到近50亿美元。自然,这样一个大型产业当然必须有丰富的自然资源和先进的科学技术作为其可靠的后盾,加之农、林业信息化的迫切需求。将数据库管理技术(DBMS)、互联网技术(IT)等先进技术结合起来,把传统竹业生产研究转化为创新的、全面的、系统的现代竹业研究已成为大趋势。本文程序使用近年来热门的编程语言——Python,利用目前被广泛应用于各类行业的搜索引擎技术,编写出一套具备搜索查询、浏览信息等功能的系统。

本文工作主要围绕以下几个方面展开:首先编写爬虫程序,将国家林业局和草原局的官方网站——中国林业网(www.forestry.gov.cn)旗下的中国竹子网(bamboo.forestry.gov.cn)中的新闻、介绍、资料等通过爬虫程序将资料爬取到本地数据库中,并与搜集到的与竹子相关的信息整合到系统中,建立一个具有实时爬取并更新网站信息,建立一个名为“竹子百科”的多功能系统,系统可以对爬取到的的数据进行关键字查询,同时对数据库具有增删改查功能。

该系统旨在为竹子相关的科研人员服务提供快速、便捷的使用体验,使科研人员能够在在本系统中浏览网站中的所有文章对应的url、来源、点击量等信息,将竹业科研与农、林业信息化有机结合起来。使网站中原本排列复杂、难以寻找的信息资源有效利用起来,并通过数据库管理系统的共享性实现资源共享,加以Python语言的开放性,使其可广泛应用于各类农林业研究。

关键词:竹子;Python;爬虫;搜索引擎;信息系统

Information integration and analysis software development of bamboo-based system

ABSTRACT

Bamboo is closely related to people's livelihood and economic production. In recent years, the bamboo industry has grown into a huge industry, with more than 35 million people engaged in bamboo production and bamboo research, and the annual output value of bamboo industry is nearly 5 billion us dollars. Of course, the industry of such scale must have abundant natural resources and advanced science and technology as its reliable backing, together with the urgent need of agricultural and forestry informatization. The database management technology (DBMS), the Internet technology (IT) and other advanced technology, the traditional bamboo industry in the production of research into innovation, comprehensive, system of modern bamboo industry research has become a trend. In this paper, the program used in recent years popular programming languages, Python, use the search engine technology is widely used in all kinds of industry, write a search query, browsing information, and other functions of the system.

This paper mainly focuses on the following aspects: Write the crawlers, the state forestry administration and the plains bureau official website - China forestry network (www.forestry.gov.cn), China's bamboo network (bamboo. forestry. gov. gn) in the news, introduction, materials and so on through the crawlers will crawl to the local database, and collect the information related to bamboo integrated into the system, build a real-time crawl and update the website information, establish a multifunctional system, called "encyclopedia" bamboo system can crawl to the data query keywords, At the same time, it has the function of adding and deleting database.

This system is intended to serve bamboo related scientific research personnel to provide quick and easy to use experience, enables researchers to all articles in the browse the web site in this system the corresponding url, source, the information such as clicks, bamboo research and the organic combination of agriculture, forestry informationization. Make website was arranged in the complex, difficult to find the effective utilization of information resources, and through the database management system can realize the resource sharing, to the openness of the Python language, make it can be widely used in the study of all kinds of agroforestry.

Key words:Bamboo;Python;Crawl;System of Information

目 录

1 绪论………………………………………………………………………………………1

    1. 研究背景及意义………………………………………………………………1

1.1.1 竹子研究的意义………………………………………………………1

1.1.2 爬虫技术背景…………………………………………………………3

    1. 竹材相关简介………………………………………………………………4

1.2.1 传统用途……………………………………………………………4

1.2.2 现代用途……………………………………………………………5

1.3 国内外研究现状………………………………………………………………5

1.3.1 国内研究现状………………………………………………………5

1.3.2 国外研究现状………………………………………………………6

1.4 本文的主要工作………………………………………………………………7

2 核心技术介绍…………………………………………………………………………8

2.1 Python简介……………………………………………………………………8

2.1.1 功能介绍…………………………………………………………9

2.1.2 发展历程…………………………………………………………8

2.1.3 设计定位…………………………………………………………9

2.2 网络爬虫技术简介………………………………………………………10

3 本文设计思路…………………………………………………………………………11

3.1 用户需求分析………………………………………………………………11

3.1.1 业务流程分析…………………………………………………………11

3.1.2 数据库分析……………………………………………………………11

3.2 构造数据库………………………………………………………………12

3.3 获取url,写入数据库………………………………………………………13

3.4 主程序…………………………………………………………………………15

3.4.1 搜索部分……………………………………………………………15

3.4.2 浏览部分……………………………………………………………15

4系统架构及程序简介…………………………………………………………………16

4.1 系统架构…………………………………………………………………16

您需要先支付 80元 才能查看全部内容!立即支付

课题毕业论文、开题报告、任务书、外文翻译、程序设计、图纸设计等资料可联系客服协助查找,优先添加企业微信。