摘 要:随着计算机网络迅速发展,信息资源的异构性现象越来越突出。为了达到异构数据源间的共享,首先要解决数据集成问题。映射是数据集成的基础性问题,尚存在去重效率低,准确率低等问题。关系代数是关系数据库设计的理论基础,因此它与数据集成有许多内在联系。本毕业论文介绍关系代数与数据集成的基本内容,并结合两者间的联系着重探讨关系代数在数据集成中的应用。利用关系代数理论,研究建立源模式与中间模式之间的语义映射方法,并在此过程中提出解决模式匹配问题中语义冲突的方法。7080
关键词:关系代数;数据集成;模式匹配;模式映射;语义冲突;
Title Application of relational algebra in data integration
Abstract
With the rapid development of computer networks, the heterogeneity of
the phenomenon in information resources is becoming more prominent. In
order to achieve the purpose of sharing data among heterogeneous data
sources, firstly we must solve the data integration problem. Integrated
mapping is one of the basic issues in data integration, while it still has
some deficiencies such as low repetition reducing, lack of accuracy, etc.
The relational algebra is the theoretical basis of the design of relational
database, so there are many connections between relational algebra and data
integration. This article describes the basic theory of the relational
algebra and data integration, combined with the link between the two and
focuses on the application of relational algebra in data integration.
Relational algebra theory, research the semantic mapping between the
source-building mode and intermediate mode, and in this process of solving
model matching problem of semantic conflict.
Keywords Relational algebra Data integration Schema matching
Schema mapping Semantic conflict