Multiversion-based view maintenance over distributed data sources

被引:20
作者
Chen, ST [1 ]
Liu, B [1 ]
Rundensteiner, EA [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2004年 / 29卷 / 04期
关键词
view maintenance; transaction processing; theory; algorithms; experimentation;
D O I
10.1145/1042046.1042049
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Materialized views can be maintained by submitting maintenance queries to the data sources. However, the query results may be erroneous due to concurrent source updates. State-of-the-art maintenance strategies typically apply compensations to resolve such conflicts and assume all source schemata remain stable over time. In a loosely coupled dynamic environment, the sources may autonomously change not only their data but also their schema or semantics. Consequently; either the maintenance or the compensation queries may be broken. Unlike compensation-based approaches found in the literature, we instead model the complete materialized view maintenance process as a view maintenance transaction (VM_Transaction). This way; the anomaly problem can be rephrased as the serializability of VM_Transactions. To achieve VM_Transaction serializability, we propose a multiversion concurrency control algorithm, called TxnWrap, which is shown to be the appropriate design for loosely coupled environments with autonomous data sources. TxnWrap is complementary to the maintenance algorithms proposed in the literature, since it. removes concurrency issues from consideration allowing the designer to focus on the maintenance logic We show several optimizations of TxnWrap, in particular, (1) space optimizations on versioned data materialization and (2) parallel maintenance scheduling. With these optimizations, TxnWrap even outperforms state-of-the-art view maintenance solutions in terms of refresh time. Further. several design choices of TxnWrap are studied each having its respective advantages for certain environmental settings. A correctness proof based on transaction theory for TxaWrap is also provided. Last, we have implemented TxnWrap. The experimental results confirm that TxnWrap achieves predictable performance under a varying rate of concurrency.
引用
收藏
页码:675 / 709
页数:35
相关论文
共 34 条
[1]  
AGRAWAL D, 1989, SIGMOD REC, V18, P408, DOI 10.1145/66926.66964
[2]   Precise call graph construction for OO programs in the presence of virtual functions [J].
Bairagi, D ;
Kumar, S ;
Agrawal, DP .
PROCEEDINGS OF THE 1997 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 1997, :412-416
[3]  
Bernstein P.A., 1987, Concurrency Control and Recovery in Database Systems
[4]   IMPLEMENTING DISTRIBUTED READ-ONLY TRANSACTIONS. [J].
Chan, Arvola ;
Gray, Robert .
IEEE Transactions on Software Engineering, 1985, SE-11 (02) :205-212
[5]  
Chen J, 2002, LECT NOTES COMPUT SC, V2503, P247
[6]  
Chen J, 2001, SIGMOD REC, V30, P619
[7]   Detection and correction of conflicting source updates for view maintenance [J].
Chen, ST ;
Chen, J ;
Zhang, X ;
Rundensteiner, EA .
20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, :436-447
[8]  
COLBY LS, 1996, P SIGMOD, P469
[9]  
Gray J., 1992, T PROCESSING CONCEPT
[10]  
Gupta A., 1995, IEEE DATA ENG B, V18, P3