* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download CS245A3
Survey
Document related concepts
Transcript
Active Databases Active Database Systems Ref: Modern Database Systems, Won Kim (ed), Addison Wesley, 1994 (chapter 21). Conventional databases are passive Active databases Monitor situations of interest Trigger a timely response when the situations occur E.g., Inventory control systems monitor the quantity in stock. When the stock falls below a threshold, a reordering activity may be initiated Possible Implementations Check inventory level periodically (not timely) Check the semantics of the condition in every program that updates the inventory database (poor software engineering approach) These methods are not general. 2 A General Approach (Active Databases) Express the derived behavior in rules (conditions ---> actions) The rule can be shared by many application programs Implementation can be optimized Contains an inference engine which: Applies all the rules in the system Matches the condition parts of the rules with the data in the working memory Selects and finds the rule that best matches the conditions 3 A General Approach (Active Databases) The action part of the rule may modify the working memory, which may cause further actions. The cycle continues until no more rules match. on event if condition then action Rules can be triggered by the following events: Database operations Occurrence of database status Transitions between database status 4 Characteristics of Rules Rules are: Defined and stored in the database Evaluated by the database system Subject to: Authorization Concurrency control Recovery Rules can be used to: Enforce integrity constraints Implement triggers and actions Maintain derived data Enforce access constraints Implement version control policies Gather statistics for database reorganization 5 Rule Models And Languages Rules are defined as metadata in the schema together with table, view, and integrity constraints. Rules operations are provided to: Add Drop Modify Rules are structured objects with the components of: Events Conditions Action Special operations of rules are: Fire (triggers a rule) Enable (activates a rule) Disable (deactivates a rule) 6 Triggering an event may be implicit or explicit. Explicit: flexibility in expressing transactions. e.g., Keep Bob and Alice’s salaries the same. If Alice’s salary constraint changes, then change Bob’s too. If Bob’s constraint is violated because of changes by the user transaction, then abort the transaction. Implicit: any change to the database that can cause the condition to become true is treated as a triggering event. Languages vary based on the complexity of specified events, conditions, and actions. 7 Event Specification Explicitly triggered by database modification Relational database define rule monitor new events on insert to employer if... then... where employer is a table of employee information Object-Oriented Database define rules check raises on employee salary raises if... then... where salary raises are a method defined by our objects in an employee class 8 Rules to be triggered by data retrieval: define rule monitor sal access on retrieve salary from employee if... then... 9 Temporal events absolute (8:00 on January 1, 1994) relative (5 seconds after take-off) periodic (17:00 hour every Friday) In object-oriented databases, events can be: generic database operations (retrieve, insert, delete, update) type-specific operations (method invocation) an operation on rule objects transaction operations external events (messages or signals from devices) a composition of events (disjunction, sequence, and repetition) Events are defined to have formal parameters. Salary-raise (e: employee, oldsal: integer, newsal: integer) e of type employee oldsal and newsal of type integer 10 Condition Specification The condition is satisfied by a predicate or query over data in the database. The condition is satisfied if the predicate is true or if the query returns a non-empty answer. Rule condition are arbitrary predicates over database status, thus modified data can be referenced and transition conditions can be specified. define rule on update to if then... monitor raise employee.salary employee.salary is > 1.1 old employee.salary 11 Action Specification Action part of a rule usually performs: inserts deletes updates Data in the working memory based on data matching the rule’s condition define rule favor employee or insert to employee then deletes employee e where e.name = employee.name Whenever a new employee is inserted, its action deletes any existing employees with the same name. Rule actions can also be: arbitrary database operations transaction operations rule operations signals for the occurrence of user-defined events calls to application procedures 12 Event-Condition-Action Binding Database production rule language may have explicitly specific events. There is a need to specify different and varied conditions and actions. The motion of binding is different from Al production rules (e.g. triggering an event of a rule may be parameterized and the parameters may be referenced in the rule’s condition and action.) e.g. If rule is triggered by salary. raise (e: employee; oldsal: integer; newsal: integer), then e in the condition or action refers to the employee object. Oldsal and newsal refer to the intergers bound when events occurred. Composite event allows events (eg. salary raise) to occur one or more times (set of queries) Rules can be triggered by insertions, deletions, and/or updates on a particular table. For example: Create trigger DepDel before deletion on department when department budget<100,000 delete employee where employee.dno=department.dno The rule is triggered before the deletion of a department. The condition (where clause) and action refer to the department being deleted. 13 Key words, old and new, can also be used. define rules on append to then delete form new emps employee employee when employee.name = new.name define rules on update to of 100 then Ave Too Big employee.salary (select ave (salary) from new updated) > rollback Abort the transaction whenever the average of the update exceeds 100. 14 Rule Ordering Choice of which rule to execute when more than one has been triggered is made: arbitrary numeric priorities partially ordered For any 2 rules, one rule can be specified as having higher priority than the other rule, but an ordering is not required. In HiPAC, multiple trigger rules can be executed concurrently. The rules are relatively ordered for serialization (concurrency control). 15 Rule Organization In relational systems Rules are defined in the schema Rules refer to particular tables Rules subject to the same controls as other meta data objects (e.g. views, constraints) If a table dropped, all rules defined for it are no longer operative In HiPAC (object oriented systems) Rules are first class objects Rules are organized in types like other objects Rule types are participants in system hierarchies; they can have attributes in relation to other objects Rules can be included in collections which can be explicitly named or defined by queries r In Flight-rule Where Effective Date (r) After 1/1/90 + Flight rule is rule type + Effective date is an attribute defined for this query + Collections of rules can be selectively activated or deactivated by the enable or disable operation 16 Rule Execution Semantics Rule behavior can be complex and unpredictable so a precise execution semantics is important Using an inference engine is not adequate in a database system Active database system rule processing must integrate with conventional database activities: Queries Modifications Transactions which cause the rules to be triggered and initiate rule processing. Granularity of rule processing Set of tuples firing the rules after modifying each tuple or firing after modifying all the tuples Set of objects firing the rule after the end of a transaction Triggering of more than one rule by the same event Conflict resolution Nested triggering (termination) 17 Error Recovery Cause for generating an error during execution of database rule Data has been deleted Data access privileges have been revoked Dead lock is created from concurrently executing transactions System generates error Rule action has uncovered an error condition Error recovery techniques Delete data and revoke access privilege, invalidating the corresponding rules Abort the current rule processing if error occurred Propagate the failure to its parent which may initiate other subtransactions to repair the error Handle error recovery during system crashes For recoverable events, their occurrences and parameter bindings have to be reliability logged For the decoupled conditions and actions, restart uncommitted transactions 18 Implementation Issues Active databases must provide mechanisms for Event detection and rule triggering Condition testing Rule action execution User development of rule applications 19 Characteristics Of Representative Systems ARIEL (build using the Exodus database tool kit) Curey et.al.1991. A rule manager/rule catalog for handling rule definition and manipulation tasks. A rule execution monitor for maintaining the set of triggered rules and scheduling their execution. A rule execution planner to produce optimized execution strategy. 20 POSTGRES Tuple level processing (run time approach) Places a marker on each tuple for each rule that has a condition matching that tuple When a tuple is modified or retrieved If a tuple has one or more marker on it, then the rule or rules associated with the marker(s) are located and their actions are executed. Marker installed when rules are created Marker deleted when rules are deleted Query rewrite (compiler time approach) A module is added between command parser and the query processor to intercept user command Argument with additional commands reflecting the effect of rules (triggering by the original command) which may trigger other rules Recursively triggering may not terminate Compiled time approach can be more efficient than runtime approach Select which mechanism to use when a rule is created 21 STARBURST (IBM Research, Hass et. al. 1990) Extensibility Use attachment feature to monitor data modification that are of interest to the rules Modifications are stored in a transaction log Rules are processed at the end of a transaction or a user command Trigger rules are indexed according to the priorities Reference to transition tables are implemented Mechanisms are provided for concurrency control, authorization, and crash recovery 22 HiPAC (An Object Oriented Database System), Dayal, U. et al, 1988, SIGMOD Record, Vol. 17, No. 7, pp. 51-70 Rule manager Coordinates rule processing Stores rule objects Implements operations Implements the coupling modes of the execution model. Concurrency control Recovery for rule object Event detector detects different events Bi-directional interaction between application programs and database rule system Application can invoke database operations Rules running inside the database can invoke application operation 23 Rule Programming Support Trace rule execution Display the current set of trigger rules Query and browse the set of rules Cross reference rules and data Activate and deactivate the selected rules for processing the database transactions 24 Rule Termination Impose sufficient syntactic restrictions on rule definitions (limit the expressiveness of the rules) Impose a rule triggering limit. The limit is specified by the use or system default (needs to monitor the number of rules executed during the rule processing). Detect if the same rule is triggered a second time with the same set of parameters 25 Future Directions Support for application development Application developments treat database rules as an assembly language Rules are generated automatically from high level specifications An increase in communication capabilities between database rules and applications Increasing expressive power of rules Improved algorithms Distribution and parallelism Distribution and parallelism Algorithms that guarantee equivalence with centralized rule processing 26 Mediators In The Architecture Of Future Information Systems Reference: Gio Wiederhold, IEEE Computer Magazine, March 1992, pp. 38-49. 27 28 29 Abstraction Functions 30 Mismatches in Data Resources 31 Knowledge and Data Feedback Loops and Their Interaction 32 33 Mediators A mediator is a software module that exploits encoded knowledge about certain sets of subsets of data to create information for a higher layer of applications. Requirements: small and simple inspectable by potential users (for selection) meta mediator (e.g., catalog list of mediators and data resource) Given a data source, locate a knowledgeable mediator; for a desirable mediator, locate an adequate data resource. 34 35 Mediator Interfaces User’s workstation interface to the mediators A language is needed to provide flexibility, composability, iteration, and evaluation in this interface Descriptive but static interface specification unable to support the variety of control and information flow The language should provide incremental growth and support new functionality as additional mediator joins the network Machine and communication friendly interfaces Support high level functions for decision making 36 Three Layers of a Mediator Architecture 37 Interfaces for Information Flow 38 39 40 41 42 43 44 Interface of Mediator to the Database Management Systems SQL RDA SELECTION and VIEW are good starting points A mediator combines information from multiple databases, can use its knowledge to control the merging process, specifying relational operations directly. New access languages are needed to manage sensor-based and simulation process (e.g., abstraction, fusion). Mediator can create objects for a wide variety of orthogonal views. Making complex objects persistent. 45 Sharing Of Mediator Modules Process tasks needed within a mediator: selection, fusion, reduction, abstraction, and generalization effective use of mediator modules - sharing use for different applications e.g., mediator for inflation adjustment can be used by many applications mediator understands postal codes can be used by post offices, express delivery services, and corporate mailrooms. 46 47 Distribution Of Mediators Locality of maintenance Economy for access Autonomy Reasons For Mediators Not Attached To DB: Mediator contains knowledge beyond the scope of DB e.g., factory production control system cannot expect to foresee all the strategies used of the collected information Concept of abstraction is not part of DB technology today Intelligent processing data will involve dealing with uncertainty Mediator may access multiple DB and combine disjoint sets of data prior to analysis and reduction 48 49 50 51 52 53 54 55 Active Infomediation Services 56 Layered Information Architecture & Processes 57