Download File Processing

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Design of the FAT file system wikipedia , lookup

Clusterpoint wikipedia , lookup

Data analysis wikipedia , lookup

Lustre (file system) wikipedia , lookup

File system wikipedia , lookup

Asynchronous I/O wikipedia , lookup

File Allocation Table wikipedia , lookup

Data vault modeling wikipedia , lookup

Information privacy law wikipedia , lookup

Business intelligence wikipedia , lookup

XFS wikipedia , lookup

Open data in the United Kingdom wikipedia , lookup

Database model wikipedia , lookup

Search engine indexing wikipedia , lookup

Computer file wikipedia , lookup

Files-11 wikipedia , lookup

File locking wikipedia , lookup

Extensible Storage Engine wikipedia , lookup

Transcript
DBMS Physical Design
Physical design is
concerned with the
placement of data and
selection of access
methods for efficiency
and ongoing maintenance
File Organization
Physical arrangement of the records of a file on
secondary storage devices
•Sequential
•Linked List
•Indexed
•Hashed
Sequential File
Sequential file sorted in alphabetical order.
Sequential files are usually sorted in ID
sequence order to facilitate batch processing.
a ddr
00
01
02
03
Ayers
Buckley
Daley
Dejoie
ACCT
MGT
ACCT
MGT
04
Kenderdine
MKT
05
Linn
FIN
06
Lusch
MKT
07
Price
MGT
08
Razook
MKT
09
Schwarzkopf
MGT
Sequential File Processing
Old Master
Process
New Master
Transaction
Sequential files must be recopied from the point of
any insertion or deletion to the end of the file. They
are commonly used in batch processing where a
new master file will be generated each time the file
is updated.
Linked List
Linked list to sort data alphabetically within department.
An external reference must point to the start record (05).
a ddr
00
Price
MGT
pointe r
01
01
02
03
Schwarzkopf
Kenderdine
Lusch
MGT
MKT
MKT
02
03
08
04
Buckley
MGT
09
05
Ayers
ACCT
06
06
Daley
ACCT
07
07
Linn
FIN
04
08
09
Razook
Dejoie
MKT
MGT
##
00
Linked List File Processing
The next record in a linked list is found at the address
stored in the record. Records are added at any
location in the DASD and pointers adjusted to include
them. Deletions are not erased, but pointers
changed to omit the deleted record.
Indexed File
(sequential index)
Index to access data by
department abbreviation.
addr
00
01
02
03
04
Price
Schwarzkopf
Kenderdine
Lusch
Buckley
MGT
MGT
MKT
MKT
MGT
ACCT
ACCT
FIN
MGT
MGT
MGT
MKT
Ayers
Daley
Linn
Razook
Dejoie
00
01
02
00
01
04
03
ACCT
ACCT
FIN
MKT
MGT
Indexed File Processing
Index
Index
Data File
When a record is inserted or deleted in a file the data
can be added at any location in the data file. Each index
must also be updated to reflect the change. For a
simple sequential index this may mean rewriting the
index for each insertion.
Segmented Index
addr
100
101
102
103
200
201
202
203
204
205
206
Root
Nodes
Leaf
Index
pointer
101
Kenderdine
200
Buckley
203
Lusch
205
Schwarzkopf
00
Ayers
01
Daley
00
Price
02
Linn
02
Kenderdine
01
Schwarzkopf
5
Van Horn
pointer
102
Razook
201
Dejoie
202
206
04
Buckley
04
Dejoie
03
Razook
03
Lusch
Data
addr
00
Price
MGT
Ayers
ACCT
01
Schwarzkopf
MGT
Daley
ACCT
02
Kenderdine
MKT
Linn
FIN
03
Lusch
MKT
Razook
MKT
04
Buckley
MGT
Dejoie
MGT
05
Van Horn
MGT
pointer
103
204
201
204
205
202
203
206
Indexed File Processing
(segmented index)
Index
Data File
Data can be inserted or deleted at any location in the
data file. The index(es) must be updated for each
change, but only the affected segments need to be
rewritten.
Physical Design
Volume and Usage analysis
Distribution Strategy
File Organizations
Indexes and Access Methods
Integrity Constraints
Physical Design Issues
Size
Speed of access
Speed of update
Growth issues: performance and
degradation
Security
Maintenance
DBMS Organization
Structured
 Relationships: physical
address pointers
 Links generated when data is
entered
 Efficient but not flexible
 Ad hoc design
 Query dependent on specific
DBMS (may support SQL)
Relational
 Relationships: logical data
references
 Links generated when data is
retrieved
 Flexible but not efficient
 Theoretical base
 SQL
DBMS Technology
• CPU
• Components
• Operation
• DASD
• Technology
• Organization
• Data Transfer
• Access methods
Physical Design
Data Distribution
Centralized
Partitioned
Horizontal
Vertical
Replicated
Hybrid