* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download lec 13 IT
Survey
Document related concepts
Data center wikipedia , lookup
Data analysis wikipedia , lookup
Concurrency control wikipedia , lookup
Versant Object Database wikipedia , lookup
3D optical data storage wikipedia , lookup
Entity–attribute–value model wikipedia , lookup
Data vault modeling wikipedia , lookup
Information privacy law wikipedia , lookup
Business intelligence wikipedia , lookup
Open data in the United Kingdom wikipedia , lookup
Clusterpoint wikipedia , lookup
Transcript
INFORMATION AND DATABASES Part 1 LEARNING OBJECTIVES • The importance of high-quality information and issues involved in managing business data. • Advantages of the Database approach. • Data Modeling and Database Design with Entity Relationship Diagram (ERD) • How organizations can maximize their strategic potential with databases : Data Warehouse, Data Marts, Data Mining, Business Intelligence Levels, Formats & Granularities of Information Information Type: Transactional And Analytical Information Quality • Business decisions are only as good as the quality of the information used to make the decisions • Characteristics of High-quality Information • Accurate • Complete • Consistent • Unique • Timely Information Quality Information Quality Low Quality Information Example The Costs Of Using Low-quality Information • The four primary sources of low quality information include 1. Customers intentionally enter inaccurate information to protect their privacy 2. Different entry standards and formats 3. Operators enter abbreviated or erroneous information by accident or to save time 4. Third party and external information contains inconsistencies, inaccuracies, and errors The Costs Of Using Low-quality Information . . . • Potential business effects resulting from low quality information include • Inability to accurately track customers • Difficulty identifying valuable customers • Inability to identify selling opportunities • Marketing to nonexistent customers • Difficulty tracking revenue • Inability to build strong customer relationships The Benefits Of Good Information • High quality information can significantly improve the chances of making a good decision • Good decisions can directly impact an organization's bottom line Information Timeliness • Real-time information – Immediate, up-to-date information • Real-time system – Provides real-time information in response to requests Difficulties in Managing Business Data • Amount of data increases exponentially. • Data are scattered and collected by many individuals using various methods and devices. • Data come from many sources. • Data security, quality and integrity are critical. • An ever-increasing amount of data needs to be considered in making organizational decisions. Data Life Cycle ( in modern businesses ) Data Management • File Processing Systems • Stand-alone applications with their own data files. • Data are NOT shared across applications • Redundancy , Inaccurate • DBMS – Database Management Systems • Use a DBMS software to create, store, organize, and retrieve data from a single database or several databases • Examples: Microsoft Access, SQL Server, Oracle Conventional Files vs. the Database File – a collection of similar records. • Files are unrelated to each other except in the code of an application program. • Data storage is built around the applications that use the files. Database – a collection of interrelated files • Records in one file (or table) are physically related to records in another file (or table). • Applications are built around the integrated database Difficulties of Non-Relational Data Files • Update Anomaly: not changing all occurrence of a data item (in many places) • Insert Anomaly: add an invalid (null record) to the database • Delete Anomaly: not remove all info (in many places) about a deleted record A File Processing System Database Management System Application1 Application2 Application3 DBMS DB Files vs. Database Database Management System (DBMS) Field Individual characteristics/attributes about an ENTITY. (Also called columns) Record A group of fields or attributes to describe a single instance/member of an ENTITY. (Also called rows) File A collection of records or instances for a given ENTITY. (Also called tables) A collection of files or entities containing Database information to support a given system or a particular topic area Main Database Activities/Functions • Recording Data (Input) • Entry Form • Processing Data (Query) • Structured Query Language (SQL) • Query by example (QBE) • Reporting Data (Output) • Report – a compilation of data that is organized and produced in printed format • Report Generators The Database Approach • DBMS provides all users with access to all the data. • DBMS minimize the following problems: • Data redundancy • Data isolation • Data inconsistency • DBMS maximizes the following issues: • Data security • Data integrity • Data independence Using A Relational Database For Business Advantages • Database advantages from a business perspective include • Increased flexibility • Increased scalability and performance • Reduced information redundancy • Increased information integrity (quality) • Increased information security Increased Flexibility • A well-designed database should • Handle changes quickly and easily • Provide users with different views • Have only one physical view • Physical view – Deals with the physical storage of information on a storage device • Have multiple logical views • Logical view – Focuses on how individual users logically access information to meet their own particular business needs Increased Scalability & Performance • A database must scale to meet increased demand, while maintaining acceptable performance levels • Scalability – Refers to how well a system can adapt to increased demands • Performance – Measures how quickly a system performs a certain process or transaction Database Approach • Advantages of the Database Approach • Program-data independence • Minimal data redundancy • Improved data consistency • Improved data sharing • Increased productivity of application development • Enforcement of standards • Improved data quality • Improved data accessibility • Reduced program maintenance Database Approach . . . • Costs/Risks of the Database Approach • New, specialized personnel • Installation/management cost and complexity • Conversion cost: what to do with legacy systems (old data) • Need for explicit backup and recovery • Data ownership: organizational conflict Data Hierarchy Database Field / Column File / Table Record / Row Student ID 2144 Last Name Arnold First Name Betty 3122 Taylor John 3843 Simmons Lisa 9844 Macy Bill 2837 Leath Heather 2293 Wrench Tim Database Management Systems Database Components Form Builder Report Writer Interactive Query Tool Application Program Database Front-end Database Engine To other computer systems Database Database Gateway To other DBMS brands Data Modeling • ERD (Entity Relationship Diagram) Blue print of Relational Database • Entity (object / info of interest) • Attributes (characteristics / fields) • Relationship (business rules) Entities • Entity is a group of attributes corresponding to the same conceptual thing about which we need to capture and store data (in a file/table) • Entity is a set of instances / members of the object that it represents (records) • Entity must have a unique name, unique identifier, and at least one attribute (the identifier itself is sufficient) Entities: Examples • • • • • Persons: agency, contractor, customer, department, division, employee, instructor, student, supplier. Places: sales region, building, room, branch office, campus. Objects: book, machine, part, product, raw material, software license, software package, tool, vehicle model, vehicle. Events: application, award, cancellation, class, flight, invoice, order, registration, renewal, requisition, reservation, sale, trip. Concepts: account, block of time, bond, course, fund, qualification, stock. Entity Instance: Example Entity instance – a single member of an entity. Attributes Entity Student ID Last Name First Name Instances 2144 Arnold Betty 3122 Taylor John 3843 Simmons Lisa 9844 Macy Bill 2837 Lea Heather 2293 Wrench Tim Attributes • An attribute is a descriptive property or characteristic of interest of an entity. Also called field. • The data type for an attribute defines what type of data can be stored in that attribute. • The domain of an attribute defines what values an attribute can legitimately take on. • The default value for an attribute is the value that will be recorded if not specified by the user. Entities & Attributes Entities & Attributes . . . ENTITY NAME CUSTOMER - entity id - attribute 1 - attribute 2 - ………….. - attribute n - Customer_ID - Cust_Name - Cust_Address - Cust_Phone Attributes • Simple vs. composite • Single-valued vs. multi-valued • Stored vs. derived • Null-valued Simple Vs. Composite • Composite attributes can be divided into smaller subparts, which represent more basic attributes that have their own meanings • Address can be broken down into a number of subparts, such as Street, City, State, Zip Code • Street may be further broken down by Number, Street Name, and Apartment/Unit Number • Attributes that are not divisible into subparts are called simple attributes Single-valued Vs. Multi-valued • Single-valued attribute means having only a single value of each attribute of an entity at any given time • A CUSTOMER entity allows only one Telephone Number for each CUSTOMER • If a CUSTOMER has more than one Phone Number and wants them all included in the database, then CUSTOMER entity cannot handle them Multi-valued • Multi-valued attribute means having the potential to contain more than one value for an attribute at any given time • An entity in a relational database cannot have multivalued attributes, must create another entity to hold them • Relational databases do not allow multi-valued attributes because they can cause problems: • Confuses the meaning of data in the database • Significantly slow down searching • Place unnecessary restrictions on the amount of data that can be stored Stored Vs. Derived • If an attribute can be calculated using the value of another attribute, it is called a derived attribute • The attribute that is used to derive the attribute is called a stored attribute • Derived attributes are not stored in the file, but can be derived when needed from the stored attributes Null-valued • Null-valued attribute – Assigned to an attribute when no other value applies or when a value is unknown THANKYOU (PART 2 IN NEXT LECTURE)