What is a database? What are the types of databases? What is flat-file database? What are the types of database software? What is a database? The basic electronic information storage unit A database is a collection of information organized to provide efficient retrieval. The collected information could be in any number of formats (electronic, printed, graphic, audio, statistical, combinations). There are physical (paper/print) and electronic databases. A database could be as simple as an alphabetical arrangement of names in an address book or as complex as a database that provides information in a combination of formats. Examples: phone book address book Census Bureau data What are the types of databases? 1. Single file (sometimes called as flat-file) 2. Multi file (relational) Flat file database Hierarchical database model Network database model Relational Database Model What is flat-file database? A relatively simple database system in which each database is contained in a single table. In contrast, relational database systems can use multiple tables to store information, and each table can have a different record format. A flat file can be a plain text file or a binary file. There are usually no structural relationships between the records. What are the types of database software? MySQL Freeware R Open Source (Recommended)
Database organisations and structure : relational, hierarchical network implementation issues and query languages. Methods to gain reliability, protection and integrity of database operation and management, Distributed Databases, Concurrency Control, Consistency and Introduction to Recovery in Distributed Databases. Q: You are writing a database application to run on your DBMS. You do not want your users to be able to view the underlying table structures. At the same time you want to allow certain update operations. Referring to the above scenario, what structure will you deploy? 1 Cursor table 2 Table filter 3 Dynamic procedure 4 View 5 Summary table ------------------------------------------------------------ You are defining the operational process of your RDBMS. Referring to the scenario above, which one of the following is a valid ongoing "operational process?" 1 OS requirement 2 User analysis 3 Performance monitoring 4 Data dictionary specification 5 System requirement ------------------------------------------------------------ You have been asked to construct a query in the company's RDBMS. You have deployed a Right Outer Join operation. Referring to the scenario above, what will happen to the final results when there is NO match between the tables? 1 The right table will return ALL rows. 2 The right table will return NULL. 3 Both tables will return NULL. 4 The left table will return ALL rows. 5 The left table will return NULL. ------------------------------------------------------------ Which phase of the data modeling process contains security review? 1 Structure 2 Design issue 3 Data source 4 Storage issue 5 Operational process ------------------------------------------------------------ Which one of the following is NOT a characteristic of metadata? 1 Data about data 2 Describes a data dictionary 3 Self-describing 4 Includes user data 5 Supports its own structure ------------------------------------------------------------ Which one of the following capabilities do you expect to see in a majority of RDBMS extensions to ANSI SQL-92? 1 Encryption key management 2 Graphical User Interface Widgets 3 Thread creation, execution, & coordination 4 Network socket creation/operation 5 If/Then, for, do/while statements ------------------------------------------------------------ What can a mandatory one to one relationship indicate? 1 More entities are needed. 2 The model should be denormalized. 3 The tables are not properly indexed. 4 The model cannot be implemented physically. 5 More attributes are needed. ------------------------------------------------------------ For performance, you denormalize your database design and create some redundant columns. Referring to the scenario above, what RDBMS construct can you use to automatically prevent the repeated columns from getting out of sync? 1 Cursors 2 Constraints 3 Views 4 Stored procedures 5 Trigger ------------------------------------------------------------ You are running a query against a relational database. Referring to the scenario above, what clause or command do you use in the query to help avoid a costly tablescan? 1 GROUP BY clause 2 INDEX command 3 HAVING clause 4 FROM clause 5 WHERE clause -------------------------------------- Which of the following uses a series of logically related two-dimensional tables or files to store information in the form of a database? A) Database B) Database management system C) Data warehouse D) None of the above ----------------------------------------------------- All of the following terms describe OLAP, except: A) The gathering of input information B) Processing input information C) Updating existing information to reflect to the gathered and processed information D) None of the above ------------------------------- Which tool is used to help an organization build and use business intelligence? A) Data warehouse B) Data mining tools C) Database management systems D) All of the above ------------------------------------ What does the data dictionary identify? A) Field names B) Field types C) Field formats D) All of the above ----------------------------------- What DBMS component contains facilities to help you develop transaction-intensive applications? A) DBMS engine B) Data definition subsystem C) Application generation subsystem D) Data administration subsystem --------------------------------------- Which of the following is a data manipulation tool? A) File generators B) Query by example tool C) Structure question language D) All of the above -------------------------------------------- What data manipulation tool is a standardized fourth-generation query language found in most DBMSs? A) Report generator B) Query-by-example tool C) Statistical tool D) None of the above ---------------------------------------------- The data administration subsystem helps you perform all of the following, except: A) Backups and recovery B) Query optimization C) Security management D) Create, change, and delete information -------------------------------------------------- Which data administration subsystem periodically backs up information contained in a database? A) Concurrency control facilities B) Reorganization facilities C) Backup and recovery facilities D) Security management facilities ----------------------------------------------------- Who is the person responsible for the more technical and operational aspects of managing the information contained in organizational databases? A) Chief information officer B) Data administration C) Data information officer D) None of the above ----------------------------- Which of the following uses a series of logically related two-dimensional tables or files to store information in the form of a database? A) Database B) Database management system C) Data warehouse D) None of the above ---------------------------- All of the following terms describe OLAP, except: A) The gathering of input information B) Processing input information C) Updating existing information to reflect to the gathered and processed information D) None of the above ------------------------------------- Which tool is used to help an organization build and use business intelligence? A) Data warehouse B) Data mining tools C) Database management systems D) All of the above ----------------------------------- What does the data dictionary identify? A) Field names B) Field types C) Field formats D) All of the above -------------------------------- What DBMS component contains facilities to help you develop transaction-intensive applications? A) DBMS engine B) Data definition subsystem C) Application generation subsystem D) Data administration subsystem ----------------------------------- Which of the following is a data manipulation tool? A) File generators B) Query by example tool C) Structure question language D) All of the above ----------------------------- What data manipulation tool is a standardized fourth-generation query language found in most DBMSs? A) Report generator B) Query-by-example tool C) Statistical tool D) None of the above --------------------------------- The data administration subsystem helps you perform all of the following, except: A) Backups and recovery B) Query optimization C) Security management D) Create, change, and delete information ------------------------------------- Which data administration subsystem periodically backs up information contained in a database? A) Concurrency control facilities B) Reorganization facilities C) Backup and recovery facilities D) Security management facilities ----------------------------------------- Who is the person responsible for the more technical and operational aspects of managing the information contained in organizational databases? A) Chief information officer B) Data administration C) Data information officer D) None of the above ----------------------------- Business intelligence is information about your customers, your competitors, your partners, your competitive environment, and your own internal operations. A) True B) False ---------------------------- Operational databases are databases that support OLTP. A) True B) False --------------------------------- A primary key field can be blank. A) True B) False ------------------------------ A data warehouse is a logical collection of information - gathered from many different operational databases - used to create business intelligence that supports business analysis activities and decision-making tasks. A) True B) False --------------------------------- Online transaction processing (OLTP) is the manipulation of information to support decision making. A) True B) False ------------------------------------- Data warehouses support only OLAP. A) True B) False --------------------------------- A database is a collection of information that you organize and access according to the physical structure of that information. A) True B) False ---------------------------- The physical view of information focuses on how you need to arrange and access information to meet the needs of the business. A) True B) False ----------------------------- A view helps you add, change, and delete information in a database and mine it for valuable information. A) True B) False ------------------------------- Structured query language (SQL) is a standardized fourth generation query language found in most DBMSs. A) True B) False Data Fundamentals Section No. Course Content (Data Fundamentals Section) Text Reference # of Lectures Introduction to 1 1 Introduction to Databases Chap. 1 1 2 Database Design Chap. 2 2 3 The Relational Model Chap. 3 1 4 SQL Chap 3 & 5 2 5 Relational Database Design Chap 4 2 6 Object Oriented databases Chap 7 1 7 MySQL 1 Lectures 8 Introduction to Knowledge and Knowledge Processing 1 9 Search Strategies Chap 4 3 10 Knowledge Representation Chap 2 3 11 Expert Systems Chap. 3 3 Important Notes: bullet The syllabus given here is tentative and can be modified by the instructor. The instructor will inform the students of any changes to the syllabus. bullet The number of lectures for each topic is based on an informed estimate, however depending on the overall class understanding of the topic the number of lectures may vary. bullet The assignment dates correlate with the lecture schedule. Any changes to the lecture schedule may consequently affect the assignment schedule. If the lectures proceed as scheduled above the assignment dates will remain valid, else the assignment schedule may be modified to ensure that the students have ample time to complete the assignments. ____________ What does the abbreviation DBMS stand for? Digital Base Mapping System. Database Manipulation Software. Data Borrowing and Movement Software. Database Management System. ___________________ Redundancy is minimised with a computer based database approach. True False ------------------ The relational database model is based on concepts proposed in the 1960s and 1970s. True False ------------- A row in a database can also be called a domain. True False ----------------- A first step in database creation should be needs analysis. True False ------------------------ In entity attribute modelling a many to many relationship is represented by M:M. True False -------------------------- In a networked web based GIS all communications must go through an internet map server. True False --------------------------- In an OO database approach 'object = attributes + behaviour'. True False ----------------------- In an OO database objects may inherit some or all of the characteristics of other objects. True --------------------------- What is a 'tuple'? A row or record in a database table. Another name for the key linking different tables in a database. Another name for a table in an RDBMS. An attribute attached to a record. More than one correct The advantages of Standard Query Language (SQL) include which of the following in relation to GIS databases? It is simple and easy to understand. It is widely used. It is good at handling geographical concepts. It uses a pseudo-English style of questioning. ---------------------------- Which of the following are characteristics of an RDBMS? Data are organized in a series of two-dimensional tables each of which contains records for one entity. It cannot use SQL. Queries are possible on individual or groups of tables. Keys may be unique or have multiple occurrences in the database. Tables are linked by common data known as keys. -------------------------------- Which of the following are issues to be considered by users of large corporate GIS databases? The need for concurrent access and multi-user update. The need for multiple copies of the same data and subsequent merging after separate updates. The need for manual transfer of records to paper. The need for multiple views or different windows into the same databases. The need to manage long transactions. ---------------------------- Which of the following are features of the object-oriented approach to databases? The ability to represent the world in a non-geometric way. The need to split objects into their component parts. The ability to develop databases using natural language approaches. The ability to develop database models based on location rather than state and behaviour. The ability to develop more realistic models of the real world. E-mail: admin@qureshiuniversity.com Telephone: (000) 0000000 Course website: Course Objectives The objectives of this course are twofold: (a) Introduce students to the fundamentals of data management leading towards the design and development of database systems; and (b) Introduce students to the fundamentals of knowledge-based problem solving, leading towards artificial intelligence search techniques used to develop intelligent systems. In essence, this course attempts to introduce concepts related to the abstraction/transition of ‘raw’ data to ‘useful’ information to ‘decision-support’ knowledge (as shown in the below diagram). Course Description This is an entry-level course regarding concepts related to database and intelligent systems. The introductory concepts presented during this course can be studied in more detail in advance courses at the third and fourth year level. In line with the course objectives, this course is divided into two interrelated sections: [1] data fundamentals section deals with issues pertaining to database design and development, such as database systems, relational data model, entity-relationship modelling, data normalization, and database programming languages—Sequential Query Language (SQL) [2] knowledge fundamentals section includes topics related to the representation knowledge such as propositional and predicate calculus, search strategies and automated reasoning strategies. An introduction to Prolog (an AI programming language) will be provided. The first half of the course will comprise lectures on data fundamentals, and will conclude with a mid-term test covering all data fundamental topics. The second half of the course will deal with knowledge fundamentals, and will conclude with a final exam that will comprise questions on all knowledge fundamental topics. A detailed description of the topics, for each section, covered during this course can be found in the syllabus document. Suggested Textbooks 1. The Essence of Databases by FD Rolland, Prentice Hall. 2. The Essence of Artificial Intelligence by A. Cawsey, Prentice Hall. Course Evaluation Scheme Course assessment will be based on the following components, each with an attached weight. At the end of the term, the weighted marks for each component will be linearly accumulated to give the total marks (out of 100) which will be translated into the final letter grade using the standard grade conversion table. Component Weight Logistics Assignments (6 in total) 18 % Due Date: To be specified with the assignment. Submission: In the class. Database Project 15 % Multiple Choice Questions (MCQs) to test the conceptual understanding of the subject. I find this style of questioning to be a more accurate and objective reflection of the student’s understanding of the subject, as opposed to ask students to regurgitate text material and write essay-type answers. Negative marking will be applied to MCQs. Analytical, problem-solving questions based on techniques covered during the lectures. You should bring your calculators for such questions. Short answer questions designed to make you think, analyze and co-relate the various topics covered in the course in order to make an informed answer. Such questions can be answered within 5-8 lines. 3. Assignments: There will be a maximum of 6 assignments. Students are expected to individually attempt ALL assignments. There will be assignments for both the sections of this course (see the schedule in the syllabus) (a) Data fundamentals section bullet The assignments are designed from the review and exercise questions given at the conclusion of each chapter in the text book. bullet The assignments largely focus on analysis and design related problems. bullet Each assignment will be given only after the topics pertaining to an assignment have been discussed by the instructor in the class. (b) Knowledge fundamentals section The assignments require students to exercise selected problem-solving techniques covered during the lectures. Assignment Marking Scheme: I have devised an interesting assignment marking scheme that you will find to maximize the impact of your best assignment result and minimize the impact of your worst assignment result. The marking scheme is as follows: Each assignment will be worth 100 marks. At the end of the term, all your assignments will be ranked based on the marks you score for each assignment. A weighted contribution of each assignment towards the final grade will be determined based on the following table. No. Ranking Weight % 1 1st (Highest) 20 2 2nd 18 3 3rd 17 4 4th 16 5 5th 15 6 6th (Lowest) 14 Total 100% 4. Project: The project will be about an applied perspective to database systems, requiring students to use any DBMS system of their choice and work with SQL to design a simple database system. Details of the project will be made available later at the course website (see project sidebar). Course Regulations Students are strongly advised to TURN-OFF their cellular phones and pagers during lectures—under no circumstances they should not ring during a lecture. bullet Students are expected to attend each lecture. In case you miss a lecture, it is your responsibility to find out about the proceedings of that particular lecture from your class-mates and act accordingly. Assignments and announcements will be posted on the course website. It is the student’s responsibility to routinely check the course website for assignments and announcements. Students should properly submit their assignments, i.e. staple all loose pages. Loss of marks due to missing pages cannot be rectified at a later stage by producing the missing material. Each assignment should have the following information: Name, Banner Id, Course Id, Assignment Number, Date and Email contact. Students can meet me in my office during my office hours. To meet me outside office hours you need to make an appointment either by email, phone or in person. The assignments and project will be graded by a designated marker. If you are not satisfied with the marks, you may contact the marker and explain your point/answer. The marker may try to re-examine your assignment in view of your explanation. However, you are discouraged to routinely approach the marker after each assignment as it may reflect adversely. If you are still unsatisfied than contact the instructor. Make-Up Test Policy: There will be no make-up test. However, exemption will be given to students who were either sick or involved in an emergency situation (such as the loss of a loved one) on the test day. The possibility of a make-up test is subject to the production of (a) letter from a physician detailing the student’s inability to take the test; or (b) evidence of an emergency situation. The make-up test may not necessarily follow the test format given above. Assignment Late Policy: All assignments are expected to be submitted on time right after the lecture on the specified due date. A grace period of 1 hour after the lecture is provided, after which the assignment will be declared as being late (even if it is submitted on the due date). Late submission will be penalized as per the below late assignment submission penalty scheme. The maximum late submission allowance is 3 days (including the due date), after which the assignment will not be accepted for evaluation. After 2 late assignment submissions, the student will be required to provide an explanation for each subsequent late assignment submission. And, the late assignment submission penalty will be increased to: –12.5%, -22.5% and –32.5% for window 1, window 2 and window 3, respectively (instead of the standard –10%, -20% and –30%). If you are unable to submit on time due to illness or an emergency situation than you need to inform me with proof of illness or emergency. No make-up assignment will be given, rather the submission deadline will be extended. The mid-term test and the final exam are both "closed book". However, you may bring ONE Single-Sided 8.5 x 11 inch reference page containing material of your choice. The reference page should have your name and banner id on it and should be typed using a font not less than 12 points. No handwritten notes will be allowed. Databases |