Download Design - LordSeeks

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Team Project: Index Writer
Bharat Vallarapu
Hiren Pandya
Junshan Li
Fall, 2001
Index Writer Project Overview
• Problem Statement – Bharat
• Identify Nouns and Verbs – Hiren
• Classes, attributes, operations, and constraints – Junshan
• Operation Model (Schemata) – Junshan
• Class diagrams – Hiren
• Questions ? – Junshan
AN INDEX WRITER PROBLEM STATEMENT
An index writer will produce an alphabetized list (an index) of
important words (word list) and phrases found in a document of
any length. Associated with each word and phrase in the index
will be a listing of all of the page numbers (in ascending order) where
that word or phrase may be found in the document.
The index writer will exclude lists of common words (e.g., articles
and prepositions) that will not appear in the index created. The index
writer will ignore the textual separators (such as spaces, line
endings, punctuation marks). The index writer will determine the
maximum number of lines for a page and the maximum number of
words for a line in a page.
(Problem Statement – continued )
The index writer will provide a user interface which will allow the
user to put a certain word either on common word list, word list, or
ignored word list.
For fine tuning, the index writer will form valid phrases from the
word list. The user interface should also provide the user the ability
to build phrases from the word list, then add the phrase into the
word list.
The final product will be an index list with two columns and a
common word list. The common word list is for the user to modify
if one wants to add or delete a word from the common word list.
The words or phrases of the final index list will be on the left
column, the page number on the right, such as an instance
commonly seen in the textbooks.
Identify Nouns and Verbs – Hiren
Nouns (Qualified Nouns)
Verbs (Verb Phrases)
Index Writer
Produces an index
Document
Consists of pages
Page
Contains lines
Line
Contains words or phrases
Word, Phrase
Belong to word lists
Word List
Contains important words
Common Word List
Contains common words
Common Word
Is articles or prepositions, or user defined
Important Word
Belongs to a Word List
Page Number
Belongs to a page
Index
Contains alphabetized list & page numbers
List of Page Numbers
Is page numbers for a certain word or phrase
File
Contains words
Article & Preposition
Belongs to the common words
User
Uses Index Writer
User Interface
Provides functionalities to users
Separator, Space, Punctuation mark
Ignored by the Index Writer
Selected Classes, Attributes, Operations,
and Constraints – Junshan
Classes
Comment
IndexWriter
Produces an index & excludes common
words
WordList
A super class of lists of words.
Document
Contains pages
Page
Contains words and phrases
User
Selects text document
Index List
Contains an alphabetized list and page
numbers in ascending orders
ComWordList
A list with articles & prepositions, etc
UserWordList
Contains common words in a file
IgnoreWordList
Contains the common word user defined
Word
A basic class
Candidates for Inheritance:
WordList
• ComWordList
• Index
• UserWordList
Verbs & Verb phrases for associations:
• User selects text documents
• IndexWriter processes document
• IndexWriter produces word lists
System Object Model:
The IndexWriter Class will play a central role in this model. It will take a
text document as an input and produces an index from the word list of
the document.
System Operations:
• readDocumnet
• addToIndex
• addToNewComWList
• deleteFromWordList
• displayWordList
Class Diagrams -- Hiren
User
Uses
IndexList
IndexWriter
produces
*
WordList
Process
UserWList
Text
Document
*
Page
IgnoreWList
*
Phrase
Word
2
*
Operation Model Schemata
System Operation Schema For readDocument
• Description: Document is read to the IndexWriter
• Reads: Document, size of a page
• Changes: WordList (initially empty)
• Assumes:
• Result: The initial wordList is built.
System Operation Schema For addToNewComWList
• Description: add the chosen common word to
the newComWList
• Reads:
• Changes: wordList, newComWList
• Assumes:
• Result: wordList, newComWList are updated.
Operation Model Schemata (Continued) :
System Operation Schema For deleteFromWordList
• Description: delete the chosen common word from wordList
• Reads:
• Changes: wordList
• Assumes:
• Result: wordList is updated.
System Operation Schema For displayWordList
• Description: display the selected wordList
• Reads:
• Changes: nothing
• Assumes:
• Result: The chosen wordList is printed out to the screen.
Questions ?