Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
others Research in Content and Knowledge Management - ICONS Project Prof David Bell Queen’s University, Belfast others ICONS formal information • Rodan Systems - co-ordinator, initiator, project management, architecture, prototype development, procedural knowledge • Queen’s University /Univ of Ulster - knowledge management paradigms • IPI Polish Academy of Sc - tools, standards, methods, user interface • University of Dauphine - distributed content repository • Universita della Calabria - inference machine Datalog • Sema Group - development of the e-Government Portal • InfoVide - ICONS deployment methodology • Budget: > 3 million EURO; founded 1,9 • Duration: 24 months; effort 350 man-months 2 others ICONS Partners • • • • • • • Rodan Systems S.A., Warsaw, Poland. Chief responsibility: project management, ICONS architecture, prototype development. Contact persons: dr Witold Staniszkis, dr Bartosz Nowicki. Website: www.rodan.pl. Corporate profile presentation: Rodan.ppt (PowerPoint, 1.573 KB). Institute of Computer Science, The Polish Academy of Sciences (ICS), Warsaw, Poland. Chief responsibility: assessment of tools, standards, and methods, advanced graphic user interface. Contact person: prof. Kazimierz Subieta. Website: www.ipipan.waw.pl. Corporate profile presentation: ICS_PAS.ppt (PowerPoint, 140 KB). Centro di Ingegneria Economica e Sociale (CIES), Arcavacata di Rende, Italy. Chief responsibility: multi-paradigm knowledge representation Contact persons: prof. Nicola Leone, prof. Pasquale Rullo. Corporate profile presentation: CIES.ppt (PowerPoint, 147 KB). Infovide SA, Warsaw, Poland. Chief responsibility: design and development of the "Structural Fund Project Knowledge Portal". Contact person: Marcin Lewandowski. Website: www.infovide.pl. Corporate profile presentation: Infovide.ppt (PowerPoint, 339 KB). Sema Group Belgium, Brussels, Belgium. Chief responsibility: design and development of the "Structural Fund Project Knowledge Portal". Contact persons: Jules Georges, Stoimir Djoudjev. Website: be.sema.com. Corporate profile presentation: SchlumbergerSema.ppt (PowerPoint, 2.626 KB). University Paris 9 Dauphine, Centre Des Etudes Et De Recherches En Informatique Appliquee, Paris, France. Chief responsibility: distributed content repository. Contact person: prof. Witold Litwin. Website: ceria.dauphine.fr. Corporate profile presentation: Dauphine.ppt (PowerPoint, 195 KB). University of Ulster/Queen’s University Belfast,, U.K. Chief responsibility: multi-paradigm knowledge representation. Contact person: Prof. David Bell. Web: http://www.qub.ac.uk/researchcentres/KDE/ Corporate profile presentation: Ulster.ppt (PowerPoint, 144 KB). 3 others Project goals • Developing a stable prototype • Demonstration of the viability of the ICONS prototype in a real application – e-Government demonstrator • Supporting uniform, knowledge-based access to – distributed information resources available in the form of web pages, – pre-existing heterogeneous databases, as well as – legacy information processing systems. • Managing knowledge base comprising – meta-information representing the domain ontology of various nature (structural, procedural, declarative, knowledge maps) – multimedia content 4 others Specific Issues QUB are addressing Prof David Bell Yaxin Bi Dr Hui Wang Gongde Guo QUB and UU others Introduction • Knowledge Representation • Collaboration • Information Integration 6 others 1. Knowledge Representation (D-S) • Dempster-Shafer Theory – Data Representation using Dempster-Shafer theory • Extended relational database model – Evidence and Rules Rule Strengths Data and Evidence From Evidence Strengths And Rule Strengths To Hypothesis Strength 7 others A Well-known Example: The universal set is the set of all possible conclusions. A particular piece of evidence supports a Subset of these. eg. Evidence from a lab supports conclusion that Tom’s disease is Cirrhosis of the liver or Hepatitis to the degree 0.60. This can be combined with other evidence and the information can be stored in an extended relational model eg. in test 2 relation below others Extending the Relational Model to support Dempster-Shafer Methods Test 1 Patient Name Test Name ……. Result …….. ……. ……. ……. Tom Lab …… <{Cirr,Hep}0.6,>,<,0.4> Tom Image …… <{Cirr,Hep,Pan},0.7>,<,0.3> …… …… …… …… 9 others Combining these 2 pieces of evidence….. Lab This gives: . 60 {C,H} .40 .70 {C,H,P} .42 {C,H} .28 {C,H,P} .30 .18 {C,H} .12 Image • • • m({C,H}) = .60 m({C,H,P}) = .28 m() = .12 10 others Test 2 Patient Name …….. Result ……. …… …….. Tom ……. ……. ……. <{Cirr,Hep,Pan},0.28>, <{Cirr,Hep},0.60>,<,0.12> ……. ……. 11 others Weighing Evidence Use with Data Mining (eg Text Categorisation) and/or Risk Analysis? 12 others Knowledge Representation (some other paradigms) • Bayesian Belief Networks – Generating causal trees – Reasoning using a causal tree with activated data Belief updating Belief propagation The PropBel algorithm • Hyperrelations used for representing mined knowledge TD HR TF WR WW 13 others 2.Collaboration • Data Mining – – – – Clustering Classification/Categorisation Rule Extraction Summarization 14 others Architecture of Text Categorization Predefined Document Document Converting Function word Removal Word Stemming Classifier Construction Feature weighting Unclassified Documents Document Categorization Feature Selection Dictionary Construction Data Input TC Engine Write the Results into XML File Data Pre-processing The arrow with dash line represents the data flow in categorization process as the arrow with solid line represents the data flow in classifier construction process 15 others Implementation • Implement a platform for text categorization • Implement a k-NN Model classification algorithm • Integrate three classification algorithms, including SVM, KNN and ROCCHIO • Support multi-types of documents • Support multi-language documents 16 others Evaluation Category SVM K-NN Rocchio kNNModel Alt.atheism 94.85 88.67 83.85 91.38 … … … … … Sci.cry 94.33 89.16 89.43 88.61 sci.med 93.91 88.40 88.22 90.54 Sci.space 89.80 83.45 86.76 86.83 Soc.religion.christian 86.70 83.17 81.00 83.33 Talk.politics.guns 93.64 8905 85.51 88.94 Talk.politics.mideast 96.12 91.39 93.09 91.54 Talk.politics.misc 87.38 85.11 75.37 83.29 Talk.religion.misc 88.38 80.38 74.18 79.47 macroaveraged F1 83.60 79.07 78.80 80.79 17 others 3. Intelligent Agents 18 others Integration Intelligent, web services based agents a software entity that carries out some set of operations on behalf of a user or another program with some degree of independence or autonomy, and in so doing, employing some knowledge or representation of the user’s goals or desires 19 others Implementation of the IADE • • • • An Intelligent Agent Development Environment has been implemented which is built on the JADE agent platform Two applications – Communities of Practice and Web Services, as part of a knowledge management system. Web Services return explicit knowledge from which a user can formulate a request for tacit knowledge to be returned by an expert through the CoP applications. Admin and user GUI has been written to allow admin of the agents and ontology manipulation for Web Services. 20 others Two kinds of Agent Answering hard questions from user Helping user see more 21