Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Integration of Agent and Data Mining Longbing Cao University of Technology, Sydney Content • Introduction • Agents can enrich data mining • Data mining can improve agents • Ontology-based integration of agents and data mining • Demo • Conclusions and directions INTRODUCTION Data mining & multiagent research group at UTS • Cross disciplinary researchers interacting at the group • Integrated research of data mining and multi-agent system – http://datamining.it.uts.edu.au • Real-world applications of the integration – Capital markets – F-Trade Agents as a new computing paradigm for complex problems • Strengths – Analyze and understand complex systems – Deal with nonfunctional requirements – Handle social complexity such as distribution, dynamics, interaction, evolution, self-organization – Build flexible infrastructure • Weaknesses – Lack machine learning capability – Lack in-depth analytics – Lack knowledge representation Data mining and knowledge discovery as an effective tool for indepth analysis • Strengths – Deep data analysis – Deep knowledge discovery • Weaknesses – Nothing related to system infrastructure – Deal with social complexity such as distribution, dynamics Bilateral enhancement of agents and data mining by the integration • Agents can enrich data mining • Data mining can improve agents • Mutual enhancement: integration between data mining and multi-agent system AGENTS can ENRICH DATA MINING Building agent-based data mining systems • Agent-based data mining system – F-Trade • Agent-based distributed data mining system – Agent-based distributed data mining systems, such as BODHI, PADMA, JAM, Papyrus • Agents for multiple data source mining • Agents for web mining Data mining models as agents • Intelligent data mining agents – modeling data mining algorithms as agents • Data mining model integrator – integrating data mining algorithms • Data mining model planner – smartly managing data mining algorithms • Data mining model recommender – recommending appropriate algorithms Agent-based mediation and management of distributed and large-scale data sources • Data gateway agents for connecting data sources • Distributed data preprocessor agent • Data integrator agents for data integration • Agents for data clustering • Agents for ensemble mining in distributed data • Agents for data sampling and assumption User and interaction agents for data mining • Human agent interaction for data mining • Agents for interactive mining • Agents in human-guided mining • Domain knowledge management using agents • User agents for preparing mining reports • Agents for circulating mining results Case study 1 -- F-Trade Users/CMCRC/Instituations (Anybody,anytime,anywhere, from MAS & KDD & Finance) Applications developers KDD Researchers (Frequent and abnormal patterns discovery, optimization of trading strategies, correlation analysis) Aims/Motivations: Network (Internet & LAN) AAMAS Researchers (OCAS, AOSE, OADI, OSOAD) (Services for system components,algorithm and multiple data sources) F-Trade (open automated enterprise services, and personalized services) Data Sources (Diff. Providers: AC3, HK market, CSFB, etc. Diff. Formats: FAV, ODBC, JDBC, OLEDB, etc. ) • Research Service Provider for AAMAS and data mining • Integrated Infrastructure for Financial Trading and Mining Support Case study 1 -- F-Trade System infrastructure Case study 1 -- F-Trade F-TRADE: Financial Trading Rules Automated Development & Evaluation Case study 1 -- F-Trade Algorithm as an agent Case study 1 -- F-Trade AgentService RegisterAlgorithm(algoname;inputlist;inputconstraint;outputlist;outputconstraint;) Description: This agent service involves accepting registration application submitted by role PluginPerson, checking validity of attribute items, creating name and directory of the algorithm, and generating universal agent identifier and unique algorithm id. Role: PluginPerson Pre-conditions: -A request of registering an algorithm has been activated by protocol SubmitAlgoPluginRequest -A knowledge base storing rules for agent and service naming and directory Type: algorithm.[datamining/tradingsignal] Location: algo.[algorithmname] Inputs: inputlist InputConstraints: inputconstraint[;] Outputs: outputlist OutputConstraints: outputconstraint[;] Activities: Register the algorithm Permissions: -Read supplied knowledge base storing algorithm agent ontologies -Read supplied algorithm base storing algorithm information Post-conditions: -Generate unique agent identifier, naming, and locator for the algorithm agent -Generate unique algorithm id Exceptions: -Cannot find target algorithm -There are invalid format existing in the input attributes Agent plugand-play Case study 1 -- F-Trade Agent for multiple data sources management Case study 1 -- F-Trade Agent for reporting Visual Reports Point Reports Transactions Reports Summary Reports Input Reports Case study 2 – agent-based WEKA Case study 3 – ensemble DATA MINING can IMPROVE AGENTS Data mining-driven multiagent learning • DM-driven learning in MAS – Coordination learning – Individual learning – Group/collective learning – Distributed learning – Online/offline learning Data mining-driven evolution and adaptation in MAS • Evolution of MAS based on hidden rules, so mine these rules and fill into the agent knowledge base for designing evolutionary agent systems • Adaptive capability mining for enhancing agent’s adaptation • Self-organization rule mining Data mining for agent communication, planning and dispatching • Cluster and classification • Class/segment-based communication • Class-based planning and dispatching DM-based User modeling • Modeling user behavior from DM – Game player modeling – Trader’s behavior modeling – Trader’s role modeling • User-agent interaction based on user modeling – Trader agents’ interface design – Trader-agent interaction rule design DM-based User servicing • DM-based agents for serving users – Visualization mining for reporting – Customer-relationship management for customer care • DM-based recommender agents – Stock recommender – In-depth rule recommender – Trading rule-stock recommender Case study - learning • Agent learning via machine learning – Reinforcement learning – Evolutionary multiobjective methods – Evolutionary algorithm – Markov decision process – Temporal difference method Case study – user modeling • Trader’s behavior modeling • Trader’s role modeling – Market order – Limit order MarketOrder LargeMarketOrder January February Large market orders analysis Case study - servicing • Pairs trading – Mining correlated stock pairs – Correlated stock miner agent – Stock pairs recommender – Pairs trading strategy solution Case study - servicing • Optimized rules – Mining in-depth rules – In-depth rule miner agent – User interface agent – Optimized rules recommender – Optimized trading strategy solution Case study - servicing • Rule-stock pairs – Mining rule-stock pairs – Rule-stock pair mining agent – User interface agent – Rule-stock pair recommender – Trading strategy solution Return on investment ONTOLOGY-BASED INTEGRATION OF AGENTS AND DATA MINING Ontology for domain understanding and interaction • Domain ontology for understanding the domain problems • Problem-solving ontology • Task ontology • Method ontology Ontology for knowledge management • Ontology for organizing agent systems • Ontology for organizing mining algorithms • Ontology for user interaction • Managing domain ontology/task ontology/problem-solving ontology/method ontology Ontology-based system architecture • Multi-domain ontological space – Related problem domains – Agent ontology domain – Data mining ontology domain • Hybrid ontology structure for organizing ontologies crossing multiple domains Ontological engineering for the integration • Ontology namespace • Ontology mapping structure • Semantic rules for ontology mapping • Ontology transformation • Ontology query • Ontology search and discovery Business Profiles Task View Domain Ontology Task Ontology Businesslogic View BL Ontology Method View Problemsolving System Resource View Method Ontology Resource Ontology PS Ontology DO-to-PSO linkage Internal PSO linkage Stock FinancialOrder f Limit Order Market Order ... Stop Order in s pa ta rtnc of eof su bc la ss -o OrderOperation Amend Price Enter Dealer Trade Delete Date Time Cantr Volume Algorithm Agent Input ontologies ... ... Output ontologies Resource ontologies ... ... Knowledge ontologies ... …. M1 ? M2 N1 = N2 || N1 N2 N1 N2 M1 M2 Equivalence, similarity Synonyms, encoding, conventions, paradigms, scaling M1 M2 Scope, coverage, granularity Generalization, specialization =M1M2 Naming conflict, homonymy Disjointness, antonyms M1M2 <min(M1, M2) Scope, coverage, granularity Overlapping M1 M2 Naming, encoding Instantiation - (part_of (A, B) part_of (B, C)) part_of (A, C) - (substitute_to (A, B) substitute_to (B, C)) substitute_to (A, C) Ontology 1 Root Concept - fixed - resident - id - local_fee - remote_fee - IP - business Ontology 2 Root Concept - home - local_call - intraprovince - interprovince - international - Hongkong - Taiwan - Macau - IP - business conceptto-concept attributeto-concept attributeto-attribute 1 attribute-tom*attribute Rule 4. - (A AND B), B ::= substitute_to(A, B) A OR B, the resulting output is A or B Rule 5. - (A AND B), B ::= disjoint_to(A, B) A AND B, the resulting output is A and B DEMO CONCLUSIONS and DIRECTIONS