Download Metodi Decisionali per l`e

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
MIND
Models in decision making & data @nalysis
Enza Messina and Francesco Archetti
Main Activities
Research Areas
o Machine Learning Algorithms
o Probabilistic and Relational Models
o Optimization Under Uncertainty
o
o
o
o
Applicative Domains
Multimedia Document
Life Sciences
Ambient Intelligence
Finance
Faculty:
Post Doc:
PhD:
Others:
Francesco Archetti
Enza Messina
Guglielmo Lulli
Elisabetta Fersini
Federica Bargna
Daniele Toscani
Ilaria Giordani
Gaia Arosio
Luigi Quarenghi
Machine Learning and Relational Data
-
Traditional learning methods are consistent with the classical
statistical inference problem formulation

-
istances are independent and identically distributed (i.i.d.)
but do not reflect the real world!
 We need a solution able to deal with relationships and
with uncertainty in more general terms
Probabilistic
Models
SL
Learning
Techniques
Probabilistic
Models
Learning
Techniques
SRL
Relational
Representation
The World is Uncertain
Graphical Models (here e.g. a Bayesian network) - model uncertainty explicitly by
representing the joint distribution
Fever
Ache
Random Variables
Direct Influences
Influenza
Propositional Model!
Real-World Data are structured
Non- i.i.d
PatientID Gender Birthdate
P1
M
3/22/63
PatientID Date Physician Symptoms
P1
P1
1/1/01
2/1/03
Smith
Jones
palpitations hypoglycemic
fever, aches influenza
First-Order
Logic / Relational
Lab Test
Result
PatientID SNP1 SNP2
Databases
PatientID Date
P1
P1
1/1/01 blood glucose
1/9/01 blood glucose
42
45
PatientID Date Prescribed Date Filled
P1
5/17/98
5/18/98
P1
P2
Diagnosis
AA
AB
… SNP500K
AB
BB
BB
AA
Physician Medication Dose
Jones
prilosec
10mg
Duration
3 months
Probabilistic Relational Models

Integrate uncertainty with relational model

Convenient language for specifying complex models



“Web of influence”: subtle & intuitive reasoning
Framework for incorporating heterogeneous data by connecting
related entities (consider also relation uncertainty)
New problems:


Relational clustering
Collective classification
Heterogeneous
Information
L N
E E
A R
R
Gene Cluster
Exp. type
GCN4
HSF
Lipid
Exp. cluster
Endoplasmatic
Level

Open Problems: Inference and Learning
Inference
Uncertainty, Relations, Dynamics
DPRM
Causal
Relationship
s
PRM,RBN,SLP…
Bayes Net
MRDM,
ILP
DBN
Relational
Markov Model
Sequence
(Hidden)
Markov Model
Some Applications
Document Analysis

Learning Models for Relational Data:

Relational Clustering
1. Constraint Learning
2. Objective Function Adaptation
 Relational Classification:


Probabilistic Relational Models with Relational Uncertainty
Conditional Random Fields
Document
Link
♦ document_id
class
Rlv
#origin_ref
#destination_ref
Publications
E. Fersini, E. Messina, F. Archetti, “A probabilistic relational approach for web document clustering”, Journal of Information
Processing and Management, Vol. 46, no 2, p. 117-130, 2010.
E. Fersini, E. Messina, F. Archetti. “Web page classification: A probabilistic model with relational uncertainty”. In Proc. of the 2010
Conference on Information Processing and Management of Uncertainty, 2010.
E. Fersini, E. Messina, F. Archetti, Probabilistic relational models with relational uncertainty: an early study in web page classification,
IEEE WI-IAT Workshop, 2009.
Submitted
E. Fersini, E. Messina, F. Archetti, “Probabilistic relational models with relational uncertainty”, Journal of Information Processing and
Management, (second revision).
Document Analysis
E-Forensics

JUdicial MAnagement by Digital Libraries Semantics
Information Extraction
Hearing Summarization
Proceedings n° ……..
Accused Name
XXXXXX
Witness Name
KKKKKK
Prosecutor Name
-
Lawyer Name
YYYYYY
ZZZZZZ
Meeting Date
1989
Meeting Location
Civitanova Marche
Emotion Recognition
Document Analysis
E-Forensics
Publications
E. Fersini, E. Messina, F. Archetti. “Multimedia Summarization in Law Courts: A Clustering-based Environment for Browsing and
Consulting Judicial Folders”. In proc. of the 10th Industrial Conference on Data Mining, 2010.
E. Fersini, G. Arosio, E. Messina, F. Archetti, “Emotion recognition in judicial domain: a multilayer SVM approach, LNAI, in Proc. of
the 6th International Conference on Machine Learning and Data Mining, Leipzig, 2009.
E. Fersini, G. Arosio, E. Messina, F. Archetti, D. Toscani. Multimedia Summarization in Law Courts: An Environment for Browsing and
Consulting Judicial Folders. In Proc. of the 2nd International Conference on ICT Solutions for Justice, Skopje, 2009.
E. Fersini, F. Callegaro, M. Cislaghi, R. Mazzilli, S. Somaschini, R. Muscillo, D. Pellegrini,. Managing Knowledge Extraction and
Retrieval from Multimedia Contents: a Case Study in Judicial Domain. In Proc. of the 2nd International Conference on ICT
Solutions for Justice, Skopje, 2009.
G. Felici, E. Fersini, E. Messina, Information extraction through constrained inference in Conditional Random Fields, AIRO 2010,
september 2010.
Submitted
E. Fersini, E. Messina, F. Archetti. “Emotional States in Judicial Courtrooms: An Experimental Investigation”. Sumbitted to Journal of
Speech Commiunication.
E. Fersini, E. Messina, D. Toscani, F. Archetti, M. Cislaghi. Semantics and machine learning for building the next generation of judicial
case and court management systems. Submitted to the Int. Conference on Knowledge Management and Information Sharing
Submitted Projects
 Progetto PON
eJRM - electronic Justice Relationship Management
Life Sciences
Systems Biology Applications
Learning gene regulatory networks
Gene
DNA
Control
Coding
+
Transcription
RNA
single
strand
Regulatory modules
Modelling the pharmacology of cancer
Human
cancer
Gene
expressio
n
Collaborations
Drug
Activity
Gene drug interaction
identification of a drug treatment for a given cell
line based both on drug activity pattern and gene
expression profile
Pharmacogenomics Application:
Predict drug response to oral anticoagulation therapy (OAT)
Grouping (Profiling) patients based on their clinical and genotypic features
in order to suggest doctors the correct drug dosage
Data of about 4000 patients:
Haemorragic risk
Thrombotic risk
 Clinical and therapeutical data: personal patients data,
medical diagnosis, therapy, INR and dosage measurements
 Genetic data: polymorphism of three genes: CYP2C9,
VKORC1 and CYP4F2 that contribute to differences in
patients’ response.
In collaboration with
14
Publications
G. Ogliari, I. Giordani, A. Mihalich, D. Castaldi, A. Di Blasio, A. Dubini, E. Messina, F. Archetti, D. Mari, Nuova classificazione
clinica e Farmacogenetica per predire la dinamica dell'inr nell'anziano in tao. Giornale di gerontologia, vol. lvii; p. 495-496, issn:
0017-0305, dicembre 2009
F. Archetti, I. Giordani, E. Messina, G. Ogliari, D. Mari, "A comparison of data mining approaches in the categorization of oral
anticoagulant patients", International Workshop of Applications of Machine Learning in Bioinformatics (satellite workshop of IEEE
International Conference on Bioinformatics and Biomedicine- BIBM, november 2009
E. Fersini, C. Manfredotti, E. Messina, F. Archetti Relational K-Means for Gene Expression Profiles and Drug Activity Pattern
Analysis, to appear on Int. Journal of Mathematical Modelling and Algorithms.
F. Archetti, I.Giordani, L. Vanneschi, “Genetic Programming for Anticancer Therapeutic Response Prediction using the NCI-60
Dataset”, Computers & Operations Research, Vol.37, No.8, pp.1395-1405, August 2010.
E. Fersini, I.Giordani, E.Messina, F. Archetti, "Relational Clustering and Bayesian Networks for Linking Gene Expression
Profiles and Drug Activity Patterns", International Workshop of Applications of Machine Learning in Bioinformatics (satellite
workshop of IEEE International Conference on Bioinformatics and Biomedicine- BIBM, november 2009.
L. Vanneschi , F. Archetti, M. Castelli, I. Giordani, "Classification of Oncologic Data with Genetic Programming," Journal of
Artificial Evolution and Applications, vol. 2009, Article ID 848532, 13 pages, 2009. doi:10.1155/2009/848532.
F. Archetti, I.Giordani, L. Vanneschi, “Genetic Programming for QSAR Investigation of Docking Energy”, Applied Soft
Computing, Vol. 10, No. 1, pp. 170-182, issn: 1568-4946, Jan 2010.
Submitted
F. Archetti, I.Giordani, G.Mauri, E.Messina. “A new clustering approach for learning transcriptional regulatory modules”, submitted to Int.
Journal of Data Mining and Bioinformatics, (second revision).
Projects
Submitted proposals:
Associazione lotta alla trombosi - Call for applications 2010
Oral Anticoagulation Therapy in the elderly and women
Partners:
Brunel University, Centre for Intelligent Data Analysis
Harvard Medical School, Biomedical Cybernetics Laboratory
Univ. of Milano, Dept. of Medical Sciences, Geriatrics Unit
Ist. Clinico Humanitas - Thrombosis Unit (Corrado Lodigiani, MD, PhD)
Ist. Auxologico Italiano, IRCCS Centro di Ricerche e Tecnologie Biomediche,
 PON
HEARTDRIVE
Project Coordinator: Calpark – Parco Tecnologico e Scientifico della Calabria
 PRIN
Revealing
common patterns among insuline resistance, osteoporosis
diseases by using Bayesian Networks.
Project Coordinator: Università degli Studi "Magna Graecia" di CATANZARO
and
chronic
inflammatory
Ambient Intelligence
Multi-target tracking
Multi-target tracking: finding the tracks of an unknown number of moving targets
from noisy observations.


Exploiting relations can improve the efficiency of the tracker
Monitoring relations can be a goal in itself
We model the transition probability of the system with a RDBN.

A new representation modelling not only objects but also their relations
A new computational strategy based on a family of Sequential Monte Carlo methods
called Particle Filter

Statistical techniques for the detection of anomalous behaviours

Publications
Cristina E. Manfredotti, Enza Messina: Relational Dynamic Bayesian Networks to Improve Multi-target Tracking. ACIVS
2009: 528-539.
C. Manfredotti, E. Messina, D.J. Fleet, Relations to improve multi-target tracking in an activity recognition system.
Proceedings of the International Conference on Imaging for Crime Detection and Prevention, London, 2009.
In collaboration with
Wireless Sensor Networks



Bayesian abstractions for virtual sensing through low cost data aggregation and netwide anomaly detection
Modelling Cluster Heads as nodes of a BN
Inference to know sensor values also in presence of temporary faults:


Lack of communication (sensor failure or sleep)
Outlier due to sensor malfunctioning
CH2
CH1
BN
CH5
CH4
sink
CH3
WSN
Publications
F. Archetti, E. Messina, D. Toscani and M. Frigerio - IKNOS – Inference and Knowledge in Networks Of
Sensors. International Journal of Sensor Networks (IJSNet), Vol.8 No. 3, 2010
F. Chiti, R. Fantacci, F. Archetti, E. Messina, D. Toscani, Integrated Communications Framework for Context
aware Continuous Monitoring with Body Sensor Networks, IEEE Journal on Selected Areas in
Communications - Wireless and Pervasive Communications for Healthcare. Volume 27, Issue 4, 2009.
Submitted
D. Toscani, I. Giordani, M. Cislaghi, L. Quarenghi. Querying Sensor Data for Environmental Monitoring.
Submitted to International Journal of Sensor Networks (IJSNet), 2010
D. Toscani, I. Giordani, L. Quarenghi, F. Archetti . A software Environment For Supporting Sensor Querying.
Submitted to IEEE Sensors 2010 Conference, Hawaii, 2010
19
Transportation & Logistics
ori
gf
u
L
ufv
P
hjf
w
k
w
w
f
j ,t
f
u ,T
Data
Models
Decisions
In collaboration with:
Projects
PRIN MIUR
Enhancing the European Air Transportation System
Partners: Università di Padova, Università di Trieste.
j
 wh ,t l  wk ,t l
f
f
 wv,T  ww,T  1
f
f
de
stf
Ambient Intelligence
Projects
LENVIS - Localised environmental and health information services for all (EU-FP7)
sviluppo di una rete collaborativa di supporto alle decisioni, per lo scambio di informazioni e servizi
riguardanti l'ambiente e la salute
Publications
D. Toscani, L. Quarenghi, F.Bargna, F. Archetti, E. Messina, "A DSS for Assessing the Impact of Environmental Quality on
Emergency Hospital Admissions", In proceedings of the WHCM 2010 - IEEE Workshop on Health Care Management, February
18-20, 2010 - Venice, Italy.
Submitted
D. Toscani, I. Giordani, F. Bargna, L. Quarenghi, F. Archetti. A software System for Data Integration and Decision Support for
Evaluation of Air Pollution Health Impact. Submitted to ICEIS 2010 - 12th International Conference on Enterprise Information
Systems. Funchal, Madeira – Portugal, 2010
Projects
In collaboration with SAL Lab.
INSYEME – Integrated Systems for Emergencies (MIUR - FIRB)
GREIS - Gestione del Risparmio Energetico attraverso Informazioni di Sicurezza (MIUR)
In collaboration withNOMADIS Lab.
H-CIM Health Care through Intelligent Monitoring (MIUR)
Submitted
FP7 ICT call 6 - STREP
OPENCITY Open framework for Transport Demand Management for smart and sustainable
urban mobility in an open and accessible city
Project Coordinator: Consorzio Milano Ricerche
In collaboration with SAL Lab. e Imaging & Vision Lab.
FLECS – FLy’s eyes for Collaborative Surveillance – (Progetto PON)
Financial Time Series
Financial Time Series & Scenario
Generation

Regime Switching Models

Observations: prices
St

Hidden var.: Regime
xt
Transition Model
p( xt | xt 1 )
Markov Chain
Observation Model
p( zStt | xt )
Mixture of Gaussians
(Autoregressive Process)
(Autoregressive) Hidden Markov Model
24
Financial Time Series

Extend state space models to more general Relational Dynamic Bayesian Networks to
account not only prices but also, through CPT, “exogenous” economic factors and
unstructured information

Algorithms for managing risk tracking portfolio using all available evidence and taking
into account all uncertainties
“Markets are good at gathering information from many heterogeneous sources and
combining it appropriately, the same we would expect from models”
Publications
G. Consigli, C. Manfredotti, E. Messina, A sequential learning method for tracking stochastic volatility, EURO XXIV, July
2010, Lisbon
Projects & Collaborations
PRIN 2007 "Modelli probabilistici per la rappresentazione dell’incertezza per la definizione di metodologie di
selezione del portafoglio” (Università di Bergamo, Università della Calabria)
Collaboration with Brunel University and CARISMA Research Centre:
Workshop “Application of Hidden Markov Models and Filters to Time Series Methods in Finance”, London,
September 2010
The cooperation network
University
of Toronto
Brunel
University
CARISMA
Research
Center
Norwegian University of
Science and Technology
SB RAS Russia
Aachen University
Massachusset
Institute of
Technology
Harvard
Medical
School
Hungarian Academy
of Sciences
Centre of Research
and Technology
Hellas