Download Proposal for a Track at the 2006 ACM Symposium on Applied

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Proposal for a Track at the 2006 ACM Symposium on Applied Computing on
Data Mining
Hasan Jamil
Department of Computer Science
Mississippi State University, USA
[email protected]
and
Rosa Meo
Department of Computer Science
University of Torino, Italy
[email protected]
Data mining from traditional relational databases as
well as from non-traditional ones such as
semi-structured data, web data and scientific
databases such as biological, earth and
atmospheric, solar system, ecological,
animal behavior, linguistic and sensor data
have recently become a popular way of discovering
hidden knowledge. In the context of relational and
traditional data, methods such as association rules,
chi square rules, ratio rules, implication rules, etc.
have been proposed in multiple, varied
contexts. In the context of non-traditional data,
newer, more experimental yet novel techniques are
being proposed. There is an agreement among the
researchers across communities that data mining is a
key ingredient for success in their respective area of
research and development. Consequently, interest in
developing new techniques for data mining has peaked
and a tremendous stride is being made to answer
interesting and fundamental questions in various
disciplines using data mining. There is a new interest in
developing techniques for obtaining solid data mining
models from distributed databases with privacy and autonomy
guarantees.
In the past, researchers mainly focused on algorithmic
issues in data mining and placed much emphasis on
scalability. Recently, the focus has shifted towards a
more declarative way of answering questions using data
mining that gave rise to the concept of mining
queries. Declarative queries facilitate building
larger systems using small mining building blocks. In
such a paradigm, the system assumes the responsibility of
optimization and scalability. Such an approach will be
extremely useful in developing solutions in complex
systems such as scientific databases where numerous
domain specific knowledge interact with mining
queries. In such an environment, the choice of
algorithm, execution method, and processing strategy
using secondary information often become complicated
and time consuming. A well-developed and robust
implementation
help eliminate
relational and
focus on their
details.
strategy for declarative systems can
these obstacles in a way similar to
deductive databases and let the users
application rather than on low level
As part of the 2006 ACM SAC meeting, we propose to
organize a Track on "Data Mining" that will encourage
submissions in all areas of data mining in traditional
as well as emerging non-standard databases. We will
emphasize submissions on declarative data mining,
intelligent querying and associated issues such as
optimization, indexing, query processing, languages
and constraints. We also encourage submissions in
the area of data preprocessing such as data cleaning,
discretization and sampling. The study of new data
models and techniques for privacy preserving
data mining, and data security will be encouraged.
This will allow also to exploit the synergy of mining
in different databases, in parallel, distributed or grid
environments.
We aim at organizing at least five sessions consisting
of about twenty papers in total. Because of our focus and
emphasis on the issues presented before, this ACM track will
be distinct from others such as SIGMOD, VLDB, ICDE and
even SIGKDD, and PAKDD, and specialized workshops such as
DMKD as their focus is too general and covers a broad
range of issues.
This will be our fifth such Track in ACM SAC. As you
are aware, the previous editions of this Track on Data
Mining were successful, and we would like to
continue with the tradition and see this Track grow and
evolve. If approved, as before, we will develop a web-site
to manage the activities of the Track, and electronically
advertise in the specialized user groups and research
networks and institutions. We will advertise in DBWORLD list,
SIGKDD list, KD-net, the European Network of
Excellence in Knowledge Discovery, and our own list of
data mining researchers that includes about 200 active
researchers. We will develop a system to collect,
review, select and put together an attractive program
solely through the electronic media. In light of
our experience and the increase in submissions last
year, we plan to form a large program committee with
almost 60 distinguished members including eminent
researchers such as Jiawei Han, Carlo Zaniolo,
Mohammed Zaki, and others for the purpose of reviewing
submissions and developing an attractive Track
program. Like previous years, we will also consider
inviting selected authors to submit an extended
edition of their contributions for a special issue of
a journal or an edited book.
We would like to continue our tradition and organize
and manage the 2006 ACM SAC Data Mining jointly. Like
the past year, Hasan Jamil and Rosa Meo will serve as
the Program Co-Chairs for the 2006 track. We are also
considering adding third Co-Chair to increase circulation,
number of submission and increase diversity of geographic
presentation. We will let SAC know about our decision soon.
Dr. Hasan Jamil has significant experiences in
organizing and managing such events and is involved in
declarative data mining. Dr. Jamil has been involved
in organizing AMAST Montreal and Sydney meetings. He
was also one of the organizers of the 2000 IEEE BIBE
Symposium and was the PC Chair for the 2001 and 2003
IEEE BIBE Symposiums. He was member of the PC of DaWaK
2001/2002 and 2001/2002 ACM SIGKDD workshop on Data
Mining in Bioinformatics, DBFusion 2002, NGITS 2002,
and so on. He is also a member of the IASTED Technical
Committee on Databases, and the Chair of the IFIP TC 5
Special Interest Group on Bioinformatics. Dr. Hasan
Jamil's research interests include databases and
Bioinformatics. He has published several articles in
leading database, logic programming and Bioinformatics
conferences such as ACM SIGMOD, VLDB, ICDT, KR, ILPS,
etc. He has organized the ACM SAC Data Mining track for
the last four years. He also holds grants from NSF and
USDA for his Bioinformatics projects. Dr. Jamil's home
page at www.cs.wayne.edu/~jamil/ may be consulted
for more information.
Rosa Meo actively works in database and data mining
research since the last ten years. She has developed
database system prototypes for data mining and worked
in European funded projects (V Framework) on data
mining themes and inductive databases. She published
papers on major database and data mining International
Conferences and Journals, such as ACM TODS, Kluwer
DMKD, IEEE IT, VLDB, ICDE, EDBT, etc. She has been
co-chair in 2002 for DTDM (DataBase Technologies for
Data Mining) Workshop at EDBT, and KDID (Knowledge
Discovery in Inductive Database) at ECML/PKDD.
This year (2005), she is a member of the PC of ACM
SIGKDD, VLDB (core database and IIS tracks),
IEEE ICDM, ECML/PKDD. Rosa Meo's home page at
http://www.di.unito.it/~meo/ can be consulted for
more information.
As in the past, we hope to form a large Program
Committee for the purpose of selecting outstanding
papers and developing an interesting program.
Tentatively, we propose the following PC for 2006 ACM
SAC DM track.
Proposed PC:
Reda Alhajj
Canada
Elena Baralis
[email protected], University of Calgary,
[email protected], Politecnico di Torino, Italy
Roberto Bayardo
[email protected], IBM Almaden Research Center,
USA
Christian Bohm
[email protected], UMIT, Austria
Francesco Bonchi
[email protected], ISTI-CNR, Pisa,
Italy
Marco Botta
[email protected], University of Torino, Italy
Jean-Francois Boulicaut [email protected], INSA
LISI, Lyon, France
Toon Calders
[email protected], University of Antwerp,
BelgiumSaso Dzeroski
Bruno Cremilleux
[email protected], GREYC Department
d'Informatique, France
Ding Chris
[email protected], Lawrence Berkeley National
Laboratory, USA
Saso Dzeroski
[email protected], Jozef Stefan Institute,
Slovenia
Johannes Gerke
[email protected], Cornell University, USA
Fosca Giannotti
[email protected], CNUCE-CNR of Pisa,
Italy
Bart Goethals
[email protected], Helsinki Institute
for
Information Technology (HIIT), Finland
Le Gruenwald
[email protected], University of Oklahoma, USA
Dimitrios Gunopulos [email protected], University of California,
Riverside, USA
Jiawei Han
[email protected], University of Illinois at
Urbana-Champaign
David Hand
[email protected], Imperial College, London, UK
Sherri Harms
[email protected], University of Nebraska, Kearney,
USA
Tomasz Imielinski
[email protected], Rutgers, the State
University of New Jersey, USA
Thorsten Joachims
[email protected], Cornell University, USA
Daniel A. Keim
[email protected] , Martin-LutherUniversity
Halle-Wittenberg, Germany
Kristian Kersting
[email protected],
Albert-Ludwigs-University, Freiburg, Germany
Marzena Kryszkiewicz [email protected], Warsaw University of Technology,
Poland
Krzysztof Koperski
[email protected], Insightful Corporation
Stefan Kramer
[email protected], Institut für
Informatik, Albert-Ludwigs-Universität Freiburg, Germany
Pier Luca Lanzi
[email protected], Politecnico di Milano, Italy
Dominique Laurent
[email protected], University of Tours, France
Nada Lavrac
[email protected], Jozef Stefan Institute,
Slovenia
Donato Malerba
[email protected], University of Bari, Italy
Giuseppe Manco
[email protected], ICAR-CNR, Italy
Andrew W. Moore
[email protected], Carnegie Mellon University, USA
Katharina Morik
[email protected], University of
Dortmund, Germany
Raymond T. Ng
[email protected], University of British Columbia, USA
Salvatore Orlando
[email protected], Università di Ca' Foscari,
Italy
Stefano Paraboschi
[email protected], University of Bergamo,
Italy
Jian Pei
[email protected], University at Buffalo,
The
State University of New York, USA
Giuseppe Psaila
[email protected], University of Bergamo, Italy
Rauch Jan
[email protected], University of Economics, Czech
Republic
Christophe Rigotti
[email protected], INSA LISI,
Lyon, France
Domenico Sacca'
[email protected], Universita' della Calabria, Italy
Lorenza Saitta
[email protected], AMEDEO AVOGADRO
University, Eastern Piedmont, Italy
Maria Luisa Sapino
[email protected]. it, University of Torino, Italy
Sunita Sarawagi
[email protected], KR School of Information
Technology, IIT Bombay, India
Savinov Alexandr
[email protected], Fraunhofer
AIS,
Germany
Arno Siebes
[email protected], Utrecht University, The
Netherland
Ramakrishnan Srikant [email protected], IBM Almaden Research Center,
USA
Einoshin Suzuki
[email protected], Yokohama National University,
Japan
Hannu TT Toivonen
[email protected], University of
Helsinki, Finland
Franco Turini
[email protected], University of Pisa, Italy
Jiong Yang
[email protected], University of Illinois at
Urbana
Champaign, USA
Philip S. Yu
[email protected], IBM T.J. Watson Research Center,
USA
Raymond Wong
[email protected], University of New South
Wales,
Australia
Osmar Zaiane
[email protected], University of Alberta,
Edmonton, Alberta, Canada
Mohammed Zaki
[email protected], Rensselaer Polytechnic Institute,
USA
Kang Zhang
[email protected], The University of Texas at
Dallas, USA
Carlo Zaniolo
[email protected], University California Los
Angeles, USA