Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Proposal for a Track at the 2006 ACM Symposium on Applied Computing on Data Mining Hasan Jamil Department of Computer Science Mississippi State University, USA [email protected] and Rosa Meo Department of Computer Science University of Torino, Italy [email protected] Data mining from traditional relational databases as well as from non-traditional ones such as semi-structured data, web data and scientific databases such as biological, earth and atmospheric, solar system, ecological, animal behavior, linguistic and sensor data have recently become a popular way of discovering hidden knowledge. In the context of relational and traditional data, methods such as association rules, chi square rules, ratio rules, implication rules, etc. have been proposed in multiple, varied contexts. In the context of non-traditional data, newer, more experimental yet novel techniques are being proposed. There is an agreement among the researchers across communities that data mining is a key ingredient for success in their respective area of research and development. Consequently, interest in developing new techniques for data mining has peaked and a tremendous stride is being made to answer interesting and fundamental questions in various disciplines using data mining. There is a new interest in developing techniques for obtaining solid data mining models from distributed databases with privacy and autonomy guarantees. In the past, researchers mainly focused on algorithmic issues in data mining and placed much emphasis on scalability. Recently, the focus has shifted towards a more declarative way of answering questions using data mining that gave rise to the concept of mining queries. Declarative queries facilitate building larger systems using small mining building blocks. In such a paradigm, the system assumes the responsibility of optimization and scalability. Such an approach will be extremely useful in developing solutions in complex systems such as scientific databases where numerous domain specific knowledge interact with mining queries. In such an environment, the choice of algorithm, execution method, and processing strategy using secondary information often become complicated and time consuming. A well-developed and robust implementation help eliminate relational and focus on their details. strategy for declarative systems can these obstacles in a way similar to deductive databases and let the users application rather than on low level As part of the 2006 ACM SAC meeting, we propose to organize a Track on "Data Mining" that will encourage submissions in all areas of data mining in traditional as well as emerging non-standard databases. We will emphasize submissions on declarative data mining, intelligent querying and associated issues such as optimization, indexing, query processing, languages and constraints. We also encourage submissions in the area of data preprocessing such as data cleaning, discretization and sampling. The study of new data models and techniques for privacy preserving data mining, and data security will be encouraged. This will allow also to exploit the synergy of mining in different databases, in parallel, distributed or grid environments. We aim at organizing at least five sessions consisting of about twenty papers in total. Because of our focus and emphasis on the issues presented before, this ACM track will be distinct from others such as SIGMOD, VLDB, ICDE and even SIGKDD, and PAKDD, and specialized workshops such as DMKD as their focus is too general and covers a broad range of issues. This will be our fifth such Track in ACM SAC. As you are aware, the previous editions of this Track on Data Mining were successful, and we would like to continue with the tradition and see this Track grow and evolve. If approved, as before, we will develop a web-site to manage the activities of the Track, and electronically advertise in the specialized user groups and research networks and institutions. We will advertise in DBWORLD list, SIGKDD list, KD-net, the European Network of Excellence in Knowledge Discovery, and our own list of data mining researchers that includes about 200 active researchers. We will develop a system to collect, review, select and put together an attractive program solely through the electronic media. In light of our experience and the increase in submissions last year, we plan to form a large program committee with almost 60 distinguished members including eminent researchers such as Jiawei Han, Carlo Zaniolo, Mohammed Zaki, and others for the purpose of reviewing submissions and developing an attractive Track program. Like previous years, we will also consider inviting selected authors to submit an extended edition of their contributions for a special issue of a journal or an edited book. We would like to continue our tradition and organize and manage the 2006 ACM SAC Data Mining jointly. Like the past year, Hasan Jamil and Rosa Meo will serve as the Program Co-Chairs for the 2006 track. We are also considering adding third Co-Chair to increase circulation, number of submission and increase diversity of geographic presentation. We will let SAC know about our decision soon. Dr. Hasan Jamil has significant experiences in organizing and managing such events and is involved in declarative data mining. Dr. Jamil has been involved in organizing AMAST Montreal and Sydney meetings. He was also one of the organizers of the 2000 IEEE BIBE Symposium and was the PC Chair for the 2001 and 2003 IEEE BIBE Symposiums. He was member of the PC of DaWaK 2001/2002 and 2001/2002 ACM SIGKDD workshop on Data Mining in Bioinformatics, DBFusion 2002, NGITS 2002, and so on. He is also a member of the IASTED Technical Committee on Databases, and the Chair of the IFIP TC 5 Special Interest Group on Bioinformatics. Dr. Hasan Jamil's research interests include databases and Bioinformatics. He has published several articles in leading database, logic programming and Bioinformatics conferences such as ACM SIGMOD, VLDB, ICDT, KR, ILPS, etc. He has organized the ACM SAC Data Mining track for the last four years. He also holds grants from NSF and USDA for his Bioinformatics projects. Dr. Jamil's home page at www.cs.wayne.edu/~jamil/ may be consulted for more information. Rosa Meo actively works in database and data mining research since the last ten years. She has developed database system prototypes for data mining and worked in European funded projects (V Framework) on data mining themes and inductive databases. She published papers on major database and data mining International Conferences and Journals, such as ACM TODS, Kluwer DMKD, IEEE IT, VLDB, ICDE, EDBT, etc. She has been co-chair in 2002 for DTDM (DataBase Technologies for Data Mining) Workshop at EDBT, and KDID (Knowledge Discovery in Inductive Database) at ECML/PKDD. This year (2005), she is a member of the PC of ACM SIGKDD, VLDB (core database and IIS tracks), IEEE ICDM, ECML/PKDD. Rosa Meo's home page at http://www.di.unito.it/~meo/ can be consulted for more information. As in the past, we hope to form a large Program Committee for the purpose of selecting outstanding papers and developing an interesting program. Tentatively, we propose the following PC for 2006 ACM SAC DM track. Proposed PC: Reda Alhajj Canada Elena Baralis [email protected], University of Calgary, [email protected], Politecnico di Torino, Italy Roberto Bayardo [email protected], IBM Almaden Research Center, USA Christian Bohm [email protected], UMIT, Austria Francesco Bonchi [email protected], ISTI-CNR, Pisa, Italy Marco Botta [email protected], University of Torino, Italy Jean-Francois Boulicaut [email protected], INSA LISI, Lyon, France Toon Calders [email protected], University of Antwerp, BelgiumSaso Dzeroski Bruno Cremilleux [email protected], GREYC Department d'Informatique, France Ding Chris [email protected], Lawrence Berkeley National Laboratory, USA Saso Dzeroski [email protected], Jozef Stefan Institute, Slovenia Johannes Gerke [email protected], Cornell University, USA Fosca Giannotti [email protected], CNUCE-CNR of Pisa, Italy Bart Goethals [email protected], Helsinki Institute for Information Technology (HIIT), Finland Le Gruenwald [email protected], University of Oklahoma, USA Dimitrios Gunopulos [email protected], University of California, Riverside, USA Jiawei Han [email protected], University of Illinois at Urbana-Champaign David Hand [email protected], Imperial College, London, UK Sherri Harms [email protected], University of Nebraska, Kearney, USA Tomasz Imielinski [email protected], Rutgers, the State University of New Jersey, USA Thorsten Joachims [email protected], Cornell University, USA Daniel A. Keim [email protected] , Martin-LutherUniversity Halle-Wittenberg, Germany Kristian Kersting [email protected], Albert-Ludwigs-University, Freiburg, Germany Marzena Kryszkiewicz [email protected], Warsaw University of Technology, Poland Krzysztof Koperski [email protected], Insightful Corporation Stefan Kramer [email protected], Institut für Informatik, Albert-Ludwigs-Universität Freiburg, Germany Pier Luca Lanzi [email protected], Politecnico di Milano, Italy Dominique Laurent [email protected], University of Tours, France Nada Lavrac [email protected], Jozef Stefan Institute, Slovenia Donato Malerba [email protected], University of Bari, Italy Giuseppe Manco [email protected], ICAR-CNR, Italy Andrew W. Moore [email protected], Carnegie Mellon University, USA Katharina Morik [email protected], University of Dortmund, Germany Raymond T. Ng [email protected], University of British Columbia, USA Salvatore Orlando [email protected], Università di Ca' Foscari, Italy Stefano Paraboschi [email protected], University of Bergamo, Italy Jian Pei [email protected], University at Buffalo, The State University of New York, USA Giuseppe Psaila [email protected], University of Bergamo, Italy Rauch Jan [email protected], University of Economics, Czech Republic Christophe Rigotti [email protected], INSA LISI, Lyon, France Domenico Sacca' [email protected], Universita' della Calabria, Italy Lorenza Saitta [email protected], AMEDEO AVOGADRO University, Eastern Piedmont, Italy Maria Luisa Sapino [email protected]. it, University of Torino, Italy Sunita Sarawagi [email protected], KR School of Information Technology, IIT Bombay, India Savinov Alexandr [email protected], Fraunhofer AIS, Germany Arno Siebes [email protected], Utrecht University, The Netherland Ramakrishnan Srikant [email protected], IBM Almaden Research Center, USA Einoshin Suzuki [email protected], Yokohama National University, Japan Hannu TT Toivonen [email protected], University of Helsinki, Finland Franco Turini [email protected], University of Pisa, Italy Jiong Yang [email protected], University of Illinois at Urbana Champaign, USA Philip S. Yu [email protected], IBM T.J. Watson Research Center, USA Raymond Wong [email protected], University of New South Wales, Australia Osmar Zaiane [email protected], University of Alberta, Edmonton, Alberta, Canada Mohammed Zaki [email protected], Rensselaer Polytechnic Institute, USA Kang Zhang [email protected], The University of Texas at Dallas, USA Carlo Zaniolo [email protected], University California Los Angeles, USA