Download CURRICULUM VITAE Reuven Kashi

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Cluster analysis wikipedia , lookup

Transcript
CURRICULUM VITAE
Reuven Kashi
CONTACT INFORMATION:
RUTCOR – Rutgers University
640 Bartholomew Road
Piscataway, NJ 08854, USA
Telephone: (+1)-732-445-4859
Fax: (+1)-732-445-5472
Email: [email protected]
Url: http://www.cs.biu.ac.il/∼kashi
CURRENT POSITION:
Postdoctoral Researcher at RUTCOR – Rutgers Center for Operations Research, Rutgers University, New
Jersey, USA.
RESEARCH INTERESTS:
My main research interests are in the field of data mining and knowledge discovery with a focus on massive
data sets. This research is mainly concerned with the development of new methodologies and advanced
algorithms that scale to the huge bodies of data that have been gathered and stored on many kinds of database
systems. The interdisciplinary nature of this field involves techniques from databases, algorithms design and
analysis, statistics, data visualization, machine learning and artificial intelligence. I have worked on the
development of new automatic tools for finding implicit patterns (hypotheses) and display useful summaries
of quantitative data. I have also solved some of the limitations and shortcomings of traditional data mining
models (in areas such as clustering, classification, link analysis, etc.), and explored new application areas
enabled by the new mining capabilities.
EDUCATION:
1997 – 2003:
Bar-Ilan University (Ramat Gan, Israel).
Ph.D. in Computer Science
Field of Research: Data Mining and Analysis
Thesis subject: Quantitative Data Mining Using Data Visualization.
Advisors: Amihood Amir and Nathan Netanyahu
1995 – 1997:
Bar-Ilan University (Ramat Gan, Israel).
M.Sc. in Computer Science
Magna cum Laude
University’s Award for Excellence of best M.Sc. thesis
Field of Research: Data Mining
Thesis subject: A New and Versatile Method for Association Generation.
Advisors: Amihood Amir and Ronen Feldman
–1–
1992 – 1995:
Bar-Ilan University (Ramat Gan, Israel).
B.Sc. in Computer Science and Mathematics
Magna cum Laude
FIELD OF INTERESTS:
• Algorithms Design and Analysis
• Databases, Data Mining and Knowledge Discovery
◦ Quantitative Queries
◦ Clustering Large Databases
◦ Mining Association Rules
◦ Approximation Algorithms
• Algorithms for Massive Data Sets
◦ Automatic Hypotheses Generation
◦ Streaming Algorithms
• Data and Information Visualization
• Pattern Matching
AWARDS AND FELLOWSHIPS:
• RUTCOR Postdoctoral Fellowship, 2003–2004.
• ICDM Student Travel Award provided by IBM Research, 2003.
• Travel supported by NSF grant CCR-9610170, Data Mining, Information Retrieval and Pattern Matching – Application-driven Algorithmic Research, 2002–2003
• ISF fellowship 282/01, On The Theory of Search and Applications, 2002–2003.
• AT&T Labs - Research Internship, Shannon Laboratory, Florham Park, NJ, USA, February 2002.
• Ph.D. and M.Sc. Fellowships, Department of Mathematics and Computer Science, Bar-Ilan University,
Ramat Gan, Israel, 1996–2002.
• Rector’s Award for Excellence of best M.Sc. thesis, Bar Ilan University, Ramat Gan, Israel, 1996.
• Dean’s Award for Undergraduate Top-10 List of Excellence, Bar Ilan University.
INDUSTRIAL BACKGROUND:
• Programming: C, C++.
• Programming Languages and Packages: HTML, PERL, JavaScript, MATLAB and SAS.
• Operating Systems: MS Windows, Unix.
• Profound knowledge and experience with developing and implementing algorithms, in particular:
◦ Algorithms that requires mathematical knowledge and some probabilistic analysis.
◦ Developing and implementing various Knowledge Discovery in Databases (KDD) algorithms.
• Automatic Hypotheses Generation: Developed and implemented new techniques for finding local and
non-linear interrelations in large multivariate quantitative databases. The proposed methodology generates and ranks hypotheses of subsets of data and attributes, according to the significance of their
relation in a given database. This enables to discover implicit but potentially useful patterns in the
data, which cannot be easily discovered by traditional analysis tools.
• High Dimensional (Projected) Clustering: Suggested and implemented an approach that makes it possible to automatically analyze high dimensional data for arbitrary low dimensional (projected) clusters.
This approach is robust as far as noise is concerned; it enables the discovery of projected clusters with
different topologies without any prior domain knowledge, and allows separating overlapping clusters.
–2–
• Visualization of High Dimensional Data: Developed and implemented an approach to aid the automatic
process of analyzing high-dimensional data for distinguishable structures. The basic idea consists of
supporting the decisive steps of a further developed automated clustering process with visualization
techniques. This visualization technique allows an easy identification of important data characteristics,
which can be used for the calculation of the best projections and separators.
• Shape-Embedded-Histograms for Visual Data Mining: Designed and implemented a new visualization
technique, that using the available 2-D screen space to present more detailed information. This technique retain the intuitiveness of traditional techniques such as Bar Charts and Scatter Plots, while
allowing the visualization of large amounts of data in an effective way, and avoiding problems with
over plotting or occlusion.
ACADEMIC TEACHING:
Lectured and TA’ed graduate and undergraduate courses in the fields of Computer Science and Discrete
Mathematics, such as: Introduction to Algorithms, Algorithms I , Advanced Algorithms, Algorithms II, Data
Mining, Seminar on Data Mining, Seminar on Pattern Matching, Seminar on Algorithms, Projects Instructor, Introduction to Databases, Database Systems, Introduction to Computing, Data Structures, Advanced
Data Structures, and Complex Functions.
PUBLICATIONS:
A. Journal Papers
1. A. Amir, R. Kashi and N. S. Netanyahu, Analyzing Quantitative Databases: Image is Everything,
Submitted.
2. A. Amir, R. Feldman and R Kashi, A New and Versatile Method for Association Generation,
Journal of Information Systems (IS), 22(6/7): 333-347, 1997.
B. Conference Papers (refereed - with Proceedings)
1. A. Amir, R. Kashi, Daniel A. Keim, N. S. Netanyahu and M. Wawryniuk, Shape-EmbeddedHistograms for Visual Data Mining, In Proceedings of the VisSym - Joint Eurographics - IEEE
TCVG Symposium on Visualization (VisSym04), May 19-21, Konstanz, Germany, 2004.
2. A. Amir, R. Kashi, Daniel A. Keim, N. S. Netanyahu and M. Wawryniuk, Analyzing HighDimensional Data by Subspace Validity, In Proceedings of the 3rd IEEE International Conference
on Data Mining (ICDM), November 19-22, 2003, Melbourne, Florida, USA, pages 473-476.
3. A. Amir, R. Kashi and N. S. Netanyahu, Efficient Multidimensional Quantitative Hypotheses
Generation,In Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM),
November 19-22, 2003, Melbourne, Florida, USA, pages 3-10.
4. A.Amir, R. Kashi and N. S. Netanyahu: Analyzing Quantitative Databases: Image is Everything,
In Proceedings of 27th International Conference on Very Large Data Bases (VLDB), September
11-14, 2001, Roma, Italy, pages 89-98.
5. A. Amir, R. Feldman and R. Kashi, A New and Versatile Method for Association Generation,
In Proceedings of the First European Symposium on Principles of Data Mining and Knowledge
Discovery (PKDD), Trondheim, Norway, June 24-27, 1997, pages 221-231. (Also in Lecture Notes
in Computer Science, Vol. 1263, Springer, 1997)
C. Work In Progress
–3–
1. Optimal Support Set Selection in Data Analysis with Applications to Bioinformatics.
2. Efficient Subspace Validity Computation for Projected Clustering.
3. Exploratory Data Analysis of High-Dimensional Data by Pixel Validity Plots.
CONFERENCE, WORKSHOP AND SEMINAR PRESENTATIONS:
Conferences
1. The 3rd IEEE International Conference on Data Mining (ICDM), Melbourne, Florida, USA,
November 2003.
2. The 27th International Conference on Very Large Data Bases (VLDB), Roma, Italy, September
2001.
3. The First European Symposium on Principles of Data Mining and Knowledge Discovery (PKDD),
Trondheim, Norway, June 1997.
Workshops
1. Second Haifa Winter Workshop on Computer Science and Statistics University of Haifa, Israel,
December 2003.
2. Haifa Winter Workshop on Computer Science and Statistics, University of Haifa, Israel, December
2001.
3. The 13th Israeli Symposium on Artificial Intelligence, Computer Vision and Neural Networks, Tel
Aviv University, Israel, February 1997.
Seminars
1. Department of Statistics, University of Connecticut, Storrs, CT, USA, April 2004.
2. RUTCOR Seminar, Rutgers University, Piscataway, NJ, USA, April 2004.
3. Computer Science Colloquium, Center for Information Science and Technology, Temple University,
Philadelphia, PA, USA, February 2004.
4. Research Seminar, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA, April 2002.
5. Computer Science Seminar, Rutgers University, Camden, NJ, USA, April 2002.
6. Research Seminar, AT&T Labs – Research, Shannon Laboratory, Florham Park, NJ, USA, February 2002.
7. Research Seminar, DIMACS Discrete Math – Theory of Computing Seminar, Rutgers University,
Piscataway, NJ, USA, February 2002.
8. Computer Science Colloquium, Bar-Ilan University, Israel, November 2001.
9. Computer Science Colloquium, Bar-Ilan University, Israel, May 1997.
PROFESSIONAL ACTIVITIES:
• Refereeing: IEEE TKDE – Transactions on Knowledge and Data Engineering, IEEE TNN – Transactions on Neural Networks, DKE – Journal on Data & Knowledge Engineering.
• Program committee: ICAI’04.
REFERENCES:
Available upon request.
–4–