Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
DBSI Teaser presentation The Beckman Report On Database Research Presented by: Akshita Anand (2012015) Priyanka Balotra (MT14018) Sakshi Agarwal (MT14043) 1 Content Characteristics of Big Data Research Challenges Community Challenges Conclusion 2 Characteristics Of Big Data Big Data is identified as a defining challenge for the field of Database. 3 Research Challenges 1. Scalable big/fast data infrastructures 2. Coping with diversity in data management 3. End-to-end processing of data 4 Challenge #1:Scalable Big Data Infrastructure 5 Challenge #2:Diversity in data management No one-size-fits-all. Cross-platform integration Integration of platforms Hiding heterogeneity Optimization of performance Programming models. Diversity in programming abstractions and reusuablilty Need of more than one language! Focus on domain- specific language Data processing workflows platforms that can span both "raw" and "cooked" data. example, querying data with SQL and then analyzing it with R 6 Challenge #3:End-to-end processing of data Data-to-knowledge pipeline steps of the raw-data-to-knowledge pipeline data acquisition; selection, assessment, cleaning, and transformation, extraction and integration etc. greater diversity of data and users Tool diversity need of multiple tools to solve each step of raw-data-to-knowledge pipeline Tool customizability domain knowledge, such as dictionaries, knowledge bases, and rules. easy to customize to a new domain Hand crafted rules are needed along with machine learning Ex- precision sensitive applications like e-commerce 7 Open source Few open source tools Mostly expensive proprietory products Understanding data Capturing and managing appropriate meta-information Eg. Facebook automatically identifies faces in the image so users can optionally tag them Knowledge base The more knowledge about a target domain, the better that tools can analyze the domain 8 Community Challenges Some of these are new, some old, brought by big data and are becoming increasingly important: Database education Data science Conclusion Database research has been restricted by the rigors of the enterprise and relational database systems Handling data diversity; exploiting new hardware, software, and cloudbased platforms; It is also time to rethink approaches to education, involvement with data consumers, and our value system and its impact on how we evaluate 9 THANK YOU