* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download The Beckman Report On Database Research
Survey
Document related concepts
Transcript
DBSI Teaser presentation The Beckman Report On Database Research Presented by: Akshita Anand (2012015) Priyanka Balotra (MT14018) Sakshi Agarwal (MT14043) 1 Content Characteristics of Big Data Research Challenges Community Challenges Conclusion 2 Characteristics Of Big Data Big Data is identified as a defining challenge for the field of Database. 3 Research Challenges 1. Scalable big/fast data infrastructures 2. Coping with diversity in data management 3. End-to-end processing of data 4 Challenge #1:Scalable Big Data Infrastructure 5 Challenge #2:Diversity in data management No one-size-fits-all. Cross-platform integration Integration of platforms Hiding heterogeneity Optimization of performance Programming models. Diversity in programming abstractions and reusuablilty Need of more than one language! Focus on domain- specific language Data processing workflows platforms that can span both "raw" and "cooked" data. example, querying data with SQL and then analyzing it with R 6 Challenge #3:End-to-end processing of data Data-to-knowledge pipeline steps of the raw-data-to-knowledge pipeline data acquisition; selection, assessment, cleaning, and transformation, extraction and integration etc. greater diversity of data and users Tool diversity need of multiple tools to solve each step of raw-data-to-knowledge pipeline Tool customizability domain knowledge, such as dictionaries, knowledge bases, and rules. easy to customize to a new domain Hand crafted rules are needed along with machine learning Ex- precision sensitive applications like e-commerce 7 Open source Few open source tools Mostly expensive proprietory products Understanding data Capturing and managing appropriate meta-information Eg. Facebook automatically identifies faces in the image so users can optionally tag them Knowledge base The more knowledge about a target domain, the better that tools can analyze the domain 8 Community Challenges Some of these are new, some old, brought by big data and are becoming increasingly important: Database education Data science Conclusion Database research has been restricted by the rigors of the enterprise and relational database systems Handling data diversity; exploiting new hardware, software, and cloudbased platforms; It is also time to rethink approaches to education, involvement with data consumers, and our value system and its impact on how we evaluate 9 THANK YOU