* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download The Beckman Report On Database Research
		                    
		                    
								Survey							
                            
		                
		                
                            
                            
								Document related concepts							
                        
                        
                    
						
						
							Transcript						
					
					DBSI Teaser presentation The Beckman Report On Database Research Presented by: Akshita Anand (2012015) Priyanka Balotra (MT14018) Sakshi Agarwal (MT14043) 1 Content  Characteristics of Big Data  Research Challenges  Community Challenges  Conclusion 2 Characteristics Of Big Data Big Data is identified as a defining challenge for the field of Database. 3 Research Challenges 1. Scalable big/fast data infrastructures 2. Coping with diversity in data management 3. End-to-end processing of data 4 Challenge #1:Scalable Big Data Infrastructure 5 Challenge #2:Diversity in data management  No one-size-fits-all.  Cross-platform integration  Integration of platforms  Hiding heterogeneity  Optimization of performance  Programming models.  Diversity in programming abstractions and reusuablilty  Need of more than one language!  Focus on domain- specific language  Data processing workflows  platforms that can span both "raw" and "cooked" data.  example, querying data with SQL and then analyzing it with R 6 Challenge #3:End-to-end processing of data  Data-to-knowledge pipeline  steps of the raw-data-to-knowledge pipeline  data acquisition; selection, assessment, cleaning, and transformation, extraction and integration etc.  greater diversity of data and users  Tool diversity  need of multiple tools to solve each step of raw-data-to-knowledge pipeline  Tool customizability  domain knowledge, such as dictionaries, knowledge bases, and rules.  easy to customize to a new domain  Hand crafted rules are needed along with machine learning  Ex- precision sensitive applications like e-commerce 7  Open source  Few open source tools  Mostly expensive proprietory products  Understanding data  Capturing and managing appropriate meta-information  Eg. Facebook automatically identifies faces in the image so users can optionally tag them  Knowledge base  The more knowledge about a target domain, the better that tools can analyze the domain 8 Community Challenges  Some of these are new, some old, brought by big data and are becoming increasingly important:  Database education  Data science Conclusion  Database research has been restricted by the rigors of the enterprise and relational database systems  Handling data diversity; exploiting new hardware, software, and cloudbased platforms;  It is also time to rethink approaches to education, involvement with data consumers, and our value system and its impact on how we evaluate 9 THANK YOU