Download The Beckman Report On Database Research

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Database wikipedia , lookup

Relational model wikipedia , lookup

Big data wikipedia , lookup

Clusterpoint wikipedia , lookup

Functional Database Model wikipedia , lookup

Object-relational impedance mismatch wikipedia , lookup

Database model wikipedia , lookup

Transcript
DBSI Teaser presentation
The Beckman Report On Database Research
Presented by: Akshita Anand (2012015)
Priyanka Balotra (MT14018)
Sakshi Agarwal (MT14043)
1
Content

Characteristics of Big Data

Research Challenges

Community Challenges

Conclusion
2
Characteristics Of Big Data
Big Data is identified as a defining challenge for the field of Database.
3
Research Challenges
1. Scalable big/fast data infrastructures
2. Coping with diversity in data management
3. End-to-end processing of data
4
Challenge #1:Scalable Big Data Infrastructure
5
Challenge #2:Diversity in data management
 No one-size-fits-all.
 Cross-platform integration
 Integration of platforms
 Hiding heterogeneity
 Optimization of performance
 Programming models.
 Diversity in programming abstractions and reusuablilty
 Need of more than one language!
 Focus on domain- specific language
 Data processing workflows
 platforms that can span both "raw" and "cooked" data.
 example, querying data with SQL and then analyzing it with R
6
Challenge #3:End-to-end processing of data
 Data-to-knowledge pipeline
 steps of the raw-data-to-knowledge pipeline
 data acquisition; selection, assessment, cleaning, and
transformation, extraction and integration etc.
 greater diversity of data and users
 Tool diversity
 need of multiple tools to solve each step of raw-data-to-knowledge
pipeline
 Tool customizability
 domain knowledge, such as dictionaries, knowledge bases, and
rules.
 easy to customize to a new domain
 Hand crafted rules are needed along with machine learning
 Ex- precision sensitive applications like e-commerce
7
 Open source
 Few open source tools
 Mostly expensive proprietory products
 Understanding data
 Capturing and managing appropriate meta-information
 Eg. Facebook automatically identifies faces in the image so users
can optionally tag them
 Knowledge base
 The more knowledge about a target domain, the better that tools
can analyze the domain
8
Community Challenges
 Some of these are new, some old, brought by big data and are
becoming increasingly important:
 Database education
 Data science
Conclusion
 Database research has been restricted by the rigors of the enterprise
and relational database systems
 Handling data diversity; exploiting new hardware, software, and cloudbased platforms;
 It is also time to rethink approaches to education, involvement with data
consumers, and our value system and its impact on how we evaluate
9
THANK YOU