Download Topics for PhD research in data management and systems

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Data analysis wikipedia , lookup

Asynchronous I/O wikipedia , lookup

Expense and cost recovery system (ECRS) wikipedia , lookup

Information privacy law wikipedia , lookup

Versant Object Database wikipedia , lookup

Database wikipedia , lookup

Data vault modeling wikipedia , lookup

Concurrency control wikipedia , lookup

Open data in the United Kingdom wikipedia , lookup

Operational transformation wikipedia , lookup

Object storage wikipedia , lookup

Business intelligence wikipedia , lookup

Clusterpoint wikipedia , lookup

Database model wikipedia , lookup

Transcript
Topics for PhD research in data
management and systems
Stratis D. Viglas
School of Informatics
University of Edinburgh
[email protected]
Stratis D. Viglas
www.inf.ed.ac.uk
Why study databases?
Databases? Isn’t that the boring part of accounting?
Stratis D. Viglas
www.inf.ed.ac.uk
Why study databases?
Databases? Isn’t that the boring part of accounting?
• Not really; database technology makes the world go round
• It’s not just about money from/to the bank
•
•
•
•
Digital asset management (multimedia, data streams)
Scientific applications: biological sciences, geosciences, DNA mapping
Decision support, data analysis, data mining
Big Data
Primary focus
Systems research; database systems are a well-understood area of
immediate impact
Stratis D. Viglas
www.inf.ed.ac.uk
Storage technology
• Support for flash memory and solid-state drives
• How does one optimise database systems for flash memory?
• What sort of algorithms and techniques should there be in place?
• Disparate storage layer
• Data structure optimisation
• Placement in storage across media with widely different performance
characteristics
• Persistent memory
• It is not volatile memory; it is not a disk
• Revisit all problems with a clean sheet
• Algorithms, placement, concurrency, logging, etc.
Stratis D. Viglas
www.inf.ed.ac.uk
Parallel processing
• Small-scale
• Multicore and manycore setups
• How does one orchestrate query processing in such environments?
• Parallel algorithms, scheduling techniques, software support for parallel
processing
• Large-scale
•
•
•
•
Stratis D. Viglas
Cluster-based processing and distributed computing
Expressive file structures for storage
Algorithms for massive parallelisation of data analytics
Data-stream management on clusters
www.inf.ed.ac.uk
Code generation
Front-end
Schemas
Query
Results
Parser
Evaluator
Syntax tree
Statistics
Optimizer
Library file
Linker
Plan
Types
Pages
Binary
Code
Generator
Compiler
Preparator
Executor
Catalog
DB tables
Buffer pool
Storage manager
Stratis D. Viglas
www.inf.ed.ac.uk