Download Plantilla PowerPoint

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Choice modelling wikipedia , lookup

Transcript
MongoDB
Lugar y Fecha
Telefónica
I+D
Mongodb is...
A
document-oriented storage.
JSON-style
power.
documents
with
dynamic
schemas
offer
simplicity
Querying.
Rich, document-base queries.
Fast
In-Place Updates.
Atomic modifiers for contention-free performance.
Full
index Support.
Index on any attribute.
Auto-Sharding.
Scale horizontally without compromising funcionality.
Replication
& High Availability.
Mirror across LAN for scale.
Map/Reduce
Flexible (but limited) data processing.
Telefónica I+D
2
and
Mongodb is Not...
A
A
ACID database.
batch procesing platform.
Telefónica I+D
3
Mongodb Architecture
 MongoDB
is a document-oriented database, but this not implica
that it has the same concepts of RDMS, one of this concepts is
the “database”, within a MongoDB instance you can have zero or
more databases, each acting as high-level containers for
everything else
A
database can have zero or more “collections”, A collection
shares enough in common with a traditional table, a collection is
the same as a table with a indeterminate number of columns.
 Collections
is made up of zero or more documents.
document can safely be thought of as a “row”.
Again,
a
A
document is made up one or more fields, which yo can probable
guess are a lot like “columns”.
 Indexes
 Cursors
in MongoDB function much like their RDBMS counterparts.
are different than the other five concepts but they are
important enough. The import thing to understand about cursors is
that when you ask MongoDB for data, it returns a cursor, which we
can do things to, such as counting or skipping ahead, without
actually pulling down data.
Telefónica I+D
4
Mongodb Architecture
 Mongo
uses memory mapped files to access data, which results in
large numbers being displayed in tools like top for the mongod
process.
 Mongo
eliminates the need (in some cases) for a separate object
caching layer. Queries that result in file system RAM cache hits
are very fast as the object's representation in the database is
very close to its representation in application memory. Also, the
MongoDB can scale to any level and provides an object cache and
database integrated together, which is very helpful as there is
no risk of retrieving stale data from the cache. In addition, the
complex queries a full DBMS provides are also possible.
 MongoDB
supports
write-ahead
journaling
of
operations
facilitate fast crash recovery and durability in the storage
Telefónica I+D
5
to
Mongodb Api
Telefónica I+D
Data Type
Type Number
Double
1
String
2
Object
3
Array
4
Binary data
5
Object id
7
Boolean
8
Date
9
Null
10
Regular Expresion
11
6
Mongodb Api
 The
conection
to
the
databases
front
the
aplications are a propiatery protocol that runs over
TCP/IP in binary format. The apis have a control the
write behavior for with various options, as well as
exception raising on error conditions.
 NONE: No exceptions are
 NORMAL: Exceptions are
raised, even for network issues
raised for network issues, but
not server errors
 SAFE:
Exceptions are raised for network issues, and
server errors; waits on a server for the write
operation
 FSYNC_SAFE:
Exceptions are raised for network issues,
and server errors and the write operation waits for the
server to flush the data to disk
 REPLICAS_SAFE:
Exceptions are raised for network
issues, and server errors; waits for at least 2 servers
for the write operation
Telefónica I+D
7
Mongodb Api
 Insert:
it is the basic method for adding data into
MongoDB. When we insert a new document into a
collection this will add an “_id” key to the
document before save it. When insert a new object
into the collection, the driver coverts the data
structure into BSON, which
it then sends to the
database. Then it saves the document, and regenerate
the _id index, and all the secondary index that the
new object has.
 db.collection.insert(doc);
 db.collection.save(doc); // updates if exists; inserts if new (_id)
 Delete: it is the basic method for deleting data.
 db.videos.remove( { rating : { $lt : 3.0 }, $atomic : true } )
Telefónica I+D
8
Mongodb Api
 Query:
MongoDB's support for dynamic (ad hoc)
queries. MongoDB supports a number of query objects
for fetching data.
 Db.collection.find(query,fields)
 Method:
 count()
 sort()
 limit()
 skip()
Telefónica I+D
9
Mongodb Api
Query Modifier Operations
$gt (>) , $gte(>=), $lt (<), $gle(<=)
db.collection.find({ "field" : { $gt: value } } );
$all
db.things.find( { a: { $all: [ 2, 3 ] } } );
$exists
db.things.find( { a : { $exists : true } } ); present
$mod
db.things.find( { a : { $mod : [ 10 , 1 ] } } )
$ne
db.things.find( { x : { $ne : 3 } } );
$in
db.things.find({j:{$in: [2,4,6]}});
$nin
db.things.find({j:{$nin: [2,4,6]}});
$or
db.foo.find( { $or : [ { a : 1 } , { b : 2 } ] } )
$size
db.things.find( { a : { $size: 1 } } );
$type
db.things.find( { a : { $type : 2 } } );
$elemMatch
t.find( { x : { $elemMatch : { a : 1}})
$regex
db.customers.find( { name : { $regex : 'acme.*corp', $options: 'i' } } );
$where
db.myCollection.find( { $where: "this.a > 3" } );
Telefónica I+D
10
Mongodb Api
 Update:
replaces the document matching criteria
entirely with objNew. If you only want to modify
some fields, you should use the atomic modifiers
below.
 db.collection.update( criteria, objNew, upsert, multi ).
 Arguments
 criteria - query which selects the record to update;
 objNew - updated object or $ operators (e.g., $inc) which manipulate
the object

upsert - if this should be an "upsert"; that is, if the record does
not exist, insert it

multi - if all documents matching criteria should be updated
Telefónica I+D
11
Mongodb Api
Update Modifier Operations
$inc
{ $inc : { field : value } }
$set
{ $set : { field : value } }
$unset
{ $unset : { field : 1} }
$push
{ $push : { field : value } }
$pushAll
{ $pushAll : { field : value_array } }
$addToSet
{ $addToSet : { field : value } }
$pop
{ $pop : { field : 1 } }
{ $pop : { field : -1 } }
$pull
{ $pull : { field : _value } }
$pullAll
{ $pullAll : { field : value_array } }
$rename
{ $rename : { old_field_name : new_field_name } }
$bit
{$bit : {field : {and : 5, or : 2}}}
$ ( positional number)
Telefónica I+D
{$inc:{'comments.$.votes':1}}
12
Mongodb Api
 Upserts
: it is a special type of update, in where
if no document is found to upate, a new document
will be created in a atomic operation.

db.jobs.findAndModify({query: {},update : {}, fields: {},
Arguments
Query
{}
Sort
{}
Remove
N/A
Update
N/A
New
False
Fields
All fields
upsert
False
Telefónica I+D
13
new: false } )
Mongodb Api
Example in console.
Telefónica I+D
14
Mongodb Index
These indexes are implemented as "B-Tree" indexes.
the indexes in MongoDB work a lot like indexes in
a
relational database: they help improve query and
sorting performance, the indexes are created manual,
and could be unique, ascending or descending.
Index on _id is always created and can`t be deleted.
I can index on a key inside of an embedded document.
Telefónica I+D
15
Mongodb Index
Example in console.
Telefónica I+D
16
Mongodb Sharding
MongoDB supports auto-sharding, and auto.balancing.
Sharding is an approach to scalability which
separates your data across multiple servers.
With sharding we instead scale horizontally to
achieve
the
same
computacional/storage/memory
footprint from smaller servers.
We will show the vertically scale db and the
horizontally scaled db for comparison.
A sharded collection has a shard key. The collection
is partitioned using this key. In this example we
uses ident.
 {ident:…, data:…}
Telefónica I+D
17
Mongodb Sharding
There are three components in sharding.
 Mongos: routing procesing.
 Config servers: store the metadata.
 Mongod: store the data.
Range
Shard
Ident in [-inf,200)
2
Ident in (200, 400]
4
…
Ident in [1000,inf)
Telefónica I+D
18
10
Mongodb Sharding
Metadata
is
maintained
on
chunks
represented by shard key ranges.
Range
which
Shard
Ident in [-inf,200)
2
Ident in (200, 400]
4
…
Ident in [1000,inf)
Telefónica I+D
19
are
10
Mongodb Sharding
Querys
with the shard key are sent to the node who
has this shard.
Query
Find({ident :{$gt 100, $lt:150})
Telefónica I+D
20
Shards
2
Mongodb Sharding
Querys
with the shard key and the data are in some
shard, it as sent to these nodes.
Query
Find({ident :{$gt 100, $lt:250})
Telefónica I+D
21
Shards
2,3
Mongodb Sharding
Querys
with the no shard key are sent to all nodes.
Query
Find({data:{$gt 100, $lt:250})
Telefónica I+D
22
Shards
2,3
Mongodb Sharding
Example in console.
Telefónica I+D
23
Mongodb Replication
MongoDB
replication
works
similarly
to
how
relational database replication works. Writes are
sent to a single server, the master, which then
synchronizes itself to one or more other servers,
the slaves. If the master goes down, a slave can be
promoted to act as the new master.
While replication can improve performance (by
distributing reads), its main purpose is to increase
reliability.
There are two types of replication:
 Master slaver replication.
 Replica sets.
Telefónica I+D
24
Mongodb Replication
Master
Setup
slaver replication.
master:
 bin/mongod –master
Setup slaver:
 bin/mongod --slave
Telefónica I+D
–oplogSize 100
--source <IP>[:<port>] --slavedelay 1 --autoresync
25
Mongodb Replication
Replica
sets.
 mongod --replSet “setname”
Shell in one node
cfg = {
... _id : “setname",
... members : [
... { _id : 0, host : “host1" },
... { _id : 1, host : “host2" },
... { _id : 2, host : “host3" } ] }
rs.initiate(cfg)
rs.status()
Options
arbiterOnly
buildIndexes
hidden
priority
slaveDelay
votes
tags
Telefónica I+D
26
Mongodb Map&Reduce
DB
is useful for batch
aggregation operations.
Telefónica I+D
27
processing
of
data
and
Mongodb Tools
Mongodump: export to binary data
Mongoexport: export to csv
Mongofiles: load data into mongodb
MongoImport: import txt, csv or json data
mongoRestore: mongorestore takes the output
from
mongodump: and restores it.
mongoSniff: This utility is to MongoDB what tcpdump
is to TCP/IP
Mongostat: Use the mongostat utility to quickly view
statistics on a running mongod instance
Telefónica I+D
28