Difference between revisions of "Data Science"

From TedYunWiki
Jump to navigation Jump to search
Line 4: Line 4:
 
* Velocity: number of rows/bytes ''per unit time''
 
* Velocity: number of rows/bytes ''per unit time''
 
(Veracity: Can we trust this data?)
 
(Veracity: Can we trust this data?)
 +
 +
=== Data Model ===
 +
Three components:
 +
* Structures
 +
* Constraints
 +
* Operations

Revision as of 00:54, 1 September 2013

The Three V's of Big Data

  • Volume: number of rows/objects/bytes
  • Variety: number of columns/dimensions/sources
  • Velocity: number of rows/bytes per unit time

(Veracity: Can we trust this data?)

Data Model

Three components:

  • Structures
  • Constraints
  • Operations