Difference between revisions of "Data Science"

From TedYunWiki
Jump to navigation Jump to search
Line 12: Line 12:
  
 
What is a database? '''A collection of information organized to afford efficient retrieval.'''
 
What is a database? '''A collection of information organized to afford efficient retrieval.'''
 +
Why do we need a database?
 +
* Sharing
 +
* Data model enforcement
 +
* Scale
 +
* Flexibility

Revision as of 00:58, 1 September 2013

The Three V's of Big Data

  • Volume: number of rows/objects/bytes
  • Variety: number of columns/dimensions/sources
  • Velocity: number of rows/bytes per unit time

(Veracity: Can we trust this data?)

Data Model

Three components:

  • Structures
  • Constraints
  • Operations

What is a database? A collection of information organized to afford efficient retrieval. Why do we need a database?

  • Sharing
  • Data model enforcement
  • Scale
  • Flexibility