Advanced analytics with Spark

"In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. You&#...

Full description

Permalink: http://skupni.nsk.hr/Record/fer.KOHA-OAI-FER:45711/TOC
Vrsta građe: Knjiga
Jezik: eng
Impresum: 2015.
Izdanje: 1. ed
Predmet:
Sadržaj:
  • Analyzing big data
  • Introduction to data analysis with Scala and Spark
  • Recommending music and the audioscrobbler data set
  • Predicting forest cover with decision trees
  • Anomaly detection in network traffic with K-means clustering
  • Understanding Wikipedia with latent semantic analysis
  • Analyzing co-occurrence networks with GraphX
  • Geospatial and temporal data analysis on the New York City taxi trip data
  • Estimating financial risk through Monte Carlo simulation
  • Analyzing genomics data and the BDG project
  • Analyzing neuroimaging data with PySpark and Thunder.