• No shipping costs from € 15, -
  • Lists and tips from our own specialists
  • Possibility of ordering without an account
  • No shipping costs from € 15, -
  • Lists and tips from our own specialists
  • Possibility of ordering without an account
Paperback | English
  • Available, delivery time is 4-5 working days
  • Not in stock in our shop
€84.50
  • From €15,- no shipping costs.
  • 30 days to change your mind and return physical products

Description

Updated to emphasize new features in Spark 2.4., this second edition shows data engineers and scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine-learning algorithms.

Jules S. Damji is an Apache Spark Community and Developer Advocate at Databricks. He is a hands-on developer with over 20 years of experience and has worked at leading companies, such as Sun Microsystems, Netscape, @Home, LoudCloud/Opsware, VeriSign, ProQuest, and Hortonworks, building large-scale distributed systems. He holds a B.Sc and M.Sc in Computer Science and MA in Political Advocacy and Communication from Oregon State University, Cal State, and Johns Hopkins University respectively. Denny Lee is a Technical Product Manager at Databricks. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premise and cloud environments. He also has a Masters of Biomedical Informatics from Oregon Health and Sciences University and has architected and implemented powerful data solutions for enterprise Healthcare customers. His current technical focuses include Distributed Systems, Apache Spark, Deep Learning, Machine Learning, and Genomics. Brooke Wenig is the Machine Learning Practice Lead at Databricks. She guides and assists customers in implementing machine learning pipelines, as well as teaching Distributed Machine Learning & Deep Learning courses. She received an MS in Computer Science from UCLA with a focus on distributed machine learning. She speaks Mandarin Chinese fluently and enjoys cycling. Tathagata Das is an Apache Spark committer and a member of the PMC. He's the lead developer behind Spark Streaming and currently develops Structured Streaming. Previously, he was a grad student in the UC Berkeley at AMPLab, where he conducted research about data-center frameworks and networks with Scott Shenker and Ion Stoica.

Specifications

  • Publisher
    O'Reilly Media
  • Edition
    2
  • Pub date
    Aug 2020
  • Pages
    300
  • Theme
    Databases
  • Dimensions
    233 x 178 mm
  • EAN
    9781492050049
  • Paperback
    Paperback
  • Language
    English

related products

Werken met een NAS

Werken met een NAS

Henk van de Kamer
€29.99
Handboek Access 2021

Handboek Access 2021

Wilfred de Feiter
€42.99
Wie niet tech is, is gezien

Wie niet tech is, is gezien

Martijn Vet
€10.00
Datacratisch werken

Datacratisch werken

Daan van Beek
€32.65
Handboek Marketing 4.0

Handboek Marketing 4.0

Paul Postma
€41.50