click to view more

Scaling Machine Learning with Spark: Distributed ML with Mllib, Tensorflow, and Pytorch

by Scaling Machine Learning with Spark: Distributed ML with Mllib, Tensorflow, and Pytorch

$58.62

List Price: $79.99
Save: $21.37 (26%)
add to favourite
  • In Stock - Ship in 24 hours with Free Online tracking.
  • FREE DELIVERY by Tuesday, April 29, 2025
  • 24/24 Online
  • Yes High Speed
  • Yes Protection
Last update:

Description

Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, author Adi Polak introduces data and ML practitioners to creative solutions that supersede today's traditional methods. You'll learn a more holistic approach that takes you beyond specific requirements and organizational goals--allowing data and ML practitioners to collaborate and understand each other better.

Scaling Machine Learning with Spark examines several technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLflow, TensorFlow, and PyTorch. If you're a data scientist who works with machine learning, this book shows you when and why to use each technology.

You will:

  • Explore machine learning, including distributed computing concepts and terminology
  • Manage the ML lifecycle with MLflow
  • Ingest data and perform basic preprocessing with Spark
  • Explore feature engineering, and use Spark to extract features
  • Train a model with MLlib and build a pipeline to reproduce it
  • Build a data system to combine the power of Spark with deep learning
  • Get a step-by-step example of working with distributed TensorFlow
  • Use PyTorch to scale machine learning and its internal architecture

Last updated on

Product Details

  • O'Reilly Media Brand
  • Apr 11, 2023 Pub Date:
  • 9781098106829 ISBN-13:
  • 1098106822 ISBN-10:
  • English Language
  • 8.75 in * 0.75 in * 6.75 in Dimensions:
  • 0 lb Weight: