{"product_id":"spark-in-action-second-edition","title":"Spark In Action Second Edition","description":"\u003cp\u003eThe Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In \u003ci\u003eSpark in Action, Second Edition\u003c\/i\u003e, you’ll learn to take advantage of Spark’s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. \u003c\/p\u003e \u003cp\u003e \u003c\/p\u003e \u003cp\u003eUnlike many Spark books written for data scientists, \u003ci\u003eSpark in Action, Second Edition \u003c\/i\u003eis designed for data engineers and software engineers who want to master data processing using Spark without having to learn a complex new ecosystem of languages and tools. You’ll instead learn to apply your existing Java and SQL skills to take on practical, real-world challenges.\u003c\/p\u003e \u003cp\u003e\u003cb\u003e \u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e\u003cb\u003eKey Features\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e·  Lots of examples based in the Spark Java APIs using real-life dataset and scenarios \u003c\/p\u003e \u003cp\u003e·  Examples based on Spark v2.3 Ingestion through files, databases, and streaming \u003c\/p\u003e \u003cp\u003e·  Building custom ingestion process \u003c\/p\u003e \u003cp\u003e·  Querying distributed datasets with Spark SQL \u003c\/p\u003e \u003cp\u003e\u003cb\u003e \u003c\/b\u003e\u003c\/p\u003e \u003cp\u003eFor beginning to intermediate developers and data engineers comfortable programming in Java. No experience with functional programming, Scala, Spark, Hadoop, or big data is required.\u003c\/p\u003e \u003cp\u003e \u003c\/p\u003e \u003cp\u003e\u003cb\u003eAbout the technology \u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e\u003cb\u003e \u003c\/b\u003e\u003c\/p\u003e \u003cp\u003eSpark is a powerful general-purpose analytics engine that can handle massive amounts of data distributed across clusters with thousands of servers. Optimized to run in memory, this impressive framework can process data up to 100x faster than most Hadoop-based systems.\u003cb\u003e\u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e\u003cb\u003e \u003c\/b\u003e\u003c\/p\u003e \u003cp\u003e\u003cb\u003eAuthor Bio\u003c\/b\u003e\u003c\/p\u003eAn experienced consultant and entrepreneur passionate about all things data, \u003cb\u003eJean-Georges Perrin\u003c\/b\u003e was the first IBM Champion in France, an honor he’s now held for ten consecutive years. Jean-Georges has managed many teams of software and data engineers.","brand":"MediaPlace","offers":[{"title":"Default Title","offer_id":57316847747454,"sku":"NW9781617295522","price":50.95,"currency_code":"EUR","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0817\/1379\/1261\/files\/9781617295522.jpg?v=1778719934","url":"https:\/\/mediaplace.com\/en-eu\/products\/spark-in-action-second-edition","provider":"MediaPlace","version":"1.0","type":"link"}