This book provides readers the “big picture” and a comprehensive
survey of the domain of big data processing systems. For the past
decade, the Hadoop framework has dominated the world of big data
processing, yet recently academia and industry have started to
recognize its limitations in several application domains and thus, it
is now gradually being replaced by a collection of engines that are
dedicated to specific verticals (e.g. structured data, graph data, and
streaming data). The book explores this new wave of systems, which it
refers to as Big Data 2.0 processing systems. After Chapter 1 presents
the general background of the big data phenomena, Chapter 2 provides
an overview of various general-purpose big data processing systems
that allow their users to develop various big data processing jobs for
different application domains. In turn, Chapter 3 examines various
systems that have been introduced to support the SQL flavor on top of
the Hadoop infrastructure and provide competing and scalable
performance in the processing of large-scale structured data. Chapter
4 discusses several systems that have been designed to tackle the
problem of large-scale graph processing, while the main focus of
Chapter 5 is on several systems that have been designed to provide
scalable solutions for processing big data streams, and on other sets
of systems that have been introduced to support the development of
data pipelines between various types of big data processing jobs and
systems. Next, Chapter 6 focuses on covering the emerging frameworks
and systems in the domain of scalable machine learning and deep
learning processing. Lastly, Chapter 7 shares conclusions and an
outlook on future research challenges. This new and considerably
enlarged second edition not only contains the completely new chapter
6, but also offers a refreshed content for the state-of-the-art in all
domains of big data processing over the last years. Overall, the book
offers a valuable reference guide for professional, students, and
researchers in the domain of big data processing systems. Further, its
comprehensive content will hopefully encourage readers to pursue
further research on the subject.
Les mer
A Systems Overview
Produktdetaljer
ISBN
9783030441876
Publisert
2020
Utgave
2. utgave
Utgiver
Vendor
Springer
Språk
Product language
Engelsk
Format
Product format
Digital bok
Forfatter