Machine Learning with PySpark

Pramod Singh

Pramod Singh
Machine Learning with PySpark [PDF ebook]
With Natural Language Processing and Recommender Systems

Sokongan

perlindungan daripada Pramod Singh: Machine Learning with PySpark (PDF)

Build machine learning models, natural language processing applications, and recommender systems with Py Spark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and recommender systems using Py Spark.

Machine Learning with Py Spark shows you how to build supervised machine learning models such as linear regression, logistic regression, decision trees, and random forest. You’ll also see unsupervised machine learning models such as K-means and hierarchical clustering. A major portion of the book focuses on feature engineering to create useful features with Py Spark to train the machine learning models. The natural language processing section covers text processing, text mining, and embedding for classification.

After reading thisbook, you will understand how to use Py Spark’s machine learning library to build and train various machine learning models. Additionally you’ll become comfortable with related Py Spark components, such as data ingestion, data processing, and data analysis, that you can use to develop data-driven intelligent applications.

What You Will Learn

Build a spectrum of supervised and unsupervised machine learning algorithms

Implement machine learning algorithms with Spark MLlib libraries

Develop a recommender system with Spark MLlib libraries

Handle issues related to feature engineering, class balance, bias and variance, and cross validation for building an optimal fit model

Who This Book Is For

Data science and machine learning professionals.

€34.99

Jadual kandungan

Chapter 1: Evolution of Data, – Chapter 2: Introduction to Machine Learning. – Chapter 3: Data Processing. – Chapter 4: Linear Regression. – Chapter 5: Logistic Regression. – Chapter 6: Random Forests. – Chapter 7: Recommender Systems. – Chapter 8: Clustering. – Chapter 9: Natural Language Processing.

Mengenai Pengarang

Pramod Singh is an established data scientist with over eight years of experience in data and solving business challenges. He has worked in organizations such as Infosys, Tally and Sapient Razorfish. Also, president of a data science meet-up group and regular speaker at various webinars. Recently spoke at major conference: GIDS 2018 and presented a session on “Sequence Embedding in Spark” which was well received. He has an online Udemy course on machine learning.

Beli ebook ini dan dapatkan 1 lagi PERCUMA!

Bahasa Inggeris ● Format PDF ● Halaman-halaman 223 ● ISBN 9781484241318 ● Saiz fail 7.4 MB ● Penerbit Apress ● Bandar raya CA ● Negara US ● Diterbitkan 2018 ● Muat turun 24 bulan ● Mata wang EUR ● ID 6775810 ● Salin perlindungan Social DRM