Go to main content
Formats
Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS
Cite

Linked e-resources

Details

Intro
Table of Contents
About the Author
About the Technical Reviewer
Acknowledgments
Introduction
Chapter 1: An Easy Transition
PySpark and Pandas Integration
Similarity in Syntax
Loading Data
Selecting Columns
Aggregating Data
Filtering Data
Joining Data
Saving Data
Modeling Steps
Pipelines
Summary
Chapter 2: Selecting Algorithms
The Dataset
Selecting Algorithms with Cross-Validation
Scikit-Learn
PySpark
Bringing It All Together
Scikit-Learn
PySpark
Summary

Chapter 3: Multiple Linear Regression with Pandas, Scikit-Learn, and PySpark
The Dataset
Multiple Linear Regression
Multiple Linear Regression with Scikit-Learn
Multiple Linear Regression with PySpark
Summary
Chapter 4: Decision Tree Regression with Pandas, Scikit-Learn, and PySpark
The Dataset
Decision Tree Regression
Decision Tree Regression with Scikit-Learn
The Modeling Steps
Decision Tree Regression with PySpark
The Modeling Steps
Bringing It All Together
Scikit-Learn
PySpark
Summary

Chapter 5: Random Forest Regression with Pandas, Scikit-Learn, and PySpark
The Dataset
Random Forest Regression
Random Forest with Scikit-Learn
Random Forest with PySpark
Bringing It All Together
Scikit-Learn
PySpark
Summary
Chapter 6: Gradient-Boosted Tree Regression with Pandas, Scikit-Learn, and PySpark
The Dataset
Gradient-Boosted Tree (GBT) Regression
GBT with Scikit-Learn
GBT with PySpark
Bringing It All Together
Scikit-Learn
PySpark
Summary
Chapter 7: Logistic Regression with Pandas, Scikit-Learn, and PySpark
The Dataset

Logistic Regression
Logistic Regression with Scikit-Learn
Logistic Regression with PySpark
Putting It All Together
Scikit-Learn
PySpark
Summary
Chapter 8: Decision Tree Classification with Pandas, Scikit-Learn, and PySpark
The Dataset
Decision Tree Classification
Scikit-Learn and PySpark Similarities
Decision Tree Classification with Scikit-Learn
Decision Tree Classification with PySpark
Bringing It All Together
Scikit-Learn
PySpark
Summary
Chapter 9: Random Forest Classification with Scikit- Learn and PySpark
Random Forest Classification

Scikit-Learn and PySpark Similarities for Random Forests
Random Forests with Scikit-Learn
Random Forests with PySpark
Bringing It All Together
Scikit-Learn
PySpark
Summary
Chapter 10: Support Vector Machine Classification with Pandas, Scikit-Learn, and PySpark
The Dataset
Support Vector Machine Classification
Linear SVM with Scikit-Learn
Linear SVM with PySpark
Bringing It All Together
Scikit-Learn
PySpark
Summary
Chapter 11: Naive Bayes Classification with Pandas, Scikit-Learn, and PySpark
The Dataset
Naive Bayes Classification

Browse Subjects

Show more subjects...

Statistics

from
to
Export