更新时间:2021-07-14 10:13:46
封面
版权页
Credits
About the Author
About the Reviewer
www.PacktPub.com
eBooks discount offers and more
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Chapter 1. From Data to Decisions – Getting Started with Analytic Applications
Designing an advanced analytic solution
Case study: sentiment analysis of social media feeds
Case study: targeted e-mail campaigns
Summary
Chapter 2. Exploratory Data Analysis and Visualization in Python
Exploring categorical and numerical data in IPython
Time series analysis
Working with geospatial data
Introduction to PySpark
Chapter 3. Finding Patterns in the Noise – Clustering and Unsupervised Learning
Similarity and distance metrics
Affinity propagation – automatically choosing cluster numbers
k-medoids
Agglomerative clustering
Streaming clustering in Spark
Chapter 4. Connecting the Dots with Models – Regression Methods
Linear regression
Tree methods
Scaling out with PySpark – predicting year of song release
Chapter 5. Putting Data in its Place – Classification Methods and Analysis
Logistic regression
Fitting the model
Evaluating classification models
Separating Nonlinear boundaries with Support vector machines
Comparing classification methods
Case study: fitting classifier models in pyspark
Chapter 6. Words and Pixels – Working with Unstructured Data
Working with textual data
Principal component analysis
Images
Case Study: Training a Recommender System in PySpark
Chapter 7. Learning from the Bottom Up – Deep Networks and Unsupervised Features
Learning patterns with neural networks
The TensorFlow library and digit recognition
Chapter 8. Sharing Models with Prediction Services
The architecture of a prediction service
Clients and making requests
Server – the web traffic controller
Persisting information with database systems
Case study – logistic regression service
Chapter 9. Reporting and Testing – Iterating on Analytic Systems
Checking the health of models with diagnostics
Iterating on models through A/B testing
Guidelines for communication
Index