LEVERAGE THE POWER OF PYTHON TO BUILD REAL-WORLD FEATURE ENGINEERING
AND MACHINE LEARNING PIPELINES READY TO BE DEPLOYED TO PRODUCTION
KEY FEATURES
* Craft powerful features from tabular, transactional, and
time-series data
* Develop efficient and reproducible real-world feature engineering
pipelines
* Optimize data transformation and save valuable time
* Purchase of the print or Kindle book includes a free PDF eBook
BOOK DESCRIPTION
Streamline data preprocessing and feature engineering in your machine
learning project with this third edition of the Python Feature
Engineering Cookbook to make your data preparation more efficient.
This guide addresses common challenges, such as imputing missing
values and encoding categorical variables using practical solutions
and open source Python libraries. You’ll learn advanced techniques
for transforming numerical variables, discretizing variables, and
dealing with outliers. Each chapter offers step-by-step instructions
and real-world examples, helping you understand when and how to apply
various transformations for well-prepared data. The book explores
feature extraction from complex data types such as dates, times, and
text. You’ll see how to create new features through mathematical
operations and decision trees and use advanced tools like Featuretools
and tsfresh to extract features from relational data and time series.
By the end, you’ll be ready to build reproducible feature
engineering pipelines that can be easily deployed into production,
optimizing data preprocessing workflows and enhancing machine learning
model performance.
WHAT YOU WILL LEARN
* Discover multiple methods to impute missing data effectively
* Encode categorical variables while tackling high cardinality
* Find out how to properly transform, discretize, and scale your
variables
* Automate feature extraction from date and time data
* Combine variables strategically to create new and powerful features
* Extract features from transactional data and time series
* Learn methods to extract meaningful features from text data
WHO THIS BOOK IS FOR
If you're a machine learning or data science enthusiast who wants to
learn more about feature engineering, data preprocessing, and how to
optimize these tasks, this book is for you. If you already know the
basics of feature engineering and are looking to learn more advanced
methods to craft powerful features, this book will help you. You
should have basic knowledge of Python programming and machine learning
to get started.
Les mer
Produktdetaljer
ISBN
9781835883594
Publisert
2024
Utgave
3. utgave
Utgiver
Vendor
Packt Publishing
Språk
Product language
Engelsk
Format
Product format
Digital bok
Forfatter