Python Feature Engineering Cookbook

Python Feature Engineering Cookbook PDF Author: Soledad Galli
Publisher: Packt Publishing Ltd
ISBN: 1789807824
Category : Computers
Languages : en
Pages : 364

Get Book Here

Book Description
Extract accurate information from data to train and improve machine learning models using NumPy, SciPy, pandas, and scikit-learn libraries Key FeaturesDiscover solutions for feature generation, feature extraction, and feature selectionUncover the end-to-end feature engineering process across continuous, discrete, and unstructured datasetsImplement modern feature extraction techniques using Python's pandas, scikit-learn, SciPy and NumPy librariesBook Description Feature engineering is invaluable for developing and enriching your machine learning models. In this cookbook, you will work with the best tools to streamline your feature engineering pipelines and techniques and simplify and improve the quality of your code. Using Python libraries such as pandas, scikit-learn, Featuretools, and Feature-engine, you’ll learn how to work with both continuous and discrete datasets and be able to transform features from unstructured datasets. You will develop the skills necessary to select the best features as well as the most suitable extraction techniques. This book will cover Python recipes that will help you automate feature engineering to simplify complex processes. You’ll also get to grips with different feature engineering strategies, such as the box-cox transform, power transform, and log transform across machine learning, reinforcement learning, and natural language processing (NLP) domains. By the end of this book, you’ll have discovered tips and practical solutions to all of your feature engineering problems. What you will learnSimplify your feature engineering pipelines with powerful Python packagesGet to grips with imputing missing valuesEncode categorical variables with a wide set of techniquesExtract insights from text quickly and effortlesslyDevelop features from transactional data and time series dataDerive new features by combining existing variablesUnderstand how to transform, discretize, and scale your variablesCreate informative variables from date and timeWho this book is for This book is for machine learning professionals, AI engineers, data scientists, and NLP and reinforcement learning engineers who want to optimize and enrich their machine learning models with the best features. Knowledge of machine learning and Python coding will assist you with understanding the concepts covered in this book.

Python Feature Engineering Cookbook

Python Feature Engineering Cookbook PDF Author: Soledad Galli
Publisher: Packt Publishing Ltd
ISBN: 1789807824
Category : Computers
Languages : en
Pages : 364

Get Book Here

Book Description
Extract accurate information from data to train and improve machine learning models using NumPy, SciPy, pandas, and scikit-learn libraries Key FeaturesDiscover solutions for feature generation, feature extraction, and feature selectionUncover the end-to-end feature engineering process across continuous, discrete, and unstructured datasetsImplement modern feature extraction techniques using Python's pandas, scikit-learn, SciPy and NumPy librariesBook Description Feature engineering is invaluable for developing and enriching your machine learning models. In this cookbook, you will work with the best tools to streamline your feature engineering pipelines and techniques and simplify and improve the quality of your code. Using Python libraries such as pandas, scikit-learn, Featuretools, and Feature-engine, you’ll learn how to work with both continuous and discrete datasets and be able to transform features from unstructured datasets. You will develop the skills necessary to select the best features as well as the most suitable extraction techniques. This book will cover Python recipes that will help you automate feature engineering to simplify complex processes. You’ll also get to grips with different feature engineering strategies, such as the box-cox transform, power transform, and log transform across machine learning, reinforcement learning, and natural language processing (NLP) domains. By the end of this book, you’ll have discovered tips and practical solutions to all of your feature engineering problems. What you will learnSimplify your feature engineering pipelines with powerful Python packagesGet to grips with imputing missing valuesEncode categorical variables with a wide set of techniquesExtract insights from text quickly and effortlesslyDevelop features from transactional data and time series dataDerive new features by combining existing variablesUnderstand how to transform, discretize, and scale your variablesCreate informative variables from date and timeWho this book is for This book is for machine learning professionals, AI engineers, data scientists, and NLP and reinforcement learning engineers who want to optimize and enrich their machine learning models with the best features. Knowledge of machine learning and Python coding will assist you with understanding the concepts covered in this book.

Python Feature Engineering Cookbook

Python Feature Engineering Cookbook PDF Author: Soledad Galli
Publisher: Packt Publishing Ltd
ISBN: 1804615390
Category : Computers
Languages : en
Pages : 386

Get Book Here

Book Description
Create end-to-end, reproducible feature engineering pipelines that can be deployed into production using open-source Python libraries Key Features Learn and implement feature engineering best practices Reinforce your learning with the help of multiple hands-on recipes Build end-to-end feature engineering pipelines that are performant and reproducible Book DescriptionFeature engineering, the process of transforming variables and creating features, albeit time-consuming, ensures that your machine learning models perform seamlessly. This second edition of Python Feature Engineering Cookbook will take the struggle out of feature engineering by showing you how to use open source Python libraries to accelerate the process via a plethora of practical, hands-on recipes. This updated edition begins by addressing fundamental data challenges such as missing data and categorical values, before moving on to strategies for dealing with skewed distributions and outliers. The concluding chapters show you how to develop new features from various types of data, including text, time series, and relational databases. With the help of numerous open source Python libraries, you'll learn how to implement each feature engineering method in a performant, reproducible, and elegant manner. By the end of this Python book, you will have the tools and expertise needed to confidently build end-to-end and reproducible feature engineering pipelines that can be deployed into production.What you will learn Impute missing data using various univariate and multivariate methods Encode categorical variables with one-hot, ordinal, and count encoding Handle highly cardinal categorical variables Transform, discretize, and scale your variables Create variables from date and time with pandas and Feature-engine Combine variables into new features Extract features from text as well as from transactional data with Featuretools Create features from time series data with tsfresh Who this book is for This book is for machine learning and data science students and professionals, as well as software engineers working on machine learning model deployment, who want to learn more about how to transform their data and create new features to train machine learning models in a better way.

Practical Data Science Cookbook

Practical Data Science Cookbook PDF Author: Prabhanjan Tattar
Publisher: Packt Publishing Ltd
ISBN: 178712326X
Category : Computers
Languages : en
Pages : 428

Get Book Here

Book Description
Over 85 recipes to help you complete real-world data science projects in R and Python About This Book Tackle every step in the data science pipeline and use it to acquire, clean, analyze, and visualize your data Get beyond the theory and implement real-world projects in data science using R and Python Easy-to-follow recipes will help you understand and implement the numerical computing concepts Who This Book Is For If you are an aspiring data scientist who wants to learn data science and numerical programming concepts through hands-on, real-world project examples, this is the book for you. Whether you are brand new to data science or you are a seasoned expert, you will benefit from learning about the structure of real-world data science projects and the programming examples in R and Python. What You Will Learn Learn and understand the installation procedure and environment required for R and Python on various platforms Prepare data for analysis by implement various data science concepts such as acquisition, cleaning and munging through R and Python Build a predictive model and an exploratory model Analyze the results of your model and create reports on the acquired data Build various tree-based methods and Build random forest In Detail As increasing amounts of data are generated each year, the need to analyze and create value out of it is more important than ever. Companies that know what to do with their data and how to do it well will have a competitive advantage over companies that don't. Because of this, there will be an increasing demand for people that possess both the analytical and technical abilities to extract valuable insights from data and create valuable solutions that put those insights to use. Starting with the basics, this book covers how to set up your numerical programming environment, introduces you to the data science pipeline, and guides you through several data projects in a step-by-step format. By sequentially working through the steps in each chapter, you will quickly familiarize yourself with the process and learn how to apply it to a variety of situations with examples using the two most popular programming languages for data analysis—R and Python. Style and approach This step-by-step guide to data science is full of hands-on examples of real-world data science tasks. Each recipe focuses on a particular task involved in the data science pipeline, ranging from readying the dataset to analytics and visualization

Python for Finance Cookbook

Python for Finance Cookbook PDF Author: Eryk Lewinson
Publisher: Packt Publishing Ltd
ISBN: 1789617324
Category : Computers
Languages : en
Pages : 426

Get Book Here

Book Description
Solve common and not-so-common financial problems using Python libraries such as NumPy, SciPy, and pandas Key FeaturesUse powerful Python libraries such as pandas, NumPy, and SciPy to analyze your financial dataExplore unique recipes for financial data analysis and processing with PythonEstimate popular financial models such as CAPM and GARCH using a problem-solution approachBook Description Python is one of the most popular programming languages used in the financial industry, with a huge set of accompanying libraries. In this book, you'll cover different ways of downloading financial data and preparing it for modeling. You'll calculate popular indicators used in technical analysis, such as Bollinger Bands, MACD, RSI, and backtest automatic trading strategies. Next, you'll cover time series analysis and models, such as exponential smoothing, ARIMA, and GARCH (including multivariate specifications), before exploring the popular CAPM and the Fama-French three-factor model. You'll then discover how to optimize asset allocation and use Monte Carlo simulations for tasks such as calculating the price of American options and estimating the Value at Risk (VaR). In later chapters, you'll work through an entire data science project in the financial domain. You'll also learn how to solve the credit card fraud and default problems using advanced classifiers such as random forest, XGBoost, LightGBM, and stacked models. You'll then be able to tune the hyperparameters of the models and handle class imbalance. Finally, you'll focus on learning how to use deep learning (PyTorch) for approaching financial tasks. By the end of this book, you’ll have learned how to effectively analyze financial data using a recipe-based approach. What you will learnDownload and preprocess financial data from different sourcesBacktest the performance of automatic trading strategies in a real-world settingEstimate financial econometrics models in Python and interpret their resultsUse Monte Carlo simulations for a variety of tasks such as derivatives valuation and risk assessmentImprove the performance of financial models with the latest Python librariesApply machine learning and deep learning techniques to solve different financial problemsUnderstand the different approaches used to model financial time series dataWho this book is for This book is for financial analysts, data analysts, and Python developers who want to learn how to implement a broad range of tasks in the finance domain. Data scientists looking to devise intelligent financial strategies to perform efficient financial analysis will also find this book useful. Working knowledge of the Python programming language is mandatory to grasp the concepts covered in the book effectively.

Modern Python Cookbook

Modern Python Cookbook PDF Author: Steven F. Lott
Publisher: Packt Publishing Ltd
ISBN: 1786463849
Category : Computers
Languages : en
Pages : 686

Get Book Here

Book Description
The latest in modern Python recipes for the busy modern programmer About This Book Develop succinct, expressive programs in Python Learn the best practices and common idioms through carefully explained and structured recipes Discover new ways to apply Python for the new age of development Who This Book Is For The book is for web developers, programmers, enterprise programmers, engineers, big data scientist, and so on. If you are a beginner, Python Cookbook will get you started. If you are experienced, it will expand your knowledge base. A basic knowledge of programming would help. What You Will Learn See the intricate details of the Python syntax and how to use it to your advantage Improve your code readability through functions in Python Manipulate data effectively using built-in data structures Get acquainted with advanced programming techniques in Python Equip yourself with functional and statistical programming features Write proper tests to be sure a program works as advertised Integrate application software using Python In Detail Python is the preferred choice of developers, engineers, data scientists, and hobbyists everywhere. It is a great scripting language that can power your applications and provide great speed, safety, and scalability. By exposing Python as a series of simple recipes, you can gain insight into specific language features in a particular context. Having a tangible context helps make the language or standard library feature easier to understand. This book comes with over 100 recipes on the latest version of Python. The recipes will benefit everyone ranging from beginner to an expert. The book is broken down into 13 chapters that build from simple language concepts to more complex applications of the language. The recipes will touch upon all the necessary Python concepts related to data structures, OOP, functional programming, as well as statistical programming. You will get acquainted with the nuances of Python syntax and how to effectively use the advantages that it offers. You will end the book equipped with the knowledge of testing, web services, and configuration and application integration tips and tricks. The recipes take a problem-solution approach to resolve issues commonly faced by Python programmers across the globe. You will be armed with the knowledge of creating applications with flexible logging, powerful configuration, and command-line options, automated unit tests, and good documentation. Style and approach This book takes a recipe-based approach, where each recipe addresses specific problems and issues. The recipes provide discussions and insights and an explanation of the problems.

Modern Python Cookbook

Modern Python Cookbook PDF Author: Steven F. Lott
Publisher: Packt Publishing Ltd
ISBN: 1800205805
Category : Computers
Languages : en
Pages : 823

Get Book Here

Book Description
Complete recipes spread across 15 chapters to help you overcome commonly faced issues by Python for everybody across the globe. Each recipe takes a problem-solution approach to resolve for effective Python. Key Features Develop expressive and effective Python programs Best practices and common idioms through carefully explained recipes Discover new ways to apply Python for data-focused development Make use of Python’s optional type annotations Book DescriptionPython is the preferred choice of developers, engineers, data scientists, and hobbyists everywhere. It is a great language that can power your applications and provide great speed, safety, and scalability. It can be used for simple scripting or sophisticated web applications. By exposing Python as a series of simple recipes, this book gives you insight into specific language features in a particular context. Having a tangible context helps make the language or a given standard library feature easier to understand. This book comes with 133 recipes on the latest version of Python 3.8. The recipes will benefit everyone, from beginners just starting out with Python to experts. You'll not only learn Python programming concepts but also how to build complex applications. The recipes will touch upon all necessary Python concepts related to data structures, object oriented programming, functional programming, and statistical programming. You will get acquainted with the nuances of Python syntax and how to effectively take advantage of it. By the end of this Python book, you will be equipped with knowledge of testing, web services, configuration, and application integration tips and tricks. You will be armed with the knowledge of how to create applications with flexible logging, powerful configuration, command-line options, automated unit tests, and good documentation.What you will learn See the intricate details of the Python syntax and how to use it to your advantage Improve your coding with Python readability through functions Manipulate data effectively using built-in data structures Get acquainted with advanced programming techniques in Python Equip yourself with functional and statistical programming features Write proper tests to be sure a program works as advertised Integrate application software using Python Who this book is for The Python book is for web developers, programmers, enterprise programmers, engineers, and big data scientists. If you are a beginner, this book will get you started. If you are experienced, it will expand your knowledge base. A basic knowledge of programming would help.

Python Machine Learning Cookbook

Python Machine Learning Cookbook PDF Author: Prateek Joshi
Publisher: Packt Publishing Ltd
ISBN: 1786467682
Category : Computers
Languages : en
Pages : 304

Get Book Here

Book Description
100 recipes that teach you how to perform various machine learning tasks in the real world About This Book Understand which algorithms to use in a given context with the help of this exciting recipe-based guide Learn about perceptrons and see how they are used to build neural networks Stuck while making sense of images, text, speech, and real estate? This guide will come to your rescue, showing you how to perform machine learning for each one of these using various techniques Who This Book Is For This book is for Python programmers who are looking to use machine-learning algorithms to create real-world applications. This book is friendly to Python beginners, but familiarity with Python programming would certainly be useful to play around with the code. What You Will Learn Explore classification algorithms and apply them to the income bracket estimation problem Use predictive modeling and apply it to real-world problems Understand how to perform market segmentation using unsupervised learning Explore data visualization techniques to interact with your data in diverse ways Find out how to build a recommendation engine Understand how to interact with text data and build models to analyze it Work with speech data and recognize spoken words using Hidden Markov Models Analyze stock market data using Conditional Random Fields Work with image data and build systems for image recognition and biometric face recognition Grasp how to use deep neural networks to build an optical character recognition system In Detail Machine learning is becoming increasingly pervasive in the modern data-driven world. It is used extensively across many fields such as search engines, robotics, self-driving cars, and more. With this book, you will learn how to perform various machine learning tasks in different environments. We'll start by exploring a range of real-life scenarios where machine learning can be used, and look at various building blocks. Throughout the book, you'll use a wide variety of machine learning algorithms to solve real-world problems and use Python to implement these algorithms. You'll discover how to deal with various types of data and explore the differences between machine learning paradigms such as supervised and unsupervised learning. We also cover a range of regression techniques, classification algorithms, predictive modeling, data visualization techniques, recommendation engines, and more with the help of real-world examples. Style and approach You will explore various real-life scenarios in this book where machine learning can be used, and learn about different building blocks of machine learning using independent recipes in the book.

Python Automation Cookbook

Python Automation Cookbook PDF Author: Jaime Buelta
Publisher: Packt Publishing Ltd
ISBN: 1800202598
Category : Computers
Languages : en
Pages : 527

Get Book Here

Book Description
Get a firm grip on the core processes including browser automation, web scraping, Word, Excel, and GUI automation with Python 3.8 and higher Key FeaturesAutomate integral business processes such as report generation, email marketing, and lead generationExplore automated code testing and Python’s growth in data science and AI automation in three new chaptersUnderstand techniques to extract information and generate appealing graphs, and reports with MatplotlibBook Description In this updated and extended version of Python Automation Cookbook, each chapter now comprises the newest recipes and is revised to align with Python 3.8 and higher. The book includes three new chapters that focus on using Python for test automation, machine learning projects, and for working with messy data. This edition will enable you to develop a sharp understanding of the fundamentals required to automate business processes through real-world tasks, such as developing your first web scraping application, analyzing information to generate spreadsheet reports with graphs, and communicating with automatically generated emails. Once you grasp the basics, you will acquire the practical knowledge to create stunning graphs and charts using Matplotlib, generate rich graphics with relevant information, automate marketing campaigns, build machine learning projects, and execute debugging techniques. By the end of this book, you will be proficient in identifying monotonous tasks and resolving process inefficiencies to produce superior and reliable systems. What you will learnLearn data wrangling with Python and Pandas for your data science and AI projectsAutomate tasks such as text classification, email filtering, and web scraping with PythonUse Matplotlib to generate a variety of stunning graphs, charts, and mapsAutomate a range of report generation tasks, from sending SMS and email campaigns to creating templates, adding images in Word, and even encrypting PDFsMaster web scraping and web crawling of popular file formats and directories with tools like Beautiful SoupBuild cool projects such as a Telegram bot for your marketing campaign, a reader from a news RSS feed, and a machine learning model to classify emails to the correct department based on their contentCreate fire-and-forget automation tasks by writing cron jobs, log files, and regexes with Python scriptingWho this book is for Python Automation Cookbook - Second Edition is for developers, data enthusiasts or anyone who wants to automate monotonous manual tasks related to business processes such as finance, sales, and HR, among others. Working knowledge of Python is all you need to get started with this book.

Feature Engineering for Machine Learning

Feature Engineering for Machine Learning PDF Author: Alice Zheng
Publisher: "O'Reilly Media, Inc."
ISBN: 1491953195
Category : Computers
Languages : en
Pages : 218

Get Book Here

Book Description
Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined on its own. With this practical book, you’ll learn techniques for extracting and transforming features—the numeric representations of raw data—into formats for machine-learning models. Each chapter guides you through a single data problem, such as how to represent text or image data. Together, these examples illustrate the main principles of feature engineering. Rather than simply teach these principles, authors Alice Zheng and Amanda Casari focus on practical application with exercises throughout the book. The closing chapter brings everything together by tackling a real-world, structured dataset with several feature-engineering techniques. Python packages including numpy, Pandas, Scikit-learn, and Matplotlib are used in code examples. You’ll examine: Feature engineering for numeric data: filtering, binning, scaling, log transforms, and power transforms Natural text techniques: bag-of-words, n-grams, and phrase detection Frequency-based filtering and feature scaling for eliminating uninformative features Encoding techniques of categorical variables, including feature hashing and bin-counting Model-based feature engineering with principal component analysis The concept of model stacking, using k-means as a featurization technique Image feature extraction with manual and deep-learning techniques

Python Data Analysis

Python Data Analysis PDF Author: Armando Fandango
Publisher: Packt Publishing Ltd
ISBN: 1787127923
Category : Computers
Languages : en
Pages : 320

Get Book Here

Book Description
Learn how to apply powerful data analysis techniques with popular open source Python modules About This Book Find, manipulate, and analyze your data using the Python 3.5 libraries Perform advanced, high-performance linear algebra and mathematical calculations with clean and efficient Python code An easy-to-follow guide with realistic examples that are frequently used in real-world data analysis projects. Who This Book Is For This book is for programmers, scientists, and engineers who have the knowledge of Python and know the basics of data science. It is for those who wish to learn different data analysis methods using Python 3.5 and its libraries. This book contains all the basic ingredients you need to become an expert data analyst. What You Will Learn Install open source Python modules such NumPy, SciPy, Pandas, stasmodels, scikit-learn,theano, keras, and tensorflow on various platforms Prepare and clean your data, and use it for exploratory analysis Manipulate your data with Pandas Retrieve and store your data from RDBMS, NoSQL, and distributed filesystems such as HDFS and HDF5 Visualize your data with open source libraries such as matplotlib, bokeh, and plotly Learn about various machine learning methods such as supervised, unsupervised, probabilistic, and Bayesian Understand signal processing and time series data analysis Get to grips with graph processing and social network analysis In Detail Data analysis techniques generate useful insights from small and large volumes of data. Python, with its strong set of libraries, has become a popular platform to conduct various data analysis and predictive modeling tasks. With this book, you will learn how to process and manipulate data with Python for complex analysis and modeling. We learn data manipulations such as aggregating, concatenating, appending, cleaning, and handling missing values, with NumPy and Pandas. The book covers how to store and retrieve data from various data sources such as SQL and NoSQL, CSV fies, and HDF5. We learn how to visualize data using visualization libraries, along with advanced topics such as signal processing, time series, textual data analysis, machine learning, and social media analysis. The book covers a plethora of Python modules, such as matplotlib, statsmodels, scikit-learn, and NLTK. It also covers using Python with external environments such as R, Fortran, C/C++, and Boost libraries. Style and approach The book takes a very comprehensive approach to enhance your understanding of data analysis. Sufficient real-world examples and use cases are included in the book to help you grasp the concepts quickly and apply them easily in your day-to-day work. Packed with clear, easy to follow examples, this book will turn you into an ace data analyst in no time.