Author: Rajalingappaa Shanmugamani
Publisher: Packt Publishing Ltd
ISBN: 1788293355
Category : Computers
Languages : en
Pages : 304
Book Description
Learn how to model and train advanced neural networks to implement a variety of Computer Vision tasks Key Features Train different kinds of deep learning model from scratch to solve specific problems in Computer Vision Combine the power of Python, Keras, and TensorFlow to build deep learning models for object detection, image classification, similarity learning, image captioning, and more Includes tips on optimizing and improving the performance of your models under various constraints Book Description Deep learning has shown its power in several application areas of Artificial Intelligence, especially in Computer Vision. Computer Vision is the science of understanding and manipulating images, and finds enormous applications in the areas of robotics, automation, and so on. This book will also show you, with practical examples, how to develop Computer Vision applications by leveraging the power of deep learning. In this book, you will learn different techniques related to object classification, object detection, image segmentation, captioning, image generation, face analysis, and more. You will also explore their applications using popular Python libraries such as TensorFlow and Keras. This book will help you master state-of-the-art, deep learning algorithms and their implementation. What you will learn Set up an environment for deep learning with Python, TensorFlow, and Keras Define and train a model for image and video classification Use features from a pre-trained Convolutional Neural Network model for image retrieval Understand and implement object detection using the real-world Pedestrian Detection scenario Learn about various problems in image captioning and how to overcome them by training images and text together Implement similarity matching and train a model for face recognition Understand the concept of generative models and use them for image generation Deploy your deep learning models and optimize them for high performance Who this book is for This book is targeted at data scientists and Computer Vision practitioners who wish to apply the concepts of Deep Learning to overcome any problem related to Computer Vision. A basic knowledge of programming in Python—and some understanding of machine learning concepts—is required to get the best out of this book.
Deep Learning for Computer Vision
Advanced Methods and Deep Learning in Computer Vision
Author: E. R. Davies
Publisher: Academic Press
ISBN: 0128221496
Category : Technology & Engineering
Languages : en
Pages : 584
Book Description
Advanced Methods and Deep Learning in Computer Vision presents advanced computer vision methods, emphasizing machine and deep learning techniques that have emerged during the past 5–10 years. The book provides clear explanations of principles and algorithms supported with applications. Topics covered include machine learning, deep learning networks, generative adversarial networks, deep reinforcement learning, self-supervised learning, extraction of robust features, object detection, semantic segmentation, linguistic descriptions of images, visual search, visual tracking, 3D shape retrieval, image inpainting, novelty and anomaly detection. This book provides easy learning for researchers and practitioners of advanced computer vision methods, but it is also suitable as a textbook for a second course on computer vision and deep learning for advanced undergraduates and graduate students. - Provides an important reference on deep learning and advanced computer methods that was created by leaders in the field - Illustrates principles with modern, real-world applications - Suitable for self-learning or as a text for graduate courses
Publisher: Academic Press
ISBN: 0128221496
Category : Technology & Engineering
Languages : en
Pages : 584
Book Description
Advanced Methods and Deep Learning in Computer Vision presents advanced computer vision methods, emphasizing machine and deep learning techniques that have emerged during the past 5–10 years. The book provides clear explanations of principles and algorithms supported with applications. Topics covered include machine learning, deep learning networks, generative adversarial networks, deep reinforcement learning, self-supervised learning, extraction of robust features, object detection, semantic segmentation, linguistic descriptions of images, visual search, visual tracking, 3D shape retrieval, image inpainting, novelty and anomaly detection. This book provides easy learning for researchers and practitioners of advanced computer vision methods, but it is also suitable as a textbook for a second course on computer vision and deep learning for advanced undergraduates and graduate students. - Provides an important reference on deep learning and advanced computer methods that was created by leaders in the field - Illustrates principles with modern, real-world applications - Suitable for self-learning or as a text for graduate courses
Practical Machine Learning for Computer Vision
Author: Valliappa Lakshmanan
Publisher: "O'Reilly Media, Inc."
ISBN: 1098102339
Category : Computers
Languages : en
Pages : 481
Book Description
This practical book shows you how to employ machine learning models to extract information from images. ML engineers and data scientists will learn how to solve a variety of image problems including classification, object detection, autoencoders, image generation, counting, and captioning with proven ML techniques. This book provides a great introduction to end-to-end deep learning: dataset creation, data preprocessing, model design, model training, evaluation, deployment, and interpretability. Google engineers Valliappa Lakshmanan, Martin Görner, and Ryan Gillard show you how to develop accurate and explainable computer vision ML models and put them into large-scale production using robust ML architecture in a flexible and maintainable way. You'll learn how to design, train, evaluate, and predict with models written in TensorFlow or Keras. You'll learn how to: Design ML architecture for computer vision tasks Select a model (such as ResNet, SqueezeNet, or EfficientNet) appropriate to your task Create an end-to-end ML pipeline to train, evaluate, deploy, and explain your model Preprocess images for data augmentation and to support learnability Incorporate explainability and responsible AI best practices Deploy image models as web services or on edge devices Monitor and manage ML models
Publisher: "O'Reilly Media, Inc."
ISBN: 1098102339
Category : Computers
Languages : en
Pages : 481
Book Description
This practical book shows you how to employ machine learning models to extract information from images. ML engineers and data scientists will learn how to solve a variety of image problems including classification, object detection, autoencoders, image generation, counting, and captioning with proven ML techniques. This book provides a great introduction to end-to-end deep learning: dataset creation, data preprocessing, model design, model training, evaluation, deployment, and interpretability. Google engineers Valliappa Lakshmanan, Martin Görner, and Ryan Gillard show you how to develop accurate and explainable computer vision ML models and put them into large-scale production using robust ML architecture in a flexible and maintainable way. You'll learn how to design, train, evaluate, and predict with models written in TensorFlow or Keras. You'll learn how to: Design ML architecture for computer vision tasks Select a model (such as ResNet, SqueezeNet, or EfficientNet) appropriate to your task Create an end-to-end ML pipeline to train, evaluate, deploy, and explain your model Preprocess images for data augmentation and to support learnability Incorporate explainability and responsible AI best practices Deploy image models as web services or on edge devices Monitor and manage ML models
Deep Learning for Vision Systems
Author: Mohamed Elgendy
Publisher: Manning Publications
ISBN: 1617296198
Category : Computers
Languages : en
Pages : 478
Book Description
How does the computer learn to understand what it sees? Deep Learning for Vision Systems answers that by applying deep learning to computer vision. Using only high school algebra, this book illuminates the concepts behind visual intuition. You'll understand how to use deep learning architectures to build vision system applications for image generation and facial recognition. Summary Computer vision is central to many leading-edge innovations, including self-driving cars, drones, augmented reality, facial recognition, and much, much more. Amazing new computer vision applications are developed every day, thanks to rapid advances in AI and deep learning (DL). Deep Learning for Vision Systems teaches you the concepts and tools for building intelligent, scalable computer vision systems that can identify and react to objects in images, videos, and real life. With author Mohamed Elgendy's expert instruction and illustration of real-world projects, you’ll finally grok state-of-the-art deep learning techniques, so you can build, contribute to, and lead in the exciting realm of computer vision! Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology How much has computer vision advanced? One ride in a Tesla is the only answer you’ll need. Deep learning techniques have led to exciting breakthroughs in facial recognition, interactive simulations, and medical imaging, but nothing beats seeing a car respond to real-world stimuli while speeding down the highway. About the book How does the computer learn to understand what it sees? Deep Learning for Vision Systems answers that by applying deep learning to computer vision. Using only high school algebra, this book illuminates the concepts behind visual intuition. You'll understand how to use deep learning architectures to build vision system applications for image generation and facial recognition. What's inside Image classification and object detection Advanced deep learning architectures Transfer learning and generative adversarial networks DeepDream and neural style transfer Visual embeddings and image search About the reader For intermediate Python programmers. About the author Mohamed Elgendy is the VP of Engineering at Rakuten. A seasoned AI expert, he has previously built and managed AI products at Amazon and Twilio. Table of Contents PART 1 - DEEP LEARNING FOUNDATION 1 Welcome to computer vision 2 Deep learning and neural networks 3 Convolutional neural networks 4 Structuring DL projects and hyperparameter tuning PART 2 - IMAGE CLASSIFICATION AND DETECTION 5 Advanced CNN architectures 6 Transfer learning 7 Object detection with R-CNN, SSD, and YOLO PART 3 - GENERATIVE MODELS AND VISUAL EMBEDDINGS 8 Generative adversarial networks (GANs) 9 DeepDream and neural style transfer 10 Visual embeddings
Publisher: Manning Publications
ISBN: 1617296198
Category : Computers
Languages : en
Pages : 478
Book Description
How does the computer learn to understand what it sees? Deep Learning for Vision Systems answers that by applying deep learning to computer vision. Using only high school algebra, this book illuminates the concepts behind visual intuition. You'll understand how to use deep learning architectures to build vision system applications for image generation and facial recognition. Summary Computer vision is central to many leading-edge innovations, including self-driving cars, drones, augmented reality, facial recognition, and much, much more. Amazing new computer vision applications are developed every day, thanks to rapid advances in AI and deep learning (DL). Deep Learning for Vision Systems teaches you the concepts and tools for building intelligent, scalable computer vision systems that can identify and react to objects in images, videos, and real life. With author Mohamed Elgendy's expert instruction and illustration of real-world projects, you’ll finally grok state-of-the-art deep learning techniques, so you can build, contribute to, and lead in the exciting realm of computer vision! Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology How much has computer vision advanced? One ride in a Tesla is the only answer you’ll need. Deep learning techniques have led to exciting breakthroughs in facial recognition, interactive simulations, and medical imaging, but nothing beats seeing a car respond to real-world stimuli while speeding down the highway. About the book How does the computer learn to understand what it sees? Deep Learning for Vision Systems answers that by applying deep learning to computer vision. Using only high school algebra, this book illuminates the concepts behind visual intuition. You'll understand how to use deep learning architectures to build vision system applications for image generation and facial recognition. What's inside Image classification and object detection Advanced deep learning architectures Transfer learning and generative adversarial networks DeepDream and neural style transfer Visual embeddings and image search About the reader For intermediate Python programmers. About the author Mohamed Elgendy is the VP of Engineering at Rakuten. A seasoned AI expert, he has previously built and managed AI products at Amazon and Twilio. Table of Contents PART 1 - DEEP LEARNING FOUNDATION 1 Welcome to computer vision 2 Deep learning and neural networks 3 Convolutional neural networks 4 Structuring DL projects and hyperparameter tuning PART 2 - IMAGE CLASSIFICATION AND DETECTION 5 Advanced CNN architectures 6 Transfer learning 7 Object detection with R-CNN, SSD, and YOLO PART 3 - GENERATIVE MODELS AND VISUAL EMBEDDINGS 8 Generative adversarial networks (GANs) 9 DeepDream and neural style transfer 10 Visual embeddings
Deep Learning in Computer Vision
Author: Mahmoud Hassaballah
Publisher: CRC Press
ISBN: 1351003801
Category : Computers
Languages : en
Pages : 261
Book Description
Deep learning algorithms have brought a revolution to the computer vision community by introducing non-traditional and efficient solutions to several image-related problems that had long remained unsolved or partially addressed. This book presents a collection of eleven chapters where each individual chapter explains the deep learning principles of a specific topic, introduces reviews of up-to-date techniques, and presents research findings to the computer vision community. The book covers a broad scope of topics in deep learning concepts and applications such as accelerating the convolutional neural network inference on field-programmable gate arrays, fire detection in surveillance applications, face recognition, action and activity recognition, semantic segmentation for autonomous driving, aerial imagery registration, robot vision, tumor detection, and skin lesion segmentation as well as skin melanoma classification. The content of this book has been organized such that each chapter can be read independently from the others. The book is a valuable companion for researchers, for postgraduate and possibly senior undergraduate students who are taking an advanced course in related topics, and for those who are interested in deep learning with applications in computer vision, image processing, and pattern recognition.
Publisher: CRC Press
ISBN: 1351003801
Category : Computers
Languages : en
Pages : 261
Book Description
Deep learning algorithms have brought a revolution to the computer vision community by introducing non-traditional and efficient solutions to several image-related problems that had long remained unsolved or partially addressed. This book presents a collection of eleven chapters where each individual chapter explains the deep learning principles of a specific topic, introduces reviews of up-to-date techniques, and presents research findings to the computer vision community. The book covers a broad scope of topics in deep learning concepts and applications such as accelerating the convolutional neural network inference on field-programmable gate arrays, fire detection in surveillance applications, face recognition, action and activity recognition, semantic segmentation for autonomous driving, aerial imagery registration, robot vision, tumor detection, and skin lesion segmentation as well as skin melanoma classification. The content of this book has been organized such that each chapter can be read independently from the others. The book is a valuable companion for researchers, for postgraduate and possibly senior undergraduate students who are taking an advanced course in related topics, and for those who are interested in deep learning with applications in computer vision, image processing, and pattern recognition.
Deep Learning for Computer Vision
Author: Jason Brownlee
Publisher: Machine Learning Mastery
ISBN:
Category : Computers
Languages : en
Pages : 564
Book Description
Step-by-step tutorials on deep learning neural networks for computer vision in python with Keras.
Publisher: Machine Learning Mastery
ISBN:
Category : Computers
Languages : en
Pages : 564
Book Description
Step-by-step tutorials on deep learning neural networks for computer vision in python with Keras.
Computer Vision Using Deep Learning
Author: Vaibhav Verdhan
Publisher: Apress
ISBN: 9781484266151
Category : Computers
Languages : en
Pages : 308
Book Description
Organizations spend huge resources in developing software that can perform the way a human does. Image classification, object detection and tracking, pose estimation, facial recognition, and sentiment estimation all play a major role in solving computer vision problems. This book will bring into focus these and other deep learning architectures and techniques to help you create solutions using Keras and the TensorFlow library. You'll also review mutliple neural network architectures, including LeNet, AlexNet, VGG, Inception, R-CNN, Fast R-CNN, Faster R-CNN, Mask R-CNN, YOLO, and SqueezeNet and see how they work alongside Python code via best practices, tips, tricks, shortcuts, and pitfalls. All code snippets will be broken down and discussed thoroughly so you can implement the same principles in your respective environments. Computer Vision Using Deep Learning offers a comprehensive yet succinct guide that stitches DL and CV together to automate operations, reduce human intervention, increase capability, and cut the costs. What You'll Learn Examine deep learning code and concepts to apply guiding principals to your own projects Classify and evaluate various architectures to better understand your options in various use cases Go behind the scenes of basic deep learning functions to find out how they work Who This Book Is For Professional practitioners working in the fields of software engineering and data science. A working knowledge of Python is strongly recommended. Students and innovators working on advanced degrees in areas related to computer vision and Deep Learning.
Publisher: Apress
ISBN: 9781484266151
Category : Computers
Languages : en
Pages : 308
Book Description
Organizations spend huge resources in developing software that can perform the way a human does. Image classification, object detection and tracking, pose estimation, facial recognition, and sentiment estimation all play a major role in solving computer vision problems. This book will bring into focus these and other deep learning architectures and techniques to help you create solutions using Keras and the TensorFlow library. You'll also review mutliple neural network architectures, including LeNet, AlexNet, VGG, Inception, R-CNN, Fast R-CNN, Faster R-CNN, Mask R-CNN, YOLO, and SqueezeNet and see how they work alongside Python code via best practices, tips, tricks, shortcuts, and pitfalls. All code snippets will be broken down and discussed thoroughly so you can implement the same principles in your respective environments. Computer Vision Using Deep Learning offers a comprehensive yet succinct guide that stitches DL and CV together to automate operations, reduce human intervention, increase capability, and cut the costs. What You'll Learn Examine deep learning code and concepts to apply guiding principals to your own projects Classify and evaluate various architectures to better understand your options in various use cases Go behind the scenes of basic deep learning functions to find out how they work Who This Book Is For Professional practitioners working in the fields of software engineering and data science. A working knowledge of Python is strongly recommended. Students and innovators working on advanced degrees in areas related to computer vision and Deep Learning.
Practical Computer Vision Applications Using Deep Learning with CNNs
Author: Ahmed Fawzy Gad
Publisher: Apress
ISBN: 1484241673
Category : Computers
Languages : en
Pages : 421
Book Description
Deploy deep learning applications into production across multiple platforms. You will work on computer vision applications that use the convolutional neural network (CNN) deep learning model and Python. This book starts by explaining the traditional machine-learning pipeline, where you will analyze an image dataset. Along the way you will cover artificial neural networks (ANNs), building one from scratch in Python, before optimizing it using genetic algorithms. For automating the process, the book highlights the limitations of traditional hand-crafted features for computer vision and why the CNN deep-learning model is the state-of-art solution. CNNs are discussed from scratch to demonstrate how they are different and more efficient than the fully connected ANN (FCNN). You will implement a CNN in Python to give you a full understanding of the model. After consolidating the basics, you will use TensorFlow to build a practical image-recognition model that you will deploy to a web server using Flask, making it accessible over the Internet. Using Kivy and NumPy, you will create cross-platform data science applications with low overheads. This book will help you apply deep learning and computer vision concepts from scratch, step-by-step from conception to production. What You Will Learn Understand how ANNs and CNNs work Create computer vision applications and CNNs from scratch using PythonFollow a deep learning project from conception to production using TensorFlowUse NumPy with Kivy to build cross-platform data science applications Who This Book Is ForData scientists, machine learning and deep learning engineers, software developers.
Publisher: Apress
ISBN: 1484241673
Category : Computers
Languages : en
Pages : 421
Book Description
Deploy deep learning applications into production across multiple platforms. You will work on computer vision applications that use the convolutional neural network (CNN) deep learning model and Python. This book starts by explaining the traditional machine-learning pipeline, where you will analyze an image dataset. Along the way you will cover artificial neural networks (ANNs), building one from scratch in Python, before optimizing it using genetic algorithms. For automating the process, the book highlights the limitations of traditional hand-crafted features for computer vision and why the CNN deep-learning model is the state-of-art solution. CNNs are discussed from scratch to demonstrate how they are different and more efficient than the fully connected ANN (FCNN). You will implement a CNN in Python to give you a full understanding of the model. After consolidating the basics, you will use TensorFlow to build a practical image-recognition model that you will deploy to a web server using Flask, making it accessible over the Internet. Using Kivy and NumPy, you will create cross-platform data science applications with low overheads. This book will help you apply deep learning and computer vision concepts from scratch, step-by-step from conception to production. What You Will Learn Understand how ANNs and CNNs work Create computer vision applications and CNNs from scratch using PythonFollow a deep learning project from conception to production using TensorFlowUse NumPy with Kivy to build cross-platform data science applications Who This Book Is ForData scientists, machine learning and deep learning engineers, software developers.
Elements of Deep Learning for Computer Vision
Author: Bharat Sikka
Publisher: BPB Publications
ISBN: 9390684684
Category : Computers
Languages : en
Pages : 224
Book Description
Conceptualizing deep learning in computer vision applications using PyTorch and Python libraries. KEY FEATURES ● Covers a variety of computer vision projects, including face recognition and object recognition such as Yolo, Faster R-CNN. ● Includes graphical representations and illustrations of neural networks and teaches how to program them. ● Includes deep learning techniques and architectures introduced by Microsoft, Google, and the University of Oxford. DESCRIPTION Elements of Deep Learning for Computer Vision gives a thorough understanding of deep learning and provides highly accurate computer vision solutions while using libraries like PyTorch. This book introduces you to Deep Learning and explains all the concepts required to understand the basic working, development, and tuning of a neural network using Pytorch. The book then addresses the field of computer vision using two libraries, including the Python wrapper/version of OpenCV and PIL. After establishing and understanding both the primary concepts, the book addresses them together by explaining Convolutional Neural Networks(CNNs). CNNs are further elaborated using top industry standards and research to explain how they provide complicated Object Detection in images and videos, while also explaining their evaluation. Towards the end, the book explains how to develop a fully functional object detection model, including its deployment over APIs. By the end of this book, you are well-equipped with the role of deep learning in the field of computer vision along with a guided process to design deep learning solutions. WHAT YOU WILL LEARN ● Get to know the mechanism of deep learning and how neural networks operate. ● Learn to develop a highly accurate neural network model. ● Access to rich Python libraries to address computer vision challenges. ● Build deep learning models using PyTorch and learn how to deploy using the API. ● Learn to develop Object Detection and Face Recognition models along with their deployment. WHO THIS BOOK IS FOR This book is for the readers who aspire to gain a strong fundamental understanding of how to infuse deep learning into computer vision and image processing applications. Readers are expected to have intermediate Python skills. No previous knowledge of PyTorch and Computer Vision is required. TABLE OF CONTENTS 1. An Introduction to Deep Learning 2. Supervised Learning 3. Gradient Descent 4. OpenCV with Python 5. Python Imaging Library and Pillow 6. Introduction to Convolutional Neural Networks 7. GoogLeNet, VGGNet, and ResNet 8. Understanding Object Detection 9. Popular Algorithms for Object Detection 10. Faster RCNN with PyTorch and YoloV4 with Darknet 11. Comparing Algorithms and API Deployment with Flask 12. Applications in Real World
Publisher: BPB Publications
ISBN: 9390684684
Category : Computers
Languages : en
Pages : 224
Book Description
Conceptualizing deep learning in computer vision applications using PyTorch and Python libraries. KEY FEATURES ● Covers a variety of computer vision projects, including face recognition and object recognition such as Yolo, Faster R-CNN. ● Includes graphical representations and illustrations of neural networks and teaches how to program them. ● Includes deep learning techniques and architectures introduced by Microsoft, Google, and the University of Oxford. DESCRIPTION Elements of Deep Learning for Computer Vision gives a thorough understanding of deep learning and provides highly accurate computer vision solutions while using libraries like PyTorch. This book introduces you to Deep Learning and explains all the concepts required to understand the basic working, development, and tuning of a neural network using Pytorch. The book then addresses the field of computer vision using two libraries, including the Python wrapper/version of OpenCV and PIL. After establishing and understanding both the primary concepts, the book addresses them together by explaining Convolutional Neural Networks(CNNs). CNNs are further elaborated using top industry standards and research to explain how they provide complicated Object Detection in images and videos, while also explaining their evaluation. Towards the end, the book explains how to develop a fully functional object detection model, including its deployment over APIs. By the end of this book, you are well-equipped with the role of deep learning in the field of computer vision along with a guided process to design deep learning solutions. WHAT YOU WILL LEARN ● Get to know the mechanism of deep learning and how neural networks operate. ● Learn to develop a highly accurate neural network model. ● Access to rich Python libraries to address computer vision challenges. ● Build deep learning models using PyTorch and learn how to deploy using the API. ● Learn to develop Object Detection and Face Recognition models along with their deployment. WHO THIS BOOK IS FOR This book is for the readers who aspire to gain a strong fundamental understanding of how to infuse deep learning into computer vision and image processing applications. Readers are expected to have intermediate Python skills. No previous knowledge of PyTorch and Computer Vision is required. TABLE OF CONTENTS 1. An Introduction to Deep Learning 2. Supervised Learning 3. Gradient Descent 4. OpenCV with Python 5. Python Imaging Library and Pillow 6. Introduction to Convolutional Neural Networks 7. GoogLeNet, VGGNet, and ResNet 8. Understanding Object Detection 9. Popular Algorithms for Object Detection 10. Faster RCNN with PyTorch and YoloV4 with Darknet 11. Comparing Algorithms and API Deployment with Flask 12. Applications in Real World
Modern Computer Vision with PyTorch
Author: V Kishore Ayyadevara
Publisher: Packt Publishing Ltd
ISBN: 1839216530
Category : Computers
Languages : en
Pages : 805
Book Description
Get to grips with deep learning techniques for building image processing applications using PyTorch with the help of code notebooks and test questions Key FeaturesImplement solutions to 50 real-world computer vision applications using PyTorchUnderstand the theory and working mechanisms of neural network architectures and their implementationDiscover best practices using a custom library created especially for this bookBook Description Deep learning is the driving force behind many recent advances in various computer vision (CV) applications. This book takes a hands-on approach to help you to solve over 50 CV problems using PyTorch1.x on real-world datasets. You’ll start by building a neural network (NN) from scratch using NumPy and PyTorch and discover best practices for tweaking its hyperparameters. You’ll then perform image classification using convolutional neural networks and transfer learning and understand how they work. As you progress, you’ll implement multiple use cases of 2D and 3D multi-object detection, segmentation, human-pose-estimation by learning about the R-CNN family, SSD, YOLO, U-Net architectures, and the Detectron2 platform. The book will also guide you in performing facial expression swapping, generating new faces, and manipulating facial expressions as you explore autoencoders and modern generative adversarial networks. You’ll learn how to combine CV with NLP techniques, such as LSTM and transformer, and RL techniques, such as Deep Q-learning, to implement OCR, image captioning, object detection, and a self-driving car agent. Finally, you'll move your NN model to production on the AWS Cloud. By the end of this book, you’ll be able to leverage modern NN architectures to solve over 50 real-world CV problems confidently. What you will learnTrain a NN from scratch with NumPy and PyTorchImplement 2D and 3D multi-object detection and segmentationGenerate digits and DeepFakes with autoencoders and advanced GANsManipulate images using CycleGAN, Pix2PixGAN, StyleGAN2, and SRGANCombine CV with NLP to perform OCR, image captioning, and object detectionCombine CV with reinforcement learning to build agents that play pong and self-drive a carDeploy a deep learning model on the AWS server using FastAPI and DockerImplement over 35 NN architectures and common OpenCV utilitiesWho this book is for This book is for beginners to PyTorch and intermediate-level machine learning practitioners who are looking to get well-versed with computer vision techniques using deep learning and PyTorch. If you are just getting started with neural networks, you’ll find the use cases accompanied by notebooks in GitHub present in this book useful. Basic knowledge of the Python programming language and machine learning is all you need to get started with this book.
Publisher: Packt Publishing Ltd
ISBN: 1839216530
Category : Computers
Languages : en
Pages : 805
Book Description
Get to grips with deep learning techniques for building image processing applications using PyTorch with the help of code notebooks and test questions Key FeaturesImplement solutions to 50 real-world computer vision applications using PyTorchUnderstand the theory and working mechanisms of neural network architectures and their implementationDiscover best practices using a custom library created especially for this bookBook Description Deep learning is the driving force behind many recent advances in various computer vision (CV) applications. This book takes a hands-on approach to help you to solve over 50 CV problems using PyTorch1.x on real-world datasets. You’ll start by building a neural network (NN) from scratch using NumPy and PyTorch and discover best practices for tweaking its hyperparameters. You’ll then perform image classification using convolutional neural networks and transfer learning and understand how they work. As you progress, you’ll implement multiple use cases of 2D and 3D multi-object detection, segmentation, human-pose-estimation by learning about the R-CNN family, SSD, YOLO, U-Net architectures, and the Detectron2 platform. The book will also guide you in performing facial expression swapping, generating new faces, and manipulating facial expressions as you explore autoencoders and modern generative adversarial networks. You’ll learn how to combine CV with NLP techniques, such as LSTM and transformer, and RL techniques, such as Deep Q-learning, to implement OCR, image captioning, object detection, and a self-driving car agent. Finally, you'll move your NN model to production on the AWS Cloud. By the end of this book, you’ll be able to leverage modern NN architectures to solve over 50 real-world CV problems confidently. What you will learnTrain a NN from scratch with NumPy and PyTorchImplement 2D and 3D multi-object detection and segmentationGenerate digits and DeepFakes with autoencoders and advanced GANsManipulate images using CycleGAN, Pix2PixGAN, StyleGAN2, and SRGANCombine CV with NLP to perform OCR, image captioning, and object detectionCombine CV with reinforcement learning to build agents that play pong and self-drive a carDeploy a deep learning model on the AWS server using FastAPI and DockerImplement over 35 NN architectures and common OpenCV utilitiesWho this book is for This book is for beginners to PyTorch and intermediate-level machine learning practitioners who are looking to get well-versed with computer vision techniques using deep learning and PyTorch. If you are just getting started with neural networks, you’ll find the use cases accompanied by notebooks in GitHub present in this book useful. Basic knowledge of the Python programming language and machine learning is all you need to get started with this book.