Author: Joachim Diederich
Publisher: Springer Science & Business Media
ISBN: 3540753893
Category : Mathematics
Languages : en
Pages : 267
Book Description
Support vector machines (SVMs) are one of the most active research areas in machine learning. SVMs have shown good performance in a number of applications, including text and image classification. However, the learning capability of SVMs comes at a cost – an inherent inability to explain in a comprehensible form, the process by which a learning result was reached. Hence, the situation is similar to neural networks, where the apparent lack of an explanation capability has led to various approaches aiming at extracting symbolic rules from neural networks. For SVMs to gain a wider degree of acceptance in fields such as medical diagnosis and security sensitive areas, it is desirable to offer an explanation capability. User explanation is often a legal requirement, because it is necessary to explain how a decision was reached or why it was made. This book provides an overview of the field and introduces a number of different approaches to extracting rules from support vector machines developed by key researchers. In addition, successful applications are outlined and future research opportunities are discussed. The book is an important reference for researchers and graduate students, and since it provides an introduction to the topic, it will be important in the classroom as well. Because of the significance of both SVMs and user explanation, the book is of relevance to data mining practitioners and data analysts.
Rule Extraction from Support Vector Machines
Author: Joachim Diederich
Publisher: Springer Science & Business Media
ISBN: 3540753893
Category : Mathematics
Languages : en
Pages : 267
Book Description
Support vector machines (SVMs) are one of the most active research areas in machine learning. SVMs have shown good performance in a number of applications, including text and image classification. However, the learning capability of SVMs comes at a cost – an inherent inability to explain in a comprehensible form, the process by which a learning result was reached. Hence, the situation is similar to neural networks, where the apparent lack of an explanation capability has led to various approaches aiming at extracting symbolic rules from neural networks. For SVMs to gain a wider degree of acceptance in fields such as medical diagnosis and security sensitive areas, it is desirable to offer an explanation capability. User explanation is often a legal requirement, because it is necessary to explain how a decision was reached or why it was made. This book provides an overview of the field and introduces a number of different approaches to extracting rules from support vector machines developed by key researchers. In addition, successful applications are outlined and future research opportunities are discussed. The book is an important reference for researchers and graduate students, and since it provides an introduction to the topic, it will be important in the classroom as well. Because of the significance of both SVMs and user explanation, the book is of relevance to data mining practitioners and data analysts.
Publisher: Springer Science & Business Media
ISBN: 3540753893
Category : Mathematics
Languages : en
Pages : 267
Book Description
Support vector machines (SVMs) are one of the most active research areas in machine learning. SVMs have shown good performance in a number of applications, including text and image classification. However, the learning capability of SVMs comes at a cost – an inherent inability to explain in a comprehensible form, the process by which a learning result was reached. Hence, the situation is similar to neural networks, where the apparent lack of an explanation capability has led to various approaches aiming at extracting symbolic rules from neural networks. For SVMs to gain a wider degree of acceptance in fields such as medical diagnosis and security sensitive areas, it is desirable to offer an explanation capability. User explanation is often a legal requirement, because it is necessary to explain how a decision was reached or why it was made. This book provides an overview of the field and introduces a number of different approaches to extracting rules from support vector machines developed by key researchers. In addition, successful applications are outlined and future research opportunities are discussed. The book is an important reference for researchers and graduate students, and since it provides an introduction to the topic, it will be important in the classroom as well. Because of the significance of both SVMs and user explanation, the book is of relevance to data mining practitioners and data analysts.
Rule Extraction from Support Vector MacHine
Author: Mohammed Farquad
Publisher: GRIN Verlag
ISBN: 365618965X
Category : Computers
Languages : en
Pages : 261
Book Description
Doctoral Thesis / Dissertation from the year 2010 in the subject Computer Science - Applied, grade: none, course: Department of Computers and Information Sciences - Ph.D., language: English, abstract: Although Support Vector Machines have been used to develop highly accurate classification and regression models in various real-world problem domains, the most significant barrier is that SVM generates black box model that is difficult to understand. The procedure to convert these opaque models into transparent models is called rule extraction. This thesis investigates the task of extracting comprehensible models from trained SVMs, thereby alleviating this limitation. The primary contribution of the thesis is the proposal of various algorithms to overcome the significant limitations of SVM by taking a novel approach to the task of extracting comprehensible models. The basic contribution of the thesis are systematic review of literature on rule extraction from SVM, identifying gaps in the literature and proposing novel approaches for addressing the gaps. The contributions are grouped under three classes, decompositional, pedagogical and eclectic/hybrid approaches. Decompositional approach is closely intertwined with the internal workings of the SVM. Pedagogical approach uses SVM as an oracle to re-label training examples as well as artificially generated examples. In the eclectic/hybrid approach, a combination of these two methods is adopted. The thesis addresses various problems from the finance domain such as bankruptcy prediction in banks/firms, churn prediction in analytical CRM and Insurance fraud detection. Apart from this various benchmark datasets such as iris, wine and WBC for classification problems and auto MPG, body fat, Boston housing, forest fires and pollution for regression problems are also tested using the proposed appraoch. In addition, rule extraction from unbalanced datasets as well as from active learning based approaches has been explored. For
Publisher: GRIN Verlag
ISBN: 365618965X
Category : Computers
Languages : en
Pages : 261
Book Description
Doctoral Thesis / Dissertation from the year 2010 in the subject Computer Science - Applied, grade: none, course: Department of Computers and Information Sciences - Ph.D., language: English, abstract: Although Support Vector Machines have been used to develop highly accurate classification and regression models in various real-world problem domains, the most significant barrier is that SVM generates black box model that is difficult to understand. The procedure to convert these opaque models into transparent models is called rule extraction. This thesis investigates the task of extracting comprehensible models from trained SVMs, thereby alleviating this limitation. The primary contribution of the thesis is the proposal of various algorithms to overcome the significant limitations of SVM by taking a novel approach to the task of extracting comprehensible models. The basic contribution of the thesis are systematic review of literature on rule extraction from SVM, identifying gaps in the literature and proposing novel approaches for addressing the gaps. The contributions are grouped under three classes, decompositional, pedagogical and eclectic/hybrid approaches. Decompositional approach is closely intertwined with the internal workings of the SVM. Pedagogical approach uses SVM as an oracle to re-label training examples as well as artificially generated examples. In the eclectic/hybrid approach, a combination of these two methods is adopted. The thesis addresses various problems from the finance domain such as bankruptcy prediction in banks/firms, churn prediction in analytical CRM and Insurance fraud detection. Apart from this various benchmark datasets such as iris, wine and WBC for classification problems and auto MPG, body fat, Boston housing, forest fires and pollution for regression problems are also tested using the proposed appraoch. In addition, rule extraction from unbalanced datasets as well as from active learning based approaches has been explored. For
Rule Extraction from Support Vector Machine
Author: Mohammed Farquad
Publisher: GRIN Verlag
ISBN: 3656188084
Category : Computers
Languages : en
Pages : 260
Book Description
Doctoral Thesis / Dissertation from the year 2010 in the subject Computer Science - Applied, grade: none, , course: Department of Computers and Information Sciences - Ph.D., language: English, abstract: Although Support Vector Machines have been used to develop highly accurate classification and regression models in various real-world problem domains, the most significant barrier is that SVM generates black box model that is difficult to understand. The procedure to convert these opaque models into transparent models is called rule extraction. This thesis investigates the task of extracting comprehensible models from trained SVMs, thereby alleviating this limitation. The primary contribution of the thesis is the proposal of various algorithms to overcome the significant limitations of SVM by taking a novel approach to the task of extracting comprehensible models. The basic contribution of the thesis are systematic review of literature on rule extraction from SVM, identifying gaps in the literature and proposing novel approaches for addressing the gaps. The contributions are grouped under three classes, decompositional, pedagogical and eclectic/hybrid approaches. Decompositional approach is closely intertwined with the internal workings of the SVM. Pedagogical approach uses SVM as an oracle to re-label training examples as well as artificially generated examples. In the eclectic/hybrid approach, a combination of these two methods is adopted. The thesis addresses various problems from the finance domain such as bankruptcy prediction in banks/firms, churn prediction in analytical CRM and Insurance fraud detection. Apart from this various benchmark datasets such as iris, wine and WBC for classification problems and auto MPG, body fat, Boston housing, forest fires and pollution for regression problems are also tested using the proposed appraoch. In addition, rule extraction from unbalanced datasets as well as from active learning based approaches has been explored. For classification problems, various rule extraction methods such as FRBS, DT, ANFIS, CART and NBTree have been utilized. Additionally for regression problems, rule extraction methods such as ANFIS, DENFIS and CART have also been employed. Results are analyzed using accuracy, sensitivity, specificity, fidelity, AUC and t-test measures. Proposed approaches demonstrate their viability in extracting accurate, effective and comprehensible rule sets in various benchmark and real world problem domains across classification and regression problems. Future directions have been indicated to extend the approaches to newer variations of SVM as well as to other problem domains.
Publisher: GRIN Verlag
ISBN: 3656188084
Category : Computers
Languages : en
Pages : 260
Book Description
Doctoral Thesis / Dissertation from the year 2010 in the subject Computer Science - Applied, grade: none, , course: Department of Computers and Information Sciences - Ph.D., language: English, abstract: Although Support Vector Machines have been used to develop highly accurate classification and regression models in various real-world problem domains, the most significant barrier is that SVM generates black box model that is difficult to understand. The procedure to convert these opaque models into transparent models is called rule extraction. This thesis investigates the task of extracting comprehensible models from trained SVMs, thereby alleviating this limitation. The primary contribution of the thesis is the proposal of various algorithms to overcome the significant limitations of SVM by taking a novel approach to the task of extracting comprehensible models. The basic contribution of the thesis are systematic review of literature on rule extraction from SVM, identifying gaps in the literature and proposing novel approaches for addressing the gaps. The contributions are grouped under three classes, decompositional, pedagogical and eclectic/hybrid approaches. Decompositional approach is closely intertwined with the internal workings of the SVM. Pedagogical approach uses SVM as an oracle to re-label training examples as well as artificially generated examples. In the eclectic/hybrid approach, a combination of these two methods is adopted. The thesis addresses various problems from the finance domain such as bankruptcy prediction in banks/firms, churn prediction in analytical CRM and Insurance fraud detection. Apart from this various benchmark datasets such as iris, wine and WBC for classification problems and auto MPG, body fat, Boston housing, forest fires and pollution for regression problems are also tested using the proposed appraoch. In addition, rule extraction from unbalanced datasets as well as from active learning based approaches has been explored. For classification problems, various rule extraction methods such as FRBS, DT, ANFIS, CART and NBTree have been utilized. Additionally for regression problems, rule extraction methods such as ANFIS, DENFIS and CART have also been employed. Results are analyzed using accuracy, sensitivity, specificity, fidelity, AUC and t-test measures. Proposed approaches demonstrate their viability in extracting accurate, effective and comprehensible rule sets in various benchmark and real world problem domains across classification and regression problems. Future directions have been indicated to extend the approaches to newer variations of SVM as well as to other problem domains.
Rule-extraction from Support Vector Machines
Author: Nahla Hosny Barakat
Publisher:
ISBN:
Category : Artificial intelligence
Languages : en
Pages : 256
Book Description
Publisher:
ISBN:
Category : Artificial intelligence
Languages : en
Pages : 256
Book Description
Rule extraction from Support Vector Machines
Author: Lu Ren
Publisher:
ISBN:
Category :
Languages : en
Pages : 0
Book Description
Publisher:
ISBN:
Category :
Languages : en
Pages : 0
Book Description
Soft Computing for Knowledge Discovery and Data Mining
Author: Oded Maimon
Publisher: Springer Science & Business Media
ISBN: 038769935X
Category : Computers
Languages : en
Pages : 431
Book Description
Data Mining is the science and technology of exploring large and complex bodies of data in order to discover useful patterns. It is extremely important because it enables modeling and knowledge extraction from abundant data availability. This book introduces soft computing methods extending the envelope of problems that data mining can solve efficiently. It presents practical soft-computing approaches in data mining and includes various real-world case studies with detailed results.
Publisher: Springer Science & Business Media
ISBN: 038769935X
Category : Computers
Languages : en
Pages : 431
Book Description
Data Mining is the science and technology of exploring large and complex bodies of data in order to discover useful patterns. It is extremely important because it enables modeling and knowledge extraction from abundant data availability. This book introduces soft computing methods extending the envelope of problems that data mining can solve efficiently. It presents practical soft-computing approaches in data mining and includes various real-world case studies with detailed results.
Support Vector Machines
Author: Ingo Steinwart
Publisher: Springer Science & Business Media
ISBN: 0387772421
Category : Computers
Languages : en
Pages : 611
Book Description
Every mathematical discipline goes through three periods of development: the naive, the formal, and the critical. David Hilbert The goal of this book is to explain the principles that made support vector machines (SVMs) a successful modeling and prediction tool for a variety of applications. We try to achieve this by presenting the basic ideas of SVMs together with the latest developments and current research questions in a uni?ed style. In a nutshell, we identify at least three reasons for the success of SVMs: their ability to learn well with only a very small number of free parameters, their robustness against several types of model violations and outliers, and last but not least their computational e?ciency compared with several other methods. Although there are several roots and precursors of SVMs, these methods gained particular momentum during the last 15 years since Vapnik (1995, 1998) published his well-known textbooks on statistical learning theory with aspecialemphasisonsupportvectormachines. Sincethen,the?eldofmachine learninghaswitnessedintenseactivityinthestudyofSVMs,whichhasspread moreandmoretootherdisciplinessuchasstatisticsandmathematics. Thusit seems fair to say that several communities are currently working on support vector machines and on related kernel-based methods. Although there are many interactions between these communities, we think that there is still roomforadditionalfruitfulinteractionandwouldbegladifthistextbookwere found helpful in stimulating further research. Many of the results presented in this book have previously been scattered in the journal literature or are still under review. As a consequence, these results have been accessible only to a relativelysmallnumberofspecialists,sometimesprobablyonlytopeoplefrom one community but not the others.
Publisher: Springer Science & Business Media
ISBN: 0387772421
Category : Computers
Languages : en
Pages : 611
Book Description
Every mathematical discipline goes through three periods of development: the naive, the formal, and the critical. David Hilbert The goal of this book is to explain the principles that made support vector machines (SVMs) a successful modeling and prediction tool for a variety of applications. We try to achieve this by presenting the basic ideas of SVMs together with the latest developments and current research questions in a uni?ed style. In a nutshell, we identify at least three reasons for the success of SVMs: their ability to learn well with only a very small number of free parameters, their robustness against several types of model violations and outliers, and last but not least their computational e?ciency compared with several other methods. Although there are several roots and precursors of SVMs, these methods gained particular momentum during the last 15 years since Vapnik (1995, 1998) published his well-known textbooks on statistical learning theory with aspecialemphasisonsupportvectormachines. Sincethen,the?eldofmachine learninghaswitnessedintenseactivityinthestudyofSVMs,whichhasspread moreandmoretootherdisciplinessuchasstatisticsandmathematics. Thusit seems fair to say that several communities are currently working on support vector machines and on related kernel-based methods. Although there are many interactions between these communities, we think that there is still roomforadditionalfruitfulinteractionandwouldbegladifthistextbookwere found helpful in stimulating further research. Many of the results presented in this book have previously been scattered in the journal literature or are still under review. As a consequence, these results have been accessible only to a relativelysmallnumberofspecialists,sometimesprobablyonlytopeoplefrom one community but not the others.
Knowledge Discovery with Support Vector Machines
Author: Lutz H. Hamel
Publisher: John Wiley & Sons
ISBN: 1118211030
Category : Computers
Languages : en
Pages : 211
Book Description
An easy-to-follow introduction to support vector machines This book provides an in-depth, easy-to-follow introduction to support vector machines drawing only from minimal, carefully motivated technical and mathematical background material. It begins with a cohesive discussion of machine learning and goes on to cover: Knowledge discovery environments Describing data mathematically Linear decision surfaces and functions Perceptron learning Maximum margin classifiers Support vector machines Elements of statistical learning theory Multi-class classification Regression with support vector machines Novelty detection Complemented with hands-on exercises, algorithm descriptions, and data sets, Knowledge Discovery with Support Vector Machines is an invaluable textbook for advanced undergraduate and graduate courses. It is also an excellent tutorial on support vector machines for professionals who are pursuing research in machine learning and related areas.
Publisher: John Wiley & Sons
ISBN: 1118211030
Category : Computers
Languages : en
Pages : 211
Book Description
An easy-to-follow introduction to support vector machines This book provides an in-depth, easy-to-follow introduction to support vector machines drawing only from minimal, carefully motivated technical and mathematical background material. It begins with a cohesive discussion of machine learning and goes on to cover: Knowledge discovery environments Describing data mathematically Linear decision surfaces and functions Perceptron learning Maximum margin classifiers Support vector machines Elements of statistical learning theory Multi-class classification Regression with support vector machines Novelty detection Complemented with hands-on exercises, algorithm descriptions, and data sets, Knowledge Discovery with Support Vector Machines is an invaluable textbook for advanced undergraduate and graduate courses. It is also an excellent tutorial on support vector machines for professionals who are pursuing research in machine learning and related areas.
Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Author: Hiroshi Sakai
Publisher: Springer
ISBN: 3642106463
Category : Computers
Languages : en
Pages : 539
Book Description
This book constitutes the refereed proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, RSFDGrC 2009, held in Delhi, India in December 2009 in conjunction with the Third International Conference on Pattern Recognition and Machine Intelligence, PReMI 2009. RSFDGrC 2009 is the core component of a broader Rough Set Year in India initiative, RSIndia09. The 56 revised full papers presented together with 6 invited papers and a report on the Rough Set Year in India 2009 project were carefully reviewed and selected from a total of 130 submissions. The papers are organized in topical sections on foundations of rough sets and beyond; rought set algorithms and applications; fuzzy set foundations and applications; data mining and knowledge discovery; clustering and current trends in computing; and information retrieval and text mining.
Publisher: Springer
ISBN: 3642106463
Category : Computers
Languages : en
Pages : 539
Book Description
This book constitutes the refereed proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, RSFDGrC 2009, held in Delhi, India in December 2009 in conjunction with the Third International Conference on Pattern Recognition and Machine Intelligence, PReMI 2009. RSFDGrC 2009 is the core component of a broader Rough Set Year in India initiative, RSIndia09. The 56 revised full papers presented together with 6 invited papers and a report on the Rough Set Year in India 2009 project were carefully reviewed and selected from a total of 130 submissions. The papers are organized in topical sections on foundations of rough sets and beyond; rought set algorithms and applications; fuzzy set foundations and applications; data mining and knowledge discovery; clustering and current trends in computing; and information retrieval and text mining.
Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines
Author: David Martens
Publisher:
ISBN:
Category :
Languages : en
Pages : 21
Book Description
In recent years, Support Vector Machines (SVMs) were successfully applied to a wide range of applications. Their good performance is achieved by an implicit non-linear transformation of the original problem to a high-dimensional (possibly infinite) feature space in which a linear decision hyperplane is constructed that yields a nonlinear classifier in the input space. However, since the classifier is described as a complex mathematical function, it is rather incomprehensible for humans. This opacity property prevents them from being used in many real- life applications where both accuracy and comprehensibility are required, such as medical diagnosis and credit risk evaluation. To overcome this limitation, rules can be extracted from the trained SVM that are interpretable by humans and keep as much of the accuracy of the SVM as possible. In this paper, we will provide an overview of the recently proposed rule extraction techniques for SVMs and introduce two others taken from the artificial neural networks domain, being Trepan and G-REX. The described techniques are compared using publicly avail- able datasets, such as Ripley's synthetic dataset and the multi-class iris dataset. We will also look at medical diagnosis and credit scoring where comprehensibility is a key requirement and even a regulatory recommendation. Our experiments show that the SVM rule extraction techniques lose only a small percentage in performance compared to SVMs and therefore rank at the top of comprehensible classification techniques.
Publisher:
ISBN:
Category :
Languages : en
Pages : 21
Book Description
In recent years, Support Vector Machines (SVMs) were successfully applied to a wide range of applications. Their good performance is achieved by an implicit non-linear transformation of the original problem to a high-dimensional (possibly infinite) feature space in which a linear decision hyperplane is constructed that yields a nonlinear classifier in the input space. However, since the classifier is described as a complex mathematical function, it is rather incomprehensible for humans. This opacity property prevents them from being used in many real- life applications where both accuracy and comprehensibility are required, such as medical diagnosis and credit risk evaluation. To overcome this limitation, rules can be extracted from the trained SVM that are interpretable by humans and keep as much of the accuracy of the SVM as possible. In this paper, we will provide an overview of the recently proposed rule extraction techniques for SVMs and introduce two others taken from the artificial neural networks domain, being Trepan and G-REX. The described techniques are compared using publicly avail- able datasets, such as Ripley's synthetic dataset and the multi-class iris dataset. We will also look at medical diagnosis and credit scoring where comprehensibility is a key requirement and even a regulatory recommendation. Our experiments show that the SVM rule extraction techniques lose only a small percentage in performance compared to SVMs and therefore rank at the top of comprehensible classification techniques.