Machine Learning Methods for Stylometry

Machine Learning Methods for Stylometry PDF Author: Jacques Savoy
Publisher: Springer Nature
ISBN: 3030533603
Category : Computers
Languages : en
Pages : 294

Get Book Here

Book Description
This book presents methods and approaches used to identify the true author of a doubtful document or text excerpt. It provides a broad introduction to all text categorization problems (like authorship attribution, psychological traits of the author, detecting fake news, etc.) grounded in stylistic features. Specifically, machine learning models as valuable tools for verifying hypotheses or revealing significant patterns hidden in datasets are presented in detail. Stylometry is a multi-disciplinary field combining linguistics with both statistics and computer science. The content is divided into three parts. The first, which consists of the first three chapters, offers a general introduction to stylometry, its potential applications and limitations. Further, it introduces the ongoing example used to illustrate the concepts discussed throughout the remainder of the book. The four chapters of the second part are more devoted to computer science with a focus on machine learning models. Their main aim is to explain machine learning models for solving stylometric problems. Several general strategies used to identify, extract, select, and represent stylistic markers are explained. As deep learning represents an active field of research, information on neural network models and word embeddings applied to stylometry is provided, as well as a general introduction to the deep learning approach to solving stylometric questions. In turn, the third part illustrates the application of the previously discussed approaches in real cases: an authorship attribution problem, seeking to discover the secret hand behind the nom de plume Elena Ferrante, an Italian writer known worldwide for her My Brilliant Friend’s saga; author profiling in order to identify whether a set of tweets were generated by a bot or a human being and in this second case, whether it is a man or a woman; and an exploration of stylistic variations over time using US political speeches covering a period of ca. 230 years. A solutions-based approach is adopted throughout the book, and explanations are supported by examples written in R. To complement the main content and discussions on stylometric models and techniques, examples and datasets are freely available at the author’s Github website.

Machine Learning Methods for Stylometry

Machine Learning Methods for Stylometry PDF Author: Jacques Savoy
Publisher: Springer Nature
ISBN: 3030533603
Category : Computers
Languages : en
Pages : 294

Get Book Here

Book Description
This book presents methods and approaches used to identify the true author of a doubtful document or text excerpt. It provides a broad introduction to all text categorization problems (like authorship attribution, psychological traits of the author, detecting fake news, etc.) grounded in stylistic features. Specifically, machine learning models as valuable tools for verifying hypotheses or revealing significant patterns hidden in datasets are presented in detail. Stylometry is a multi-disciplinary field combining linguistics with both statistics and computer science. The content is divided into three parts. The first, which consists of the first three chapters, offers a general introduction to stylometry, its potential applications and limitations. Further, it introduces the ongoing example used to illustrate the concepts discussed throughout the remainder of the book. The four chapters of the second part are more devoted to computer science with a focus on machine learning models. Their main aim is to explain machine learning models for solving stylometric problems. Several general strategies used to identify, extract, select, and represent stylistic markers are explained. As deep learning represents an active field of research, information on neural network models and word embeddings applied to stylometry is provided, as well as a general introduction to the deep learning approach to solving stylometric questions. In turn, the third part illustrates the application of the previously discussed approaches in real cases: an authorship attribution problem, seeking to discover the secret hand behind the nom de plume Elena Ferrante, an Italian writer known worldwide for her My Brilliant Friend’s saga; author profiling in order to identify whether a set of tweets were generated by a bot or a human being and in this second case, whether it is a man or a woman; and an exploration of stylistic variations over time using US political speeches covering a period of ca. 230 years. A solutions-based approach is adopted throughout the book, and explanations are supported by examples written in R. To complement the main content and discussions on stylometric models and techniques, examples and datasets are freely available at the author’s Github website.

Versification and Authorship Attribution

Versification and Authorship Attribution PDF Author: Petr Plecháč
Publisher: Charles University in Prague, Karolinum Press
ISBN: 8024648717
Category : Literary Criticism
Languages : en
Pages : 96

Get Book Here

Book Description
The technique known as contemporary stylometry uses different methods, including machine learning, to discover a poem’s author based on features like the frequencies of words and character n-grams. However, there is one potential textual fingerprint stylometry tends to ignore: versification, or the very making of language into verse. Using poetic texts in three different languages (Czech, German, and Spanish), Petr Plecháč asks whether versification features like rhythm patterns and types of rhyme can help determine authorship. He then tests its findings on two unsolved literary mysteries. In the first, Plecháč distinguishes the parts of the Elizabethan verse play The Two Noble Kinsmen written by William Shakespeare from those written by his coauthor, John Fletcher. In the second, he seeks to solve a case of suspected forgery: how authentic was a group of poems first published as the work of the nineteenth-century Russian author Gavriil Stepanovich Batenkov? This book of poetic investigation should appeal to literary sleuths the world over.

Authorship Attribution

Authorship Attribution PDF Author: Patrick Juola
Publisher: Now Publishers Inc
ISBN: 160198118X
Category : Authorship, Disputed
Languages : en
Pages : 116

Get Book Here

Book Description
Authorship Attribution surveys the history and present state of the discipline, presenting some comparative results where available. It also provides a theoretical and empirically-tested basis for further work. Many modern techniques are described and evaluated, along with some insights for application for novices and experts alike.

Computational Intelligence in Data Mining

Computational Intelligence in Data Mining PDF Author: Himansu Sekhar Behera
Publisher: Springer
ISBN: 9811038740
Category : Technology & Engineering
Languages : en
Pages : 825

Get Book Here

Book Description
The book presents high quality papers presented at the International Conference on Computational Intelligence in Data Mining (ICCIDM 2016) organized by School of Computer Engineering, Kalinga Institute of Industrial Technology (KIIT), Bhubaneswar, Odisha, India during December 10 – 11, 2016. The book disseminates the knowledge about innovative, active research directions in the field of data mining, machine and computational intelligence, along with current issues and applications of related topics. The volume aims to explicate and address the difficulties and challenges that of seamless integration of the two core disciplines of computer science.

Intelligent Systems Technologies and Applications

Intelligent Systems Technologies and Applications PDF Author: Sabu M. Thampi
Publisher: Springer
ISBN: 3319683853
Category : Technology & Engineering
Languages : en
Pages : 442

Get Book Here

Book Description
This book constitutes the thoroughly refereed post-conference proceedings of the third International Symposium on Intelligent Systems Technologies and Applications (ISTA’17), September 13-16, 2017, Manipal, Karnataka, India. All submissions were evaluated on the basis of their significance, novelty, and technical quality. This proceedings contains 34 papers selected for presentation at the Symposium.

Internet of Behaviors Implementation in Organizational Contexts

Internet of Behaviors Implementation in Organizational Contexts PDF Author: Carvalho, Luísa Cagica
Publisher: IGI Global
ISBN: 1668490412
Category : Computers
Languages : en
Pages : 494

Get Book Here

Book Description
Internet of behaviors (IoB), also known as the internet of behavior, emerged as a natural consequence of the internet of things (IoT) and artificial intelligence (AI). IoB is an area of investigation that compiles three fields of study: IoT, data analysis, and behavioral science. IoB seeks to explain the data obtained from a behavioral point of view, analyzing human interaction with technology and referring to the process by which user-controlled data is evaluated from a behavioral psychology perspective. Internet of Behaviors Implementation in Organizational Contexts explores internet of behaviors solutions that promote people's quality of life. This book explores and discusses, through innovative studies, case studies, systematic literature reviews, and reports. The content within this publication represents research encompassing the internet of behaviors, internet of things, big data, artificial intelligence, blockchain, smart cities, human-centric approach for digital technologies, ICT sustainability, and more. This vital reference source led by an editor with over two decades of experience is optimized for university professors, researchers, undergraduate and graduate level students, and business managers and professionals across several industries related to or utilizing the internet of things (IoT).

›Prometheus Bound‹ - A Separate Authorial Trace in the Aeschylean Corpus

›Prometheus Bound‹ - A Separate Authorial Trace in the Aeschylean Corpus PDF Author: Nikos Manousakis
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3110687674
Category : Literary Criticism
Languages : en
Pages : 297

Get Book Here

Book Description
Classics, Computer Science, and Linguistics are brought together in this book, in an attempt to provide an answer to the authorship question concerning Prometheus Bound, a disputed play in the Aeschylean corpus, by applying some well-established Computer Stylistics methods. One of the main objectives of Stylometry, which, broadly speaking, is the study of quantified style, is Authorship Attribution. In its traditional form it can range from manually calculating descriptive statistics to the use of computer-assisted methodologies. However, non-traditional Authorship Attribution drastically changed the field. It brought together modern Linguistics and Artificial Intelligence applications (machine learning, natural language processing), and its key characteristic is that it aims at developing fully-automated systems for the attribution of texts of unknown authorship. In this book the author employs a series of supervised and unsupervised techniques used in non-traditional Authorship Attribution–applied here for the first time in ancient drama. The outcome of the analysis indicates a significant distance between the disputed text and the secure plays of Aeschylus, but also various interesting (micro-linguistic) ties of affinity with other authors, especially Sophocles and Euripides.

Computational Stylistics in Poetry, Prose, and Drama

Computational Stylistics in Poetry, Prose, and Drama PDF Author: Anne-Sophie Bories
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3110781506
Category : Literary Criticism
Languages : en
Pages : 242

Get Book Here

Book Description
This volume responds to the current interest in computational and statistical methods to describe and analyse metre, style, and poeticity, particularly insofar as they can open up new research perspectives in literature, linguistics, and literary history. The contributions are representative of the diversity of approaches, methods, and goals of a thriving research community. Although most papers focus on written poetry, including computer-generated poetry, the volume also features analyses of spoken poetry, narrative prose, and drama. The contributions employ a variety of methods and techniques ranging from motif analysis, network analysis, machine learning, and Natural Language Processing. The volume pays particular attention to annotation, one of the most basic practices in computational stylistics. This contribution to the growing, dynamic field of digital literary studies will be useful to both students and scholars looking for an overview of current trends, relevant methods, and possible results, at a crucial moment in the development of novel approaches, when one needs to keep in mind the qualitative, hermeneutical benefit made possible by such quantitative efforts.

Computational Legal Studies

Computational Legal Studies PDF Author: Ryan Whalen
Publisher: Edward Elgar Publishing
ISBN: 1788977459
Category : Law
Languages : en
Pages : 375

Get Book Here

Book Description
Featuring contributions from a diverse set of experts, this thought-provoking book offers a visionary introduction to the computational turn in law and the resulting emergence of the computational legal studies field. It explores how computational data creation, collection, and analysis techniques are transforming the way in which we comprehend and study the law, and the implications that this has for the future of legal studies.

New Perspectives on Corpus Translation Studies

New Perspectives on Corpus Translation Studies PDF Author: Vincent X. Wang
Publisher: Springer Nature
ISBN: 9811649189
Category : Language Arts & Disciplines
Languages : en
Pages : 325

Get Book Here

Book Description
The book features recent attempts to construct corpora for specific purposes – e.g. multifactorial Dutch (parallel), Geasy Easy Language Corpus (intralingual), HK LegCo interpreting corpus – and showcases sophisticated and innovative corpus analysis methods. It proposes new approaches to address classical themes – i.e. translation pedagogy, translation norms and equivalence, principles of translation – and brings interdisciplinary perspectives – e.g. contrastive linguistics, cognition and metaphor studies – to cast new light. It is a timely reference for the researchers as well as postgraduate students who are interested in the applications of corpus technology to solving translation and interpreting problems.