Author: R. Harald Baayen
Publisher: Springer Science & Business Media
ISBN: 9780792370178
Category : Computers
Languages : en
Pages : 374
Book Description
This book is a comprehensive introduction to the statistical analysis of word frequency distributions, intended for computational linguists, corpus linguists, psycholinguists, and researchers in the field of quantitative stylistics. It aims to make these techniques more accessible for non-specialists, both theoretically, by means of a careful introduction to the underlying probabilistic and statistical concepts, and practically, by providing a program library implementing the main models for word frequency distributions.
Word Frequency Distributions
Author: R. Harald Baayen
Publisher: Springer Science & Business Media
ISBN: 9780792370178
Category : Computers
Languages : en
Pages : 374
Book Description
This book is a comprehensive introduction to the statistical analysis of word frequency distributions, intended for computational linguists, corpus linguists, psycholinguists, and researchers in the field of quantitative stylistics. It aims to make these techniques more accessible for non-specialists, both theoretically, by means of a careful introduction to the underlying probabilistic and statistical concepts, and practically, by providing a program library implementing the main models for word frequency distributions.
Publisher: Springer Science & Business Media
ISBN: 9780792370178
Category : Computers
Languages : en
Pages : 374
Book Description
This book is a comprehensive introduction to the statistical analysis of word frequency distributions, intended for computational linguists, corpus linguists, psycholinguists, and researchers in the field of quantitative stylistics. It aims to make these techniques more accessible for non-specialists, both theoretically, by means of a careful introduction to the underlying probabilistic and statistical concepts, and practically, by providing a program library implementing the main models for word frequency distributions.
What's in a Word-list?
Author: Dawn Archer
Publisher: Routledge
ISBN: 1134761481
Category : Language Arts & Disciplines
Languages : en
Pages : 214
Book Description
The frequency with which particular words are used in a text can tell us something meaningful both about that text and also about its author because their choice of words is seldom random. Focusing on the most frequent lexical items of a number of generated word frequency lists can help us to determine whether all the texts are written by the same author. Alternatively, they might wish to determine whether the most frequent words of a given text (captured by its word frequency list) are suggestive of potentially meaningful patterns that could have been overlooked had the text been read manually. This edited collection brings together cutting-edge research written by leading experts in the field on the construction of word-lists for the analysis of both frequency and keyword usage. Taken together, these papers provide a comprehensive and up-to-date survey of the most exciting research being conducted in this subject.
Publisher: Routledge
ISBN: 1134761481
Category : Language Arts & Disciplines
Languages : en
Pages : 214
Book Description
The frequency with which particular words are used in a text can tell us something meaningful both about that text and also about its author because their choice of words is seldom random. Focusing on the most frequent lexical items of a number of generated word frequency lists can help us to determine whether all the texts are written by the same author. Alternatively, they might wish to determine whether the most frequent words of a given text (captured by its word frequency list) are suggestive of potentially meaningful patterns that could have been overlooked had the text been read manually. This edited collection brings together cutting-edge research written by leading experts in the field on the construction of word-lists for the analysis of both frequency and keyword usage. Taken together, these papers provide a comprehensive and up-to-date survey of the most exciting research being conducted in this subject.
Word Frequency Studies
Author: Ioan-IoviÈ› Popescu
Publisher: Walter de Gruyter
ISBN: 3110218526
Category : Electronic books
Languages : en
Pages : 291
Book Description
The present book finds and collects absolutely new aspects of word frequency. First, eminent characteristics (such as the h-point, first used in scientometrics, the k-, m-, and n-points) are introduced - it can be shown that the geometry of word frequency is fundamentally based on them. Furthermore, various indicators of text properties are proposed for the first time, such as thematic concentration, autosemantic text compactness, autosemantic density, etc. In detail, the autosemantic structure of a given text is evaluated by means of a graph representation and its properties (according to a problem from network research). Special emphasis is given to the part-of-speech differentiation, which plays a significant role in stylistics. On the basis of a general theory, which has been developed especially for linguistic research, problems of the frequency structure of texts with respect to word occurrence are investigated and discussed in detail. Methodologically, specific reference is made to synergetic linguistics, including some exemplary analyses, showing that there are points of contact with this field. A separate chapter is dedicated to within-sentence word position; this issue considers grammar as well as language genesis; another chapter is dedicated to the type-token ratio, discussing all established methods and their relevance for word frequency analysis. All methods presented in the book are statistically tested; to this end, some new tests have been developed. All procedures and calculations are conducted for 20 languages, ranging from Polynesia, Indonesia, India, and Europe to a North American Indian language. The broad distribution of the data and texts from all genres allows generalizations with respect to language typology.
Publisher: Walter de Gruyter
ISBN: 3110218526
Category : Electronic books
Languages : en
Pages : 291
Book Description
The present book finds and collects absolutely new aspects of word frequency. First, eminent characteristics (such as the h-point, first used in scientometrics, the k-, m-, and n-points) are introduced - it can be shown that the geometry of word frequency is fundamentally based on them. Furthermore, various indicators of text properties are proposed for the first time, such as thematic concentration, autosemantic text compactness, autosemantic density, etc. In detail, the autosemantic structure of a given text is evaluated by means of a graph representation and its properties (according to a problem from network research). Special emphasis is given to the part-of-speech differentiation, which plays a significant role in stylistics. On the basis of a general theory, which has been developed especially for linguistic research, problems of the frequency structure of texts with respect to word occurrence are investigated and discussed in detail. Methodologically, specific reference is made to synergetic linguistics, including some exemplary analyses, showing that there are points of contact with this field. A separate chapter is dedicated to within-sentence word position; this issue considers grammar as well as language genesis; another chapter is dedicated to the type-token ratio, discussing all established methods and their relevance for word frequency analysis. All methods presented in the book are statistically tested; to this end, some new tests have been developed. All procedures and calculations are conducted for 20 languages, ranging from Polynesia, Indonesia, India, and Europe to a North American Indian language. The broad distribution of the data and texts from all genres allows generalizations with respect to language typology.
Natural Language Processing with Python
Author: Steven Bird
Publisher: "O'Reilly Media, Inc."
ISBN: 0596555717
Category : Computers
Languages : en
Pages : 506
Book Description
This book offers a highly accessible introduction to natural language processing, the field that supports a variety of language technologies, from predictive text and email filtering to automatic summarization and translation. With it, you'll learn how to write Python programs that work with large collections of unstructured text. You'll access richly annotated datasets using a comprehensive range of linguistic data structures, and you'll understand the main algorithms for analyzing the content and structure of written communication. Packed with examples and exercises, Natural Language Processing with Python will help you: Extract information from unstructured text, either to guess the topic or identify "named entities" Analyze linguistic structure in text, including parsing and semantic analysis Access popular linguistic databases, including WordNet and treebanks Integrate techniques drawn from fields as diverse as linguistics and artificial intelligence This book will help you gain practical skills in natural language processing using the Python programming language and the Natural Language Toolkit (NLTK) open source library. If you're interested in developing web applications, analyzing multilingual news sources, or documenting endangered languages -- or if you're simply curious to have a programmer's perspective on how human language works -- you'll find Natural Language Processing with Python both fascinating and immensely useful.
Publisher: "O'Reilly Media, Inc."
ISBN: 0596555717
Category : Computers
Languages : en
Pages : 506
Book Description
This book offers a highly accessible introduction to natural language processing, the field that supports a variety of language technologies, from predictive text and email filtering to automatic summarization and translation. With it, you'll learn how to write Python programs that work with large collections of unstructured text. You'll access richly annotated datasets using a comprehensive range of linguistic data structures, and you'll understand the main algorithms for analyzing the content and structure of written communication. Packed with examples and exercises, Natural Language Processing with Python will help you: Extract information from unstructured text, either to guess the topic or identify "named entities" Analyze linguistic structure in text, including parsing and semantic analysis Access popular linguistic databases, including WordNet and treebanks Integrate techniques drawn from fields as diverse as linguistics and artificial intelligence This book will help you gain practical skills in natural language processing using the Python programming language and the Natural Language Toolkit (NLTK) open source library. If you're interested in developing web applications, analyzing multilingual news sources, or documenting endangered languages -- or if you're simply curious to have a programmer's perspective on how human language works -- you'll find Natural Language Processing with Python both fascinating and immensely useful.
Word Knowledge and Word Usage
Author: Vito Pirrelli
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3110432447
Category : Language Arts & Disciplines
Languages : en
Pages : 670
Book Description
Word storage and processing define a multi-factorial domain of scientific inquiry whose thorough investigation goes well beyond the boundaries of traditional disciplinary taxonomies, to require synergic integration of a wide range of methods, techniques and empirical and experimental findings. The present book intends to approach a few central issues concerning the organization, structure and functioning of the Mental Lexicon, by asking domain experts to look at common, central topics from complementary standpoints, and discuss the advantages of developing converging perspectives. The book will explore the connections between computational and algorithmic models of the mental lexicon, word frequency distributions and information theoretical measures of word families, statistical correlations across psycho-linguistic and cognitive evidence, principles of machine learning and integrative brain models of word storage and processing. Main goal of the book will be to map out the landscape of future research in this area, to foster the development of interdisciplinary curricula and help single-domain specialists understand and address issues and questions as they are raised in other disciplines.
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3110432447
Category : Language Arts & Disciplines
Languages : en
Pages : 670
Book Description
Word storage and processing define a multi-factorial domain of scientific inquiry whose thorough investigation goes well beyond the boundaries of traditional disciplinary taxonomies, to require synergic integration of a wide range of methods, techniques and empirical and experimental findings. The present book intends to approach a few central issues concerning the organization, structure and functioning of the Mental Lexicon, by asking domain experts to look at common, central topics from complementary standpoints, and discuss the advantages of developing converging perspectives. The book will explore the connections between computational and algorithmic models of the mental lexicon, word frequency distributions and information theoretical measures of word families, statistical correlations across psycho-linguistic and cognitive evidence, principles of machine learning and integrative brain models of word storage and processing. Main goal of the book will be to map out the landscape of future research in this area, to foster the development of interdisciplinary curricula and help single-domain specialists understand and address issues and questions as they are raised in other disciplines.
Text Mining with R
Author: Julia Silge
Publisher: "O'Reilly Media, Inc."
ISBN: 1491981628
Category : Computers
Languages : en
Pages : 193
Book Description
Chapter 7. Case Study : Comparing Twitter Archives; Getting the Data and Distribution of Tweets; Word Frequencies; Comparing Word Usage; Changes in Word Use; Favorites and Retweets; Summary; Chapter 8. Case Study : Mining NASA Metadata; How Data Is Organized at NASA; Wrangling and Tidying the Data; Some Initial Simple Exploration; Word Co-ocurrences and Correlations; Networks of Description and Title Words; Networks of Keywords; Calculating tf-idf for the Description Fields; What Is tf-idf for the Description Field Words?; Connecting Description Fields to Keywords; Topic Modeling.
Publisher: "O'Reilly Media, Inc."
ISBN: 1491981628
Category : Computers
Languages : en
Pages : 193
Book Description
Chapter 7. Case Study : Comparing Twitter Archives; Getting the Data and Distribution of Tweets; Word Frequencies; Comparing Word Usage; Changes in Word Use; Favorites and Retweets; Summary; Chapter 8. Case Study : Mining NASA Metadata; How Data Is Organized at NASA; Wrangling and Tidying the Data; Some Initial Simple Exploration; Word Co-ocurrences and Correlations; Networks of Description and Title Words; Networks of Keywords; Calculating tf-idf for the Description Fields; What Is tf-idf for the Description Field Words?; Connecting Description Fields to Keywords; Topic Modeling.
The American Heritage Word Frequency Book
Author: John Bissell Carroll
Publisher:
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 924
Book Description
Publisher:
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 924
Book Description
Statistics in Corpus Linguistics
Author: Vaclav Brezina
Publisher: Cambridge University Press
ISBN: 1107125707
Category : Foreign Language Study
Languages : en
Pages : 317
Book Description
A comprehensive and accessible introduction to statistics in corpus linguistics, covering multiple techniques of quantitative language analysis and data visualisation.
Publisher: Cambridge University Press
ISBN: 1107125707
Category : Foreign Language Study
Languages : en
Pages : 317
Book Description
A comprehensive and accessible introduction to statistics in corpus linguistics, covering multiple techniques of quantitative language analysis and data visualisation.
The Psycho-Biology Of Language
Author: George Kingsley Zipf
Publisher: Routledge
ISBN: 1136310533
Category : Medical
Languages : en
Pages : 360
Book Description
This is Volume XXI in a series of twenty-one on the Cognitive Psychology. Orignally published in 1936, this is a study on the introduction to Dynamic Philology.
Publisher: Routledge
ISBN: 1136310533
Category : Medical
Languages : en
Pages : 360
Book Description
This is Volume XXI in a series of twenty-one on the Cognitive Psychology. Orignally published in 1936, this is a study on the introduction to Dynamic Philology.
Encyclopedia of Survey Research Methods
Author: Paul J. Lavrakas
Publisher: SAGE Publications
ISBN: 150631788X
Category : Social Science
Languages : en
Pages : 1073
Book Description
To the uninformed, surveys appear to be an easy type of research to design and conduct, but when students and professionals delve deeper, they encounter the vast complexities that the range and practice of survey methods present. To complicate matters, technology has rapidly affected the way surveys can be conducted; today, surveys are conducted via cell phone, the Internet, email, interactive voice response, and other technology-based modes. Thus, students, researchers, and professionals need both a comprehensive understanding of these complexities and a revised set of tools to meet the challenges. In conjunction with top survey researchers around the world and with Nielsen Media Research serving as the corporate sponsor, the Encyclopedia of Survey Research Methods presents state-of-the-art information and methodological examples from the field of survey research. Although there are other "how-to" guides and references texts on survey research, none is as comprehensive as this Encyclopedia, and none presents the material in such a focused and approachable manner. With more than 600 entries, this resource uses a Total Survey Error perspective that considers all aspects of possible survey error from a cost-benefit standpoint. Key Features Covers all major facets of survey research methodology, from selecting the sample design and the sampling frame, designing and pretesting the questionnaire, data collection, and data coding, to the thorny issues surrounding diminishing response rates, confidentiality, privacy, informed consent and other ethical issues, data weighting, and data analyses Presents a Reader′s Guide to organize entries around themes or specific topics and easily guide users to areas of interest Offers cross-referenced terms, a brief listing of Further Readings, and stable Web site URLs following most entries The Encyclopedia of Survey Research Methods is specifically written to appeal to beginning, intermediate, and advanced students, practitioners, researchers, consultants, and consumers of survey-based information.
Publisher: SAGE Publications
ISBN: 150631788X
Category : Social Science
Languages : en
Pages : 1073
Book Description
To the uninformed, surveys appear to be an easy type of research to design and conduct, but when students and professionals delve deeper, they encounter the vast complexities that the range and practice of survey methods present. To complicate matters, technology has rapidly affected the way surveys can be conducted; today, surveys are conducted via cell phone, the Internet, email, interactive voice response, and other technology-based modes. Thus, students, researchers, and professionals need both a comprehensive understanding of these complexities and a revised set of tools to meet the challenges. In conjunction with top survey researchers around the world and with Nielsen Media Research serving as the corporate sponsor, the Encyclopedia of Survey Research Methods presents state-of-the-art information and methodological examples from the field of survey research. Although there are other "how-to" guides and references texts on survey research, none is as comprehensive as this Encyclopedia, and none presents the material in such a focused and approachable manner. With more than 600 entries, this resource uses a Total Survey Error perspective that considers all aspects of possible survey error from a cost-benefit standpoint. Key Features Covers all major facets of survey research methodology, from selecting the sample design and the sampling frame, designing and pretesting the questionnaire, data collection, and data coding, to the thorny issues surrounding diminishing response rates, confidentiality, privacy, informed consent and other ethical issues, data weighting, and data analyses Presents a Reader′s Guide to organize entries around themes or specific topics and easily guide users to areas of interest Offers cross-referenced terms, a brief listing of Further Readings, and stable Web site URLs following most entries The Encyclopedia of Survey Research Methods is specifically written to appeal to beginning, intermediate, and advanced students, practitioners, researchers, consultants, and consumers of survey-based information.