Cross-Language Information Retrieval

Cross-Language Information Retrieval PDF Author: Jian-Yun Nie
Publisher: Springer Nature
ISBN: 303102138X
Category : Computers
Languages : en
Pages : 125

Get Book Here

Book Description
Search for information is no longer exclusively limited within the native language of the user, but is more and more extended to other languages. This gives rise to the problem of cross-language information retrieval (CLIR), whose goal is to find relevant information written in a different language to a query. In addition to the problems of monolingual information retrieval (IR), translation is the key problem in CLIR: one should translate either the query or the documents from a language to another. However, this translation problem is not identical to full-text machine translation (MT): the goal is not to produce a human-readable translation, but a translation suitable for finding relevant documents. Specific translation methods are thus required. The goal of this book is to provide a comprehensive description of the specific problems arising in CLIR, the solutions proposed in this area, as well as the remaining problems. The book starts with a general description of the monolingual IR and CLIR problems. Different classes of approaches to translation are then presented: approaches using an MT system, dictionary-based translation and approaches based on parallel and comparable corpora. In addition, the typical retrieval effectiveness using different approaches is compared. It will be shown that translation approaches specifically designed for CLIR can rival and outperform high-quality MT systems. Finally, the book offers a look into the future that draws a strong parallel between query expansion in monolingual IR and query translation in CLIR, suggesting that many approaches developed in monolingual IR can be adapted to CLIR. The book can be used as an introduction to CLIR. Advanced readers can also find more technical details and discussions about the remaining research challenges in the future. It is suitable to new researchers who intend to carry out research on CLIR. Table of Contents: Preface / Introduction / Using Manually Constructed Translation Systems and Resources for CLIR / Translation Based on Parallel and Comparable Corpora / Other Methods to Improve CLIR / A Look into the Future: Toward a Unified View of Monolingual IR and CLIR? / References / Author Biography

Cross-Language Information Retrieval

Cross-Language Information Retrieval PDF Author: Jian-Yun Nie
Publisher: Springer Nature
ISBN: 303102138X
Category : Computers
Languages : en
Pages : 125

Get Book Here

Book Description
Search for information is no longer exclusively limited within the native language of the user, but is more and more extended to other languages. This gives rise to the problem of cross-language information retrieval (CLIR), whose goal is to find relevant information written in a different language to a query. In addition to the problems of monolingual information retrieval (IR), translation is the key problem in CLIR: one should translate either the query or the documents from a language to another. However, this translation problem is not identical to full-text machine translation (MT): the goal is not to produce a human-readable translation, but a translation suitable for finding relevant documents. Specific translation methods are thus required. The goal of this book is to provide a comprehensive description of the specific problems arising in CLIR, the solutions proposed in this area, as well as the remaining problems. The book starts with a general description of the monolingual IR and CLIR problems. Different classes of approaches to translation are then presented: approaches using an MT system, dictionary-based translation and approaches based on parallel and comparable corpora. In addition, the typical retrieval effectiveness using different approaches is compared. It will be shown that translation approaches specifically designed for CLIR can rival and outperform high-quality MT systems. Finally, the book offers a look into the future that draws a strong parallel between query expansion in monolingual IR and query translation in CLIR, suggesting that many approaches developed in monolingual IR can be adapted to CLIR. The book can be used as an introduction to CLIR. Advanced readers can also find more technical details and discussions about the remaining research challenges in the future. It is suitable to new researchers who intend to carry out research on CLIR. Table of Contents: Preface / Introduction / Using Manually Constructed Translation Systems and Resources for CLIR / Translation Based on Parallel and Comparable Corpora / Other Methods to Improve CLIR / A Look into the Future: Toward a Unified View of Monolingual IR and CLIR? / References / Author Biography

Cross-Language Information Retrieval and Evaluation

Cross-Language Information Retrieval and Evaluation PDF Author: Cross-Language Evaluation Forum. Workshop
Publisher: Springer Science & Business Media
ISBN: 3540424466
Category : Computers
Languages : en
Pages : 396

Get Book Here

Book Description
This book presents the thoroughly refereed post-proceedings of the international Cross-Language Evaluation Forum Workshop organized by the CLEF activity of the European DELOS Network of Excellence for Digital Libraries. The 25 revised papers presented together with an introduction were carefully selected based on two rounds of reviewing. All current aspects of cross-language information retrieval are addressed, ranging from foundational issues and systems evaluation to applications in a variety of fields.

Cross-Language Information Retrieval

Cross-Language Information Retrieval PDF Author: Gregory Grefenstette
Publisher: Springer Science & Business Media
ISBN: 1461556619
Category : Computers
Languages : en
Pages : 190

Get Book Here

Book Description
Most of the papers in this volume were first presented at the Workshop on Cross-Linguistic Information Retrieval that was held August 22, 1996 dur ing the SIGIR'96 Conference. Alan Smeaton of Dublin University and Paraic Sheridan of the ETH, Zurich, were the two other members of the Scientific Committee for this workshop. SIGIR is the Association for Computing Ma chinery (ACM) Special Interest Group on Information Retrieval, and they have held conferences yearly since 1977. Three additional papers have been added: Chapter 4 Distributed Cross-Lingual Information retrieval describes the EMIR retrieval system, one of the first general cross-language systems to be implemented and evaluated; Chapter 6 Mapping Vocabularies Using Latent Semantic Indexing, which originally appeared as a technical report in the Lab oratory for Computational Linguistics at Carnegie Mellon University in 1991, is included here because it was one of the earliest, though hard-to-find, publi cations showing the application of Latent Semantic Indexing to the problem of cross-language retrieval; and Chapter 10 A Weighted Boolean Model for Cross Language Text Retrieval describes a recent approach to solving the translation term weighting problem, specific to Cross-Language Information Retrieval. Gregory Grefenstette CONTRIBUTORS Lisa Ballesteros David Hull W, Bruce Croft Gregory Grefenstette Center for Intelligent Xerox Research Centre Europe Information Retrieval Grenoble Laboratory Computer Science Department University of Massachusetts Thomas K. Landauer Department of Psychology Mark W. Davis and Institute of Cognitive Science Computing Research Lab University of Colorado, Boulder New Mexico State University Michael L. Littman Bonnie J.

Advances in Multilingual and Multimodal Information Retrieval

Advances in Multilingual and Multimodal Information Retrieval PDF Author: Valentin Jijkoun
Publisher: Springer
ISBN: 3540857605
Category : Computers
Languages : en
Pages : 942

Get Book Here

Book Description
The eighth campaign of the Cross Language Evaluation Forum (CLEF) for - ropean languages was held from January to September 2007. There were seven distinct evaluation tracks in CLEF 2007, designed to test the performance of a wide range of multilingual information access systems or system components. CLEF is by now an established international evaluation initiative and, in 2007, 81 groups from all over the world submitted results for one or more of the di?erent evaluation tracks. Full details regarding the design of the tracks, the methodologies used for evaluation, and the results obtained by the participants can be found in the di?erent sections of these proceedings. As always the results of the campaign were reported and discussed at the annual workshop, held in Budapest, Hungary, 19-21 September, immediately following the eleventh European Conference on Digital Libraries. The workshop playsanimportantrolebyprovidingtheopportunityforallthe groupsthathave participated in the evaluation campaign to get together to compare approaches and exchange ideas.

Introduction to Information Retrieval

Introduction to Information Retrieval PDF Author: Christopher D. Manning
Publisher: Cambridge University Press
ISBN: 1139472100
Category : Computers
Languages : en
Pages :

Get Book Here

Book Description
Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.

Information Retrieval

Information Retrieval PDF Author: Ayse Goker
Publisher: John Wiley & Sons
ISBN: 9780470033630
Category : Technology & Engineering
Languages : en
Pages : 320

Get Book Here

Book Description
This book is an essential reference to cutting-edge issues and future directions in information retrieval Information retrieval (IR) can be defined as the process of representing, managing, searching, retrieving, and presenting information. Good IR involves understanding information needs and interests, developing an effective search technique, system, presentation, distribution and delivery. The increased use of the Web and wider availability of information in this environment led to the development of Web search engines. This change has brought fresh challenges to a wider variety of users’ needs, tasks, and types of information. Today, search engines are seen in enterprises, on laptops, in individual websites, in library catalogues, and elsewhere. Information Retrieval: Searching in the 21st Century focuses on core concepts, and current trends in the field. This book focuses on: Information Retrieval Models User-centred Evaluation of Information Retrieval Systems Multimedia Resource Discovery Image Users’ Needs and Searching Behaviour Web Information Retrieval Mobile Search Context and Information Retrieval Text Categorisation and Genre in Information Retrieval Semantic Search The Role of Natural Language Processing in Information Retrieval: Search for Meaning and Structure Cross-language Information Retrieval Performance Issues in Parallel Computing for Information Retrieval This book is an invaluable reference for graduate students on IR courses or courses in related disciplines (e.g. computer science, information science, human-computer interaction, and knowledge management), academic and industrial researchers, and industrial personnel tracking information search technology developments to understand the business implications. Intermediate-advanced level undergraduate students on IR or related courses will also find this text insightful. Chapters are supplemented with exercises to stimulate further thinking.

EuroWordNet: A multilingual database with lexical semantic networks

EuroWordNet: A multilingual database with lexical semantic networks PDF Author: Piek Vossen
Publisher: Springer Science & Business Media
ISBN: 9401714916
Category : Computers
Languages : en
Pages : 180

Get Book Here

Book Description
This book describes the main objective of EuroWordNet, which is the building of a multilingual database with lexical semantic networks or wordnets for several European languages. Each wordnet in the database represents a language-specific structure due to the unique lexicalization of concepts in languages. The concepts are inter-linked via a separate Inter-Lingual-Index, where equivalent concepts across languages should share the same index item. The flexible multilingual design of the database makes it possible to compare the lexicalizations and semantic structures, revealing answers to fundamental linguistic and philosophical questions which could never be answered before. How consistent are lexical semantic networks across languages, what are the language-specific differences of these networks, is there a language-universal ontology, how much information can be shared across languages? First attempts to answer these questions are given in the form of a set of shared or common Base Concepts that has been derived from the separate wordnets and their classification by a language-neutral top-ontology. These Base Concepts play a fundamental role in several wordnets. Nevertheless, the database may also serve many practical needs with respect to (cross-language) information retrieval, machine translation tools, language generation tools and language learning tools, which are discussed in the final chapter. The book offers an excellent introduction to the EuroWordNet project for scholars in the field and raises many issues that set the directions for further research in semantics and knowledge engineering.

Language Modeling for Information Retrieval

Language Modeling for Information Retrieval PDF Author: W. Bruce Croft
Publisher: Springer Science & Business Media
ISBN: 9401701717
Category : Computers
Languages : en
Pages : 253

Get Book Here

Book Description
A statisticallanguage model, or more simply a language model, is a prob abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize artificial text, and discriminative techniques to classify text into predefined cat egories. The first statisticallanguage modeler was Claude Shannon. In exploring the application of his newly founded theory of information to human language, Shannon considered language as a statistical source, and measured how weH simple n-gram models predicted or, equivalently, compressed natural text. To do this, he estimated the entropy of English through experiments with human subjects, and also estimated the cross-entropy of the n-gram models on natural 1 text. The ability of language models to be quantitatively evaluated in tbis way is one of their important virtues. Of course, estimating the true entropy of language is an elusive goal, aiming at many moving targets, since language is so varied and evolves so quickly. Yet fifty years after Shannon's study, language models remain, by all measures, far from the Shannon entropy liInit in terms of their predictive power. However, tbis has not kept them from being useful for a variety of text processing tasks, and moreover can be viewed as encouragement that there is still great room for improvement in statisticallanguage modeling.

Multimodal Interface for Human-machine Communication

Multimodal Interface for Human-machine Communication PDF Author: P. C. Yuen
Publisher: World Scientific
ISBN: 9789810245948
Category : Computers
Languages : en
Pages : 288

Get Book Here

Book Description
With the advance of speech, image and video technology, human-computer interaction (HCI) will reach a new phase.In recent years, HCI has been extended to human-machine communication (HMC) and the perceptual user interface (PUI). The final goal in HMC is that the communication between humans and machines is similar to human-to-human communication. Moreover, the machine can support human-to-human communication (e.g. an interface for the disabled). For this reason, various aspects of human communication are to be considered in HMC. The HMC interface, called a multimodal interface, includes different types of input methods, such as natural language, gestures, face and handwriting characters.The nine papers in this book have been selected from the 92 high-quality papers constituting the proceedings of the 2nd International Conference on Multimodal Interface (ICMI '99), which was held in Hong Kong in 1999. The papers cover a wide spectrum of the multimodal interface.

Advances in Computer and Computational Sciences

Advances in Computer and Computational Sciences PDF Author: Sanjiv K. Bhatia
Publisher: Springer
ISBN: 9811037736
Category : Technology & Engineering
Languages : en
Pages : 689

Get Book Here

Book Description
Exchange of information and innovative ideas are necessary to accelerate the development of technology. With advent of technology, intelligent and soft computing techniques came into existence with a wide scope of implementation in engineering sciences. Keeping this ideology in preference, this book includes the insights that reflect the ‘Advances in Computer and Computational Sciences’ from upcoming researchers and leading academicians across the globe. It contains high-quality peer-reviewed papers of ‘International Conference on Computer, Communication and Computational Sciences (ICCCCS 2016), held during 12-13 August, 2016 in Ajmer, India'. These papers are arranged in the form of chapters. The content of the book is divided into two volumes that cover variety of topics such as intelligent hardware and software design, advanced communications, power and energy optimization, intelligent techniques used in internet of things, intelligent image processing, advanced software engineering, evolutionary and soft computing, security and many more. This book helps the perspective readers’ from computer industry and academia to derive the advances of next generation computer and communication technology and shape them into real life applications.