Expérimentations et évaluations en fouille de textes : Un panorama des campagnes DEFT

Expérimentations et évaluations en fouille de textes : Un panorama des campagnes DEFT PDF Author: GROUIN Cyril
Publisher: Lavoisier
ISBN: 2746288362
Category :
Languages : en
Pages : 258

Get Book Here

Book Description
La fouille de textes est une activité combinant traitements informatiques et données linguistiques avec comme objectif principal l’extraction et l’organisation automatique des informations présentes dans les textes. Deux familles de méthodes permettent d’atteindre ce but : celles à base de connaissances d’experts et celles reposant sur un apprentissage automatique supervisé. Une campagne d’évaluation consiste à confronter les systèmes développés par plusieurs équipes sur un même jeu de données et en un temps limité. Créé en 2005 à l’image des campagnes anglo-saxonnes, le défi fouille de textes (DEFT) est aujourd’hui la seule campagne d’évaluation francophone en fouille de textes. Cet ouvrage rassemble les méthodes utilisées lors des différentes éditions du défi. Les thématiques relèvent de la classification de documents en genres et thèmes, de la fouille d’opinions et de l’identification de la période de parution d’un document.

Expérimentations et évaluations en fouille de textes : Un panorama des campagnes DEFT

Expérimentations et évaluations en fouille de textes : Un panorama des campagnes DEFT PDF Author: GROUIN Cyril
Publisher: Lavoisier
ISBN: 2746288362
Category :
Languages : en
Pages : 258

Get Book Here

Book Description
La fouille de textes est une activité combinant traitements informatiques et données linguistiques avec comme objectif principal l’extraction et l’organisation automatique des informations présentes dans les textes. Deux familles de méthodes permettent d’atteindre ce but : celles à base de connaissances d’experts et celles reposant sur un apprentissage automatique supervisé. Une campagne d’évaluation consiste à confronter les systèmes développés par plusieurs équipes sur un même jeu de données et en un temps limité. Créé en 2005 à l’image des campagnes anglo-saxonnes, le défi fouille de textes (DEFT) est aujourd’hui la seule campagne d’évaluation francophone en fouille de textes. Cet ouvrage rassemble les méthodes utilisées lors des différentes éditions du défi. Les thématiques relèvent de la classification de documents en genres et thèmes, de la fouille d’opinions et de l’identification de la période de parution d’un document.

Expérimentations et évaluations en fouille de textes

Expérimentations et évaluations en fouille de textes PDF Author: Cyril Grouin
Publisher:
ISBN: 9782746238367
Category :
Languages : fr
Pages : 248

Get Book Here

Book Description


European Language Grid

European Language Grid PDF Author: Georg Rehm
Publisher: Springer Nature
ISBN: 3031172582
Category : Computers
Languages : en
Pages : 380

Get Book Here

Book Description
This open access book provides an in-depth description of the EU project European Language Grid (ELG). Its motivation lies in the fact that Europe is a multilingual society with 24 official European Union Member State languages and dozens of additional languages including regional and minority languages. The only meaningful way to enable multilingualism and to benefit from this rich linguistic heritage is through Language Technologies (LT) including Natural Language Processing (NLP), Natural Language Understanding (NLU), Speech Technologies and language-centric Artificial Intelligence (AI) applications. The European Language Grid provides a single umbrella platform for the European LT community, including research and industry, effectively functioning as a virtual home, marketplace, showroom, and deployment centre for all services, tools, resources, products and organisations active in the field. Today the ELG cloud platform already offers access to more than 13,000 language processing tools and language resources. It enables all stakeholders to deposit, upload and deploy their technologies and datasets. The platform also supports the long-term objective of establishing digital language equality in Europe by 2030 – to create a situation in which all European languages enjoy equal technological support. This is the very first book dedicated to Language Technology and NLP platforms. Cloud technology has only recently matured enough to make the development of a platform like ELG feasible on a larger scale. The book comprehensively describes the results of the ELG project. Following an introduction, the content is divided into four main parts: (I) ELG Cloud Platform; (II) ELG Inventory of Technologies and Resources; (III) ELG Community and Initiative; and (IV) ELG Open Calls and Pilot Projects.

Text Analysis Pipelines

Text Analysis Pipelines PDF Author: Henning Wachsmuth
Publisher: Springer
ISBN: 3319257412
Category : Computers
Languages : en
Pages : 317

Get Book Here

Book Description
This monograph proposes a comprehensive and fully automatic approach to designing text analysis pipelines for arbitrary information needs that are optimal in terms of run-time efficiency and that robustly mine relevant information from text of any kind. Based on state-of-the-art techniques from machine learning and other areas of artificial intelligence, novel pipeline construction and execution algorithms are developed and implemented in prototypical software. Formal analyses of the algorithms and extensive empirical experiments underline that the proposed approach represents an essential step towards the ad-hoc use of text mining in web search and big data analytics. Both web search and big data analytics aim to fulfill peoples’ needs for information in an adhoc manner. The information sought for is often hidden in large amounts of natural language text. Instead of simply returning links to potentially relevant texts, leading search and analytics engines have started to directly mine relevant information from the texts. To this end, they execute text analysis pipelines that may consist of several complex information-extraction and text-classification stages. Due to practical requirements of efficiency and robustness, however, the use of text mining has so far been limited to anticipated information needs that can be fulfilled with rather simple, manually constructed pipelines.

String Processing and Information Retrieval

String Processing and Information Retrieval PDF Author: Roberto Grossi
Publisher: Springer Science & Business Media
ISBN: 364224582X
Category : Computers
Languages : en
Pages : 440

Get Book Here

Book Description
This book constitutes the proceedings of the 18th International Symposium on String Processing and Information Retrieval, SPIRE 2011, held in Pisa, Italy, in October 2011. The 30 long and 10 short papers together with 1 keynote presented were carefully reviewed and selected from 102 submissions. The papers are structured in topical sections on introduction to web retrieval, sequence learning, computational geography, space-efficient data structures, algorithmic analysis of biological data, compression, text and algorithms.

Text Mining

Text Mining PDF Author: Chris Biemann
Publisher: Springer
ISBN: 3319126555
Category : Computers
Languages : en
Pages : 243

Get Book Here

Book Description
This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects the most recent achievements with respect to the automatic build-up of large lexical resources. It addresses researchers that already perform text mining, and those who want to enrich their battery of methods. Selected articles can be used to support graduate-level teaching. The book is suitable for all readers that completed undergraduate studies of computational linguistics, quantitative linguistics, computer science and computational humanities. It assumes basic knowledge of computer science and corpus processing as well as of statistics.

Handbook of Research on Text and Web Mining Technologies

Handbook of Research on Text and Web Mining Technologies PDF Author: Song, Min
Publisher: IGI Global
ISBN: 1599049910
Category : Computers
Languages : en
Pages : 901

Get Book Here

Book Description
Examines recent advances and surveys of applications in text and web mining which should be of interest to researchers and end-users alike.

New Trends in Database and Information Systems

New Trends in Database and Information Systems PDF Author: Silvia Chiusano
Publisher: Springer Nature
ISBN: 3031157435
Category : Computers
Languages : en
Pages : 675

Get Book Here

Book Description
This book constitutes the proceedings of the 26th European Conference on Advances in Databases and Information Systems, ADBIS 2022, held in Turin, Italy, in September 2022. The 29 short papers presented were carefully reviewed and selected from 90 submissions. The selected short papers are organized in the following sections: data understanding, modeling and visualization; fairness in data processing; data management pipeline, information and process retrieval; data access optimization; data pre-processing and cleaning; data science and machine learning. Further, papers from the following workshops and satellite events are provided in the volume: DOING: 3rd Workshop on Intelligent Data – From Data to Knowledge; K-GALS: 1st Workshop on Knowledge Graphs Analysis on a Large Scale; MADEISD: 4th Workshop on Modern Approaches in Data Engineering and Information System Design; MegaData: 2nd Workshop on Advanced Data Systems Management, Engineering, and Analytics; SWODCH: 2nd Workshop on Semantic Web and Ontology Design for Cultural Heritage; Doctoral Consortium.

Survey of Text Mining

Survey of Text Mining PDF Author: Michael W. Berry
Publisher: Springer Science & Business Media
ISBN: 147574305X
Category : Computers
Languages : en
Pages : 251

Get Book Here

Book Description
Extracting content from text continues to be an important research problem for information processing and management. Approaches to capture the semantics of text-based document collections may be based on Bayesian models, probability theory, vector space models, statistical models, or even graph theory. As the volume of digitized textual media continues to grow, so does the need for designing robust, scalable indexing and search strategies (software) to meet a variety of user needs. Knowledge extraction or creation from text requires systematic yet reliable processing that can be codified and adapted for changing needs and environments. This book will draw upon experts in both academia and industry to recommend practical approaches to the purification, indexing, and mining of textual information. It will address document identification, clustering and categorizing documents, cleaning text, and visualizing semantic models of text.

Ontology Learning from Text

Ontology Learning from Text PDF Author: Paul Buitelaar
Publisher: IOS Press
ISBN: 9781586035235
Category : Computers
Languages : en
Pages : 188

Get Book Here

Book Description
The latest title in Black Library's premium line. Perturabo - master of siegecraft, and executioner of Olympia. Long has he lived in the shadow of his more favoured primarch brothers, frustrated by the mundane and ignominious duties which regularly fall to his Legion. When Fulgrim offers him the chance to lead an expedition in search of an ancient and destructive xenos weapon, the Iron Warriors and the Emperor's Children unite and venture deep into the heart of the great warp-rift known only as 'the Eye'. Pursued by a ragged band of survivors from Isstvan V and the revenants of a dead eldar world, they must work quickly if they are to unleash the devastating power of the Angel Exterminatus