Se connecter

Mot de passe oublié ?

Aide

Centre de Documentation Méditerranéen

CIHEAM Montpellier

Nouvelle recherche

Ajouter au panier

Lien vers la notice

Titre :	Text mining with R: a tidy approach
Auteurs :	J. Silge ; D. Robinson
Type de document :	E-Book
Editeur :	Sebastopol [Etats-Unis] : O'Reilly, 2017
ISBN/ISSN/EAN :	978-1-4919-8165-8
Format :	178 p.
Langues :	Anglais
Langues du résumé :	Anglais
Catégories :	Catégories principales 12 - EDUCATION. FORMATION. INFORMATION. GESTION DES SAVOIRS ; 12.5 - Formation Continue, Professionnelle. Gestion des Cursus Thésaurus IAMM LOGICIEL ; DONNEE STATISTIQUE ; ANALYSE DE DONNEES ; TRAITEMENT DES DONNEES ; LANGAGE
Mots-clés:	EXPLORATION DE TEXTE ; ANALYSE DE TEXTE
Résumé :	Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you’ll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You’ll learn how tidytext and other tidy tools in R can make text analysis easier and more effective. The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You’ll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media. • Learn how to apply the tidy text format to NLP • Use sentiment analysis to mine the emotional content of text • Identify a document’s most important terms with frequency measurements • Explore relationships and connections between words with the ggraph and widyr packages • Convert back and forth between R’s tidy and non-tidy text formats • Use topic modeling to classify document collections into natural groups • Examine case studies that compare Twitter archives, dig into NASA metadata, and analyze thousands of Usenet messages
Note de contenu :	• 1. The Tidy Text Format • 2. Sentiment Analysis with Tidy Data • 3. Analyzing Word and Document Frequency: tf-idf • 4. Relationships Between Words: N-grams and Correlations • 5. Converting to and from Nontidy Formats • 6. Topic Modeling • 7. Case Study: Comparing Twitter Archives • 8. Case Study: Mining NASA Metadata • 9. Case Study: Analyzing Usenet Text
Cote :	Réservé lecteur CIHEAM
URL / DOI :	https://www.tidytextmining.com/

Accueil

Adresse

ÉQUIPE
Nous contacter

Informations pratiques

contact

pmb

A-
A
A+

CIHEAM Montpellier

Adresse

Centre de Documentation Méditerranéen

CIHEAM Montpellier
3191 route de Mende
CS 43999

34093 Montpellier Cedex 5

France

Contact

docassistance@iamm.fr
+33 (0)4 67 04 60 49

Horaires

Du 1er avril

jusqu'au 30 septembre :

du lundi au jeudi :

de 13h30 à 17h30

Fermé le vendredi

Du 1er octobre

jusqu'au 31 mars :

du lundi au jeudi :

de 13h30 à 18h30

Fermé le vendredi

Se connecter

Centre de Documentation Méditerranéen

Accueil

Sélection de la langue

Adresse

CIHEAM Montpellier

Adresse

Contact

Horaires

Plan d'accès

Nos labels

Membre de