에너지경제연구원 전자도서관

로그인

에너지경제연구원 전자도서관

자료검색

  1. 메인
  2. 자료검색
  3. 통합검색

통합검색

단행본

Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning

발행사항
Sebastopol, CA : O'REILLY, 2018
형태사항
xviii, 310 p. : illustrations ; 24 cm
서지주기
Includes bibliographical references and index
소장정보
위치등록번호청구기호 / 출력상태반납예정일
이용 가능 (1)
자료실E207071대출가능-
이용 가능 (1)
  • 등록번호
    E207071
    상태/반납예정일
    대출가능
    -
    위치/청구기호(출력)
    자료실
책 소개

From news and speeches to informal chatter on social media, natural language is one of the richest and most underutilized sources of data. Not only does it come in a constant stream, always changing and adapting in context; it also contains information that is not conveyed by traditional data sources. The key to unlocking natural language is through the creative application of text analytics. This practical book presents a data scientist's approach to building language-aware products with applied machine learning.

You'll learn robust, repeatable, and scalable techniques for text analysis with Python, including contextual and linguistic feature engineering, vectorization, classification, topic modeling, entity resolution, graph analysis, and visual steering. By the end of the book, you'll be equipped with practical methods to solve any number of complex real-world problems.

  • Preprocess and vectorize text into high-dimensional feature representations
  • Perform document classification and topic modeling
  • Steer the model selection process with visual diagnostics
  • Extract key phrases, named entities, and graph structures to reason about data in text
  • Build a dialog framework to enable chatbots and language-driven interaction
  • Use Spark to scale processing power and neural networks to scale model complexity
목차
Preface 1. Language and Computation 2. Building a Custom Corpus 3. Corpus Preprocessing and Wrangling 4. Text Vectorization and Transformation Pipelines 5. Classification for Text Analysis 6. Clustering for Text Similarity 7. Context-Aware Text Analysis 8. Text Visualization 9. Graph Analysis of Text 10. Chatbots 11. Scaling Text Analytics with Multiprocessing and spark 12. Deep Learning and Beyond Glossary Index