注冊(cè) | 登錄讀書好,好讀書,讀好書!
讀書網(wǎng)-DuShu.com
當(dāng)前位置: 首頁出版圖書科學(xué)技術(shù)計(jì)算機(jī)/網(wǎng)絡(luò)軟件與程序設(shè)計(jì)Spark NLP自然語言處理(影印版)

Spark NLP自然語言處理(影印版)

Spark NLP自然語言處理(影印版)

定 價(jià):¥132.00

作 者: AlexThomas 著
出版社: 東南大學(xué)出版社
叢編項(xiàng):
標(biāo) 簽: 暫缺

ISBN: 9787564195113 出版時(shí)間: 2021-07-01 包裝: 平裝
開本: 16開 頁數(shù): 347 字?jǐn)?shù):  

內(nèi)容簡介

  如果你想構(gòu)建一款使用自然語言文本的企業(yè)級(jí)應(yīng)用,但不確定從哪里著手或者該使用什么工具,這本實(shí)用指南可以助你一臂之力。Wisecube首席數(shù)據(jù)科學(xué)家Alex Thomas向軟件工程師和數(shù)據(jù)科學(xué)家們展示了如何使用深度學(xué)習(xí)和Apache Spark NLP庫構(gòu)建可擴(kuò)展的自然語言處理(Natural Language Processing,NLP)應(yīng)用。通過具體的示例、實(shí)踐和理論解釋,以及在Spark處理框架上使用NLP進(jìn)行的動(dòng)手練習(xí),本書將教授你從基本語言學(xué)和書寫系統(tǒng)到情感分析和搜索引擎的一切。除此之外,你還將探究開發(fā)基于文本的應(yīng)用時(shí)要特別注意的性能等問題。在本書的四個(gè)部分中,你將學(xué)習(xí)到NLP基礎(chǔ)知識(shí)和基本構(gòu)成要素,然后再深入研究應(yīng)用和系統(tǒng)構(gòu)建:基礎(chǔ):理解自然語言處理、Apache Stark上的NLP及深度學(xué)習(xí)的基礎(chǔ)知識(shí)。基本構(gòu)成要素:學(xué)習(xí)包括標(biāo)記化、句子分割和命名實(shí)體識(shí)別在內(nèi)的NLP應(yīng)用構(gòu)建技術(shù),知曉其工作方式及工作原理。應(yīng)用:探究構(gòu)建你自己的NLP應(yīng)用所涉及的設(shè)計(jì)、開發(fā)和實(shí)驗(yàn)過程。構(gòu)建NLP系統(tǒng):考慮生產(chǎn)和部署NLP模型的備選方案,包括支持哪些人類語言。

作者簡介

  亞歷克斯·托馬斯是Wisecube的首席數(shù)據(jù)科學(xué)家。他將自然語言處理和機(jī)器學(xué)習(xí)運(yùn)用于臨床數(shù)據(jù)、身份數(shù)據(jù)、雇主和求職者數(shù)據(jù)以及如今的生化數(shù)據(jù)。Alex從09版本開始使用Apache Spark,在工作中也用過包括UIMA和OpenNLP在內(nèi)的多種NLP庫和框架。

圖書目錄

Preface
Part I. Basics
1. Getting Started
Introduction
Other Tools
Setting Up Your Environment
Prerequisites
Starting Apache Spark
Checking Out the Code
Getting Familiar with Apache Spark
Starting Apache Spark with Spark NLP
Loading and Viewing Data in Apache Spark
Hello World with Spark NLP
2. Natural Language Basics
What Is Natural Language?
Origins of Language
Spoken Language Versus Written Language
Linguistics
Phonetics and Phonology
Morphology
Syntax
Semantics
Sociolinguistics: Dialects, Registers, and Other Varieties
Formality
Context
Pragmatics
Roman ]akobson
How To Use Pragmatics
Writing Systems
Origins
Alphabets
Abiads
Abugidas
Syllabaries
Logographs
Encodings
ASCII
Unicode
UTF-8
Exercises: Tokenizing
Tokenize English
Tokenize Greek
Tokenize Ge'ez (Amharic)
Resources
3. NLP on Apache Spark
Parallelism, Concurrency, Distributing Computation
Parallelization Before Apache Hadoop
MapReduce and Apache Hadoop
Apache Spark
Architecture of Apache Spark
Physical Architecture
Logical Architecture
Spark SQL and Spark MLlib
Transformers
Estimators and Models
Evaluators
NLP Libraries
Functionality Libraries
Annotation Libraries
NLP in Other Libraries
Spark NLP
Annotation Library
Stages
Pretrained Pipelines
Finisher
Exercises: Build a Topic Model
Resources
4. Deep Learning Basics
Gradient Descent
Backpropagation
Convolutional Neural Networks
Filters
Pooling
Recurrent Neural Networks
Backpropagation Through Time
Elman Nets
LSTMs
Exercise 1
Exercise 2
Resources
Part II. Building Blocks
5. Processing Words
6. Information Retrieval
7. Classification and Regression
8. Sequence Modeling with Keras
9. Information Extraction
10. Topic Modeling
11. Word Embeddings
Part III. Applications
12. Sentiment Analysis and Emotion Detection
13. Building Knowledqe Bases
14. Search Engine
15. Chatbot
16. Object Character Recognition
Part IV. Building NLP Systems
17. Supporting Multiple Languages
18. Human Labeling
19. Productionizing NLP Applications
Glossary
Index

本目錄推薦

掃描二維碼
Copyright ? 讀書網(wǎng) ranfinancial.com 2005-2020, All Rights Reserved.
鄂ICP備15019699號(hào) 鄂公網(wǎng)安備 42010302001612號(hào)