注冊 | 登錄讀書好,好讀書,讀好書!
讀書網(wǎng)-DuShu.com
當前位置: 首頁出版圖書科學技術(shù)計算機/網(wǎng)絡軟件工程及軟件方法學基于開源工具的數(shù)據(jù)分析(影印版)

基于開源工具的數(shù)據(jù)分析(影印版)

基于開源工具的數(shù)據(jù)分析(影印版)

定 價:¥82.00

作 者: (美)雅奈特 著
出版社: 東南大學出版社
叢編項:
標 簽: 軟件工程/開發(fā)項目管理

ISBN: 9787564126742 出版時間: 2011-05-01 包裝: 平裝
開本: 16開 頁數(shù): 509 字數(shù):  

內(nèi)容簡介

  數(shù)據(jù)收集相對比較簡單,而要把原始信息轉(zhuǎn)化為有用的數(shù)據(jù)則需要知道如何精確地抽取你想要的內(nèi)容。通過《基于開源工具的數(shù)據(jù)分析(影印版·英文版)》的深入講解,那些對數(shù)據(jù)分析感興趣的中等或者富有經(jīng)驗的程序員將可以學習到在商業(yè)環(huán)境中與數(shù)據(jù)打交道的技術(shù)。你將了解到如何觀察數(shù)據(jù)來找出它所包含的信息,如何在概念模型里捕捉到這些想法,然后把你的理解通過商業(yè)計劃、度量標準的精確報告和其他方式反饋給你所在的機構(gòu)。你將會通過《基于開源工具的數(shù)據(jù)分析(影印版·英文版)》每章結(jié)束部分的動手實踐來慢慢體驗各種概念。最重要的是,你將了解到如何思考你所希望獲取的數(shù)據(jù):而不是依賴于工具來替你思考。

作者簡介

  PhilippK.Janert目前提供數(shù)據(jù)分析和數(shù)學模型的咨詢服務,他曾經(jīng)是物理學家和軟件工程師。他是《Gnuplot inAction:Understanding Data with Graphs》(Manning出版)的作者,他為O’ReillvNetwork.IBMdeVeloperWorks和IEEEsoftware寫過文章。他擁有Washington大學理論物理學的博士學位。

圖書目錄

PREFACE
1 INTRODUCTION
 Data Analysis
 What's in This Book
 What's with the Workshops?
 What's with the Math?
 What You'll Need
 What's Missing
PART I Graphics: Looking at Data
2 A SINGLE VARIABLE: SHAPE AND DISTRIBUTION
 Dot andJitter Plots
 Histograms and Kernel Density Estimates
 The Cumu/atiue Distribution Function
 Rank-Order Plots and Lilt Charts
 Only When Appropriate: Summary Statistics and Box Plots
 Workshop: NumPy
 Further Reading
3 TWO VARIABLES: ESTABLISHING RELATIONSHIPS
 Scatter Plots
 Conquering Noise: 5moothing
 Logarithmic Plots
 Banking
 Linear ReRression and All That
 Shouwing What's Important
 Graphical Analysis and Presentation Graphics
 Workshop: matplotlib
 Further Reading
 TIME AS A VARIABLE: TIME-SERIES ANALYSIS
 Examples
 The Task
 Smoothing
 Don't Ouerlook the Obuious!
 The Correlation Function
 Optional: Filters and Conuolutions
 Workshop: scipy.signal
 Further ReadinR
5 MORE THAN TWO VARIABLES: GRAPHICAL MULTIVARIATE ANALYSIS
 False-Color Plots
 A Lot at a Glance: Multiplots
 Composition Problems
 Nouel Plot Types
 Interactiue Explorations
 Workshop: Tools for Multiuariate Graphics
 Further ReadinR
6 INTERMEZZO: A DATA ANALYSIS SESSION
 A Data Analysis Session
 Workshop: gnuplot
 Further ReadinR
PART II Analyticg: Modeling Data
7 GUESSTIMATION AND THE BACK OF THE ENVELOPE
 Principles of Guesstimation
 How Good Are Those Numbers?
 Optional: A Closer Look at Perturbation Theory and
 Error PropaRation
 Workshop: The Gnu Scientific Library (GSL)
 Further Reading
8 MODELS FROM SCALING ARGUMENTS
 Models
 ArRuments from Scale
 Mean-Field Approximations
 Common Time-Euolution Scenarios
 Case Study: How Many Seruers Are Best?
 Why Modeling?
 Workshop: Sage
 Further Reading
9 ARGUMENTS FROM PROBABILITY MODELS
 The. Binomial Distribution and Bernoulli Trials
 The Gaussian Distribution and the Central Limit Theorem
 Power-Law Distributions and Non-Normal Statistics
 Other Distributions
 Optional: Case Study--Unique Visitors ouer Time
 Workshop: Power-Law Distributions
 Further Reading
10 WHAT YOU REALLY NEED TO KNOW ABOUT CLASSICAL STATISTICS
 Genesis
 Statistics Defined
 Statistics Explained
 Controlled Experiments Versus Obseruationa} Studies
 Optional: Bayesian Statistics--The Other Point of View
 Workshop: R
 Further Reading
11 INTERMEZZO:MYTHBUSTING--BIGFOOT, LEAST SQUARES, AND ALLTHAT
 How to Auerage Auerages
 The Standard Deuiation
 Least Squares
 Further Reading
PART III Computation: Mininhg Data
12 SIMULATIONS
 A Warm-Up Question
 Monte Carlo Simulations
 Resampling Methods
 Workshop: Discrete Euent Simulations with Simpy
 Further Reading
13 FINDING CLUSTERS
 What Constitutes a Cluster?
 Distance and Similarity Measures
 Clustering Methods
 Pre-and Postprocessing
 Other ThouRhts
 A Special Case: Market BasketAnalysis
 A Word of WarninR
 Workshop: P/cluster and the C Clustering Library
 Further Reading
14 SEEING THE FOREST FOR THE TREES: FINDING
 IMPORTANT ATTRIBUTES
 Principal Component Analysis
 Visual Techniques
 Kohonen Maps
 Workshop: PCA with R
 Further Readin2
15 INTERMEZZO:WHEN MORE IS DIFFERENT
 A Horror Story
 Some Suggestions
 What About Map/Reduce?
 Workshop: Generating Permutations
 Further ReadingPART IV Applications: Using Data
16 REPORTING, BUSINESS INTELLIGENCE, AND DASHBOARDS
 Business Intelligence
 Corporate Metrics and Dashboards
 Data Quality Issues
 Workshop: Berkeley DB and SQLite
 Further Reading
17 FINANCIAL CALCULATIONS AND MODELING
 The Time Value o[ Money
 Uncertainty in Planning and Opportunity Costs
 Cost Concepts and Depreciation
 Should You Care?
 Is This All That Matters?
 Workshop: The Newsuendor Problem
 Further Reading
18 PREDICTIVE ANALYTICS
 Introduction
 Some Classification Terminology
 Algorithms for Classification
 The Process
 The Secret Sauce
 The Nature o[ Statistical Learning
 Workshop: Two Do-lt-Yoursel Classifiers
 Further Reading
19 EPILOGUE: FACTS ARE NOT REALITY
 A PROGRAMMING ENVIRONMENTS FOR SCIENTIFIC COMPUTATION
 AND DATA ANALYSIS
 Software Tools
 A Catalog of Scientific Software
 Writing Your Own
 Further Reading
 B RESULTS FROM CALCULUS
 Common Functions
 Calculus
 Useful Tricks
  Notation and Basic Math
  Where to Go from Here
  Further Readin9
  WORKING WITH DATA
  Sources for Data
  Cleanin9 and ConditioninR
  Sarnplin9
  Data File Formats
  The Care and Feeding of Your Data Zoo
  Skills
  Terminology
  Further Fleadin9
INDEX

本目錄推薦

掃描二維碼
Copyright ? 讀書網(wǎng) ranfinancial.com 2005-2020, All Rights Reserved.
鄂ICP備15019699號 鄂公網(wǎng)安備 42010302001612號