数据科学:统计与机器学习专项课程

Data Science: Statistics and Machine Learning

This specialization continues and develops on the material from the Data Science: Foundations using R specialization. It covers statistical inference, regression models, machine learning, and the development of data products.

约翰霍普金斯大学

Coursera

计算机

普通(中级)

6 个月

本课程由Coursera和Linkshare共同提供
  • 英语
  • 611

课程概况

Build models, make inferences, and deliver interactive data products.
This specialization continues and develops on the material from the Data Science: Foundations using R specialization. It covers statistical inference, regression models, machine learning, and the development of data products. In the Capstone Project, you’ll apply the skills learned by building a data product using real-world data. At completion, learners will have a portfolio demonstrating their mastery of the material.

The five courses in this specialization are the very same courses that make up the second half of the Data Science Specialization. This specialization is presented for learners who have already mastered the fundamentals and want to skip right to the more advanced courses.

你将学到什么

Perform regression analysis, least squares and inference using regression models.

Build and apply prediction functions

Develop public data products

包含课程

课程1
统计推断

Statistical inference is the process of drawing conclusions about populations or scientific truths from data. There are many modes of performing inference including statistical modeling, data oriented strategies and explicit use of designs and randomization in analyses. Furthermore, there are broad theories (frequentists, Bayesian, likelihood, design based, …) and numerous complexities (missing data, observed and unobserved confounding, biases) for performing inference. A practitioner can often be left in a debilitating maze of techniques, philosophies and nuance. This course presents the fundamentals of inference in a practical approach for getting things done. After taking this course, students will understand the broad directions of statistical inference and use this information for making informed choices in analyzing data.

课程2
回归模型

Linear models, as their name implies, relates an outcome to a set of predictors of interest using linear assumptions. Regression models, a subset of linear models, are the most important statistical analysis tool in a data scientist’s toolkit. This course covers regression analysis, least squares and inference using regression models. Special cases of the regression model, ANOVA and ANCOVA will be covered as well. Analysis of residuals and variability will be investigated. The course will cover modern thinking on model selection and novel uses of regression models including scatterplot smoothing.

课程3
实用机器学习

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates. The course will also introduce a range of model based and algorithmic machine learning methods including regression, classification trees, Naive Bayes, and random forests. The course will cover the complete process of building prediction functions including data collection, feature creation, algorithms, and evaluation.

课程4
数据产品开发

A data product is the production output from a statistical analysis. Data products automate complex analysis tasks or use technology to expand the utility of a data informed model, algorithm or inference. This course covers the basics of creating data products using Shiny, R packages, and interactive graphics. The course will focus on the statistical fundamentals of creating a data product that can be used to tell a story about data to a mass audience.

课程5
数据课程毕业项目

The capstone project class will allow students to create a usable/public data product that can be used to show your skills to potential employers. Projects will be drawn from real-world problems and will be conducted with industry, government, and academic partners.

预备知识

Learners should complete the Data Science: Foundations using R specialization to gain the right foundation before starting this course.

Self-Driving Cars. Become an autonomous vehicle engineer.
声明:MOOC中国十分重视知识产权问题,我们发布之课程均源自下列机构,版权均归其所有,本站仅作报道收录并尊重其著作权益。感谢他们对MOOC事业做出的贡献!
  • Coursera
  • edX
  • OpenLearning
  • FutureLearn
  • iversity
  • Udacity
  • NovoEd
  • Canvas
  • Open2Study
  • Google
  • ewant
  • FUN
  • IOC-Athlete-MOOC
  • World-Science-U
  • Codecademy
  • CourseSites
  • opencourseworld
  • ShareCourse
  • gacco
  • MiriadaX
  • JANUX
  • openhpi
  • Stanford-Open-Edx
  • 网易云课堂
  • 中国大学MOOC
  • 学堂在线
  • 顶你学堂
  • 华文慕课
  • 好大学在线CnMooc
  • (部分课程由Coursera、Udemy、Linkshare共同提供)

© 2008-2019 MOOC.CN 慕课改变你,你改变世界