大数据 – 毕业项目

Big Data - Capstone Project

1767 次查看
加州大学圣地亚哥分校
Coursera
  • 完成时间大约为 19 个小时
  • 混合难度
  • 英语, 韩语, 其他
注:本课程由Coursera和Linkshare共同提供,因开课平台的各种因素变化,以上开课日期仅供参考

你将学到什么

Big Data

Neo4j

Knime

Splunk

课程概况

Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game “Catch the Pink Flamingo”. During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark’s MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership.

课程大纲

周1
完成时间为 1 小时
Simulating Big Data for an Online Game
This week we provide an overview of the Eglence, Inc. Pink Flamingo game, including various aspects of the data which the company has access
to about the game and users and what we might be interested in finding out.
4 个视频 (总计 18 分钟), 4 个阅读材料

完成时间为 4 小时
Acquiring, Exploring, and Preparing the Data
Next, we begin working with the simulated game data by exploring and preparing the data for ingestion into big data analytics applications.
6 个阅读材料, 2 个测验

周2
完成时间为 5 小时
Data Classification with KNIME
This week we do some data classification using KNIME.
4 个阅读材料, 1 个测验

周3
完成时间为 5 小时
Clustering with Spark
This week we do some clustering with Spark.
2 个阅读材料, 1 个测验

周4
完成时间为 4 小时
Graph Analytics of Simulated Chat Data With Neo4j
This week we apply what we learned from the 'Graph Analytics With Big Data' course to simulated chat data from Catch the Pink Flamingos
using Neo4j. We analyze player chat behavior to find ways of improving the game.
2 个阅读材料, 1 个测验

周5
完成时间为 9 分钟
Reporting and Presenting Your Work
1 个视频 (总计 2 分钟), 1 个阅读材料

周6
完成时间为 4 小时
Final Submission

千万首歌曲。全无广告干扰。
此外,您还能在所有设备上欣赏您的整个音乐资料库。免费畅听 3 个月,之后每月只需 ¥10.00。
Apple 广告
声明:MOOC中国十分重视知识产权问题,我们发布之课程均源自下列机构,版权均归其所有,本站仅作报道收录并尊重其著作权益。感谢他们对MOOC事业做出的贡献!
  • Coursera
  • edX
  • OpenLearning
  • FutureLearn
  • iversity
  • Udacity
  • NovoEd
  • Canvas
  • Open2Study
  • Google
  • ewant
  • FUN
  • IOC-Athlete-MOOC
  • World-Science-U
  • Codecademy
  • CourseSites
  • opencourseworld
  • ShareCourse
  • gacco
  • MiriadaX
  • JANUX
  • openhpi
  • Stanford-Open-Edx
  • 网易云课堂
  • 中国大学MOOC
  • 学堂在线
  • 顶你学堂
  • 华文慕课
  • 好大学在线CnMooc
  • (部分课程由Coursera、Udemy、Linkshare共同提供)

© 2008-2020 MOOC.CN 慕课改变你,你改变世界