Skip to main content

Student-led Development of Open Source Materials for Hadoop

Submission Number: 15
Submission ID: 32
Submission UUID: 210df84e-dc4e-489a-bb56-d7bea262913b
Submission URI: /form/project

Created: Tue, 09/03/2019 - 13:08
Completed: Tue, 09/03/2019 - 13:08
Changed: Thu, 05/05/2022 - 05:00

Remote IP address: 130.215.55.243
Submitted by: Northeast Cyberteam
Language: English

Is draft: No
Webform: Project
Student-led Development of Open Source Materials for Hadoop
Northeast
{Empty}
big-data (4), ceph (56), data-wrangling (6), hadoop (12), storage (47)
Complete

Project Leader

Christopher Bennet
2073331609
2077787114

Project Personnel

{Empty}
{Empty}
{Empty}

Project Information

As part of a system-wide Data Science Degree, numerous modules have been developed that can be offered at a distance. These include VBA in Excel, SQL, R, and others. No module currently exists for Hadoop, nor does an instance of Hadoop that can be used for student training. This project aims to create a suitable Hadoop environment on University of Maine System resources and to create materials for a one credit micro-course that can be delivered at a distance.

Project Information Subsection

Course materials including relevant assignments, readings, lectures, and tutorials for a class on Hadoop will be produced. The course is expected to be offered during or before the Fall of 2019.
{Empty}
Steve Nutting, Undergraduate at University of Maine Farmington
{Empty}
{Empty}
{Empty}
University of Maine Farmington
228 Main St
Brinkman House
Farmington, Maine. 04938
NE-University of Maine
09/07/2018
No
Already behind3Start date is flexible
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
Materials that can be added to the NE Cyberteam Portal include all written materials, relevant code, and tutorials covering Hadoop installation and exploration.
A publication detailing the course is planned.
The student will gain knowledge of setting up a Hadoop instance as well as a deeper understanding of Data Science.
{Empty}
The Cyberteam will create better ties between research and education related to data science in the New England region.
The hadoop cluster will initially be deployed on the OpenStack cloud run by the Advanced Computing Group of the University of Maine System. It will be migrated to the HPC cluster if deemed necessary.
{Empty}

Final Report

This work will positively impact the ability of the University of Maine System to train the next generation of data scientists.
Any discipline that uses big data can benefit from the proposed materials.
{Empty}
The proposed work will increase the number of people with a background in data science.
{Empty}
This work will increase the use of computing and data resources in Maine.
{Empty}
{Empty}
{Empty}
This will be publicly viewable on portal.