Machinelearning On Bigdata W Mapreduce
Status:
Approved
Start:
Wednesday, February 1 2012 at 7:00pm
End:
Wednesday, February 1 2012 at 9:00pm
Member:
mike.bowles
Type:
Meetup
Estimated size:
40
Contact:
mike bowles
Fee:
300
Rooms:
140b
Details:
This class meets Wed and Thurs Evenings from 7pm to 9pm for 5 weeks.
Overview of the course
Participants will learn to adapt and execute machine learning algorithms in the map reduce framework. Participants should finish the class able to author their own machine learning algorithms for map reduce and to run them on Amazon Web Services. Amazon is providing AWS credits for class participants.
Participants will learn to use python code to author mappers and reducers for “hadoop-streaming”. For most of the class we will employ “mrjob” - an open-source framework developed at Yelp. Employing mrjob enables class members to program mappers and reducers in python. The mrjob framework then submits the mapper-reducer to run locally without using hadoop, to run on Amazon Web Services, or to run them on a private hadoop cluster. This will simplify the programming tasks.
Schedule:
Week 1 - Intro to map-reduce, AWS (Amazon web services), Mahout and mrjob
Week 2 - Unsupervised Learning - Clustering
Week 3 - Supervised Learning
Week 4 - Other Machine Learning Topics (text mining, recommender system, svd)
Week 5 - Student Projects
Class Web Page: http://machinelearningbigdata.pbworks.com/w/page/37651454/FrontPage
Meetup: http://www.meetup.com/HandsOnProgrammingEvents/events/44371702/
Register: http://machinelearningbigdata2.eventbrite.com/
Overview of the course
Participants will learn to adapt and execute machine learning algorithms in the map reduce framework. Participants should finish the class able to author their own machine learning algorithms for map reduce and to run them on Amazon Web Services. Amazon is providing AWS credits for class participants.
Participants will learn to use python code to author mappers and reducers for “hadoop-streaming”. For most of the class we will employ “mrjob” - an open-source framework developed at Yelp. Employing mrjob enables class members to program mappers and reducers in python. The mrjob framework then submits the mapper-reducer to run locally without using hadoop, to run on Amazon Web Services, or to run them on a private hadoop cluster. This will simplify the programming tasks.
Schedule:
Week 1 - Intro to map-reduce, AWS (Amazon web services), Mahout and mrjob
Week 2 - Unsupervised Learning - Clustering
Week 3 - Supervised Learning
Week 4 - Other Machine Learning Topics (text mining, recommender system, svd)
Week 5 - Student Projects
Class Web Page: http://machinelearningbigdata.pbworks.com/w/page/37651454/FrontPage
Meetup: http://www.meetup.com/HandsOnProgrammingEvents/events/44371702/
Register: http://machinelearningbigdata2.eventbrite.com/
Notes:
Hacker Dojo members may login to reserve space in the event room up to 48 hours before the event.
Member RSVP does not imply event registration if applicable.