This course prepares you to process large data sets efficiently. You will be introduced to nonrelational databases and algorithms that allow for the distributed processing of large data sets across clusters.
- Implement algorithms that allow for the distributed processing of large data sets across computing clusters.
- Create parallel algorithms that can process large data sets.
- Use tools and software such as Hadoop, Pig, Hive, and Python to compare large data-processing tasks using cloud-computing services.
Prerequisites: DS 710: Programming for Data Science
Return to the Courses page.
Call 1-877-895-3276 or send an email to email@example.com. Our enrollment advisers are available Monday through Thursday, 8 a.m. to 7:30 p.m.; Fridays, 8 a.m. to 4:30 p.m. CT; or by appointment.