Home > Data Science > Big Data Analytics using Hadoop

Ask us everything about the program

Big Data Analytics using Hadoop

_
Overview
  • manipal overview icon
    Lucrative Career Option
    73% of organizations have already invested or plan to invest in big data.
  • manipal overview icon
    Job Opportunities
    Complete the Hadoop online training become eligible for over 1,500,000 jobs.
  • manipal overview icon
    Who Should Attend
    Beginners in analytics domain, engineering & non-engineering graduates.
  • manipal overview icon
    Salary Packages
    The average salary for an entry-level Big Data engineer is ₹5 lakhs per annum.
  • manipal overview icon
    Expert Faculty
    The Big Data certification course is taught by eminent big data experts.
manipal overview icon
Lucrative Career Option
73% of organizations have already invested or plan to invest in big data.
manipal overview icon
Job Opportunities
Complete the Hadoop online training become eligible for over 1,500,000 jobs.
manipal overview icon
Who Should Attend
Beginners in analytics domain, engineering & non-engineering graduates.
manipal overview icon
Salary Packages
The average salary for an entry-level Big Data engineer is ₹5 lakhs per annum.
manipal overview icon
Expert Faculty
The Big Data certification course is taught by eminent big data experts.
_
Course Curriculum
1.1 Introduction to BIG DATA and Hadoop
manipal icon

Introduction
Challenges of Processing Big Data
Distributed Systems
History of Hadoop
Hadoop Overview
Ecosystem of Hadoop
HDFS and Map Reduce Paradigm
Processing Pipeline
Big Data Technologies
Use Cases
Features of Hadoop
Summary

1.2 Hadoop Environment Setup
manipal icon

Introduction
Hadoop Installation and Configuration
Hive Installation and Configuration
Pig Installation and Configuration
Sqoop Installation and Configuration
Oozie Installation and Configuration
Flume Installation and Configuration
Hbase Installation and Configuration
Hue Installation and Configuration

1.3 HDFS Architecture
manipal icon

Introduction
HDFS Configuration Files
Data Storage in HDFS
Blocks and Splits
Metadata Files
Name Node – Demo
HDFS Data Storage – Demo
High Availability
Data Replication – Demo
Reliable Storage – Demo
HDFS Client
Data Node – Demo
HDFS Clients – Demo
HDFS Clients – Demo

1.4 HDFS Commands
manipal icon

Introduction
HDFS Commands
Basic HDFS Commands - Demo
Read Anatomy in HDFS
Write Anatomy in HDFS
Additional HDFS Commands - Demo
HDFS File System API
HDFS File System API - Demo
HDFS Permission Management
Permission Management – Demo
Summary

1.5 MapReduce
manipal icon

MapReduce 1 Architecture
MR and Traditional Approach
Architecture 1
Introduction to YARN
Architecture 2
Summary

2.1 MapReduce Programs
manipal icon

Introduction
Executing a MapReduce Program - Demo
Datatypes and APIs
MapReduce Concepts
Mapper – Demo
MapReduce – Demo
Combiners – Demo
Partitioners – Demo
Debug logs & Printing in MR Jobs – Demo
Path Filters – Demo
Splits – Demo
Named Output – Demo Summary
Write MapReduce Keys and Values – Demo
Identity Mappers – Demo
Identity Reducers – Demo
Counters in Hadoop
MapReduce Counters – Demo
Input and Output Formats
About MR Unit
MR Unit – Demo
Summary

2.2 MapReduce and Job Execution
manipal icon

Introduction
Job Flow
Job Submission
Job Initializing
Job Scheduling
Map Task Execution
Sort and Shuffle
Reduce Task Execution
Job Cleanup
Job Failure – Demo
Staggering Job – Demo
Scheduler
Summary

2.3 Hadoop Serialization and Compression
manipal icon

Introduction -Serialization
Uses of Serialization
Serialization Techniques
Summary
Introduction - Compression
Uses of Compression
Compression Techniques
Summary

2.4 Advanced MapReduce Programming
manipal icon

Introduction
Customization of Input Format APIs
Input Format and Record Readers – Demo
Distributed Cache
Distributed Cache – Demo
Map Side Joins
Sideways Joins
Map Side Joins – Demo
Reduce Side Joins
Reduce Side Joins – Demo
Sequence File Format
Sequence File Creation – Demo
Sequence File with MapReduce -Demo
Hadoop Streaming
Hadoop Streaming – Demo
Configuring Development Environment using Eclipse – Demo
Running MapReduce Jobs – Demo
Summary

2.5 Apache Hive
manipal icon

Introduction
Hive vs RDBMS
Hive Architecture
Hive Components
Hive Schema Model
Hive Integration with Hadoop
Hive Query Language
Transformation in Hive
Hive Database Creation - Demo
Hive Tables - Demo
Hive Queries - Demo
Advanced Hive Partitioning - Demo
Bucketing - Demo
Advanced Concepts - Demo
Manage an XML or JSON files - Demo
Use a predefined SERDE - Demo
Summary

3.1 Apache Pig
manipal icon

Introduction
MapReduce and Pig
Modes of Execution in Pig
Pig Client
Pig Datatypes and Operators
SQL vs Apache Pig
Pig Usage
Loading Data in Pig – Demo
Pig Dialects – Demo
Transformations in Pig – Demo
Debugging in Pig – Demo
Other capabilities in Pig - Demo

3.2 Apache Hbase
manipal icon

Introduction
Categories of NoSQL Databases
Hbase Evolution
Hbase vs RDBMS
Hbase Architecture
Hbase Components
Column Family
Hbase Fundamentals
Hbase Client
Basic CRUD Operation Demo
Basic CRUD Operation
Zookeeper
Zookeeper Demo
Summary

3.3 Apache Sqoop, Oozie and Hue
manipal icon

Introduction - Apache Sqoop
Sqoop Usage
Working with Sqoop – Demo
Advanced Sqoop
Hive Integration – Demo
Hbase Integration – Demo
Sqoop Scripts - Demo
Introduction - Apache Oozie
Oozie Client
Basic Workflow Setup
Types of Oozie Actions
Control Statements
Defining a Workflow
Run MapReduce with Oozie - Demo
Summary
Introduction - Apache Hue
Hue User Interface
Working with Hive using Hue
Working with Pig using Hue
Monitoring an Oozie Job using Hue
Apache Hue – Demo
Summary

3.4 Streaming
manipal icon

Introduction Summary
Apache Flume Introduction
Flume Core Components
Launch Flume
Apache Flume - Demo
Apache Spark & Storm Introduction
Storm Concepts
Spark Streaming Concepts
Deployment Architecture
Summary

3.5 Hadoop Real Time Deployment and Distribution
manipal icon

Introduction - Real time deployment
System architecture
Logical deployment overview
Physical deployment overview
Summary
Introduction - Big data software and tools
Streaming tools
NoSQL tools
Workflow Tools
Administration tools
Other ecosystem tools
Summary

_
Frequent Questions we get
Why should I take up a Manipal ProLearn course?
manipal icon

Manipal ProLearn, a part of Manipal Global Education Services, offers a variety of professional certification courses across Technology, Digital Marketing, Data Sciences, Project Management, and Finance domains.

Carrying forward the Manipal legacy of over six decades in education, Manipal ProLearn helps working professionals and students to enhance their skills and fast-track their careers. Manipal ProLearn has partnered with industry leaders like Google, Microsoft, AWS Educate, EY, Sandbox, Manipal Academy of Higher Education, and Chartered Institute of Management Accountants (CIMA) to provide quality courses that add to your skill set.

We have redefined learning in the professional certification program space with a wide range of course options taught by an expert faculty on an award-winning learning platform. Benchmarked against global certification standards of PMI, CIMA, PMBOK etc., our course content is designed and developed by industry experts. Not only are these certification courses good for skill enhancement, but with industry recognised certification, they are also a great value-add to one's resume. Over the last two years, more than 80,000 learners have advanced their careers with the help of our courses.

Do I have to clear any entrance (screening) test before taking up the course?
manipal icon

Currently, none of the courses has an entrance test. But some certifications might have their prerequisites which are mentioned on the respective course pages.

What type of support can I expect in terms of course material, assessment, evaluation, feedback, discussion forums, mentoring etc.?
manipal icon

Once you have enrolled in a course, you will receive a welcome e-mail and a telephonic call from our customer support team. The features offered as part of the course will be explained clearly by our representative and also will also be listed in the mail correspondence.

Besides, our Customer Support team is available between 8 AM to 8 PM every day to help students on any course related issues.

Will I be able to access the course material post-completion too?
manipal icon

The course material access will be active till the validity of your course which is 3 months from the date of your enrolment.

How will successful completion of the course improve my job prospects?
manipal icon

Manipal ProLearn gives learners an edge when it comes to employment opportunities. By partnering with industry leaders to provide quality courses, we add to your individual skill set and make you industry ready. Also, the constantly expanding list of top-end knowledge partners ensures that learners are exposed to the latest developments and trends across sectors.

What should I do if I face any issues?
manipal icon

In case you come across any issue, please write to us at support@manipalprolearn.com and we will be happy to assist you. For a quicker response, use your registered e-mail id to reach out to us. Our Helpdesk is also available at 1800-103-5941 (Toll-free within India) and +91-80-42515887 and it is operational between 10:00 AM and 7:00 PM IST (Monday through Saturday). All tickets will be addressed within 48 hours.

Do you have a refund policy if I am not satisfied with the course and want to withdraw? Is there a timeframe for the same?
manipal icon

All refund requests received by Manipal ProLearn are processed through the same gateway within 14 working days of receiving the request. For any exceptional cases where you have not received the refund amount, please write to us at support@manipalprolearn.com. We shall get back to you with an update on your refund request or valid reasons in case of rejections.

Didn’t find your question here? What would you like to know?
Ask a Question   add_circle_outline
_
Read up our trending blogs
Back To Topmanipal icon

Find Big Data Analytics using Hadoop in Other Cities

Similar topics you may be interested in