CUSTOMISED
Expert-led training for your team

Dismiss

Big Data Introduction training course

Leverage big data analysis tools and techniques to foster better business decision-making

"Our tailored course provided a well rounded introduction and also covered some intermediate level topics that we needed to know. Clive gave us some best practice ideas and tips to take away. Fast paced but the instructor never lost any of the delegates"

Brian Leek, Data Analyst, May 2022

Public Courses

20/05/24 - 3 days

£1600 +VAT

Enrol

01/07/24 - 3 days

£1600 +VAT

Enrol

12/08/24 - 3 days

£1600 +VAT

Enrol

Customised Courses

* Train a team
* Tailor content
* Flex dates

From £1200 / day

Highlights Details Audience Feedback Prices/Dates

Public Courses

Highlights

Gain an introduction to Big Data
Learn how to define Big Data
Select the correct Big Data stores for disparate data sets
Process large data sets using Hadoop to extract value
Store, manage and analyse unstructured data
Leverage Big Data analysis tools and techniques to foster better business decision-making
Query large data sets in near real-time with Pig and Hive
Plan and implement a Big Data strategy for your organisation

Course Details

Introduction to Big Data

Defining Big Data
The four dimensions of Big Data: volume, velocity, variety, veracity
Introducing the Storage, MapReduce and Query Stack
Delivering business benefit from Big Data
Establishing the business importance of Big Data
Addressing the challenge of extracting useful data
Integrating Big Data with traditional data

Storing Big Data

Analysing your data characteristics
Selecting data sources for analysis
Eliminating redundant data
Establishing the role of NoSQL

Overview of Big Data stores

Data models: key value, graph, document, column–family
Hadoop Distributed File System
HBase
Hive
Cassandra
Hypertable
Amazon S3
BigTable
DynamoDB
MongoDB
Redis
Riak
Neo4J

Selecting Big Data stores

Choosing the correct data stores based on your data characteristics
Moving code to data
Implementing polyglot data store solutions
Aligning business goals to the appropriate data store

Processing Big Data

Integrating disparate data stores
Mapping data to the programming framework
Connecting and extracting data from storage
Transforming data for processing
Subdividing data in preparation for Hadoop MapReduce
Employing Hadoop MapReduce
Creating the components of Hadoop MapReduce jobs
Distributing data processing across server farms
Executing Hadoop MapReduce jobs
Monitoring the progress of job flows
The building blocks of Hadoop MapReduce
Distinguishing Hadoop daemons
Investigating the Hadoop Distributed File System
Selecting appropriate execution modes: local, pseudo–distributed and fully distributed
Handling streaming data
Comparing real–time processing models
Leveraging Storm to extract live events
Lightning–fast processing with Spark and Shark

Tools and Techniques to Analyse Big Data

Abstracting Hadoop MapReduce jobs with Pig
Communicating with Hadoop in Pig Latin
Executing commands using the Grunt Shell
Streamlining high–level processing
Performing ad hoc Big Data querying with Hive
Persisting data in the Hive MegaStore
Performing queries with HiveQL
Investigating Hive file formats
Creating business value from extracted data
Mining data with Mahout
Visualising processed results with reporting tools
Querying in real time with Impala

Developing a Big Data Strategy

Defining a Big Data strategy for your organisation
Establishing your Big Data needs
Meeting business goals with timely data
Evaluating commercial Big Data tools
Managing organisational expectations
Enabling analytic innovation
Focusing on business importance
Framing the problem
Selecting the correct tools
Achieving timely results

Implementing a Big Data Solution

Selecting suitable vendors and hosting options
Balancing costs against business value
Keeping ahead of the curve

Who should attend

IT professionals looking to learn about how to implement and enhance a corporate big data environment and looking to get a better elementary practical skills relating to Big Data

Feedback

4.8 out of 5 average

Brian Leek, Data Analyst, May 2022

“JBI did a great job of customizing their syllabus to suit our business needs and also bringing our team up to speed on the current best practices. Our teams varied widely in terms of experience and the Instructor handled this particularly well - very impressive”

Brian F, Team Lead, RBS, Data Analysis Course, 20 April 2022

Hadoop Administration

SQL

Sign up for the JBI Training newsletter to stay updated with world-class technology training opportunities, including Analytics, AI, ML, DevOps, Web, Backend and Security. Our Power BI Training Course is especially popular. Gain new skills, useful tips, and validate your expertise with an industry-leading organisation, all tailored to your schedule and learning preferences.

More about this course

In this Introduction to Big Data training course, you will learn ways of storing data that allow for efficient processing and analysis. You will also gain the skills you need to store, manage, process and analyse massive amounts of unstructured data to create an appropriate data lake.

Get introduced to leading products such as Hadoop to learn how to apply Big Data in the real world.

FAQs

CONTACT
+44 (0)20 8446 7555

[email protected]

SHARE

Copyright © 2023 JBI Training. All Rights Reserved.
JB International Training Ltd - Company Registration Number: 08458005
Registered Address: Wohl Enterprise Hub, 2B Redbourne Avenue, London, N3 2BS

Modern Slavery Statement & Corporate Policies | Terms & Conditions | Contact Us