EXCEPTIONAL TRAINING FOR IT PROFESSIONALS

Big Data Introduction Training Course

NEXT COURSE
7 Jan London
request info

Capita Marks and Spencer Telefonica Cisco BBC Lloyds Sony

Big Data Introduction training course (code: Bigdata)

Leverage big data analysis tools and techniques to foster better business decision-making


SEE MORE COURSES IN THIS CATEGORY

TRAINING COURSE OVERVIEW

In this Introduction to Big Data training course you will learn ways of storing data that allow for efficient processing and analysis, and gain the skills you need to store, manage, process, and analyse massive amounts of unstructured data to create an appropriate data lake.

Get introduced to leading products such as Hadoop to learn how to apply Big Data in the real world.

AUDIENCE

IT professionals looking to learn about how to implement and  enhance a corporate big data environment and looking to get a better elementary practical skills relating to Big Data


DETAILHIGHLIGHTS

Introduction to Big Data

Defining Big Data

    The four dimensions of Big Data: volume, velocity, variety, veracity
    Introducing the Storage, MapReduce and Query Stack

Delivering business benefit from Big Data

    Establishing the business importance of Big Data
    Addressing the challenge of extracting useful data
    Integrating Big Data with traditional data

Storing Big Data

Analysing your data characteristics

    Selecting data sources for analysis
    Eliminating redundant data
    Establishing the role of NoSQL

Overview of Big Data stores

    Data models: key value, graph, document, column–family
    Hadoop Distributed File System
    HBase
    Hive
    Cassandra
    Hypertable
    Amazon S3
    BigTable
    DynamoDB
    MongoDB
    Redis
    Riak
    Neo4J

Selecting Big Data stores

    Choosing the correct data stores based on your data characteristics
    Moving code to data
    Implementing polyglot data store solutions
    Aligning business goals to the appropriate data store

Processing Big Data

Integrating disparate data stores

    Mapping data to the programming framework
    Connecting and extracting data from storage
    Transforming data for processing
    Subdividing data in preparation for Hadoop MapReduce

Employing Hadoop MapReduce

    Creating the components of Hadoop MapReduce jobs
    Distributing data processing across server farms
    Executing Hadoop MapReduce jobs
    Monitoring the progress of job flows

The building blocks of Hadoop MapReduce

    Distinguishing Hadoop daemons
    Investigating the Hadoop Distributed File System
    Selecting appropriate execution modes: local, pseudo–distributed and fully distributed

Handling streaming data

    Comparing real–time processing models
    Leveraging Storm to extract live events
    Lightning–fast processing with Spark and Shark

Tools and Techniques to Analyse Big Data

Abstracting Hadoop MapReduce jobs with Pig

    Communicating with Hadoop in Pig Latin
    Executing commands using the Grunt Shell
    Streamlining high–level processing

Performing ad hoc Big Data querying with Hive

    Persisting data in the Hive MegaStore
    Performing queries with HiveQL
    Investigating Hive file formats

Creating business value from extracted data

    Mining data with Mahout
    Visualising processed results with reporting tools
    Querying in real time with Impala

Developing a Big Data Strategy

Defining a Big Data strategy for your organisation

    Establishing your Big Data needs
    Meeting business goals with timely data
    Evaluating commercial Big Data tools
    Managing organisational expectations

Enabling analytic innovation

    Focusing on business importance
    Framing the problem
    Selecting the correct tools
    Achieving timely results

Implementing a Big Data Solution

    Selecting suitable vendors and hosting options
    Balancing costs against business value
    Keeping ahead of the curve

 

  • Store, manage, and analyse unstructured data
  • Select the correct big data stores for disparate data sets
  • Process large data sets using Hadoop to extract value
  • Query large data sets in near real time with Pig and Hive
  • Plan and implement a big data strategy for your organisation
Receive the latest version of this course into your inbox


PUBLIC COURSES (LONDON, UK)
 

7th Jan 2019 - 3 days £1600
Book

see all dates


X

Show Discount for this course


PRIVATE COURSES


  Bring a JBI course to your office
  and train a whole team onsite
  0800 028 6400
or request quote


  You can customise this course to
  suit your exact needs here
  0800 028 6400 or request quote


Get in touch
0800 028 6400




Excellent feedback, consistently !

►"great tips help reduce build times"
► "we got access to exclusive content"
► "Short course meant less time off"

►"what an inspiring trainer !"
► "colleagues at 2 sites joined via web"
► "I passed my exam the next day"


Newsletter ! Get exclusive news about upcoming programs, technical insights & special offers