Our Python Data Analysis course for Data Scientists covers an introduction to the core concepts of the Python language, ultimately focusing on Big Data Analytics including how to best manipulate and visualise your data with Python's excellent library support.
The course is intensive and is intended for Data Scientists, Data Analysts, and Business Intelligence experts who want to understand how to use Python in their data-oriented environment,
Practical exercises and interactive walk-throughs are used throughout, so attendees have the opportunity to apply the proposed concepts on real Data Science applications, from exploratory data analysis to predictive analytics.
Quants, Data Scientists, Data Analysts, Financial Analysts, Business Intelligence experts who are new to Python. Python developers who are new to Data Science or want to know more about the Python tools for Data Analysis.
Installation, packaging and virtualisation of Python using Conda.
We'll set up Python using the Anaconda distribution, a free and enterprise-ready Python distribution that includes hundreds of the most popular Python packages for science, math, engineering and data analysis.
Anaconda comes with Conda, a cross-platform tool for managing packages and virtual environments. We'll also set up Jupyter, a web-based interactive environment where users can organise, write and run their Python code in notebooks.
Introduction to Python basic concepts, data structures and control flow structures.
Overview of how Python is used for Data Science and Data Analytics projects.
Notions of Object-Oriented Programming and Functional Programming, applied to the design of Python applications and analysis pipelines using best practices.
We'll explore the most important Python tools for Data Science.
NumPy, short for Numerical Python, is one of the main building blocks for scientific computing in Python. It provides high speed manipulation of multi-dimensional arrays and it's used by higher level libraries (like pandas) to support sophisticated analytics with high speed computation.
Pandas is a highly performant library for data manipulation and data analysis in Python.
It's built on top of NumPy and optimised for performance, while offering a high-level interface.
We'll discuss how to create and manipulate Series and DataFrame objects in pandas, accessing data from multiple sources, cleaning and transforming data sets to get them in the right shape for advanced analysis.
Data can come in multiple formats and from multiple sources. We'll examine how to read and write data from local files in different formats, and how to access data from remote source.
Data cleaning and data preparation are the first steps in a data analysis project, so we'll discuss how to perform data transformation to get ready for further analysis.
With our data in the right shape, we're ready to analyse them in order to extract useful insights.
We'll perform the computation of summary information and basic statistics from data sets.
We'll approach split-apply-combine operations with Data Frames, in order to perform advanced transformations and reshaping our data with pandas.
We'll query our Data Frames using the powerful group-by method.
Data analysis benefits from the visualisation of data. If a picture if worth a thousand words,
complex data structures can be easier to understand and analyse using effective visualisation
Communicating the results with non-technical users is also a challenge that
visualisation techniques help to overcome.
See why people choose JBI
26/02/2018: Analyzing data is not a goal unto itself; there is a larger objective: to provide useful information and make business recommendations, all in...
04/02/2018: As the volume of data held by your business continues to grow, your data science team will need a new set of tools to make sense of it. Although...
28/01/2018: It’s not really news that the world is heading in a technological direction; new advancements are being developed each day; new innovations are...
19/10/2017: Nowadays, there is a significant business advantage in being able analyse, process and visualize "big data". While there is no agreed definition...
13/10/2017: This organisation needed their Supply Chain department to get fully involved with Microsoft’s Power BI reporting product as soon as possible....
12/10/2017: The Graduate Programme provided a gateway into technology within investment banking. Graduates (Computer Science, Engineering, Maths, Physics...
Bring a JBI course to your office
and train a whole team onsite
0800 028 6400 or request quote
Get in touch
0800 028 6400
Excellent feedback, consistently !
"great tips help reduce build times"
"we got access to exclusive content"
"Short course meant less time off"
"what an inspiring trainer !"
"colleagues at 2 sites joined via web"
"I passed my exam the next day"