Quantitative Data management, analysis and Visualization with Python
Learn at the comfort of your home or office

Quantitative Data management, analysis and Visualization with Python

10 Days Online - Virtual Training

# Start Date End Date Duration Location Course fee: Registration
13 04/04/2022 15/04/2022 10 Days Live Online Training USD 1,200
14 27/06/2022 08/07/2022 10 Days Live Online Training USD 1,200
15 19/09/2022 30/09/2022 10 Days Live Online Training USD 1,200
16 12/12/2022 23/12/2022 10 Days Live Online Training USD 1,200

Introduction

This comprehensive course will be your guide to learning how to use the power of Python to analyze big data, create beautiful visualizations, and use powerful machine learning algorithms. This course is designed for both beginners with basic programming experience or experienced developers looking to make the jump to Data Science and big data Analysis. Python has been one of the most adaptable, and robust open-source languages that are easy to learn and uses powerful libraries for data manipulation and analysis. For many years now, Python has been used in scientific computing and mathematical domains such as physics, finance, oil and gas, and signal processing.This Big Data Analytics with Python course provides a complete overview of data analysis techniques using Python. A Data Scientist is one of the strongest professions today and Python is a crucial skill for such roles.The Big Data Analytics with Python course teaches you to master the concepts of Python programming. Through this training, you will gain knowledge of the essential tools of Data Analytics with Python.

 

DURATION

10 Days

 

Course Objective

  • Research Design
  • Python for Data Science and Machine
  • Spark for Big Data Analysis
  • Programmatically download and analyse data
  • Practise techniques to manage various types of data – ordinal, categorical, encoding
  • Learn data visualisation
  • Master the art of performing step-by-step data analysis
  • Gain information on the roles of a Machine Learning Engineer
  • Describe Machine Learning
  • Work with real-time data
  • Learn tools and techniques for predictive modelling
  • Discuss Machine Learning algorithms and their implementation
  • Validate Machine Learning algorithms
  • Explain Time Series and its related concepts
  • Perform Text Mining and Sentimental analysis

 

WHO SHOULD ATTEND?

  • Analytics Team Managers
  • Business Analysts who want to comprehend Machine Learning concepts
  • Information Architects who want to gain proficiency in Predictive Analytics
  • Programmers, Developers, Technical Leads, Architects
  • Individuals aspiring to be Machine Learning Engineers
  • Professionals who want to develop automatic predictive models using data

 

Course content

Module1: Basic statistical terms and concepts

·         Introduction to statistical concepts

·         Descriptive Statistics

·         Inferential statistics

Module 2:Research Design

·         The role and purpose of research design

·         Types of research designs

·         The research process

·         Which method to choose?

·         Exercise: Identify a project of choice and developing a research design

Module 3: Survey Planning, Implementation and Completion

·         Types of surveys

·         The survey process

·         Survey design

·         Methods of survey sampling

·         Determining the Sample size

·         Planning a survey

·         Conducting the survey

·         After the survey

·         Exercise: Planning for a survey based on the research design selected

 

MODULE 4: DATA SCIENCE OVERVIEW

·         Introduction to Data Science

·         Different Sectors Using Data Science

·         Purpose and Components of Python

MODULE 5: DATA ANALYTICS OVERVIEW

·         Data Analytics Process

·         Knowledge Check

·         Exploratory Data Analysis (EDA)

·         EDA-Quantitative Technique

·         EDA – Graphical Technique

·         Data Analytics Conclusion or Predictions

·         Data Analytics Communication

·         Data Types for Plotting

·         Data Types and Plotting

MODULE 6: STATISTICAL ANALYSIS AND BUSINESS APPLICATIONS

·         Introduction to Statistics

·         Statistical and Non-statistical Analysis

·         Major Categories of Statistics

·         Statistical Analysis Considerations

·         Population and Sample

·         Statistical Analysis Process

·         Data Distribution

·         Dispersion

·         Histogram

·         Correlation and Inferential Statistics

MODULE 7 PYTHON ENVIRONMENT SETUP AND ESSENTIALS

·         Anaconda

·         Installation of Anaconda Python Distribution

·         Data Types with Python

·         Basic Operators and Functions

MODULE 8: MATHEMATICAL COMPUTING WITH PYTHON (NUMPY)

·         Introduction to NumPy

·         Activity-Sequence it Right

·         Creating and Printing an nd array

·         Class and Attributes of nd array

·         Basic Operations

·         Copy and Views

·         Mathematical Functions of NumPy

·         Evaluate the datasets containing GDPs of different countries

·         Evaluate the datasets of Summer Olympics 2012

MODULE 9: SCIENTIFIC COMPUTING WITH PYTHON (SCIPY)

·         Introduction to SciPy

·         SciPy Sub Package – Integration and Optimisation

·         SciPy Sub package

·         Demo – Calculate Eigenvalues and Eigenvector

·         Use SciPy to solve a linear algebra problem

·         Use SciPy to define 20 random variables for random values

MODULE 10: DATA MANIPULATION WITH PANDAS

·         Introduction to Pandas

·         Understanding DataFrame

·         View and Select Data Demo

·         Missing Values

·         Data Operations

·         File Read and Write Support

·         Pandas SQL Operation

·         Analyse the Federal Aviation Authority (FAA) dataset using Pandas

·         Analyse the dataset in CSV format given for fire department

MODULE 11: MACHINE LEARNING WITH SCIKIT–LEARN

·         Machine Learning Approach

·         Understand data sets and extract its features

·         Identifying problem type and learning model

·         How it Works

·         Train, test and optimising the model

·         Supervised Learning Model Considerations

·         Scikit-Learn

·         Supervised Learning Models – Linear Regression

·         Supervised Learning Models – Logistic Regression

·         Unsupervised Learning Models

·         Pipeline

·         Model Persistence and Evaluation

·         Analyse a dataset to find the features and response label of it

MODULE 12: NATURAL LANGUAGE PROCESSING WITH SCIKIT LEARN

·         NLP Overview

·         NLP Applications

·         NLP Libraries-Scikit

·         Extraction Considerations

·         Scikit Learn-Model Training and Grid Search

·         Analyse a given spam collection dataset

·         Analyse the sentiment dataset using NLP

MODULE 13: DATA VISUALISATION IN PYTHON USING MATPLOT-LIB

·         Introduction to Data Visualisation

·         Line Properties

·         (x, y) Plot and Subplots

·         Types of Plots

·         Analyse the “auto mpg data” and draw a pair plot

·         Draw a pie chart to visualise a dataset

MODULE 14: WEB SCRAPING WITH BEAUTIFUL SOUP

·         Web Scraping and Parsing

·         Knowledge Check

·         Understanding and Searching the Tree

·         Navigating options

·         Demo3 Navigating a Tree

·         Knowledge Check

·         Modifying the Tree

·         Parsing and Printing the Document

·         Scrape the Simplilearn website page to perform some tasks

·         Scrape the Simplilearn website page to perform some tasks

MODULE 15: INTEGRATION WITH HADOOP MAP-REDUCE AND SPARK

·         Why Big Data Solutions are Provided for Python0

·         Big Data and Hadoop

·         Hadoop Core Components

·         Python Integration with HDFS using Hadoop Streaming

·         Using Hadoop Streaming for Calculating Word Count

·         Python Integration with Spark using PySpark

·         Using PySpark to Determine Word Count

·         Determine the word count for Amazon dataset

 

General Notes

·         All our courses can be Tailor-made to participants needs

·         The participant must be conversant with English

·         Presentations are well guided, practical exercise, web based tutorials and group work. Our facilitators are expert with more than 10years of experience.

·         Upon completion of training the participant will be issued with Foscore development center certificate (FDC-K)

·         Training will be done at Foscore development center (FDC-K) center in Nairobi Kenya. We also offer more than five participants training at requested location within Kenya, more than ten participant within east Africa and more than twenty participant all over the world.

·         Course duration is flexible and the contents can be modified to fit any number of days.

·         The course fee includes facilitation training materials, 2 coffee breaks, buffet lunch and a Certificate of successful completion of Training. Participants will be responsible for their own travel expenses and arrangements, airport transfers, visa application dinners, health/accident insurance and other personal expenses.

·         Accommodation, pickup, freight booking and Visa processing arrangement, are done on request, at discounted prices.

·         One year free Consultation and Coaching provided after the course.

·         Register as a group of more than two and enjoy discount of (10% to 50%) plus free five hour adventure drive to the National game park.

·         Payment should be done two week before commence of the training, to FOSCORE DEVELOPMENT CENTER account, so as to enable us prepare better for you.

·         For any enquiry to: training@fdc-k.org or +254712260031

·         Website:www.fdc-k.org

 

 

 

Need help? Chat with us WhatsApp