Practical Big Data Analysis (inglise keeles)

Koolituse maht: 40 akadeemilist tundi (5 päeva)

Sihtgrupp: Practitioners of data analysis and fledging data scientists who wish to leverage Big Data technologies such as NoSQL databases, Hadoop and Spark.

Koolitusel osalemise eeldused:

  • Basic knowledge of data base architecture and SQL
  • Basic knowledge in programming: variables, flow and scope and functions
  • Prior experience of working with data
  • Experience with Python or other scripting languages such as Perl will be an advantage

The training is held in cooperation with our partner QA.

Koolituse kirjeldus

An introduction to Python, Data Science and Big Data, plus a deep introduction to the major Big Data technologies for the practitioners working with them.

This 5-day course is ideal for people who are currently working as software engineers with data, or in business intelligence, looking for a level-up to the next stage of large data analysis skills and contemporary patterns of Data Science.

You will learn how to work and model large sets of data and understand the statistical mathematical models behind it. You’ll also work with SQL and NoSQL trends and understand how to create an effective hypothesis-approach way of working with data and discerning really measurable statistical outcomes.

Koolituse teemad

  • Introduction to Data Science
  • Data Mining and Machine Learning
  • Data Models
  • NoSQL
  • Introduction to Python
  • Python and Data
  • Python Databases and SQL
  • Data Science and Numerical Python
  • MongoDB
  • Neo4j and Graph Analytics
  • Functional Programming ​​
  • Hadoop and Ecosystem
  • Spark MapReduce
  • Spark SQL
  • Python Machine Learning

Koolituse õpiväljundid

At the end of this course attendees will know:

  • fundamentals of Data Science;
  • fundamentals of Machine Learning;
  • fundamentals of Python programming;
  • Python’s data and numerical packages;
  • how to visualise data using Python;
  • different data models;
  • what is a NoSQL database, how is it different from a (traditional) Relational Database;
  • what is Hadoop;
  • what is Spark.

At the end of this course attendees will be able to:

  • write Python programs to manipulate data;
  • use Python to visualise data;
  • query graph databases Neo4j;
  • query column store database Cassandra;
  • query document based database MongoDB;
  • use Hadoop and Spark;
  • use Python Machine Learning libraries to perform predictive analysis.

Koolituse lõpetamise tingimused: õpiväljundite saavutamist kontrollitakse ja hinnatakse läbi iseseisva praktilise töö.

Koolitushind sisaldab:

  • koolitust;
  • õppematerjale;
  • tunnistust.

Täienduskoolituse õppekavarühm: informatsiooni- ja kommunikatsioonitehnoloogia interdistsiplinaarne õppekavarühm

  • 00


  • 00


  • 00


  • 00



Koolitusel osalemine
Vabu kohti: saadaval
The Koolitusel osalemine ticket is sold out. You can try another ticket or another date.


19.juuli 2021 - 23.juuli 2021


5 päeva
11:00 - 19:30


4290€ +km